Skip to content

The Science of Data

Statistics, Data Science and everything in between

  • About the Author
  • Blogroll
  • Contact Me

Author: Junaid

Posted on 17th July 2021

The Bootstrap

Recently I’ve had occasion to use the bootstrap and have been reminded at what a remarkably powerful technique this is despite it’s simplicity. I thought…

Continue reading → The Bootstrap

Posted on 7th July 2021

Look what the Cat dragged in: Catboost with Tidymodels

I’ve been learning the Tidymodels framework for building Machine Learning models in R pioneered by Max Kuhn and Julia Silge. After spending a few weeks…

Continue reading → Look what the Cat dragged in: Catboost with Tidymodels

Posted on 18th February 202119th February 2021

This one’s for the Oddballs

Introduction A very interesting paper was brought to my attention which proposes an adjustment to the traditional Stochastic Gradient Descent approach called Oddball Stochastic Gradient…

Continue reading → This one’s for the Oddballs

Posted on 16th February 2021

The Likelihood Function

Introductory Concepts In the field of statistics, researchers are interested in making inferences from data. The data is collected from a population; the data drawn…

Continue reading → The Likelihood Function

Posted on 5th October 20206th October 2020

Definitely Interesting Matrices

This is a post I've been wanting to write for a while - Quadratic forms and Definite matrices are everywhere in linear algebra and they…

Continue reading → Definitely Interesting Matrices

Posted on 26th September 202026th September 2020

Batch Updating with Plumber and Google Scheduler

Recently, I’ve had a chance to play with R’s plumber library and used it to run scripts on a schedule. This post will show how…

Continue reading → Batch Updating with Plumber and Google Scheduler

Posted on 20th July 202021st July 2020

Why You’re Not as Data-Driven as You Think You Are

How many times have you heard someone say they are data-driven or data centric or that “data is the heart of everything they do”? I’ve…

Continue reading → Why You’re Not as Data-Driven as You Think You Are

Posted on 25th June 2020

Shortest Paths: Dijkstra’s Algorithm

Introduction I have a startling admission to make. When I was a student, I scoffed at Dijkstra's algorithm - I had paid it no mind…

Continue reading → Shortest Paths: Dijkstra’s Algorithm

Posted on 24th May 2020

Word2vec vs Fasttext – A First Look

Introduction Recently, I've had a chance to play with word embedding models. Word embedding models involve taking a text corpus and generating vector representations for…

Continue reading → Word2vec vs Fasttext – A First Look

Posted on 25th February 2020

Being a Data Scientist at a Start-Up

Although data science as a job function is relatively new compared to roles like software engineer or database administrator, in the age of “Big Data”,…

Continue reading → Being a Data Scientist at a Start-Up

Posts navigation

1 2 3 Next →
Powered by WordPress.com.
  • About the Author
  • Blogroll
  • Contact Me
The Science of Data
Proudly powered by WordPress Theme: TheFour.
 

Loading Comments...