Skip to content

The Science of Data

Statistics, Data Science and everything in between

  • About the Author
  • Blogroll
  • Contact Me

Author: Junaid

Posted on 18th February 202119th February 2021

This one’s for the Oddballs

Introduction A very interesting paper was brought to my attention which proposes an adjustment to the traditional Stochastic Gradient Descent approach called Oddball Stochastic Gradient…

Continue reading → This one’s for the Oddballs

Posted on 16th February 2021

The Likelihood Function

Introductory Concepts In the field of statistics, researchers are interested in making inferences from data. The data is collected from a population; the data drawn…

Continue reading → The Likelihood Function

Posted on 5th October 20206th October 2020

Definitely Interesting Matrices

This is a post I've been wanting to write for a while - Quadratic forms and Definite matrices are everywhere in linear algebra and they…

Continue reading → Definitely Interesting Matrices

Posted on 26th September 202026th September 2020

Batch Updating with Plumber and Google Scheduler

Recently, I’ve had a chance to play with R’s plumber library and used it to run scripts on a schedule. This post will show how…

Continue reading → Batch Updating with Plumber and Google Scheduler

Posted on 20th July 202021st July 2020

Why You’re Not as Data-Driven as You Think You Are

How many times have you heard someone say they are data-driven or data centric or that “data is the heart of everything they do”? I’ve…

Continue reading → Why You’re Not as Data-Driven as You Think You Are

Posted on 25th June 2020

Shortest Paths: Dijkstra’s Algorithm

Introduction I have a startling admission to make. When I was a student, I scoffed at Dijkstra's algorithm - I had paid it no mind…

Continue reading → Shortest Paths: Dijkstra’s Algorithm

Posted on 24th May 2020

Word2vec vs Fasttext – A First Look

Introduction Recently, I've had a chance to play with word embedding models. Word embedding models involve taking a text corpus and generating vector representations for…

Continue reading → Word2vec vs Fasttext – A First Look

Posted on 25th February 2020

Being a Data Scientist at a Start-Up

Although data science as a job function is relatively new compared to roles like software engineer or database administrator, in the age of “Big Data”,…

Continue reading → Being a Data Scientist at a Start-Up

Posted on 23rd February 2020

ML Classifier Evaluation – A First Look

Once you’ve built a machine learning classifier, the next step is to validate it and see how well it fits the data. This short post…

Continue reading → ML Classifier Evaluation – A First Look

Posted on 22nd December 2019

Naive and Proud: Introducing the Naive Bayes Algorithm

The Naive Bayes Algorithm is a simple and elegant approach for tackling supervised learning problems in Machine Learning. This post will be a brief introduction…

Continue reading → Naive and Proud: Introducing the Naive Bayes Algorithm

Posts navigation

1 2 Next →
Powered by WordPress.com.
  • About the Author
  • Blogroll
  • Contact Me
The Science of Data
Proudly powered by WordPress Theme: TheFour.