paper-summary compositionality embeddings natural-language-processing

Paper Summary: Context is Everything: Finding Meaning Statistically in Semantic Spaces

01 May 2018   Summary of the 2018 article "Context is Everything: Finding Meaning Statistically in Semantic Spaces" by Zelikman, where the author introduces CoSal weighting for bag-of-words vectors.

Read More ›

data-science peopleware data-newsletter-5 machine-learning-engineering

Podcast Episode Overview: What Machine Learning Engineers need to Know

23 Apr 2018   Overview of a great podcast episode on how much (if at all) we need a new role for data teams, namely Machine Learning Engineers.

Read More ›

matplotlib machine-learning scikit-learn

Visualizing Machine Learning Models: Examples with Scikit-learn, XGB and Matplotlib

23 Apr 2018   Examples on how to use matplotlib and Scikit-learn together to visualize the behaviour of machine learning models, conduct exploratory analysis, etc.

Read More ›


Pandas Dataframe: Merge and Join Examples

17 Apr 2018   Examples on how to use pandas.merge to do SQL-style joins on pandas dataframes.

Read More ›

machine-learning data-science model-evaluation

Introduction to AUC and Calibrated Models with Examples using Scikit-Learn

15 Apr 2018   Inspired by a podcast episode by Linear Digressions, which talks about what AUC is and what it is not and why you need well calibrated models if you want to treat their outputs as probabilities.

Read More ›

dlt jvm

Corda Framework Overview + Examples

07 Apr 2018   Overview of the main concepts of the Corda framework for building decentralized applications based on Distributed Ledger Technology (DLT).

Read More ›


Python Dict Examples

31 Mar 2018   Examples of common dict operations. Includes mapping, filtering, dict comprehension and how to work with ordered dicts.

Read More ›


Similarity measures and distances: Basic reference and examples for data science practitioners

10 Mar 2018   Measuring how far apart two points are is not as simple as you think and knowing how to use each can make predictive or exploratory models perform either very poorly or very well. Reference and examples including euclidean distance, manhattan distance, mahalanobis distance, etc.

Read More ›

python documentation testing

Python Docstrings: Reference & Examples

24 Feb 2018   Examples to help you document your Python code using any of the commonly used docstring styles.

Read More ›


Crypto Asset Overview: Augur Project

18 Feb 2018   Short overview with the least you need to know about the Augur project and the REP token.

Read More ›