paper-summary compositionality embeddings natural-language-processing
Paper Summary: Context is Everything: Finding Meaning Statistically in Semantic Spaces
01 May 2018 Summary of the 2018 article "Context is Everything: Finding Meaning Statistically in Semantic Spaces" by Zelikman, where the author introduces CoSal weighting for bag-of-words vectors.
data-science peopleware data-newsletter-5 machine-learning-engineering
Podcast Episode Overview: What Machine Learning Engineers need to Know
23 Apr 2018 Overview of a great podcast episode on how much (if at all) we need a new role for data teams, namely Machine Learning Engineers.
matplotlib machine-learning scikit-learn
Visualizing Machine Learning Models: Examples with Scikit-learn, XGB and Matplotlib
23 Apr 2018 Examples on how to use matplotlib and Scikit-learn together to visualize the behaviour of machine learning models, conduct exploratory analysis, etc.
Read More ›Pandas Dataframe: Merge and Join Examples
17 Apr 2018 Examples on how to use pandas.merge to do SQL-style joins on pandas dataframes.
Read More ›machine-learning data-science model-evaluation
Introduction to AUC and Calibrated Models with Examples using Scikit-Learn
15 Apr 2018 Inspired by a podcast episode by Linear Digressions, which talks about what AUC is and what it is not and why you need well calibrated models if you want to treat their outputs as probabilities.
Corda Framework Overview + Examples
07 Apr 2018 Overview of the main concepts of the Corda framework for building decentralized applications based on Distributed Ledger Technology (DLT).
Python Dict Examples
31 Mar 2018 Examples of common dict operations. Includes mapping, filtering, dict comprehension and how to work with ordered dicts.
Read More ›Similarity measures and distances: Basic reference and examples for data science practitioners
10 Mar 2018 Measuring how far apart two points are is not as simple as you think and knowing how to use each can make predictive or exploratory models perform either very poorly or very well. Reference and examples including euclidean distance, manhattan distance, mahalanobis distance, etc.
Read More ›Python Docstrings: Reference & Examples
24 Feb 2018 Examples to help you document your Python code using any of the commonly used docstring styles.
Read More ›Crypto Asset Overview: Augur Project
18 Feb 2018 Short overview with the least you need to know about the Augur project and the REP token.