queirozf.com

Evaluation Metrics for Regression Problems: Quick examples + Reference

26 May 2018 Regression problems are evaluated against specific metrics that analyze whether the residuals (difference between actual and predicted values) indicate that a fitted model is a good fit for the data. Here are some of the most commonly-used metrics in that domain.

Read More ›

vim

Vim Examples: Search and Replace

24 May 2018 Examples on how to search and replace text on Vim; simple exmaples, using regexes, etc.

Paper Summary: A Simple but Tough-to-beat Baseline for Sentence Embeddings

13 May 2018 Summary of the 2017 article "A Simple but Tough-to-beat Baseline for Sentence Embeddings" by Arora et al.

Read More ›

scikit-learn

Scikit-Learn examples: Making Dummy Datasets

02 May 2018 Make dummy datasets to test out classifiers and/or parameter configurations in Scikit-learn.

Paper Summary: Context is Everything: Finding Meaning Statistically in Semantic Spaces

01 May 2018 Summary of the 2018 article "Context is Everything: Finding Meaning Statistically in Semantic Spaces" by Zelikman, where the author introduces CoSal weighting for bag-of-words vectors.

Podcast Episode Overview: What Machine Learning Engineers need to Know

23 Apr 2018 Overview of a great podcast episode on how much (if at all) we need a new role for data teams, namely Machine Learning Engineers.

Visualizing Machine Learning Models: Examples with Scikit-learn, XGB and Matplotlib

23 Apr 2018 Examples on how to use matplotlib and Scikit-learn together to visualize the behaviour of machine learning models, conduct exploratory analysis, etc.

Read More ›

pandas

Pandas Dataframe: Merge and Join Examples

17 Apr 2018 Examples on how to use pandas.merge to do SQL-style joins on pandas dataframes.

Introduction to AUC and Calibrated Models with Examples using Scikit-Learn

15 Apr 2018 Inspired by a podcast episode by Linear Digressions, which talks about what AUC is and what it is not and why you need well calibrated models if you want to treat their outputs as probabilities.

Read More ›

dlt jvm

Corda Framework Overview + Examples

07 Apr 2018 Overview of the main concepts of the Corda framework for building decentralized applications based on Distributed Ledger Technology (DLT).

Read More ›