python pandas

Pandas Dataframe Examples: Styling Cells and Conditional Formatting

09 May 2021   Some examples on how to highlight and style cells in pandas dataframes when some criteria is met. Useful for analytics and presenting data.

Read More ›

python regex

Python Regular Expressions: Lookahead and lookbehind examples

09 May 2021   Examples for how and when to use lookaheads and lookbehinds in python regular expressions. These are ways to create matches that depend on what came before or what came after the pattern you want to match.

Read More ›

nlp preprocessing python

Normalize Text for Natural Language Processing Tasks: Reference and Examples

02 May 2021   A couple of common preprocessing tasks you need in order to be able to use raw text in NLP tools.

Read More ›


Git-LFS: Reference and Examples

11 Apr 2021   Common tasks and examples related to using Github's LFS (Large File Storage) functionality

Read More ›

project-review sequence-learning generative-models

Project Review: Generating Article Titles from Keywords

31 Mar 2021   Another project summary, this time for a client who needed an API-accessible ML system to generate text to be used as article titles, given a keyword as input.

Read More ›


Working with Functions in Bash: Reference and Examples

31 Mar 2021   Simple examples and common use cases for functions in shell scripting using bash (bourne-again shell).

Read More ›


Markdown cheatsheet and Reference: Using Images

31 Mar 2021   Quick cheatsheet with common fixes to use images in markdown documents, such as github READMEs.

Read More ›

gradient-boosting feature-engineering

Features in Boosted Tree Algorithms: Engineering, Encoding, Interaction, etc

30 Mar 2021   How to encode and engineer features correctly for use in tree boosting algorithms? How are features allowed to interact and sort of information do data scientists need to provide via manual engineering? How?

Read More ›

paper-summary model-evaluation

Paper Summary: The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets

29 Mar 2021   Summary of the 2015 article "The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets" by Saito and Hemsmeier.

Read More ›

The Great Financial Crisis was Triggered by a Series of Unrelated Factors

21 Mar 2021   Thoughts on the Great Financial Crisis. Techy summary of the book "The Big Short" by Michael Lewis

Read More ›