
paper-summary language-models

Paper Summary: Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

18 Jun 2023   Summary of the 2023 article "Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling" by Biderman et al.

Read More ›

paper-summary language-modeling instruction-following

Paper Summary: LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention

04 Jun 2023   Summary of the 2023 article "LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention" by Zhang et al.

Read More ›

paper-summary llms

Paper Summary: LLaMA: Open and Efficient Foundation Language Models

04 Jun 2023   Summary of the 2023 article "LLaMA: Open and Efficient Foundation Language Models" by Touvron et al.

Read More ›

paper-summary language-modeling alignment

Paper Summary: Self-instruct: Aligning Language Models with Self-generated Instructions

03 Jun 2023   Summary of the 2022 article "Self-instruct: Aligning Language Models with Self-generated Instructions" by Wang et al.

Read More ›


As a Manager: Is it Worthwhile? How Worthwhile?

28 May 2023   Some thoughts about the frame of mind you need to get into as you transition to a technical management role.

Read More ›


As a Manager: Stating the Obvious is Important

28 May 2023   Examples and an overview on why managers should state the obvious even when it may seem unnecessary.

Read More ›


As a Manager: Drive Growth by Asking Open-Ended Questions

22 May 2023   One of the ways you can foster growth in reports is to ask open-ended questions.

Read More ›

paper-summary language-models alignment

Paper Summary: Training language models to follow instructions with human feedback

05 Feb 2023   Summary of the 2022 article "Training language models to follow instructions with human feedback" by Ouyang et al. AKA the InstructGPT article

Read More ›

paper-summary language-models

Paper Summary: Language Models are Few-Shot Learners

01 Jan 2023   Summary of the 2020 article "Language Models are Few-Shot Learners" by Brown et al. AKA the GPT-3 Paper.

Read More ›

paper-summary language-models

Paper Summary: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

01 Jan 2023   Summary of the 2018 article "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding" by Devlin et al.

Read More ›