paper-summary instruction-following
Paper Summary: Direct Preference Optimization: Your Language Model is Secretly a Reward Model
23 Jun 2023 Summary of the 2023 article "Direct Preference Optimization: Your Language Model is Secretly a Reward Model" by Rafailov et al.
Read More ›Pyenv Examples: Managing multiple Python versions and Virtualenvs
21 Jun 2023 Examples on how to use pyenv to handle Virtualenvs and python versions on Unix-like systems such as Ubuntu and MacOS.
Read More ›Paper Summary: Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
18 Jun 2023 Summary of the 2023 article "Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling" by Biderman et al.
Read More ›paper-summary language-modeling instruction-following
Paper Summary: LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
04 Jun 2023 Summary of the 2023 article "LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention" by Zhang et al.
Read More ›Paper Summary: LLaMA: Open and Efficient Foundation Language Models
04 Jun 2023 Summary of the 2023 article "LLaMA: Open and Efficient Foundation Language Models" by Touvron et al.
Read More ›paper-summary language-modeling alignment
Paper Summary: Self-instruct: Aligning Language Models with Self-generated Instructions
03 Jun 2023 Summary of the 2022 article "Self-instruct: Aligning Language Models with Self-generated Instructions" by Wang et al.
Read More ›As a Manager: Is it Worthwhile? How Worthwhile?
28 May 2023 Some thoughts about the frame of mind you need to get into as you transition to a technical management role.
Read More ›As a Manager: Stating the Obvious is Important
28 May 2023 Examples and an overview on why managers should state the obvious even when it may seem unnecessary.
Read More ›As a Manager: Drive Growth by Asking Open-Ended Questions
22 May 2023 One of the ways you can foster growth in reports is to ask open-ended questions.
Read More ›paper-summary language-models alignment
Paper Summary: Training language models to follow instructions with human feedback
05 Feb 2023 Summary of the 2022 article "Training language models to follow instructions with human feedback" by Ouyang et al. AKA the InstructGPT article
Read More ›