queirozf.com

paper-summary instruction-following

Paper Summary: Direct Preference Optimization: Your Language Model is Secretly a Reward Model

23 Jun 2023   Summary of the 2023 article "Direct Preference Optimization: Your Language Model is Secretly a Reward Model" by Rafailov et al.

Read More ›

pyenv

Pyenv Examples: Managing multiple Python versions and Virtualenvs

21 Jun 2023   Examples on how to use pyenv to handle Virtualenvs and python versions on Unix-like systems such as Ubuntu and MacOS.

Read More ›

paper-summary language-models

Paper Summary: Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

18 Jun 2023   Summary of the 2023 article "Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling" by Biderman et al.

Read More ›

paper-summary language-modeling instruction-following

Paper Summary: LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention

04 Jun 2023   Summary of the 2023 article "LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention" by Zhang et al.

Read More ›

paper-summary llms

Paper Summary: LLaMA: Open and Efficient Foundation Language Models

04 Jun 2023   Summary of the 2023 article "LLaMA: Open and Efficient Foundation Language Models" by Touvron et al.

Read More ›

paper-summary language-modeling alignment

Paper Summary: Self-instruct: Aligning Language Models with Self-generated Instructions

03 Jun 2023   Summary of the 2022 article "Self-instruct: Aligning Language Models with Self-generated Instructions" by Wang et al.

Read More ›

management

As a Manager: Is it Worthwhile? How Worthwhile?

28 May 2023   Some thoughts about the frame of mind you need to get into as you transition to a technical management role.

Read More ›

management

As a Manager: Stating the Obvious is Important

28 May 2023   Examples and an overview on why managers should state the obvious even when it may seem unnecessary.

Read More ›

management

As a Manager: Drive Growth by Asking Open-Ended Questions

22 May 2023   One of the ways you can foster growth in reports is to ask open-ended questions.

Read More ›

paper-summary language-models alignment

Paper Summary: Training language models to follow instructions with human feedback

05 Feb 2023   Summary of the 2022 article "Training language models to follow instructions with human feedback" by Ouyang et al. AKA the InstructGPT article

Read More ›