queirozf.com

paper-summary reinforcement-learning rlhf

Paper Summary: Deep Reinforcement Learning from Human Preferences

15 Jul 2023   Summary of the 2017 article "Deep Reinforcement Learning from Human Preferences" by Christiano et al. AKA the RLHF article.

Read More ›

jenv

Jenv Examples on MacOS

07 Jul 2023   Examples on how to config multiple java versions on MacOS using jenv

Read More ›

paper-summary instruction-following

Paper Summary: Fine-tuned Language models are Zero-Shot Learners

02 Jul 2023   Summary of the 2022 article "Fine-tuned Language models are Zero-Shot Learners" by Wei et al, aka the FLAN article.

Read More ›

paper-summary

Paper Summary: Cross-Task Generalization via Natural Language Crowdsourcing Instructions

25 Jun 2023   Summary of the 2022 article "Cross-Task Generalization via Natural Language Crowdsourcing Instructions" by Mishra et al.

Read More ›

python-3 regex

Python 3 Regex: Named Capture Examples

25 Jun 2023   Examples on how to use named capture gropus in Python regular expressions.

Read More ›

paper-summary instruction-following

Paper Summary: Direct Preference Optimization: Your Language Model is Secretly a Reward Model

23 Jun 2023   Summary of the 2023 article "Direct Preference Optimization: Your Language Model is Secretly a Reward Model" by Rafailov et al.

Read More ›

pyenv

Pyenv Examples: Managing multiple Python versions and Virtualenvs

21 Jun 2023   Examples on how to use pyenv to handle Virtualenvs and python versions on Unix-like systems such as Ubuntu and MacOS.

Read More ›

paper-summary language-models

Paper Summary: Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

18 Jun 2023   Summary of the 2023 article "Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling" by Biderman et al.

Read More ›

paper-summary language-modeling instruction-following

Paper Summary: LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention

04 Jun 2023   Summary of the 2023 article "LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention" by Zhang et al.

Read More ›

paper-summary llms

Paper Summary: LLaMA: Open and Efficient Foundation Language Models

04 Jun 2023   Summary of the 2023 article "LLaMA: Open and Efficient Foundation Language Models" by Touvron et al.

Read More ›