queirozf.com
Navigation
Tags
Archive
Archive
Newsletters
Data Newsletter
Contact
Contact
About
About
QUEIROZF.COM
Home
Entries by tag:
language-models
Including child/synonym tags
Paper Summary: Llama 2: Open Foundation and Fine-Tuned Chat Models
01 Aug 2023
paper-summary
instruction-following
language-modeling
Summary of the 2023 article "Llama 2: Open Foundation and Fine-Tuned Chat Models" by Touvron et al.
Read More ›
Paper Summary: Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
18 Jun 2023
paper-summary
language-models
Summary of the 2023 article "Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling" by Biderman et al.
Read More ›
Paper Summary: LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
04 Jun 2023
paper-summary
language-modeling
instruction-following
Summary of the 2023 article "LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention" by Zhang et al.
Read More ›
Paper Summary: LLaMA: Open and Efficient Foundation Language Models
04 Jun 2023
paper-summary
llms
Summary of the 2023 article "LLaMA: Open and Efficient Foundation Language Models" by Touvron et al.
Read More ›
Paper Summary: Self-instruct: Aligning Language Models with Self-generated Instructions
03 Jun 2023
paper-summary
language-modeling
alignment
Summary of the 2022 article "Self-instruct: Aligning Language Models with Self-generated Instructions" by Wang et al.
Read More ›
Paper Summary: Training language models to follow instructions with human feedback
05 Feb 2023
paper-summary
language-models
alignment
Summary of the 2022 article "Training language models to follow instructions with human feedback" by Ouyang et al. AKA the InstructGPT article
Read More ›
Paper Summary: Language Models are Few-Shot Learners
01 Jan 2023
paper-summary
language-models
Summary of the 2020 article "Language Models are Few-Shot Learners" by Brown et al. AKA the GPT-3 Paper.
Read More ›
Paper Summary: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
01 Jan 2023
paper-summary
language-models
Summary of the 2018 article "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding" by Devlin et al.
Read More ›
Paper Summary: Language Models are Unsupervised Multitask Learners
31 Aug 2019
paper-summary
language-models
Summary of the 2019 article "Language Models are Unsupervised Multitask Learners" by Radford et al. AKA the GPT-2 Article.
Read More ›