queirozf.com
Navigation
Tags
Archive
Archive
Other Writing
Contact
Contact
About
About
QUEIROZF.COM
Home
Entries by tag:
language-modeling
Including child/synonym tags
Paper Summary: Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
06 Oct 2024
paper-summary
alignment
instruction-tuning
Summary of the 2024 article "Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study" by Xu et al.
Read More ›
Paper Summary: The Science of Detecting LLM-Generated Texts
28 Jul 2024
paper-summary
language-modeling
Summary of the 2023 article "The Science of Detecting LLM-Generated Texts" by Tang et al.
Read More ›
Paper Summary: Multitask Prompted Training Enables Zero-Shot Task Generalization
31 Mar 2024
paper-summary
instruction-tuning
language-modeling
Summary of the 2021 article "Multitask Prompted Training Enables Zero-Shot Task Generalization" by Sahn et al. AKA the T0 (T-zero) article
Read More ›
Paper Summary: Constitutional AI
16 Nov 2023
paper-summary
instruction-tuning
language-models
Summary of the 2022 article "Constitutional AI" by Anthropic.
Read More ›
Paper Summary: Llama 2: Open Foundation and Fine-Tuned Chat Models
01 Aug 2023
paper-summary
instruction-following
language-modeling
Summary of the 2023 article "Llama 2: Open Foundation and Fine-Tuned Chat Models" by Touvron et al.
Read More ›
Paper Summary: Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
18 Jun 2023
paper-summary
language-models
Summary of the 2023 article "Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling" by Biderman et al.
Read More ›
Paper Summary: LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
04 Jun 2023
paper-summary
language-modeling
instruction-following
Summary of the 2023 article "LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention" by Zhang et al.
Read More ›
Paper Summary: LLaMA: Open and Efficient Foundation Language Models
04 Jun 2023
paper-summary
llms
Summary of the 2023 article "LLaMA: Open and Efficient Foundation Language Models" by Touvron et al.
Read More ›
Paper Summary: Self-instruct: Aligning Language Models with Self-generated Instructions
03 Jun 2023
paper-summary
language-modeling
alignment
Summary of the 2022 article "Self-instruct: Aligning Language Models with Self-generated Instructions" by Wang et al.
Read More ›
Paper Summary: Training language models to follow instructions with human feedback
05 Feb 2023
paper-summary
language-models
alignment
Summary of the 2022 article "Training language models to follow instructions with human feedback" by Ouyang et al. AKA the InstructGPT article
Read More ›
Paper Summary: Language Models are Few-Shot Learners
01 Jan 2023
paper-summary
language-models
Summary of the 2020 article "Language Models are Few-Shot Learners" by Brown et al. AKA the GPT-3 Paper.
Read More ›
Paper Summary: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
01 Jan 2023
paper-summary
language-models
Summary of the 2018 article "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding" by Devlin et al.
Read More ›
Paper Summary: Language Models are Unsupervised Multitask Learners
31 Aug 2019
paper-summary
language-models
Summary of the 2019 article "Language Models are Unsupervised Multitask Learners" by Radford et al. AKA the GPT-2 Article.
Read More ›