paper-summary alignment instruction-tuning
Paper Summary: Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
06 Oct 2024 Summary of the 2024 article "Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study" by Xu et al.
Read More ›paper-summary language-modeling
Paper Summary: The Science of Detecting LLM-Generated Texts
28 Jul 2024 Summary of the 2023 article "The Science of Detecting LLM-Generated Texts" by Tang et al.
Read More ›Paper Summary: Few-shot Fine-Tuning vs In-context Learning: a Fair Comparison and Evaluation
22 Jul 2024 Summary of the 2023 article "Few-shot Fine-Tuning vs In-context Learning: a Fair Comparison and Evaluation" by Mosbach et al.
Read More ›paper-summary instruction-tuning language-modeling
Paper Summary: Multitask Prompted Training Enables Zero-Shot Task Generalization
31 Mar 2024 Summary of the 2021 article "Multitask Prompted Training Enables Zero-Shot Task Generalization" by Sahn et al. AKA the T0 (T-zero) article
Read More ›Paper Summary: Learning to summarize from human feedback
09 Mar 2024 Summary of the 2020 article "Learning to summarize from human feedback" by Stiennon et al.
Read More ›paper-summary instruction-tuning
Paper Summary: Zephyr: Direct Distillation of LM Alignment
02 Jan 2024 Summary of the 2023 article "Zephyr: Direct Distillation of LM Alignment" by Tunstall et al.
Read More ›Examples: Installing and Updating Packages with Apt
01 Jan 2024 Examples on how to use apt-get and dpkg to install, uninstall, and list packages on Ubuntu and similar systems.
Read More ›Git Examples: Reverting a File from a Branch
27 Dec 2023 Many ways to reset files to what they look like on other branches, both locally and on remotes.
Read More ›paper-summary instruction-tuning language-models
Paper Summary: Constitutional AI
16 Nov 2023 Summary of the 2022 article "Constitutional AI" by Anthropic.
Read More ›Pytest Examples: Handling Exceptions
18 Oct 2023 Examples on how to use Pytest to handle exceptions
Read More ›