Posts, articles, and discussions
Community posts

StackLLaMA: A hands-on guide to train LLaMA with RLHF
By April 5, 2023

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU
By March 9, 2023

Red-Teaming Large Language Models
By February 24, 2023

What Makes a Dialog Agent Useful?
By January 24, 2023

Illustrating Reinforcement Learning from Human Feedback (RLHF)
By December 9, 2022