Posts, articles, and discussions
Community posts
StackLLaMA: A hands-on guide to train LLaMA with RLHF
By April 5, 2023
Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU
By March 9, 2023
Red-Teaming Large Language Models
By February 24, 2023
What Makes a Dialog Agent Useful?
By January 24, 2023
Illustrating Reinforcement Learning from Human Feedback (RLHF)
By December 9, 2022