CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language: Python
Stars: 132 Issues: 3 Forks: 11
https://github.com/CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF) - GitHub - CarperAI/trlx: A repo for distributed training of language models with Reinforcem...