view article Article A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond karina-zadorozhny • Jan 19 • 26
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective LinkedIn • Jan 27 • 76
view post Post Sharing our paper and library for building LLM agent. The library is less than 1K code lines!https://github.com/SalesforceAIResearch/AgentLite https://arxiv.org/abs/2402.15538 ❤️ 9 9 + Reply