Hemant's picture

3

Hemant

kumarh1982

·

AI & ML interests

None yet

Recent Activity

upvoted an article 16 days ago

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

upvoted an article 4 months ago

Forge: Scalable Agent RL Framework and Algorithm

upvoted an article 4 months ago

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

View all activity

Organizations

None yet

upvoted an article 16 days ago

Article

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

karina-zadorozhny

•

Jan 19

• 26

upvoted 2 articles 4 months ago

Article

Forge: Scalable Agent RL Framework and Algorithm

MiniMax-AI

•

Feb 13

• 155

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

LinkedIn

•

Jan 27

• 76

reacted to jimzhiwei's post with ❤️ about 2 years ago

Post

Sharing our paper and library for building LLM agent. The library is less than 1K code lines!
https://github.com/SalesforceAIResearch/AgentLite
https://arxiv.org/abs/2402.15538