ReLIFT, a training method that interleaves RL with online FT, achieving superior performance and efficiency compared to using RL or SFT alone.
RoadMa
RoadQAQ
AI & ML interests
None yet
Recent Activity
upvoted a paper about 3 hours ago
Foundation Protocol: A Coordination Layer for Agentic Society upvoted a paper 3 months ago
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters liked a model 4 months ago
stepfun-ai/Step-3.5-Flash