OpenDataArena/ODA-Fin-RL-8B
Reinforcement Learning β’ 8B β’ Updated β’ 48 β’ 1
Data-centric AI, LLM, MLLM
Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs
Unlocking Data Value in Finance: A Study on Distillation and Difficulty-Aware Training