Submitted by Kaicheng Yang 8 UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards DeepGlint 4 2