VALOR
Collection
[ICLR 2026] Models from the paper "No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers" • 3 items • Updated
• 1
This is the RL-tuned Qwen3-8B model from the paper: No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers
For further information please refer to the project webpage, paper, and repository.
If you use VALOR in your research, please consider citing our work:
BibTeX:
@misc{marsili2025labelsproblemtrainingvisual,
title={No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers},
author={Damiano Marsili and Georgia Gkioxari},
year={2025},
eprint={2512.08889},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2512.08889},
}