glab-caltech
/

VALOR-8B

visual-programming

program-synthesis

visual-reasoning

Model card Files Files and versions

Model Card for VALOR-8B

This is the RL-tuned Qwen3-8B model from the paper: No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers

For further information please refer to the project webpage, paper, and repository.

Citation

If you use VALOR in your research, please consider citing our work:

BibTeX:

@misc{marsili2025labelsproblemtrainingvisual,
      title={No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers}, 
      author={Damiano Marsili and Georgia Gkioxari},
      year={2025},
      eprint={2512.08889},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2512.08889}, 
}

Downloads last month: 18

Safetensors

Model size

8B params

Tensor type

BF16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for glab-caltech/VALOR-8B

Base model

Qwen/Qwen3-8B-Base

Finetuned

Finetuned

(1501)

this model

Quantizations

Dataset used to train glab-caltech/VALOR-8B

Collection including glab-caltech/VALOR-8B

VALOR

[ICLR 2026] Models from the paper "No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers" • 3 items • Updated Feb 22 • 1

Paper for glab-caltech/VALOR-8B

No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers

Paper • 2512.08889 • Published Dec 9, 2025 • 1