None defined yet.
CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding
MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods