Solaris: Building a Multiplayer Video World Model in Minecraft Paper • 2602.22208 • Published 14 days ago • 28
SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding Paper • 2511.04668 • Published Nov 6, 2025 • 6
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders Paper • 2601.16208 • Published Jan 22 • 54
Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts Paper • 2511.04655 • Published Nov 6, 2025 • 9
Solaris-Data Collection Training and evaluation datasets collected for Solaris: Building a Multiplayer Video World Model in Minecraft • 2 items • Updated 16 days ago • 3
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published Oct 13, 2025 • 168