Sejong Yang
bismute
ยท
AI & ML interests
computer vision, video generation, world model for agent
Recent Activity
upvoted a paper about 15 hours ago
Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning upvoted a paper about 15 hours ago
MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding upvoted a paper about 15 hours ago
Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMsOrganizations
None yet