JavisGPT-dev

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Jungang authored a paper 1 day ago

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

kkail8 submitted a paper 3 days ago

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

Guan123 authored a paper 3 months ago

Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction

View all activity

Jungang

authored a paper 1 day ago

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

Paper • 2512.22905 • Published 6 days ago • 15

kkail8

submitted a paper to Daily Papers 3 days ago

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

Paper • 2512.22905 • Published 6 days ago • 15

Guan123

authored a paper 3 months ago

Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction

Paper • 2510.03117 • Published Oct 3, 2025 • 11

Jungang

authored 2 papers 3 months ago

Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios

Paper • 2411.02708 • Published Nov 5, 2024 • 1

MOSS-ChatV: Reinforcement Learning with Process Reasoning Reward for Video Temporal Reasoning

Paper • 2509.21113 • Published Sep 25, 2025 • 5

Jungang

authored a paper 4 months ago

Mind the Third Eye! Benchmarking Privacy Awareness in MLLM-powered Smartphone Agents

Paper • 2508.19493 • Published Aug 27, 2025 • 11

Jungang

authored 6 papers 7 months ago

VideoMark: A Distortion-Free Robust Watermarking Framework for Video Diffusion Models

Paper • 2504.16359 • Published Apr 23, 2025 • 3

Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities

Paper • 2505.21191 • Published May 27, 2025 • 3

SAVEn-Vid: Synergistic Audio-Visual Integration for Enhanced Understanding in Long Video Context

Paper • 2411.16213 • Published Nov 25, 2024 • 2

RTV-Bench: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video

Paper • 2505.02064 • Published May 4, 2025 • 4

UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models

Paper • 2505.14679 • Published May 20, 2025 • 5

PhysicsArena: The First Multimodal Physics Reasoning Benchmark Exploring Variable, Process, and Solution Dimensions

Paper • 2505.15472 • Published May 21, 2025 • 3

kkail8

authored a paper 9 months ago

JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

Paper • 2503.23377 • Published Mar 30, 2025 • 57

Guan123

authored a paper 9 months ago

BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain

Paper • 2409.20075 • Published Sep 30, 2024 • 2

ycsun

authored a paper 10 months ago

ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering

Paper • 2503.16867 • Published Mar 21, 2025 • 11

Guan123

authored a paper 10 months ago

ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering

Paper • 2503.16867 • Published Mar 21, 2025 • 11

DwanZhang

authored 4 papers about 1 year ago

GPT-4V(ision) as A Social Media Analysis Engine

Paper • 2311.07547 • Published Nov 13, 2023 • 1

Video Understanding with Large Language Models: A Survey

Paper • 2312.17432 • Published Dec 29, 2023 • 3

DNAGPT: A Generalized Pre-trained Tool for Versatile DNA Sequence Analysis Tasks

Paper • 2307.05628 • Published Jul 11, 2023 • 10

Cross Contrasting Feature Perturbation for Domain Generalization

Paper • 2307.12502 • Published Jul 24, 2023

AI & ML interests

Recent Activity

Team members 6

JavisGPT-dev's activity