15 69 32

Bohan Zeng

zbhpku

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data

upvoted a paper 7 days ago

WorldOlympiad: Can Your World Model Survive a Triathlon?

authored a paper 9 days ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

View all activity

Organizations

None yet

upvoted a paper 3 days ago

OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data

Paper • 2606.13432 • Published 7 days ago • 99

upvoted a paper 7 days ago

WorldOlympiad: Can Your World Model Survive a Triathlon?

Paper • 2606.11129 • Published 9 days ago • 31

authored 7 papers 9 days ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Paper • 2605.18984 • Published May 18 • 22

Towards Next-Generation LLM Training: From the Data-Centric Perspective

Paper • 2603.14712 • Published Mar 16

VGGT-Edit: Feed-forward Native 3D Scene Editing with Residual Field Prediction

Paper • 2605.15186 • Published May 14 • 26

TraceAV-Bench: Benchmarking Multi-Hop Trajectory Reasoning over Long Audio-Visual Videos

Paper • 2605.07593 • Published May 8

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Paper • 2605.22012 • Published 28 days ago • 46

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published 24 days ago • 38

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

Paper • 2606.09669 • Published 10 days ago • 44

upvoted a paper 9 days ago

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

Paper • 2606.09669 • Published 10 days ago • 44

upvoted a paper 16 days ago

VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization

Paper • 2606.02564 • Published 17 days ago • 29

upvoted a paper 17 days ago

DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory

Paper • 2605.31336 • Published 20 days ago • 12

upvoted a paper 22 days ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published 24 days ago • 38

upvoted a paper 27 days ago

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Paper • 2605.22012 • Published 28 days ago • 46

submitted a paper to Daily Papers 27 days ago

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Paper • 2605.22012 • Published 28 days ago • 46

upvoted a paper 29 days ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Paper • 2605.18984 • Published May 18 • 22

upvoted 3 papers about 1 month ago

VGGT-Edit: Feed-forward Native 3D Scene Editing with Residual Field Prediction

Paper • 2605.15186 • Published May 14 • 26

Beyond the Last Layer: Multi-Layer Representation Fusion for Visual Tokenization

Paper • 2605.10780 • Published May 12 • 33

Edit-Compass & EditReward-Compass: A Unified Benchmark for Image Editing and Reward Modeling

Paper • 2605.13062 • Published May 13 • 33

liked a dataset about 2 months ago

KlingTeam/HM-World

Updated Apr 22 • 515 • 7

Bohan Zeng

AI & ML interests

Recent Activity

Organizations

zbhpku's activity