Little Brains, Big Feats: Exploring Compact Language Models Paper • 2606.30062 • Published 7 days ago • 14
BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding Paper • 2606.31315 • Published 6 days ago • 73
Bridging VideoQA and Video-Guided Agentic Tasks via Generalized Keyframe Extraction Paper • 2606.29445 • Published 8 days ago • 26
TACO: Tool-Augmented Credit Optimization for Agentic Tool Use Paper • 2606.30251 • Published 7 days ago • 20
Agentic Abstention: Do Agents Know When to Stop Instead of Act? Paper • 2606.28733 • Published 9 days ago • 143
CogniRoute: Learning to Route Social Evidence in Omni-Modal Models Paper • 2606.20970 • Published 18 days ago • 4
ProMSA:Progressive Multimodal Search Agents for Knowledge-Based Visual Question Answering Paper • 2606.27974 • Published 10 days ago • 12
SingGuard: A Policy-Adaptive Multimodal LLM Guardrail with Dynamic Reasoning Paper • 2606.22873 • Published 14 days ago • 15
V-Zero: Answer-Label-Free On-Policy Distillation with Contrastive Evidence Gating for Fine-Grained Visual Reasoning Paper • 2606.25319 • Published 12 days ago • 27
DiffusionBench: On Holistic Evaluation of Diffusion Transformers Paper • 2606.24888 • Published 13 days ago • 11
Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 13 days ago • 144
Dense Reward for Multi-View 3D Reasoning with Global Maps and Local Views Paper • 2606.23557 • Published 14 days ago • 5
Robusto-2: Benchmarking Humans & VLMs for Autonomous Driving in Lima & New York City Paper • 2606.20980 • Published 18 days ago • 3
CalVerT: Augmenting Agents with Calibrated Verifier Telemetry Improves Action and Learning in Knowledge-Intensive Tasks Paper • 2606.21777 • Published 17 days ago • 4
WorldLines: Benchmarking and Modeling Long-Horizon Stateful Embodied Agents Paper • 2606.18847 • Published 19 days ago • 5
PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models Paper • 2606.19534 • Published 19 days ago • 64