Dockerless: Environment-Free Program Verifier for Coding Agents Paper • 2606.28436 • Published 6 days ago • 95
LISA: Likelihood Score Alignment for Visual-condition Controllable Generation Paper • 2606.27192 • Published 7 days ago • 13
Running the Gauntlet: Re-evaluating the Capabilities of Agents Beyond Familiar Environments Paper • 2606.14397 • Published 7 days ago • 18
Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It Paper • 2606.26027 • Published 8 days ago • 18
GUI vs. CLI: Execution Bottlenecks in Screen-Only and Skill-Mediated Computer-Use Agents Paper • 2606.24551 • Published 10 days ago • 28
JetSpec: Breaking the Scaling Ceiling of Speculative Decoding with Parallel Tree Drafting Paper • 2606.18394 • Published 7 days ago • 34
The Verification Horizon: No Silver Bullet for Coding Agent Rewards Paper • 2606.26300 • Published 8 days ago • 46
Beyond NL2Code: A Structured Survey of Multimodal Code Intelligence Paper • 2606.15932 • Published 16 days ago • 38
MemGUI-Agent: An End-to-End Long-Horizon Mobile GUI Agent with Proactive Context Management Paper • 2606.19926 • Published 14 days ago • 42
Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 9 days ago • 144
OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning Paper • 2606.26790 • Published 7 days ago • 53
V-Zero: Answer-Label-Free On-Policy Distillation with Contrastive Evidence Gating for Fine-Grained Visual Reasoning Paper • 2606.25319 • Published 8 days ago • 27
CLI-Universe: Towards Verifiable Task Synthesis Engine for Terminal Agents Paper • 2606.22883 • Published 10 days ago • 37
Deeper is Not Always Better: Mitigating the Alignment Tax via Confident Layer Decoding Paper • 2606.21906 • Published 12 days ago • 24