-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 70 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 37 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
Collections
Discover the best community collections!
Collections including paper arxiv:2604.24026
-
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild
Paper • 2603.17187 • Published • 140 -
Attention Residuals
Paper • 2603.15031 • Published • 187 -
MOSS-TTS Technical Report
Paper • 2603.18090 • Published • 16 -
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens
Paper • 2603.23516 • Published • 50
-
UNIDOC-BENCH: A Unified Benchmark for Document-Centric Multimodal RAG
Paper • 2510.03663 • Published • 17 -
LLM-guided Hierarchical Retrieval
Paper • 2510.13217 • Published • 21 -
AnyUp: Universal Feature Upsampling
Paper • 2510.12764 • Published • 13 -
katanemo/Arch-Router-1.5B
Text Generation • 2B • Updated • 1.5k • • 264
-
Contrastive Decoding Improves Reasoning in Large Language Models
Paper • 2309.09117 • Published • 40 -
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models
Paper • 2310.08491 • Published • 57 -
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding
Paper • 2411.04282 • Published • 37 -
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
Paper • 2411.14432 • Published • 26
-
From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills
Paper • 2604.24026 • Published • 21 -
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company
Paper • 2604.22446 • Published • 121 -
The Last Harness You'll Ever Build
Paper • 2604.21003 • Published • 5
-
SemShareKV: Efficient KVCache Sharing for Semantically Similar Prompts via Token-Level LSH Matching
Paper • 2509.24832 • Published -
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
Paper • 2605.00658 • Published • 84 -
Map2World: Segment Map Conditioned Text to 3D World Generation
Paper • 2605.00781 • Published • 25 -
From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills
Paper • 2604.24026 • Published • 21
-
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 328 -
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process
Paper • 2512.23988 • Published • 19 -
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time
Paper • 2512.25075 • Published • 16 -
Guiding a Diffusion Transformer with the Internal Dynamics of Itself
Paper • 2512.24176 • Published • 8
-
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Paper • 2402.04252 • Published • 31 -
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
Paper • 2402.03749 • Published • 15 -
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Paper • 2402.04615 • Published • 45 -
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss
Paper • 2402.05008 • Published • 24
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 70 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 37 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
-
Contrastive Decoding Improves Reasoning in Large Language Models
Paper • 2309.09117 • Published • 40 -
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models
Paper • 2310.08491 • Published • 57 -
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding
Paper • 2411.04282 • Published • 37 -
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
Paper • 2411.14432 • Published • 26
-
From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills
Paper • 2604.24026 • Published • 21 -
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company
Paper • 2604.22446 • Published • 121 -
The Last Harness You'll Ever Build
Paper • 2604.21003 • Published • 5
-
SemShareKV: Efficient KVCache Sharing for Semantically Similar Prompts via Token-Level LSH Matching
Paper • 2509.24832 • Published -
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
Paper • 2605.00658 • Published • 84 -
Map2World: Segment Map Conditioned Text to 3D World Generation
Paper • 2605.00781 • Published • 25 -
From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills
Paper • 2604.24026 • Published • 21
-
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild
Paper • 2603.17187 • Published • 140 -
Attention Residuals
Paper • 2603.15031 • Published • 187 -
MOSS-TTS Technical Report
Paper • 2603.18090 • Published • 16 -
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens
Paper • 2603.23516 • Published • 50
-
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 328 -
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process
Paper • 2512.23988 • Published • 19 -
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time
Paper • 2512.25075 • Published • 16 -
Guiding a Diffusion Transformer with the Internal Dynamics of Itself
Paper • 2512.24176 • Published • 8
-
UNIDOC-BENCH: A Unified Benchmark for Document-Centric Multimodal RAG
Paper • 2510.03663 • Published • 17 -
LLM-guided Hierarchical Retrieval
Paper • 2510.13217 • Published • 21 -
AnyUp: Universal Feature Upsampling
Paper • 2510.12764 • Published • 13 -
katanemo/Arch-Router-1.5B
Text Generation • 2B • Updated • 1.5k • • 264
-
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Paper • 2402.04252 • Published • 31 -
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
Paper • 2402.03749 • Published • 15 -
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Paper • 2402.04615 • Published • 45 -
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss
Paper • 2402.05008 • Published • 24