SWE-FastContext Collection A family of code-search models powering the Explore subagent for coding agents. • 2 items • Updated 2 days ago • 9
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 4 days ago • 157
NITP: Next Implicit Token Prediction for LLM Pre-training Paper • 2605.24956 • Published 23 days ago • 35
LongDS-Bench: On the Failure of Long-Horizon Agentic Data Analysis Paper • 2605.30434 • Published 19 days ago • 23
COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation Paper • 2605.31264 • Published 18 days ago • 112
Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning Paper • 2605.28424 • Published 20 days ago • 32
Granite Speech Collection Multilingual ASR and speech-to-text (STT) models for enterprise transcription and translation. • 7 items • Updated Apr 29 • 34
Granite Embedding Collection Embedding models (bi‑encoders and rerankers) for RAG, semantic search, and retrieval tasks. • 9 items • Updated Apr 30 • 45
Retrieval from Within: An Intrinsic Capability of Attention-Based Models Paper • 2605.05806 • Published May 8 • 7
Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training Paper • 2605.09608 • Published May 10 • 53
Granite 4.1 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 6 items • Updated Apr 29 • 59