view article Article Introducing North Mini Code: Cohere’s First Model For Developers CohereLabs • 5 days ago • 62
Accelerating RL for LLM Reasoning with Optimal Advantage Regression Paper • 2505.20686 • Published May 27, 2025 • 3
GlucoFM: A Dual-Stream Foundation Model for Continuous Glucose Monitoring Paper • 2605.30865 • Published 16 days ago • 7
view article Article Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler +3 ariG23498, sayakpaul, sergiopaniego, ror, pcuenq • 16 days ago • 103
X-Token: Projection-Guided Cross-Tokenizer Knowledge Distillation Paper • 2605.21699 • Published 25 days ago • 1
view article Article Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL +6 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego • 18 days ago • 41
🧬 Carbon Collection Carbon 500M, 3B, 8B genomic models and GGUF variants for llama.cpp • 7 items • Updated 12 days ago • 43
AI Co-Mathematician: Accelerating Mathematicians with Agentic AI Paper • 2605.06651 • Published May 7 • 16
Triple Preference Optimization: Achieving Better Alignment with Less Data in a Single Step Optimization Paper • 2405.16681 • Published May 26, 2024 • 4
view article Article How I contributed a new model to the Transformers library using Codex nielsr • Mar 30 • 52
view article Article Introducing Storage Buckets on the Hugging Face Hub +10 Wauplin, coyotte508, XciD, victor, julien-c, lhoestq, pierric, Sylvestre, hlarcher, rajatarya, seanses, assafvayner • Mar 10 • 195
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 163