Nathan Habib's picture

Building on HF

Nathan Habib PRO

SaylorTwift

huggingface

·

AI & ML interests

Evals

Recent Activity

liked a dataset about 7 hours ago

crosbylegal/RedlineBench

liked a model about 9 hours ago

MiniMaxAI/MiniMax-M3

upvoted an article about 9 hours ago

GLM-5.2: Built for Long-Horizon Tasks

View all activity

Organizations

liked a dataset about 7 hours ago

crosbylegal/RedlineBench

Viewer • Updated 1 day ago • 140 • 444 • 8

liked a model about 9 hours ago

MiniMaxAI/MiniMax-M3

Image-Text-to-Text • 427B • Updated 4 days ago • 67.8k • • 1.13k

upvoted an article about 9 hours ago

Article

GLM-5.2: Built for Long-Horizon Tasks

zai-org

•

2 days ago

• 76

liked a model about 11 hours ago

poolside/Laguna-M.1

Text Generation • 226B • Updated about 22 hours ago • 431 • 69

upvoted an article 1 day ago

Article

Is it agentic enough? Benchmarking open models on your own tooling

+1

lysandre, SaylorTwift, pcuenq

•

2 days ago

• 14

published an article 2 days ago

Article

Is it agentic enough? Benchmarking open models on your own tooling

+1

lysandre, SaylorTwift, pcuenq

•

2 days ago

• 14

liked 2 models 5 days ago

nex-agi/Nex-N2-Pro

Text Generation • 397B • Updated 8 days ago • 7.51k • 335

prefeitura-rio/Rio-3.5-Open-397B

Image-Text-to-Text • 403B • Updated 5 days ago • 191k • 325

New activity in CohereLabs/North-Mini-Code-1.0 8 days ago

Add eval results for SWE-bench Verified, SWE-bench Pro, and Terminal-Bench v2

#7 opened 8 days ago by

New activity in CohereLabs/North-Mini-Code-1.0 9 days ago

Add evaluation results (SWE-bench Verified, SWE-bench Pro, Terminal-Bench v2)

#6 opened 9 days ago by

liked a model 9 days ago

CohereLabs/North-Mini-Code-1.0

Text Generation • 30B • Updated 5 days ago • 17.7k • 458

upvoted a changelog 9 days ago

Hugging Face Changelog

Publish models from CI without HF_TOKEN

11 days ago

• 100

upvoted an article 9 days ago

Article

The Open Source Community is backing OpenEnv for Agentic RL

+16

burtenshaw, spisakjo, lysandre, darktex, willcb, qjoy, pawalt, cwing-nv, danielhanchen, andrewzhou, thegovind, shimmyshimmer, Hamid-Nazeri, Sanyam, zkwentz, emre0, lewtun, sergiopaniego

•

12 days ago

• 88

upvoted a paper 9 days ago

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

Paper • 2606.07591 • Published 23 days ago • 91

updated a dataset 11 days ago

SaylorTwift/harbor-assets

Updated 11 days ago • 22

published a dataset 11 days ago

SaylorTwift/harbor-assets

Updated 11 days ago • 22

New activity in MMMU/MMMU_Pro 12 days ago

Update eval.yaml

#7 opened 15 days ago by

upvoted an article 12 days ago

Article

Designing the hf CLI as an agent-optimized way to work with the Hub

celinah, Wauplin

•

16 days ago

• 57

New activity in nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4 14 days ago

Add evaluation results (GPQA, MMLU-Pro, SWE-bench Verified, HLE)

#6 opened 14 days ago by

New activity in nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16 14 days ago

Add evaluation results (GPQA, MMLU-Pro, SWE-bench Verified, HLE)

#3 opened 14 days ago by