view article Article Is it agentic enough? Benchmarking open models on your own tooling +1 lysandre, SaylorTwift, pcuenq • 2 days ago • 14
view article Article Is it agentic enough? Benchmarking open models on your own tooling +1 lysandre, SaylorTwift, pcuenq • 2 days ago • 14
view article Article The Open Source Community is backing OpenEnv for Agentic RL +16 burtenshaw, spisakjo, lysandre, darktex, willcb, qjoy, pawalt, cwing-nv, danielhanchen, andrewzhou, thegovind, shimmyshimmer, Hamid-Nazeri, Sanyam, zkwentz, emre0, lewtun, sergiopaniego • 12 days ago • 88
ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research Paper • 2606.07591 • Published 23 days ago • 91
view article Article Designing the hf CLI as an agent-optimized way to work with the Hub celinah, Wauplin • 16 days ago • 57