🏗️ Building on HF

Dimitris Roussis

droussis

https://www.ilsp.gr/en/members/roussis-dimitris/

AI & ML interests

All things data for LLMs, NMT, evaluation, safety, alignment, and more

Recent Activity

liked a dataset 5 days ago

geoskyr/arena-expert-scored

upvoted a paper 14 days ago

LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents

liked a dataset about 1 month ago

microsoft/Orchard

View all activity

Organizations

upvoted a paper 14 days ago

LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents

Paper • 2605.29559 • Published 26 days ago • 17

upvoted a paper about 1 month ago

WebWorld: A Large-Scale World Model for Web Agent Training

Paper • 2602.14721 • Published Feb 16 • 19

upvoted a paper about 2 months ago

Co-Evolving Policy Distillation

Paper • 2604.27083 • Published Apr 29 • 68

upvoted a paper 2 months ago

OpenGame: Open Agentic Coding for Games

Paper • 2604.18394 • Published Apr 20 • 84

upvoted a changelog 3 months ago

Hugging Face Changelog

Hugging Face Papers for AI Agents

Mar 18

• 142

upvoted a paper 3 months ago

When AI Navigates the Fog of War

Paper • 2603.16642 • Published Mar 17 • 31

upvoted a paper 4 months ago

DockSmith: Scaling Reliable Coding Environments via an Agentic Docker Builder

Paper • 2602.00592 • Published Jan 31 • 2

upvoted 2 papers 11 months ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7, 2025 • 133

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 190

upvoted an article 12 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 780

upvoted a paper 12 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 79

upvoted 2 papers about 1 year ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Paper • 2506.04734 • Published Jun 5, 2025 • 21

Krikri: Advancing Open Large Language Models for Greek

Paper • 2505.13772 • Published May 19, 2025 • 6

upvoted 3 collections about 1 year ago

upvoted a paper about 1 year ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26, 2025 • 173

upvoted a paper over 1 year ago

reWordBench: Benchmarking and Improving the Robustness of Reward Models with Transformed Inputs

Paper • 2503.11751 • Published Mar 14, 2025 • 17

upvoted an article over 1 year ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

ariG23498, merve, pcuenq, reach-vb

•

Mar 12, 2025

• 497