Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
FSMBench
university
Activity Feed
Follow
7
AI & ML interests
Evaluating and Benchmarking Large Multimodal Models
Recent Activity
taesiri
submitted
a paper
about 15 hours ago
SkillCoach: Self-Evolving Rubrics for Evaluating and Enhancing Agentic Skill-Use
taesiri
submitted
a paper
about 15 hours ago
PACE: A Proxy for Agentic Capability Evaluation
taesiri
submitted
a paper
about 15 hours ago
Representation Distribution Matching for One-Step Visual Generation
View all activity
Team members
5
FSMBench
's models
None public yet