Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences
Zhuoran Jin
jinzhuoran
AI & ML interests
NLP
Recent Activity
upvoted a paper about 13 hours ago
Critique of Agent Model submitted a paper 3 days ago
Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix ItOrganizations
None yet