IBM

company

Verified

https://www.ibm.com/

AI & ML interests

Enterprise AI and ML, Foundation Models, Responsible AI

Recent Activity

DhavalPatel submitted a paper 16 days ago

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

DhavalPatel submitted a paper about 1 month ago

Evaluating Temporal Semantic Caching and Workflow Optimization in Agentic Plan-Execute Pipelines

DhavalPatel submitted a paper about 2 months ago

Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA Scaffolds

View all activity

Papers

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

Evaluating Temporal Semantic Caching and Workflow Optimization in Agentic Plan-Execute Pipelines

View all Papers

submitted a paper to Daily Papers 16 days ago

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

Paper • 2606.19704 • Published 17 days ago • 41

submitted a paper to Daily Papers about 1 month ago

Evaluating Temporal Semantic Caching and Workflow Optimization in Agentic Plan-Execute Pipelines

Paper • 2605.20630 • Published May 20 • 12

submitted 3 papers to Daily Papers about 2 months ago

Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA Scaffolds

Paper • 2605.18827 • Published May 12 • 7

DiagnosticIQ: A Benchmark for LLM-Based Industrial Maintenance Action Recommendation from Symbolic Rules

Paper • 2605.08614 • Published May 9 • 7

SPIN: Structural LLM Planning via Iterative Navigation for Industrial Tasks

Paper • 2605.14051 • Published May 13 • 1

authored 3 papers 2 months ago

ChartGen: Scaling Chart Understanding Via Code-Guided Synthetic Chart Generation

Paper • 2507.19492 • Published May 31, 2025 • 1

Composition-Grounded Instruction Synthesis for Visual Reasoning

Paper • 2510.15040 • Published Oct 16, 2025

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

Paper • 2603.27064 • Published Mar 28 • 29

submitted a paper to Daily Papers 3 months ago

From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

Paper • 2603.22386 • Published Mar 23 • 57

submitted a paper to Daily Papers 4 months ago

NLE: Non-autoregressive LLM-based ASR by Transcript Editing

Paper • 2603.08397 • Published Mar 9 • 24

in ibm/risk-atlas-nexus 4 months ago

iv_chart_updates

#16 opened 4 months ago by

in ibm/risk-atlas-nexus 4 months ago

iv_update_version

#15 opened 6 months ago by

in ibm/risk-atlas-nexus 4 months ago

iv_update_version

#15 opened 6 months ago by

in ibm/biomed-multi-alignment 6 months ago

revive_space

#1 opened 6 months ago by

in ibm/risk-atlas-nexus 7 months ago

iv_repo_renaming

#14 opened 7 months ago by

Update to the graph view

#13 opened 7 months ago by

in ibm/risk-atlas-nexus 7 months ago

iv_repo_renaming

#14 opened 7 months ago by

Update to the graph view

#13 opened 7 months ago by

in ibm/risk-atlas-nexus 9 months ago

iv_spacing

#12 opened 9 months ago by

in ibm/risk-atlas-nexus 9 months ago

iv_spacing

#12 opened 9 months ago by