From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents Paper • 2603.22386 • Published 10 days ago • 54
Time Series Models Collection A collection of time series models trained by IBM • 4 items • Updated Feb 25 • 1
Granite Time Series Models Collection Time series models for forecasting, anomaly detection, classification, and more. • 9 items • Updated about 15 hours ago • 47
view article Article IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST Feb 18 • 18
ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks Paper • 2502.05352 • Published Feb 7, 2025 • 2
view article Article OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments +3 Feb 12 • 31
Enterprise Agents and Benchmarks Collection Enterprise agent ecosystem featuring AssetOpsBench (industrial) and ITBench (SRE, FinOps, CISO), CUGA to accelerate AI Automation • 14 items • Updated 8 days ago • 15
view article Article AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality Jan 21 • 31
From Benchmarks to Business Impact: Deploying IBM Generalist Agent in Enterprise Production Paper • 2510.23856 • Published Oct 27, 2025 • 5
view article Article Granite Embedding R2: Setting New Standards for Enterprise Retrieval Oct 14, 2025 • 16
Granite Docling Models Collection Models for parsing complex PDFs and structured documents, designed to complement Docling. • 4 items • Updated about 16 hours ago • 60