Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ibm-research 's Collections
Enterprise Agents and Benchmarks
REAL-MM-RAG-Bench_BEIR
AI-Agent-4-Industry-4.0
Otter-Knowledge
REAL-MM-RAG-Bench
Granite 3.2 Models (GGUF)
Materials

Enterprise Agents and Benchmarks

updated 7 days ago

Enterprise agent ecosystem featuring AssetOpsBench (industrial) and ITBench (SRE, FinOps, CISO), CUGA to accelerate AI Automation

Upvote
13

  • Running
    16

    AssetOpsBench

    🚀
    16

    Generate and benchmark machine learning models with ease


  • Running
    Featured
    94

    CUGA Agent

    🤖
    94

    Configurable Generalist Agent, leader in AppWorld Benchmark


  • Running
    3

    ITBench-Lite-Space

    🚀
    3

    Develop and run interactive code notebooks with JupyterLab


  • ibm-research/AssetOpsBench

    Viewer • Updated Jan 10 • 152 • 820 • 9

  • ibm-research/ITBench-Lite

    Updated 13 days ago • 9.11k • 5

  • ibm-research/ITBench-Trajectories

    Updated 24 days ago • 96 • 3

  • AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance

    Paper • 2506.03828 • Published Jun 4, 2025 • 16

  • ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks

    Paper • 2502.05352 • Published Feb 7, 2025 • 1

  • Survey on Evaluation of LLM-based Agents

    Paper • 2503.16416 • Published Mar 20, 2025 • 96
Upvote
13
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs