api-debug-env / openenv.yaml
yadnyeshkolte's picture
Resubmission: strict grader, tests, randomization, improved rewards
486044c
raw
history blame contribute delete
792 Bytes
spec_version: 1
name: api_debug_env
type: space
runtime: fastapi
app: server.app:app
port: 8000
description: >
API Integration Debugging Environment — an AI agent must diagnose and fix
broken API integrations by reading error logs, inspecting configurations,
and submitting corrected API calls. Supports 3 difficulty levels with
seed-based randomization for reproducible evaluation.
tasks:
- id: easy
description: "Fix missing Authorization header and wrong Content-Type in a payment API client"
max_steps: 15
- id: medium
description: "Debug a webhook chain with rate limiting, retry, and signature validation failures"
max_steps: 25
- id: hard
description: "Diagnose cascading failures across a 5-service order processing pipeline"
max_steps: 40