Robert Zhang
0xrobertzhang
ยท
AI & ML interests
None yet
Recent Activity
authored
a paper
about 5 hours ago
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces
updated
a dataset
2 days ago
DCAgent/perturbed-docker-exp-nl2bash-tasks-1
published
a dataset
2 days ago
DCAgent/perturbed-docker-exp-nl2bash-tasks-1