Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
96
1
1
Alex Shaw
alexgshaw
Follow
lincolnhuj's profile picture
evalstate's profile picture
bhargavi909's profile picture
8 followers
·
7 following
https://www.tbench.ai/
alexgshaw
alexgshaw
alexgshaw
AI & ML interests
None yet
Recent Activity
new
activity
1 day ago
harborframework/terminal-bench-2-leaderboard:
Add WozCode (Claude Opus 4.6) submission - 68.1% on terminal-bench 2.0
new
activity
1 day ago
harborframework/terminal-bench-2-leaderboard:
Add WozCode (Claude Opus 4.6) submission - 68.1% on terminal-bench 2.0
new
activity
1 day ago
harborframework/terminal-bench-2-leaderboard:
Add WozCode (Claude Opus 4.6) submission - 68.1% on terminal-bench 2.0
View all activity
Organizations
alexgshaw
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
harborframework/terminal-bench-2-leaderboard
1 day ago
Add WozCode (Claude Opus 4.6) submission - 68.1% on terminal-bench 2.0
1
#111 opened 1 day ago by
chefeckert
Add WozCode (Claude Opus 4.6) submission - 68.1% on terminal-bench 2.0
1
#110 opened 1 day ago by
chefeckert
Add WozCode (Claude Opus 4.6) submission - 68.1% on terminal-bench 2.0
1
#109 opened 1 day ago by
chefeckert
New activity in
harborframework/terminal-bench-2-leaderboard
2 days ago
Add Pilot (Claude Opus 4.6) — 82.0% on Terminal Bench 2.0
1
#108 opened 2 days ago by
alekspetrov
New activity in
harborframework/terminal-bench-2-leaderboard
3 days ago
Add WozCode (Claude Opus 4.6) submission - 63.1% on terminal-bench 2.0
1
#107 opened 3 days ago by
chefeckert
New activity in
harborframework/terminal-bench-2-leaderboard
4 days ago
Add Meta-Harness (Claude Opus 4.6) submission
1
#106 opened 4 days ago by
yoonholee
Add Meta-Harness (Claude Opus 4.6) submission
2
#105 opened 4 days ago by
yoonholee
New activity in
harborframework/terminal-bench-2-leaderboard
5 days ago
Add WozCode (Claude Opus 4.6) submission - 63.1% on terminal-bench 2.0
1
#101 opened 5 days ago by
chefeckert
New activity in
harborframework/terminal-bench-2-leaderboard
6 days ago
Add WozCode (Claude Opus 4.6) submission - 63.1% on terminal-bench 2.0
2
#100 opened 6 days ago by
chefeckert
New activity in
harborframework/terminal-bench-2-leaderboard
9 days ago
Add IndusAGI Coding Agent submission (MiniMax-M2.7)
1
#97 opened 9 days ago by
varun324242
Add Codelia GPT-5.3-Codex submission
1
#96 opened 9 days ago by
kousw
New activity in
harborframework/terminal-bench-2-leaderboard
11 days ago
Add Kiro CLI + Claude Opus 4.6 submission
2
#93 opened 11 days ago by
MingxuanWang
Add Kiro CLI + Claude Opus 4.6 submission
2
#92 opened 11 days ago by
mxWWWWWWw
New activity in
harborframework/terminal-bench-2-leaderboard
12 days ago
Add BashAgent + TermiGen-32B submission for Terminal-Bench 2.0
9
#91 opened 12 days ago by
yuzhounie
New activity in
harborframework/terminal-bench-2-leaderboard
13 days ago
Add IndusAGI Coding Agent submission (gpt-5.3-codex)
#90 opened 13 days ago by
varun324242
New activity in
harborframework/terminal-bench-2-leaderboard
15 days ago
Add cchuter__minimax-m2.5 submission
2
#87 opened 15 days ago by
cchuter
New activity in
harborframework/terminal-bench-2-leaderboard
17 days ago
Add OpenSage submission
2
#79 opened 22 days ago by
3rdn4
Add TongAgents gemini 3.1 pro submission
1
#80 opened 22 days ago by
dunyiguo
New activity in
harborframework/terminal-bench-2-leaderboard
18 days ago
MAYA-V2 Adya Submission
1
#83 opened 19 days ago by
thilak9
Add ForgeCode__Opus-4.6 submission
1
#85 opened 18 days ago by
ssddtc
Load more