FINAL_Bench

Team
community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

SeaWolf-AI  updated a Space about 23 hours ago
FINAL-Bench/Darwin-4B-Opus
SeaWolf-AI  updated a model about 24 hours ago
FINAL-Bench/Darwin-4B-Opus
View all activity

Articles

SeaWolf-AI 
published an article about 22 hours ago
view article
Article

Darwin V6: Diagnostic-Guided Evolutionary Model Merging

11
SeaWolf-AI 
published an article 9 days ago
view article
Article

"The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge"

14
SeaWolf-AI 
published an article 10 days ago
view article
Article

Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models

13
SeaWolf-AI 
published an article 30 days ago
view article
Article

🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do

38
SeaWolf-AI 
published an article about 1 month ago
view article
Article

MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning

15
SeaWolf-AI 
published an article about 1 month ago
view article
Article

Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework

12
SeaWolf-AI 
published an article about 1 month ago
view article
Article

Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism?

17
SeaWolf-AI 
published an article about 2 months ago
view article
Article

FINAL Bench: The Real Bottleneck to AGI Is Self-Correction

20