FINAL_Bench

Team
community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

SeaWolf-AI  updated a dataset about 15 hours ago
FINAL-Bench/service-urls
SeaWolf-AI  updated a collection 1 day ago
DARWIN-Family
SeaWolf-AI  updated a collection 1 day ago
DARWIN-Family
View all activity

Articles

SeaWolf-AI 
published an article 10 days ago
view article
Article

Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step

FINAL-Bench
18
SeaWolf-AI 
published an article about 1 month ago
view article
Article

Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain — It Started Showing Emotion

FINAL-Bench
13
SeaWolf-AI 
published an article about 1 month ago
view article
Article

"Darwin-27B-Opus: Surpassing the Foundation Model Without Training"

FINAL-Bench
13
SeaWolf-AI 
published an article about 2 months ago
view article
Article

Darwin V6: Diagnostic-Guided Evolutionary Model Merging

FINAL-Bench
11
SeaWolf-AI 
published an article about 2 months ago
view article
Article

"The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge"

FINAL-Bench
14
SeaWolf-AI 
published an article about 2 months ago
view article
Article

Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models

FINAL-Bench
13
SeaWolf-AI 
published an article 3 months ago
view article
Article

🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do

FINAL-Bench
38
SeaWolf-AI 
published an article 3 months ago
view article
Article

MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning

FINAL-Bench
16
SeaWolf-AI 
published an article 3 months ago
view article
Article

Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework

FINAL-Bench
12
SeaWolf-AI 
published an article 3 months ago
view article
Article

Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism?

FINAL-Bench
17
SeaWolf-AI 
published an article 3 months ago
view article
Article

FINAL Bench: The Real Bottleneck to AGI Is Self-Correction

FINAL-Bench
20