916Company

916COM

·

https://916company.com/

AI & ML interests

None yet

Organizations

None yet

upvoted 3 articles 4 months ago

Article

🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do

FINAL-Bench

•

Mar 10

• 38

Article

Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework

FINAL-Bench

•

Mar 8

• 12

Article

MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning

FINAL-Bench

•

Mar 9

• 16

upvoted 2 articles 5 months ago

Article

FINAL Bench: The Real Bottleneck to AGI Is Self-Correction

FINAL-Bench

•

Feb 21

• 20

Article

Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism?

FINAL-Bench

•

Feb 24

• 17