Nathan Habib PRO
AI & ML interests
Evals
Recent Activity
new activity about 3 hours ago
CohereLabs/North-Mini-Code-1.0:Add eval results for SWE-bench Verified, SWE-bench Pro, and Terminal-Bench v2 new activity about 12 hours ago
CohereLabs/North-Mini-Code-1.0:Add evaluation results (SWE-bench Verified, SWE-bench Pro, Terminal-Bench v2) liked a model about 24 hours ago
CohereLabs/North-Mini-Code-1.0Organizations
benchmarks
RULER Datasets Falcon-H1-3B-Base
RULER Datasets
RULER Datasets Lamma3-Instruct
RULER Datasets
RULER Datasets Qwen2.5-Instruct
RULER Datasets
RULER Datasets Qwen-3-Instruct
RULER Datasets
RULER Datasets Qwen-3
RULER Datasets
agents
Agents ressources
All the ressources I found / used when getting up to speed with agents.