-
meta-llama/Llama-3.2-1B-Instruct
Text Generation • 1B • Updated • 5.03M • • 1.38k -
stabilityai/stable-diffusion-3.5-large-turbo
Text-to-Image • Updated • 8.68k • • 667 -
When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research
Paper • 2505.11855 • Published • 10 -
ArxivBench: Can LLMs Assist Researchers in Conducting Research?
Paper • 2504.10496 • Published • 2
Jarl
Phyte987
AI & ML interests
None yet
Organizations
interesting
-
meta-llama/Llama-3.2-1B-Instruct
Text Generation • 1B • Updated • 5.03M • • 1.38k -
stabilityai/stable-diffusion-3.5-large-turbo
Text-to-Image • Updated • 8.68k • • 667 -
When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research
Paper • 2505.11855 • Published • 10 -
ArxivBench: Can LLMs Assist Researchers in Conducting Research?
Paper • 2504.10496 • Published • 2
models 0
None public yet
datasets 0
None public yet