🧬 Carbon Collection Carbon 500M, 3B, 8B genomic models and GGUF variants for llama.cpp • 6 items • Updated 1 day ago • 28
Zero-To-CAD Collection Datasets (1M & 100K) and model for synthesizing executable CAD programs from an LLM in a CadQuery environment. No real data used. • 3 items • Updated 27 days ago • 18
view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs nielsr • Apr 7 • 62
GR00T-N1.7 Collection NVIDIA Isaac GR00T N1.7 open vision-language-action (VLA) model for generalized humanoid • 5 items • Updated 3 days ago • 10
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers tomaarsen • Apr 16 • 71
PGC Psychiatric GWAS Summary Statistics Collection ~1 billion rows of genome-wide association study (GWAS) NOTE: We are in the process to transfer these datasets to the Psychiatric Genomics Consortiu • 12 items • Updated Apr 14 • 91
view article Article Custom Kernels for All from Codex and Claude +2 burtenshaw, sayakpaul, ariG23498, evalstate • Feb 13 • 78
MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds Paper • 2508.14879 • Published Aug 20, 2025 • 69
Build error Agents 3 Qwen2.5‑Coder‑7B‑Instruct‑Omni1.1 (Isaac Sim Assistant) 🚀 3 Chat with an AI assistant for Isaac Sim robotics development