s-nlp/tool-calling-hallucination-modernbert-base-glaive-100pct Token Classification • 0.1B • Updated 20 days ago • 16
s-nlp/tool-calling-hallucination-modernbert-base-glaive-100pct Token Classification • 0.1B • Updated 20 days ago • 16
s-nlp/tool-calling-hallucination-modernbert-large-glaive-100pct Token Classification • 0.4B • Updated 20 days ago • 10
s-nlp/tool-calling-hallucination-modernbert-large-glaive-100pct Token Classification • 0.4B • Updated 20 days ago • 10
s-nlp/tool-calling-hallucination-modernbert-base-unified-final Token Classification • 0.1B • Updated 20 days ago • 132
s-nlp/tool-calling-hallucination-modernbert-base-unified-final Token Classification • 0.1B • Updated 20 days ago • 132
ssurface/tool-calling-hallucination-modernbert-base-glaive-100pct Token Classification • 0.1B • Updated 20 days ago • 45
ssurface/tool-calling-hallucination-modernbert-base-glaive-100pct Token Classification • 0.1B • Updated 20 days ago • 45
ssurface/tool-calling-hallucination-modernbert-large-glaive-100pct Token Classification • 0.4B • Updated 20 days ago • 52
ssurface/tool-calling-hallucination-modernbert-large-glaive-100pct Token Classification • 0.4B • Updated 20 days ago • 52
ssurface/tool-calling-hallucination-modernbert-base-unified-final Token Classification • 0.1B • Updated 20 days ago • 60
ssurface/tool-calling-hallucination-modernbert-base-unified-final Token Classification • 0.1B • Updated 20 days ago • 60
Qwen3-4B CoT Compression Study Collection LoRA adapters trained for 5 progressively shorter chain-of-thought styles on GSM8K, plus the eval artifacts behind the Pareto curve. • 6 items • Updated 28 days ago • 1