How Much Is One Recurrence Worth? Iso-Depth Scaling Laws for Looped Language Models Paper • 2604.21106 • Published 7 days ago • 7
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-4-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-2e-5 Updated 11 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-4-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-2e-5 Updated 11 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-4-summary-mean-1024-mlp-ov0-causal-3e-5 Updated 15 days ago
smcleish/0.6b-embed-4b-instruct-cs-8-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-3e-5 Updated 15 days ago
smcleish/0.6b-embed-4b-instruct-cs-8-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-3e-5 Updated 15 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-4-summary-mean-1024-mlp-ov0-causal-3e-5 Updated 15 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-5e-5 Updated 17 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-5e-5 Updated 17 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-3e-5 Updated 19 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-3e-5 Updated 19 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-8-summary-mean-1024-mlp-ov0-causal-2e-5 Updated 20 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-8-summary-mean-1024-mlp-ov0-causal-2e-5 Updated 20 days ago