v13: Filled mechanism memo with seed-42 analysis — all 7 hypothesis verdicts computed 3e3663f verified narcolepticchicken commited on 2 days ago
Update README with honest results, clean structure, paper reference 0f60772 verified narcolepticchicken commited on 2 days ago
Add workshop paper: Compute Is Not Neutral 5a85f0b verified narcolepticchicken commited on 3 days ago
Upload results/debate_local_results_qwen3coder30b.json with huggingface_hub 61a7833 verified narcolepticchicken commited on May 14
Collapse mechanism: seed=42,cond=adversary_weak 764c541 verified narcolepticchicken commited on May 12
Collapse mechanism: seed=42,cond=confidence_weighted_3round bd13a3c verified narcolepticchicken commited on May 12
Collapse mechanism: seed=42,cond=judge_vote_3round 14a65ef verified narcolepticchicken commited on May 12
Collapse mechanism: seed=42,cond=randomized_order_3round a6484ad verified narcolepticchicken commited on May 12
Collapse mechanism: seed=42,cond=equal_token_unequal_turn 4040cf9 verified narcolepticchicken commited on May 11
Collapse mechanism: seed=42,cond=equal_3round_traced b1e5c8c verified narcolepticchicken commited on May 11
Collapse mechanism: seed=42,cond=baseline_1round_traced d4fa9a7 verified narcolepticchicken commited on May 11
Fix: handle both v2 (per_seed) and v3 (seeds) output formats a3dbabc verified narcolepticchicken commited on May 11
Upload jobs/occ_debate_collapse_mechanism_v3.py 26a56c7 verified narcolepticchicken commited on May 11
Upload jobs/occ_debate_collapse_mechanism_v2.py d48ed63 verified narcolepticchicken commited on May 11
Upload reports/debate_extended_baselines_2seed.json 0755d5f verified narcolepticchicken commited on May 10
Upload reports/truthfulqa_allenai_results.json ef70478 verified narcolepticchicken commited on May 10
Upload reports/debate_global_pool_v2_results.json ae5370d verified narcolepticchicken commited on May 7