Commit History

Upload aipf_dm_metric_succ_v1.zip with huggingface_hub
012567a
verified

Wendy-Fly commited on

Upload Human Autonomous Teaming - Next-Generation Human–AI Co-Intelligence_with_logo.mkv with huggingface_hub
e4adefe
verified

Wendy-Fly commited on

Upload run_eval_flex.sh with huggingface_hub
a3bcb12
verified

Wendy-Fly commited on

Upload README_eval_flex.sh with huggingface_hub
112ed8a
verified

Wendy-Fly commited on

Upload aipf_dm_metric_4T_Succ_0514.zip with huggingface_hub
b1bbe7e
verified

Wendy-Fly commited on

Upload ansa_case_results_20260418.jsonl with huggingface_hub
e9749ec
verified

Wendy-Fly commited on

Upload pairwise_comparison.py with huggingface_hub
91c518d
verified

Wendy-Fly commited on

Upload AIPF_4T_Succ_0513.zip with huggingface_hub
b8df358
verified

Wendy-Fly commited on

Upload aipf_example.zip
9718a2b
verified

Wendy-Fly commited on

add warm 4 + top5/top10 results, K sweet spot found at K=5
4c0516e
verified

Wendy-Fly commited on

warm-start experiment results vs cold 4/8
d7a1e7b
verified

Wendy-Fly commited on

default output to AIPF csv path (in-place overwrite, no cp needed)
2c780f9
verified

Wendy-Fly commited on

default col name back to 'estimated_position' (overwrite each run)
500dff7
verified

Wendy-Fly commited on

support arbitrary K for estimated_position (top1/top5/top100)
de652ba
verified

Wendy-Fly commited on

full experiment summary: AIPF / embedding / LLM all methods
92b8e5c
verified

Wendy-Fly commited on

warm-start patch 3/3: true_skill_ranking.py
d9c9653
verified

Wendy-Fly commited on

warm-start patch 2/3: find_positions.py
d399f19
verified

Wendy-Fly commited on

warm-start patch 1/3: prepare_local_eval_data.py
58c42b0
verified

Wendy-Fly commited on

AIPF warm-start patch: use embedding_position as binary search start
75f3322
verified

Wendy-Fly commited on

drop tabulate dep: write markdown table manually
3ffa31d
verified

Wendy-Fly commited on

compare embedding estimated_position vs LLM positions/scores
20981bf
verified

Wendy-Fly commited on

simplify: only add one estimated_position column (int)
bf13d1c
verified

Wendy-Fly commited on

round estimated position to integer (align with AIPF ruler_position)
588cdd6
verified

Wendy-Fly commited on

add script: write top-100 estimated position back to golden_set
2b769ad
verified

Wendy-Fly commited on

compare 4096D cosine top-100 vs t-SNE 2D top-100
f4eceff
verified

Wendy-Fly commited on

add full t-SNE: 1000 golden + 200 ruler
1670e6a
verified

Wendy-Fly commited on

add LLM (gemini/gpt) prediction columns to comparison
bddfa08
verified

Wendy-Fly commited on

fix syntax: remove Chinese quotes inside Python strings
58b909f
verified

Wendy-Fly commited on

Upload embedding_transform_eval.py with huggingface_hub
0cfad12
verified

Wendy-Fly commited on

Upload golden_top100.jsonl with huggingface_hub
add7b72
verified

Wendy-Fly commited on

Upload batch_top100_match.py with huggingface_hub
c01a125
verified

Wendy-Fly commited on

Upload golden_top5.jsonl with huggingface_hub
8a775f6
verified

Wendy-Fly commited on

Upload batch_top5_match.py with huggingface_hub
9d30467
verified

Wendy-Fly commited on

Upload ruler_tsne.py with huggingface_hub
37c2c5b
verified

Wendy-Fly commited on

Upload aipf_dm_metric.zip with huggingface_hub
a7d7258
verified

Wendy-Fly commited on

Upload 2 files
3451841
verified

Wendy-Fly commited on

Upload aipf_dm_metric_code.zip with huggingface_hub
281da15
verified

Wendy-Fly commited on

Upload AIPF.tar.gz with huggingface_hub
bc7bc77
verified

Wendy-Fly commited on

Upload aipf_dm_metric_code.zip with huggingface_hub
fb6dd26
verified

Wendy-Fly commited on

Upload requirements_extra.txt
15208b4
verified

Wendy-Fly commited on

initial commit
a5ff14b
verified

Wendy-Fly commited on