AI & ML interests
None yet
Organizations
None yet
ShikangWang/mistral_12b_sft_dpo
12B • Updated
ShikangWang/mistral_12b_sft_300k
12B • Updated
ShikangWang/pk_family_0.3_grpo_sft_filter_kl_0.001
12B • Updated
ShikangWang/pk_grpo_sft_filter_kl_0.001
12B • Updated
ShikangWang/mistral_12b_sft_roleplay
12B • Updated
• 1
• 1
ShikangWang/smo-family-v2-0.3-filter-1127
2B • Updated
ShikangWang/mistral_12b_sft_1125
12B • Updated
• 1
ShikangWang/smo-family-v2-0.3-filter-1126
2B • Updated
ShikangWang/pk_grpo_sft_filter_kl_0.002_en_0.005
12B • Updated
ShikangWang/smo-family-0.3-filter_ep1
2B • Updated
ShikangWang/pk_family_0.3_grpo_sft_filter_kl_0.02_en_0.01
12B • Updated
ShikangWang/pk_family_0.3_grpo_sft_filter
12B • Updated
ShikangWang/mistral_12b_sft
12B • Updated
• 1
ShikangWang/smo-family-0.3-filter
2B • Updated
ShikangWang/pk_family_0.3_grpo_src
12B • Updated
ShikangWang/pk_family_0.0_grpo
12B • Updated
ShikangWang/pk_family_0.3_grpo
12B • Updated
2B • Updated
ShikangWang/smo-pk-family
2B • Updated
ShikangWang/mistral_12b_grpo_safe20k
12B • Updated
• 169
0.4B • Updated
ShikangWang/model110_grpo_safe_20kv2
12B • Updated
ShikangWang/model110_grpo_safe_20k
12B • Updated
• 7
ShikangWang/model110_grpo_50k
12B • Updated
• 3
ShikangWang/model110_grpo_10k
12B • Updated
ShikangWang/model110_dpo_ftx_10_filter20_step7500
12B • Updated
ShikangWang/model110_dpo_ftx_5_filter20
12B • Updated