rasdani's picture

rasdani PRO

rasdani

·

AI & ML interests

None yet

Recent Activity

upvoted an article 6 days ago

Arcee Becomes the First Major American AI Lab to Replace AWS S3 with Hugging Face Private Storage, in a Multi-Million Dollar Commercial Partnership

published a dataset 11 days ago

PrimeIntellect/Multi-SWE-RL

updated a dataset about 1 month ago

PrimeIntellect/Multi-SWE-RL

View all activity

Organizations

rasdani 's models 37

rasdani/deepseek_r1_qwen14b_swe_rl_8k

15B • Updated Jul 12, 2025 • 2 • 1

rasdani/deepseek_r1_llama_8b_swe_rl_8k_12_epochs

8B • Updated Jul 10, 2025 • 5 • 1

rasdani/qwen3_8b_swe_rl_8k

8B • Updated Jul 7, 2025 • 2

rasdani/deepseek_r1_7b_gh_patches_2k_fixed_reward

8B • Updated Jun 29, 2025 • 2

rasdani/deepseek_r1_7b_gh_patches_2k

8B • Updated Jun 28, 2025 • 2

rasdani/crux-eval_math-eval-logs

Updated Jun 25, 2025

rasdani/git-diff-Qwen-4B-10k

4B • Updated Jun 25, 2025 • 1

rasdani/git-diff-Qwen-4B-10k-checkpoints

Updated Jun 25, 2025

rasdani/git-diff-Qwen-4B-32k-checkpoints

Updated Jun 23, 2025

rasdani/git-diff-Qwen-4B-30k

4B • Updated Jun 22, 2025 • 4

rasdani/git-diff-Qwen-4B

4B • Updated Jun 17, 2025 • 1

rasdani/git-diff-Qwen-1.7B

2B • Updated Jun 16, 2025 • 2

rasdani/git-diff-Qwen-1.7-B

2B • Updated Jun 16, 2025 • 3

rasdani/simple-math-Qwen-1.5B

2B • Updated Jun 15, 2025 • 2

rasdani/qwen3_0_6b_function_rm

0.8B • Updated May 22, 2025 • 3

rasdani/Qwen2.5-0.5B-simpleRL-Zoo-8192k

0.5B • Updated Apr 8, 2025 • 2

rasdani/Qwen2.5-0.5B-simpleRL-Zoo

Text Generation • 0.5B • Updated Apr 6, 2025 • 3

rasdani/smolR1-Qwen2.5-0.5B

Text Generation • 0.5B • Updated Mar 31, 2025 • 6 •

rasdani/Qwen2.5-0.5B-simpleRL-Zoo-no-KL

Updated Mar 31, 2025

rasdani/Qwen2.5-0.5B-simpleRL-Zoo-3072k

Updated Mar 31, 2025

rasdani/Qwen2.5-0.5B-simpleRL-Zoo-4096k

Updated Mar 31, 2025

rasdani/Qwen2.5-0.5B-simpleRL-Zoo-2560k

Updated Mar 31, 2025

rasdani/Qwen2.5-0.5B-simpleRL-Zoo-2048k

Updated Mar 31, 2025

rasdani/Qwen2.5-0.5B-simpleRL-Zoo-first-try

0.5B • Updated Mar 29, 2025 • 2

rasdani/Qwen-1.5B-Distill-GRPO

Text Generation • 2B • Updated Mar 28, 2025 • 1

rasdani/Qwen-0.5B-Instruct-GRPO

Updated Mar 27, 2025

rasdani/gsm8k_qwen2.5-0.5b

0.5B • Updated Mar 11, 2025 • 1

rasdani/Qwen2.5-1.5B-Open-R1-Code-GRPO

Updated Mar 9, 2025

rasdani/Qwen2.5-0.5B-Open-R1-Code-GRPO

Text Generation • 0.6B • Updated Mar 8, 2025 • 5

rasdani/Qwen2.5-7B-Instruct-GRPO-unsloth

Text Generation • 8B • Updated Mar 2, 2025 • 1