Arjun Jagota's picture

Arjun Jagota PRO

ajagota71

·

ajagota7

AI & ML interests

None yet

Recent Activity

updated a dataset 19 days ago

ajagota71/fairl-openrlhf-features

published a dataset 19 days ago

ajagota71/fairl-openrlhf-features

updated a dataset 28 days ago

ajagota71/irl-experiment-v2

View all activity

Organizations

None yet

ajagota71 's models 652

ajagota71/qwen3b-alpaca-sft

Updated Nov 14, 2025

ajagota71/smollm2-360m-saferlhf-ppo-lag-10epoch

Text Generation • 0.4B • Updated Sep 25, 2025 • 1

ajagota71/smollm2-360m-saferlhf-ppo-lag-3epoch

Text Generation • 0.4B • Updated Sep 25, 2025 • 1

ajagota71/smollm2-360m-saferlhf-ppo-1epoch

Text Generation • 0.4B • Updated Sep 24, 2025 • 2

ajagota71/tinyllama-saferlhf-ppo-1epoch

Text Generation • 1B • Updated Sep 24, 2025 • 1

ajagota71/smol-sft-135m

Text Generation • 0.1B • Updated Sep 24, 2025 • 1

ajagota71/gemma-3-270m-detox

Reinforcement Learning • 0.3B • Updated Aug 16, 2025

ajagota71/gemma-3-270m-detox-checkpoint-epoch-100

Reinforcement Learning • 0.3B • Updated Aug 16, 2025 • 1

ajagota71/gemma-3-270m-detox-checkpoint-epoch-80

Reinforcement Learning • 0.3B • Updated Aug 16, 2025 • 1

ajagota71/Qwen2.5-0.5B-detox

Reinforcement Learning • 0.5B • Updated Aug 15, 2025 • 1

ajagota71/Qwen2.5-0.5B-detox-checkpoint-epoch-100

Reinforcement Learning • 0.5B • Updated Aug 15, 2025 • 1

ajagota71/gemma-3-270m-detox-checkpoint-epoch-60

Reinforcement Learning • 0.3B • Updated Aug 15, 2025 • 1

ajagota71/Qwen2.5-0.5B-detox-checkpoint-epoch-80

Reinforcement Learning • 0.5B • Updated Aug 15, 2025 • 1

ajagota71/gemma-3-270m-detox-checkpoint-epoch-40

Reinforcement Learning • 0.3B • Updated Aug 15, 2025 • 1

ajagota71/Qwen2.5-0.5B-detox-checkpoint-epoch-60

Reinforcement Learning • 0.5B • Updated Aug 15, 2025 • 1

ajagota71/Qwen2.5-0.5B-detox-checkpoint-epoch-40

Reinforcement Learning • 0.5B • Updated Aug 15, 2025

ajagota71/gemma-3-270m-detox-checkpoint-epoch-20

Reinforcement Learning • 0.3B • Updated Aug 15, 2025

ajagota71/Qwen2.5-0.5B-detox-checkpoint-epoch-20

Reinforcement Learning • 0.5B • Updated Aug 15, 2025 • 1

ajagota71/SmolLM2-360M-detox

Reinforcement Learning • 0.4B • Updated Aug 15, 2025 • 1

ajagota71/SmolLM2-360M-detox-checkpoint-epoch-100

Reinforcement Learning • 0.4B • Updated Aug 15, 2025 • 1

ajagota71/SmolLM2-135M-detox

Reinforcement Learning • 0.1B • Updated Aug 15, 2025 • 1

ajagota71/SmolLM2-360M-detox-checkpoint-epoch-80

Reinforcement Learning • 0.4B • Updated Aug 15, 2025 • 1

ajagota71/SmolLM2-135M-detox-checkpoint-epoch-100

Reinforcement Learning • 0.1B • Updated Aug 15, 2025 • 1

ajagota71/SmolLM2-135M-detox-checkpoint-epoch-80

Reinforcement Learning • 0.1B • Updated Aug 15, 2025 • 1

ajagota71/SmolLM2-360M-detox-checkpoint-epoch-60

Reinforcement Learning • 0.4B • Updated Aug 15, 2025 • 1

ajagota71/SmolLM2-135M-detox-checkpoint-epoch-60

Reinforcement Learning • 0.1B • Updated Aug 15, 2025 • 1

ajagota71/SmolLM2-360M-detox-checkpoint-epoch-40

Reinforcement Learning • 0.4B • Updated Aug 15, 2025 • 1

ajagota71/SmolLM2-135M-detox-checkpoint-epoch-40

Reinforcement Learning • 0.1B • Updated Aug 15, 2025 • 1

ajagota71/SmolLM2-360M-detox-checkpoint-epoch-20

Reinforcement Learning • 0.4B • Updated Aug 15, 2025 • 1

ajagota71/SmolLM2-135M-detox-checkpoint-epoch-20

Reinforcement Learning • 0.1B • Updated Aug 15, 2025 • 1