·
AI & ML interests
None yet
Organizations
None yet
gurpreetmukker/smollm2-finetuned-chat-instruct-lora-adapters
Updated
gurpreetmukker/full_model_fine_tuned
Text Generation
•
0.1B
•
Updated
•
1
gurpreetmukker/full_model_fine_tuned2
0.1B
•
Updated
gurpreetmukker/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
gurpreetmukker/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
1
gurpreetmukker/Reinforce-cartpole-default-hp
Reinforcement Learning
•
Updated
gurpreetmukker/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
gurpreetmukker/ppo-LunarLander-v2-3M
Reinforcement Learning
•
Updated
•
1
gurpreetmukker/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
gurpreetmukker/taxiv3-qlearning
Reinforcement Learning
•
Updated
gurpreetmukker/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
gurpreetmukker/ppo-LunarLander-v2-0.0012
Updated