·
AI & ML interests
None yet
Organizations
view article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
upvoted an article about 1 year ago view article Open-R1: a fully open reproduction of DeepSeek-R1


- +1
upvoted a paper almost 2 years ago upvoted an article almost 2 years ago view article LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)