RL with verify reward
Hert4
beyoru
AI & ML interests
None yet
Recent Activity
updated a dataset about 2 hours ago
beyoru/Deepseek-v4-pro-max-distill-1000x liked a model 2 days ago
RikkaBotan/stable-static-embedding-fast-retrieval-mrl-ja published a dataset 4 days ago
beyoru/Deepseek-v4-pro-max-distill-1000x