Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
3
8
3
Minju Gwak
PRO
talzoomanzoo
Follow
21world's profile picture
1 follower
·
1 following
https://minjugwak.netlify.app/
talzoomanzoo
AI & ML interests
None yet
Recent Activity
updated
a dataset
32 minutes ago
talzoomanzoo/Superior-Reasoning-SFT-gpt-oss-120b-5000
published
a dataset
32 minutes ago
talzoomanzoo/Superior-Reasoning-SFT-gpt-oss-120b-5000
authored
a paper
1 day ago
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
View all activity
Organizations
talzoomanzoo
's models
126
Sort: Recently updated
talzoomanzoo/qwen3-30b-base-um-math
Updated
5 days ago
talzoomanzoo/qwen3-8b-base-superior
Updated
5 days ago
talzoomanzoo/qwen3-8b-base-um-math
Updated
5 days ago
talzoomanzoo/qwen3-4b-base-math-32k
Text Generation
•
Updated
6 days ago
•
25
talzoomanzoo/eurus_grpo_rlmia_epoch_2
8B
•
Updated
7 days ago
•
14
talzoomanzoo/eurus_grpo_rlmia_epoch_1
8B
•
Updated
7 days ago
•
37
talzoomanzoo/eurus_grpo_rlmia_epoch_0
8B
•
Updated
7 days ago
•
43
talzoomanzoo/eurus_member3_new_epoch2
8B
•
Updated
8 days ago
•
11
talzoomanzoo/eurus_member3_new_epoch1
Updated
8 days ago
•
20
talzoomanzoo/eurus_member3_new_epoch0
8B
•
Updated
8 days ago
•
22
talzoomanzoo/eurus_member2_new_epoch2
8B
•
Updated
8 days ago
•
13
talzoomanzoo/eurus_member2_new_epoch1
8B
•
Updated
8 days ago
•
19
talzoomanzoo/eurus_member2_new_epoch0
8B
•
Updated
8 days ago
•
29
talzoomanzoo/qwen3-4b-base-rewrite-filtered-32k
Text Generation
•
Updated
8 days ago
•
15
talzoomanzoo/qwen3-4b-base-distill-sft-32k
Text Generation
•
Updated
8 days ago
•
22
talzoomanzoo/qwen3-4b-base-rewrite-sft-32k
Text Generation
•
Updated
8 days ago
•
17
talzoomanzoo/qwen3-4b-instruct-rewrite-sft
Text Generation
•
Updated
11 days ago
•
23
talzoomanzoo/qwen3-4b-instruct-distill-lora
Text Generation
•
Updated
11 days ago
•
39
talzoomanzoo/pa2
1B
•
Updated
13 days ago
•
32
talzoomanzoo/limr_grpo_rlmia_new_epoch_2
8B
•
Updated
13 days ago
•
17
talzoomanzoo/limr_grpo_rlmia_new_epoch_1
8B
•
Updated
13 days ago
•
16
talzoomanzoo/limr_grpo_rlmia_new_epoch_0
8B
•
Updated
13 days ago
•
39
talzoomanzoo/cas4133-assn2-final-p
1B
•
Updated
16 days ago
•
34
talzoomanzoo/limr_grpo_rlmia_epoch_2
8B
•
Updated
16 days ago
•
13
talzoomanzoo/limr_grpo_rlmia_epoch_1
8B
•
Updated
16 days ago
•
25
talzoomanzoo/limr_grpo_rlmia_epoch_0
8B
•
Updated
16 days ago
•
22
talzoomanzoo/cas4133-assn2-dpo-p
1B
•
Updated
17 days ago
•
22
talzoomanzoo/cas4133-assn2-sft-p
1B
•
Updated
17 days ago
•
16
talzoomanzoo/dpo-trained
Updated
26 days ago
talzoomanzoo/dpo-final
Updated
26 days ago
Previous
1
2
3
...
5
Next