Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
4
Shouren Wang
PRO
ShourenWSR
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 20 hours ago
ShourenWSR/Qwen3-4B-Instruct2507-PLE-Initialized
published
a model
about 20 hours ago
ShourenWSR/Qwen3-4B-Instruct2507-PLE-Initialized
updated
a dataset
about 20 hours ago
ShourenWSR/superior_reasoning_20k_1to1
View all activity
Organizations
None yet
ShourenWSR
's models
83
Sort: Recently updated
ShourenWSR/Qwen3-4B-Instruct2507-PLE-Initialized
7B
•
Updated
about 20 hours ago
ShourenWSR/Qwen3-4B-Dense-Merged-Soup-30k
Text Generation
•
4B
•
Updated
2 days ago
•
38
ShourenWSR/Qwen3-4B-Base-PLE-Superior-Hybrid-30k
7B
•
Updated
2 days ago
•
30
ShourenWSR/Qwen3-4B-Dense-Think-30k
196k
•
Updated
2 days ago
•
35
ShourenWSR/Qwen3-4B-Dense-NoThink-30k
196k
•
Updated
2 days ago
•
35
ShourenWSR/Phi4-Mini-PL-MoE-20k
6B
•
Updated
2 days ago
•
15
ShourenWSR/LLaMA3.1-8B-PL-MoE-Superior-27k-27k
14B
•
Updated
2 days ago
•
15
ShourenWSR/LLaMA3.1-8B-PL-MoE-20k
14B
•
Updated
2 days ago
•
17
ShourenWSR/Qwen3-4B-PLE-Superior-Hybrid-30k
7B
•
Updated
5 days ago
•
57
ShourenWSR/Qwen3-4B-PLE-Initialized
7B
•
Updated
8 days ago
•
14
ShourenWSR/Qwen3-4B-Base-PLE-Initialized
7B
•
Updated
8 days ago
•
19
ShourenWSR/Qwen3-8B-Base-Instruct-Stage1-to-be-deleted
308k
•
Updated
May 1
•
7
ShourenWSR/Qwen3-4B-Base-Instruct-Stage2-Superior-65k-27k-to-be-deleted
7B
•
Updated
May 1
•
6
ShourenWSR/Qwen3-4B-Base-Instruct-Stage2-Superior-27k-27k-to-be-deleted
7B
•
Updated
May 1
•
7
ShourenWSR/Qwen3-4B-Base-Instruct-Stage1-to-be-deleted
4B
•
Updated
May 1
•
3
ShourenWSR/Qwen3-4B-V2-Superior-Hybrid-30k
7B
•
Updated
May 1
•
9
ShourenWSR/Qwen3-4B-Base-Superior-65k-27k
7B
•
Updated
May 1
•
5
ShourenWSR/Qwen3-4B-Instruct-Superior-65k-27k
7B
•
Updated
May 1
•
8
ShourenWSR/Qwen3-4B-Instruct-Superior-27k-27k
7B
•
Updated
May 1
•
6
ShourenWSR/Qwen3-4B-Instruct-NaiveMix-140k
7B
•
Updated
May 1
•
7
ShourenWSR/Qwen3-4B-Base-NaiveMix-140k
7B
•
Updated
May 1
•
5
ShourenWSR/Qwen3-4B-NaiveMix-140k
7B
•
Updated
May 1
•
7
ShourenWSR/Qwen3-4B-Baseline-2Phase-Superior-65k-27k-Phase2
196k
•
Updated
May 1
•
7
ShourenWSR/Qwen3-4B-Baseline-2Phase-Superior-65k-27k-Phase1
196k
•
Updated
May 1
•
8
ShourenWSR/Qwen3-4B-Baseline-2Phase-NaiveMix-140k-Phase2
196k
•
Updated
May 1
•
10
ShourenWSR/Qwen3-4B-Baseline-2Phase-NaiveMix-140k-Phase1
196k
•
Updated
May 1
•
5
ShourenWSR/Qwen2.5-7B-PL-MoE-Superior-27k-27k-v2
13B
•
Updated
Mar 30
•
2
ShourenWSR/Qwen3-4B-PL-MoE-V2-Superior-Hybrid-8k
Feature Extraction
•
7B
•
Updated
Mar 30
•
9
ShourenWSR/Qwen2.5-7B-PL-MoE-Superior-27k-27k-ckpt2400
13B
•
Updated
Mar 30
•
9
ShourenWSR/Qwen3-4B-PL-MoE-Initialized-V2
7B
•
Updated
Mar 29
•
2
Previous
1
2
3
Next