Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
JustinLeee
/
GrandLine_LLM
like
1
Question Answering
Chinese
English
chatbot
LLM
Pretrain
SFT
Distill
GRPO
CoT
Pytorch
Deepseek-MoE
Qwen3-Dense
License:
mit
Model card
Files
Files and versions
xet
Community
main
GrandLine_LLM
/
dense
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
JustinLeee
Update images via Git LFS
5158eb1
1 day ago
grpo_768.pth
pickle
Detected Pickle imports (3)
"torch.HalfStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
197 MB
xet
Update images via Git LFS
1 day ago
pretrain_768.pth
pickle
Detected Pickle imports (3)
"torch.HalfStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
197 MB
xet
Update images via Git LFS
1 day ago
sft_768.pth
pickle
Detected Pickle imports (3)
"torch.HalfStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
197 MB
xet
Update images via Git LFS
1 day ago
thinking_distill_768.pth
pickle
Detected Pickle imports (3)
"torch.HalfStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
197 MB
xet
Update images via Git LFS
1 day ago