Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ryan-u's picture
6 8 11

ryan-u

ryan-u
louie-m's profile picture Zerohertz's profile picture 21world's profile picture
·
  • bzantium

AI & ML interests

None yet

Organizations

Kakao Corp.'s profile picture

New activity in kakaocorp/kanana-2-30b-a3b-thinking-2601 2 months ago

Kanana serving docs: vLLM version difference for RoPE/YaRN config

1
#1 opened 2 months ago by
seongsubae
New activity in Qwen/Qwen3-Next-80B-A3B-Instruct 6 months ago

fix the blog link

1
#6 opened 6 months ago by
ryan-u
New activity in OpenCoder-LLM/opc-annealing-corpus 8 months ago

algorithmic_corpus and synthetic_qa are swapped

#6 opened 8 months ago by
ryan-u
commented a paper 9 months ago

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9, 2025 • 96 •
5
commented 3 papers over 1 year ago

Upcycling Large Language Models into Mixture of Experts

Paper • 2410.07524 • Published Oct 10, 2024 • 4 •
3

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25, 2024 • 80 •
12

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25, 2024 • 80 •
12
New activity in EleutherAI/polyglot-ko-5.8b over 1 year ago

모델 저작권 문의 [copyrights]

2
#1 opened over 1 year ago by
hyerong
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs