Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LLaMA-MoE

https://github.com/pjlab-sys4nlp/llama-moe
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

tongjingqiĀ  authored a paper 3 days ago
LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
tongjingqiĀ  authored a paper 3 days ago
Adaptive Fast-and-Slow Visual Program Reasoning for Long-Form VideoQA
tongjingqiĀ  authored a paper 3 days ago
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm
View all activity

Tong Zhu's profile pictureXiaoye Qu's profile pictureJiacheng Ruan's profile pictureDaize Dong's profile picturetongjingqi(SII)'s profile pictureXuyang Hu's profile picture

llama-moe 's models 8

llama-moe/LLaMA-MoE-v2-3_8B-residual-sft

8B • Updated Dec 3, 2024 • 4 • 2

llama-moe/LLaMA-MoE-v2-3_8B-2_8-sft

8B • Updated Dec 3, 2024 • 263 • 6

llama-moe/LLaMA-MoE-v1-3_0B-2_16

Text Generation • Updated Jun 25, 2024 • 134 • 11

llama-moe/LLaMA-MoE-v1-3_5B-4_16

Text Generation • Updated Jun 25, 2024 • 159 • 16

llama-moe/LLaMA-MoE-v1-3_0B-2_16-sft

Text Generation • 7B • Updated Jun 25, 2024 • 4 • 2

llama-moe/LLaMA-MoE-v1-3_5B-2_8-sft

Text Generation • 7B • Updated Jun 25, 2024 • 18 • 3

llama-moe/LLaMA-MoE-v1-3_5B-4_16-sft

Text Generation • 7B • Updated Jun 25, 2024 • 10 • 1

llama-moe/LLaMA-MoE-v1-3_5B-2_8

Text Generation • Updated Jun 25, 2024 • 445 • 15
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs