Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

MoYoYoTech
/
llm_mutil_npu

Model card Files Files and versions
xet
Community
llm_mutil_npu / src
86.2 kB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 1 commit
xianglarry's picture
xianglarry
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
4b9fefd 9 days ago
  • device_weights.cpp
    10.7 kB
    Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU 9 days ago
  • main_cli.cpp
    41.1 kB
    Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU 9 days ago
  • model_config.cpp
    5.41 kB
    Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU 9 days ago
  • runner.cpp
    17.6 kB
    Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU 9 days ago
  • safetensors_loader.cpp
    5.08 kB
    Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU 9 days ago
  • tokenizer.cpp
    6.21 kB
    Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU 9 days ago