Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
MoYoYoTech
/
llm_mutil_npu
like
0
Follow
MoYoYoTech
27
Model card
Files
Files and versions
xet
Community
main
llm_mutil_npu
/
include
66.4 kB
Ctrl+K
Ctrl+K
2 contributors
History:
1 commit
xianglarry
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
4b9fefd
9 days ago
acl_common.h
3.9 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
9 days ago
acl_runtime.h
1.16 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
9 days ago
aclnn_ops.h
15.8 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
9 days ago
device_weights.h
3.51 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
9 days ago
engine.h
17.9 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
9 days ago
hccl_comm.h
3.84 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
9 days ago
model_config.h
2.29 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
9 days ago
rope.h
4.87 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
9 days ago
runner.h
5.84 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
9 days ago
safetensors_loader.h
2.63 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
9 days ago
tokenizer.h
1.51 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
9 days ago
workspace_pool.h
3.09 kB
Initial C++ aclnn EAGER inference for Qwen3-235B-A22B MoE on Ascend 910 × 16 NPU
9 days ago