Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

stiv14
/
audio-caption-categorizer-model

Audio Classification
ONNX
ExecuTorch
English
audio
audio-captioning
mobile
arm
Model card Files Files and versions
xet
Community
audio-caption-categorizer-model / audio-caption
46.1 MB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 1 commit
stivenDR14
feat: Introduce audio captioning and categorization model with ONNX/ExecuTorch hybrid inference and category embedding generation.
5c8d855 6 months ago
  • effb2_decoder_5sec.pte
    15.1 MB
    xet
    feat: Introduce audio captioning and categorization model with ONNX/ExecuTorch hybrid inference and category embedding generation. 6 months ago
  • effb2_encoder_preprocess-2.onnx
    30.9 MB
    xet
    feat: Introduce audio captioning and categorization model with ONNX/ExecuTorch hybrid inference and category embedding generation. 6 months ago
  • export_decoder_executorch.py
    9.67 kB
    feat: Introduce audio captioning and categorization model with ONNX/ExecuTorch hybrid inference and category embedding generation. 6 months ago
  • export_encoder_preprocess_onnx.py
    6.6 kB
    feat: Introduce audio captioning and categorization model with ONNX/ExecuTorch hybrid inference and category embedding generation. 6 months ago
  • generate_caption_hybrid.py
    4.68 kB
    feat: Introduce audio captioning and categorization model with ONNX/ExecuTorch hybrid inference and category embedding generation. 6 months ago