Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

JackIsNotInTheBox
/
Generate_Audio_for_Video_Checkpoints

Diffusers
Safetensors
Model card Files Files and versions
xet
Community

Instructions to use JackIsNotInTheBox/Generate_Audio_for_Video_Checkpoints with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • Diffusers

    How to use JackIsNotInTheBox/Generate_Audio_for_Video_Checkpoints with Diffusers:

    pip install -U diffusers transformers accelerate
    import torch
    from diffusers import DiffusionPipeline
    
    # switch to "mps" for apple devices
    pipe = DiffusionPipeline.from_pretrained("JackIsNotInTheBox/Generate_Audio_for_Video_Checkpoints", dtype=torch.bfloat16, device_map="cuda")
    
    prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"
    image = pipe(prompt).images[0]
  • Notebooks
  • Google Colab
  • Kaggle
Generate_Audio_for_Video_Checkpoints
Ctrl+K
Ctrl+K
  • 1 contributor
History: 97 commits
JackIsNotInTheBox's picture
JackIsNotInTheBox
Mirror tokenizer/tokenizer_config.json from cvssp/audioldm2@c8e7e189
0d6fc01 verified 10 days ago
  • HunyuanVideo-Foley
    Upload 4 files about 1 month ago
  • MMAudio
    Upload 3 files about 1 month ago
  • TARO
    Delete TARO/630k-audioset-best.pt about 1 month ago
  • encoders
    Mirror tokenizer/tokenizer_config.json from cvssp/audioldm2@c8e7e189 10 days ago
  • upstream
    Mirror weights/mmaudio_small_44k.pth from hkchengrex/MMAudio@eb13a1a9 10 days ago
  • .gitattributes
    2.82 kB
    Mirror tokenizer.json from google/siglip2-base-patch16-512@a89f5c50 10 days ago