Text-to-Audio
Diffusers
Safetensors
English
Chinese
MossSoundEffectPipeline
diffusion
flow-matching
sound-effects
audio-generation
Instructions to use OpenMOSS-Team/MOSS-SoundEffect-v2.0 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use OpenMOSS-Team/MOSS-SoundEffect-v2.0 with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("OpenMOSS-Team/MOSS-SoundEffect-v2.0", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
| { | |
| "_class_name": "MossSoundEffectPipeline", | |
| "_diffusers_version": "0.32.0", | |
| "transformer": [ | |
| "WanAudioModel", | |
| "transformer" | |
| ], | |
| "vae": [ | |
| "DAC", | |
| "vae" | |
| ], | |
| "text_encoder": [ | |
| "Qwen3TextEncoder", | |
| "text_encoder" | |
| ], | |
| "tokenizer": [ | |
| "AutoTokenizer", | |
| "tokenizer" | |
| ], | |
| "scheduler": [ | |
| "FlowMatchScheduler", | |
| "scheduler" | |
| ], | |
| "dit_variant": "1.3B", | |
| "sample_rate": 48000, | |
| "max_inference_seconds": 30, | |
| "vae_type": "dac", | |
| "text_encoder_type": "qwen3" | |
| } |