How to use ACE-Step/acestep-captioner with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-to-audio", model="ACE-Step/acestep-captioner")
# Load model directly from transformers import AutoProcessor, AutoModelForTextToWaveform processor = AutoProcessor.from_pretrained("ACE-Step/acestep-captioner") model = AutoModelForTextToWaveform.from_pretrained("ACE-Step/acestep-captioner")