Now it is possible to train models conditional on an encoding (of text or audio, for example).
1.3.2
🤗 Exciting news! AudioDiffusionPipeline has been migrated to the Hugging Face diffusers package so that it is even easier for others to use and contribute.