Stable Audio Open
Preview:
Introduce:
Stable Audio Open is an open source text-to-audio model optimized for generating short audio samples, sound effects, and production elements. It allows users to generate up to 47 seconds of high quality audio data with simple text prompts, and is particularly suitable for creating music production and sound design such as drum beats, instrumental improvisation, ambient sounds, and mimic recordings. The key benefit of open source distribution is that users can fine-tune the model according to their own custom audio data.
Stakeholders:
Stable Audio Open’s target audience includes sound designers, musicians and the creative community. It provides these users with a powerful tool to quickly generate the desired audio samples through text prompts, thus speeding up the process of music production and sound design, while maintaining the diversity and creativity of the audio.
Usage Scenario Examples:
- Generates warm analog synthesizer arpeggios, progressively rising filter cutoff and reverb endings
- Rock beats played in a treated studio, with conversational drum playing using acoustic drum sets
- The blackbird song of a summer evening in the forest
The features of the tool:
- Generate high quality audio samples up to 47 seconds
- Create drum beats, instrumental improvisations, ambient sounds, and more
- Audio sample style conversion and audio variant generation
- Users can fine-tune the model to fit their own audio data
- Support for text prompts to generate specific styles of audio
- Respect creator rights and train with audio data from FreeSound and the Free Music Archive
Steps for Use:
- Visit the Hugging Face website to download Stable Audio Olien model weights
- Fine-tune the model to fit specific audio data according to individual needs
- Use text prompts to generate the desired audio sample
- Explore different features of the model, such as style conversion of audio samples
- Join Stable AI’s community to get feedback and participate in further research and development
Tool’s Tabs: Audio generation, open source model