Audio AI Tools
Free audio AI tools for sound design, music composition, and voice synthesis, helping creatives produce unique audio experiences effortlessly.
Audio AI Tools
Audio Captioning
Audio Classification
Audio Editing
Audio Generation
Audio Inpainting
Audio Outpainting / Continuation
Audio Separation
Audio-to-3D
Audio-to-Motion
Audio-to-Text
Controllable Audio Generation
Image-to-Audio
Personalized Audio Generation
Speech Recognition
Text-to-Audio
Text-to-Audio
Image-to-Audio
Video-to-Audio
Text-to-Music
Text-to-Music
Text-to-SFX
Text-to-SFX
Text-to-Speech
Video-to-Audio
I Hear Your True Colors: Image Guided Audio Generation can generate audio that matches images using a two-stage Transformer model. It produces high-quality sound and introduces the ImageHear dataset for testing future image-to-audio models.
AudioLM can generate high-quality audio by treating it like a language task. It produces coherent speech and piano music continuations while keeping the speaker’s voice and style consistent, even for new speakers.