Audio Generation
Generate custom music, voiceovers, or sound effects for your videos and social media content — all powered by AI.
Getting to Audio Generation
- Open Media Studio from the sidebar.
- Click Create Audio (or Create Sound).

Types of Audio You Can Generate
Background Music
Generate original background music tracks for your videos.
Use cases:
- Background music for Instagram Reels or TikTok videos.
- Mood-setting background audio for product showcase videos.
- Upbeat tracks for promotional content.
Prompt examples:
- "Upbeat, energetic pop track for a fitness brand, 30 seconds"
- "Calm ambient background music for a wellness product video"
- "Corporate background music, professional and modern, 60 seconds"
Voice Clone Tab
Two sub-tabs are available within this tab:
Generate Speech
- Type or paste your script into the Text to Speak field (up to 10 000 characters).
- Choose a voice:
- Preset Voices — select from the built-in Minimax voice library.
- Cloned Voices — select a voice you have previously cloned.
- Configure optional parameters:
- Model — e.g.
speech-02-hd - Speed, Volume, Pitch sliders
- Emotion — neutral, happy, sad, angry, etc.
- Language Boost — improves accuracy for a specific language
- Format — MP3 or other supported formats
- Click Generate Speech. The job is queued and you are redirected to Media Jobs.

Clone Voice
Create a custom voice model from a recording.
- Switch to the Clone Voice sub-tab.
- Provide a reference audio sample — upload a file or choose one from your Asset Library.
- Enter a Custom Voice ID (must start with a letter, contain letters and digits, and be at least 8 characters).
- Configure optional settings:
- Model
speech-02-hd, etc.) - Noise Reduction toggle
- Volume Normalization toggle
- Accuracy slider
- Preview Text — played back after cloning to verify the result
- Language Boost
- Click Clone Voice. The job is queued in Media Jobs.
Your cloned voices appear in the cloned-voice selector in the Generate Speech sub-tab once processing completes.

Multilingual Tab
Use this tab for high-quality speech in Arabic and other languages.
- Type or paste your script (up to 5 000 characters).
- Select a Voice from the ElevenLabs voice library.
- Adjust quality sliders:
- Stability — how consistent the voice sounds across sentences
- Similarity — how closely the output matches the selected voice
- Speaker Boost toggle — enhances voice clarity
- Click Generate. Audio is processed and plays back in-browser when ready, with a Download button.

Generate Music Tab
Generates original, royalty-free background music.
- Enter a Prompt describing the style, mood, tempo, and genre.
- Example: "An upbeat electronic dance track with heavy bass drops and synth melodies"
- Optionally enter a Negative Prompt to steer the model away from unwanted elements.
- Expand Advanced Options for:
- Sample Count (1–4 variations; disabled when a seed is set)
- Seed — for reproducible results (disabled when sample count > 1)
- Click Generate. A status badge shows Pending → Processing → Completed.
- Once complete, play the track in-browser and click Download to save it.

Prompt examples:
"Calm ambient background music for a wellness product video""Corporate background music, professional and modern""Upbeat, energetic pop track for a fitness brand"
Sound Effects Tab
Generates short SFX clips. Choose between two AI models:
Beatoven
- Prompt — describe the sound (e.g. "Powerful helicopter takeoff with rotor blade chop")
- Negative Prompt — elements to avoid
- Duration slider (1–60 seconds)
- Refinement slider — how closely the output follows the prompt
- Creativity slider — variation in the output
- Seed — optional, for reproducibility
- An Enhance button rewrites your prompt with AI to improve results.
CassetteAI
- Prompt — describe the sound (e.g. "dog barking in the rain")
- Duration slider (1–60 seconds)
After clicking Generate, the job is queued and you are redirected to Media Jobs.

Reviewing and Downloading Audio
- For Generate Music and Multilingual, completed audio streams directly in the browser via an inline audio player.
- For Voice Clone and Sound Effects, completed files are available in Media Jobs where you can preview, download, or save to your asset repository.
Attaching Audio to a Video
After generating audio, you can combine it with a video in the Video Editor:
- Download your audio file.
- Open the Video Editor in Media Studio.
- Import your video and audio files.
- Sync and export.