Audio Generation

Generate custom music, voiceovers, or sound effects for your videos and social media content — all powered by AI.

Getting to Audio Generation

Open Media Studio from the sidebar.
Click Create Audio (or Create Sound).

Types of Audio You Can Generate

Background Music

Generate original background music tracks for your videos.

Use cases:

Background music for Instagram Reels or TikTok videos.
Mood-setting background audio for product showcase videos.
Upbeat tracks for promotional content.

Prompt examples:

"Upbeat, energetic pop track for a fitness brand, 30 seconds"
"Calm ambient background music for a wellness product video"
"Corporate background music, professional and modern, 60 seconds"

Voice Clone Tab

Two sub-tabs are available within this tab:

Generate Speech

Type or paste your script into the Text to Speak field (up to 10 000 characters).
Choose a voice:
Preset Voices — select from the built-in Minimax voice library.
Cloned Voices — select a voice you have previously cloned.
Configure optional parameters:
Model — e.g. speech-02-hd
Speed, Volume, Pitch sliders
Emotion — neutral, happy, sad, angry, etc.
Language Boost — improves accuracy for a specific language
Format — MP3 or other supported formats
Click Generate Speech. The job is queued and you are redirected to Media Jobs.

Clone Voice

Create a custom voice model from a recording.

Switch to the Clone Voice sub-tab.
Provide a reference audio sample — upload a file or choose one from your Asset Library.
Enter a Custom Voice ID (must start with a letter, contain letters and digits, and be at least 8 characters).
Configure optional settings:
Model speech-02-hd, etc.)
Noise Reduction toggle
Volume Normalization toggle
Accuracy slider
Preview Text — played back after cloning to verify the result
Language Boost
Click Clone Voice. The job is queued in Media Jobs.

Your cloned voices appear in the cloned-voice selector in the Generate Speech sub-tab once processing completes.

Multilingual Tab

Use this tab for high-quality speech in Arabic and other languages.

Type or paste your script (up to 5 000 characters).
Select a Voice from the ElevenLabs voice library.
Adjust quality sliders:
Stability — how consistent the voice sounds across sentences
Similarity — how closely the output matches the selected voice
Speaker Boost toggle — enhances voice clarity
Click Generate. Audio is processed and plays back in-browser when ready, with a Download button.

Generate Music Tab

Generates original, royalty-free background music.

Enter a Prompt describing the style, mood, tempo, and genre.
Example: "An upbeat electronic dance track with heavy bass drops and synth melodies"
Optionally enter a Negative Prompt to steer the model away from unwanted elements.
Expand Advanced Options for:
Sample Count (1–4 variations; disabled when a seed is set)
Seed — for reproducible results (disabled when sample count > 1)
Click Generate. A status badge shows Pending → Processing → Completed.
Once complete, play the track in-browser and click Download to save it.

Prompt examples:

"Calm ambient background music for a wellness product video"
"Corporate background music, professional and modern"
"Upbeat, energetic pop track for a fitness brand"

Sound Effects Tab

Generates short SFX clips. Choose between two AI models:

Beatoven

Prompt — describe the sound (e.g. "Powerful helicopter takeoff with rotor blade chop")
Negative Prompt — elements to avoid
Duration slider (1–60 seconds)
Refinement slider — how closely the output follows the prompt
Creativity slider — variation in the output
Seed — optional, for reproducibility
An Enhance button rewrites your prompt with AI to improve results.

CassetteAI

Prompt — describe the sound (e.g. "dog barking in the rain")
Duration slider (1–60 seconds)

After clicking Generate, the job is queued and you are redirected to Media Jobs.

Reviewing and Downloading Audio

For Generate Music and Multilingual, completed audio streams directly in the browser via an inline audio player.
For Voice Clone and Sound Effects, completed files are available in Media Jobs where you can preview, download, or save to your asset repository.

Attaching Audio to a Video

After generating audio, you can combine it with a video in the Video Editor:

Download your audio file.
Open the Video Editor in Media Studio.
Import your video and audio files.
Sync and export.