Audio Generation

A
admin@zorgsocial.com
4 min readUpdated Jul 1, 2026v5

Generate custom music, voiceovers, or sound effects for your videos and social media content — all powered by AI.


Getting to Audio Generation

  1. Open Media Studio from the sidebar.
  2. Click Create Audio (or Create Sound).
Screenshot from 2026-05-18 18-41-41.png

Types of Audio You Can Generate

Background Music

Generate original background music tracks for your videos.

Use cases:

  • Background music for Instagram Reels or TikTok videos.
  • Mood-setting background audio for product showcase videos.
  • Upbeat tracks for promotional content.

Prompt examples:

  • "Upbeat, energetic pop track for a fitness brand, 30 seconds"
  • "Calm ambient background music for a wellness product video"
  • "Corporate background music, professional and modern, 60 seconds"

Voice Clone Tab

Two sub-tabs are available within this tab:

Generate Speech

  1. Type or paste your script into the Text to Speak field (up to 10 000 characters).
  2. Choose a voice:
  3. Preset Voices — select from the built-in Minimax voice library.
  4. Cloned Voices — select a voice you have previously cloned.
  5. Configure optional parameters:
  6. Model — e.g. speech-02-hd
  7. Speed, Volume, Pitch sliders
  8. Emotion — neutral, happy, sad, angry, etc.
  9. Language Boost — improves accuracy for a specific language
  10. Format — MP3 or other supported formats
  11. Click Generate Speech. The job is queued and you are redirected to Media Jobs.
Screenshot from 2026-05-18 18-43-04.png

Clone Voice

Create a custom voice model from a recording.

  1. Switch to the Clone Voice sub-tab.
  2. Provide a reference audio sample — upload a file or choose one from your Asset Library.
  3. Enter a Custom Voice ID (must start with a letter, contain letters and digits, and be at least 8 characters).
  4. Configure optional settings:
  5. Model speech-02-hd, etc.)
  6. Noise Reduction toggle
  7. Volume Normalization toggle
  8. Accuracy slider
  9. Preview Text — played back after cloning to verify the result
  10. Language Boost
  11. Click Clone Voice. The job is queued in Media Jobs.

Your cloned voices appear in the cloned-voice selector in the Generate Speech sub-tab once processing completes.

Screenshot from 2026-05-18 18-57-03.png

Multilingual Tab

Use this tab for high-quality speech in Arabic and other languages.

  1. Type or paste your script (up to 5 000 characters).
  2. Select a Voice from the ElevenLabs voice library.
  3. Adjust quality sliders:
  4. Stability — how consistent the voice sounds across sentences
  5. Similarity — how closely the output matches the selected voice
  6. Speaker Boost toggle — enhances voice clarity
  7. Click Generate. Audio is processed and plays back in-browser when ready, with a Download button.
Screenshot from 2026-05-18 18-57-26.png

Generate Music Tab

Generates original, royalty-free background music.

  1. Enter a Prompt describing the style, mood, tempo, and genre.
  2. Example: "An upbeat electronic dance track with heavy bass drops and synth melodies"
  3. Optionally enter a Negative Prompt to steer the model away from unwanted elements.
  4. Expand Advanced Options for:
  5. Sample Count (1–4 variations; disabled when a seed is set)
  6. Seed — for reproducible results (disabled when sample count > 1)
  7. Click Generate. A status badge shows Pending → Processing → Completed.
  8. Once complete, play the track in-browser and click Download to save it.
Screenshot from 2026-05-18 19-01-11.png

Prompt examples:

  • "Calm ambient background music for a wellness product video"
  • "Corporate background music, professional and modern"
  • "Upbeat, energetic pop track for a fitness brand"

Sound Effects Tab

Generates short SFX clips. Choose between two AI models:

Beatoven

  • Prompt — describe the sound (e.g. "Powerful helicopter takeoff with rotor blade chop")
  • Negative Prompt — elements to avoid
  • Duration slider (1–60 seconds)
  • Refinement slider — how closely the output follows the prompt
  • Creativity slider — variation in the output
  • Seed — optional, for reproducibility
  • An Enhance button rewrites your prompt with AI to improve results.

CassetteAI

  • Prompt — describe the sound (e.g. "dog barking in the rain")
  • Duration slider (1–60 seconds)

After clicking Generate, the job is queued and you are redirected to Media Jobs.

Screenshot from 2026-05-18 19-01-49.png

Reviewing and Downloading Audio

  • For Generate Music and Multilingual, completed audio streams directly in the browser via an inline audio player.
  • For Voice Clone and Sound Effects, completed files are available in Media Jobs where you can preview, download, or save to your asset repository.

Attaching Audio to a Video

After generating audio, you can combine it with a video in the Video Editor:

  1. Download your audio file.
  2. Open the Video Editor in Media Studio.
  3. Import your video and audio files.
  4. Sync and export.

Related Reading

Still need assistance?

If this article didn't clear things up, our support engineers are standing by.