Back to Blog

Seamlessly Merge Images and Audio with FFMPEGAPI.net

June 2026 FFMPEG API Team

Creating engaging video content often requires the perfect combination of images and audio. With FFMPEGAPI.net, developers can leverage a powerful hosted REST API to merge images and audio seamlessly. This article explores the Image and Audio Merge endpoint, its capabilities, and how it can be integrated into your applications.

What is the Image and Audio Merge Endpoint?

The Image and Audio Merge endpoint allows you to create an MP4 video from pairs of images and audio files. This is especially useful for creating video presentations, tutorials, or any content where images need to be displayed alongside audio.

By sending a simple POST request to the /api/merge_image_audio endpoint, you can combine multiple image and audio files, each displayed for the duration of its corresponding audio track.

  • Supports multi-pair image/audio combinations.
  • Optional image transitions for smooth visuals.
  • Default slow zoom effect to enhance viewer engagement.

Using the Merge Endpoint: Parameters and Options

The endpoint accepts various parameters that can be tailored to your specific needs. Below are the key parameters you can use:

You can provide image URLs and audio URLs in arrays for multiple pairs or use single image and audio strings for a simple merge.

  • image_urls: Array of image URLs for multi-pair requests.
  • audio_urls: Array of audio URLs that dictate the duration of image display.
  • transition_effect: An optional effect for transitioning between images.
  • zoom_effect: Control if a slow zoom should be applied to images.

Practical Examples of Merging Images and Audio

Here’s how you can use cURL to make a request to the FFMPEGAPI.net Image and Audio Merge endpoint:

This example shows how to combine images and audio with a fade transition effect.

curl -X POST https://www.ffmpegapi.net/api/merge_image_audio \
-H 'Content-Type: application/json' \
-d '{ \
  "image_urls": ["https://example.com/intro.jpg", "https://example.com/chapter-1.jpg"], \
  "audio_urls": ["https://example.com/intro.mp3", "https://example.com/chapter-1.mp3"], \
  "transition_effect": "fade", \
  "transition_duration": 0.75, \
  "dimensions": "1280x720", \
  "zoom_effect": true \
}'
import requests

url = 'https://www.ffmpegapi.net/api/merge_image_audio'
data = {
    'image_urls': ['https://example.com/intro.jpg', 'https://example.com/chapter-1.jpg'],
    'audio_urls': ['https://example.com/intro.mp3', 'https://example.com/chapter-1.mp3'],
    'transition_effect': 'fade',
    'transition_duration': 0.75,
    'dimensions': '1280x720',
    'zoom_effect': True
}
response = requests.post(url, json=data)
print(response.json())

FFMPEGAPI.net provides a robust solution for developers looking to merge images and audio effortlessly. By utilizing our hosted REST API, you can save time and resources, allowing you to focus on building exceptional applications. With features like image transitions and zoom effects, your video content will stand out without the hassle of managing infrastructure. Start integrating the Image and Audio Merge endpoint today and elevate your video processing capabilities.

Ready to Start Processing Videos?

Get your free API key and start merging videos in minutes

Get Started Free