Ffmpeg
Ffmpeg
ffmpeg Documentation
Table of Contents
1 Synopsis
2 Description
3 Detailed description
3.1 Streamcopy
3.2 Transcoding
3.3 Filtering
3.3.1 Simple filtergraphs
3.3.2 Complex filtergraphs
3.4 Loopback decoders
4 Stream selection
4.1 Description
4.1.1 Automatic stream selection
4.1.2 Manual stream selection
4.1.3 Complex filtergraphs
4.1.4 Stream handling
4.2 Examples
5 Options
5.1 Stream specifiers
5.2 Generic options
5.3 AVOptions
5.4 Main options
5.5 Video Options
5.6 Advanced Video options
5.7 Audio Options
5.8 Advanced Audio options
5.9 Subtitle options
5.10 Advanced Subtitle options
5.11 Advanced options
5.12 Preset files
5.12.1 ffpreset files
5.12.2 avpreset files
5.13 vstats file format
6 Examples
6.1 Video and Audio grabbing
6.2 X11 grabbing
6.3 Video and Audio file format conversion
7 See Also
8 Authors
[Link] 1/65
1 Synopsis
2/16/26, 2:47 PM ffmpeg Documentation
2 Description
ffmpeg is a universal media converter. It can read a wide variety of inputs - including live
grabbing/recording devices - filter, and transcode them into a plethora of output formats.
ffmpeg reads from an arbitrary number of inputs (which can be regular files, pipes, network streams,
grabbing devices, etc.), specified by the -i option, and writes to an arbitrary number of outputs,
which are specified by a plain output url. Anything found on the command line which cannot be
interpreted as an option is considered to be an output url.
Each input or output can, in principle, contain any number of elementary streams of different types
(video/audio/subtitle/attachment/data), though the allowed stream counts and/or types may be limited
by the container format. Selecting which streams from which inputs will go into which output is either
done automatically or with the -map option (see the Stream selection chapter).
To refer to inputs/outputs in options, you must use their indices (0-based). E.g. the first input is 0 ,
the second is 1 , etc. Similarly, streams within an input/output are referred to by their indices. E.g.
2:3 refers to the fourth stream in the third input or output. Also see the Stream specifiers chapter.
As a general rule, options are applied to the next specified file. Therefore, order is important, and you
can have the same option on the command line multiple times. Each occurrence is then applied to the
next input or output file. Exceptions from this rule are the global options (e.g. verbosity level), which
should be specified first.
Do not mix input and output files – first specify all input files, then all output files. Also do not mix
options which belong to different files. All options apply ONLY to the next input or output file and are
reset between files.
Force the frame rate of the input file (valid for raw formats only) to 1 fps and the frame rate of the
output file to 24 fps:
[Link] 2/65
2/16/26, 2:47 PM ffmpeg Documentation
3 Detailed description
ffmpeg builds a transcoding pipeline out of the components listed below. The program’s operation
then consists of input data chunks flowing from the sources down the pipes towards the sinks, while
being transformed by the components they encounter along the way.
In other literature, demuxers are sometimes called splitters, because their main function is
splitting a file into elementary streams (though some files only contain one elementary stream).
┌──────────┬───────────────────────┐
│ demuxer │ │ packets for stream 0
╞══════════╡ elementary stream 0 ├──────────────────────►
│ │ │
│ global ├───────────────────────┤
│properties│ │ packets for stream 1
│ and │ elementary stream 1 ├──────────────────────►
│ metadata │ │
│ ├───────────────────────┤
│ │ │
│ │ ........... │
│ │ │
│ ├───────────────────────┤
│ │ │ packets for stream N
│ │ elementary stream N ├──────────────────────►
│ │ │
└──────────┴───────────────────────┘
▲
│
│ read from file, network stream,
│ grabbing device, etc.
│
Decoders receive encoded (compressed) packets for an audio, video, or subtitle elementary
stream, and decode them into raw frames (arrays of pixels for video, PCM for audio). A decoder is
typically associated with (and receives its input from) an elementary stream in a demuxer, but
sometimes may also exist on its own (see Loopback decoders).
A schematic representation of a decoder looks like this:
[Link] 3/65
2/16/26, 2:47 PM ffmpeg Documentation
┌─────────┐
packets │ │ raw frames
─────────►│ decoder ├────────────►
│ │
└─────────┘
Filtergraphs process and transform raw audio or video frames. A filtergraph consists of one or
more individual filters linked into a graph. Filtergraphs come in two flavors - simple and complex,
configured with the -filter and -filter_complex options, respectively.
A simple filtergraph is associated with an output elementary stream; it receives the input to be
filtered from a decoder and sends filtered output to that output stream’s encoder.
A simple video filtergraph that performs deinterlacing (using the yadif deinterlacer) followed by
┌────────────────────────┐
│ simple filtergraph │
frames from ╞════════════════════════╡ frames for
a decoder │ ┌───────┐ ┌───────┐ │ an encoder
────────────►├─►│ yadif ├─►│ scale ├─►│────────────►
│ └───────┘ └───────┘ │
└────────────────────────┘
A complex filtergraph is standalone and not associated with any specific stream. It may have
multiple (or zero) inputs, potentially of different types (audio or video), each of which receiving
data either from a decoder or another complex filtergraph’s output. It also has one or more
outputs that feed either an encoder or another complex filtergraph’s input.
The following example diagram represents a complex filtergraph with 3 inputs and 2 outputs (all
video):
┌─────────────────────────────────────────────────┐
│ complex filtergraph │
╞═════════════════════════════════════════════════╡
frames ├───────┐ ┌─────────┐ ┌─────────┐ ┌────────┤ frames
─────────►│input 0├─►│ overlay ├─────►│ overlay ├─►│output 0├────────►
├───────┘ │ │ │ │ └────────┤
frames ├───────┐╭►│ │ ╭►│ │ │
─────────►│input 1├╯ └─────────┘ │ └─────────┘ │
├───────┘ │ │
frames ├───────┐ ┌─────┐ ┌─────┬─╯ ┌────────┤ frames
─────────►│input 2├►│scale├►│split├───────────────►│output 1├────────►
├───────┘ └─────┘ └─────┘ └────────┤
└─────────────────────────────────────────────────┘
Frames from second input are overlaid over those from the first. Frames from the third input are
rescaled, then the duplicated into two identical streams. One of them is overlaid over the
combined first two inputs, with the result exposed as the filtergraph’s first output. The other
duplicate ends up being the filtergraph’s second output.
[Link] 4/65
2/16/26, 2:47 PM ffmpeg Documentation
Encoders receive raw audio, video, or subtitle frames and encode them into encoded packets. The
encoding (compression) process is typically lossy - it degrades stream quality to make the output
smaller; some encoders are lossless, but at the cost of much higher output size. A video or audio
encoder receives its input from some filtergraph’s output, subtitle encoders receive input from a
decoder (since subtitle filtering is not supported yet). Every encoder is associated with some
muxer’s output elementary stream and sends its output to that muxer.
A schematic representation of an encoder looks like this:
┌─────────┐
raw frames │ │ packets
────────────►│ encoder ├─────────►
│ │
└─────────┘
Muxers (short for "multiplexers") receive encoded packets for their elementary streams from
encoders (the transcoding path) or directly from demuxers (the streamcopy path), interleave them
(when there is more than one elementary stream), and write the resulting bytes into the output file
(or pipe, network stream, etc.).
A schematic representation of a muxer looks like this:
┌──────────────────────┬───────────┐
packets for stream 0 │ │ muxer │
──────────────────────►│ elementary stream 0 ╞═══════════╡
│ │ │
├──────────────────────┤ global │
packets for stream 1 │ │properties │
──────────────────────►│ elementary stream 1 │ and │
│ │ metadata │
├──────────────────────┤ │
│ │ │
│ ........... │ │
│ │ │
├──────────────────────┤ │
packets for stream N │ │ │
──────────────────────►│ elementary stream N │ │
│ │ │
└──────────────────────┴─────┬─────┘
│
write to file, network stream, │
grabbing device, etc. │
│
▼
3.1 Streamcopy
The simplest pipeline in ffmpeg is single-stream streamcopy, that is copying one input elementary
stream’s packets without decoding, filtering, or encoding them. As an example, consider an input file
called [Link] with 3 elementary streams, from which we take the second and write it to file
[Link] 5/65
2/16/26, 2:47 PM ffmpeg Documentation
OUTPUT.mp4. A schematic representation of such a pipeline looks like this:
┌──────────┬─────────────────────┐
│ demuxer │ │ unused
╞══════════╡ elementary stream 0 ├────────╳
│ │ │
│[Link] ├─────────────────────┤ ┌──────────────────────┬───────────┐
│ │ │ packets │ │ muxer │
│ │ elementary stream 1 ├─────────►│ elementary stream 0 ╞═══════════╡
│ │ │ │ │OUTPUT.mp4 │
│ ├─────────────────────┤ └──────────────────────┴───────────┘
│ │ │ unused
│ │ elementary stream 2 ├────────╳
│ │ │
└──────────┴─────────────────────┘
In this commandline
Streamcopy is useful for changing the elementary stream count, container format, or modifying
container-level metadata. Since there is no decoding or encoding, it is very fast and there is no quality
loss. However, it might not work in some cases because of a variety of factors (e.g. certain information
required by the target container is not available in the source). Applying filters is obviously also
impossible, since filters work on decoded frames.
More complex streamcopy scenarios can be constructed - e.g. combining streams from two input files
into a single output:
┌──────────┬────────────────────┐ ┌────────────────────┬───────────┐
│ demuxer 0│ │ packets │ │ muxer │
╞══════════╡elementary stream 0 ├────────►│elementary stream 0 ╞═══════════╡
│[Link]│ │ │ │OUTPUT.mp4 │
└──────────┴────────────────────┘ ├────────────────────┤ │
┌──────────┬────────────────────┐ │ │ │
│ demuxer 1│ │ packets │elementary stream 1 │ │
╞══════════╡elementary stream 0 ├────────►│ │ │
│[Link]│ │ └────────────────────┴───────────┘
└──────────┴────────────────────┘
[Link] 6/65
2/16/26, 2:47 PM ffmpeg Documentation
that can be built by the commandline
The output -map option is used twice here, creating two streams in the output file - one fed by the first
input and one by the second. The single instance of the -c option selects streamcopy for both of those
streams. You could also use multiple instances of this option together with Stream specifiers to apply
different values to each stream, as will be demonstrated in following sections.
A converse scenario is splitting multiple streams from a single input into multiple outputs:
┌──────────┬─────────────────────┐ ┌───────────────────┬───────────┐
│ demuxer │ │ packets │ │ muxer 0 │
╞══════════╡ elementary stream 0 ├─────────►│elementary stream 0╞═══════════╡
│ │ │ │ │OUTPUT0.mp4│
│[Link] ├─────────────────────┤ └───────────────────┴───────────┘
│ │ │ packets ┌───────────────────┬───────────┐
│ │ elementary stream 1 ├─────────►│ │ muxer 1 │
│ │ │ │elementary stream 0╞═══════════╡
└──────────┴─────────────────────┘ │ │OUTPUT1.mp4│
└───────────────────┴───────────┘
built with
ffmpeg -i [Link] -map 0:0 -c copy OUTPUT0.mp4 -map 0:1 -c copy OUTPUT1.mp4
Note how a separate instance of the -c option is needed for every output file even though their values
are the same. This is because non-global options (which is most of them) only apply in the context of
the file before which they are placed.
These examples can of course be further generalized into arbitrary remappings of any number of inputs
into any number of outputs.
3.2 Transcoding
Transcoding is the process of decoding a stream and then encoding it again. Since encoding tends to be
computationally expensive and in most cases degrades the stream quality (i.e. it is lossy), you should
only transcode when you need to and perform streamcopy otherwise. Typical reasons to transcode are:
applying filters - e.g. resizing, deinterlacing, or overlaying video; resampling or mixing audio;
you want to feed the stream to something that cannot decode the original codec.
Note that ffmpeg will transcode all audio, video, and subtitle streams unless you specify -c copy for
them.
Consider an example pipeline that reads an input file with one audio and one video stream, transcodes
the video and copies the audio into a single output file. This can be schematically represented as
follows
[Link] 7/65
2/16/26, 2:47 PM ffmpeg Documentation
┌──────────┬─────────────────────┐
│ demuxer │ │ audio packets
╞══════════╡ stream 0 (audio) ├─────────────────────────────────────╮
│ │ │ │
│[Link] ├─────────────────────┤ video ┌─────────┐ raw │
│ │ │ packets │ video │ video frames │
│ │ stream 1 (video) ├─────────►│ decoder ├──────────────╮ │
│ │ │ │ │ │ │
└──────────┴─────────────────────┘ └─────────┘ │ │
▼ ▼
│ │
┌──────────┬─────────────────────┐ video ┌─────────┐ │ │
│ muxer │ │ packets │ video │ │ │
╞══════════╡ stream 0 (video) │◄─────────┤ encoder ├──────────────╯ │
│ │ │ │(libx264)│ │
│OUTPUT.mp4├─────────────────────┤ └─────────┘ │
│ │ │ │
│ │ stream 1 (audio) │◄────────────────────────────────────╯
│ │ │
└──────────┴─────────────────────┘
ffmpeg -i [Link] -map 0:v -map 0:a -c:v libx264 -c:a copy OUTPUT.mp4
Note how it uses stream specifiers :v and :a to select input streams and apply different values of
the -c option to them; see the Stream specifiers section for more details.
3.3 Filtering
When transcoding, audio and video streams can be filtered before encoding, with either a simple or
complex filtergraph.
Taking the transcoding example from above, adding filtering (and omitting audio, for clarity) makes it
look like this:
[Link] 8/65
2/16/26, 2:47 PM ffmpeg Documentation
┌──────────┬───────────────┐
│ demuxer │ │ ┌─────────┐
╞══════════╡ video stream │ packets │ video │ frames
│[Link] │ ├─────────►│ decoder ├─────►───╮
│ │ │ └─────────┘ │
└──────────┴───────────────┘ │
╭───────────◄───────────╯
│ ┌────────────────────────┐
│ │ simple filtergraph │
│ ╞════════════════════════╡
│ │ ┌───────┐ ┌───────┐ │
╰──►├─►│ yadif ├─►│ scale ├─►├╮
│ └───────┘ └───────┘ ││
└────────────────────────┘│
│
│
┌──────────┬───────────────┐ video ┌─────────┐ │
│ muxer │ │ packets │ video │ │
╞══════════╡ video stream │◄─────────┤ encoder ├───────◄───────╯
│OUTPUT.mp4│ │ │ │
│ │ │ └─────────┘
└──────────┴───────────────┘
A trivial example of a complex filtergraph is the overlay filter, which has two video inputs and one
video output, containing one video overlaid on top of the other. Its audio counterpart is the amix
filter.
[Link] 9/65
2/16/26, 2:47 PM ffmpeg Documentation
Decoding AVOptions can be passed to loopback decoders by placing them before -dec , analogously
to input/output options.
ffmpeg -i INPUT \
-map 0:v:0 -c:v libx264 -crf 45 -f null - \
-threads 3 -dec 0:0 \
-filter_complex '[0:v][dec:0]hstack[stack]' \
-map '[stack]' -c:v ffv1 OUTPUT
┌──────────┬───────────────┐
│ demuxer │ │ ┌─────────┐ ┌─────────┐ ┌────────────────────┐
╞══════════╡ video stream │ │ video │ │ video │ │ null muxer │
│ INPUT │ ├──►│ decoder ├──┬────────►│ encoder ├─┬─►│(discards its input)│
│ │ │ └─────────┘ │ │(libx264)│ │ └────────────────────┘
└──────────┴───────────────┘ │ └─────────┘ │
╭───────◄──╯ ┌─────────┐ │
│ │loopback │ │
│ ╭─────◄──────┤ decoder ├────◄──╯
│ │ └─────────┘
│ │
│ │
│ │ ┌───────────────────┐
│ │ │complex filtergraph│
│ │ ╞═══════════════════╡
│ │ │ ┌─────────────┐ │
╰─╫─►├─►│ hstack ├─►├╮
╰─►├─►│ │ ││
│ └─────────────┘ ││
└───────────────────┘│
│
┌──────────┬───────────────┐ ┌─────────┐ │
│ muxer │ │ │ video │ │
╞══════════╡ video stream │◄─┤ encoder ├───────◄──────────╯
│ OUTPUT │ │ │ (ffv1) │
│ │ │ └─────────┘
└──────────┴───────────────┘
[Link] 10/65
4 Stream selection
2/16/26, 2:47 PM ffmpeg Documentation
ffmpeg provides the -map option for manual control of stream selection in each output file. Users
can skip -map and let ffmpeg perform automatic stream selection as described below. The
-vn / -an / -sn / -dn options can be used to skip inclusion of video, audio, subtitle and data streams
respectively, whether manually mapped or automatically selected, except for those streams which are
outputs of complex filtergraphs.
4.1 Description
The sub-sections that follow describe the various rules that are involved in stream selection. The
examples that follow next show how these rules are applied in practice.
While every effort is made to accurately reflect the behavior of the program, FFmpeg is under
continuous development and the code may have changed since the time of this writing.
In the case where several streams of the same type rate equally, the stream with the lowest index is
chosen.
Data or attachment streams are not automatically selected and can only be included using -map .
When -map is used, only user-mapped streams are included in that output file, with one possible
exception for filtergraph outputs described below.
[Link] 11/65
2/16/26, 2:47 PM ffmpeg Documentation
If there are any complex filtergraph output streams with unlabeled pads, they will be added to the first
output file. This will lead to a fatal error if the stream type is not supported by the output format. In the
absence of the map option, the inclusion of these streams leads to the automatic stream selection of
their types being skipped. If map options are present, these filtergraph streams are included in addition
to the mapped streams.
Complex filtergraph output streams with labeled pads must be mapped once and exactly once.
An exception exists for subtitles. If a subtitle encoder is specified for an output file, the first subtitle
stream found of any type, text or image, will be included. ffmpeg does not validate if the specified
encoder can convert the selected stream or if the converted stream is acceptable within the output
format. This applies generally as well: when the user sets an encoder manually, the stream selection
process cannot check if the encoded stream can be muxed into the output file. If it cannot, ffmpeg will
abort and all output files will fail to be processed.
4.2 Examples
The following examples illustrate the behavior, quirks and limitations of ffmpeg’s stream selection
methods.
[Link] 12/65
2/16/26, 2:47 PM ffmpeg Documentation
ffmpeg -i [Link] -i B.mp4 [Link] [Link] -map 1:a -c:a copy [Link]
There are three output files specified, and for the first two, no -map options are set, so ffmpeg will
select streams for these two files automatically.
[Link] is a Matroska container file and accepts video, audio and subtitle streams, so ffmpeg will try
to select one of each type.
For video, it will select stream 0 from B.mp4, which has the highest resolution among all the input
video streams.
For audio, it will select stream 3 from B.mp4, since it has the greatest number of channels.
For subtitles, it will select stream 2 from B.mp4, which is the first subtitle stream from among [Link]
and B.mp4.
[Link] accepts only audio streams, so only stream 3 from B.mp4 is selected.
For [Link], since a -map option is set, no automatic stream selection will occur. The -map 1:a
option will select all audio streams from the second input B.mp4. No other streams will be included in
this output file.
For the first two outputs, all included streams will be transcoded. The encoders chosen will be the
default ones registered by each output format, which may not match the codec of the selected input
streams.
For the third output, codec option for audio streams has been set to copy , so no decoding-filtering-
encoding operations will occur, or can occur. Packets of selected streams shall be conveyed from the
input file and muxed within the output file.
Although [Link] is a Matroska container file which accepts subtitle streams, only a video and audio
stream shall be selected. The subtitle stream of [Link] is image-based and the default subtitle encoder
of the Matroska muxer is text-based, so a transcode operation for the subtitles is expected to fail and
hence the stream isn’t selected. However, in [Link], a subtitle encoder is specified in the command
and so, the subtitle stream is selected, in addition to the video stream. The presence of -an disables
audio stream selection for [Link].
A filtergraph is setup here using the -filter_complex option and consists of a single video filter. The
overlay filter requires exactly two video inputs, but none are specified, so the first two available video
streams are used, those of [Link] and [Link]. The output pad of the filter has no label and so is sent to
the first output file out1.mp4. Due to this, automatic selection of the video stream is skipped, which
[Link] 13/65
2/16/26, 2:47 PM ffmpeg Documentation
would have selected the stream in B.mp4. The audio stream with most channels viz. stream 3 in B.mp4,
is chosen automatically. No subtitle stream is chosen however, since the MP4 format has no default
subtitle encoder registered, and the user hasn’t specified a subtitle encoder.
The 2nd output file, [Link], only accepts text-based subtitle streams. So, even though the first
subtitle stream available belongs to [Link], it is image-based and hence skipped. The selected stream,
stream 2 in B.mp4, is the first text-based subtitle stream.
The above command will fail, as the output pad labelled [outv] has been mapped twice. None of the
output files shall be processed.
This command above will also fail as the hue filter output has a label, [outv] , and hasn’t been
mapped anywhere.
The video stream from B.mp4 is sent to the hue filter, whose output is cloned once using the split filter,
and both outputs labelled. Then a copy each is mapped to the first and third output files.
The overlay filter, requiring two video inputs, uses the first two unused video streams. Those are the
streams from [Link] and [Link]. The overlay output isn’t labelled, so it is sent to the first output file
out1.mp4, regardless of the presence of the -map option.
The aresample filter is sent the first unused audio stream, that of [Link]. Since this filter output is also
unlabelled, it too is mapped to the first output file. The presence of -an only suppresses automatic or
manual stream selection of audio streams, not outputs sent from filtergraphs. Both these mapped
streams shall be ordered before the mapped stream in out1.mp4.
The video, audio and subtitle streams mapped to [Link] are entirely determined by automatic
stream selection.
[Link] consists of the cloned video output from the hue filter and the first audio stream from B.mp4.
[Link] 14/65
5 Options
2/16/26, 2:47 PM ffmpeg Documentation
All the numerical options, if not specified otherwise, accept a string representing a number as input,
which may be followed by one of the SI unit prefixes, for example: ’K’, ’M’, or ’G’.
If ’i’ is appended to the SI unit prefix, the complete prefix will be interpreted as a unit prefix for binary
multiples, which are based on powers of 1024 instead of powers of 1000. Appending ’B’ to the SI unit
prefix multiplies the value by 8. This allows using, for example: ’KB’, ’MiB’, ’G’ and ’B’ as number
suffixes.
Options which do not take arguments are boolean options, and set the corresponding value to true.
They can be set to false by prefixing the option name with "no". For example using "-nofoo" will set the
boolean option with name "foo" to false.
Options that take arguments support a special syntax where the argument given on the command line is
interpreted as a path to the file from which the actual argument value is loaded. To use this feature, add
a forward slash ’/’ immediately before the option name (after the leading dash). E.g.
A stream specifier is a string generally appended to the option name and separated from it by a colon.
E.g. -co[Link] ac3 contains the a:1 stream specifier, which matches the second audio stream.
Therefore, it would select the ac3 codec for the second audio stream.
A stream specifier can match several streams, so that the option is applied to all of them. E.g. the
stream specifier in -b:a 128k matches all audio streams.
An empty stream specifier matches all streams. For example, -codec copy or -codec: copy would
copy all the streams without reencoding.
stream_index
Matches the stream with this index. E.g. -threads:1 4 would set the thread count for the second
stream to 4. If stream_index is used as an additional stream specifier (see below), then it selects
stream number stream_index from the matching streams. Stream numbering is based on the order of
the streams as detected by libavformat except when a stream group specifier or program ID is also
specified. In this case it is based on the ordering of the streams in the group or program.
stream_type[:additional_stream_specifier]
[Link] 15/65
2/16/26, 2:47 PM ffmpeg Documentation
stream_type is one of following: ’v’ or ’V’ for video, ’a’ for audio, ’s’ for subtitle, ’d’ for data, and ’t’
for attachments. ’v’ matches all video streams, ’V’ only matches video streams which are not
attached pictures, video thumbnails or cover arts. If additional_stream_specifier is used, then it
matches streams which both have this type and match the additional_stream_specifier. Otherwise, it
matches all streams of the specified type.
g:group_specifier[:additional_stream_specifier]
Matches streams which are in the group with the specifier group_specifier. if
additional_stream_specifier is used, then it matches streams which both are part of the group and
match the additional_stream_specifier. group_specifier may be one of the following:
group_index
Match the stream with this group index.
#group_id or i:group_id
Match the stream with this group id.
p:program_id[:additional_stream_specifier]
Matches streams which are in the program with the id program_id. If additional_stream_specifier is
used, then it matches streams which both are part of the program and match the
additional_stream_specifier.
#stream_id or i:stream_id
Match the stream by stream id (e.g. PID in MPEG-TS container).
m:key[:value]
Matches streams with the metadata tag key having the specified value. If value is not given, matches
streams that contain the given tag with any value. The colon character ’:’ in key or value needs to be
backslash-escaped.
disp:dispositions[:additional_stream_specifier]
Matches streams with the given disposition(s). dispositions is a list of one or more dispositions (as
printed by the -dispositions option) joined with ’+’.
u
Matches streams with usable configuration, the codec must be defined and the essential information
such as video dimension or audio sample rate must be present.
Note that in ffmpeg , matching by metadata will only work properly for input files.
-L, -license
Show license.
[Link] 16/65
2/16/26, 2:47 PM ffmpeg Documentation
Possible values of arg are:
long
Print advanced tool options in addition to the basic tool options.
full
Print complete list of options, including shared and private options for encoders, decoders,
demuxers, muxers, filters, etc.
decoder=decoder_name
Print detailed information about the decoder named decoder_name. Use the -decoders option to
get a list of all decoders.
encoder=encoder_name
Print detailed information about the encoder named encoder_name. Use the -encoders option to
get a list of all encoders.
demuxer=demuxer_name
Print detailed information about the demuxer named demuxer_name. Use the -formats option to
get a list of all demuxers and muxers.
muxer=muxer_name
Print detailed information about the muxer named muxer_name. Use the -formats option to get a
list of all muxers and demuxers.
filter=filter_name
Print detailed information about the filter named filter_name. Use the -filters option to get a list
of all filters.
bsf=bitstream_filter_name
Print detailed information about the bitstream filter named bitstream_filter_name. Use the -bsfs
option to get a list of all bitstream filters.
protocol=protocol_name
Print detailed information about the protocol named protocol_name. Use the -protocols option to
get a list of all protocols.
-version
Show version.
-buildconf
Show the build configuration, one option per line.
-formats
Show available formats (including devices).
-demuxers
Show available demuxers.
-muxers
Show available muxers.
-devices
Show available devices.
-codecs
Show all codecs known to libavcodec.
[Link] 17/65
2/16/26, 2:47 PM ffmpeg Documentation
Note that the term ’codec’ is used throughout this documentation as a shortcut for what is more
correctly called a media bitstream format.
-decoders
Show available decoders.
-encoders
Show all available encoders.
-bsfs
Show available bitstream filters.
-protocols
Show available protocols.
-filters
Show available libavfilter filters.
-pix_fmts
Show available pixel formats.
-sample_fmts
Show available sample formats.
-layouts
Show channel names and standard channel layouts.
-dispositions
Show stream dispositions.
-colors
Show recognized color names.
-sources device[,opt1=val1[,opt2=val2]...]
Show autodetected sources of the input device. Some devices may provide system-dependent source
names that cannot be autodetected. The returned list cannot be assumed to be always complete.
-sinks device[,opt1=val1[,opt2=val2]...]
Show autodetected sinks of the output device. Some devices may provide system-dependent sink
names that cannot be autodetected. The returned list cannot be assumed to be always complete.
‘repeat’
Indicates that repeated log output should not be compressed to the first line and the "Last
message repeated n times" line will be omitted.
‘level’
[Link] 18/65
2/16/26, 2:47 PM ffmpeg Documentation
Indicates that log output should add a [level] prefix to each message line. This can be used as
an alternative to log coloring, e.g. when dumping the log to file.
‘time’
Indicates that log lines should be prefixed with time information.
‘datetime’
Indicates that log lines should be prefixed with date and time information.
Flags can also be used alone by adding a ’+’/’-’ prefix to set/reset a single flag without affecting
other flags or changing loglevel. When setting both flags and loglevel, a ’+’ separator is expected
between the last flags value and before loglevel.
‘quiet, -8’
Show nothing at all; be silent.
‘panic, 0’
Only show fatal errors which could lead the process to crash, such as an assertion failure. This is
not currently used for anything.
‘fatal, 8’
Only show fatal errors. These are errors after which the process absolutely cannot continue.
‘error, 16’
Show all errors, including ones which can be recovered from.
‘warning, 24’
Show all warnings and errors. Any message related to possibly incorrect or unexpected events will
be shown.
‘info, 32’
Show informative messages during processing. This is in addition to warnings and errors. This is
the default value.
‘verbose, 40’
Same as info , except more verbose.
‘debug, 48’
Show everything, including debugging information.
‘trace, 56’
For example to enable repeated log output, add the level prefix, and set loglevel to verbose :
Another example that enables repeated log output without affecting current state of level prefix
flag or loglevel:
[Link] 19/65
2/16/26, 2:47 PM ffmpeg Documentation
By default the program logs to stderr. If coloring is supported by the terminal, colors are used to
mark errors and warnings. Log coloring can be disabled setting the environment variable
AV_LOG_FORCE_NOCOLOR , or can be forced setting the environment variable AV_LOG_FORCE_COLOR .
-report
Dump full command line and log output to a file named [Link] in the current
directory. This file can be useful for bug reports. It also implies -loglevel debug .
Setting the environment variable FFREPORT to any value has the same effect. If the value is a ’:’-
separated key=value sequence, these options will affect the report; option values must be escaped if
they contain special characters or the options delimiter ’:’ (see the “Quoting and escaping” section in
the ffmpeg-utils manual).
file
set the file name to use for the report; %p is expanded to the name of the program, %t is
level
set the log verbosity level using a numerical value (see -loglevel ).
For example, to output a report to a file named [Link] using a log level of 32 (alias for log
level info ):
Errors in parsing the environment variable are not fatal, and will not appear in the report.
-hide_banner
Suppress printing banner.
All FFmpeg tools will normally show a copyright notice, build options and library versions. This
option can be used to suppress printing this information.
[Link] 20/65
2/16/26, 2:47 PM ffmpeg Documentation
‘x86’
‘mmx’
‘mmxext’
‘sse’
‘sse2’
‘sse2slow’
‘sse3’
‘sse3slow’
‘ssse3’
‘atom’
‘sse4.1’
‘sse4.2’
‘avx’
‘avx2’
‘xop’
‘fma3’
‘fma4’
‘3dnow’
‘3dnowext’
‘bmi1’
‘bmi2’
‘cmov’
‘ARM’
‘armv5te’
‘armv6’
‘armv6t2’
‘vfp’
‘vfpv3’
‘neon’
‘setend’
‘AArch64’
‘armv8’
‘vfp’
‘neon’
‘PowerPC’
‘altivec’
‘Specific Processors’
‘pentium2’
‘pentium3’
‘pentium4’
[Link] 21/65
2/16/26, 2:47 PM ffmpeg Documentation
‘k6’
‘k62’
‘athlon’
‘athlonxp’
‘k8’
-cpucount count (global)
Override detection of CPU count. This option is intended for testing. Do not use it unless you know
what you’re doing.
ffmpeg -cpucount 2
-max_alloc bytes
Set the maximum size limit for allocating a block on the heap by ffmpeg’s family of malloc functions.
Exercise extreme caution when using this option. Don’t use if you do not understand the full
consequence of doing so. Default is INT_MAX.
5.3 AVOptions
These options are provided directly by the libavformat, libavdevice and libavcodec libraries. To see the
list of available AVOptions, use the -help option. They are separated into two categories:
generic
These options can be set for any container, codec or device. Generic options are listed under
AVFormatContext options for containers/devices and under AVCodecContext options for codecs.
private
These options are specific to the given container, device or codec. Private options are listed under
their corresponding containers/devices/codecs.
For example to write an ID3v2.3 header instead of a default ID3v2.4 to an MP3 file, use the
id3v2_version private option of the MP3 muxer:
All codec AVOptions are per-stream, and thus a stream specifier should be attached to them:
ffmpeg -i [Link] -map 0:v:0 -map [Link] -map [Link] -[Link] ac3 -[Link] 640k -[Link] 2 -[Link]
In the above example, a multichannel audio stream is mapped twice for output. The first instance is
encoded with codec ac3 and bitrate 640k. The second instance is downmixed to 2 channels and
encoded with codec aac. A bitrate of 128k is specified for it using absolute index of the output stream.
Note: the -nooption syntax cannot be used for boolean AVOptions, use -option 0/-option 1.
Note: the old undocumented way of specifying per-stream AVOptions by prepending v/a/s to the
options name is now obsolete and will be removed soon.
[Link] 22/65
2/16/26, 2:47 PM ffmpeg Documentation
-i url (input)
input file url
-y (global)
Overwrite output files without asking.
-n (global)
Do not overwrite output files, and exit immediately if a specified output file already exists.
-recast_media (global)
Allow forcing a decoder of a different media type than the one detected or designated by the
demuxer. Useful for decoding media data muxed as data streams.
For example
encodes all video streams with libx264 and copies all audio streams.
will copy all the streams except the second video, which will be encoded with libx264, and the 138th
audio, which will be encoded with libvorbis.
-t duration (input/output)
When used as an input option (before -i ), limit the duration of data read from the input file.
When used as an output option (before an output url), stop writing the output after its duration
reaches duration.
duration must be a time duration specification, see the Time duration section in the ffmpeg-utils(1)
manual (./[Link]#time-duration-syntax).
[Link] 23/65
2/16/26, 2:47 PM ffmpeg Documentation
-to and -t are mutually exclusive and -t has priority.
-to position (input/output)
Stop writing the output or reading the input at position. position must be a time duration
specification, see the Time duration section in the ffmpeg-utils(1) manual (./ffmpeg-
[Link]#time-duration-syntax).
formats it is not possible to seek exactly, so ffmpeg will seek to the closest seek point before
position. When transcoding and -accurate_seek is enabled (the default), this extra segment between
the seek point and position will be decoded and discarded. When doing stream copy or when -
noaccurate_seek is used, it will be preserved.
When used as an output option (before an output url), decodes but discards input until the
timestamps reach position.
position must be a time duration specification, see the Time duration section in the ffmpeg-utils(1)
manual (./[Link]#time-duration-syntax).
This will take the difference between the start times of the target and reference inputs and offset the
timestamps of the target file by that difference. The source timestamps of the two inputs should
derive from the same clock source for expected results. If copyts is set then start_at_zero must
also be set. If either of the inputs has no starting timestamp then no sync adjustment is made.
Acceptable values are those that refer to a valid ffmpeg input index. If the sync reference is the
target index itself or -1, then no adjustment is made to target timestamps. A sync reference may not
itself be synced to any other input.
offset must be a time duration specification, see the Time duration section in the ffmpeg-utils(1)
manual (./[Link]#time-duration-syntax).
The offset is added to the timestamps of the input files. Specifying a positive offset means that the
corresponding streams are delayed by the time duration specified in offset.
[Link] 24/65
2/16/26, 2:47 PM ffmpeg Documentation
Rescale input timestamps. scale should be a floating point number.
date must be a date specification, see the Date section in the ffmpeg-utils(1) manual (./ffmpeg-
[Link]#date-syntax).
An optional metadata_specifier may be given to set metadata on streams, chapters or programs. See
-map_metadata documentation for details.
This option overrides metadata set with -map_metadata . It is also possible to delete metadata by
Default value: by default, all disposition flags are copied from the input stream, unless the output
stream this option applies to is fed by a complex filtergraph - in that case no disposition flags are
set by default.
value is a sequence of disposition flags separated by ’+’ or ’-’. A ’+’ prefix adds the given
disposition, ’-’ removes it. If the first flag is also prefixed with ’+’ or ’-’, the resulting disposition is
the default value updated by value. If the first flag is not prefixed, the resulting disposition is value.
It is also possible to clear the disposition by setting it to 0.
If no -disposition options were specified for an output file, ffmpeg will automatically set the
’default’ disposition flag on the first stream of each type, when there are multiple streams of this
type in the output file and no stream of that type is already marked as default.
For example, to make the second audio stream the default stream:
To make the second subtitle stream the default stream and remove the default disposition from the
first subtitle stream:
[Link] 25/65
2/16/26, 2:47 PM ffmpeg Documentation
To add an embedded cover/thumbnail:
ffmpeg -i in.mp4 -i IMAGE -map 0 -map 1 -c copy -c:v:1 png -disposition:v:1 attached_pic out.mp4
To add the ’original’ and remove the ’comment’ disposition flag from the first audio stream without
removing its other disposition flags:
To remove the ’original’ and add the ’comment’ disposition flag to the first audio stream without
removing its other disposition flags:
To set only the ’original’ and ’comment’ disposition flags on the first audio stream (and remove its
other disposition flags):
Not all muxers support embedded thumbnails, and those who do, only support a few formats, like
JPEG or PNG.
-program [title=title:][program_num=program_num:]st=stream[:st=stream...] (output)
Creates a program with the specified title, program_num and adds the specified stream(s) to it.
-stream_group [map=input_file_id=stream_group][type=type:]st=stream[:st=stream]
[:stg=stream_group][:id=stream_group_id...] (output)
Creates a stream group of the specified type and stream_group_id, or by mapping an input group,
adding the specified stream(s) and/or previously defined stream_group(s) to it.
iamf_audio_element
Groups streams that belong to the same IAMF Audio Element
audio_element_type
The Audio Element type. The following values are supported:
channel
Scalable channel audio representation
scene
Ambisonics representation
demixing
[Link] 26/65
2/16/26, 2:47 PM ffmpeg Documentation
Demixing information used to reconstruct a scalable channel audio representation. This option
must be separated from the rest with a ’,’, and takes the following key=value options
parameter_id
An identifier parameters blocks in frames may refer to
dmixp_mode
A pre-defined combination of demixing parameters
recon_gain
Recon gain information used to reconstruct a scalable channel audio representation. This
option must be separated from the rest with a ’,’, and takes the following key=value options
parameter_id
An identifier parameters blocks in frames may refer to
layer
A layer defining a Channel Layout in the Audio Element. This option must be separated from
the rest with a ’,’. Several ’,’ separated entries can be defined, and at least one must be set.
ch_layout
The layer’s channel layout
flags
The following flags are available:
recon_gain
Whether to signal if recon_gain is present as metadata in parameter blocks within frames
output_gain
output_gain_flags
Which channels output_gain applies to. The following flags are available:
FL
FR
BL
BR
TFL
TFR
ambisonics_mode
The ambisonics mode. This has no effect if audio_element_type is set to channel.
mono
Each ambisonics channel is coded as an individual mono stream in the group
default_w
[Link] 27/65
2/16/26, 2:47 PM ffmpeg Documentation
Default weight value
iamf_mix_presentation
Groups streams that belong to all IAMF Audio Element the same IAMF Mix Presentation references
submix
A sub-mix within the Mix Presentation. This option must be separated from the rest with a ’,’.
Several ’,’ separated entries can be defined, and at least one must be set.
parameter_id
An identifier parameters blocks in frames may refer to, for post-processing the mixed audio
signal to generate the audio signal for playback
parameter_rate
The sample rate duration fields in parameters blocks in frames that refer to this
parameter_id are expressed as
default_mix_gain
Default mix gain value to apply when there are no parameter blocks sharing the same
parameter_id for a given frame
element
References an Audio Element used in this Mix Presentation to generate the final output
audio signal for playback. This option must be separated from the rest with a ’|’. Several ’|’
separated entries can be defined, and at least one must be set.
stg
The stream_group_id for an Audio Element which this sub-mix refers to
parameter_id
An identifier parameters blocks in frames may refer to, for applying any processing to the
referenced and rendered Audio Element before being summed with other processed
Audio Elements
parameter_rate
The sample rate duration fields in parameters blocks in frames that refer to this
parameter_id are expressed as
default_mix_gain
Default mix gain value to apply when there are no parameter blocks sharing the same
parameter_id for a given frame
annotations
A key=value string describing the sub-mix element where "key" is a string conforming to
BCP-47 that specifies the language for the "value" string. "key" must be the same as the
one in the mix’s annotations
headphones_rendering_mode
[Link] 28/65
2/16/26, 2:47 PM ffmpeg Documentation
Indicates whether the input channel-based Audio Element is rendered to stereo
loudspeakers or spatialized with a binaural renderer when played back on headphones.
This has no effect if the referenced Audio Element’s audio_element_type is set to
channel.
stereo
binaural
layout
Specifies the layouts for this sub-mix on which the loudness information was measured.
This option must be separated from the rest with a ’|’. Several ’|’ separated entries can be
defined, and at least one must be set.
layout_type
loudspeakers
The layout follows the loudspeaker sound system convention of ITU-2051-3.
binaural
The layout is binaural.
sound_system
Channel layout matching one of Sound Systems A to J of ITU-2051-3, plus 7.1.2 and
3.1.2 This has no effect if layout_type is set to binaural.
integrated_loudness
The program integrated loudness information, as defined in ITU-1770-4.
digital_peak
The digital (sampled) peak value of the audio signal, as defined in ITU-1770-4.
true_peak
The true peak of the audio signal, as defined in ITU-1770-4.
dialog_anchored_loudness
The Dialogue loudness information, as defined in ITU-1770-4.
album_anchored_loudness
The Album loudness information, as defined in ITU-1770-4.
annotations
A key=value string string describing the mix where "key" is a string conforming to BCP-47 that
specifies the language for the "value" string. "key" must be the same as the ones in all sub-mix
element’s annotationss
E.g. to create an scalable 5.1 IAMF file from several WAV input files
[Link] 29/65
2/16/26, 2:47 PM ffmpeg Documentation
To copy the two stream groups (Audio Element and Mix Presentation) from an input IAMF file with
four streams into an mp4 output
-target type (output)
Specify target file type ( vcd , svcd , dvd , dv , dv50 ). type may be prefixed with pal- ,
ntsc- or film- to use the corresponding standard. All the format options (bitrate, codecs, buffer
sizes) are then set automatically. You can just type:
Nevertheless you can specify additional options as long as you know they do not conflict with the
standard, as in:
VCD
[Link] 30/65
2/16/26, 2:47 PM ffmpeg Documentation
pal:
-f vcd -muxrate 1411200 -muxpreload 0.44 -packetsize 2324
-s 352x288 -r 25
-codec:v mpeg1video -g 15 -b:v 1150k -maxrate:v 1150k -minrate:v 1150k -bufsize:v 327680
-ar 44100 -ac 2
-codec:a mp2 -b:a 224k
ntsc:
-f vcd -muxrate 1411200 -muxpreload 0.44 -packetsize 2324
-s 352x240 -r 30000/1001
-codec:v mpeg1video -g 18 -b:v 1150k -maxrate:v 1150k -minrate:v 1150k -bufsize:v 327680
-ar 44100 -ac 2
-codec:a mp2 -b:a 224k
film:
-f vcd -muxrate 1411200 -muxpreload 0.44 -packetsize 2324
-s 352x240 -r 24000/1001
-codec:v mpeg1video -g 18 -b:v 1150k -maxrate:v 1150k -minrate:v 1150k -bufsize:v 327680
-ar 44100 -ac 2
-codec:a mp2 -b:a 224k
SVCD
pal:
-f svcd -packetsize 2324
-s 480x576 -pix_fmt yuv420p -r 25
-codec:v mpeg2video -g 15 -b:v 2040k -maxrate:v 2516k -minrate:v 0 -bufsize:v 1835008 -scan_offs
-ar 44100
-codec:a mp2 -b:a 224k
ntsc:
-f svcd -packetsize 2324
-s 480x480 -pix_fmt yuv420p -r 30000/1001
-codec:v mpeg2video -g 18 -b:v 2040k -maxrate:v 2516k -minrate:v 0 -bufsize:v 1835008 -scan_offs
-ar 44100
-codec:a mp2 -b:a 224k
film:
-f svcd -packetsize 2324
-s 480x480 -pix_fmt yuv420p -r 24000/1001
-codec:v mpeg2video -g 18 -b:v 2040k -maxrate:v 2516k -minrate:v 0 -bufsize:v 1835008 -scan_offs
-ar 44100
-codec:a mp2 -b:a 224k
DVD
[Link] 31/65
2/16/26, 2:47 PM ffmpeg Documentation
pal:
-f dvd -muxrate 10080k -packetsize 2048
-s 720x576 -pix_fmt yuv420p -r 25
-codec:v mpeg2video -g 15 -b:v 6000k -maxrate:v 9000k -minrate:v 0 -bufsize:v 1835008
-ar 48000
-codec:a ac3 -b:a 448k
ntsc:
-f dvd -muxrate 10080k -packetsize 2048
-s 720x480 -pix_fmt yuv420p -r 30000/1001
-codec:v mpeg2video -g 18 -b:v 6000k -maxrate:v 9000k -minrate:v 0 -bufsize:v 1835008
-ar 48000
-codec:a ac3 -b:a 448k
film:
-f dvd -muxrate 10080k -packetsize 2048
-s 720x480 -pix_fmt yuv420p -r 24000/1001
-codec:v mpeg2video -g 18 -b:v 6000k -maxrate:v 9000k -minrate:v 0 -bufsize:v 1835008
-ar 48000
-codec:a ac3 -b:a 448k
DV
pal:
-f dv
-s 720x576 -pix_fmt yuv420p -r 25
-ar 48000 -ac 2
ntsc:
-f dv
-s 720x480 -pix_fmt yuv411p -r 30000/1001
-ar 48000 -ac 2
film:
-f dv
-s 720x480 -pix_fmt yuv411p -r 24000/1001
-ar 48000 -ac 2
The dv50 target is identical to the dv target except that the pixel format set is yuv422p for all
three standards.
Any user-set value for a parameter above will override the target preset value. In that case, the
output may not comply with the target standard.
-dn (input/output)
As an input option, blocks all data streams of a file from being filtered or being automatically
selected or mapped for any output. See -discard option to disable streams individually.
As an output option, disables data recording i.e. automatic selection or mapping of any data stream.
For full manual control see the -map option.
[Link] 32/65
2/16/26, 2:47 PM ffmpeg Documentation
-dframes number (output)
Set the number of data frames to output. This is an obsolete alias for -frames:d , which you should
use instead.
-q[:stream_specifier] q (output,per-stream)
-qscale[:stream_specifier] q (output,per-stream)
Use fixed quality scale (VBR). The meaning of q/qscale is codec-dependent. If qscale is used without
a stream_specifier then it applies only to the video stream, this is to maintain compatibility with
previous behavior and as specifying the same codec specific value to 2 different codecs that is audio
and video generally is not what is intended when no stream_specifier is used.
filtergraph is a description of the filtergraph to apply to the stream, and must have a single input and
a single output of the same type of the stream. In the filtergraph, the input is associated to the label
in , and the output to the label out . See the ffmpeg-filters manual for more information about
See the -filter_complex option if you want to create filtergraphs with multiple inputs and/or outputs.
-stats (global)
Log encoding progress/statistics as "info"-level log (see -loglevel ). It is on by default, to explicitly
-print_graphs (global)
Prints execution graph details to stderr in the format set via -print_graphs_format.
Progress information is written periodically and at the end of the encoding process. It is made of
"key=value" lines. key consists of only alphanumeric characters. The last key of a sequence of
progress information is always "progress" with the value "continue" or "end".
-stdin
Enable interaction on standard input. On by default unless standard input is used as an input. To
explicitly disable interaction you need to specify -nostdin .
Disabling interaction on standard input is useful, for example, if ffmpeg is in the background process
group. Roughly the same result can be achieved with ffmpeg ... < /dev/null but it requires a shell.
-debug_ts (global)
Print timestamp/latency information. It is off by default. This option is mostly useful for testing and
debugging purposes, and the output format may change from one version to another, so it should
not be employed by portable scripts.
[Link] 34/65
2/16/26, 2:47 PM ffmpeg Documentation
usual way. Attachment streams created with this option will be created after all the other streams
(i.e. those created with -map or automatic mappings).
Note that for Matroska you also have to set the mimetype metadata tag:
(assuming that the attachment stream will be third in the output file).
-dump_attachment[:stream_specifier] filename (input,per-stream)
Extract the matching attachment stream into a file named filename. If filename is empty, then the
value of the filename metadata tag will be used.
Technical note – attachments are implemented as codec extradata, so this option can actually be
used to extract extradata from any stream, not just attachments.
As an input option, ignore any timestamps stored in the file and instead generate timestamps
assuming constant frame rate fps. This is not the same as the -framerate option used for some input
formats like image2 or v4l2 (it used to be the same in older versions of FFmpeg). If in doubt use -
framerate instead of the input option -r.
As an output option:
video encoding
Duplicate or drop frames right before encoding them to achieve constant output frame rate fps.
video streamcopy
Indicate to the muxer that fps is the stream frame rate. No data is dropped or duplicated in this
case. This may produce invalid files if fps does not match the actual stream frame rate as
determined by packet timestamps. See also the setts bitstream filter.
[Link] 35/65
2/16/26, 2:47 PM ffmpeg Documentation
-fpsmax[:stream_specifier] fps (output,per-stream)
Set maximum frame rate (Hz value, fraction or abbreviation).
Clamps output frame rate when output framerate is auto-set and is higher than this value. Useful in
batch processing or when input framerate is wrongly detected as very high. It cannot be set together
with -r . It is ignored during streamcopy.
As an input option, this is a shortcut for the video_size private option, recognized by some
demuxers for which the frame size is either not stored in the file or is configurable – e.g. raw video
or video grabbers.
As an output option, this inserts the scale video filter to the end of the corresponding filtergraph.
Please use the scale filter directly to insert it at the beginning or some other place.
aspect can be a floating point number string, or a string of the form num:den, where num and den
are the numerator and denominator of the aspect ratio. For example "4:3", "16:9", "1.3333", and
"1.7777" are valid argument values.
If used together with -vcodec copy, it will affect the aspect ratio stored at container level, but not the
aspect ratio stored in encoded frames, if it exists.
rotation is a decimal number specifying the amount in degree by which the video should be rotated
counter-clockwise before being displayed.
This option overrides the rotation/display transform metadata stored in the file, if any. When the
video is being transcoded (rather than copied) and -autorotate is enabled, the video will be rotated
at the filtering stage. Otherwise, the metadata will be written into the output file if the muxer
supports it.
If the -display_hflip and/or -display_vflip options are given, they are applied after the rotation
specified by this option.
-display_hflip[:stream_specifier] (input,per-stream)
Set whether on display the image should be horizontally flipped.
-display_vflip[:stream_specifier] (input,per-stream)
Set whether on display the image should be vertically flipped.
[Link] 36/65
2/16/26, 2:47 PM ffmpeg Documentation
-vn (input/output)
As an input option, blocks all video streams of a file from being filtered or being automatically
selected or mapped for any output. See -discard option to disable streams individually.
As an output option, disables video recording i.e. automatic selection or mapping of any video
stream. For full manual control see the -map option.
-pass[:stream_specifier] n (output,per-stream)
Select the pass number (1 or 2). It is used to do two-pass video encoding. The statistics of the video
are recorded in the first pass into a log file (see also the option -passlogfile), and in the second pass
that log file is used to generate the video at the exact requested bitrate. On pass 1, you may just
deactivate audio and set output to null, examples for Windows and Unix:
-autorotate
Automatically rotate the video according to file metadata. Enabled by default, use -noautorotate to
disable it.
-autoscale
Automatically scale the video according to the resolution of first frame. Enabled by default, use -
noautoscale to disable it. When autoscale is disabled, all output frames of filter graph might not be
in the same resolution and may be inadequate for some encoder/muxer. Therefore, it is not
recommended to disable it unless you really know what you are doing. Disable autoscale at your own
risk.
[Link] 37/65
2/16/26, 2:47 PM ffmpeg Documentation
can not be selected, and automatic conversions inside filtergraphs are disabled. If pix_fmt is a single
+ , ffmpeg selects the same pixel format as the input (or graph output) and automatic conversions
are disabled.
-sws_flags flags (input/output)
Set default flags for the libswscale library. These flags are used by automatically inserted scale
filters and those within simple filtergraphs, if not overridden within the filtergraph definition.
-vstats
Dump video coding statistics to vstats_HHMMSS.log. See the vstats file format section for the format
description.
-vstats_file file
Dump video coding statistics to file. See the vstats file format section for the format description.
-vstats_version file
Specify which version of the vstats format to use. Default is 2 . See the vstats file format section for
the format description.
time[,time...]
If the argument consists of timestamps, ffmpeg will round the specified times to the nearest
output timestamp as per the encoder time base and force a keyframe at the first frame having
timestamp equal or greater than the computed timestamp. Note that if the encoder time base is
too coarse, then the keyframes may be forced on frames with timestamps lower than the specified
time. The default encoder time base is the inverse of the output framerate but may be set
otherwise via -enc_time_base .
If one of the times is " chapters [delta]", it is expanded into the time of the beginning of all
chapters in the file, shifted by delta, expressed as a time in seconds. This option can be useful to
ensure that a seek point is present at a chapter mark or any other designated place in the output
file.
For example, to insert a key frame at 5 minutes, plus key frames 0.1 second before the beginning
of every chapter:
[Link] 38/65
2/16/26, 2:47 PM ffmpeg Documentation
-force_key_frames [Link],chapters-0.1
expr:expr
If the argument is prefixed with expr: , the string expr is interpreted like an expression and is
evaluated for each frame. A key frame is forced in case the evaluation is non-zero.
n
the number of current processed frame, starting from 0
n_forced
the number of forced frames
prev_forced_n
the number of the previous forced frame, it is NAN when no keyframe was forced yet
prev_forced_t
the time of the previous forced frame, it is NAN when no keyframe was forced yet
t
the time of the current processed frame
For example to force a key frame every 5 seconds, you can specify:
-force_key_frames expr:gte(t,n_forced*5)
To force a key frame 5 seconds after the time of the last forced one, starting from second 13:
-force_key_frames expr:if(isnan(prev_forced_t),gte(t,13),gte(t,prev_forced_t+5))
source
If the argument is source , ffmpeg will force a key frame if the current frame being encoded is
marked as a key frame in its source. In cases where this particular source frame has to be
dropped, enforce the next available frame to become a key frame instead.
scd_metadata
If the argument is scd_metadata , ffmpeg will force a key frame if the current frame contains a
metadata entry with the key [Link] . The metadata can be added by filters like scdet
and scdet_vulkan . Avoid inserting filters that duplicate frames after scdet , as this can cause
duplicate metadata for multiple frames and repeated insertion of key frames.
Note that forcing too many keyframes is very harmful for the lookahead algorithms of certain
encoders: using fixed-GOP options or similar would be more efficient.
-apply_cropping[:stream_specifier] source (input,per-stream)
Automatically crop the video after decoding according to file metadata. Default is all.
[Link] 39/65
2/16/26, 2:47 PM ffmpeg Documentation
none (0)
Don’t apply any cropping metadata.
all (1)
Apply both codec and container level croppping. This is the default mode.
codec (2)
Apply codec level croppping.
container (3)
Apply container level croppping.
-copyinkf[:stream_specifier] (output,per-stream)
When doing stream copy, copy also non-key frames found at the beginning.
-init_hw_device type[=name][:device[,key=value...]]
Initialise a new hardware device of type type called name, using the given device parameters. If no
name is specified it will receive a default name of the form "type%d".
The meaning of device and the following arguments depends on the device type:
cuda
device is the number of the CUDA device.
primary_ctx
If set to 1, uses the primary device context instead of creating a new one.
Examples:
-init_hw_device cuda:1
Choose the second device on the system.
-init_hw_device cuda:0,primary_ctx=1
Choose the first device and use the primary device context.
dxva2
device is the number of the Direct3D 9 display adapter.
d3d11va
device is the number of the Direct3D 11 display adapter. If not specified, it will attempt to use the
default Direct3D 11 display adapter or the first Direct3D 11 display adapter whose hardware
VendorId is specified by ‘vendor_id’.
Examples:
-init_hw_device d3d11va
Create a d3d11va device on the default Direct3D 11 display adapter.
-init_hw_device d3d11va:1
Create a d3d11va device on the Direct3D 11 display adapter specified by index 1.
-init_hw_device d3d11va:,vendor_id=0x8086
[Link] 40/65
2/16/26, 2:47 PM ffmpeg Documentation
Create a d3d11va device on the first Direct3D 11 display adapter whose hardware VendorId is
0x8086.
vaapi
device is either an X11 display name, a DRM render node or a DirectX adapter index. If not
specified, it will attempt to open the default X11 display ($DISPLAY) and then the first DRM render
node (/dev/dri/renderD128), or the default DirectX adapter on Windows.
kernel_driver
When device is not specified, use this option to specify the name of the kernel driver associated
with the desired device. This option is available only when the hardware acceleration method
drm and vaapi are enabled.
vendor_id
When device and kernel_driver are not specified, use this option to specify the vendor id
associated with the desired device. This option is available only when the hardware acceleration
method drm and vaapi are enabled and kernel_driver is not specified.
Examples:
-init_hw_device vaapi
Create a vaapi device on the default device.
-init_hw_device vaapi:/dev/dri/renderD129
Create a vaapi device on DRM render node /dev/dri/renderD129.
-init_hw_device vaapi:1
Create a vaapi device on DirectX adapter 1.
-init_hw_device vaapi:,kernel_driver=i915
Create a vaapi device on a device associated with kernel driver ‘i915’.
-init_hw_device vaapi:,vendor_id=0x8086
Create a vaapi device on a device associated with vendor id ‘0x8086’.
vdpau
device is an X11 display name. If not specified, it will attempt to open the default X11 display
($DISPLAY).
qsv
device selects a value in ‘MFX_IMPL_*’. Allowed values are:
auto
sw
hw
auto_any
hw_any
hw2
hw3
hw4
[Link] 41/65
2/16/26, 2:47 PM ffmpeg Documentation
If not specified, ‘auto_any’ is used. (Note that it may be easier to achieve the desired result for
QSV by creating the platform-appropriate subdevice (‘dxva2’ or ‘d3d11va’ or ‘vaapi’) and then
deriving a QSV device from that.)
child_device
Specify a DRM render node on Linux or DirectX adapter on Windows.
child_device_type
Choose platform-appropriate subdevice type. On Windows ‘d3d11va’ is used as default
subdevice type when --enable-libvpl is specified at configuration time, ‘dxva2’ is used as
Examples:
-init_hw_device qsv:hw,child_device=/dev/dri/renderD129
Create a QSV device with ‘MFX_IMPL_HARDWARE’ on DRM render node /dev/dri/renderD129.
-init_hw_device qsv:hw,child_device=1
Create a QSV device with ‘MFX_IMPL_HARDWARE’ on DirectX adapter 1.
-init_hw_device qsv:hw,child_device_type=d3d11va
Choose the GPU subdevice with type ‘d3d11va’ and create QSV device with ‘MFX_IMPL_HARDWARE’.
-init_hw_device qsv:hw,child_device_type=dxva2
Choose the GPU subdevice with type ‘dxva2’ and create QSV device with ‘MFX_IMPL_HARDWARE’.
-init_hw_device qsv:hw,child_device=1,child_device_type=d3d11va
Create a QSV device with ‘MFX_IMPL_HARDWARE’ on DirectX adapter 1 with subdevice type
‘d3d11va’.
The set of devices can also be filtered using the key-value pairs to find only devices matching
particular platform or device strings.
platform_profile
platform_version
platform_name
platform_vendor
platform_extensions
device_name
device_vendor
driver_version
[Link] 42/65
2/16/26, 2:47 PM ffmpeg Documentation
device_version
device_profile
device_extensions
device_type
Examples:
-init_hw_device opencl:0.1
Choose the second device on the first platform.
-init_hw_device opencl:,device_name=Foo9000
Choose the device with a name containing the string Foo9000.
-init_hw_device opencl:1,device_type=gpu,device_extensions=cl_khr_fp16
Choose the GPU device on the second platform supporting the cl_khr_fp16 extension.
vulkan
If device is an integer, it selects the device by its index in a system-dependent list of devices. If
device is any other string, it selects the first device with a name containing that string as a
substring.
debug
If set to 1, enables the validation layer, if installed.
linear_images
If set to 1, images allocated by the hwcontext will be linear and locally mappable.
instance_extensions
A plus separated list of additional instance extensions to enable.
device_extensions
A plus separated list of additional device extensions to enable.
Examples:
-init_hw_device vulkan:1
Choose the second device on the system.
-init_hw_device vulkan:RADV
Choose the first device with a name containing the string RADV.
-init_hw_device vulkan:0,instance_extensions=VK_KHR_wayland_surface+VK_KHR_xcb_surface
Choose the first device and enable the Wayland and XCB instance extensions.
-init_hw_device type[=name]@source
Initialise a new hardware device of type type called name, deriving it from the existing device with
the name source.
-init_hw_device list
List all hardware device types supported in this build of ffmpeg.
-filter_hw_device name
[Link] 43/65
2/16/26, 2:47 PM ffmpeg Documentation
Pass the hardware device called name to all filters in any filter graph. This can be used to set the
device to upload to with the hwupload filter, or the device to map to with the hwmap filter. Other
filters may also make use of this parameter when they require a hardware device. Note that this is
typically only required when the input is not already in hardware frames - when it is, filters will
derive the device they require from the context of the frames they receive as input.
This is a global setting, so all filters will receive the same device.
none
Do not use any hardware acceleration (the default).
auto
Automatically select the hardware acceleration method.
vdpau
Use VDPAU (Video Decode and Presentation API for Unix) hardware acceleration.
dxva2
Use DXVA2 (DirectX Video Acceleration) hardware acceleration.
d3d11va
Use D3D11VA (DirectX Video Acceleration) hardware acceleration.
vaapi
Use VAAPI (Video Acceleration API) hardware acceleration.
qsv
Use the Intel QuickSync Video acceleration for video transcoding.
Unlike most other values, this option does not enable accelerated decoding (that is used
automatically whenever a qsv decoder is selected), but accelerated transcoding, without copying
the frames into the system memory.
For it to work, both the decoder and the encoder must support QSV acceleration and no filters
must be used.
videotoolbox
Use Video Toolbox hardware acceleration.
This option has no effect if the selected hwaccel is not available or not supported by the chosen
decoder.
Note that most acceleration methods are intended for playback and will not be faster than software
decoding on modern CPUs. Additionally, ffmpeg will usually need to copy the decoded frames from
the GPU memory into the system memory, resulting in further performance loss. This option is thus
mainly useful for testing.
[Link] 44/65
2/16/26, 2:47 PM ffmpeg Documentation
This option only makes sense when the -hwaccel option is also specified. It can either refer to an
existing device created with -init_hw_device by name, or it can create a new device as if ‘-
init_hw_device’ type:hwaccel_device were called immediately before.
-hwaccels
List all hardware acceleration components enabled in this build of ffmpeg. Actual runtime availability
depends on the hardware and its suitable driver being installed.
-fix_sub_duration_heartbeat[:stream_specifier]
Set a specific output video stream as the heartbeat stream according to which to split and push
through currently in-progress subtitle upon receipt of a random access packet.
This lowers the latency of subtitles for which the end packet or the following subtitle has not yet
been received. As a drawback, this will most likely lead to duplication of subtitle events in order to
cover the full duration, so when dealing with use cases where latency of when the subtitle event is
passed on to output is not relevant this option should not be utilized.
Requires -fix_sub_duration to be set for the relevant input subtitle stream for this to have any
effect, as well as for the input subtitle stream having to be directly mapped to the same output in
which the heartbeat stream resides.
-aq q (output)
Set the audio quality (codec-specific, VBR). This is an alias for -q:a.
-an (input/output)
As an input option, blocks all audio streams of a file from being filtered or being automatically
selected or mapped for any output. See -discard option to disable streams individually.
As an output option, disables audio recording i.e. automatic selection or mapping of any audio
stream. For full manual control see the -map option.
[Link] 45/65
2/16/26, 2:47 PM ffmpeg Documentation
-sample_fmt[:stream_specifier] sample_fmt (output,per-stream)
Set the audio sample format. Use -sample_fmts to get a list of supported sample formats.
-sn (input/output)
As an input option, blocks all subtitle streams of a file from being filtered or being automatically
selected or mapped for any output. See -discard option to disable streams individually.
As an output option, disables subtitle recording i.e. automatic selection or mapping of any subtitle
stream. For full manual control see the -map option.
[Link] 46/65
2/16/26, 2:47 PM ffmpeg Documentation
Note that this option will delay the output of all data until the next subtitle packet is decoded: it may
increase memory consumption and latency a lot.
-canvas_size size
Set the size of the canvas used to render subtitles.
In the first form, an output stream is created for every stream from the input file with the index
input_file_id. If stream_specifier is given, only those streams that match the specifier are used (see
the Stream specifiers section for the stream_specifier syntax).
A - character before the stream identifier creates a "negative" mapping. It disables matching
streams from already created mappings.
An optional view_specifier may be given after the stream specifier, which for multiview video
specifies the view to be used. The view specifier may have one of the following formats:
view:view_id
select a view by its ID; view_id may be set to ’all’ to use all the views interleaved into one stream;
vidx:view_idx
select a view by its index; i.e. 0 is the base view, 1 is the first non-base view, etc.
vpos:position
select a view by its display position; position may be left or right
The default for transcoding is to only use the base view, i.e. the equivalent of vidx:0 . For
streamcopy, view specifiers are not supported and all views are always copied.
[Link] 47/65
2/16/26, 2:47 PM ffmpeg Documentation
A trailing ? after the stream index will allow the map to be optional: if the map matches no
streams the map will be ignored instead of failing. Note the map will still fail if an invalid input file
index is used; such as if the map refers to a non-existent input.
An alternative [linklabel] form will map outputs from complex filter graphs (see the -filter_complex
option) to the output file. linklabel must correspond to a defined output link label in the graph.
This option may be specified multiple times, each adding more streams to the output file. Any given
input stream may also be mapped any number of times as a source for different output streams, e.g.
in order to use different encoding options and/or filters. The streams are created in the output in the
same order in which the -map options are given on the commandline.
Using this option disables the default mappings for this output file.
Examples:
map everything
To map ALL streams from the first input file to output
will map the second input stream in INPUT to the (single) output stream in [Link].
negative map
To map all the streams except the second audio, use negative mappings
optional map
To map the video and audio streams from the first input, and using the trailing ? , ignore the
map by language
To pick the English audio stream:
-ignore_unknown
Ignore input streams with unknown type instead of failing if copying such streams is attempted.
-copy_unknown
Allow input streams with unknown type to be copied instead of failing if copying such streams is
attempted.
g
global metadata, i.e. metadata that applies to the whole file
s[:stream_spec]
per-stream metadata. stream_spec is a stream specifier as described in the Stream specifiers
chapter. In an input metadata specifier, the first matching stream is copied from. In an output
metadata specifier, all matching streams are copied to.
c:chapter_index
per-chapter metadata. chapter_index is the zero-based chapter index.
p:program_index
per-program metadata. program_index is the zero-based program index.
By default, global metadata is copied from the first input file, per-stream and per-chapter metadata
is copied along with streams/chapters. These default mappings are disabled by creating any
mapping of the relevant type. A negative file index can be used to create a dummy mapping that just
disables automatic copying.
For example to copy metadata from the first stream of the input file to global metadata of the output
file:
Note that simple 0 would work as well in this example, since global metadata is assumed by
default.
-benchmark (global)
Show benchmarking information at the end of an encode. Shows real, system and user time used and
maximum memory consumption. Maximum memory consumption is not supported on all systems, it
will usually display as 0 if not supported.
-benchmark_all (global)
Show benchmarking information during the encode. Shows real, system and user time used in
various steps (audio/video encode/decode).
-dump (global)
Dump each input packet to stderr.
-hex (global)
When dumping packets, also dump the payload.
Its value is a floating-point positive number which represents the maximum duration of media, in
seconds, that should be ingested in one second of wallclock time. Default value is zero and
represents no imposed limitation on speed of ingestion. Value 1 represents real-time speed and is
equivalent to -re .
Mainly used to simulate a capture device or live input stream (e.g. when reading from a file). Should
not be used with a low value when input is an actual capture device or live stream as it may cause
packet loss.
It is useful for when flow speed of output packets is important, such as live streaming.
-re (input)
Read input at native frame rate. This is equivalent to setting -readrate 1 .
-readrate_initial_burst seconds
Set an initial read burst time, in seconds, after which -re/-readrate will be enforced.
For compatibility reasons some of the values for vsync can be specified as numbers (shown in
parentheses in the following table).
[Link] 50/65
2/16/26, 2:47 PM ffmpeg Documentation
passthrough (0)
Each frame is passed with its timestamp from the demuxer to the muxer.
cfr (1)
Frames will be duplicated and dropped to achieve exactly the requested constant frame rate.
vfr (2)
Frames are passed through with their timestamp or dropped so as to prevent 2 frames from
having the same timestamp.
auto (-1)
Chooses between cfr and vfr depending on muxer capabilities. This is the default method.
Note that the timestamps may be further modified by the muxer, after this. For example, in the case
that the format option avoid_negative_ts is enabled.
With -map you can select from which stream the timestamps should be taken. You can leave either
video or audio unchanged and sync the remaining stream(s) to the unchanged one.
-frame_drop_threshold parameter
Frame drop threshold, which specifies how much behind video frames can be before they are
dropped. In frame rate units, so 1.0 is one frame. The default is -1.1. One possible usecase is to
avoid framedrops in case of noisy timestamps or to increase frame drop precision in case of exact
timestamps.
parameters composed the same as with the apad filter. -shortest must be set for this output for
the option to take effect.
-copyts
Do not process input timestamps, but keep their values without trying to sanitize them. In particular,
do not remove the initial start time offset value.
Note that, depending on the vsync option or on specific muxer processing (e.g. in case the format
option avoid_negative_ts is enabled) the output timestamps may mismatch with the input
timestamps even when this option is selected.
-start_at_zero
When used with copyts, shift input timestamps so they start at zero.
This means that using e.g. -ss 50 will make output timestamps start at 50 seconds, regardless of
what timestamp the input file started at.
-copytb mode
Specify how to set the encoder timebase when stream copying. mode is an integer numeric value,
and can assume one of the following values:
1
Use the demuxer timebase.
[Link] 51/65
2/16/26, 2:47 PM ffmpeg Documentation
The time base is copied to the output encoder from the corresponding input demuxer. This is
sometimes required to avoid non monotonically increasing timestamps when copying video
streams with variable frame rate.
0
Use the decoder timebase.
The time base is copied to the output encoder from the corresponding input decoder.
-1
Try to make the choice automatically, in order to generate a sane output.
0
Assign a default value according to the media type.
demux
Use the timebase from the demuxer.
filter
Use the timebase from the filtergraph.
a positive number
Use the provided number as the timebase.
This field can be provided as a ratio of two integers (e.g. 1:24, 1:48000) or as a decimal number
(e.g. 0.04166, 2.0833e-5)
Default value is 0.
-bitexact (input/output)
Enable bitexact mode for (de)muxer and (de/en)coder
-shortest (output)
Finish encoding when the shortest output stream ends.
Note that this option may require buffering frames, which introduces extra latency. The maximum
amount of this latency may be controlled with the -shortest_buf_duration option.
This option controls the maximum duration of buffered frames in seconds. Larger values may allow
the -shortest option to produce more accurate results, but increase memory use and latency.
-dts_delta_threshold threshold
Timestamp discontinuity delta threshold, expressed as a decimal number of seconds.
[Link] 52/65
2/16/26, 2:47 PM ffmpeg Documentation
The timestamp discontinuity correction enabled by this option is only applied to input formats
accepting timestamp discontinuity (for which the AVFMT_TS_DISCONT flag is enabled), e.g. MPEG-TS
and HLS, and is automatically disabled when employing the -copyts option (unless wrapping is
detected).
If a timestamp discontinuity is detected whose absolute value is greater than threshold, ffmpeg will
remove the discontinuity by decreasing/increasing the current DTS and PTS by the corresponding
delta value.
The timestamp correction enabled by this option is only applied to input formats not accepting
timestamp discontinuity (for which the AVFMT_TS_DISCONT flag is not enabled).
If a timestamp discontinuity is detected whose absolute value is greater than threshold, ffmpeg will
drop the PTS/DTS timestamp value.
The default value is 3600*30 (30 hours), which is arbitrarily picked and quite conservative.
For example, to set the stream 0 PID to 33 and the stream 1 PID to 36 for an output mpegts file:
filter[=optname0=optval0:optname1=optval1:...]
Any of the ’,=:’ characters that are to be a part of an option value need to be escaped with a
backslash.
E.g.
[Link] 53/65
2/16/26, 2:47 PM ffmpeg Documentation
applies the h264_mp4toannexb bitstream filter (which converts MP4-encapsulated H.264 stream to
Annex B) to the input video stream.
ffmpeg -i [Link] -an -vn -bsf:s mov2textsub -c:s copy -f rawvideo [Link]
applies the mov2textsub bitstream filter (which extracts text from MOV subtitles) to the output
subtitle stream. Note, however, that since both examples use -c copy , it matters little whether the
filters are applied on input or output - that would change if transcoding was happening.
-tag[:stream_specifier] codec_tag (input/output,per-stream)
Force a tag/fourcc for matching streams.
-timecode hh:mm:ssSEPff
Specify Timecode for writing. SEP is ’:’ for non drop timecode and ’;’ (or ’.’) for drop.
Inputs to a complex filtergraph may come from different source types, distinguished by the format of
the corresponding link label:
To connect an input stream, use [file_index:stream_specifier] (i.e. the same syntax as -map).
If stream_specifier matches multiple streams, the first one will be used. For multiview video,
the stream specifier may be followed by the view specifier, see documentation for the -map
option for its syntax.
To connect a loopback decoder use [dec:dec_idx], where dec_idx is the index of the loopback
decoder to be connected to given input. For multiview video, the decoder index may be
followed by the view specifier, see documentation for the -map option for its syntax.
To connect an output from another complex filtergraph, use its link label. E.g the following
example:
ffmpeg -i [Link] \
-filter_complex '[0:v]scale=size=hd1080,split=outputs=2[for_enc][orig_scaled]' \
-c:v libx264 -map '[for_enc]' [Link] \
-dec 0:0 \
-filter_complex '[dec:0][orig_scaled]hstack[stacked]' \
-map '[stacked]' -c:v ffv1 [Link]
[Link] 54/65
2/16/26, 2:47 PM ffmpeg Documentation
(line 2) uses a complex filtergraph with one input and two outputs to scale the video to
1920x1080 and duplicate the result to both outputs;
(line 3) encodes one scaled output with libx264 and writes the result to [Link];
(line 4) decodes this encoded stream with a loopback decoder;
(line 5) places the output of the loopback decoder (i.e. the libx264 -encoded video) side
by side with the scaled original input;
(line 6) combined video is then losslessly encoded and written into [Link].
Note that the two filtergraphs cannot be combined into one, because then there would be a
cycle in the transcoding pipeline (filtergraph output goes to encoding, from there to decoding,
then back to the same graph), and such cycles are not allowed.
An unlabeled input will be connected to the first unused input stream of the matching type.
Output link labels are referred to with -map. Unlabeled outputs are added to the first output file.
Note that with this option it is possible to use only lavfi sources without normal input files.
Here [0:v] refers to the first video stream in the first input file, which is linked to the first (main)
input of the overlay filter. Similarly the first video stream in the second input is linked to the second
(overlay) input of overlay.
Assuming there is only one video stream in each input file, we can omit input labels, so the above is
equivalent to
Furthermore we can omit the output label and the single output from the filter graph will be added to
the output file automatically, so we can simply write
As a special exception, you can use a bitmap subtitle stream as input: it will be converted into a
video with the same size as the largest video in the file, or 720x576 if no video is present. Note that
this is an experimental and temporary solution. It will be removed once libavfilter has proper support
for subtitles.
For example, to hardcode subtitles on top of a DVB-T recording stored in MPEG-TS format, delaying
the subtitles by 1 second:
[Link] 55/65
2/16/26, 2:47 PM ffmpeg Documentation
(0x2d0, 0x2dc and 0x2ef are the MPEG-TS PIDs of respectively the video, audio and subtitles
streams; 0:0, 0:3 and 0:7 would have worked too)
-accurate_seek (input)
This option enables or disables accurate seeking in input files with the -ss option. It is enabled by
default, so seeking is accurate when transcoding. Use -noaccurate_seek to disable it, which may be
useful e.g. when copying some streams and transcoding the others.
-seek_timestamp (input)
This option enables or disables seeking by timestamp in input files with the -ss option. It is disabled
by default. If enabled, the argument to the -ss option is considered an actual timestamp, and is not
offset by the start time of the file. This matters only for files which do not start from timestamp 0,
such as transport streams.
For output, this option specified the maximum number of packets that may be queued to each
muxing thread.
-discard (input)
Allows discarding specific streams or frames from streams. Any input stream can be fully discarded,
using value all whereas selective discarding of frames from a stream occurs at the demuxer and is
not supported by all demuxers.
none
Discard no frame.
default
Default, which discards no frames.
noref
Discard all non-reference frames.
[Link] 56/65
2/16/26, 2:47 PM ffmpeg Documentation
bidir
Discard all bidirectional frames.
nokey
Discard all frames excepts keyframes.
all
Discard all frames.
-abort_on flags (global)
Stop and abort on various conditions. The following flags are available:
empty_output
No packets were passed to the muxer, the output is empty.
empty_output_stream
No packets were passed to the muxer in some of the output streams.
-max_error_rate (global)
Set fraction of decoding frame failures across all inputs which when crossed ffmpeg will return exit
code 69. Crossing this threshold does not terminate processing. Range is a floating-point number
between 0 to 1. Default is 2/3.
-xerror (global)
Stop and exit on error
The default value of this option should be high enough for most uses, so only touch this option if
you are sure that you need it.
-auto_conversion_filters (global)
Enable automatically inserting format conversion filters in all filter graphs, including those defined by
-vf, -af, -filter_complex and -lavfi. If filter format negotiation requires a conversion, the
initialization of the filters will fail. Conversions can still be performed by inserting the relevant
conversion filter (scale, aresample) in the graph. On by default, to explicitly disable it you need to
specify -noauto_conversion_filters .
[Link] 57/65
2/16/26, 2:47 PM ffmpeg Documentation
Write per-frame encoding information about the matching streams into the file given by path.
-stats_enc_pre writes information about raw video or audio frames right before they are sent for
encoding, while -stats_enc_post writes information about encoded packets as they are received
from the encoder. -stats_mux_pre writes information about packets just as they are about to be sent
to the muxer. Every frame or packet produces one line in the specified file. The format of this line is
controlled by -stats_enc_pre_fmt / -stats_enc_post_fmt / -stats_mux_pre_fmt.
When stats for multiple streams are written into a single file, the lines corresponding to different
streams will be interleaved. The precise order of this interleaving is not specified and not guaranteed
to remain stable between different invocations of the program, even with the same options.
format_spec is a string that may contain directives of the form {fmt}. format_spec is backslash-
escaped — use \{, \}, and \\ to write a literal {, }, or \, respectively, into the output.
fidx
Index of the output file.
sidx
Index of the output stream in the file.
n
Frame number. Pre-encoding: number of frames sent to the encoder so far. Post-encoding:
number of packets received from the encoder so far. Muxing: number of packets submitted to the
muxer for this stream so far.
ni
Input frame number. Index of the input frame (i.e. output by a decoder) that corresponds to this
output frame or packet. -1 if unavailable.
tb
Timebase in which this frame/packet’s timestamps are expressed, as a rational number num/den.
Note that encoder and muxer may use different timebases.
tbi
Timebase for ptsi, as a rational number num/den. Available when ptsi is available, 0/1 otherwise.
pts
Presentation timestamp of the frame or packet, as an integer. Should be multiplied by the
timebase to compute presentation time.
ptsi
Presentation timestamp of the input frame (see ni), as an integer. Should be multiplied by tbi to
compute presentation time. Printed as (2^63 - 1 = 9223372036854775807) when not available.
t
Presentation time of the frame or packet, as a decimal number. Equal to pts multiplied by tb.
ti
[Link] 58/65
2/16/26, 2:47 PM ffmpeg Documentation
Presentation time of the input frame (see ni), as a decimal number. Equal to ptsi multiplied by tbi.
Printed as inf when not available.
dts (packet)
Decoding timestamp of the packet, as an integer. Should be multiplied by the timebase to
compute presentation time.
dt (packet)
Decoding time of the frame or packet, as a decimal number. Equal to dts multiplied by tb.
sn (frame,audio)
Number of audio samples sent to the encoder so far.
samp (frame,audio)
Number of audio samples in the frame.
size (packet)
Size of the encoded packet in bytes.
br (packet)
Current bitrate in bits per second.
abr (packet)
Average bitrate for the whole stream so far, in bits per second, -1 if it cannot be determined at
this point.
key (packet)
Character ’K’ if the packet contains a keyframe, character ’N’ otherwise.
Directives tagged with packet may only be used with -stats_enc_post_fmt and -stats_mux_pre_fmt.
Directives tagged with audio may only be used with audio streams.
pre-encoding
{fidx} {sidx} {n} {t}
post-encoding
{fidx} {sidx} {n} {t}
In the future, new items may be added to the end of the default formatting strings. Users who
depend on the format staying exactly the same, should prescribe it manually.
Note that stats for different streams written into the same file may have different formats.
ffpreset files are specified with the vpre , apre , spre , and fpre options. The fpre option takes
the filename of the preset instead of a preset name as input and can be used for any kind of codec. For
the vpre , apre , and spre options, the options specified in a preset file are applied to the currently
selected codec of the same type as the preset option.
The argument passed to the vpre , apre , and spre preset options identifies the preset file to use
according to the following rules:
First ffmpeg searches for a file named [Link] in the directories $FFMPEG_DATADIR (if set), and
$HOME/.ffmpeg, and in the datadir defined at configuration time (usually PREFIX/share/ffmpeg) or in a
ffpresets folder along the executable on win32, in that order. For example, if the argument is
libvpx-1080p , it will search for the file [Link].
If no such file is found, then ffmpeg will search for a file named codec_name-[Link] in the above-
mentioned directories, where codec_name is the name of the codec to which the preset file options will
be applied. For example, if you select the video codec with -vcodec libvpx and use -vpre 1080p ,
avpreset files are specified with the pre option. They work similar to ffpreset files, but they only allow
encoder- specific options. Therefore, an option=value pair specifying an encoder cannot be used.
When the pre option is specified, ffmpeg will look for files with the suffix .avpreset in the directories
$AVCONV_DATADIR (if set), and $HOME/.avconv, and in the datadir defined at configuration time (usually
PREFIX/share/ffmpeg), in that order.
First ffmpeg searches for a file named codec_name-[Link] in the above-mentioned directories,
where codec_name is the name of the codec to which the preset file options will be applied. For
example, if you select the video codec with -vcodec libvpx and use -pre 1080p , then it will search
If no such file is found, then ffmpeg will search for a file named [Link] in the same directories.
[Link] 60/65
2/16/26, 2:47 PM ffmpeg Documentation
The -vstats_version option controls the format version of the generated file.
frame= FRAME q= FRAME_QUALITY PSNR= PSNR f_size= FRAME_SIZE s_size= STREAM_SIZEkB time= TIMESTAMP
out= OUT_FILE_INDEX st= OUT_FILE_STREAM_INDEX frame= FRAME_NUMBER q= FRAME_QUALITYf PSNR= PSNR f_s
avg_br
average bitrate expressed in Kbits/s
br
bitrate expressed in Kbits/s
frame
number of encoded frame
out
out file index
PSNR
Peak Signal to Noise Ratio
q
quality of the frame
f_size
encoded packet size expressed as number of bytes
s_size
stream size expressed in KiB
st
out file stream index
time
time of the packet
type
picture type
See also the -stats_enc options for an alternative way to show encoding statistics.
6 Examples
[Link] 61/65
2/16/26, 2:47 PM ffmpeg Documentation
Note that you must activate the right video source and channel before launching ffmpeg with any TV
viewer such as xawtv ([Link] by Gerd Knorr. You also have to set the audio
recording levels correctly with a standard mixer.
0.0 is [Link] number of your X11 server, same as the DISPLAY environment variable.
0.0 is [Link] number of your X11 server, same as the DISPLAY environment variable. 10 is the
x-offset and 20 the y-offset for the grabbing.
Examples:
The Y files use twice the resolution of the U and V files. They are raw files, without header. They
can be generated by all decent video decoders. You must specify the size of the image with the -s
option if ffmpeg cannot guess it.
[Link] is a file containing raw YUV planar data. Each frame is composed of the Y plane followed
by the U and V planes at half vertical and horizontal resolution.
You can output to a raw YUV420P file:
Converts the audio file [Link] and the raw YUV video file [Link] to MPEG file [Link].
You can also do audio and video conversions at the same time:
You can encode to several formats at the same time and define a mapping from input stream to
output streams:
ffmpeg -i /tmp/[Link] -map 0:a -b:a 64k /tmp/a.mp2 -map 0:a -b:a 128k /tmp/b.mp2
Converts [Link] to a.mp2 at 64 kbits and to b.mp2 at 128 kbits. ’-map file:index’ specifies which
input stream is used for each output stream, in the order of the definition of output streams.
ffmpeg -i snatch_1.vob -f avi -c:v mpeg4 -b:v 800k -g 300 -bf 2 -c:a libmp3lame -b:a 128k sna
This is a typical DVD ripping example; the input is a VOB file, the output an AVI file with MPEG-4
video and MP3 audio. Note that in this command we use B-frames so the MPEG-4 stream is DivX5
compatible, and GOP size is 300 which means one intra frame every 10 seconds for 29.97fps
input video. Furthermore, the audio stream is MP3-encoded so you need to enable LAME support
by passing --enable-libmp3lame to configure. The mapping is particularly useful for DVD
transcoding to get the desired audio language.
You can extract images from a video, or create a video from many images:
For extracting images from a video:
This will extract one video frame per second from the video and will output them in files named
[Link], [Link], etc. Images will be rescaled to fit the new WxH values.
[Link] 63/65
2/16/26, 2:47 PM ffmpeg Documentation
If you want to extract just a limited number of frames, you can use the above command in
combination with the -frames:v or -t option, or in combination with -ss to start extracting
from a certain point in time.
The syntax foo-%[Link] specifies to use a decimal number composed of three digits padded
with zeroes to express the sequence number. It is the same syntax supported by the C printf
function, but only formats accepting a normal integer are suitable.
When importing an image sequence, -i also supports expanding shell-like wildcard patterns
(globbing) internally, by selecting the image2-specific -pattern_type glob option.
For example, for creating a video from filenames matching the glob pattern foo-*.jpeg :
You can put many streams of the same type in the output:
ffmpeg -i [Link] -i [Link] -map 1:1 -map 1:0 -map 0:1 -map 0:0 -c copy -y [Link]
The resulting output file [Link] will contain the first four streams from the input files in
reverse order.
ffmpeg -i [Link] -b 4000k -minrate 4000k -maxrate 4000k -bufsize 1835k out.m2v
The four options lmin, lmax, mblmin and mblmax use ’lambda’ units, but you may use the
QP2LAMBDA constant to easily convert from ’q’ units:
7 See Also
ffmpeg-all ([Link]), ffplay ([Link]), ffprobe ([Link]), ffmpeg-utils (ffmpeg-
[Link]), ffmpeg-scaler ([Link]), ffmpeg-resampler ([Link]), ffmpeg-
codecs ([Link]), ffmpeg-bitstream-filters ([Link]), ffmpeg-
formats ([Link]), ffmpeg-devices ([Link]), ffmpeg-protocols (ffmpeg-
[Link]), ffmpeg-filters ([Link])
8 Authors
The FFmpeg developers.
[Link] 64/65
2/16/26, 2:47 PM ffmpeg Documentation
For details about the authorship, see the Git history of the project ([Link] e.g.
by typing the command git log in the FFmpeg source directory, or browsing the online repository at
[Link] ([Link]
Maintainers for the specific components are listed in the file MAINTAINERS in the source code tree.
[Link] 65/65