Feature request: use ffmpeg's `-ac` flag for downmixing

imax · November 29, 2020, 12:00pm

At least as of 1.20.1.3252, Plex server is using aresample with ocl=stereo to downmix 5.1 audio when playing on chromecast. But resulting audio has much lower volume of speech compared to sound effects. ffmpeg has -ac flag specifically for downmixing, which works differently from aresample filter.

To further illustrate the difference, below is a screenshot of waveforms of 3 tracks downmixed from the same source:

ffmpeg -filter_complex '[0:0] aresample=async=1:ocl='stereo':rematrix_maxval=0.000000dB:osr=48000'
ffmpeg -ac 2
ffmpeg-normalize -e '-ac 2' (which also applies loudnorm filter)

As you can see, output of aresample is much more quiet than the other two methods.

Volts · November 29, 2020, 6:10pm

Plex’s downmix being too quiet is a common complaint. And there are other feature requests for louder audio. But I think you’ve hit the nail on the head with this one.

A really good argument for -ac is that it obeys the NTSC recommendations for downmixing.

I’ve had this bookmarked for years, and it has analysis similar to yours.

I really like the “Nightmode Dialogue” answer for listening in noisy places, Planes, Trains and Automobiles.

I wonder if Plex’s method predates -ac working well, or if it provides other functionality/generality. (Or if it plays well with EasyAudioEncoder.)

OttoKerner · November 29, 2020, 7:01pm

As much as I agree that there is much to be desired with the loudness of downmixed audio, I somehow doubt that the loudnorm filter can be applied while streaming the file. From what I gathered working with ffmpeg, this filter requires a separate “analysis” run, prior to the actual conversion.

imax · November 29, 2020, 8:28pm

Yeah, loudnorm works better with two passes, but it can also work in one pass.

However, that’s beside the point: using -ac already would be a huge improvement. I think it even might be as simple as removing ocl=2 and adding -ac 2 in the transcoder command line.

ChuckPa · November 29, 2020, 8:50pm

I chatted with the transcoder team.

-ac injects the aresample filter but does not inject the rematrix_maxval .
They suggest increasing the "multichannel audio boost" setting

imax · November 29, 2020, 9:01pm

“Multi-channel audio boost” did very much close to nothing on any video and player combination I tried

Volts · November 29, 2020, 10:58pm

Now I’m wondering why rematrix_maxval is set. The default is 1, which would already avoid clipping - possibly at the expense of loudness. If it’s being set to 0, I’m confused that things aren’t louder.

Every time I think I understand some parts of Plex I learn that I’m missing an entire dimension.

imax · November 30, 2020, 9:44am

I made a simple wrapper around Plex Transcoder to do exactly that: remove ocl and insert -ac 2 right after that. It seems to work so far, but I’ll keep an eye out for breakages.

wrapper

package main

import (
	"os"
	"regexp"
	"syscall"
)

var oclRe = regexp.MustCompile(":ocl=(2|stereo|'stereo'):")

func rewriteArgs(args []string) []string {
	r := []string{args[0]}
	for i := 1; i < len(args); i++ {
		switch {
		case args[i] == "-filter_complex":
			r = append(r, args[i])
			i++
			m := oclRe.FindStringSubmatchIndex(args[i])
			if len(m) >= 2 {
				r = append(r, args[i][0:m[0]]+args[i][m[1]-1:], "-ac", "2")
			} else {
				r = append(r, args[i])
			}
		default:
			r = append(r, args[i])
		}
	}
	return r
}

func main() {
	args := rewriteArgs(os.Args)
	syscall.Exec(args[0]+"_org", args, os.Environ())
}

imax · December 2, 2020, 11:36pm

Ugh, just adding -ac right after -filter_complex doesn’t actually fix the relative volume issue. I’ll need to tinker with flags a bit more.

Volts · December 2, 2020, 11:43pm

I’d done the same style wrapper for Plex Transcoder before, but yours is nicer than what I did. TY for sharing.

imax · December 4, 2020, 12:54pm

It took a bit of effort, but here’s a rundown of various flag combinations: https://github.com/gelraen/ffmpeg-downmix#results

 0.495810 source
 0.465214 aresample-no_rematrix_maxval-ac
 0.465214 aresample-no_rematrix_maxval
 0.465214 ac-2
 0.455529 loudnorm-aresample-no_rematrix_maxval_ac
 0.455529 loudnorm-aresample-no_rematrix_maxval
 0.455529 aresample-loudnorm-aresample
 0.189032 aresample-ac
 0.189032 aresample
 0.179328 loudnorm-aresample-ac
 0.179328 loudnorm-aresample

(number in the first column is max level in each output)

It seems that it’s rematrix_maxval that is messing up the volume. loudnorm can bring it back to normal, but only if placed after aresample (and then another aresample instance is needed to get to the final desired sample rate from 192kHz).

I think I’ll just filter out rematrix_maxval from ffmpeg invocation and see if anything breaks.

Volts · December 4, 2020, 8:30pm

It makes intuitive sense that setting maxval would have an impact on the maximum value.

I haven’t found that intuition is always appropriate with ffmpeg.

Eager to hear how you get on with it.

imax · December 18, 2020, 12:51pm

Yup, removing rematrix_maxval helped with volume when downmixing, and I haven’t seen any issues.

Can we please have at least a checkbox to disable it?

Volts · December 18, 2020, 1:29pm

What does your wrapper script currently look like?

imax · December 18, 2020, 4:16pm

package main

import (
        "os"
        "regexp"
        "syscall"
)

var oclRe = regexp.MustCompile(":rematrix_maxval=[^:]+:")

func rewriteArgs(args []string) []string {
        r := []string{args[0]}
        for i := 1; i < len(args); i++ {
                switch {
                case args[i] == "-filter_complex":
                        i++
                        m := oclRe.FindStringSubmatchIndex(args[i])
                        if len(m) >= 2 {
                                r = append(r, args[i-1], args[i][0:m[0]]+args[i][m[1]-1:])
                        } else {
                                r = append(r, args[i-1], args[i])
                        }
                default:
                        r = append(r, args[i])
                }
        }
        return r
}

func main() {
        args := rewriteArgs(os.Args)
        syscall.Exec(args[0]+"_org", args, os.Environ())
}

Volts · December 19, 2020, 1:13am

Wrapper installed, testing. TYVM. Will report back too.

system · March 19, 2021, 1:13am

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Force ac3 5.1 downmix Apps & Creations	34	619	December 21, 2019
Sharing some ffmpeg scripts I made to down convert audio in batch as well a 'night mode / DRC' General Discussions	14	4261	January 24, 2019
Possible ffmpeg solution: low volume when transcoding surround to stereo Plex Media Server server-linux-arm	1	102	November 25, 2022
Weirdly fluctuating volume/normalization when transcoding TrueHD to EAC3 General Discussions server-linux , lg-webos	8	301	June 27, 2021
PMP audio downmix General Discussions	4	130	January 8, 2020

Feature request: use ffmpeg's `-ac` flag for downmixing

Related topics