[WIP] Replace FFmpeg with GStreamer #161

flibitijibibo · 2019-10-21T20:47:38Z

I've written a new WMA decoder that uses GStreamer instead of FFmpeg directly. There are a few reasons for this:

Support on Windows and macOS is surprisingly good compared to FFmpeg, particularly for developers
GStreamer is available on pretty much every Linux distribution, while FFmpeg might not be (for legal reasons). This allows us to be more robust; we check for any known WMA plugins, then return an error code when no plugin is found. Distributions that don't want to package said plugins for legal reasons can still build against GStreamer and the user can install the plugin if they want.
GStreamer is the best way to access official (and more importantly, legal) WMA support via the Fluendo Codec Pack, a set of plugins that work with existing GStreamer installations and are in rare cases bundled in certain products (Ubuntu comes with Fluendo mp3, for example).

The current revision mostly works, but similar to #95 it seems like GStreamer (or possibly the decoder?) is withholding samples, even though each packet should be entirely self-sufficient without the need for information from neighboring packets. You can see the difference by setting REMOVE_ME to 1 in FAudio_gstreamer.c; the intended size should always be exactly the same as the actual size, and yet it isn't most of the time! If there was a way to force a full decode of the block and flush the entire decoded stream every time a packet was sent, it would most likely fix this problem.

This can be tested by setting GSTREAMER=ON BUILD_UTILS=ON in the CMake configuration and running testxwma with this test file. GStreamer debugging information can be found here.

flibitijibibo · 2019-10-25T19:37:35Z

Did a bit more work and it actually seems like it might be the decoder's fault. For laughs I went and submitted the entire buffer at once and attempted to pull samples from that, and it continued to send blocks incorrectly in the exact same way, so the way we submit data has absolutely no relevance, the decoder simply refuses to present us with the right number of samples. This is a problem because the dpds table actually matters a lot, we can't just decode linearly and expect everything to be safe, in rare cases we have to go backward in time and we also have to concern ourselves with play regions, so lazily decoding is not an option.

sdroege · 2019-10-25T21:18:41Z

OOC, which decoder are you actually using here? The Fluendo WMA decoder or the ffmpeg one? The latter should literally give you the same amount of samples per packet as the previous ffmpeg code did (unless your conversion actually causes resampling).

flibitijibibo · 2019-10-26T01:26:45Z

I’ve tested both Fluendo and gst-libav, both produce different results. We know the latter will be wrong per #95, but I would expect a more official codec to be accurate... I’ll see what their team has to say.

The existing FFmpeg decoder ignores the block size problem to sound mostly correct, at the expense of numerous issues that are potentially unsafe to the user should the decoder veer off too far from the intended result.

sdroege · 2019-10-27T08:57:24Z

I’ve tested both Fluendo and gst-libav, both produce different results. We know the latter will be wrong per #95

Not that it's going to help you, but with gst-libav you should get exactly the same results in this regard as with using ffmpeg directly. There's no additional padding, buffering, clipping, etc. between the calls to ffmpeg and the GStreamer layer here.

I can't speak for the Fluendo decoder but I would assume it to be based on the official Microsoft SDK.

The existing FFmpeg decoder ignores the block size problem to sound mostly correct, at the expense of numerous issues that are potentially unsafe to the user should the decoder veer off too far from the intended result.

So if I understand you correctly, what you must be able to do is:

Read the block size, number of samples per WMA packet on your side
Pass one WMA packet at a time to the decoder and get back exactly that many samples before passing any further packets to the decoder

And I assume it's not enough for you to only get the correct number of samples overall for the whole stream.

I think in that case your only option is to directly use a single WMA decoder library (and no abstraction like ffmpeg or GStreamer on top of it) directly, as only then you can actually guarantee these points. Otherwise you have no control over the actual decoder implementation and it can behave different as it has different requirements than you.

flibitijibibo · 2019-10-27T15:29:08Z

A direct decoder would be great, but I’m unaware of anything like that other than the MS SDK, which is private. I definitely would have gone for that from the start, similar to how we use stb_vorbis for that format.

FFmpeg comes really close to doing what we want, it just isn’t accurate enough. If it was, we would be able to get the right size since we only submit one packet at a time and it doesn’t keep samples from us.

soredake · 2020-01-15T08:53:45Z

Any progress on this?

Can you tell that there are zero examples for bytes -> decoder -> bytes?

soredake · 2020-03-19T13:51:36Z

If someone wants to test this pr on arch, i made packages (rename to .tar.xz):
faudio-tkg-git-20.01.r16.ge3a36c8-1-x86_64.pkg.zip
lib32-faudio-tkg-git-20.01.r16.ge3a36c8-1-x86_64.pkg.zip

soredake · 2020-03-19T15:49:37Z

Sound in fallout 4 not working, log with gst_debug=4
f4.log

flibitijibibo · 2020-03-19T16:12:53Z

Do not use this patch unless you are using a custom GStreamer codec that does not exist. The existing decoders will not work correctly with this system.

soredake · 2020-03-19T16:16:16Z

Ok, get it.

flibitijibibo · 2020-05-21T14:30:37Z

Closing in favor of #192, which includes this patch.

flibitijibibo force-pushed the gstreamer branch 2 times, most recently from a0664f9 to 8954444 Compare October 25, 2019 17:07

flibitijibibo force-pushed the gstreamer branch from 5ee7a3a to 0a05b65 Compare October 27, 2019 05:38

Initial work to replace FFmpeg with GStreamer.

e3a36c8

Can you tell that there are zero examples for bytes -> decoder -> bytes?

flibitijibibo force-pushed the gstreamer branch from 0a05b65 to e3a36c8 Compare January 28, 2020 22:09

flibitijibibo closed this May 21, 2020

flibitijibibo deleted the gstreamer branch May 21, 2020 14:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Replace FFmpeg with GStreamer #161

[WIP] Replace FFmpeg with GStreamer #161

flibitijibibo commented Oct 21, 2019 •

edited

Loading

flibitijibibo commented Oct 25, 2019

sdroege commented Oct 25, 2019

flibitijibibo commented Oct 26, 2019

sdroege commented Oct 27, 2019

flibitijibibo commented Oct 27, 2019

soredake commented Jan 15, 2020

soredake commented Mar 19, 2020

soredake commented Mar 19, 2020

flibitijibibo commented Mar 19, 2020

soredake commented Mar 19, 2020

flibitijibibo commented May 21, 2020

[WIP] Replace FFmpeg with GStreamer #161

[WIP] Replace FFmpeg with GStreamer #161

Conversation

flibitijibibo commented Oct 21, 2019 • edited Loading

flibitijibibo commented Oct 25, 2019

sdroege commented Oct 25, 2019

flibitijibibo commented Oct 26, 2019

sdroege commented Oct 27, 2019

flibitijibibo commented Oct 27, 2019

soredake commented Jan 15, 2020

soredake commented Mar 19, 2020

soredake commented Mar 19, 2020

flibitijibibo commented Mar 19, 2020

soredake commented Mar 19, 2020

flibitijibibo commented May 21, 2020

flibitijibibo commented Oct 21, 2019 •

edited

Loading