Keep samples in 64 bit #773

roderickvd · 2021-05-27T22:22:26Z

Perhaps counter-intuitively, the main reason for doing this is higher performance. Until now we are storing samples in f32 but casting them to f64 every time we manipulate them (normalisation, volume control), then cast them back again. The idea is that we can save CPU cycles and RAM copies by just keeping them in f64.

Added benefits are:

Support for F64 audio format
Even lower quantization error (bragging rights)

To re-iterate, 64 bit sample processing is already in, this just two places of casting back and forth while introducing one new one (reading lewton samples). So compared to the current situation, this should result in the following cases:

Volume control	Normalisation	Performance	Sound quality
Softvol	Disabled	Equal	Better
Softvol	Enabled	Faster	Better
Softvol @ MAX	Disabled	Slower	Equal
Softvol @ MAX	Enabled	Equal	Better
Alsa mixer	Disabled	Slower	Equal
Alsa mixer	Enabled	Equal	Better

Perhaps counter-intuitively, the main reason for doing this is higher performance. Until now we are storing samples in `f32` but casting them to `f64` every time we manipulate them (normalisation, volume control), then cast them back again. The idea is that we can save CPU cycles and RAM copies by just keeping them in `f64`. Added benefits are: - Support for F64 audio format - Even lower quantization error (bragging rights) To re-iterate, 64 bit sample processing is already in, this just two places of casting back and forth while introducing one new one (reading `lewton` samples). So compared to the current situation, this should result in the following cases: Volume control | Normalisation | Performance | Sound quality -------------- | ------------- | ----------- | ------------- Softvol | Disabled | Equal | Better Softvol | Enabled | Faster | Better Softvol @ MAX | Disabled | Slower | Equal Softvol @ MAX | Enabled | Equal | Better Alsa mixer | Disabled | Slower | Equal Alsa mixer | Enabled | Equal | Better This is a work in progress, currently in is support for Alsa, Rodio, pipe and subprocess. The other backends are on the todo list.

roderickvd · 2021-05-27T22:22:40Z

Contrary to what I had thought, cursory testing on Alsa has shown no significant change in CPU or RAM usage. In which case we "might as well".

JasonLG1979 · 2021-05-29T03:16:27Z

I am by no means an expert but in my mind, like you've said it makes sense to just to keep everything in 64 bit once it's converted even if it's not a tangible performance difference it's more straightforward.

As a side-note, the Gstreamer backend may still "just work" since it just uses parse_launch?

JasonLG1979 · 2021-05-29T04:20:34Z

One nitpick I have is the magic numbers in convert.rs The "factors". They would be much less magical if they were consts.

It could be something like (pseudo code, but you get the idea):

const i32_factor = 2147483648.
const i24_factor = 8388608.
const i16_factor = 32768.

roderickvd · 2021-05-29T08:05:35Z

As a side-note, the Gstreamer backend may still "just work" since it just uses parse_launch?

Yes but due to a different mechanic. The default GStreamer sets ! audioconvert and that's where the magic happens. parse_launch is really just to parse and launch that pipeline.

Also note that the pipeline sets format={} which is aligned with the samples that librespot puts out, so there won't be a format conversion in GStreamer anyway (at least not on that account).

One nitpick I have is the magic numbers in convert.rs The "factors". They would be much less magical if they were consts.

Agreed that's why this is already in this PR 😉 (look at the code)

JasonLG1979 · 2021-05-29T08:21:59Z

Agreed that's why this is already in this PR (look at the code)

Where? In this PR you use for example:

            .map(|sample| self.scale(*sample, 2147483648.) as i32)

Just because it's not hex doesn't make it less magical,lol!!!

roderickvd · 2021-05-29T08:37:20Z

Oh I see you're asking to make them named constants instead of just dropping them as parameters. Sure could do that.

JasonLG1979 · 2021-05-29T08:39:13Z

Oh I see you're asking to make them named constants instead of just dropping them as parameters. Sure could do that.

I'm not a rust person but to me it at least kinda makes it more clear what they are. I just hate naked numbers in code in general also,lol!!!

JasonLG1979 · 2021-05-29T09:01:58Z

Yes but due to a different mechanic. The default GStreamer sets ! audioconvert and that's where the magic happens. parse_launch is really just to parse and launch that pipeline.

Also note that the pipeline sets format={} which is aligned with the samples that librespot puts out, so there won't be a format conversion in GStreamer anyway (at least not on that account).

Gstreamer supports 64 bit float, so the question really becomes why bother to do any conversion? Why not just always give Gstreamer 64 bit float and let it worry about converting it? Having a static config would make things much easier I'd imagine?

roderickvd · 2021-05-29T10:51:55Z

I guess you could, however:

I don’t fancy magic black-box approaches because they are not guaranteed to work and somehow always produce more issue reports than just pulling the strings yourself.
It would mean that GStreamer does dithering downstream, and it would be a burden to map librespot dithering to control what GStreamer does. One of the big review points of the dithering PR was too not cause too much coupling in the back ends. We’ve now nicely abstracted that away.
It’s not very idiomatic in the way librespot tackles the other backends.

So yes we could but let’s not change a winning team.

roderickvd · 2021-05-29T19:18:32Z

I am by no means an expert but in my mind, like you've said it makes sense to just to keep everything in 64 bit once it's converted even if it's not a tangible performance difference it's more straightforward.

Could you verify on your RPi Zero?

Indeed if it doesn't really matter CPU-wise and no-one raises any other concerns, I plan on merging this.

JasonLG1979 · 2021-05-29T19:49:12Z

I guess you could, however:

I don’t fancy magic black-box approaches because they are not guaranteed to work and somehow always produce more issue reports than just pulling the strings yourself.

It would mean that GStreamer does dithering downstream, and it would be a burden to map librespot dithering to control what GStreamer does. One of the big review points of the dithering PR was too not cause too much coupling in the back ends. We’ve now nicely abstracted that away.

It’s not very idiomatic in the way librespot tackles the other backends.

So yes we could but let’s not change a winning team.

I guess for that matter Gstreamer also supports volume control and ReplayGain. It was just a thought.

Could you verify on your RPi Zero?

Yes of course. I test just about all of the audio related PR's on my zero 😉

Indeed if it doesn't really matter CPU-wise and no-one raises any other concerns, I plan on merging this.

In your 1st comment you mention that only doing the conversions at the edges in theory should use less CPU cycles and then in a follow up comment you said it really didn't make much of any difference. What I mean is that it makes more sense and is more straightforward to have "one format to rule them all" as far as internally and convert only at the edges even if there is no performance benefit. Regardless of anything else it's easier for a future dev to know that internally audio is always 64 bit float.

roderickvd · 2021-05-29T19:55:30Z

Could you verify on your RPi Zero?

Yes of course. I test just about all of the audio related PR's on my zero wink

Great, thanks! Buying a Zero myself wouldn't be too much trouble but at the same time, this is great peer review, so very much appreciated.

Indeed if it doesn't really matter CPU-wise and no-one raises any other concerns, I plan on merging this.

In your 1st comment you mention that only doing the conversions at the edges in theory should use less CPU cycles and then in a follow up comment you said it really didn't make much of any difference. What I mean is that it makes more sense and is more straightforward to have "one format to rule them all" as far as internally and convert only at the edges even if there is no performance benefit. Regardless of anything else it's easier for a future dev to know that internally audio is always 64 bit float.

To double-check. You mean you agree with the approach in these commits? Internally it's now f32 to f64 from lewton to librespot AudioPacket, then stays there in f64 all through volume control and normalisation, then is converted down to the chosen audio format for the backend. So this is like you said: converting at the edges, same format internally everywhere.

JasonLG1979 · 2021-05-29T20:02:53Z

To double-check. You mean you agree with the approach in these commits? Internally it's now f32 to f64 from lewton to librespot AudioPacket, then stays there in f64 all through volume control and normalisation, then is converted down to the chosen audio format for the backend. So this is like you said: converting at the edges, same format internally everywhere.

Yes I agree that only converting twice is obviously more efficient technically and as I said it's easier to understand if the audio is 64 bit pretty much all the time internally even if it doesn't make for a tangible performance boost.

JasonLG1979 · 2021-05-29T20:30:55Z

Great, thanks! Buying a Zero myself wouldn't be too much trouble but at the same time, this is great peer review, so very much appreciated.

I'm not so sure if I would refer to myself as a "peer" as I clearly don't know rust,lol!!! Testing things and asking disruptive and often stupid questions are my way of contributing,lol!!! Contribution is the currency in an open source project. Pure consumers are of very little, to no use in open source. Basically if you don't contribute you can't complain,lol!!!

JasonLG1979 · 2021-05-29T20:56:58Z

Testing on my Pi Zero shows no noticeable difference performance or resource usage wise.

JasonLG1979 · 2021-05-29T22:09:23Z

@roderickvd as soon as you merge this I'll start my work on the wiki. It'll be nice from a "bragging rights" stand point to say that all internal audio processing is done in "64 bit Double-precision floating-point format to push requantization noise well below the physical noise floor of even the most state of the art DACs in production today."... Hows that for marketing speak,lol!!!

roderickvd · 2021-05-29T22:16:54Z

🦾 heck yes. Roon and JRiver do it all in 64-bit and now we're here with them.
I'll leave this up here for one more day for others to comment.

JasonLG1979 · 2021-05-29T22:30:04Z

heck yes. Roon and JRiver do it all in 64-bit and now we're here with them.
I'll leave this up here for one more day for others to comment.

I think pretty much every modern media player/framework does stuff in 64 bit. Just not all of them brag about it,lol!!! The audio enthusiast community does love their superlatives and grand statements though. I promise to keep away from snake oil salesman territory though. I will keep things factual but there's no reason facts can't sound poetic,lol!!!

JasonLG1979 · 2021-05-30T21:21:10Z

@roderickvd again sorry to bug you, but to go from f64 to f32 it looks like you're casting a f64 as a f32 what does the conversion look like in that situation? Does it round, truncate?

roderickvd · 2021-05-30T21:46:06Z

It drops 32 bits and what happens then is determined by the way floating point are encoded. It's not as easy to say if that's rounding or not, that's thinking in an integer paradigm when this is not.

You can play around with https://play.rust-lang.org/?version=stable&mode=debug&edition=2018&gist=8fdaea3e5d7d51fda1c922f286c5fb5a and see what happens.

Remember that samples values are normalized between -1.0..=1.0.

This PR is closed now so as much as I enjoy the discussion, let's do that on Gitter.

roderickvd added enhancement audio labels May 27, 2021

roderickvd self-assigned this May 27, 2021

Add f32 and f64 sample sizes

d65dfd3

roderickvd added 2 commits May 29, 2021 20:34

Move scale factors into named constants

4c7cc64

Handle 64-bit samples on other backends

66dbd8a

Update changelog

7039cf1

roderickvd marked this pull request as ready for review May 29, 2021 19:47

Merge branch 'dev' into keep-samples-in-64-bit

fa21aa7

Merge branch 'dev' into keep-samples-in-64-bit

11f8c70

roderickvd mentioned this pull request May 30, 2021

Improve sample rounding and clean up noise shaping leftovers #771

Merged

roderickvd merged commit fe2d5ca into librespot-org:dev May 30, 2021

roderickvd deleted the keep-samples-in-64-bit branch May 30, 2021 18:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keep samples in 64 bit #773

Keep samples in 64 bit #773

roderickvd commented May 27, 2021 •

edited

Loading

roderickvd commented May 27, 2021

JasonLG1979 commented May 29, 2021 •

edited

Loading

JasonLG1979 commented May 29, 2021

roderickvd commented May 29, 2021

JasonLG1979 commented May 29, 2021 •

edited

Loading

roderickvd commented May 29, 2021

JasonLG1979 commented May 29, 2021

JasonLG1979 commented May 29, 2021

roderickvd commented May 29, 2021

roderickvd commented May 29, 2021

JasonLG1979 commented May 29, 2021

roderickvd commented May 29, 2021

JasonLG1979 commented May 29, 2021

JasonLG1979 commented May 29, 2021

JasonLG1979 commented May 29, 2021

JasonLG1979 commented May 29, 2021

roderickvd commented May 29, 2021

JasonLG1979 commented May 29, 2021 •

edited

Loading

JasonLG1979 commented May 30, 2021

roderickvd commented May 30, 2021

Keep samples in 64 bit #773

Keep samples in 64 bit #773

Conversation

roderickvd commented May 27, 2021 • edited Loading

roderickvd commented May 27, 2021

JasonLG1979 commented May 29, 2021 • edited Loading

JasonLG1979 commented May 29, 2021

roderickvd commented May 29, 2021

JasonLG1979 commented May 29, 2021 • edited Loading

roderickvd commented May 29, 2021

JasonLG1979 commented May 29, 2021

JasonLG1979 commented May 29, 2021

roderickvd commented May 29, 2021

roderickvd commented May 29, 2021

JasonLG1979 commented May 29, 2021

roderickvd commented May 29, 2021

JasonLG1979 commented May 29, 2021

JasonLG1979 commented May 29, 2021

JasonLG1979 commented May 29, 2021

JasonLG1979 commented May 29, 2021

roderickvd commented May 29, 2021

JasonLG1979 commented May 29, 2021 • edited Loading

JasonLG1979 commented May 30, 2021

roderickvd commented May 30, 2021

roderickvd commented May 27, 2021 •

edited

Loading

JasonLG1979 commented May 29, 2021 •

edited

Loading

JasonLG1979 commented May 29, 2021 •

edited

Loading

JasonLG1979 commented May 29, 2021 •

edited

Loading