Updating some readonly static data in JpegEncoderCore to take advantage of compiler functionality. #855

tannergooding · 2019-03-21T18:55:43Z

Prerequisites

I have written a descriptive pull-request title
I have verified that there are no overlapping pull-requests open
I have verified that I am following matches the existing coding patterns and practice as demonstrated in the repository. These follow strict Stylecop rules 👮.
I have provided test coverage for my change (where applicable)

Description

This is a partial fix of #854 and shows how to take advantage of the underlying compiler functionality.

CLAassistant · 2019-03-21T18:55:50Z

All committers have signed the CLA.

src/ImageSharp/Common/Extensions/StreamExtensions.cs

tannergooding · 2019-03-21T20:21:43Z

BenchmarkDotNet=v0.11.3, OS=Windows 10.0.17763.379 (1809/October2018Update/Redstone5)
Intel Core i7-7700 CPU 3.60GHz (Kaby Lake), 1 CPU, 8 logical and 4 physical cores
.NET Core SDK=3.0.100-preview4-010901
  [Host]     : .NET Core 2.1.9 (CoreCLR 4.6.27414.06, CoreFX 4.6.27415.01), 64bit RyuJIT
  DefaultJob : .NET Core 2.1.9 (CoreCLR 4.6.27414.06, CoreFX 4.6.27415.01), 64bit RyuJIT

Before:

Method	TestImage	Mean	Error	StdDev	Median	Ratio	RatioSD
'System.Drawing Jpeg'	Bmp/Car.bmp	4.817 ms	0.1838 ms	0.5419 ms	4.985 ms	1.00	0.00
'ImageSharp Jpeg'	Bmp/Car.bmp	7.079 ms	0.4893 ms	1.4428 ms	6.105 ms	1.51	0.44

After:

Method	TestImage	Mean	Error	StdDev	Median	Ratio	RatioSD
'System.Drawing Jpeg'	Bmp/Car.bmp	4.857 ms	0.1933 ms	0.5698 ms	5.036 ms	1.00	0.00
'ImageSharp Jpeg'	Bmp/Car.bmp	5.716 ms	0.0762 ms	0.0595 ms	5.716 ms	1.29	0.15

codecov · 2019-03-21T20:29:04Z

Codecov Report

Merging #855 into master will not change coverage.
The diff coverage is 100%.

@@           Coverage Diff           @@
##           master     #855   +/-   ##
=======================================
  Coverage   88.89%   88.89%           
=======================================
  Files        1014     1014           
  Lines       44295    44295           
  Branches     3208     3209    +1     
=======================================
  Hits        39376    39376           
  Misses       4198     4198           
  Partials      721      721

Impacted Files	Coverage Δ
...c/ImageSharp/Common/Extensions/StreamExtensions.cs	`88.88% <ø> (ø)`	⬆️
src/ImageSharp/Formats/Jpeg/JpegEncoderCore.cs	`94.71% <100%> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 47e8f2c...1e2c32d. Read the comment docs.

antonfirsov · 2019-03-21T20:41:48Z

src/ImageSharp/Common/Extensions/StreamExtensions.cs

+        // This is a port of the CoreFX implementation and is MIT Licensed: https://github.com/dotnet/coreclr/blob/c4dca1072d15bdda64c754ad1ea474b1580fa554/src/System.Private.CoreLib/shared/System/IO/Stream.cs#L770
+        public static void Write(this Stream stream, ReadOnlySpan<byte> buffer)
+        {
+            byte[] sharedBuffer = ArrayPool<byte>.Shared.Rent(buffer.Length);


We usually prefer using our own memory management primitives like MemoryAllocator.AllocateManagedByteBuffer() instead of ArrayPool.Shared.

This is matching a signature and implementation available in .NET Core. ArrayPool<byte>.Shared should generally be allocation-less for something like this (not necessarily for other T) since most of the framework is fairly dependent on it.

I'll run benchmarks on net472 as well though, in order to see what the overhead here is, if any, since it isn't applicable to netcoreapp2.1 (where the above benchmark was run).

Our "allocator" is typically also an allocation free pool, but we definitely have an overhead because of nested virtual calls, and other infrastructure stuff.
Everything depends on the size of the array. It's not uncommon to have very large (> 1 MB) buffers in image processing, which works better with our allocator/pool in my experience.

However, for those buffers we should avoid invoking this extension method anyways. @JimBobSquarePants we can probably make an exception here, but we need to make sure the purpose is documented, or at least we remember it 😄

Definitely and I don't recall if benchmark.net isolates each iteration or not (I know it does for separate tests), so it could be that the benchmark is "lying" for full framework and it is hiding the allocation cost (so it might not be representative of real world scenarios).

It would be possible to take a MemoryAllocator as an optional parameter or to not have SosHeaderYCbCr take advantage of this compiler optimization (the single copy on class instantiation isn't terrible, but losing out on permanent pinning is a bit unfortunate) or to only not use the optimization on full framework if there are concerns about this regressing full framework.

IIRC, the ArrayPool<byte>.Shared is optimized for up to 2MB, since we use it for things like the JSON and XML readers.

Let's add a comment explaining why we use that pool (signature matching) and move on.

Fixed. Also rebased onto the current HEAD.

antonfirsov · 2019-03-21T20:45:45Z

src/ImageSharp/Common/Extensions/StreamExtensions.cs

+            byte[] sharedBuffer = ArrayPool<byte>.Shared.Rent(buffer.Length);
+            try
+            {
+                buffer.CopyTo(sharedBuffer);


To me it looks like we are loosing all the benefits of the copy-free initialization at this line...

But the benchmarks show different results ... I wonder why? I guess this method is not a hot path.

In the case of netcoreapp, the method is actually virtual and various stream types can override it to avoid the copy. I'll double check the numbers on full framework to see if it hurts anything (and if so, will look at what can be done).

tannergooding · 2019-03-21T21:40:15Z

BenchmarkDotNet=v0.11.3, OS=Windows 10.0.17763.379 (1809/October2018Update/Redstone5)
Intel Core i7-7700 CPU 3.60GHz (Kaby Lake), 1 CPU, 8 logical and 4 physical cores
.NET Core SDK=3.0.100-preview4-010901
  [Host] : .NET Core 2.1.9 (CoreCLR 4.6.27414.06, CoreFX 4.6.27415.01), 64bit RyuJIT
  Clr    : .NET Framework 4.7.2 (CLR 4.0.30319.42000), 64bit RyuJIT-v4.7.3362.0

Before:

Method	TestImage	Mean	Error	StdDev	Median	Ratio	RatioSD	Gen 0/1k Op	Gen 1/1k Op	Gen 2/1k Op	Allocated Memory/Op
'System.Drawing Jpeg'	Bmp/Car.bmp	4.856 ms	0.1641 ms	0.4838 ms	5.064 ms	1.00	0.00	210.9375	-	-	873.48 KB
'ImageSharp Jpeg'	Bmp/Car.bmp	6.041 ms	0.0559 ms	0.0467 ms	6.022 ms	1.24	0.15	23.4375	-	-	129.19 KB

After:

Method	TestImage	Mean	Error	StdDev	Median	Ratio	RatioSD	Gen 0/1k Op	Gen 1/1k Op	Gen 2/1k Op	Allocated Memory/Op
'System.Drawing Jpeg'	Bmp/Car.bmp	4.909 ms	0.1418 ms	0.4180 ms	5.046 ms	1.00	0.00	210.9375	-	-	873.48 KB
'ImageSharp Jpeg'	Bmp/Car.bmp	6.071 ms	0.1181 ms	0.1406 ms	5.995 ms	1.28	0.11	23.4375	-	-	129.19 KB

…ge of compiler functionality.

JimBobSquarePants

Amazing to see a change like this can yield such performance benefits. I'll need to learn more tricks like this!

JimBobSquarePants · 2019-03-26T01:30:18Z

Merging this in. Thanks @tannergooding

tannergooding commented Mar 21, 2019

View reviewed changes

src/ImageSharp/Common/Extensions/StreamExtensions.cs Outdated Show resolved Hide resolved

JimBobSquarePants added area:performance formats:jpeg labels Mar 21, 2019

antonfirsov reviewed Mar 21, 2019

View reviewed changes

Updating some readonly static data in JpegEncoderCore to take advanta…

1e2c32d

…ge of compiler functionality.

JimBobSquarePants approved these changes Mar 26, 2019

View reviewed changes

JimBobSquarePants merged commit 5eb0122 into SixLabors:master Mar 26, 2019

JimBobSquarePants added this to the 1.0.0-rc1 milestone Mar 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updating some readonly static data in JpegEncoderCore to take advantage of compiler functionality. #855

Updating some readonly static data in JpegEncoderCore to take advantage of compiler functionality. #855

tannergooding commented Mar 21, 2019

CLAassistant commented Mar 21, 2019 •

edited

Loading

tannergooding commented Mar 21, 2019

codecov bot commented Mar 21, 2019 •

edited

Loading

antonfirsov Mar 21, 2019

tannergooding Mar 21, 2019

antonfirsov Mar 21, 2019 •

edited

Loading

tannergooding Mar 21, 2019

JimBobSquarePants Mar 22, 2019

tannergooding Mar 25, 2019

antonfirsov Mar 21, 2019

antonfirsov Mar 21, 2019 •

edited

Loading

tannergooding Mar 21, 2019

tannergooding commented Mar 21, 2019

JimBobSquarePants left a comment

JimBobSquarePants commented Mar 26, 2019

Updating some readonly static data in JpegEncoderCore to take advantage of compiler functionality. #855

Updating some readonly static data in JpegEncoderCore to take advantage of compiler functionality. #855

Conversation

tannergooding commented Mar 21, 2019

Prerequisites

Description

CLAassistant commented Mar 21, 2019 • edited Loading

tannergooding commented Mar 21, 2019

codecov bot commented Mar 21, 2019 • edited Loading

Codecov Report

antonfirsov Mar 21, 2019

Choose a reason for hiding this comment

tannergooding Mar 21, 2019

Choose a reason for hiding this comment

antonfirsov Mar 21, 2019 • edited Loading

Choose a reason for hiding this comment

tannergooding Mar 21, 2019

Choose a reason for hiding this comment

JimBobSquarePants Mar 22, 2019

Choose a reason for hiding this comment

tannergooding Mar 25, 2019

Choose a reason for hiding this comment

antonfirsov Mar 21, 2019

Choose a reason for hiding this comment

antonfirsov Mar 21, 2019 • edited Loading

Choose a reason for hiding this comment

tannergooding Mar 21, 2019

Choose a reason for hiding this comment

tannergooding commented Mar 21, 2019

JimBobSquarePants left a comment

Choose a reason for hiding this comment

JimBobSquarePants commented Mar 26, 2019

CLAassistant commented Mar 21, 2019 •

edited

Loading

codecov bot commented Mar 21, 2019 •

edited

Loading

antonfirsov Mar 21, 2019 •

edited

Loading

antonfirsov Mar 21, 2019 •

edited

Loading