Add a reference to MessagePack implementation #14

AArnott · 2024-10-24T00:26:13Z

This pull request includes updates to the README.md file, enhancements to the SpanDictionary class, and improvements to the equality members of the BaseClass and DerivedClass in the test types.

Documentation Updates:

README.md: Added a new section listing known libraries based on TypeShape, including Nerdbank.MessagePack.

Code Enhancements:

src/TypeShape.Examples/Utilities/SpanDictionary.cs: Added a new overload for the ToSpanDictionary method to support mapping with a value selector.

Test Improvements:

tests/TypeShape.Tests/TestTypes.cs: Implemented IEquatable<T> for BaseClass and DerivedClass, and added appropriate Equals and GetHashCode methods.

It's very rough and limited. And it assumes eiriktsarpalis#12 is addressed.

AArnott · 2024-10-24T01:18:29Z

Incidentally, upgrading the baseline from MessagePack v2 to v3 produces this:

Serialization

Method	Mean	Error	StdDev	Ratio	Allocated	Alloc Ratio
Serialize_TypeShape	130.9 ns	0.37 ns	0.33 ns	0.95	-	NA
Serialize_Library	137.6 ns	0.23 ns	0.21 ns	1.00	-	NA

This improves serialization slightly, making TypeShape fair slightly worse by comparison.

Deserialization

Method	Mean	Error	StdDev	Ratio	Allocated	Alloc Ratio
Deserialize_TypeShape	148.1 ns	0.63 ns	0.56 ns	1.21	32 B	1.00
Deserialize_Library	122.2 ns	0.74 ns	0.61 ns	1.00	32 B	1.00

Deserialization improves significantly, making TypeShape look quite a bit worse. This is due to using a dictionary to look up property names, which the AOT formatter avoids.

src/TypeShape.Examples/MsgPackSerializer/MsgPackSerializer.cs

AArnott · 2024-10-24T12:45:13Z

Serialization is somehow faster than the Ref.Emit specialized code produced by the MessagePack library. I don't know how that's possible.

I suspect it's because in this version, we call MessagePackWriter.Write(string) directly rather than including a level of indirection that allows for some other serializer to hijack that operation at runtime to serialize something else (e.g. so that a string is only serialized once).

eiriktsarpalis · 2024-10-24T19:38:08Z

src/TypeShape.Examples/MsgPackSerializer/MsgPackSerializer.cs

+    private delegate T? FormatDeserialize<T>(ref MessagePackReader reader, MessagePackSerializerOptions options);
+    private delegate void FormatSerialize<T>(ref MessagePackWriter writer, T? value, MessagePackSerializerOptions options);
+
+    private sealed class Formatter<T>(FormatDeserialize<T> deserialize, FormatSerialize<T> serialize) : IMessagePackFormatter<T?>


I can see why you went for a delegate adapter, but it does result in two layers of indirection for every call, which adds up as the object graph is being traversed.

Fixed.
Can you review all the places where formatters.GetOrAdd appears in my code and tell me if I'm doing it right? I don't understand why this pattern is required, so I'm not sure...

I was suggesting that you could make Formatter<T> an abstract class and have individual types and shapes inherit from that. The downside is that you need to factor those into individual classes as opposed to inlining lambda expressions, but on the other hand splitting your code into a folder of Formatters and then a keeping a small visitor that just folds them together should make things more manageable.

So far, I'm not sure what I'd do with more than the two implementations I have.
Maybe I'll need more as I get to supporting non-default constructors, etc. though.

eiriktsarpalis · 2024-10-24T21:56:21Z

src/TypeShape.Examples/MsgPackSerializer/MsgPackSerializer.cs

        };

        internal IMessagePackFormatter<T?> GetFormatter<T>(ITypeShape<T> typeShape)
        {
-            return formatters.GetOrAdd<IMessagePackFormatter<T?>>(typeShape, this, box => new Formatter<T?>(box.Result.Deserialize, box.Result.Serialize));
+            return formatters.GetOrAdd<IMessagePackFormatter<T?>>(typeShape, this, box => box.Result switch


The delayed value factory is a late binding mechanism used when a recursive type is encountered. The idea is that it's creating a wrapper for box.Result which will eventually contain a reference to the final computed value once building has completed.

For this to work, evaluating box.Result needs to be delayed until an actual serialization operation is run. Here's how it's done for CBOR:

https://github.com/eiriktsarpalis/typeshape-csharp/blob/c8c2c397beb472e716b447bb460dea626811ba18/src/TypeShape.Examples/CborSerializer/Converters/DelayedCborConverter.cs#L6-L13

And here's how the object cloner does it:

https://github.com/eiriktsarpalis/typeshape-csharp/blob/ba848263d45d05975fa7cc3a0e601b1bcdca608b/src/TypeShape.Examples/Cloner/Cloner.cs#L62

AArnott · 2024-10-24T23:59:35Z

@neuecc this should interest you.

…till deserialization

eiriktsarpalis · 2024-11-01T16:23:19Z

tests/TypeShape.Tests/TestTypes.cs

@@ -592,15 +592,27 @@ public StructWithDefaultCtor()
 }

 [GenerateShape]
-public partial class BaseClass
+public partial class BaseClass : IEquatable<BaseClass>


Would it make sense to factor the IEquatable implementation to a separate set of types so that we coverage for both cases?

I added this because I had been testing round-tripping. I'm not sure what case we're not covering when this interface isn't present.
But I did just discover IsEquatable on test cases, which probably was false for this type up to this point and why no one else hit the fact that this class didn't know how to do a by-value compare of itself.
I can remove this change if you'd like. I don't need it any more.

Add MsgPackSerializer sample

711251b

It's very rough and limited. And it assumes eiriktsarpalis#12 is addressed.

AArnott mentioned this pull request Oct 24, 2024

NativeAOT assertion for v3 MessagePack-CSharp/MessagePack-CSharp#1997

Closed

Add msgpack benchmarks

3a1badc

AArnott added 2 commits October 23, 2024 22:54

Perf improvements

fae9fba

More perf improvements

ebdbdc2

eiriktsarpalis reviewed Oct 24, 2024

View reviewed changes

src/TypeShape.Examples/MsgPackSerializer/MsgPackSerializer.cs Outdated Show resolved Hide resolved