CA1838 Avoid 'StringBuilder' parameters for P/Invokes #7186

elachlan · 2021-12-30T03:36:07Z

Relates to #7174
https://docs.microsoft.com/en-us/dotnet/fundamentals/code-analysis/quality-rules/ca1838

elachlan · 2022-01-08T05:26:17Z

There are 21 violations of this rule. The fixes seem complex and I don't think I can sort them out.

Helpful resources:
dotnet/runtime#47735
https://docs.microsoft.com/en-us/dotnet/fundamentals/code-analysis/quality-rules/ca1838

Forgind · 2022-01-10T23:42:47Z

There are 21 violations of this rule. The fixes seem complex and I don't think I can sort them out.

Helpful resources: dotnet/runtime#47735 https://docs.microsoft.com/en-us/dotnet/fundamentals/code-analysis/quality-rules/ca1838

Do you want me to mark this up-for-grabs? I'm not sure if someone else will have a chance to sort through it, but I wouldn't have high expectations from maintainers.

elachlan · 2022-01-10T23:56:19Z

As far as I can tell instead of using stringbuilder you are supposed to use a char buffer when making calls via P/Invoke?

I will give it a go and someone can review it and tell me if I am doing it wrong.

… a character buffer instead.

src/Tasks/LockCheck.cs

src/Tasks/NativeMethods.cs

elachlan · 2022-01-12T23:04:17Z

@Forgind in my manic googling to understand how this all works I stumbled into these:

They might be useful in replacing the msbuild maintained pinvokes.

ladipro

A bunch of issues here caused the semantic difference between StringBuilder and char[] marshaling. StringBuilder automatically sets its Length on the way out by scanning the unmanaged buffer for \0. With char array you haver to do it manually.

src/Tasks/NativeMethods.cs

src/Tasks/LockCheck.cs

src/Tasks/ComReference.cs

src/Tasks/AssemblyDependency/AssemblyInformation.cs

src/Tasks/AssemblyDependency/GlobalAssemblyCache.cs

src/Framework/NativeMethods.cs

src/Tasks/AssemblyDependency/GlobalAssemblyCache.cs

src/Tasks/LockCheck.cs

src/Tasks/NativeMethods.cs

src/Tasks/AssemblyDependency/AssemblyInformation.cs

src/Tasks/ComReference.cs

stephentoub · 2022-02-01T01:35:12Z

I tested it with Span and the P/Invoke threw an exception.

You can't pass a span directly to a DllImport method (you'll be able to with the new GeneratedDllImport support coming in .NET 7 that builds out the marshaling stubs at compile time). But you can pass either a ref or a pointer. So, for example, if the DllImport signature is:

internal static extern int GetLongPathName(string path, ref char fullpath, int length);

you can pass ref MemoryMarshal.GetReference(span) as that fullPath argument.

…ing back to char array instead of stackalloc Remove Explicit zero initialization for RmStartSession

src/Framework/NativeMethods.cs

src/Tasks/AssemblyDependency/AssemblyInformation.cs

src/Tasks/ComReference.cs

ladipro · 2022-02-02T13:27:29Z

[curious] @stephentoub do you think the runtime can at some point introduce zero-copy marshaling of output string buffers? Something like:

extern static int GetStringAndYesIKnowTheExactLength([MarshalAs(UnmanagedType.CreateAndPinNewString, LengthIsPassedInParameterNumber=2)] out string s, int length);

Where the stub would create a new string object of size length, pin it, and pass the pointer to unmanaged code. Technically it mutates an existing string object but the object is freshly created so if you squint you could say this is just a fancy string constructor.

stephentoub · 2022-02-02T14:05:35Z

Do you have an example Win32 API that looks like that, where the exact length of the output is known in advance? Typically it's the API that tells the caller how much it wrote to the caller-supplied buffer. (And I say Win32 because on Unix the lingua-franca is UTF8 which couldn't write directly into the string buffer anyway.)

That said, you can already do that if you really want to. Just make the call inside of a string.Create callback and hand the span for the string buffer off to the native call (either pinning it and passing a pointer or using the new source gen marshaling support for spans).

ladipro · 2022-02-02T20:38:59Z

This very PR has three occurrences of the pattern:

Call an API with null buffer to get the length (kernel32!GetShortPathName, kernel32!GetLongPathName, fusion!GetCachePath) or call it with a reasonably sized buffer and hope it will fit (mscoree!GetFileVersion).
Allocate a buffer of the actual returned length (only if the reasonably sized buffer was not enough in case of GetFileVersion).
Call the API again passing the allocated buffer to obtain the string.

The allocation and copying in steps 2 and 3 could be avoided.

I didn't know about string.Create, that's an awesome API!

stephentoub · 2022-02-02T20:50:17Z

Call the API again passing the allocated buffer to obtain the string.

I don't see how the runtime could depend on that, though. This pattern involves trusting that the API will succeed in filling the whole space because you gave it what it previously told you was required.

ladipro · 2022-02-04T08:28:04Z

The second call still takes the buffer size so it doesn't overrun, and it returns the actual size so the caller has to make sure that it filled the whole space and the string length is correct. I see how convoluted it is and runtime support is probably not a good idea. Especially now that I know string.Create exists.

ladipro

Thank you!

src/Tasks/ComReference.cs

Forgind

I haven't read all the conversation, but I will come back to this. There's a lot to learn here!

Forgind · 2022-02-04T16:11:09Z

src/Tasks/AssemblyDependency/AssemblyInformation.cs

+                    if (hresult == NativeMethodsShared.ERROR_INSUFFICIENT_BUFFER)
+                    {
+                        // Allocate new buffer based on the returned length.
+                        char* runtimeVersion2 = stackalloc char[dwLength];


Is there an unstackalloc? And maybe check what dwLength is?

If dwLength is big, it would be good not to overrun the stack. I imagine we'd have a little more space if we can un-allocate the first stack before allocating the second.

No Its scoped the the current method.

stack allocated memory block created during the method execution is automatically discarded when that method returns. You cannot explicitly free the memory allocated with stackalloc.

https://docs.microsoft.com/en-us/dotnet/csharp/language-reference/operators/stackalloc

Forgind · 2022-02-04T16:12:05Z

src/Tasks/AssemblyDependency/AssemblyInformation.cs

-                do
-                {
-                    runtimeVersion = new StringBuilder(bufferLength);
-                    hresult = NativeMethods.GetFileVersion(path, runtimeVersion, bufferLength, out _);


That said, I'm not a fan of this code, so I'm glad it's gone 😁

Forgind · 2022-02-04T16:17:12Z

src/Tasks/AssemblyDependency/GlobalAssemblyCache.cs

+        /// <summary>
+        /// Lazy loaded cached root path of the GAC.
+        /// </summary>
+        private static readonly Lazy<string> _gacPath = new(() => GetGacPath());


Is it worth making this lazy vs. just leaving it as a static call? It looks like it's only used once per ResolveComReference call, which seems like not very much to me.

My thought process was to hold a cached static value for it so we only have to call once per global run. I am unsure if that is how it works in practice.

Forgind · 2022-02-04T17:29:30Z

src/Tasks/ComReference.cs


            // Try increased buffer sizes if on longpath-enabled Windows
-            for (int bufferSize = NativeMethodsShared.MAX_PATH; !success && bufferSize <= NativeMethodsShared.MaxPath; bufferSize *= 2)
+            for (int bufferSize = NativeMethodsShared.MAX_PATH; bufferSize <= NativeMethodsShared.MaxPath; bufferSize *= 2)


I don't think it's relevant for this PR, but MaxPath can be as large as int.MaxValue; since that isn't exactly a power of 2, doesn't that mean it could theoretically (if we keep getting ERROR_INSUFFICIENT_BUFFER or pathLength is 0) reach the top, overflow, and throw an exception?

There are 23 doublings going from MAX_PATH (260) to int.MaxValue. So its not a trivial risk for long path enabled windows. Interestingly NTFS has a 65,535 character limit and The Windows API has many functions that also have Unicode versions to permit an extended-length path for a maximum total path length of 32,767 characters.

So I think we would run into file system/WinAPI limitations before hitting overflows.

I think this would work.
for (int bufferSize = NativeMethodsShared.MAX_PATH; bufferSize <= NativeMethodsShared.MaxPath && bufferSize <= int.MaxValue/2; bufferSize *= 2)

@Forgind let me know if you think the additional check is helpful or not.

I think that would work. I'd be mildly in favor of adding it, but I don't care too much. It hasn't been an important case up to this point, so I doubt it'll be an important case in the future.

It isn't really the point of this PR, but a VerifyThrow at the end of the loop might be a nicer solution? It's almost certainly a bug if we get close to int.MaxValue, and it would be good to make the bug as visible as possible.

Forgind

Looks good!

Forgind · 2022-02-15T17:57:49Z

Thanks @elachlan!

CA1838 Avoid 'StringBuilder' parameters for P/Invokes

994b696

Forgind approved these changes Dec 30, 2021

View reviewed changes

elachlan mentioned this pull request Jan 1, 2022

Change CodeAnalysis rules from 'Info' to 'Warning' after fixing all instances of the violations #7174

Open

sharwell approved these changes Jan 3, 2022

View reviewed changes

elachlan added 3 commits January 8, 2022 09:56

revert ruleset change

4b519a9

rebase

3a88ed9

Enable warning on CA1838

638775f

elachlan marked this pull request as draft January 8, 2022 00:09

elachlan added 2 commits January 12, 2022 07:25

Merge branch 'main' into CA1838

751fea3

CA1838 Avoid 'StringBuilder' parameters for P/Invokes. Consider using…

661c2c8

… a character buffer instead.

elachlan commented Jan 11, 2022

View reviewed changes

src/Tasks/LockCheck.cs Outdated Show resolved Hide resolved

elachlan commented Jan 11, 2022

View reviewed changes

src/Tasks/NativeMethods.cs Outdated Show resolved Hide resolved

elachlan commented Jan 11, 2022

View reviewed changes

src/Tasks/NativeMethods.cs Outdated Show resolved Hide resolved

trying again

4f0424c

ladipro suggested changes Jan 27, 2022

View reviewed changes

Changes from review

e042e66

elachlan commented Jan 27, 2022

View reviewed changes

src/Framework/NativeMethods.cs Outdated Show resolved Hide resolved

elachlan commented Jan 27, 2022

View reviewed changes

src/Framework/NativeMethods.cs Outdated Show resolved Hide resolved

elachlan added 2 commits January 28, 2022 09:05

GetGacPath changes

4ddf661

Making sure we are form strings properly

8f0c31c

elachlan marked this pull request as ready for review January 28, 2022 02:56

danmoseley reviewed Jan 28, 2022

View reviewed changes

changes from review

04e4a1b

elachlan commented Jan 28, 2022

View reviewed changes

src/Tasks/AssemblyDependency/AssemblyInformation.cs Outdated Show resolved Hide resolved

Remove unused field

903f35d

elachlan commented Jan 28, 2022

View reviewed changes

src/Tasks/ComReference.cs Show resolved Hide resolved

elachlan added 3 commits February 2, 2022 09:44

Fix possible StackOverflow in GetShortPathName/GetLongPathName by mov…

54878ac

…ing back to char array instead of stackalloc Remove Explicit zero initialization for RmStartSession

Merge branch 'CA1838' of github.com:elachlan/msbuild into CA1838

567cc08

remove unneeded unsafe from pinvoke definitions

8af24e5

elachlan commented Feb 1, 2022

View reviewed changes

src/Framework/NativeMethods.cs Show resolved Hide resolved

ladipro approved these changes Feb 2, 2022

View reviewed changes

src/Tasks/AssemblyDependency/AssemblyInformation.cs Outdated Show resolved Hide resolved

src/Tasks/AssemblyDependency/AssemblyInformation.cs Outdated Show resolved Hide resolved

src/Tasks/ComReference.cs Outdated Show resolved Hide resolved

Added cached GAC Path and changes from review

031af51

elachlan requested a review from ladipro February 2, 2022 21:57

Add documentation to Pinvoke

b4f6bf7

ladipro approved these changes Feb 4, 2022

View reviewed changes

src/Tasks/ComReference.cs Outdated Show resolved Hide resolved

ladipro requested a review from Forgind February 4, 2022 09:18

Changes from review

b5f60db

Forgind approved these changes Feb 4, 2022

View reviewed changes

Forgind added the merge-when-branch-open PRs that are approved, except that there is a problem that means we are not merging stuff right now. label Feb 8, 2022

elachlan added 2 commits February 9, 2022 07:48

Change from review

53d3da4

Add VerifyThrow instead of loop condition

0a97bfd

elachlan requested a review from Forgind February 8, 2022 21:56

Forgind approved these changes Feb 8, 2022

View reviewed changes

Forgind merged commit b8d493a into dotnet:main Feb 15, 2022

elachlan deleted the CA1838 branch February 16, 2022 01:11

elachlan mentioned this pull request Jun 22, 2022

Removed CA1838 warning suppressions on p/invokes for StringBuilders. dotnet/winforms#4751

Closed

elachlan mentioned this pull request Sep 25, 2022

Convert Shlwapi interop code to CSWin32 dotnet/winforms#7839

Merged

elachlan mentioned this pull request Nov 4, 2022

CA1838 Avoid 'StringBuilder' parameters for P/Invokes dotnet/winforms#8113

Merged

ladipro mentioned this pull request Feb 13, 2023

Quick post-mortem of recently implemented RAR optimizations #8432

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CA1838 Avoid 'StringBuilder' parameters for P/Invokes #7186

CA1838 Avoid 'StringBuilder' parameters for P/Invokes #7186

elachlan commented Dec 30, 2021 •

edited

Loading

elachlan commented Jan 8, 2022

Forgind commented Jan 10, 2022

elachlan commented Jan 10, 2022

elachlan commented Jan 12, 2022

ladipro left a comment

stephentoub commented Feb 1, 2022

ladipro commented Feb 2, 2022

stephentoub commented Feb 2, 2022 •

edited

Loading

ladipro commented Feb 2, 2022 •

edited

Loading

stephentoub commented Feb 2, 2022

ladipro commented Feb 4, 2022

ladipro left a comment

Forgind left a comment

Forgind Feb 4, 2022

elachlan Feb 4, 2022

Forgind Feb 4, 2022

Forgind Feb 4, 2022

elachlan Feb 4, 2022

Forgind Feb 4, 2022

elachlan Feb 4, 2022 •

edited

Loading

elachlan Feb 7, 2022

Forgind Feb 7, 2022

elachlan Feb 8, 2022

Forgind left a comment

Forgind commented Feb 15, 2022

CA1838 Avoid 'StringBuilder' parameters for P/Invokes #7186

CA1838 Avoid 'StringBuilder' parameters for P/Invokes #7186

Conversation

elachlan commented Dec 30, 2021 • edited Loading

elachlan commented Jan 8, 2022

Forgind commented Jan 10, 2022

elachlan commented Jan 10, 2022

elachlan commented Jan 12, 2022

ladipro left a comment

Choose a reason for hiding this comment

stephentoub commented Feb 1, 2022

ladipro commented Feb 2, 2022

stephentoub commented Feb 2, 2022 • edited Loading

ladipro commented Feb 2, 2022 • edited Loading

stephentoub commented Feb 2, 2022

ladipro commented Feb 4, 2022

ladipro left a comment

Choose a reason for hiding this comment

Forgind left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

elachlan Feb 4, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Forgind left a comment

Choose a reason for hiding this comment

Forgind commented Feb 15, 2022

elachlan commented Dec 30, 2021 •

edited

Loading

stephentoub commented Feb 2, 2022 •

edited

Loading

ladipro commented Feb 2, 2022 •

edited

Loading

elachlan Feb 4, 2022 •

edited

Loading