-
-
Notifications
You must be signed in to change notification settings - Fork 327
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimize project file decoding #365
Conversation
I should have a chance to test this in the next few days, our pbxproj file is 34 mbs so it would be great to see an improvement. I'll have to locally work this into an XcodeGen build though |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @CognitiveDisson. This is awesome ❤️ and very appreciated. Thanks for taking your time to bring those improvements to the codebase.
I left some minor comments but overall looks good. It's been a while without working with the codebase so I needed to get some context beforehand to understand all the pieces that you touched.
It'd be great @keith if you could give us some numbers from that project. |
Looks like this saves ~4 seconds or ~10% on the largest version of our project 🎉 (I didn't test a ton) |
@keith Strange, I expected a better result. Can you provide more information about the project file? What objects are there most? @pepibumur I corrected most of the comments. Can I merge? I implemented using a custom decoder to work with a dictionary. This should take away the extra work. According to the results received a slight but still improved. (from 3 seconds to 2) |
I can check in more detail but we have ~700 targets and like 15k files |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some last comments and we can merge afterwards.
dab2f3d
Awesome job @CognitiveDisson 🎉 . I'm merging it. |
Hi.
I was going to use your framework to implement a remote cache. But I ran into a blocking problem - it takes my project file about 12 seconds to read from the disk and about 30 seconds to write back. The project is really big - about 13 megabytes, but even for this size such performance is extremely bad.
I tried to find a issue about this, but did not find it. After that, I looked at the source code and found a lot of performance issues. Some of them I tried to fix with minimal changes. As a result, the project opening time was reduced from 12 to 3 seconds. There are still a few problems left in the code, but the main problems have been fixed. In general, I would recommend to refuse Codable and use dictionaries and Marshal - benchmarks.
Problem research
I used Xcode Instruments and my utilitarian class to look for problems.
I made a demo project to check how long it takes to serialize a project file using a simple PropertyListSerialization.
It turned out that it takes less than a second (very quickly). I understood that the problem is in decoding and serialization.
Steps to optimize
Concurrent decoding
First of all, I added concurrent decoding for objects in PBXObjects and implemented synchronization mechanisms for it.
Before:
After:
I chose the standard method
enumerateKeysAndObjects
, but I also tried to manually select the number of threads. There was no significant difference. This gave about 4 seconds. ( 8 seconds instead of 12 ). Instruments showed excellent work parallelization.Blind decoding order
I also added very simple optimizations based on the order of parsing - since there are usually more
PBXBuildFile
entries in the project thanPBXTarget
ones, so we should start our blind decoding from them, thus reducing the number of checks. Similar with switches by type. This gave another 1-2 seconds. (6 seconds instead of 12)Eliminating unnecessary serialization
After that I found a time consuming decoding here:
I realized that in this case you can convert this project file to serialize into a dictionary and just take a value from it. To make this more accurate, I added a new property to the
ProjectDecodingContext
and read from it directly in theKeyedDecodingContainer
extension. Currenly this optimization takes effect only duringPBXProj
decoding. For other objects there will be no key provided, but this extra check does not affect their decoding times. ( In the future, it is possible to improve this for nested objects. )Result
Finally, I got the cherished 3 seconds. Ideally, I would like about a 1-2 seconds, but this requires global changes (like replacing
Codable
withMarshal
).All tests pass (checked many times), and the project parsing is also correct (checked it in my demo project).
I also plan to speed up the project saving performance in the next PR, and I already know what to do.