-
-
Notifications
You must be signed in to change notification settings - Fork 516
[GSoC] Optimizing the VOC Compiler #772
Comments
@patiences This is a really solid first draft - great work! A couple of suggestions:
Of course, another approach would be to modify VOC to use Java strings in place of a separate string class - which require much less messing around with type conversion, but would mess up the internals a lot. However... I suspect it would also be a huge performance bump. Given that you've found conflicting opinions about whether StackMapFrames are worth the effort, our time might be better served looking to see if modifying VOC internals to use Java primitives rather than classes for Python primitives might be worthwhile. Even a speculative experiment that doesn't end up with production code might be a better use of GSoC time than implementing a bytecode structure that doesn't actually benefit performance. |
Ah, right!
Do you mean that python methods that operate on strings (Dict#getitem, for example?) should take java.lang.String strings instead?
Yeah, unfortunately I've seen multiple times that the performance gain was more of a theoretical advantage and has not been realized. :-( But wouldn't implementing StackMapFrames be a good idea since they are no longer optional in Java 8 -- is that a good enough reason? |
Regarding strings: I was thinking that the potential optimisation is to completely remove any concept of an org.python.types.String, and replace them with java.lang.String (and similar with Integer, Boolean, and so on). Essentially, we'd be making a special case of Python primitive types, making them fall back to Java primitives. The catch is that we'd need to provide all the utility methods that Python provides on it's primitive types (e.g., It wouldn't be a trivial change, though - at the moment, we can assume every object is an org.python.Object; if we start using Java primitives, that distinction would be lost. As for whether SMF's are worth adding... that's a good question. It's possibly a good move to still add them; at the moment, we can only generate Java 6 bytecode, and while Java VMs are completely backwards compatible, and J6 bytecode will continue to run, it means that if Java ever adds anything interesting (and they already have in J7 and J8), we're might start to hit problems if our Java source code (for the Python standard library) starts to use those features. |
Does this mean that users that write Java extensions for their Python code also have to use Java 6 (I don't know how often this happens)? That seems a bit outdated, I personally can't imagine writing java code without J8 lambdas. If this is the case StackMapFrames would probably be a very good idea. On the other hand, I think the special-casing Python primitive types would be more interesting work for me. :-) |
Combining Java6 and Java8 classes in a single executable shouldn't be a problem... but those are always famous last words :-) And, we don't know what is going to be added in Java9 that might cause compatibility problems with Java6. Pinning ourselves to a known-deprecated format isn't a good idea in general. Honestly, either StackMapFrames or trying to make better use of primitives would be a viable project; StackMapFrames is better defined, but more detailed; primitive use is a lot more exploratory. |
@freakboy3742 Thanks for all the insights! :-) I've updated my proposal to address your comment about getting the benchmarking down in the first week. I've also replaced the work on implementing StackMapFrames with the primitives types optimizations we've chatted about in the comments here, as it is more interesting work for me and will likely have a huge performance impact. I've also added an extra week to the variable lookup work period -- I've been thinking about how to update the jump addresses and I'm still not sure so I've factored in some more time for that. Let me know what you think! |
This is looking pretty good - certainly good enough to submit as a final proposal. |
This project was selected for the 2018 GSoC. |
At the end of week 1, here's what's been happening...
|
Week 2 finished! I've been working on:
No blockers, and plenty of work to do coming into week 3! :-) |
End of week 3 update:
Hoping to wrap up these 3 PRs this coming week, and starting work optimizing global loads! |
End of week 4 update:
Thanks @freakboy3742 for spending extra time with me this week, even finding time to pair program! :-) |
End of week 5 update (sorry late!):
|
End of week 6!
Next week I'll look at getting pystone working as well, as it would be super beneficial, and perhaps doing some profiling. On another note, I've written a blog post about my first month working on VOC, check it out if interested :-) |
End of week 7 update:
|
End of week 8 update (late because I was away Jul 5-13):
I've written another blog post about the module optimization from this past month (#839), if anyone wants to know more about that. :-) |
End of week 9:
Next week, I'll continue working through some potential optimizations based on results I've gotten from JProfile, so there's plenty more work to do! |
End of week 10 update:
|
End of week 11 update:
This coming Tuesday is the official cutoff for GSoC, but I will continue working for about a week after. I'm currently working on a blog post for the website, and wrapping up a few more PRs this coming week. |
End of week 12 update:
I'm also currently working on hooking up a few more benchmarks from http://speed.pypy.org/comparison/ (which is where I'm finding all these new bugs!). I will try to wrap these up early this week if I can as I am planning on working as part of GSoC until Wednesday (but will be around to finish up anything that needs a little more time of course). |
This proposal intends to improve the performance of VOC-generated bytecode by adding piecemeal optimizations.
Table of Contents
Motivation
The current implementation of Python AST to Java bytecode transpilation in many cases takes a naive approach, resulting in redundant bytecode instructions being produced and class files being longer than necessary. Not only does this make the code run slower than it should, this causes problems in some cases because the JVM enforces a size limit on class files, in particular on method sizes: each method must be less than 64KB.
This proposal explores optimizations that cut down the number of generated bytecode instructions.
Goals
This is a list of criteria to use as an evaluation of the success of this project.
Overview of Changes
Reducing Instances of Variable Creation
VOC-generated bytecode contains a lot of Object creation and initialization because everything in Python is an
Object
. Being frugal when creating string and integer objects, in particular, will result in huge savings.Primitive Types
The Python
Object
abstraction is preserved in BeeWare's java implementation of the built-in types. However, this means that every time one of these objects is used instructions forObject
creation must be generated. Working directly with the java primitives rather than wrapping them inObject
would give a significant performance boost. The standard python typesint
,float
,complex
,Str
,bool
andNone
are considered here as the first candidates for refactoring.Integers
CPython keeps an array of integer objects for all integers between -5 and 256. When an integer is created in that range a reference to the existing object is returned. It should be possible for VOC to do the same, that is, instead of generating bytecode that creates a new
Integer
object, any integer in [-5, 256] will refer to the preallocated integer.Improving Variable Lookup
When a variable is read, instructions for loading global or local variables are generated. However, if the variable is looked up again, VOC has to generate the same loading instructions again. This results in a lot of duplicated bytecode.
VOC should cache values after the first lookup of global variables, modules, and functions whenever possible.
Considerations
Proposed Timeline
Preparation
Until April 23 (Student Proposal Acceptance)
April 23 - May 14 (Community Bonding Period)
Integer Pre-allocation
Week 1: May 14 (Official Coding Start) - May 20
small_ints
.small_ints
array instead of creating integer objects.Variable Lookup Optimization
Week 2: May 21 - May 27
Week 3: May 28 - June 3
Week 4: June 4 - June 10
Week 5: June 11 (Phase 1 Evaluation due 15th) - June 17
Week 6: June 18 - June 24
Week 7: June 25 - July 1
Primitive Types Optimization
Week 8: July 2 - July 8
Week 9: July 9 (Phase 2 Evaluation due 13th) - July 15
Week 10: July 16 - July 22
Week 11: July 23 - July 29
Week 12: July 30 - August 5
Note: The official coding period is May 14 - August 14, which is a 13 week period. I've accounted for 12 weeks of work here.
Risk Analysis
There are many gaps in my technical knowledge:
I will try to mitigate this by keeping my mentor updated of my progress, updating this timeline as needed, and asking for help early.
The text was updated successfully, but these errors were encountered: