-
Notifications
You must be signed in to change notification settings - Fork 184
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
VZV on FZV #2035
VZV on FZV #2035
Conversation
Assembly code of VZVComponents::get_unchecked: Before:
After:
Needless to say, it's a bit slower. |
I got it to go a little bit faster by special-casing width=1 and width=2. It's still a slowdown unfortunately though.
|
Trying to narrow down the root cause. Some more info:
|
Additional findings:
So, here's the summary of my findings:
|
Possible next step: change the VarZeroVec index array to store the endpoints instead of the startpoints of each range. For example, right now we store |
Assembly code across 76c6f24:
to
and across 36fdb8d to
and across daec8eb
and across bfd450f
and then across c548800
|
No description provided.