Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ARM: Fix signed/unsigned simd mismatch in vbool4::load #3722

Merged
merged 1 commit into from
Dec 23, 2022

Conversation

lgritz
Copy link
Collaborator

@lgritz lgritz commented Dec 22, 2022

Fixes #3721

Please read the comments in #3721. This is the "local" fix for the build break due to the type mismatch. It is a band-aid. And it may be the best solution for the 2.4 branch if we don't want to potentially break ABIs by changing the definition of any public types.

Still pending is to examine the issue of whether it was a mistake to define vbool4 storage for neon as uint32x4_t or if we should change it to int32x4_t to better match the non-simd reference implementation. After debating that (and identifying somebody with access to an ARM-based machine to test the solution for us), we may return to tackle this more fundamental change.

Fixes 3721

Please read the comments in 3721. This is the "local" fix for the
build break due to the type mismatch. It is a band-aid. And it may be
the best solution for the 2.4 branch if we don't want to potentially
break ABIs by changing the definition of any public types.

Still pending is to examine the issue of whether it was a mistake to
define vbool4 storage for neon as uint32x4_t or if we should change it
to int32x4_t to better match the non-simd reference implementation.
After debating that (and identifying somebody with access to an
ARM-based machine to test the solution for us), we may return to
tackle this more fundamental change.
@hobbes1069
Copy link
Contributor

Looks like that got it:
https://koji.fedoraproject.org/koji/taskinfo?taskID=95618167

@lgritz lgritz merged commit 39120c7 into AcademySoftwareFoundation:dev-2.4 Dec 23, 2022
lgritz added a commit to lgritz/OpenImageIO that referenced this pull request Dec 23, 2022
…reFoundation#3722)

Fixes AcademySoftwareFoundation#3721

Please read the comments in AcademySoftwareFoundation#3721. This is the "local" fix for the
build break due to the type mismatch. It is a band-aid. And it may be
the best solution for the 2.4 branch if we don't want to potentially
break ABIs by changing the definition of any public types.

Still pending is to examine the issue of whether it was a mistake to
define vbool4 storage for neon as uint32x4_t or if we should change it
to int32x4_t to better match the non-simd reference implementation.
After debating that (and identifying somebody with access to an
ARM-based machine to test the solution for us), we may return to
tackle this more fundamental change.
@lgritz lgritz deleted the lg-simd branch December 23, 2022 01:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants