Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minor performance fix for NEON RAID-Z. #9725

Merged
merged 1 commit into from
Dec 18, 2019
Merged

Conversation

rdolbeau
Copy link
Contributor

The NEON code replicates too closely the SSE code, including
a masked 16-bits shift. But NEON, like AltiVec (#9539), has
unsigned 8-bits shift, so use that instead and drop the masking.

Signed-off-by: Romain Dolbeau romain.dolbeau@european-processor-initiative.eu

Motivation and Context

Performance.

Description

See commit message - removes two 'and' by using the proper shift.

How Has This Been Tested?

raidz_test

Types of changes

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • [ X] Performance enhancement (non-breaking change which improves efficiency)
  • Code cleanup (non-breaking change which makes code smaller or more readable)
  • Breaking change (fix or feature that would cause existing functionality to change)
  • Documentation (a change to man pages or other documentation)

Checklist:

  • [X ] My code follows the ZFS on Linux code style requirements.
  • [ N/A] I have updated the documentation accordingly.
  • [ X] I have read the contributing document.
  • I have added tests to cover my changes.
  • [ X] All new and existing tests passed.
  • [ X] All commit messages are properly formatted and contain Signed-off-by.

The NEON code replicates too closely the SSE code, including
a masked 16-bits shift. But NEON, like AltiVec (openzfs#9539), has
unsigned 8-bits shift, so use that instead and drop the masking.

Signed-off-by: Romain Dolbeau <romain.dolbeau@european-processor-initiative.eu>
@codecov
Copy link

codecov bot commented Dec 15, 2019

Codecov Report

Merging #9725 into master will decrease coverage by <1%.
The diff coverage is n/a.

Impacted file tree graph

@@           Coverage Diff            @@
##           master    #9725    +/-   ##
========================================
- Coverage      79%      79%   -<1%     
========================================
  Files         420      420            
  Lines      123654   123654            
========================================
- Hits        98069    98041    -28     
- Misses      25585    25613    +28
Flag Coverage Δ
#kernel 80% <ø> (ø) ⬆️
#user 67% <ø> (ø) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ddb4e69...fc898b1. Read the comment docs.

@behlendorf behlendorf added the Status: Code Review Needed Ready for review and testing label Dec 16, 2019
@behlendorf
Copy link
Contributor

This looks right to me after consulting the ARM reference manual.

@rdolbeau
Copy link
Contributor Author

... and sorry about that, this inefficiency should not have made it in the original submission for NEON :-(

@behlendorf behlendorf added Status: Accepted Ready to integrate (reviewed, tested) and removed Status: Code Review Needed Ready for review and testing labels Dec 18, 2019
@behlendorf behlendorf merged commit 118fc3e into openzfs:master Dec 18, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Status: Accepted Ready to integrate (reviewed, tested)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants