[GSoC] Add more universal intrinsic implementations for RVV. #22353

hanliutong · 2022-08-08T02:33:53Z

This is a patch of my GSoC project that the goal is to make the existing Universal Intrinsic compatible with scalable (variable-length) backends.

In #22179, we have already introduce a new framework of universal intrinsic for RISC-V Vector backend and few implementations and test cases are also added.

In this patch, we are going to add more universal intrinsic implementations for RVV.

Tested with QEMU for RVV backend in various VLEN:

qemu-riscv64 -cpu rv64,x-v=true,vlen=128 ./bin/opencv_test_core --gtest_filter="hal*"
qemu-riscv64 -cpu rv64,x-v=true,vlen=512 ./bin/opencv_test_core --gtest_filter="hal*"

Also tested for AVX and SSE backend on Linux.

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
The PR is proposed to the proper branch
There is a reference to the original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

hanliutong · 2022-08-14T08:13:37Z

Update:

There are some differences between the implementation and the documentation, the following test case is modified to match the documentation description:

v_min v_max for v_uint64 and v_int64 are removed.
test_abs for v_float32 and v_float64 are enabled.

Since we introduce some new intrinsic functions, testcase for v_mul(v0,v1, ..., vn) and v_add(v0,v1, ..., vn) are added (in test_mul() and test_addsub()).

And we may also need to update the documentations: test_mul_expand is enabled for v_unit8 and v_int8, but the doc said it is only for 16- and unsigned 32-bit source types

hanliutong · 2022-08-17T14:47:55Z

Hello @vpisarev, I have added some implementations as discussed in our meeting today (zip, transpose and interleave), there are also 2 sets of functions submitted together (combine and reverse).

And this PR is frozen for review, any new implementations will commit to another new PR.

vpisarev · 2022-08-18T12:21:34Z

looks good to me. @asmorkalov, since it's very unobtrusive patch, I suggest to merge it

asmorkalov · 2022-08-23T09:49:24Z

👍 Tested manually with Qemu!

Add more universal intrinsic implementations for RVV.

f0d29cd

asmorkalov added optimization feature GSoC platform: riscv labels Aug 8, 2022

hanliutong added 4 commits August 12, 2022 01:44

Add testcase for continuous mul and add.

2fb652c

Update implementations on arithmetics.

80c82e1

Remove redundant intrinsics.

e65ad44

add missing test cases(v_abs)

f572ae3

hanliutong added 2 commits August 17, 2022 14:38

Add implementation for zip, transpose, interleave, reverse and combine.

189f647

Add testcases for interleave_p&q and enable others testcases.

8dc3327

Remove the test log in test_interleave_pq.

b9a1039

vpisarev self-requested a review August 18, 2022 12:20

vpisarev approved these changes Aug 18, 2022

View reviewed changes

asmorkalov merged commit d108320 into opencv:4.x Aug 23, 2022

hanliutong mentioned this pull request Aug 26, 2022

[GSoC] Add remaining universal intrinsic implementations for RVV. #22429

Merged

6 tasks

asmorkalov mentioned this pull request Sep 16, 2022

!!! NOT FOR REVIEW !!! [SVE] Example HAL-compatible SVE code for Linear Resize. #20640

Closed

6 tasks

alalek mentioned this pull request Jan 8, 2023

(5.x) Merge 4.x #23113

Merged

asmorkalov added this to the 4.7.0 milestone Jan 23, 2023

asmorkalov mentioned this pull request Oct 31, 2023

About the performance of opencv of the sizeless instruction(Riscv vector, SVE .etc) #21780

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[GSoC] Add more universal intrinsic implementations for RVV. #22353

[GSoC] Add more universal intrinsic implementations for RVV. #22353

Uh oh!

hanliutong commented Aug 8, 2022 •

edited

Loading

Uh oh!

hanliutong commented Aug 14, 2022 •

edited

Loading

Uh oh!

hanliutong commented Aug 17, 2022

Uh oh!

vpisarev commented Aug 18, 2022

Uh oh!

asmorkalov commented Aug 23, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[GSoC] Add more universal intrinsic implementations for RVV. #22353

[GSoC] Add more universal intrinsic implementations for RVV. #22353

Uh oh!

Conversation

hanliutong commented Aug 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Readiness Checklist

Uh oh!

hanliutong commented Aug 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hanliutong commented Aug 17, 2022

Uh oh!

vpisarev commented Aug 18, 2022

Uh oh!

asmorkalov commented Aug 23, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hanliutong commented Aug 8, 2022 •

edited

Loading

hanliutong commented Aug 14, 2022 •

edited

Loading