cudev: fix 1D error introduced in PR 3378 #3418

cudawarped · 2023-01-10T16:18:44Z

This fixes issue #3412 returning incorrect results for single GpuMat's with a single row, which was a result of an error in #3378.

Unfortunatley because GpuMat's with a single row or column are allocated with cudaMalloc and not cudaMallocPitch and 2D texture objects require pitched memory this PR introduces extra staging memory, when the row/col dimension is 1. This is a quick fix for the issue which only introduces extra overhead for single dimension GpuMat's who's use should not be that common.

A "better" fix would be to rework the TexturePtr class so that

__device__ __forceinline__ R operator ()(index_type y, index_type x) const;

staticly dispatches a call to tex1Dfetch when either x or y is 1, but this will be a more significant change.

Dependant on opencv/opencv#23126

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
The PR is proposed to the proper branch
There is a reference to the original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

force_builders=Custom
buildworker:Custom=linux-1
docker_image:Custom=ubuntu-cuda:16.04

cudawarped marked this pull request as draft January 11, 2023 07:14

cudawarped force-pushed the fix_issue_3412 branch from 3b55b52 to 7a21c73 Compare January 11, 2023 12:54

cudawarped marked this pull request as ready for review January 11, 2023 14:52

cudev: fix 1D error introduced in PR 3378

abd3ca8

cudawarped force-pushed the fix_issue_3412 branch from 7a21c73 to abd3ca8 Compare January 12, 2023 19:16

alalek approved these changes Jan 13, 2023

View reviewed changes

opencv-pushbot merged commit 325e6ab into opencv:4.x Jan 13, 2023

cudawarped mentioned this pull request Jan 13, 2023

CUDA Remap gives incorrect result or crashes for 1 row/column src Mats #3412

Closed

alalek mentioned this pull request Jan 18, 2023

(5.x) Merge 4.x #3425

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

cudev: fix 1D error introduced in PR 3378 #3418

cudev: fix 1D error introduced in PR 3378 #3418

Uh oh!

cudawarped commented Jan 10, 2023 •

edited by alalek

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cudev: fix 1D error introduced in PR 3378 #3418

cudev: fix 1D error introduced in PR 3378 #3418

Uh oh!

Conversation

cudawarped commented Jan 10, 2023 • edited by alalek Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Readiness Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cudawarped commented Jan 10, 2023 •

edited by alalek

Loading