5.x merge 4.x #24981

asmorkalov · 2024-02-08T12:55:23Z

OpenCV Contrib: opencv/opencv_contrib#3634
OpenCV Extra: opencv/opencv_extra#1147

#24548 from dkurt:qrcode_struct_append_decode
#24768 from Haosonn:pre-pr-2
#24779 from MaximSmolskiy:fix-bug-in-ChessBoardDetector-findQuadNeighbor
#24832 from AryanNanda17:Aryan#22177
#24845 from TolyaTalamanov:at/concurrent-executor
#24892 from opencv-pushbot:gitee/alalek/dnn_avoid_16s_usage
#24898 from Abdurrahheem:ash/yolo_ducumentation
#24910 from alexlyulkov:al/android-tests
#24913 from usyntest:optical-flow-sample-raft
#24918 from opencv-pushbot:gitee/alalek/core_convertfp16_replacement
#24919 from asmorkalov:as/python_Rect2f_Point3i
#24925 from fengyuentau:loongarch_handle_warnings
#24929 from asmorkalov:as/imdecode_user_buffer
#24931 from mshabunin:fix-rvv07-mul
#24934 from GengGode:fix
#24936 from mshabunin:fix-rvv07-scale64f
#24942 from asmorkalov:as/android_warning_fix
#24945 from asmorkalov:as/android_sample_warning_fix
#24947 from asmorkalov:as/android_test_with_phone
#24949 from hoodmane:emscripten-enable-file-system
#24956 from asmorkalov:as/android_build_offline
#24968 from fengyuentau:fix_nary_ocl
#24969 from asmorkalov:as/android_offline
#24973 from asmorkalov:as/fix_weigths_proto_mess

Previous "Merge 4.x": #24912

force_builders=Linux OpenCL,Win64 OpenCL

Resolved issue number opencv#22177

…_convertfp16_replacement core(OpenCL): optimize convertTo() with CV_16F (convertFp16() replacement) opencv#24918 relates opencv#24909 relates opencv#24917 relates opencv#24892 Performance changes: - [x] 12700K (1 thread) + Intel iGPU |Name of Test|noOCL|convertFp16|convertTo BASE|convertTo PATCH| |---|:-:|:-:|:-:|:-:| |ConvertFP16FP32MatMat::OCL_Core|3.130|3.152|3.127|3.136| |ConvertFP16FP32MatUMat::OCL_Core|3.030|3.996|3.007|2.671| |ConvertFP16FP32UMatMat::OCL_Core|3.010|3.101|3.056|2.854| |ConvertFP16FP32UMatUMat::OCL_Core|3.016|3.298|2.072|2.061| |ConvertFP32FP16MatMat::OCL_Core|2.697|2.652|2.723|2.721| |ConvertFP32FP16MatUMat::OCL_Core|2.752|4.268|2.662|2.947| |ConvertFP32FP16UMatMat::OCL_Core|2.706|2.601|2.603|2.528| |ConvertFP32FP16UMatUMat::OCL_Core|2.704|3.215|1.999|1.988| Patched version is not worse than convertFp16 and convertTo baseline (except MatUMat 32->16, baseline uses CPU code+dst buffer map). There are still gaps against noOpenCL(CPU only) mode due to T-API implementation issues (unnecessary synchronization). - [x] 12700K + AMD dGPU |Name of Test|noOCL|convertFp16 dGPU|convertTo BASE dGPU|convertTo PATCH dGPU| |---|:-:|:-:|:-:|:-:| |ConvertFP16FP32MatMat::OCL_Core|3.130|3.133|3.172|3.087| |ConvertFP16FP32MatUMat::OCL_Core|3.030|1.713|9.559|1.729| |ConvertFP16FP32UMatMat::OCL_Core|3.010|6.515|6.309|4.452| |ConvertFP16FP32UMatUMat::OCL_Core|3.016|0.242|23.597|0.170| |ConvertFP32FP16MatMat::OCL_Core|2.697|2.641|2.713|2.689| |ConvertFP32FP16MatUMat::OCL_Core|2.752|4.076|6.483|4.191| |ConvertFP32FP16UMatMat::OCL_Core|2.706|9.042|16.481|1.834| |ConvertFP32FP16UMatUMat::OCL_Core|2.704|0.229|15.730|0.176| convertTo-baseline can't compile OpenCL kernel for FP16 properly - FIXED. dGPU has much more power, so results are x16-17 better than single cpu core. Patched version is not worse than convertFp16 and convertTo baseline. There are still gaps against noOpenCL(CPU only) mode due to T-API implementation issues (unnecessary synchronization) and required memory transfers. Co-authored-by: Alexander Alekhin <[email protected]>

…nings Handle warnings in loongson-related code opencv#24925 See https://github.com/fengyuentau/opencv/actions/runs/7665377694/job/20891162958#step:14:16 Warnings needs to be handled before we add the loongson server to our CI. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake

…avoid_16s_usage DNN: avoid CV_16S usage for FP16 opencv#24892 **Merge after**: opencv#24918 TODO: - [x] measure performance changes - [x] optimize convertTo for OpenCL: opencv#24918 12700K iGPU: |Name of Test|0|1|1 vs 0 (x-factor)| |---|:-:|:-:|:-:| |AlexNet::DNNTestNetwork::OCV/OCL_FP16|7.441|7.480|0.99| |CRNN::DNNTestNetwork::OCV/OCL_FP16|10.776|10.736|1.00| |DenseNet_121::DNNTestNetwork::OCV/OCL_FP16|52.762|52.833|1.00| |EAST_text_detection::DNNTestNetwork::OCV/OCL_FP16|60.694|60.721|1.00| |EfficientNet::DNNTestNetwork::OCV/OCL_FP16|33.373|33.173|1.01| |FastNeuralStyle_eccv16::DNNTestNetwork::OCV/OCL_FP16|81.840|81.724|1.00| |GoogLeNet::DNNTestNetwork::OCV/OCL_FP16|20.965|20.927|1.00| |Inception_5h::DNNTestNetwork::OCV/OCL_FP16|22.204|22.173|1.00| |Inception_v2_SSD_TensorFlow::DNNTestNetwork::OCV/OCL_FP16|47.115|47.460|0.99| |MPHand::DNNTestNetwork::OCV/OCL_FP16|6.760|6.670|1.01| |MPPalm::DNNTestNetwork::OCV/OCL_FP16|10.188|10.171|1.00| |MPPose::DNNTestNetwork::OCV/OCL_FP16|12.510|12.561|1.00| |MobileNet_SSD_Caffe::DNNTestNetwork::OCV/OCL_FP16|17.290|17.072|1.01| |MobileNet_SSD_v1_TensorFlow::DNNTestNetwork::OCV/OCL_FP16|19.473|19.306|1.01| |MobileNet_SSD_v2_TensorFlow::DNNTestNetwork::OCV/OCL_FP16|22.874|23.404|0.98| |OpenFace::DNNTestNetwork::OCV/OCL_FP16|9.568|9.517|1.01| |OpenPose_pose_mpi_faster_4_stages::DNNTestNetwork::OCV/OCL_FP16|539.899|539.845|1.00| |PPHumanSeg::DNNTestNetwork::OCV/OCL_FP16|18.015|18.769|0.96| |PPOCRv3::DNNTestNetwork::OCV/OCL_FP16|63.122|63.540|0.99| |ResNet_50::DNNTestNetwork::OCV/OCL_FP16|34.947|34.925|1.00| |SFace::DNNTestNetwork::OCV/OCL_FP16|10.249|10.206|1.00| |SSD::DNNTestNetwork::OCV/OCL_FP16|213.068|213.108|1.00| |SqueezeNet_v1_1::DNNTestNetwork::OCV/OCL_FP16|4.867|4.878|1.00| |VIT_B_32::DNNTestNetwork::OCV/OCL_FP16|200.563|190.788|1.05| |VitTrack::DNNTestNetwork::OCV/OCL_FP16|7.528|7.173|1.05| |YOLOX::DNNTestNetwork::OCV/OCL_FP16|132.858|132.701|1.00| |YOLOv3::DNNTestNetwork::OCV/OCL_FP16|209.559|208.809|1.00| |YOLOv4::DNNTestNetwork::OCV/OCL_FP16|221.357|220.924|1.00| |YOLOv4_tiny::DNNTestNetwork::OCV/OCL_FP16|24.446|24.382|1.00| |YOLOv5::DNNTestNetwork::OCV/OCL_FP16|43.922|44.080|1.00| |YOLOv8::DNNTestNetwork::OCV/OCL_FP16|64.159|63.842|1.00| |YuNet::DNNTestNetwork::OCV/OCL_FP16|10.177|10.231|0.99| |opencv_face_detector::DNNTestNetwork::OCV/OCL_FP16|15.121|15.445|0.98| Co-authored-by: Alexander Alekhin <[email protected]>

RISC-V: fix mul 8/16 bit for RVV 0.7

RISC-V: fix scale64f performance for RVV 0.7

Do not release user-provided buffer, if image decoder failed

Add python bindings for Rect2f and Point3i

Raft support added in this sample code opencv#24913 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake fix: opencv#24424 Update DNN Optical Flow sample with RAFT model I implemented both RAFT and FlowNet v2 leaving it to the user which one he wants to use to estimate the optical flow. Co-authored-by: Uday Sharma <[email protected]>

Vulkan backend for NaryEltwiseLayer in DNN module opencv#24768 We improve Vulkan backend for ``NaryEltwiseLayer`` in DNN module by: - add a basic framework for Vulkan backend in ``NaryEltwiseLayer`` - add a compute shader for binary forwarding (an imitation of what has been done in native OpenCV backend including broadcasting and eltwise-operation) - typo fixed: - Wrong info output in ``context.cpp`` Currently, our implementation (or all layers supporting Vulkan backend) runs pretty slow on discrete GPUs basically due to IO cost in function ``copyToHost``, and we are going to fix that by - find out the best ``VkMemoryProperty`` for various discrete GPUs - prevent ``copyToHost`` in middle layers during forwarding, (i.e keep data in GPU memory) ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake Co-authored-by: IskXCr <[email protected]>

…cutor G-API: Implement concurrent executor opencv#24845 ## Overview This PR introduces the new G-API executor called `GThreadedExecutor` which can be selected when the `GComputation` is compiled in `serial` mode (a.k.a `GComputation::compile(...)`) ### ThreadPool `cv::gapi::own::ThreadPool` has been introduced in order to abstract usage of threads in `GThreadedExecutor`. `ThreadPool` is implemented by using `own::concurrent_bounded_queue` `ThreadPool` has only as single method `schedule` that will push task into the queue for the further execution. The **important** notice is that if `Task` executed in `ThreadPool` throws exception - this is `UB`. ### GThreadedExecutor The `GThreadedExecutor` is mostly copy-paste of `GExecutor`, should we extend `GExecutor` instead? #### Implementation details 1. Build the dependency graph for `Island` nodes. 2. Store the tasks that don't have dependencies into separate `vector` in order to run them first. 3. at the `GThreadedExecutor::run()` schedule the tasks that don't have dependencies that will schedule their dependents and wait for the completion. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [ ] I agree to contribute to the project under Apache 2 License. - [ ] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [ ] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake

Documentation for Yolo usage in Opencv opencv#24898 This PR introduces documentation for the usage of yolo detection model family in open CV. This is not to be merge before opencv#24691, as the sample will need to be changed. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake

Build warning fix for Charuco tests

Modified Java tests to run on Android opencv#24910 To run the tests you need to: 1. Build OpenCV using Android pipeline. For example: `cmake -DBUILD_TEST=ON -DANDROID=ON -DANDROID_ABI=arm64-v8a -DCMAKE_TOOLCHAIN_FILE=/usr/lib/android-sdk/ndk/25.1.8937393/build/cmake/android.toolchain.cmake -DANDROID_NDK=/usr/lib/android-sdk/ndk/25.1.8937393 -DANDROID_SDK=/usr/lib/android-sdk ../opencv` `make` 2. Connect Android Phone 3. Run tests: `cd android_tests` `./gradlew tests_module:connectedAndroidTest` Related CI pipeline: opencv/ci-gha-workflow#138 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake

…phone Added job to test with real hardware

…ning_fix Build warning fix in Tutorial4-OpenCL.

QR codes Structured Append decoding mode opencv#24548 ### Pull Request Readiness Checklist resolves opencv#23245 Merge after opencv#24299 Current proposal is to use `detectAndDecodeMulti` or `decodeMulti` for structured append mode decoding. 0-th QR code in a sequence gets a full message while the rest of codes will correspond to empty strings. See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake

Add CMake policy CMP0071 for AUTOMOC and AUTOUIC

…system Enable file system on Emscripten

Added offline option for Android builds opencv#24956 ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake

…ardDetector-findQuadNeighbor Fix bug in ChessBoardDetector::findQuadNeighbors opencv#24779 ### Pull Request Readiness Checklist `corners` and `neighbors` indices means not filling order, but relative position. So, for example if `quad->count = 2`, it doesn't mean that `quad->neighbors[0]` and `quad->neighbors[1]` are filled. And we should should iterate over all four `neighbors`. See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [ ] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake

…mess Fix proto and weights mess in dnn performance tests

Allow multiple flags with OPENCV_GRADLE_VERBOSE_OPTIONS opencv#24969 ### Pull Request Readiness Checklist Merge with opencv/ci-gha-workflow#144 See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [ ] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [ ] The feature is well documented and sample code can be built with the project CMake

asmorkalov · 2024-02-08T13:01:40Z

@zihaomu @fengyuentau Could you take a look on Vulkan NaryEltwiseLayer part. I I'm not sure, if merged all things correctly.

fengyuentau

I dont see problems in the naryeltwise vulkan backend part. Also tests are passing. So it should be alright.

AryanNanda17 and others added 30 commits January 9, 2024 01:23

Resolved issue number opencv#22177

9b402cf

Add python bindings for Rect2f and Point3i

fefc7e3

Test for Rect2f in Python.

cb92974

Merge pull request opencv#24832 from AryanNanda17:Aryan#22177

ae21368

Resolved issue number opencv#22177

Do not release user-provided buffer, if decoder failed.

c9671da

RISC-V: fix mul 8/16 bit for RVV 0.7

2ea2483

Add CMake policy CMP0071 for AUTOMOC and AUTOUIC

a97e66e

RISC-V: fix scale64f for RVV 0.7

65784dd

Merge pull request opencv#24931 from mshabunin:fix-rvv07-mul

8ed0319

RISC-V: fix mul 8/16 bit for RVV 0.7

Merge pull request opencv#24936 from mshabunin:fix-rvv07-scale64f

54b7caf

RISC-V: fix scale64f performance for RVV 0.7

Merge pull request opencv#24929 from asmorkalov:as/imdecode_user_buffer

8ea939f

Do not release user-provided buffer, if image decoder failed

Merge pull request opencv#24919 from asmorkalov:as/python_Rect2f_Point3i

73acf08

Add python bindings for Rect2f and Point3i

Build warning fix for Charuco tests.

ba8915c

Merge pull request opencv#24942 from asmorkalov:as/android_warning_fix

e48b96b

Build warning fix for Charuco tests

Build warning fix in Tutorial4-OpenCL.

145981c

Added job to test with real hardware.

1b4c1ff

Merge pull request opencv#24947 from asmorkalov:as/android_test_with_…

0c6ff36

…phone Added job to test with real hardware

Merge pull request opencv#24945 from asmorkalov:as/android_sample_war…

ea94f7e

…ning_fix Build warning fix in Tutorial4-OpenCL.

Enable file system on Emscripten

422d519

Merge pull request opencv#24934 from GengGode:fix

c7021f0

Add CMake policy CMP0071 for AUTOMOC and AUTOUIC

Merge pull request opencv#24949 from hoodmane:emscripten-enable-file-…

250cfe8

…system Enable file system on Emscripten

asmorkalov and others added 7 commits February 5, 2024 11:57

fix incorrect steps and elemsize when dtype changes

fcaa8ce

Merge pull request opencv#24968 from fengyuentau:fix_nary_ocl

5abb065

Fix proto and weights mess in dnn performance tests.

77af137

Merge pull request opencv#24973 from asmorkalov:as/fix_weigths_proto_…

4b35b2f

…mess Fix proto and weights mess in dnn performance tests

asmorkalov mentioned this pull request Feb 8, 2024

Force OPENCV_TEST_REQUIRE_DATA in CI environments in 5.x opencv/ci-gha-workflow#146

Merged

asmorkalov mentioned this pull request Feb 8, 2024

Use Gradle offline mode in 5.x opencv/ci-gha-workflow#147

Merged

fengyuentau approved these changes Feb 8, 2024

View reviewed changes

asmorkalov requested a review from opencv-alalek February 8, 2024 17:22

opencv-alalek approved these changes Feb 8, 2024

View reviewed changes

opencv-alalek mentioned this pull request Feb 11, 2024

Extended several core functions to support new types #24962

Merged

6 tasks

This was referenced Feb 12, 2024

5.x merge 4.x opencv/opencv_contrib#3634

Merged

5.x merge 4.x opencv/opencv_extra#1147

Merged

Merge branch 4.x

3a55f50

asmorkalov force-pushed the 5.x-merge-4.x branch from cb6507b to 3a55f50 Compare February 12, 2024 11:23

asmorkalov merged commit 3a55f50 into opencv:5.x Feb 12, 2024

asmorkalov mentioned this pull request Feb 16, 2024

(5.x) Merge 4.x #25041

Merged

dkurt added this to the 5.0 milestone Apr 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

5.x merge 4.x #24981

5.x merge 4.x #24981

Uh oh!

asmorkalov commented Feb 8, 2024 •

edited

Loading

Uh oh!

asmorkalov commented Feb 8, 2024

Uh oh!

fengyuentau left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

16 participants

Uh oh!

5.x merge 4.x #24981

5.x merge 4.x #24981

Uh oh!

Conversation

asmorkalov commented Feb 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

asmorkalov commented Feb 8, 2024

Uh oh!

fengyuentau left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

16 participants

asmorkalov commented Feb 8, 2024 •

edited

Loading