OCR scan of screenshot #603

hum4nizer · 2021-03-08T13:19:41Z

Is your feature request related to a problem? Please describe.
No. It is not related to a problem.

Describe the solution you'd like
I would like the feature to OCR scan the screenshot when a screenshot is taken to extract text from the picture.

Additional context
Thanks a bunch for a awesome piece of software!

DamirPorobic · 2021-03-10T10:03:08Z

I've been thinking about this feature for quite some time but haven't found any simple solution. We need to figure out what external library can be used for this and hot to interact with it. But definitely would a nice addition to ksnip.

DamirPorobic · 2021-03-10T10:20:50Z

https://github.com/tesseract-ocr/ might be something that could work.

hum4nizer · 2021-03-11T09:33:58Z

Exactly! ShareX uses the tesseract OCR engine. And it works for them.

DamirPorobic · 2021-03-11T09:46:17Z

@fnkabit what do you think about this feature? Would be nice to have something like this in the application but I personally haven't worked with OCR libraries yet. Do you have any experience?

fnkabit · 2021-03-11T10:04:52Z

@DamirPorobic . Sure, would like to work on this.
Except the tesseract integration, how do you see this working ? ie. do we add a tool for making a selection, perform OCR, and then display a dialog with the transcription ?

fnkabit · 2021-03-11T10:05:37Z

@DamirPorobic To answer your question, I don't have any experience working with OCR.

DamirPorobic · 2021-03-11T10:11:23Z

Maybe @hum4nizer can describe how it works in ShareX but for beginners I had in mind to have a button in the file menu that triggers OCR and displays a dialog with all text it has found, something in that direction, later on we can get more fancy.

Regarding the code, I think it would be nice to have a nice separation, hide the OCR stuff behind an adapter and an interface. Also, maybe we should consider that Tesseract might not be always available when building ksnip so cmake should check for it when building and when not found the option in the filemenu should be grayed out, something like that.

This is probably a larger feature, maybe start small and see how it works.

fnkabit · 2021-03-11T10:12:46Z

@DamirPorobic Sounds good.
Did a YT search to see how this works in ShareX: https://www.youtube.com/watch?v=t629fruq1Z0

DamirPorobic · 2021-03-11T10:16:08Z

Not far from what I had in mind. Would be cool to be able to do something like this.
ShareX is open source as far as I know, maybe you can have a look into their code for hints

fnkabit · 2021-03-11T10:18:22Z

@DamirPorobic Thanks ! I will implement this next week.

hum4nizer · 2021-03-16T17:22:32Z

Hi! Im really excited about this new feature. Good luck with the implementation.

fnkabit · 2021-03-29T13:38:42Z

@hum4nizer Thank you !

Sorry guys, didn't have time to start working on this; have been really busy the last two weeks.
Next week would be better, I hope.

DamirPorobic · 2021-09-20T09:04:51Z

I'm hoping to look into this for the next minor release. An issue that I'm still thinking about is what way to go, either implement it in ksnip or call an external tool from ksnip. What worries me with implementing in knsip directly is the size of such OCR software, it seems to be much larger then ksnip.

DamirPorobic · 2021-09-20T09:11:43Z

Maybe a plugin approach could be doable, something like done here https://doc.qt.io/qt-5/qtwidgets-tools-echoplugin-example.html

raphaelh · 2021-09-20T09:40:58Z

There's https://ocr.space/OCRAPI

Each user could register her/his own API key

DamirPorobic · 2021-09-20T09:48:52Z

I had more of a local version in mind, without an API. You trigger OCR and get a dialog window with all the text that you can then copy or whatever.
@raphaelh Have you used this API? In what form do you get the return value?

raphaelh · 2021-09-23T07:45:56Z

I've contributed the API option because you said:

What worries me with implementing in knsip directly is the size of such OCR software, it seems to be much larger then ksnip.

When installing the tesseract package under Ubuntu it takes 16,3 MB (tesseract-ocr, tesseract-ocr-eng, tesseract-ocr-osd). If I add tesseract-ocr-fra (French language), it takes 1 145 KB

ksnip-1.9.1.deb is 710 KB

There's also https://github.com/PaddlePaddle/PaddleOCR, it says on the github page:

Ultra lightweight PP-OCRv2 series models: detection (3.1M) + direction classifier (1.4M) + recognition 8.5M) = 13.0M

I haven't used https://ocr.space/OCRAPI directly. I know about it because I'm using the Copyfish browser extension (https://addons.mozilla.org/fr/firefox/addon/copyfish-ocr-software/) which works well for my needs (copy text from images or PDF files while I'm browsing).

DamirPorobic · 2021-11-27T20:40:00Z

I'm working on the OCR support and I must say that I'm bit surprised by Tesseract's weak performance:

I thought the OCR development had achieved more by now.

DamirPorobic · 2021-11-28T08:12:37Z

Same image triggered via command line looks better

Maybe my API call requires some improvement.

hum4nizer · 2021-11-28T09:45:21Z

It looks way better in the command line test for sure! Good luck with the development. I'm really looking forward for this feature. Thanks!

…but not compatible #603

DamirPorobic · 2022-03-19T09:54:24Z

This is implemented now, I have to write some tests but in general can be tested now. Let me know what you think.

SM-26 · 2022-05-30T06:41:10Z

This is implemented now, I have to write some tests but in general can be tested now. Let me know what you think.

This look really amazing, how can I test it?

When I click the OCR button on Options
a new window opens up, a blank text window
what is the next step?

sm26@sm26-Latitude-3420:~$ ksnip -v
Debug: SingleInstance mode detected, we are the client.
Debug: X11ImageGrabber selected.
Version: 1.10.1-continuous
Build: 1-2009073

OCR used:ksnip-plugin-ocr-0.1.0-continuous.deb
Build Time: Sat, 19 Mar 2022 09:36:14

DamirPorobic · 2022-05-30T06:46:49Z

That should be actually working. Do you see any message text there saying that the text is being processed?

SM-26 · 2022-05-30T08:59:29Z

@DamirPorobic nope, I don't see any message. where should I look?

the windows of OCR is a text box I can edit, but it doesn't have anything in there ATM.

DamirPorobic · 2022-05-30T09:03:50Z

There, before the inner text box comes where you write, there should be a label saying something like "Processing text..." and when done, the label is hidden and the text box comes up. Strange, doesn't look right.

One more thing, can you try some black text on white background? Maybe with few sentences so the process takes a few seconds more.

SM-26 · 2022-05-30T09:08:39Z

Nope, sorry.
I don't see any label, and the OCR windows pops up instantly.

black text and white background test seems the same

DamirPorobic · 2022-05-30T09:18:30Z

Ok, thanks, must have a look into the code, it seems to have a bug

MichelDiz · 2022-12-04T17:26:04Z

hey all, how do I build the ksnip-plugin-ocr? Theres no make file. And no instructions to do so in Windows. Thanks!

DamirPorobic · 2022-12-05T08:40:17Z

There are prebuild binaries, also for windows https://github.com/ksnip/ksnip-plugin-ocr/releases

Building it locally is quit cumbersome due to a lot of dependencies of OCR. If you still want to build it locally, you can see how the pipeline builds it https://github.com/ksnip/ksnip-plugin-ocr/blob/master/.github/workflows/windows.yml

MichelDiz · 2022-12-06T02:28:00Z

Okay thanks! that's good enough. For some reason I hadn't seen it. I thought it was TAR ball or something.

hum4nizer added the feature_request label Mar 8, 2021

DamirPorobic assigned fnkabit Mar 11, 2021

DamirPorobic mentioned this issue Jun 15, 2021

Set config file location on command line #667

Open

DamirPorobic assigned DamirPorobic and unassigned fnkabit Nov 22, 2021

DamirPorobic added a commit that referenced this issue Dec 10, 2021

Add setting for loading plugins #603

0fc131f

DamirPorobic added a commit that referenced this issue Dec 11, 2021

Load plugin version #603

7494326

DamirPorobic added a commit that referenced this issue Dec 12, 2021

Clear plugin list when no plugin was found and log when plugin found …

ffe1e2b

…but not compatible #603

DamirPorobic added a commit that referenced this issue Mar 6, 2022

Add default plugin search paths #603

92164dd

DamirPorobic added a commit that referenced this issue Mar 6, 2022

Fix windows plugin path #603

921fd86

DamirPorobic added a commit that referenced this issue Mar 8, 2022

Store selection of plugin search path #603

547ea5b

DamirPorobic added a commit that referenced this issue Mar 19, 2022

Notify plugin manager when new plugins found #603

84bbb9e

DamirPorobic closed this as completed Mar 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OCR scan of screenshot #603

OCR scan of screenshot #603

hum4nizer commented Mar 8, 2021 •

edited

Loading

DamirPorobic commented Mar 10, 2021

DamirPorobic commented Mar 10, 2021

hum4nizer commented Mar 11, 2021

DamirPorobic commented Mar 11, 2021

fnkabit commented Mar 11, 2021

fnkabit commented Mar 11, 2021

DamirPorobic commented Mar 11, 2021

fnkabit commented Mar 11, 2021

DamirPorobic commented Mar 11, 2021

fnkabit commented Mar 11, 2021

hum4nizer commented Mar 16, 2021 •

edited

Loading

fnkabit commented Mar 29, 2021

DamirPorobic commented Sep 20, 2021

DamirPorobic commented Sep 20, 2021

raphaelh commented Sep 20, 2021

DamirPorobic commented Sep 20, 2021

raphaelh commented Sep 23, 2021

DamirPorobic commented Nov 27, 2021 •

edited

Loading

DamirPorobic commented Nov 28, 2021

hum4nizer commented Nov 28, 2021

DamirPorobic commented Mar 19, 2022

SM-26 commented May 30, 2022

DamirPorobic commented May 30, 2022

SM-26 commented May 30, 2022

DamirPorobic commented May 30, 2022

SM-26 commented May 30, 2022

DamirPorobic commented May 30, 2022

MichelDiz commented Dec 4, 2022

DamirPorobic commented Dec 5, 2022

MichelDiz commented Dec 6, 2022

OCR scan of screenshot #603

OCR scan of screenshot #603

Comments

hum4nizer commented Mar 8, 2021 • edited Loading

DamirPorobic commented Mar 10, 2021

DamirPorobic commented Mar 10, 2021

hum4nizer commented Mar 11, 2021

DamirPorobic commented Mar 11, 2021

fnkabit commented Mar 11, 2021

fnkabit commented Mar 11, 2021

DamirPorobic commented Mar 11, 2021

fnkabit commented Mar 11, 2021

DamirPorobic commented Mar 11, 2021

fnkabit commented Mar 11, 2021

hum4nizer commented Mar 16, 2021 • edited Loading

fnkabit commented Mar 29, 2021

DamirPorobic commented Sep 20, 2021

DamirPorobic commented Sep 20, 2021

raphaelh commented Sep 20, 2021

DamirPorobic commented Sep 20, 2021

raphaelh commented Sep 23, 2021

DamirPorobic commented Nov 27, 2021 • edited Loading

DamirPorobic commented Nov 28, 2021

hum4nizer commented Nov 28, 2021

DamirPorobic commented Mar 19, 2022

SM-26 commented May 30, 2022

DamirPorobic commented May 30, 2022

SM-26 commented May 30, 2022

DamirPorobic commented May 30, 2022

SM-26 commented May 30, 2022

DamirPorobic commented May 30, 2022

MichelDiz commented Dec 4, 2022

DamirPorobic commented Dec 5, 2022

MichelDiz commented Dec 6, 2022

hum4nizer commented Mar 8, 2021 •

edited

Loading

hum4nizer commented Mar 16, 2021 •

edited

Loading

DamirPorobic commented Nov 27, 2021 •

edited

Loading