Enable OCR for Material Text Extraction #6423

jrjohnson · 2025-08-13T04:50:56Z

Apache provides an image of tika with full OCR capabilities through tesseract. All we have to do is install it and we get full OCR text extraction for materials that have images of text instead of text.

Refs #6421

Apache provides an image of tika with full OCR capabilities through tesseract. All we have to do is install it and we get full OCR text extraction for materials that have images of text instead of text.

ucsf-sonarqube-cloud-public-repo · 2025-08-13T19:03:48Z

Quality Gate passed

Issues
0 New issues
0 Fixed issues
0 Accepted issues

Measures
0 Security Hotspots
No data about Coverage
No data about Duplication

See analysis details on SonarQube

stopfstedt

let's do it. LGTM.

Enable OCR for Material Text Extraction

92e6534

Apache provides an image of tika with full OCR capabilities through tesseract. All we have to do is install it and we get full OCR text extraction for materials that have images of text instead of text.

jrjohnson force-pushed the add-orc-to-tika branch from 7e8e68f to 92e6534 Compare August 13, 2025 18:46

jrjohnson marked this pull request as ready for review August 13, 2025 19:46

jrjohnson requested a review from stopfstedt as a code owner August 13, 2025 19:46

stopfstedt approved these changes Aug 13, 2025

View reviewed changes

stopfstedt merged commit 3fabaaa into ilios:master Aug 13, 2025
35 checks passed

jrjohnson deleted the add-orc-to-tika branch August 13, 2025 21:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable OCR for Material Text Extraction #6423

Enable OCR for Material Text Extraction #6423

Uh oh!

jrjohnson commented Aug 13, 2025

Uh oh!

ucsf-sonarqube-cloud-public-repo bot commented Aug 13, 2025

Uh oh!

stopfstedt left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Enable OCR for Material Text Extraction #6423

Enable OCR for Material Text Extraction #6423

Uh oh!

Conversation

jrjohnson commented Aug 13, 2025

Uh oh!

ucsf-sonarqube-cloud-public-repo bot commented Aug 13, 2025

Quality Gate passed

Uh oh!

stopfstedt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants