Releases: OCR-D/core
Releases · OCR-D/core
v3.9.1
v3.9.0
Changed:
- Support multiple output file groups for processors, #1344
OcrdPageResult: replace by proxy classOcrdPageResultVariadicListWrapperwith list semantics and variadic constructor (with the original class now underSingleOcrdPageResult)Processor.process_page_file: handle results fromprocess_page_pcgts
as lists:- split
output_file_grpwith commas (just asinput_file_grp) - iterate over output file groups and
OcrdPageResult - log error if there are more results than output file groups (that will get lost)
- raise
FileExistsError(in order to skip actual computation) iff output file exists for all output file groups - make output files (and file IDs), and save images etc for each output independently
- split
- PAGE API:
get_AllRegionsavailable for all region types, not just PAGE root, #1344 ocrd_network: Update RabbitMQ from 3.12 to (latest) 4.2, #1348ocrd_network: Fix and improve logging for network integration tests, #1348
Fixed:
Added:
- test combinations of
OCRD_*config variables and multi-output, #1344
v3.8.1
v3.8.0
Added:
ocrd-commandprocessor to run arbitrary PAGE transformation CLIs, #1343- various parameter presets for ocrd-command, #1343
ocrd-mergeprocessor to join multiple PAGE inputs by concatenation, #1343- test coverage for ocrd-filter, ocrd-command, and ocrd-merge, #1343
- Resource Manager Server as
ocrd_networkanalogon ofocrd.cli.resmgr, #1309ocrd network resmgr-serverfor triggering Resource Manager Server (RMS) in the background- Processing Server also deploys RMS on each processing host
Fixed:
Page.get_ReadingOrderGroups: sort by index, useOrderedDictas resultOcrdAgent.notes: convert to dict to accommodate pydantic 2 with older lxmlocrd.resource_manager: ensure necessary + reduce unnecessary updates of user databaseocrd.resource_manager: deduplicate entries (newer wins) before updating user databaseocrd resmgr download: extract archives independent of whether they are URLs or local pathsocrd resmgr download: if--overwrite, ensure the old res gets removedocrd resmgr download: default todatalocation instead of first in list of allowedocrd_utils.list_all_resources: filter module non-resource files w/ more anti-patternsocrd_utils.list_all_resources: no subpaths except forcwdlocation, OCR-D/spec#263, #1315ocrd_utils.list_all_resources: filter resources via media (MIME) type, if specified, #1315
Merged PR
- drop ocrd-distributed resource_list.yml for good by @bertsky in #1322
- resmgr download: implement git clone by @bertsky in #1340
- Continuation of #1309: Implementation of the resource manager server (issue #1294) by @MehmedGIT in #1319
- add builtin processors ocrd-command and ocrd-merge by @bertsky in #1343
Full Changelog: v3.7.0...v3.8.0
v3.7.0
v3.6.0
Changed:
ocrd network client workflow run: Allow passing workflow asocrd processtasks as alternative to a workflow file, #1264, #1335- Processing Server: Store page ranges as comma-separated page IDs instead of relying on
..numeric range operator, #1333, #1334 ocrd network: More verbose error messages instead of plain assertions, #1337
Removed:
- 🔥 Remove Processor Server in favor of Processing Server + Processing Workers only, #1338, OCR-D/spec#265
Merged PR:
v3.5.1
v3.5.0
Added:
- Extended syntax for filtering pages with
--page-idandOcrdMets.find_files, #1328, #1328- negation operator
~, e.g.~PHYS_0001..PHYS_0004to not match the first 4 pages - filtering by attributes of
mets:divfor both physical (prefixphysical:) andlogical(prefixlogical:) elements, e.g.~logical:type:bindingto skip all pages that are book bindings
- negation operator
v3.4.1
Fixed:
OCRD_MISSING_INPUTconfig variable only checked existence ofmets:filenot the actual file on disk even when set toABORT, #1330- Fix processor CLI subcommand
worker|serverbroken due to changedEnumhandling in click 8.2, #1331 - Typo: proper log message if media type could not be determined, #1330, bertsky/workflow-configuration#35
Processor.zip_input_files: always sort pages in physical order, #1330
v3.4.0
Added:
make test-cuda-{torch,tf2,tf1}: ensure CUDA is available and shapely#1598 does not trigger, #1326ocrd ocrd-tool dump-module-dirsto dumpmoduledirof every tool in anocrd-tool.json, #1326
Fixed:
ocrd workspace find --undo-download: Unsetlocal_filenameonly after unlinking, #1324, #1325- METS Server: do not convert
Noneto the string"None", #1324, #1325
Changed:
- Ensure logging files and directories are writeable for non-root users, #1214