Codestin Search App

Additions to Base Repository

This repo contains updates that incorporate the ideas of optimizing the writer retrieval model towards task risk by using rewards calculated using the mean average precision of a ranked document list.

The additions for training with reward tuning are incorporated into reward_tune.py.

And additional losses and samplers are added to the utils folder.

m_n_sampler: for sampling with restricted writer and page requirements
ranked_list_reward: Ranked List Reward implementation combined with the mean average precision loss from fast AP loss
Smooth_AP_loss_Brown

The steps from the base repository as listed below can be followed to setup for training and testing.

This repository contains the official code implementation of our paper

Marco Peer, Florian Kleber and Robert Sablatnig : Towards Writer Retrieval for Historical Datasets,

an unsupervised approach using NetRVLAD and Similarity Graph Reranking, which will be presented at ICDAR 2023 this year. It is currently state of the art on the ICDAR17 (80.6% mAP) and ICDAR19 (93.2% mAP) dataset for writer retrieval. Paper

Installation

Install the packages via

pip install -r requirements.txt

The repository uses wandb for logging.

Patch extraction

We provide four scripts (two with a color version each for RGB images) to extract the patches from the documents:

extract_patches_only : only extracts patches without clustering (mainly used for test sets)
extract_patches : extracts patches and clusters their descriptors (mainly used to train sets)

The respective configs for the scripts to reproduce our results are located in the config directory (config/config_patches.yml).

Defining your dataset

If you use the patch extraction scripts provided, edit the paths of the respective dataset in writer_zoo.py:

'icdar2017': {
    'basepath': BASEPATH,
    'set': {
        'test' :  {'path': YOUR_TEST_DIRECTORY,
                            'regex' : {'writer': '(\d+)', 'page': '\d+-IMG_MAX_(\d+)'}},

        'train' :  {'path': YOUR_TRAIN_DIRECTORY,
                            'regex' : {'cluster' : '(\d+)', 'writer': '\d+_(\d+)', 'page' : '\d+_\d+-\d+-IMG_MAX_(\d+)'}},
        
    }
}

The labels used for our repository are extracted by regular expressions. (supported: ICDAR2013, ICDAR2017, ICDAR2019 dataset, refer to our paper for more details).

Training and testing

Run

python reward_tune.py --gpuid=GPU_ID --config=config/icdar2017.yml

to run training on ICDAR2017. Afterward, testing is executed if a testset is specified in the config file. Refer to main.py for further commands.

Rerank

For the reranking part, you are expected to provide an embedding file (.npy), then run

python rerank.py --algorithm='sgr'

which creates the reranked descriptors and saves them. Refer to rerank.py for further commands and functionalities.

Citing

Please consider citing our paper if you find our resources helpful

@inproceedings{10.1007/978-3-031-41676-7_24,
author = {Peer, Marco and Kleber, Florian and Sablatnig, Robert},
title = {Towards Writer Retrieval for&nbsp;Historical Datasets},
year = {2023},
booktitle = {Document Analysis and Recognition - ICDAR 2023: 17th International Conference, San Jos\'{e}, CA, USA, August 21–26, 2023, Proceedings, Part I},
pages = {411–427},
numpages = {17},
location = {San Jos\'{e}, CA, USA}
}

Feel free to reach out to us via mpeer(at)cvl.tuwien.ac.at in case you find errors or have questions about our paper.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
artifacts		artifacts
assets		assets
backbone		backbone
config		config
dataloading		dataloading
evaluators		evaluators
helpers		helpers
reranking		reranking
test		test
utils		utils
visualization		visualization
.gitignore		.gitignore
aug.py		aug.py
classification_train.py		classification_train.py
classification_train_metric_learning.py		classification_train_metric_learning.py
combined_val_map.svg		combined_val_map.svg
debug.py		debug.py
graph_generator.py		graph_generator.py
main.py		main.py
page_encodings.py		page_encodings.py
readme.md		readme.md
requirements.txt		requirements.txt
rerank.py		rerank.py
reranking_script.py		reranking_script.py
reward_tune.py		reward_tune.py
writer_encoding.py		writer_encoding.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Additions to Base Repository

Installation

Patch extraction

Defining your dataset

Training and testing

Rerank

Citing

About

Uh oh!

Releases

Packages

Languages

murali-kri5hna/icdar23

Folders and files

Latest commit

History

Repository files navigation

Additions to Base Repository

Installation

Patch extraction

Defining your dataset

Training and testing

Rerank

Citing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages