Results - ICDAR 2024 Competition on Historical Map Text Detection, Recognition, and Linking

method: dino_mvit2024-04-29

Authors: Rajat Kumar Singh, Himani Shrotriya, Shivshankar Reddy, Himanshu Bhatt

Affiliation: American Express

Description: We trained MViTv2 for Rumsey Map and Mask DINO for IGN map. To further improve the performance, we crop the image into 4 equal parts, we predict on original image and all 4 cropped images and combine the output.

@misc{li2022mask, title={Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation}, author={Feng Li and Hao Zhang and Huaizhe xu and Shilong Liu and Lei Zhang and Lionel M. Ni and Heung-Yeung Shum}, year={2022}, eprint={2206.02777}, archivePrefix={arXiv}, primaryClass={cs.CV} }

@inproceedings{li2021improved, title={MViTv2: Improved multiscale vision transformers for classification and detection}, author={Li, Yanghao and Wu, Chao-Yuan and Fan, Haoqi and Mangalam, Karttikeya and Xiong, Bo and Malik, Jitendra and Feichtenhofer, Christoph}, booktitle={CVPR}, year={2022} }

method: dino_map2024-04-29

Authors: Rajat Kumar Singh, Himani Shrotriya, Shivshankar Reddy, Himanshu Bhatt

Affiliation: American Express

Email: rajatks@outlook.com

Description: We trained Mask DINO for both the maps. To further improve the performance, we crop the image into 4 parts with some overlap, we predict on original image and all 4 cropped images and combine the output.

method: Baseline TESTR Checkpoint2024-03-26

Authors: Organizers

Affiliation: ICDAR'24 RRC-MapText

Description: TESTR checkpoint is used without any additional modifications or finetuning. The model checkpoint version with polygon prediction head and fine-tuned on TotalText was used.

Xiang Zhang, Yongwen Su, Subarna Tripathi, Zhuowen Tu. Text Spotting Transformers. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 9519-9528.

Source code

Ranking Table

Description Paper Source Code

Date	Method	Quality	F-score	Tightness	Precision	Recall
2024-04-29	dino_mvit	64.75%	89.73%	72.16%	88.68%	90.81%
2024-04-29	dino_map	64.75%	89.73%	72.16%	88.68%	90.81%
2024-03-26	Baseline TESTR Checkpoint	20.56%	29.19%	70.46%	86.38%	17.56%

Inactive evaluations

method: dino_mvit2024-04-29

method: dino_map2024-04-29

method: Baseline TESTR Checkpoint2024-03-26

Ranking Table

Ranking Graphic

Ranking Graphic

Ranking Graphic