- Task 1 - Detection
- Task 2 - Detection-Linking
- Task 3 - Detection-Recognition
- Task 4 - Detection-Recognition-Linking
method: dino_mvit2024-04-29
Authors: Rajat Kumar Singh, Himani Shrotriya, Shivshankar Reddy, Himanshu Bhatt
Affiliation: American Express
Email: rajatks@outlook.com
Description: We trained MViTv2 for Rumsey Map and Mask DINO for IGN map. To further improve the performance, we crop the image into 4 equal parts, we predict on original image and all 4 cropped images and combine the output.
method: dino_map2024-04-29
Authors: Rajat Kumar Singh, Himani Shrotriya, Shivshankar Reddy, Himanshu Bhatt
Affiliation: American Express
Email: rajatks@outlook.com
Description: We trained Mask DINO for both the maps. To further improve the performance, we crop the image into 4 parts with some overlap, we predict on original image and all 4 cropped images and combine the output.
method: Baseline TESTR Checkpoint2024-03-26
Authors: Organizers
Affiliation: ICDAR'24 RRC-MapText
Description: TESTR checkpoint is used without any additional modifications or finetuning. The model checkpoint version with polygon prediction head and fine-tuned on TotalText was used.
Date | Method | Quality | F-score | Tightness | Precision | Recall | |||
---|---|---|---|---|---|---|---|---|---|
2024-04-29 | dino_mvit | 64.75% | 89.73% | 72.16% | 88.68% | 90.81% | |||
2024-04-29 | dino_map | 64.75% | 89.73% | 72.16% | 88.68% | 90.81% | |||
2024-03-26 | Baseline TESTR Checkpoint | 20.56% | 29.19% | 70.46% | 86.38% | 17.56% |