- Task 1 - Detection
- Task 2 - Detection-Linking
- Task 3 - Detection-Recognition
- Task 4 - Detection-Recognition-Linking
method: dino_map2024-04-29
Authors: Rajat Kumar Singh, Himani Shrotriya, Shivshankar Reddy, Himanshu Bhatt
Affiliation: American Express
Email: rajatks@outlook.com
Description: We trained Mask DINO for both the maps. To further improve the performance, we crop the image into 4 parts with some overlap, we predict on original image and all 4 cropped images and combine the output.
method: dino_mvit2024-04-29
Authors: Rajat Kumar Singh, Himani Shrotriya, Shivshankar Reddy, Himanshu Bhatt
Affiliation: American Express
Email: rajatks@outlook.com
Description: We trained MViTv2 for Rumsey Map and Mask DINO for IGN map. To further improve the performance, we crop the image into 4 equal parts, we predict on original image and all 4 cropped images and combine the output.
method: Baseline TESTR Checkpoint2024-03-26
Authors: Organizers
Affiliation: ICDAR'24 RRC-MapText
Description: TESTR checkpoint is used without any additional modifications or finetuning. The model checkpoint version with polygon prediction head and fine-tuned on TotalText was used.
Date | Method | Quality | F-score | Tightness | Precision | Recall | |||
---|---|---|---|---|---|---|---|---|---|
2024-04-29 | dino_map | 73.38% | 87.34% | 84.02% | 87.21% | 87.47% | |||
2024-04-29 | dino_mvit | 72.41% | 86.66% | 83.56% | 89.21% | 84.25% | |||
2024-03-26 | Baseline TESTR Checkpoint | 55.13% | 69.29% | 79.57% | 71.85% | 66.90% | |||
2024-05-04 | MapText Using EasyOCR | 42.67% | 58.33% | 73.16% | 69.29% | 50.36% |