method: dino_map2024-04-29

Authors: Rajat Kumar Singh, Himani Shrotriya, Shivshankar Reddy, Himanshu Bhatt

Affiliation: American Express

Email: rajatks@outlook.com

Description: We trained Mask DINO for both the maps. To further improve the performance, we crop the image into 4 parts with some overlap, we predict on original image and all 4 cropped images and combine the output.

method: DS-LP2024-03-26

Authors: hsy

Affiliation: BUPT

Description: Four tasks unified submission
DeepSolo, Multi-Polygon NMS (word detection and recognition) -> LayoutPointer[LayoutLMv3] (word linking)

Ranking Table

Description Paper Source Code
DateMethodQualityF-scoreTightnessPrecisionRecall
2024-04-29dino_mvit64.75%89.73%72.16%88.68%90.81%
2024-04-29dino_map64.75%89.73%72.16%88.68%90.81%
2024-03-26DS-LP44.08%67.78%65.03%64.85%71.00%
2024-05-06MapText Detection Strong Pipeline 42.33%61.10%69.29%82.51%48.51%
2024-04-29MapDet35.73%54.70%65.33%70.13%44.84%
2024-03-26Baseline TESTR Checkpoint20.56%29.19%70.46%86.38%17.56%

Ranking Graphic

Ranking Graphic

Ranking Graphic