Replies: 6 comments 1 reply
-
(Baselines is not the right term here, so I am using the term "textlines") dinglehopper matches the extracted text in order (e.g. ReadingOrder for PAGE documents, or in document order in ALTO, text files etc.). It uses the shortest alignment as per the Levenshtein algorithm. If you have lines that only appear on one side, there will be no match (or a bad match). An example (from the README): https://raw.githubusercontent.com/qurator-spk/dinglehopper/master/.screenshots/dinglehopper.png If you look at the second line, there's no match on the left (GT) side and the extra text on the right side (OCR) is counted as an error. |
Beta Was this translation helpful? Give feedback.
-
Does that mean that the levenshtein distance is calculated from every textline of the ground truth to every textline of the hypothesis and based on the smallest distance the allignment is made? |
Beta Was this translation helpful? Give feedback.
-
No, the alignment/calculating the distance works on the full concatenated text, not on single textlines. |
Beta Was this translation helpful? Give feedback.
-
Can you elaborate on that? I don´t really understand how calculating the levenshtein distance on the whole transcription would be helpful, escpecially since it sounds like the alignment of the textlines has been set beforehand. Thank you so much again! |
Beta Was this translation helpful? Give feedback.
-
Calculating the Levenshtein distance involves aligning the two texts, so you don't get "a distance", but "the distance" = the shortest series of operations to get from one text to the other. |
Beta Was this translation helpful? Give feedback.
-
@icarl-ad mailed me about this. There seems to be a fundamental misunderstanding: There are no textlines matched. dinglehopper does not work with lines.
|
Beta Was this translation helpful? Give feedback.
-
Hi,
I was wondering how the baselines (or the transcription of baselines) of the groundtruth and hypothesis are matched? This is obviosly important for the calculation of KPIs.
If e.g. the levenshtein distance is applied, how does this work with an extremly bad transcription or baselines that occur only in groundtruth or hypothesis?
Thank you for your help!
Beta Was this translation helpful? Give feedback.
All reactions