OCR Training preferentially identifying upside down text

MattJP · February 3, 2022, 2:25pm

This is not urgent as I think the disparate formats I have, AppSheet’s limit of ten training images and the lack of identifiable anchor words makes trained OCR unsuitable for this app.

It looks like Appsheet OCR Training is preferentially recognise inverted text.

AppSheet OCR Training showed the image, and others, with the correct orientation. However, in every case where there was an upside down copy of the target text, the inverted text appears to have been preferentially identified.

Based on the anchor word warning message text, I understood that if no anchor words were identified the default was to match from the top left.

I have searched generally for Tesseract training inversion and found nothing helpful.

Is this expected behaviour?

In the end I went for untrained OCR and didn't worry about the training aspects.

WillowMobileSys · September 25, 2022, 3:42pm

any additional knowledge you discovered that you can add?

MattJP · September 26, 2022, 9:56am

I discovered nothing else of value. I suspect but have not confirmed that it is a Tesseract (underlying OCR engine) issue. I was not able to confirm if it is a defect or a feature.

Untrained OCR was good enough to demonstrate to the client but in the end, against our advice, the client went with on-device OCR in a native Kotlin android app using ML-Kit and CameraX APIs.

Sorry I can’t be more help

Topic		Replies	Views
OCR function not reading AppSheet Q&A other	7	2	March 30, 2021
OCR Text cannot accurately render sideways images AppSheet Q&A automation , other , errors	0	0	July 20, 2020
Document AI, upside-down text not recognized, when on a page with majority right-side-up text AI Solutions document-ai	1	3	November 7, 2024

OCR Training preferentially identifying upside down text

AI Suggested topics