We are delivering a platform to a customer based on Document AI. The use case it to send a lottery ticket via API and return the structure information using Document AI. We tried for several hundred images and the Document AI OCR worked great (95%+ times captured right string, only errors were line feeds and Q turning into O etc. that we could resolve using a post-processor). But for one set of images (from DC), the OCR fails miserably. This is a corner case that seems to throw the Document AI engine off the mark.I will appreciate greatly if anyone can help explain it.
See one particular image which is the most problematic.
I could find this guide that might seem helpful for your case, if not, please give me more time so I can provide you a proper answer for the issue you are facing.
Sorry, you missed the entire point, the issue is that the core OCR engine is failing to process the image properly. If the product team takes a look at the image and result, it may give a clue. Hopefully they may be able to find a corner case that will improve the OCR results.
We are quite familiar with the documents and how to parse the result of the DOcument AI.
Hi Anil, sorry it took me so long to answer you, couldn’t find any information on why the OCR is failing for that image that you specify, so my best recommendation for you, is that you file an issue tracker or open a support ticket since this seems like an issue that you are only facing.