Document AI Form Processor Parse Table Structure Incorrectly

anonymous · August 2, 2022, 3:17am

Hi All,

We are currently using Document AI for form parsing some PDF document and half of times the default former processor either missing a col or messed up some col structure.

Let’s say the expected file header

Sales | Dollar Volume | Average Price

For example, I saw cases like

Missing Header

Sales|Average Price

Wrong structure

SalesDollar|Volume|Average Price

The content of first two cols are messed up as well. The cell could be missing value or incomplete value.

Any recommendation to improve this? If no easy way, any guidance with examples to train or deploy one’s own form processor? PS: the document has the same structure.

josegutierrez · August 4, 2022, 5:57pm

You can improve the data results by using Document AI Parser with AI Platform Notebooks. Also you can use Vision AI to create your own Parser.

Topic		Replies	Views
Parsing single digit in table with Form parser AI Solutions document-ai	2	21	April 20, 2023
Document AI is not parsing the PDF(having combination of multiple tables) data correctly AI Solutions document-ai	0	41	August 4, 2025
Document AI missing some line-items in similar documents AI Solutions document-ai	3	74	October 4, 2025

Document AI Form Processor Parse Table Structure Incorrectly

AI Suggested topics