I am testing the Layout Parser (v1beta3) with the latest processor version (updated Jan 2026).
Goal: To extract individual illustrations (e.g., chairs in a catalog) along with their specific coordinates.
Current Problem: Although the processor returns a boundingPoly with 4 coordinates, it only covers the entire page or a very large section. The individual elements inside are merged into one large Block.
-
Processor: Layout Parser (v1beta3)
-
Attempted: Toggle
enableLlmLayoutParsing(True/False) -
Result:
blocks_countis very low (e.g., 2 blocks for a page with 10+ items).
Is this a known limitation of the current preview version, or is there a specific configuration to improve the segmentation of visual elements? Any advice on how to get individual bounding boxes for small elements would be appreciated.