レイアウトパーサーに関する問題:個々の画像が別々のブロックとして検出されず、バウンディングボックスが欠落しています

I am testing the Layout Parser (v1beta3) with the latest processor version (updated Jan 2026).

Goal: To extract individual illustrations (e.g., chairs in a catalog) along with their specific coordinates.

Current Problem: Although the processor returns a boundingPoly with 4 coordinates, it only covers the entire page or a very large section. The individual elements inside are merged into one large Block.

  • Processor: Layout Parser (v1beta3)

  • Attempted: Toggle enableLlmLayoutParsing (True/False)

  • Result: blocks_count is very low (e.g., 2 blocks for a page with 10+ items).

Is this a known limitation of the current preview version, or is there a specific configuration to improve the segmentation of visual elements? Any advice on how to get individual bounding boxes for small elements would be appreciated.

1 Like