Hello!
We’ve been trying Document AI for some time, previously we could train a custom model with no problem using around 10 documents and it worked pretty well.
But since friday we’ve been trying to train another custom model using around 50 labelled documents, but after 10 minutes or so the following error appears:
{
"name": "projects/[REDACTED]/locations/[REDACTED]/operations/[REDACTED]",
"done": true,
"result": "error",
"response": {},
"metadata": {
"@type": "type.googleapis.com/google.cloud.documentai.v1.TrainProcessorVersionMetadata",
"commonMetadata": {
"state": "FAILED",
"createTime": "2024-09-15T20:29:29.927297Z",
"updateTime": "2024-09-15T20:38:02.040543Z",
"resource": "projects/[REDACTED]/locations/[REDACTED]/processors/[REDACTED]/processorVersions/[REDACTED]"
},
"trainingDatasetValidation": {},
"testDatasetValidation": {}
},
"error": {
"code": 13,
"message": "Internal error encountered.",
"details": []
}
}
This model also made use of the parent label feature, which is in preview, so we tried again without the use of that feature (making each child of the parent label its own label), but the error persists.
The only way we found to train successfully is to disable the labels that were on the parent label (before and after removing the use of the parent label), but with that solution we miss a lot of labels we need to continue.
We checked the logs of when the error happens and it shows this:
The replica workerpool0-0 exited with a non-zero status of 1. Termination reason: Error. To find out more about why your job exited please check the logs: [REDACTED]
The URL links to a premade query from Logs Explorer from an unknown project we don’t have access to, so we can’t see more details.
We also have other dataset of documents having the same issue with around 91 documents.
What could be causing this problem or how could we see more details about the error?