**Hi @NorieRam **
Thank you for your response and recommendations. We have conducted several tests and identified an interesting behavior that led us to pinpoint an issue with scheduled executions in Dataform.
Context and Initial Configuration
We have configured Dataform to work with BigQuery Omni on AWS (aws-us-east-1), setting the following parameters in workflow_settings.yaml:
.yaml
defaultProject: augusta-bavv-dev-activo
defaultLocation: aws-us-east-1
defaultDataset: dataform
defaultAssertionDataset: dataform_assertions
dataformCoreVersion: 3.0.0
Additionally, when executing queries from Workspace, external tables in BigQuery Omni are successfully created, and we were able to deploy some views by adjusting the YAML configuration with location=aws-us-east-1.
Here’s an example of a view that we tested and confirmed to be working correctly:
prueba.sqlx
config {
type: "operations"
}
CREATE OR REPLACE VIEW
`aws_omni_view.prueba` AS
SELECT
REGEXP_EXTRACT(_FILE_NAME, r'/([^/]+)/[^/]+\.parquet$') AS subfolder,
PARSE_DATE('%Y%m%d', REGEXP_EXTRACT(_FILE_NAME, r'\d{8}')) AS partition_date
FROM
`aws_omni_campanas.prueba`
We also confirmed that executions from the BigQuery Console run correctly and that data is created in the expected region (aws-us-east-1).
Furthermore, when reviewing BigQuery Job History, we observed that manual executions from Workspace are indeed running in the configured region (aws-us-east-1), indicating that the YAML settings are being applied correctly within the Workspace environment.
Identified Issue
Even though executions from Workspace behave as expected, when we schedule a Dataform execution, we receive an error.
Root Causes Identified
The workflow_settings.yaml file only affects the Workspace but does not control the actual execution of scheduled workflows.
- While queries run correctly in aws-us-east-1 from the Workspace, when scheduled, Dataform follows the configuration in release_config.
The release_config was set to us-east1 with target: bigquery.
- This caused Dataform to execute workflows on GCP instead of AWS Omni.
There was no properly configured release_config for aws-us-east-1.
- Since no specific release_config existed for AWS Omni, executions defaulted to us-east1 on GCP.
Evidence from Logs:
By analyzing the execution logs, we found that the releaseConfigId is “omni” and it is executing in “location”: “us-east1”, confirming that Dataform jobs are being executed in the wrong region.
json
{
"insertId": "geq081ch6v",
"jsonPayload": {
"@type": "type.googleapis.com/google.cloud.dataform.logging.v1.WorkflowInvocationCompletionLogEntry",
"releaseConfigId": "omni",
"workflowInvocationId": "1741637475-a4f89d69-7cf4-476b-82cd-b5c7a2ebd314",
"terminalState": "FAILED",
"workflowConfigId": "naruto"
},
"resource": {
"type": "dataform.googleapis.com/Repository",
"labels": {
"location": "us-east1", <--- ⚠️ EXECUTING ON GCP INSTEAD OF AWS Omni
"resource_container": "626537586202",
"repository_id": "augusta-bavv-bigquery-omni-aws"
}
},
"timestamp": "2025-03-10T20:11:15.529842342Z",
"severity": "ERROR",
"logName": "projects/augusta-bavv-dev-activo/logs/dataform.googleapis.com%2Fworkflow_invocation_completion",
"receiveTimestamp": "2025-03-10T20:11:16.071909743Z"
}
Next Steps
To resolve this issue, we believe that:
A release_config must be created and properly configured for aws-us-east-1 so that scheduled executions in Dataform use the correct region instead of defaulting to GCP.
Could you provide guidance on the best approach to configure the release_config in this scenario to ensure that scheduled executions respect the AWS Omni configuration?
We appreciate any additional recommendations.