BigQuery external table using JSON files

How large are your JSON files (in MBytes and rows)?
How do they change? (Updates to the files, new files, appends to existing files)
How do you plan to query the data that is given to you in JSON format? (real-time queries, batch processing, BI)
How much latency can you accept between when new JSON files are available and when the data will show up in BigQuery queries?

Some immediate thoughts that come to mind:

  1. Google doesn’t charge for ingestion of new data into BigQuery
  2. The size of data in JSON format is usually larger than the size of data stored in BigQuery internal tabular format
  3. External tables are slower to query than internal BigQuery tables.
1 Like