Hi all,
I just started to play with Vartex AI. I am working with “Tabular” - “Forecasting” and currently struggling with few things and i hope you can help me in order i can continue. I tried to organize my questions to three categories:
1) Datasets for training:
a) “series identifier” define to which time series data are belonging … lets assume that i have two series identifier - one is called “A” and one is called “B”. Does this means that AI treats them as completely separate and noncorrelated - this means any data that belongs to series A don’t have any correlation to B, right? This give me possibility to train different dataset with one shot right? Otherwise i would need to make (in my case) two trainings - one for A and one for B.
2) Training new model → Model details
a) Is possible to predict more then one target column?
b) lets assume that my dataset data granularity is 1 day. Can i use data granularity of “5min” for Forecast configuration or can this setup decrease quality of my forecast? Should it be more correct to use already at beginning lets say dataset granularity of 5 minute and afterwards it could be more flexible when setting data granularity for forecast configuration without influencing forecast quality?
c) If I set Forecast horizon of 7 and context window 30, does this means that this setting limit my forecast to maximum 7 time steps and requesting always exactly 30 time steps of historical data as input when forecasting on existing trained model?
3) Batch predictions
a) Batch Source file
Lets assume that i have data with 15 columns from which one is “serial identifier” , one is “time step” - actually date and one of those columns is target column. Rest of 12 columns are used as influencer and used to train my model. I know that i need to have same structure for batch source file - i read that i can use same file as i used for training, but i just need to add in my case a 2x7 new rows with adding 7 dates and serial identification (in my case 7x A and 7xB) and target column need to be empty. But what should i do with data of rest 12 columns? Do i need manually to enter data for those new rows (2x7) of those 12 columns which values should be for future? But what if i don’t have those data? Does this means i cannot do prediction?
I tried to make prediction without those future data and i got following message:
“There are rows with non-empty target values after this row. The time series has been excluded from predictions.”
I hope you can help me with above questions.
Tnx in advance!
Regards,
Arny