Question: I ' m working on a time series prediction problem using LSTM and have several interconnected questions: 1 . When using differencing in time series
Im working on a time series prediction problem using LSTM and have several interconnected questions:
When using differencing in time series with LSTM:
a Should I calculate RMSE on the differenced values or convert back to original values first?
b What is the proper scaling approach for differenced values?
For multivariate time series analysis:
I've added features like moving averages and seasonal components sin cos
Original data has only month and target value
How should scaling be applied in this case? Should I scale all features or only specific ones?
Data preprocessing concerns:
a When adding derived features moving averages, etc. I get NaN values
b What's the best practice for handling these NaN values in time series context dropna or fillna?
c How does this choice impact model performance?
Model evaluation:
What's the proper methodology for evaluating LSTM performance in both univariate and multivariate cases?
How do different preprocessing choices affect the final evaluation metrics?
Please provide a detailed, stepbystep standard pipeline for both cases:
a Univariate time series:
Data preprocessing steps
Feature engineering
Model building and evaluation
Best practices for each step
b Multivariate time series:
Proper sequence of preprocessing steps
Feature selection and engineering
Handling multiple variables
Model architecture considerations
Evaluation methodology
Please provide comprehensive guidelines with theoretical explanations and practical considerations for each scenario.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
