Dataset looks like:
- all features
x
are numerical and scaled except forname
(which is currently the indexed alongside year)
[name, year, x1, x2, x3, x4, ...]
josh 2001 ... #the various values for the x_features, for that name, at that time
josh 2002 ...
josh 2003 ...
bill 2001 ...
bill 2002 ...
bill 2003 ...
I have already applied StandardScaler
to my entire time series dataset.
I now am about to use
PCA
, but I stopped to wonder if it can/should be applied to an entire time series dataset like the one above.
- I have just finished researching PCA quite heavily, but could not think of a reason why using it on a time series would be any different.
- Am I forgetting something critical about PCA in respect to time series??
I found some older mentions of Functional PCA, but is this still relevant/needed? Or has SciKit.learn made this obsolete?