Keras: LSTM for time series prediction

Created on 31 May 2016 · 11Comments · Source: keras-team/keras

This is more of a conceptual clarification.

I have the following architecture :

model = Sequential()
model.add(LSTM(batch_size,
               batch_input_shape=(batch_size, tsteps, 1)))  
model.compile(loss='mse', optimizer='rmsprop',metrics=["accuracy"])

The LSTM is able to make pretty accurate predictions as of now.

Training data :
issue_tr_le_full

Note that the training data for each of the 3 weeks is almost similar in shape.

LSTM predictions on unseen testing data :
issue_test

As you can see, the LSTM is still able to trace the unexpected peak in the graph which was never seen in the training data.

Any reason as to how the LSTM is able to do the above?

stale

Source

ashwinnaresh

👍3

Most helpful comment

@ashwin123 I am sure there is one step lag between the actual time series and the predicted time series, this is the most seen "trap" if you do time series prediction like this, in which the NN will always mimic previous input of time series.

hijoe320 on 2 Jun 2016

👍17

All 11 comments

@ashwin123
What is your input and output data for this model

gamehere007 on 1 Jun 2016

hijoe320 on 2 Jun 2016

👍17

@hijoe320 yup you are right, there was a 1 time-step lag between predicted and actual. the NN was mimicking the previous input of the time series.
Thanks!

sjayakum on 3 Jun 2016

@hijoe320 Ok, So are we doing any thing wrong to end up with this?

The main problem we have is the "Testing Data" real-time we get only 1 time-step at a time. So they are tested on a batch-size = 1 and time-step = 1.

Anyways to overcome this "trap"? :grin:

More information

batch_size = 1
tsteps = 1

We have trained Each Timestep as a sample.

Say lstm requires input of the form => nb_samples x tsteps x attributes

where a,b,c,d... is a time series data
10:00 am -> X = 1 x 1 x a and y = b goes into lstm

10:01 am -> X = 1 x 1 x b and y = c goes into lstm..

10:02 am -> X = 1 x 1 x c and y = d goes into lstm.. And So on..

sjayakum on 3 Jun 2016

@sjayakum you can try:

randomize training samples in each batch, make sure they are not followed one by one
choose or design a better loss function other than MSE
extract some features from the input time series
manually limit the weight of x_{t-1}, x_{t-2}

hijoe320 on 7 Jun 2016

👍4 😄1

hi there. May I know how to use LSTM for multiple step ahead forecasting? Assuming I would like to predict 7 days ahead outcome. I've been trying few days but seems to get stuck - how should i prepare the data, and and what to set for batch size and time steps. Thanks in advance!

iwinterlovermy on 30 Aug 2016

👍3

please did you find something to solve your problem i have the same problem here

safa94 on 31 Jul 2017

Hello, l'm using LSTM to predict time series and I meet the same problem. It seems that LSTM outputs input as output. Have you solved this problem? Would you like to give me some advice to solve this problem?

SunlnuS on 11 Oct 2017

I have the same problem while using LSTM to predict a complex time series. There is a delay between true value and predicted value and they have a simular shape. It seems that the network copies the last input value to generate an output value. Are there any solutions to this problems? I will be very grateful.

yyb1995 on 10 Apr 2018

In my opinion, loss function comes to minimum, so the network chooses to mimic the actual value. You will get same result even without LSTM, I think, even just with one fully connect layer.