LSTNet

This repository is a Tensorflow / Keras implementation of Modeling Long- and Short-Term Temporal Patterns with Deep Neural Networks paper https://arxiv.org/pdf/1703.07015.pdf

This implementation has been inspired by the following Pytorch implementation https://github.com/laiguokun/LSTNet

Installation

Clone this prerequisite repository:

git clone https://github.com/fbadine/util.git

Clone this repository:

git clone https://github.com/fbadine/LSTNet.git
cd LSTNet
mkdir log/ save/ data/

Download the dataset from https://github.com/laiguokun/multivariate-time-series-data and copy the text files into LSTNet/data/

Usage

Training

There are 4 different script samples to train, validate and test the model on the different datasets:

electricity.sh
exchange_rate.sh
solar.sh
traffic.sh

Predict

In order to predict and plot traffic you will need to run main.py as follows (example for the electricity traffic)

python main.py --data="data/electricity.txt" --no-train --load="save/electricity/electricity" --predict=all --plot --series-to-plot=0

Running Options

The following are the parameters that the python script takes along with their description:

| Input Parameters | Default | :-----------------| | --data | | --normalize |2 | --trainpercent |0.6 | --validpercent |0.2 | --window |24 * 7 | --horizon |12 | --CNNFilters |100 | --CNNKernel |6 | --GRUUnits |100 | --SkipGRUUnits |5 | --skip |24 | --dropout |0.2 | --highway |24 | --initializer | --loss | --optimizer |Adam | --lr |0.001 | --batchsize |128 | --epochs |100 | --tensorboard |None | --no-train | | --no-validation | | --test | | --load |None | --save |None | --no-saveresults | | --savehistory | | --predict |None | --plot | | --series-to-plot |0 | --autocorrelation |None | --save-plot | None | --no-log | | --logfilename |log/lstnet | --debuglevel |20 | Description | :------------------| :-----------| |Full Path of the data file. (REQUIRED)| |Type of data normalisation: - 0: No Normalisation - 1: Normalise all timeseries together - 2: Normalise each timeseries alone| |Percentage of the given data to use for training| |Percentage of the given data to use for validation| |Number of time values to consider in each input X| |How far is the predicted value Y. It is horizon values away from the last value of X (into the future)| |Number of output filters in the CNN layer A value of 0 will remove this layer| |CNN filter size that will be (CNNKernel, number of multivariate timeseries) A value of 0 will remove this layer| |Number of hidden states in the GRU layer| |Number of hidden states in the SkipGRU layer| |Number of timeslots to skip. A value of 0 will remove this layer| |Dropout frequency| |Number of timeslots values to consider for the linear layer (AR layer)| |glorot_uniform |The weights initialiser to use| |mean_absolute_error |The loss function to use for optimisation| |The optimiser to use Accepted values: - SGD - RMSprop - Adam| |Learning rate| |Training batchsize| |Number of training epochs| |Set to the folder where to put the tensorboard file If set to None => no tensorboard| |Do not train the model| |Do not validate the model| |Evaluate the model on the test data| |Location and Name of the file to load a pre-trained model from as follows: - Model in filename.json - Weights in filename.h5| |Full path of the file to save the model in as follows: - Model in filename.json - Weights in filename.h5 This location is also used to save results and history as follows: - Results in filename.txt - History in filename_history.csv if --savehistory is passed| |Do not save results| |Save training / validation history in file as described in parameter --save above| |Predict timeseries using the trained model It takes one of the following values: - trainingdata: predict the training data only - validationdata: predict the validation data only - testingdata: predict the testing data only - all: all of the above - None: none of the above| |Generate plots| |Series to plot Format: series,start,end - series: the number of the series you wish to plot - start: start timeslot (default is the start of the timeseries) - end: end timeslot (default is the end of the timeseries)| |Autocorrelation plotting Format: series,start,end - series: the number of random timeseries you wish to plot the autocorrelation for - start: start timeslot (default is the start of the timeseries) - end: end timeslot (default is the end of the timeseries)| | Location and name of the file to save the plotted images to - Autocorrelation in filename_autocorrelation.png - Training history in filename_training.png - Prediction in filename_prediction.png| |Do not create logfiles However error and critical messages will still appear| |Full path of the logging file| |Logging debug level|

Results

The followinng are the results that were obtained:

| Dataset | Width | Horizon | Correlation | RSE | | :-------------| :-----------| :-----------| :-----------| :-----------| | Solar | 28 hours | 2 hours | 0.9548 | 0.3060 | | Traffic | 7 days | 12 hours | 0.8932 | 0.4089 | | Electricity | 7 days | 24 hours | 0.8856 | 0.3746 | | Exchange Rate | 168 days | 12 days | 0.9731 | 0.1540 |

Dataset

As described in the paper the data is composed of 4 publicly available datasets downloadable from https://github.com/laiguokun/multivariate-time-series-data:

Traffic: A collection of 48 months (2015-2016) hourly data from the California Department of Transportation
Solar Energy: The solar power production records in 2006, sampled every 10 minutes from 137 PV plants in the state of Alabama
Electricity: Electricity consumption for 321 clients recorded every 15 minutes from 2012 to 2014
Exchange Rate: A collection of daily average rates of 8 currencies from 1990 to 2016

Environment

Primary environment

The results were obtained on a system with the following versions:

Python 3.6.8
Tensorflow 1.11.0
Keras 2.1.6-tf

TensorFlow 2.0 Ready

The model has also been tested on TF 2.0 alpha version:

Python 3.6.7
Tensorflow 2.0.0-alpha0
Keras 2.2.4-tf

LSTNet

Install / Use

README

LSTNet

Installation

Usage

Training

Predict

Running Options

Results

Dataset

Environment

Primary environment

TensorFlow 2.0 Ready