GitHub - AdeboyeML/Anomaly-Detection-using-RNN-LSTM_Autoencoders

Anomaly-Detection-using-RNN-LSTM-Autoencoders

You can download the data here: exampleco_data

Main objective: to develop an automated method to pinpoint the times of fault and failure in this machine.

Steps Taken to accomplish the task:

Load in Data and necessary libraries
Data exploration
Noise Removal using Moving Window Average
Data Splitting
Data preparation
Implementation of LSTM Autoencoder
Data visualization to pinpoints times of fault and failures (~ to detect anomalies).

training.py -- train data (with normal pattern) anomaly_detection.py -- detect anomalies that exceed the minimal reconstruction error

Building and Implementation of the LSTM Autoencoder Model

Brief overview of LSTM Autoencoder model and the reason why we decided to use it

Autoencoder and Reconstruction Error:

Autoencoder: are neural networks that aim to reconstruct their input. They consist of two parts: an encoder and a decoder. The encoder maps input data to a latent space (or hidden representation) and the decoder maps back from latent space (~ hidden representation) to input space.

Typically the latent space has a lower dimensionality than the input space and, hence, Autoencoders are forced to learn compressed representations of the input data which enables them to captures the correlations and interactions between the input data. The autoencoder is trained to reconstruct data with normal pattern (e.g., normal time series) by minimizing a loss function that measures the quality of the reconstructions. After training, the model can now reconstruct the normal data well enough with minimal reconstruction error.

Once training is completed, the reconstruction error is used afterwards as an anomaly score to detect anomaly in future time instance of the data i.e. if the model is given an anomalous sequence of the data in future time that is NOT NORMAL, it may not be able to reconstruct it well and hence would lead to higher reconstruction error compared to the resconstruction errors for the normal sequence. This is reason why we have to assume that the training data is said to be in NORMAL STATE.

LSTM Autoencoder;

LSTM (Long Short Term Memory) is an upgraded Recurrent Neural Network (RNN), a powerful sequence learner with a memory cell and gates that control the information to include, remove and output from the memory cell. The major attribute of LSTM in comparision to RNN is the memory cell that stores long to short term information about input sequence across the timesteps.

In our case, LSTM Autoencoder will be used for sequence to sequence (seq2seq) learning i.e. the encoder reads a variable-length input sequence and converts it into a fixed-length vector representation (reduced dimension), and the decoder takes this vector representation and converts it back into a variable length sequence.

In general, the learned vector representation corresponds to the final hidden state of the encoder network, which acts like a summary of the whole sequence. Our LSTM Autoencoder is an example of Seq2Seq autoencoder, in which the input and output sequences are aligned in time (x = y) and, thus, have equal lengths (Tx = Ty).

Reason we used LSTM Autoencoder:

We used LSTM Autoencoder simply because our time series data is a sequential data and LSTM captures the temporal dependencies of the data by introducing memory. Specifically, LSTM has the ability to capture long term temporal interactions and correlations between variables in the input sequence which is highly required in this scenario since this relationship are time dependent and they determine the state of the machine.

Summary:

I believe the predictions are good because the model takes into account the behaviour of the signals over time and the error shows consistent decrease with increasing epochs.

Based on our results, LSTM Autoencoder is a robust model for detecting anomalies in time-series data, this is because it takes into account the temporal dependencies of the input sequence.

More data is required for each single machine to be able to justify the findings of these results because the more data for a single machine the more robust the model for predicting times of anomalies for the machine.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
Data_Science_Exercise_Tagup		Data_Science_Exercise_Tagup
README.md		README.md
anomaly.png		anomaly.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Anomaly-Detection-using-RNN-LSTM-Autoencoders

Main objective: to develop an automated method to pinpoint the times of fault and failure in this machine.

Steps Taken to accomplish the task:

Building and Implementation of the LSTM Autoencoder Model

Brief overview of LSTM Autoencoder model and the reason why we decided to use it

Autoencoder and Reconstruction Error:

LSTM Autoencoder;

Summary:

Tools utilized: Python libraries: pandas, numpy, keras, seaborn.

About

Releases

Packages

Languages

AdeboyeML/Anomaly-Detection-using-RNN-LSTM_Autoencoders

Folders and files

Latest commit

History

Repository files navigation

Anomaly-Detection-using-RNN-LSTM-Autoencoders

Main objective: to develop an automated method to pinpoint the times of fault and failure in this machine.

Steps Taken to accomplish the task:

Building and Implementation of the LSTM Autoencoder Model

Brief overview of LSTM Autoencoder model and the reason why we decided to use it

Autoencoder and Reconstruction Error:

LSTM Autoencoder;

Summary:

Tools utilized: Python libraries: pandas, numpy, keras, seaborn.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages