Detecting temporal anomalies in time series data utilizing the matrix profile
Beattie, Alexander (2022)
Diplomityö
Beattie, Alexander
2022
School of Energy Systems, Konetekniikka
Kaikki oikeudet pidätetään.
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe2022051134353
https://urn.fi/URN:NBN:fi-fe2022051134353
Tiivistelmä
This work presents a review of anomaly detection algorithms and libraries and the strengths and weaknesses of the most commonly used benchmarking datasets. With this information, the experimental datasets are selected and a practical implementation of an anomaly detector is created.
The matrix profile algorithm is selected for implementation because of its generalizeable approach for detecting real-time anomalies in streaming time series data. The STUMPY python library implementation of the iterative matrix profile is used for the creation of the detector. A series of custom filters is created and added to the detector to tune its sensitivity, recall, and detection accuracy.
Three experiments with significantly different conditions are presented to demonstrate the generalizability and performance of the detector. The experimential datasets used in this study include a hydraulic simulation, power electronic converter, and cyber-security intrusion detection dataset. With simple parameter tuning, the detector provides high accuracy and performance in a variety of difficult circumstances. In the future, the detector can be improved to further increase precision and accuracy in more complex circumstances. Additionally, the detector can be developed for use in a real-time, production system for monitoring and decision making.
The matrix profile algorithm is selected for implementation because of its generalizeable approach for detecting real-time anomalies in streaming time series data. The STUMPY python library implementation of the iterative matrix profile is used for the creation of the detector. A series of custom filters is created and added to the detector to tune its sensitivity, recall, and detection accuracy.
Three experiments with significantly different conditions are presented to demonstrate the generalizability and performance of the detector. The experimential datasets used in this study include a hydraulic simulation, power electronic converter, and cyber-security intrusion detection dataset. With simple parameter tuning, the detector provides high accuracy and performance in a variety of difficult circumstances. In the future, the detector can be improved to further increase precision and accuracy in more complex circumstances. Additionally, the detector can be developed for use in a real-time, production system for monitoring and decision making.