Inhalt des Dokuments
Publications
Citation key | ZiehnCHM19 |
---|---|
Author | Ariane Ziehn, Marcela Charfuelan, Holmer Hemsen, Volker Markl |
Year | 2019 |
Address | An Erratum is available below under "link to publication". |
Journal | DARLI_AP Workshop, co-located with EDBT/ICDT 2019 |
Abstract | In this paper we propose a practical study and demonstration of time series similarity search in modern distributed data processing platforms for stream data. After an intensive literature review, we implement a flexible similarity search application in Apache Flink, which includes the most commonly used distance measurements: Euclidean distance and Dynamic Time Warping. For efficient and accurate similarity search we evaluate normalization and pruning techniques developed for single machine processing and demonstrate that they can be adapted and leveraged for those distributed platforms. Our final implementation is capable of monitoring many time series in real-time and parallel. Further, we demonstrate that the number of required parameters can be reduced and optimally derived from data properties. We evaluate our application by comparing its performance with electrocardiogram data on a cluster with several nodes. We reach average response times of less than 0,1 ms for windows of 2 s of data, which allow fast reactions on matching sequences. |
Back [4]
ublikation/Papers/Ziehn_Time_Series_Similarity_Search_D
arli-2019_crv_Erratum.pdf
blications/parameter/maxhilfe/?no_cache=1&tx_sibibt
ex_pi1%5Bdownload_bibtex_uid%5D=2171730&tx_sibibtex
_pi1%5Bcontentelement%5D=tt_content%3A126920
blications/parameter/maxhilfe/
g_data_management_report/parameter/maxhilfe/
Zusatzinformationen / Extras
Quick Access:
Schnellnavigation zur Seite über Nummerneingabe
Auxiliary Functions
Copyright TU Berlin 2008