Abstract
A high volume of data including log records, sensors, monitoring systems, manufacturing processes, call detail records, blogs, emails, and social media streams are generated around the clock by diverse applications. Thus, as the volume of data is growing rapidly, detecting anomaly from high volume big data becomes a critical and difficult task, due to the theatrical (research) and practical (technical) limitations. This paper aims to investigate anomaly detection and provide global understanding of anomaly concepts in the big data mining perspective. In this paper we demonstrate how existing methods of anomaly detection can be adopted with high volumes of data, specifically providing in depth understanding of the anomaly concept in streaming data. The key contribution of this study is an attempt to answer the following questions: 1) What is the concept of big data and what are big data analytic approaches? 2) What is the relationship between big data and anomaly detection? 3) What is the main characteristic of anomaly in big batch and streaming data? 4) What is the appropriate state of the art infrastructure to process and detect large-scale batch and streaming data?
Original language | English |
---|---|
Title of host publication | Proceedings - 20th International Conference on High Performance Computing and Communications, 16th International Conference on Smart City and 4th International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2018 |
Editors | Juan E. Guerrero |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1177-1182 |
Number of pages | 6 |
ISBN (Electronic) | 9781538666142 |
ISBN (Print) | 9781538666159 |
DOIs | |
Publication status | Published - 24 Jan 2019 |
Event | 20th International Conference on High Performance Computing and Communications, 16th IEEE International Conference on Smart City and 4th IEEE International Conference on Data Science and Systems - Exeter, United Kingdom Duration: 28 Jun 2018 → 30 Jun 2018 Conference number: 20/16/4 |
Conference
Conference | 20th International Conference on High Performance Computing and Communications, 16th IEEE International Conference on Smart City and 4th IEEE International Conference on Data Science and Systems |
---|---|
Abbreviated title | HPCC/SmartCity/DSS 2018 |
Country/Territory | United Kingdom |
City | Exeter |
Period | 28/06/18 → 30/06/18 |