LogMine: Fast Pattern Recognition for Log Analytics Hossein Hamooni Biplob Debnath Jianwu Xu University of New Mexico NEC Laboratories America NEC Laboratories America 1 University Blvd 4 Independence Way 4 Independence Way Albuquerque, NM 87131 Princeton, NJ 08540 Princeton, NJ 08540
[email protected] [email protected] [email protected] Hui Zhang Guofei Jiang Abdullah Mueen NEC Laboratories America NEC Laboratories America University of New Mexico 4 Independence Way 4 Independence Way 1 University Blvd Princeton, NJ 08540 Princeton, NJ 08540 Albuquerque, NM 87131
[email protected] [email protected] [email protected] ABSTRACT Keywords Modern engineering incorporates smart technologies in all Log analysis; Pattern recognition; Map-reduce aspects of our lives. Smart technologies are generating ter- abytes of log messages every day to report their status. It 1. INTRODUCTION is crucial to analyze these log messages and present usable The Internet of Things (IoT) enables advanced connec- information (e.g. patterns) to administrators, so that they tivity of computing and embedded devices through internet can manage and monitor these technologies. Patterns mini- infrastructure. Although computers and smartphones are mally represent large groups of log messages and enable the the most common devices in IoT, the number of \things" administrators to do further analysis, such as anomaly de- is expected to grow to 50 billion by 2020 [5]. IoT involves tection and event prediction. Although patterns exist com- machine-to-machine communications (M2M), where it is im- monly in automated log messages, recognizing them in mas- portant to continuously monitor connected machines to de- sive set of log messages from heterogeneous sources without tect any anomaly or bug, and resolve them quickly to mini- any prior information is a significant undertaking.