Change Detection and Notification of Web Pages: a Survey
Change Detection and Notification of Web Pages: A Survey VIJINI MALLAWAARACHCHI, The Australian National University, Australia LAKMAL MEEGAHAPOLA, Idiap Research Institute & École Polytechnique Fédérale de Lausanne (EPFL), Switzerland ROSHAN MADHUSHANKA, University of Moratuwa, Sri Lanka ERANGA HESHAN, University of Moratuwa, Sri Lanka DULANI MEEDENIYA, University of Moratuwa, Sri Lanka SAMPATH JAYARATHNA, Old Dominion University, USA The majority of currently available webpages are dynamic in nature and are changing frequently. New content gets added to webpages, and existing content gets updated or deleted. Hence, people find it useful to be alert for changes in webpages that contain information that is of value to them. In the current context, keeping track of these webpages and getting alerts about different changes have become significantly challenging. Change Detection and Notification (CDN) systems were introduced to automate this monitoring process, and to notify users when changes occur in webpages. This survey classifies and analyzes different aspects of CDN systems and different techniques used for each aspect. Furthermore, the survey highlights the current challenges and areas of improvement present within the field of research. CCS Concepts: • General and reference → Surveys and overviews; • Information systems → Web searching and information discovery; Web applications; Document structure; Similarity measures; Informa- tion extraction; Relevance assessment. Additional Key Words and Phrases: distributed digital collections, scheduling, change detection, change notification, webpages, websites, web search, search engine ACM Reference Format: Vijini Mallawaarachchi, Lakmal Meegahapola, Roshan Madhushanka, Eranga Heshan, Dulani Meedeniya, and Sampath Jayarathna. 2019. Change Detection and Notification of Web Pages: A Survey. ACM Comput. Surv. 53, 1, Article 15 (December 2019), 35 pages.
[Show full text]