Rewind: Leveraging Everyday Mobile Phones for Targeted Assisted Recall
Total Page:16
File Type:pdf, Size:1020Kb
Rewind: Leveraging Everyday Mobile Phones for Targeted Assisted Recall Donnie H. Kim, Nicolai M. Petersen, Mohammad Rahimi, Jeff Burke, Deborh Estrin Center for Embedded Networked Sensing, UCLA {dhjkim, nmp}@cs.ucla.edu, [email protected], [email protected], [email protected] Lenore Arab David Geffen School of Medicine, UCLA [email protected] This paper presents an assisted recall system, Rewind, that employs novative applications that leverage the use of image and location automatic image capture on mobile phones and filtering of images data captured by mobile phones as they move around in the envi- for end-user viewing. The usability of image-based assisted recall ronment with their users. In this paper we document and evaluate systems is limited by the large number of images through which one such application type - assisted recall. the user must navigate. Rewind is a scalable system of everyday mobile phones and supporting web services that we developed to Assisted recall systems support individuals by recording aspects of explore how client- and server-side image processing can be used the environment around them for later playback. They can assist to both lower bandwidth needs and streamline user navigation. It people with healthy memories who want to track additional details, has been in use since August 2007 as part of a pilot study super- those with memory impairments or perceptual problems due to in- vised by an epidemiologist to evaluate its utility for improving re- jury or illness, or in self-improvement where an individual wants to call of dietary intake, as well as other shorter and more exploratory become more conscious of their habits. Additionally, scientists can trials. While the system is designed to accommodate a range of use them as tools to study the impact of individual behaviors for image processing algorithms, in this first prototype we rely on sim- which there are no direct measures suitable for longitudinal study, ple filtering techniques to evaluate our system concept. We present such as foods consumed or numbers of people interacted with [5, performance results for a configuration in which the processing oc- 15, 21]. curs on the server, and then compare this to processing on the mo- bile phones. Simple image processing on the phone can address In contrast to lifeblogging systems such as MyLifeBits [9, 10] and the narrow-band and intermittent upload channels that characterize SenseCam [13], which attempt to collect as much data as possi- cellular infrastructure, while more sophisticated processing tech- ble for future mining, Rewind, our system for image-based assisted niques can be implemented on the server to further reduce the num- recall, supports data collection on specific behaviors or situations. ber of images displayed to users. By targeting recall, we can take advantage of the application’s con- straints in capture, presentation, processing, and upload. In partic- Categories and Subject Descriptors ular, our system (i) reduces burden on the end user by employing J.3 [Life and Medical Sciences]: Health application-specific signal processing to decrease the amount of in- formation presented, (ii) lowers bandwidth through prioritized or selective upload based on this processing (iii) pursues a conserva- General Terms tive, application-specific approach to personal data collection and Design, Experimentation, Human Factor retention, thus reducing privacy risks. Keywords To evaluate its utility in improving dietary recall, Rewind has been Urban Sensing, Urban Computing, Mobile phones, Automatic im- in use since August 2007 as a supplemental tool in an NIH-funded age capture, Image processing, Prioritized upload study on dietary intake. We have also designed it to explore other image-based targeted recall applications, such as the PosterSession- 1. INTRODUCTION Capture Pilot described in Section 4.3. Our use of everyday mobile People increasingly carry mobile phones with them everywhere and phones and web services is motivated by low cost and easy avail- all the time, across cultures, countries, and demographics. These ability for the populations in our specific studies, as well as a desire mobile phones increasingly contain imaging and location capabili- for extensibility to a broad range of applications in the future. ties, (i.e., high resolution cameras and GPS). We are exploring in- The use model is simple. For example, in the dietary intake pilot, study participants launch the Rewind application on their mobile phone before eating, place the phone around their neck on a lan- yard with the camera facing out and down, and then turn it off at the end of the meal. When they visit the study facility on the next day, images captured during their meals will already have been au- tomatically uploaded and tagged with image-quality metrics. Par- ticipants log-in to a secure image viewer to browse the best of their own food images while taking a dietary survey. To be effective for this application, Rewind must sample images at 2. RELATED WORK a high enough sampling rate to capture the “dynamics” of eating, There is significant mobile and pervasive computing research that such as the participant taking additional servings, leaving items on explores wearable image and audio recording devices. Several in- plates that are returned, etc., and often run for an hour or more at vestigate the use of wearable image capture devices for creating a a time several times a day. It must automatically upload and then multimedia diary of one’s life. Typically, there is no a priori focus process the images to select a small representative set for the users on particular situations or behaviors to capture; Rather the goal is to to view during their visit. At the rapid sampling rate needed, a archive the data and, thus, life, in its entirety. The many challenge single user captures too many images for manual investigation. In include: meaningful organization and indexing of the huge image the DietaryRecall Pilot, each user generated an average of more corpus, user interface design, and effective navigation of the infor- than 100 images per meal. mation [4, 8, 11]. Notably, SenseCam [13, 10] explores usage of a wearable device worn on a pendant hung from ones neck and has Based on this and other similar scenarios, we designed Rewind to: a combination of image, accelerometer, infrared and temperature sensors. It captures images either continuously or based on trigger- ing events from its other sensors and images can be subsequently • Automatically capture and cache images at a rate that would transferred to a host PC for archival and visualization [12, 14, 10, not be tolerable if the user had to do it manually, 19]. Kapur et. al. tested the SenseCam to aid autobiographical • Perform both handset- and server-side processing of images memory in patients with severe memory impairment [5]. Subse- to generate simple classifications and quality metrics selected quent research developed tools to navigate through the SenseCam and tuned for the specific application target, images and effectively visualize the captured information [6]. • Prioritize images for upload based on mobile-phone-based Many factors distinguish our work from other assisted recall re- annotations to mitigate the impact of channel congestion on search. First is the usage of the phone itself. By using mobile the rapid availability of representative images to the end user, phones that are ubiquitously available, we enable future deploy- • Support rapid experimentation with different image process- ments and larger scale experimentation. Therefore we have de- ing algorithms as part of an iterative design process with ap- signed our system to support a large number of phones and users. plication experiences. A second distinction is the usage of the information in our system. In our research, the objective is capturing images for improving a • Display images for the user in a private on-line gallery, where specific, targeted, application. For instance, in the dietary experi- they are clustered in time and filtered based on the application- mentation, the objective is to improve the ability of the participants specific annotations, and, to recall their daily food intake. This targeted capture also lends • Enable the user to explicitly select images to be stored or itself to the third party usage of the data such as by a medical re- shared within the target applications’ privacy constraints, while searcher once the subject has reviewed and released data. The se- assuring that all images not marked for sharing are deleted. lective sharing of the image data is critical to individual privacy protection and has implications for both system design and evalua- tion. Contributions. To the best of our knowledge, Rewind is the first targeted assisted recall system using everyday mobile phones. Our Several works in image processing and computer vision literature contributions include: are relevant to the assessment of application-specific quality of the captured images. This includes characterization of both quality • Conceiving, building, and deploying in a “live” medical study and relevance of the content of the images. The main difficulty this type of platform, with configurabality and extensibility in assessment of the image quality is a consistent metric indepen- enabling other pilots that we will also describe. dent of illumination changes. Many of the image processing tech- niques rely on holistic illumination-independent features such as a • The use of web services and image processing techniques for histogram of coefficients of Discrete Cosine Transform (DCT) or server-side operations. Harr wavelet transform to assess the quality of the images [18, 23]. • The use of a password protected, user-data workspace model There is also a multi-decade research thread in content-based as- for staging of potentially-private data until it is reviewed and sessment of image information. These research efforts have roots released or deleted by the user.