Contributions to Lifelogging Protection in Streaming Environments
Total Page:16
File Type:pdf, Size:1020Kb
CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS David Pàmies Estrems ADVERTIMENT. L'accés als continguts d'aquesta tesi doctoral i la seva utilització ha de respectar els drets de la persona autora. Pot ser utilitzada per a consulta o estudi personal, així com en activitats o materials d'investigació i docència en els termes establerts a l'art. 32 del Text Refós de la Llei de Propietat Intel·lectual (RDL 1/1996). Per altres utilitzacions es requereix l'autorització prèvia i expressa de la persona autora. En qualsevol cas, en la utilització dels seus continguts caldrà indicar de forma clara el nom i cognoms de la persona autora i el títol de la tesi doctoral. No s'autoritza la seva reproducció o altres formes d'explotació efectuades amb finalitats de lucre ni la seva comunicació pública des d'un lloc aliè al servei TDX. Tampoc s'autoritza la presentació del seu contingut en una finestra o marc aliè a TDX (framing). Aquesta reserva de drets afecta tant als continguts de la tesi com als seus resums i índexs. ADVERTENCIA. El acceso a los contenidos de esta tesis doctoral y su utilización debe respetar los derechos de la persona autora. Puede ser utilizada para consulta o estudio personal, así como en actividades o materiales de investigación y docencia en los términos establecidos en el art. 32 del Texto Refundido de la Ley de Propiedad Intelectual (RDL 1/1996). Para otros usos se requiere la autorización previa y expresa de la persona autora. En cualquier caso, en la utilización de sus contenidos se deberá indicar de forma clara el nombre y apellidos de la persona autora y el título de la tesis doctoral. No se autoriza su reproducción u otras formas de explotación efectuadas con fines lucrativos ni su comunicación pública desde un sitio ajeno al servicio TDR. Tampoco se autoriza la presentación de su contenido en una ventana o marco ajeno a TDR (framing). Esta reserva de derechos afecta tanto al contenido de la tesis como a sus resúmenes e índices. WARNING. Access to the contents of this doctoral thesis and its use must respect the rights of the author. It can be used for reference or private study, as well as research and learning activities or materials in the terms established by the 32nd article of the Spanish Consolidated Copyright Act (RDL 1/1996). Express and previous authorization of the author is required for any other uses. In any case, when using its content, full name of the author and title of the thesis must be clearly indicated. Reproduction or other forms of for profit use or public communication from outside TDX service is not allowed. Presentation of its content in a window or frame external to TDX (framing) is not authorized either. These rights affect both the content of the thesis and its abstracts and indexes. UNIVERSITAT ROVIRA I VIRGILI CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS David Pàmies Estrems Contributions to Lifelogging Protection In Streaming Environments DAVID PÀMIES-ESTREMS DOCTORAL THESIS 2020 UNIVERSITAT ROVIRA I VIRGILI CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS David Pàmies Estrems UNIVERSITAT ROVIRA I VIRGILI CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS David Pàmies Estrems UNIVERSITAT ROVIRA I VIRGILI CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS David Pàmies Estrems UNIVERSITAT ROVIRA I VIRGILI CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS David Pàmies Estrems David P`amies-Estrems CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS DOCTORAL THESIS supervised by Dr. Jordi Castell`a-Roca and Dr. Joaquin Garcia-Alfaro Departament d'Enginyeria Inform`aticai Matem`atiques Tarragona, Catalonia, Spain May 2020 UNIVERSITAT ROVIRA I VIRGILI CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS David Pàmies Estrems UNIVERSITAT ROVIRA I VIRGILI CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS David Pàmies Estrems FEM CONSTAR que aquest treball, titulat "Contributions to Lifelog- ging Protection In Streaming Environments", que presenta en David P`amies-Estremsper a l’obtenci´odel t´ıtol de Doctor, ha estat real- itzat sota la nostra direcci´oal Departament d'Enginyeria Inform`atica i Matem`atiquesd'aquesta universitat. HACEMOS CONSTAR que el presente trabajo, titulado "Contribu- tions to Lifelogging Protection In Streaming Environments", que pre- senta David P`amies-Estrems para la obtenci´ondel t´ıtulode Doctor, ha sido realizado bajo nuestra direcci´onen el Departamento de Ingenier´ıa Inform´aticay Matem´aticasde esta universidad. WE STATE that the present study, entitled "Contributions to Life- logging Protection In Streaming Environments", presented by David P`amies-Estremsfor the award of the degree of Doctor, has been carried out under our supervision at the Department of Computer Engineering and Mathematics of this university. Tarragona, May, 2020 Doctoral Thesis Supervisor Doctoral Thesis Supervisor Jordi Castell`a-Roca Joaquin Garcia-Alfaro UNIVERSITAT ROVIRA I VIRGILI CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS David Pàmies Estrems UNIVERSITAT ROVIRA I VIRGILI CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS David Pàmies Estrems \What are the roots that clutch, what branches grow Out of this stony rubbish? Son of man, You cannot say, or guess, for you know only A heap of broken images, where the sun beats, And the dead tree gives no shelter, the cricket no relief, And the dry stone no sound of water. Only There is shadow under this red rock, (Come in under the shadow of this red rock), And I will show you something different from either Your shadow at morning striding behind you Or your shadow at evening rising to meet you; I will show you fear in a handful of dust." T.S. Eliot UNIVERSITAT ROVIRA I VIRGILI CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS David Pàmies Estrems UNIVERSITAT ROVIRA I VIRGILI CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS David Pàmies Estrems Acknowledgements To begin, I would especially like to thank the work done by my two advisors, Jordi Castell`a-Roca and Joaquin Garcia-Alfaro. As can be deduced from their career, they are top-level experts in this field of knowledge. Without them, this experience would not have been so interesting. I can not help but mention the members of CRISES research group of the Uni- versitat Rovira i Virgili and his head Josep Domingo-Ferrer. The contributions of Alexandre Viejo and David S´anchez. The communtication skills and the goodwill of Carles Angl´es.The diligence of Jes´usManj´onand the support of Jordi Ribes. In other URV departments I have also found people who have helped me during those years. In particular I would like to thank the time and efforts that A¨ıda Valls has dedicated to me on several occasions, as well as the advice of Jordi Duch, Pedro Garcia, Carles Barber`aand Agust´ıSolanas along this path. Je voudrais ´egalement exprimer ma gratitude `aT´el´ecom SudParis, partie de l'Institut Polytechnique de Paris, en particulier au D´epartement R´eseauxet Ser- vices de T´el´ecomet `ason responsable, Herv´eDebar, qui m'ont tr`esgentiment accueilli pour des mois passionnants. Je voudrais souligner en particulier le sou- tien de Maryline Laurent et Nesrine Kaaniche pour leur collaboration dans la recherche. Antoine Bernard et Houda Ibn-Khedher pour leur hospitalit´einfinie. Les conseils pr´ecieuxet les discussions fructueuses avec Gr´egoryBlanc, Jose Ru- bio, Anna Guinet, Ender Alvarez, Fabien Charmet, Mohammed El Barbori et Mustafizur Shahid. J'aimerais ´egalement remercier Sandra Gchweinder pour son aide dans la gestion. Et surtout pour les bons moments partag´esavec Mariana Segovia, Thomas Tournaire et Vu Thai Duong. C'´etaitun privil`egede faire partie d'une des meilleures ´ecolesd'ing´enieursfran¸caises.Merci beaucoup ! I must also acknowledge the various organizations that partially supported this work, including the Department of Computer Science and Mathematics of the Universitat Rovira i Virgili, the CRISES Research Group, the UNESCO Chair in Data Privacy and CYBERCAT-Center for Cybersecurity Research of Catalonia. Last but not least, I must express my gratitude to my friends and family. Joan, Andrew, Raida, Tere, Ramon, Maite, Arlet and Laura. Your support and ability to collect blackberries were especially useful to survive along the way. vii UNIVERSITAT ROVIRA I VIRGILI CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS David Pàmies Estrems Copyright c 2020 { David P`amies-Estrems UNIVERSITAT ROVIRA I VIRGILI CONTRIBUTIONS TO LIFELOGGING PROTECTION IN STREAMING ENVIRONMENTS David Pàmies Estrems UNIVERSITAT ROVIRA I VIRGILI Abstract Escola T`ecnicaSuperior d'Enginyeria Departament d'Enginyeria Inform`aticai Matem`atiques Doctor of Philosophy by David P`amies-Estrems Every day, more than five billion people generate some kind of data over the Internet. As a tool for accessing that information, we need to use search services, either in the form of Web Search Engines or through Personal Assistants. On each interaction with them, our record of actions via logs, is used to offer a more useful experience. For companies, logs are also very valuable since they offer a way to monetize the service. Monetization is achieved by selling data to third parties, however query logs could potentially expose sensitive user information: identifiers, sensitive data from users (such as diseases, sexual tendencies, religious beliefs) or be used for what is called "life-logging": a continuous record of one's daily activities. Current regulations oblige companies to protect this personal information. Protection systems for closed data sets have previously been proposed, most of them working with atomic files or structured data. Unfortunately, those systems do not fit when used in the growing real-time unstructured data environment posed by Internet services. This thesis aims to design techniques to protect the user's sensitive information in a non-structured real-time streaming environment, guaranteeing a trade-off between data utility and protection. In this regard, three proposals have been made in effi- cient log protection.