Nucleosome Dynamics and Analysis in Breast Cancer Cells
Total Page:16
File Type:pdf, Size:1020Kb
Nucleosome dynamics and analysis in breast cancer cells Andy Pohl Tesi Doctoral UPF / Any 2014 Director de la tesi Dr. Miguel Beato Department Gene Regulation, Stem Cells, and Cancer Department of the Centre for Genomic Regulation (CRG) Agraïments I am very grateful for all of the help I have received during the PhD. First and foremost, I must acknowledge my supervisor, Miguel Beato. In my time in the lab, he has remained very accessible, even when he had more responsibili- ties as director of the institute. Unlike perhaps many supervisors, he gave me the opportunities to explore almost any idea I had. Not all of my ideas were the best, and he knew that, but he gave me the freedom to try and fail (and fail frequently) on my own. I must acknowledge Roderic Guigó. I perhaps did not take advantage of Roderic’s extensive experience as best as I could have, but being a part of his lab in general helped me stay current with bioin- formatic topics. In addition to that, he funded my final year of study, which has been the most fruitful. In addition to my supervisors, my thesis commit- tee included Eduardo Eyras and Fátima Gebauer (who replaced Raul Méndez in 2011), who gave me very good advice even when discussing a topic outside of their specialty. Beyond my formal supervision, I have to credit all of my labmates who have made possible my transformation from software engineer to capa- ble experimentalist. This starts with Roni Wright. She taught me all the ba- sics of culturing cells, and performing basic assays like Western blots or DNA gels, and also gene cloning techniques and transfection/infection of our cells. Jofre Font, Michael Wierer, Diana Reyes, Marija Kundakovic, Guille Vicent, Flopy Ogara, Priyanka Sharma, Alessandra Ciociola, Alejandro Lagreca, Silvina Nacht, and François Le Dily also helped me a lot in that respect. I learned extensive details about how to perform chromatin immunoprecipita- tions (ChIPs) from Cecilia Ballare. Other people that have benefitted my lab education through informal discussion (in no particular order) include João Tavanez, Bernhard Pätzold, Anna Corrionero, Camilla Ianonne, Gaetano Verde, Christos Gekas, Marina Garcia, Laura Gaveglia, Martin Lange, Payal Jain, Malte Beringer, Paula Pisano, Rory Johnson, Sophie Bonnal, Bill Keyes, Cristina Militti, Ramón Tejedor, Lluis Morey, Alessio Bava, Sergio Barberan, Luisa Vigevani, Luciano Di Croce, Juan Valcarcel, Karthik Arumugam, João Frade, Esteban Rozen, Adam Klosin, Alex Santanach, Elena Martin, Jordi iii Hernández, Bruno Di Stefano, Laure Weill, Timo Zimmermann, Ben Leh- ner, Thomas Graf, Matteo Pecoraro, and Elias Bechara. Also I must thank people that contributed a wealth of information towards furthering my bio- informatic skills. This includes Giancarlo Castellano, Daniel Soronellas, João Curado, Christoforos Nikolaou, Debayan Datta, Hagen Tilgner, Colin Kingswood, Sonja Althammer, Julien Lagarde, Sarah Bonnin, Sarah Djebali, Emilio Palumbo, Brian Raney, Jim Kent, Anna Vlasova, Tobias Warnecke, David Haussler, Guillaume Filion, Alessandra Breschi, Barbara Uszczynska, Marco Mariotti, Dmitri Pervouchine, Raik Grünberg, Almer van der Sloot, Peter Vanhee, Gireesh Bogu, Panagiotis Papasaikas, Micha Sammeth, Cedrik Magis, Paolo Ribeca, Francisco Câmara, Ivan Junier, Davide Baù, Arnau Bria, Rodny Hernandez, Gabriel Gonzalez, Oscar Gonzalez, Judith Flo, Cedric Notredamme, Didac Santesmasses. I will also acknowledge Aitor Busquets, Isma De Mingo, Marc Gonzalez, Núria Janè, Romina Garrido, Imma Fale- ro, Isabel Jurado, Diego Mellibovsky, Zoe Barbarà, Sharon Bel, Rafa Bayona, Carla Senozaín, Lucia Marucci, Lucia Russo, Giacoma Simonte, Francesco Sottile, Francesco Aulicino, Daniela Sanges, Pia Cosma, Livia Caizzi, Fran- cesca Rapino, Mekayla Popoff, Valeria Di Giacomo, Valeria Giangarra, Alba Mas, Kadri Reis, Jia-Ming Chang, Anne Camapgna, Eric Verschueren, Ma- ria Aurelia Ricci, Marc Friedländer, Tony Ferrar, Javier Delgado, Jae-Seong Yang, Solip Park, Mirko Francesconi, Krisztina Arató, Ilda Theka, Esther Lizano, Verónica Llorens, Emilia Szostak, Maria Lluch, Sophia Teichmann, Marcos Perez, Roxana Tovar, Chiara Di Vona, Luca Cozzuto, Jean-François Taly, Magalí Bartomeus, Eli Mateu, Elena Miñones, Carolina Gallo, Carolina Segura, Jo Aigner, Kiana Toufighi, Amy Curwin, James Cotterell, Alexan- dra Grippa, Luigi Aloia, Luigi Ombrato, Karl Wotton, Umberto Di Vicino, Eric Kallin, Julia Röwenstrunk, Marcus Buschbeck, Olivera Vujatovic, Sergi Repullo, Tian Tian, Valentina Schiavone, Silvina Catuara, Carla Bello, Tom Starke, Villy Michaki, Wassim Altarche, and Salvatore Cappadona in general for what I feel is a positive impact they each make on the CRG and why I have enjoyed working here. Many of those that have helped me professionally, have also been good friends to me as well. The CRG and Barcelona can take credit for pro- iv viding a wonderful environment, where it is easy to make friends and enjoy life. To that extent I will give special credit to my yoga instructor Alessa Ben- naton and all of my spanish teachers in the past, most recently Alexandra Albuera. The city of Barcelona has given me back my love for running and I imagine I will find myself here again for more marathons. My family is very important to me and they have supported all my life decisions, including moving to Europe and going back to school. This thesis is dedicated to the memory of my great-uncle Garland Petefish (1917-2008). Thank you everyone. Gracias a todos. Gràcies a tothom. v vi Abstract Genome-wide analysis of the nucleosome positioning and histone H1 iso- form content of the T47D breast cancer cell line has found a number of observations, namely that with a gentle digestion of microccocal nuclease (MNase), a nucleosome is visible just upstream of the transcription start site, in the region known as the “nucleosome-free region” (NFR). H1 isoforms bind to chromatin mainly in a redundant manner, but H1.2 and H1.3 show some specificity while H1.5 increases its binding dramatically after a proges- terone stimulus. In the course of these studies, a general-purpose software package was developed for the manipulation and analysis of bigWig files, a data format for storing continuous signal data assigned to genome coordi- nates. Resum En el meu estudi genòmic sobre el posicionament de nucleosomes i sobre el- contingut de les isoformes de la histona H1 en cèl·lules de càncer de mama T47D he dut a terme una sèrie d'observacions. Específicament he trobat que amb una digestió suau amb nucleasa micrococcal, es pot identificar un nucleosoma just abans del lloc d'inici de transcripció, en la regió coneguda com a "regió lliure de nucleosomes". També he vist que les diferents isoformes somàtiques de la histona H1 (H1.0-H1.5, H1x) s'uneixen a la cromatina de manera redundant, però que la H1.2 i la H1.3 presenten certa especifici- tat, mentre que la H1.5 mostra un augment de la unió generalitzat després d'estimular les cèl·lules amb progesterona. En el decurs de la meva recerca, he desenvolupat un programari general per la manipulació i l'anàlisi d'arxius amb format bigWig, un format per a l'emmagetzematge de dades de senyals continus al llarg de les coordenades del genoma. vii viii Prefaci This thesis touches on several different aspects of chromatin biology, in three chapters of results. After an introduction to chromatin biology and gene regulation and the kind of experiments that are done, I describe our work positioning nucleosomes in T47D breast cancer cells. This work is important because it draws attention to nucleosomes found in the badly named “nucle- osome free regions” near gene transcription start sites. The second results chapter describes our work studying the genome-wide binding of linker his- tone H1 variants to chromatin. This study is notable because seven somatic variants of H1 exist in humans, and it is not understood why. Our study is also the first that analyzes the redistribution of H1 after a hormone stimulus to the cells. A side effect of preparing these results was the creation of a utility for genomic continuous-valued signal data, whose function was originally split across a number of smaller more specific-use tools. This software: bwtool, has been released as open source to the bioinformatics community and its article recently published is attached as the third results chapter. Finally, I close with some additional discussion about how emerging trends and technologies may impact future directions of the field. ix x Sumari Acknowledgements .................................................................................................. iii Abstract ...................................................................................................................... vii Preface ..........................................................................................................................ix Introduction ................................................................................................................. 1 Objectives ................................................................................................................... 33 Results I: ..................................................................................................................... 35 Results II:................................................................................................................... 67 Results III: ................................................................................................................103 Conclusions ............................................................................................................