bioRxiv preprint doi: https://doi.org/10.1101/2021.05.01.442102; this version posted May 1, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. 1 INfrastructure for a PHAge REference Database: Identification of large-scale biases in the 2 current collection of phage genomes 3 4 Ryan Cook1, Nathan Brown2, Tamsin Redgwell3, Branko Rihtman4, Megan Barnes2, Martha 5 Clokie2, Dov J. Stekel5, Jon Hobman5, Michael A. Jones1, Andrew Millard2* 6 7 1 School of Veterinary Medicine and Science, University of Nottingham, Sutton Bonington 8 Campus, College Road, Loughborough, Leicestershire, LE12 5RD, UK 9 2 Dept Genetics and Genome Biology, University of Leicester, University Road, Leicester, 10 Leicestershire, LE1 7RH, UK 11 3 COPSAC, Copenhagen Prospective Studies on Asthma in Childhood, Herlev and Gentofte 12 Hospital, University of Copenhagen, Copenhagen, Denmark 13 4 University of Warwick, School of Life Sciences, Coventry, UK 14 5 School of Biosciences, University of Nottingham, Sutton Bonington Campus, College Road, 15 Loughborough, Leicestershire, LE12 5RD, 16 17 Corresponding author:
[email protected] bioRxiv preprint doi: https://doi.org/10.1101/2021.05.01.442102; this version posted May 1, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license.