bioRxiv preprint doi: https://doi.org/10.1101/677575; this version posted June 25, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Genome-wide characterization of satellite DNA arrays in a complex plant genome using nanopore reads by Tihana Vondrak1,2, Laura Ávila Robledillo1,2, Petr Novák1, Andrea Koblížková1, Pavel Neumann1 and Jiří Macas1,* (1) Biology Centre, Czech Academy of Sciences, Branišovská 31, České Budějovice, CZ-37005, Czech Republic (2) University of South Bohemia, Faculty of Science, České Budějovice, Czech Republic * Corresponding author e-mail:
[email protected] phone: +420 387775516 1 bioRxiv preprint doi: https://doi.org/10.1101/677575; this version posted June 25, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 Abstract 2 Background: Amplification of monomer sequences into long contiguous arrays is the main 3 feature distinguishing satellite DNA from other tandem repeats, yet it is also the main obstacle in 4 its investigation because these arrays are in principle difficult to assemble. Here we explore an 5 alternative, assembly-free approach that utilizes ultra-long Oxford Nanopore reads to infer the 6 length distribution of satellite repeat arrays, their association with other repeats and the 7 prevailing sequence periodicities.