Molecular Evolution of the non-coding Eosinophil Granule Ontogeny Transcript EGOT – Supplement –

Dominic Rose a,h and Peter F. Stadler b,c,d,e,f,g aBioinformatics Group, Department of Computer Science, University of Freiburg, Georges-K¨ohler-Allee 106, D-79110 Freiburg, Germany. bBioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, H¨artelstraße 16-18, D-04107 Leipzig, Germany. cMax Planck Institute for Mathematics in the Sciences, Inselstrasse 22, D-04103 Leipzig, Germany dFraunhofer Institut f¨urZelltherapie und Immunologie – IZI Perlickstraße 1, D-04103 Leipzig, Germany eDepartment of Theoretical Chemistry, University of Vienna, W¨ahringerstraße 17, A-1090 Wien, Austria f Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønneg˚ardsvej 3, DK-1870 Frederiksberg C, Denmark gSanta Fe Institute, 1399 Hyde Park Rd., Santa Fe, NM 87501, USA hCorresponding author: [email protected]

Supplement 25 August 2011 Table 1 Approximate genomic locations of EGO-B orthologs. The coordinates refer to the unspliced genomic regions of EGO-B. Recall that some entries are based on draft assemblies (GeneScaffolds). These genomes contain the EGO-B but the respective coordinates are preliminary. In case of assembly problems (e.g. the gene is covered by different scaffolds), no genomic coordinates are given.

Species Assembly Chr. 5’ EGO-B 3’ EGO-B ± size [nt]

Homo sapiens hg19 chr3 4790878 4793274 - 2397 Pan troglodytes panTro2 chr3 4878075 4880473 - 2399 Gorilla gorilla gorGor3 chr3 4902164 4904567 - 2404 Pongo pygmaeus ponAbe2 chr3 65374639 65377039 - 2401 Macaca mulatta rheMac2 chr2 56276017 56278426 + 2410 Papio hamadryas Pham 1.0 Contig1259 Contig623173 26345 28758 + 2413 Callithrix jacchus calJac3 chr15 56602911 56605332 - 2422 Tarsius syrichta tarSyr1 GeneScaffold 4896 162358 164326 - 1969 Microcebus murinus micMur1 - - - - - Otolemur garnettii BUSHBABY1 GeneScaffold 2768 553545 555837 - 2293 Tupaia belangeri tupBel1 scaffold 127316 516 2657 + 2142 Mus musculus mm9 chr6 108404678 108407558 - 2881 Rattus norvegicus rn4 chr4 143936406 143939404 - 2999 Dipodomys ordii dipOrd1 GeneScaffold 6600 155406 158566 - 3160 Cavia porcellus cavPor3 scaffold 16 32052549 32055063 - 2514 Spermophilus tridecemlineatus SQUIRREL GeneScaffold 3331 244508 246869 - 2361 Oryctolagus cuniculus oryCun2 GL018703 3454423 3456525 2102 Tursiops truncatus turTru1 GeneScaffold 1935 210524 212948 - 2425 Bos taurus bosTau4 chr22 22291950 22294301 + 2352 Equus caballus equCab2 chr16 11378820 11381084 + 2265 Felis catus felCat4 A2 55998823 56001116 - 2294 Ailuropoda melanoleuca ailMel1 GL192717.1 421344 423702 - 2359 Canis familiaris canFam2 chr20 15833826 15836114 + 2289 Pteropus vampyrus pteVam1 GeneScaffold 2203 226110 228253 - 2143 Loxodonta africana loxAfr3 chr12 42811986 42814765 - 2780 Procavia capensis proCap1 GeneScaffold 4371 187873 197725 + 9853 Echinops telfairi TENREC GeneScaffold 5028 354037 357269 + 3233 Dasypus novemcinctus dasNov2 GeneScaffold 4264 285144 287278 - 2135 Choloepus hoffmanni choHof1 GeneScaffold 4676 145093 147373 + 2281 Monodelphis domestica monDom5 chr6 236476850 236479951 - 3102 Ornithorhynchus anatinus ornAna1 X1 44628568 44640839 - 12271 Gallus gallus galGal3 chr12 19134732 19138187 - 3455

2 5'exon 3'exon

1.0 muscle 1.0 muscle multiz multiz clustalw clustalw 0.8 0.8 0.6 0.6 0.4 0.4 phastCons score phastCons score 0.2 0.2 0.0 0.0

0 100 200 300 400 500 600 0 200 400 600 800

genomic position, 5'exon genomic position, 3'exon

Fig. 1. Sequence conservation of EGO-B as indicated by the phastCons program. We have com- puted the phastCons scores for three different alignment approaches using the vertebrate model available at the UCSC Genome Browser: (1) our own muscle alignment, (2) the pre-computed 46-way vertebrate multiz alignment from the UCSC browser, and (3) a clustalw alignment based on our set of orthologs. The peaks are fairly similar, only clustalw slightly differs due to a higher alignment error rate resulting from the lack of consistency transformation or similar alignment refinement/improvement steps. Applied phastCons parameters: –transitions 0.01,0.01 –rho 0.4.

length of 5'exon

manual 800 multiz

600

400 nucleotides 200

0 Rat Cat Dog Cow Sloth K.Rat Hyrax Horse Panda Rabbit Chimp Gorilla Tarsier Tenrec Mouse Elefant Rhesus Dolphin Squirrel Baboon Chicken Platypus Megabat M.Lemur Armadillo Opossum Bushbaby Marmoset GuineaPig Orangutan TreeShrew

Fig. 2. Differences in aligned sequence data between our approach and pre-computed UCSC multiz alignments. We plot the sequence portion of the 5’exon of EGO-B for different species and align- ment approaches. There is a large overlap but also substantial differences between the two alignment ap- proaches. An obvious drawback of pre-computed alignments is missing data. For example, the UCSC align- ments do not contain the ortholog of panda because the assembly was not ready at the time the alignments were generated. More strikingly, non-human orthologs are only partially included in the UCSC alignments, since it is a reference (human) based approach. Regions that would cause larger gaps in human, such as mouse or rat which exhibit various exonic insertions, are only partially included in the final alignment. However, partial sequences are crucial for any subsequent analysis relying on valid alignments, i.e. RNA secondary structure prediction.

3 external external external internal

2 kb Scale 4771000 4770000 4769000 4768000 4767000 4766000 :chr3 ENCODE Enhancer- and Promoter-Associated Histone Marks HUVEC (H3K27Ac)

HUVEC (H3K4Me1) RefSeq EGOT ENCODE Digital DNaseI Hypersensitivity Clusters DNase Clusters ENCODE UW Digital DNaseI Raw Signal - 1st (in GM12878 cells) GM12878 Raw

ENCODE UW Digital DNaseI Raw Signal - 1st (in K562 cells) K562 Raw 1

ENCODE UW Digital DNaseI Raw Signal - 1st (in HEEpiC cells)

HEEpiC Raw 1

ENCODE UW Digital DNaseI Raw Signal - 1st (in HRE cells)

HRE Raw 1

Fig. 3. EGOT promoter regions. ENCODE data suggest four possible promoter regions for EGOT (marked by the four arrows). Digital DNase1 hypersensitivity clusters indicate three promoter candidate regions upstream of EGOT. On the other hand, histone marks suggest an internal promoter at the 5’exon of EGO-B. The figure contains only exemplary cell lines. However, the depicted signal peaks, especially the the DNase1 hypersensitivity peaks, are consistently present in numerous cell lines.

1 2 3 4 5 6

RNAz1 Horse

A U

U RNAz2 chimp

C

U U A

U

G

G

U A

C

A A U C

U G

A U

A U

U

G A

C G A RNAz2 human C

C U

G C G

C C G U U

G U U G RNAz2 orangutan

A

A G U

U A U A G

A A A G

U U

A C

U G A

C U G U C U

A C A

C UU A G A A G

G A G

U U UU A C U G A U C A U

A G C

C C G U A A G A C A C A U U U C A

C U GU G U G G C

A G C C G U A G G A C A

G A A

G C A G U

G U A A

A A G C C U

C C G G C U A G G A C U G G

U A U G A G

C C A A C C A C A U U A C A

U C C A A C

G A U A G A C C A A A C G A G

U U G G U

A A A G U

G A

G C G U A A C C A

U A A U A G C

A U A C C A U A U G G

C U C A G C A U A A A A C A

G A A

C G U A A A A G U A A

A A U

A U G A U U

U A

U A U C A A U U

G U U C A A

G A U G

A C

C C A A

A A

C G U U C U U G

U A U G C

A C C

A G A A G C A

A A U U A

CC U

U U G A C C U

C A

A G G U G A A C

U U A C A

A C C

G G A A A U G G U C C A A G G

G G A C

G C C G G G G

U C A A A G

U G G G G C A U

C U G C G A G U

G C A

C U C C U A A U G A A U C C U U U U C U U U A G

U A G A G G A U A G

A U C A G C A G

C G G

U G

G A C U G G

C U A

A G A A G C C C

C C U U G

G A A G G U A C

G A A G A A G C

U A A G A G G

G G U A A G G A A U C U C A C GCA A A A

U A G C A

C A C

C U G G C A G

C C G A

U

C G A C

C U G U U G G C A

G U U A G G A

G C

U C A G U C A

C U C U U A A U U G

C A U G A A

G G A U

U A U U U

A C A A A A

G G G A U C A G G U U U G U G G C

A C U A A U A U A A U A

G G G

A U C

C A A U

U G A C A C C G G A A G U A G U U G

G G A G G G

C U A G A U U A

G A A U A A A C A

U U A G U A C G G Human G A A U G U C C C G C U C G U A

G AAG G U

U C C A G U

A G U G G U G G C C G A A G RNAz2 rhesus A G A A G C A G U A

G U G C A U C A C

C G C G A

G G A

A A C U G RNAz1 Cat

C A A

A G A U A C

G C G U G G C U G U A U

C A A G U C

U U G A

A G U C U A

G C A U C A G A A

G A G

G

G U G A G

G G U C A RNAz1 Panda

U U C A U C A

C U G A C

C C G U G U G

A CU C C A G U U G U G U G C

C U A C

C G U

A G A U A C U G C A U A U C G G G A G A

A U U U A U G C

U A U C A A G G U A U G C U C U G

U A U G A A C G RNAz1 Dog G

U C C A G C A

A C A A U A A A U

G A G G

G A A A A A G A

U C A G A A RNAz2 marmoset G A A A C C C

G C A U U C

C A

G U A A G A

G

A A

A

A

U

U A C U

U A C A C

A

U A A C A

C A RNAz1 Elefant

C G C

G

G

U A A

U

C G C

G

A U C G

A

A U

G C U

U U

C G A

C A A U A G C

A C G C A C U G G C U RNAz1 Marmoset U C A

G U U U

C A G U G

A C U A U U

G A U U G A

U G C G U A C A U A A C U C G C G U U A A A U G A U C G U A A C G C G G U U C C U G G A A G U A G G C U G U A C A G U A A A RNAz2 elefant A C A A G A U G G A G RNAz1 Rhesus U U A U A U C G C G U A A G G C G U U G C A A U U A U U A C U C C G C G A G U U U G A A A A A U U G U G G G U U A G A A C U A C G A C U A U G G A U C A U C U A C C A C C G U G C C C U C U C G U G A U A A A G U A A G G A U A G A G G G G A A U U U G A G A C

U C C A C G

U A C A A A C

A C A C C C C A A A A A C A A A C A A A U A C

U G G G A U U A

A A G

C U G G A A

C A U A

A G U

C U A A G A

A C U U A U

A U A U

U A A

A A C U G G A C

G U C C A U A A A C A A G A G A G

G U C C A A

A G U

U G

G A A U A

C C A G G U G U A

U U C U A C A U

G C A A C U

C G U C U A

C C U G C A

A AA G U G U

A

A U G G U

G A A C

U C C C C

A A G

C C A A A A C C

G C A AG G G G U C C A C A

A U G U G G

A U C G U C

C A C G C A A A G G G

C U G A A G C C U U A

U U

G U U C U U A C U G C G

G G G C U G A U

A U

A G

G U G

G A A G C G G C G A G A U U G

A C A U G C G C U U G G A U U A A G A G U

A G A A G C

G A G

U G

C U

A C G C

A A U

U A C A G A G RNAz1 Orangutan G U GC A

C A A C C C GC A C A U U A U G G C

G A A A G A C U A A A G A A G A A C U A G A

U C U G A A

U C A

G U

C U A

C G A U G A A A A G C

U U

C C U A A

A C U C G C A A A C C G U G G A

C A A A A A G G U U U A G

A U C G

C A U C A

C U C A A C

U U G

A G A C

A A C A G C A G G A G C U U G G U C A A A U C C A G C U U C A G G U G C U U U G G A U G C U C A G G A G C G C G G G G C U A A G U U A C U G G C Chimp G G A A G G A G U U G C A C A A C U G A C A C C C U G A C G U C U U U G C U A C A A G G C G U C A G A U G C A G G G A U A G G U C C C G C C U G G U G A A G U G U U G G A G A U A U A A A C C C G A U A C G C A G G G A G G G C U C U G U GG C A U U U RNAz2 dog G C A G G U C C A G U C A A G C G A A U U A A A C A C G U U A A G C A U A A A G G G U A U A G C U G U U C C C G

C C A G A A U C U A C A G A C G A U A A A G RNAz1 Human U U U U A A U U G A A C A G U C U A U G G A C U G A C A G A U C G G U G U A G G C G C U

A G U G C A

U A U A A A G C

U G U

A A A U G U

C G A U U U A A A A A G

C G A A U

C C

A U C A

G C G U

C G A U

A U U

A

A U

G C A RNAz1 Chimp

U C RNAz2 cat G C

U C G

G U

G U A C

C G G

A C

C U U G U C U A

U G U U U

C C

U A C

G U

U G A

C A G G A

G U C A A

A

G A A A A A C C A

U A U G G C U U A

A A C U C G U A C U A G G U U C U U G C G G A G A G G U A U G U C C A C A

G A U A A A C G U C G

G G A A A A U G U U C G G A A C C A C U A C A

U A U C U A G G G G A U G C G C G C A U U A U G C U U G A A G A U C A A G A C

U C A G A U U A A C C A C RNAz6 Marmoset A G U

U C A U A U U C G G

C U A U U A

C C

A G

U A A U A U A A C C C U U UU A C U A A U A C A C G G A A A

C G U A U G G A U G A G A U C G

U A C U A G A C C A A G C U G U C G A A U

C G U G U G C C A A G U A A U C G A G G G C A C A A

G A A C G U A A A

G C A U A A

A A C G C A

U C

U A A

G G A A A

C G U C G G A G A A

U U

C A A G C U G U G

U G U G A U U

C U A C

C C G U A G

C U C C

A A A A G A A C

A A A U A

U C C U A A G A A U U U U G

A U A C G A

G U G U A

A A C C U C A A U

U U A C G G

U U A

U

U C U G A G G

G G U U G G G A U A G G U

G C G A A A C A A G A

U A A C G G

C A U

A C G U G G U U G G A G C U U G

C U A C A A A C C A C C G U C U A A U

G U U A A U A C A

C G A A A A C A A A C

A C U G

A A G C C

A U G G A G AGA U A A C A U A C A A U U U G A A G G A A A U G A G A A U C U

G A A G U U G A U G A A C G A U C C U C C U C U A C G G C C G U C C U C G A U G G U A C

U A G G A C G A C A G G

A A G U A C U U C U A G C C U U U U C U G C C U C U A A C A U C A A G A G G C U A C A U G G G A U A G

C A G G A G C U G A C A G A

A G U U A A C G G C U A A C U U G G A A G A G A G A G U A A A G A

C Orangutan G A G G

G G A U A C GCA A G G U A C G U G A U C A

G G A C G A A U U A

G A G A A U U A

A A

U U G G C A GG U G G A

G U C C C A C U A A C

U C A A G G A G A C A G C G G A A

G C U G A A G U U G U G U C C U G A G C U G U C A A G U U C C U U A A U G A U C A U A G A G

G A G C C A G G C A G G G C U C G

G U G A U U A U C G G U

G A U U C C A C G

A G U A U A A A G A A A G A A G C G C C A A A A A A C C G G G G U G RNAz6 Rhesus G U GG C G U U G C C A C C A A U A G C C G A G U A G C A A U C G C G U U U A G U U C C C G A G C A C A A C U U U G U U A U A U U U U A U U A U G U U G U G C G U U A U A C G U A

C G U

U A U U RNAz5 Human

A

G C G

C U

C C G C A RNAz6 Orangutan A U G A G C U A

G C U U U

A C A

U G U U C C G A G G A A C A

A G U U A A U C

C U U C U G C C G A G A G A C U A G U U U

C U

G A G G U A U U G C A A A U C G G U C C G C U A A C A G A A G U G A A A A A A U U G C G A U A U G

A C C C G A G C

A C U A AA C A A C U A A C

G C C U A U U A A G G U A A A U A G U A G U A U A

G A U A A G A A A

G A C G

G C A

U A G

G C U A A U A G G G G C U G G A C A A U U U U A U A U U A C U C

G A A U A A

A A U

C U G C A G U G G

A

C G C U C

C C G G C A A U

C A G G U G C

A U A C U

G C C

G C U A G G G A G G G A C

C C A

G G

G C U C C G A G

G A C A C U

C G G A G

G G A U C A A G G A U C

G C A A U

A G A U U U U U G C

C G A G U A C RNAz6 Chimp

A U C A A

C A C U A A A A C A

C G A G U A

A U C U A U G U U C G G G A A

A C C U U U U U C A

C C G C U U U U A G U A A G A G G U

G C A G G

A G U U A C U G U U A

G U C U AG A A C G G A A G A G C A A

C G A A G A A

U A A A

U U C A C

G U G C U C A G G A A A A

G G A G U RNAz5 Chimp C U

AG C A U A A

G C U U A C G G C U G

G A G

C C A A G U G U U U

A U

G U C G G G G G A G U

U UC U U U A A C G C A G G C U C

G U C C U C A G A A A A A C C

CC AA G U U A G U

A U A C C C

U G A

C CA A C C A G G C C A

U G A A A G G A G A G A C C U

U A G G G G

G C C A U U

U U G G U C A C

U A

C G G G A G

C

G C U A G U G

C A C U C C U

G A A C

C A C C U C A A A

A U C A G

A U C U A U A U

C A G

U C U U A C A

C G

A C

A A

A A U U

G U G C U

G C C C A

G G G

U U

A U A

C C

G U C A U

C C G A U A

A G A U A

G U

U C A A G G A

G C U

A

C A

A U

G

A U

A A C

A A A A A

C U C A Rhesus U G A

U

U A

G G A A C C

G G C A C G U A

A C

G CA C U G G U A G

G C A U

A A C G U A

C A G A G C

C G C C G A G A A A A U A A

A U U C C A A

U A A U U G G U U U

U G C

G

C G U G U C C

G A G U A U U A G U

C G U A U G A U G A A

U G C U

U C A A U G

U A G C U

U U U G A

U U

A C U U C G U G C A G U

A G U C G U U C

U A G A A A U U

U U G C G U

G U U U A U A

C G A G U

G A G A U G A G A

C G C C A

G A C G G U A G U G

U G U G C A U U G A G A A

A C G A G RNAz6 Human

U A A UA

G A C G

A A U

A A A U

C G A C A A A C A

U G A A

U A G A A C

A G C G U A U A A A C C G A A G UA G A C A

A G C G U C A C A A A

G C U

G C U G A U G A A C U G U

C G A G A U A A A U A A U C C U G A A RNAz5 Orangutan

A U

G U C

C G C

A U U C A A

G A G A

C A A U G U C U C A A G U U A

U A G U A C A U U A A A G C G A A U A

A A U A A A A A C U A U

U C

G U A

A A U U

U A U

C G U U A G C C U G

U U C

C G U G C U A A U U

G U A A U A U G A

C C G G U

A G

A G A G A A A

C U U G G C U A A A G C A U U C A C U U A G G A A A A A C G U A A A U A G C A U G U

G A G G A A CU G A G A A RNAz4 Elefant

C U A U C A G A C G C C U C C A U G A G G G A U U G C G U

G G G G

C U U A

C U G U A A C A U U A

C G A U G C C A A G C U G U A U C

C C C G U

G G C C U A U A C G C U AG C A C A A

U A C G C C A U A U U C G A CUG G U U U A U U C U G A

G G G C U C C U G C G C U G G U A G G U C G U C G C C A G C G C A A G U U A A G U C U U C G

A G A G U

G A U U G A A U G G A A U A G A C G A UU U U A A A A A C U C U C C A G A G A G G A C G G G G A A C A U A G G A A C A U G A U C U C U U C U G A U G G A U A C C C C G A C

C C U U U U G G C G A A C U A G U A U C U U A U U G A U A G G G U G A G G G C A G A G G A A C G G G G U C C U G C U A G U U A G G RNAz5 Rhesus A G G A A C G U A C U G U U C C A A U U U U C A A U C A A A U A G

G U C G A A C C G GA G G U C C A G G A G G C

U A

G A A C G G G G U A A G A C A G G A U U C A G G C U A U A U A A C A C G G G U U G A U C A C

A G A

U A C U A A C C A A U

C A A A C G A A U U A A U A G

U C A C

A C C A A A A

Marmoset G A U A U A G GA A C G G U A U U GG A A

U G U U A A U G U A GA A A A

C A C G A C

C A C G CAA A C U CG A G A U A G G A G C C A G A

A U C

C U G G A U A C G G U G G U G G U C C C G A A C C A G A U U C A G U G U G G C A A U C C G A C U U A A G A G G U G A U A A A G A A G A A C G G C U A C C C G C U A G G C U

G RNAz4 Cat

A A A U G

G A C U A A C G U U G C U C U U C C A G A

C G G A C A AA C U A A G C G U G G U C A G A G AG G A C G A G G A C U A U G U A C C C G C G A C G G A A U G A C A C U U G A A G G C C G U U C G C A A A U G A G A U G A U A A

A U A G U G G G G A U U U G U U G

A C A A C A A A A G U C

A A U

C C U G A C U A U U G C A A G U G A C U G U U U A A C A U C A A C A U A G C U U C A A A U U A G A

U A

G A G U A U U A A A A

U G C A G U A U U A A G A A G A A C A

G G G U U A C U A U A A A A U

C G U U A

C A U A

C G G U U A U G C U U G C G U C A G A U G G C U G C AU U U C G U A A U U RNAz5 Marmoset C

C U U

A U U A

A G U G C A U G

G U U A RNAz4 Dog

A C U G U A C A A C

G U A U U

A U G C G A U

U C A U G G G U U U G G

C A A A C A U U

A G G C A A A C G C

C A G G G G G U

U C A A G G G G G G

A G G A A C G A C C U

C G U G A G A A

C U G U A G G C A

C U U A A C U U G GUA C U

C G G C

G A A

C A A G U

U G U G A C U

G G A

A A A U G

C U C G C G

A

A A

C A U A A U A A A

G A G C

G U A G C G G C

C A

G G A C G

U G

A

A U U A

U A G C A A G

A C C U C G U C

U G G A

G G

U C C U

G

A G A A U U C

C

G A G

C C

G G G U

G G A C C G G U U U G A G U

U U U G

G A U G

G A A U A

C U C A A G A U A U U A

G C C

G U

U U

G G A C

A C A G

C G A U A U

G

C C A

C

U

C

G G

A

A U C A

G U

A C C U U U U U

U G G U C C G U A

U U U A A C C

G

G U A A C G

G

A A A U

G U A

G G C

G U G G A C C

G

A G U C C U A A C A U U

G C C G C A G A A

A A A

A A

A G U

G U A

U A

U A A G C A U U U A

C U U

C A U G G C G

A A C A G

U G A

C G G G G A C C U A

C U A

C G A A A A A G A C A C A C

U G U G A U G G A C A G C U U U

C G U G U U G U A G

U C

G

A G A G U C G G

G U A A U A A G U U C G A G

G A C A G x

C

A A C

U G U U RNAz4 Panda

C A

A C C U G G U A C

U RNAz5 Elefant G C U

A G U C

G C

Cat A C U U G

A A G A C

C

G A U G U G

C U A

C G A

A A C G A C U U U A

C A G G U

U G A G C A U G

CAU G A C G G A G U G

G G U G A A C G A C C

U U C G U A G C U G C A C U A A C U U U

C C U G U A A G

C U C A G G U U C G C A G A C U G G G

C A A A U A A G G G C G U U G A G U A G G C C G U UC G G C C G UU U G A U A U U U G G C A A U G A C G G UG CC G C C G U C U G U C G A A A A U C G C G A C U A U G A C A U U G C G C C G G A A G U U A G C A U G A C G U U G U A U A G U U U A U U G C A A A U U U G A A U C U A A U U A G G C U A C A G G C A C A G AG G A U A RNAz4 Marmoset A U G A A U G RNAz5 Dog C G C G A C U U U A U G A A U U A C A A G U C U G A C U

C A U

C G C U U C G U C G A

G U A A

C U G C U U

U A U A C G A

G C G A

G U A G

A U U A C

A A C A

U C C C A C

G A G

C A U

U A G A

G G A G G G

G

C A C

A U

G A C U

U G G G G

U G G RNAz4 Orangutan

G C U A C G G

U C

A C A G A A G A G C U U U C A G

U U A A U A U A

C U G U C A

C U A G G A U A A U

U U C

C U G G G

G C C C U C

G A C C

C A U G A

U A A C

A G A

C U A

C

A U A G A

G G A C A C

A A

G A U C

G A RNAz5 Cat

G A A

C C U G

C G

G G U G

A

U C

A U

A U

C U A C U

U C A C G G G A G A G A C U A A

Panda C G C C A A G G C U C A

U

G C C C A G C G A

A C A

A U U

A C

U U C

U C G A

A A C U

C G G

G G G U A G

U C A

C G A

C

C C

C U

A C U A

G G G U

U C U G C G C U C

G A U G C U U A

G G A C G U G G G U C C G G C

G G C A A

A C G C G G

A C G G C U G U G G U C A G A A U A G A A C U U C U A A G C A G U G A C G U C C G G C G U C A

G U G G U A U A U A

U U C G A

C C C U G U U G G A

G A C A G C U U U U A G U A U U U U A U A A G A C U U A G C U C A C C U C A U A C A U A C C U A A U A AG AU G C U U G U A A C C U x G U C C A G A G C G A U A G A C C A C U G U A RNAz4 Rhesus A U G C U G U G C A A G G U C G G A G A A A G A G C A C U C G C U C U C U C G A G G C U U C A C A A U A U A G G G C G C C G A U U A C C A A C U A C G C C C U A C C A U G C A U C U C RNAz5 Panda RNAz4 Chimp

G G G

G G

U C G C A U

U A

U U A G

C G A U U C U C U C A C G G U

G C C U

U A C U C A G

A G AU A A

C U C GU GAA

U C G U AG A A C G U U G A G A C

C U U A A

U G C U G A

U G A U U

G A G A

G C A

U A U G A A A A

A U A RNAz4 Human

A C

A U C U A U U U G

G A A A

G A U

U C G G A A G

G A U G A G C

G G A A C

C A A

G G U U G A A A

G C G

C G G A U

G A

U A C

A G A A U

A U A G G G G U

A C C C G A U C A A G C A U A A U A A G U C U A RNAz6 Panda C G C

U A U A A

A U U C U U G U U

U A

U

C A A C

A A A A C A C G G A

U C A

A G A G U G C

A A C

A C G C

C A G A

U A

G U

G A G G C U

A A U

C A C G

G G C A G U G G G

U U A A A

A

U U

U A U A

G C A U C

U

C C C A C G

A G RNAz3 Cat

C U G G G G

C A C U G U

C C U G C U A G

G

Dog G

G

C C U U G G U C A G G G A

U G G A

A G U

A C G A

G C G A G A

G C C G

C

C C A C A G

G G C A U

G A A

G A U

G C A G

A A

A G U G C

G C C A A G C

G G G A A

G G G C

G C

U G C U C C C C U C U C C U A

U

G A A

U G G G A C A A U U A G

U G G G A C

G G U C U G G G G G G G C A G

G U UG G G A G G G U

C G G G U U A U A A A C C A A G

A U U C G A G C U C U A U U G G A

A U C G G G A G A C U C C A A A

G U G A C U A A G G U G G A

G U C U A A U C U A G A U G U A G

A G C G C

A A U U A G A C A G G U U A A A

A U U A C A C U C A

U C C G U G A

G U G C G U C G

C A A U U G A A

C A U A U G G C G A G A A G U U A G A A U U U A

U A A G C C U RNAz3 Dog

A U A A G C G A G A U A A A A A G A C U A A A RNAz6 Dog A G A A G G A C U A A G U C G U A A G U U G U A G G G U A G A G U G C G A A C A U C G U C C A C C A C U A G U C C C U G A U G G A G G U C U G A U G C G C A U G A G A G G A U G U A U C C U A G C G A G A G C C G G C U A G A U G U G U U G A G A A G G G RNAz3 Panda A G A A G C C U G C G U A A U G G A G A A G C U C U A C G C C A C U G U C G C A G U U U U U G A G U A A RNAz3 Elefant G A C G C G U A G A G U A A G A U A C C C G G G G U U A C A A A A C C A C A G C U G C A A C A U G U U A C A C U C U C A A C C G A U A A U C U U C

A A A A A A A A G A A A A A U G G U C C A A C C G C G G U U U A C C G C A C U A U U C U A RNAz3 Marmoset U A C C A U A A A G G U U C C A C C G U U C A C A C U U A G C G C A G G U C C C C U U G U G U G U U C A U A A A G U CC U C A G C A A A A U GU U AU C C A GA C C C U G A U RNAz3 Human U A A U G A G A U A A A A A A A U G G G A C G U C C A U U A A A G C G U A U C A C C C G C U G A U U A C C G G C A C A U G A U C C A U G C A U C G A A U C A G U C A U U G A U A C C G U U A G C A G GC C A U A A C A A A A C G U C U U A C G C A C A U A A G A A A U U U U C G A A U C C G A A G C C C A G G G C G U G G G A A U A G A A A C G C A CA A A C U RNAz3 Chimp G G U A G C U U A U U A U G U A C G G A C G G C A C G U U C A G U A C C C U U G G U C C U U A C G A A G C U G A U C A A A G G C A U A G C C G G A A A A G G A A C G U U A G U G U G U C A U A U U G U G G G C C C G A U G A U C U G G G U C A A C C G x G A G A C G U U C A G A U U G A A A A A G A G G U C G G A A U C U G U C G Elefant C C U C A U A U C U U U U A A A G A U C G RNAz3 Rhesus A C G A A U A U A A G A G U U C G U A G G U A G UC C U A A C A C U G C A G A G G U U A A C C A C A C U G C U U A G U A G G A G U A A A C G A G C C A C G G A C A A U C C C A G U A C A U C G U C U A G A A G U A G G U G C G U U G G A G U A A C A G A G C G A G A U C C UA G A U RNAz3 Orangutan G C A U A U C C A CCACA U A C U G U G A G C U A U G C A U G U G C G A G C A C U A C G A G U G A A G G G A A G U C A G A G U A G A A A G G A C C C A C U U A G C U C A A C C A U G U A G A G U A U G A A A A A A U C G A G C A C U U G U C C G A A U A G A G A A G U CA G A G CCU G A G A C U C U C U G G G A U G C C C U C G U U A C G C U G A C A G U C C G C U A U C G U G G U C A U C A C C A A C U U U A G A A A C AU G U A C A A G A C C C U C G C C G A A C C U C U G A C A G U U C U C A C G G G G G U

Fig. 4. Traces of evolutionary conserved secondary structures. The minimum free energy struc- tures of the six RNAz-predicted regions are at least partially conserved throughout higher eukaryotes. A sequence/structure-based clustering using LocARNA (Will et al., 2007) visualizes the similarities between the predicted structures in more detail. As expected, the structures nearly perfectly cluster into the six groups.

1 kb chr6 108407500 108407000 108406500 108406000 108405500 108405000 HTS data from GSE20370

Mouse EGOT ortholog

Placental Mammal Basewise Conservation by PhyloP

Fig. 5. Non-coding RNA profiling by high throughput sequencing reveals extragenic Pol-II sites at the mouse EGOT ortholog. The deep sequencing data from (De Santa et al., 2010) available in the Gene Expression Omnibus (under GEO accession number GSE20370) confirm tran- scription of the intronic highly conserved element (HCE) and parts of the 3’end of the mouse EGOT ortholog. Although the data do not validate the full mouse ortholog, we benefit twice from the depicted transcribed regions. On the one hand the two independently transcribed regions at the intronic HCE sup- port our findings that the HCE consists of two independent non-coding as well as protein-coding domains. Next, since it was previously postulated that EGOT may act via siRNAs to repress its targets MBP and EDN (Wagner et al., 2007), the signals at the 3’ end on the other hand might in deed indicate small RNAs that are hosted by EGOT.

4 References

De Santa, F., Barozzi, I., Mietton, F., Ghisletti, S., Polletti, S., Tusi, B., Muller, H., Ragoussis, J., Wei, C., and Natoli, G., 2010. A large fraction of extragenic RNA pol II transcription sites overlap enhancers. PLoS Biol 8: e1000384. doi: 10.1371/journal.pbio.1000384. Wagner, L., Christensen, C., Dunn, D., Spangrude, G., Georgelas, A., Kelley, L., Esplin, M., Weiss, R., and Gleich, G., 2007. EGO, a novel, noncoding RNA gene, regulates eosinophil granule protein transcript expression. Blood 109: 5191–8. Will, S., Reiche, K., Hofacker, I., Stadler, P., and Backofen, R., 2007. Inferring noncoding RNA families and classes by means of genome-scale structure-based clustering. PLoS Comput Biol 3: e65. doi:10.1371/journal.pcbi.0030065.

5