SUPPLEMENTAL MATERIAL

Human intron-encoded Alu RNAs are processed and packaged into Wdr79-associated nucleoplasmic box H/AC RNPs Beáta E. Jády, Amandine Ketele and Tamás Kiss

Jady_Supplemental_Table 1

Supplemental Table 1. Alignment of the 3’-terminal sequences of 348 human AluACA RNAs. The 5’-terminal sequences corresponding to the structurally highly variable 5’-hairpin regions of AluACA RNAs are missing. Numbers on the left indicate AluACA RNAs. Multiple sequence alignment was performed by the ClustalW2 program (European Bioinformatics Institute). The ACA box motifs located three before the RNA 3’ end are in red. The proximal and distal CAB boxes (pCAB and dCAB) are in blue and the predicted H boxes are underlined. The lengths of the indicated 3’-terminal AluACA RNA sequences are shown on the right.

H box pCAB dCAB box ACA box 3 ------AG-ATCAC-T-TG---AGCCC----AGGAG----ATT------GAGGCTG-TAGTG--AGCC-AAG---AT-CAC------ATCACT------GCACT-CCAA------TCTG--GGTG-ACAAAG 76 268 ------AG-ATCAG-T-TG---AGCCC----AGGAG----ATG------GAGGCTG-CAGTG--AGCC-AAG---AT-CAC------ACCACT------GCACT-CCAT------CCTG--GGCA-ACAGAG 76 209 ------AG-ATCATGC-TG---GGCCC----AAGAG----GTG------GAAGTTG-CAGTG--AGCC-AAG---AT-CAC------GTCACT------GCACT-TCAG------CCTG--GGCA-ACAGAG 77 105 ------AG-ATCAC-T-TG---AGCCT----AGGAG----GTA------CAGGTTA-CAGTG--AGCC-AAG---AT-CAC------GCCACT------GTACT-CCAG------CCTG--GGCA-ACAGAG 76 282 ------AG-ATCGC-T-TG---AGCCC----AGGAG----GTG------GAGGTTG-CAGTG--AGCC-AAG---AT-CAT------GCCACT------GCACT-CAAG------CTTG--GGCA-ATAGAA 76 63 ----GGAGTATCAC-T-TG---AGCTC----AGGAG----GTT------GAGGCTG-CAGTG--AGCC-AAG---AT-TGT------TCCACT------GCACT-CTAG------CCTG--AGCA-ACAGAG 79 90 ------AGAATCAC-C-TG---AGCCC----AGGAG----GTC------GAGGCTG-CAGTG--GGCC-AAG---AT-CAT------GCCACT------GCATT-CCAG------TCTG--GGCAAACAGAG 78 1 ------AGATCA-CT-TG---ACCCC----AGGAG----GTC------AAGACTG-CAGTG--AGCT-ACA---AT-TGC------ACAACT------GCACTAC------AACCT---GGGTGACATGG 76 319 ------AGATCA-CT-TG---AGCCC----AGGAG----GTC------AAGGTTG-CCGTG--AGCC-CAA---AT-AGC------ACTACT------GCACTACTCCACTCC-AGCCT---GGGCGACAGAG 84 328 ------AGATCA-CT-TG---AGCCC----AGGAC----ATC------AAGGCTG-CAGTG--AGCC-AAG---AC-TGC------ACCACT------GCATTTT------AGCCT---GGGCAACAGAG 76 109 ------AGATCA-TC-TG---AGCCC----AGGAG----GTC------AAGGTTG-CGGTG--AGCC-GAG---AT-TGC------ACCACT------GCACTCC------AGCCT---GGGTAACAGAG 76 174 ------AGATCA-CT-TG---AGCCC----AGGAG----GTG------GGGGTAG-CAGTA--AGCC-AAG---AT-TGC------ACCACT------GCACTCC------AGCCT---GGGTGACAGAG 76 284 ------AGATCA-CC-TG---AGCCC----AGGAG----GCA------G---TTG-CAGTG--AGCC-AAG---AT-TGC------ACCACT------GCACTCC------AGCCT---GGGTGACAGAG 73 149 ------AGATCA-CT-TG---AGTCC----AGGAG----GCG------GAGGCTG-CAGTG--AGCC-AAG---AT-TGC------ACCACT------GTACTCC------AGCCT---GGGCAACAAAG 76 88 ------AGATGG-GA-AG---AGCCC----AGGAG----GTT------GATACTG-CAGTG--AGCC-AAG---GT-AGC------ACTACT------GCACTCC------AGCCT---GGGCCACAGAA 76 345 ------AGATGG-GA-AG---AGCCC----AGGAG----GTT------GATACTG-CAGTG--AGCC-AAG---GT-AGC------ACTACT------GCACTCC------AGCCT---GGGCCACAGAA 76 204 ------AGATCA-CA-AG---ACCCC----AGGAG----ACT------GAGAGTG-CAGTG--AGCT-GAG---AT-CAC------ACCACT------ACACTCC------AGCCT---GGGCGACAGAG 76 103 ------AGATCA-TT-TG---AACTC----AGGAG----ACG------GAGGTTG-CAGTG--AGCC-AAG---AT-CGC------ACCACT------GCACTCC------AGCCT---GGGCGACAGTG 76 5 ----AGAGAATCG-CT-TG---AAGCC----AAGAG----GCA------GAGGTTG-CAGTG--AGCT-GAG---AA-CGT------GTCACT------GCATA-TCAG------CCT---GGGCAACAGAG 79 157 ----AGAGAATCG-CT-TG---AACCC----AGGAG----GCA------GAGGTTG-CAGTG--AGCT-GAG---AT-CGT------GTCACT------GAACT-CTAG------CCT---GGGCAACAGAG 79 177 ------AAATCA-CT-TG---AACCC----AGGAG----GCA------GAGGTTG-CAGTG--AGCC-AAG---AT-GGT------GTCACT------CTACT-CCAG------CTT---GGGCAACAGTG 76 208 ------AAATCA-CT-TG---AACCC----AGGAG----GCA------GAGGTTG-CAGTG--AGCC-AAG---AT-CGC------ATCACT------GCATT-CTAG------CCT---GGGCAACAGAG 76 327 ----AGAGAATCG-CT-TG---AACCC----AGGAG----GCA------GAGGTTG-CAGTG--AGCC-AAG---AT-CAT------GCCACT------GTATT-GCAG------CCT---GGGCAACAGAG 79 34 ----AGAGAATCG-CT-TG---AACCC----AGGAG----GCA------GAGGTTG-CGGTG--AGCC-GAG---AT-GGT------GCCATT------GCATT-CCAG------CCT---GGGCAACAGAG 79 1

79 ----AGAGAATCATCT-TG---AACCC----AGGAG----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-TGC------GCCACT------GCTCT-CCAG------CCT---GGGCGACAGAG 80 139 ----AGAGAATCA-CT-TG---AACCC----AGGAG----GCA------GAGGTTG-CAGTG--AGCC-GAA---GT-TGC------ACGACT------GCACT-CCAG------CCT---GGGCGACAGAG 79 35 ----AGAGAATCA-CT-TG---AACCC----AGGAG----GCA------GACGTTG-CAGTG--AGCC-GAG---AT-TGC------ACCACT------GCACT-CCAG------CCT---GGGCAACAGCG 79 87 ----AGAGAATCA-CT-TA---AACCC----AGGAG----ACG------GAGGTTG-CAGTG--AGCT-GAG---AT-CCC------GCCACT------GCATT-CCAG------CCT---GGGCAACAGAG 79 188 ----AGAGAATCA-CT-TG---AACCC----AGGAG----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-CCA------CCACT------GCATT-CCAG------CCT---GGGCAACAGAA 78 36 ----AGAGAATCA-CT-TG---AACCC----AGGAG----GTG------GAGGTTG-CAGTG--AGCT-GAG---AT-TGT------GCCACT------GCAGT-CCAG------CCT---GGGCAACAGAG 79 347 ----AGAAAATCC-CT-TG---AACCC----AGGAG----GCA------GAGTTTG-CAGTG--AGCT-GAG---AT-TGT------GCCACT------GCACT-CCAG------CCT---GGGCAACAGAG 79 266 ----AGAGAATCA-CT-TG---AACCC----AGGAG----GTA------GAGGTTG-CAGTG--AGCC-GAG---AT-TGT------GCCACT------GCACT-CCAG------CCT---GGGCGACAGAG 79 206 ------AAATCA-CT-TG---AACCC----GGGAG----GCA------GAGGTTG-CAGTG--AGCC-GAG---AG-TAC------GCCACT------GCACT-CCAG------CCT---GGGCAACAGAG 76 258 ------AAATCA-CT-TG---AACCC----GGGAG----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-TGC------GCCACT------GCCCT-CCAG------CCT---GGGCAACAGAG 76 275 ------AAATCG-CT-TG---AACCC----GGGAG----GCG------GAGCTTG-CTGTG--AGCC-GAG---AT-TGC------ACCACT------GCACC-CCCG------CCT---GGGCAACAGAG 76 341 ----AGAGAATCA-CT-TG---AACCC----GGGAG----GCG------GAGCTTG-CAGTG--AGCC-GAG---AT-TCT------GCCACG------GCACT-CCAG------CCT---GGGCAACAGAG 79 111 ------AAATCA-CT-TG---AACCT----AGGAG----GCG------GAGGTTG-CGGTG--AGCC-AAG---AT-CAT------GCCATT------GCACT-CCAG------CCT---GGGTAACAAGA 76 317 ---ACAAAAATCA-CT-TG---AACCC----GGGAG----GTG------GAGGCTG-CGGTG--AGCC-AAG---AT-CAT------GCCATT------GCACC-CCAG------CCT---GGGCAACAAGA 80 53 ------AAATCA-CT-TG---AACCC----AGGAG----GCA------GAGGTTA-CAGTG--AGCC-AAG---AT-CAC------GCCATT------GCACT-CCAG------CCT---GGGCAACAAGA 76 8 ----AGAGAATCA-CT-TG---AGCCC----AGGGA----GCG------GAGGTTG-CAGTG--AGCC-AAG---AT-CAC------GCCATT------GCACT-CCAG------CCT---GGGCAACAAGA 79 135 ---AG-ATAATTG-CT-TG---AACCC----AGGAG----GCG------GAGGTTG-CAGTG--AGCC-AAG---AT-CGC------ACCATT------GCACT-CAAA------CCT---GGGCAACAAGA 79 346 ---AGAAAAATTG-CT-TG---AAACC----AGGGG----GCG------GAGATTG-CAGTG--AGCC-AAG---AT-CTC------TCCACT------GCACT-CCAA------CCT---GAGCAACAAGA 81 43 ------AAATCA-CT-TG---ACCCC----GGGAG----GCG------GAGGTTG-CAGTG--AGCC-AAG---AT-CAT------GCCACT------GCATT-CCAG------CCT---GGGCAACAAGA 76 158 ----AGAGAATCA-CT-TG---AACCC----GGGAG----GCG------GAGGTTG-TAGTG--AGCG-AGG---AT-CGT------GCCACT------GCATT-CTAG------CCT---GGGCAACAAGA 79 24 ----AGAGAATCA-CA-TG---AACCC----AGGAC----GTG------GAGGTTG-CAGTG--AGCC-GAG---AT-TGC------GCCATT------GCAGT-CGAT------CCT---AGGCAACAAGA 79 329 ------AAATCG-CT-TG---AACCC----AGGAG----TTG------GAGGTTG-CAGTG--AGCC-GAG---AT-TGC------ACCATT------GCATT-CCAG------CCT---GGGCAACAAGA 76 211 ------AGAATCG-CT-TG---AACCC----AGGAG----GCGGAAGGTGAGGTGG-CAGTG--AGCC-GAG---AT-TGT------GCCATT------GCACT-CCAG------CCT---GGGCAACAAGA 83 269 ------AGAATCA-CT-TG---AACCC----AGGAG----GC------TGAGGTTG-CAGTG--AGCCTGAG---AT-TGC------GCCACT------GCTTT-CCAG------CCT---GGGCAACAAGA 78 46 ----AGAGAATTG-CT-TG---AACCC----AGGAG----GCG------AAGGTTG-CATTG--AGCT-GAG---AT-CAC------ACCATT------GCACT-CCAT------CCT---GGGCGACAAGA 79 334 ----AGAGAATTG-CT-TG---AACCC----AGGAG----GCG------AAGGTTG-CGGTG--AGCA-GAG---AT-CAC------ACCGTT------GCACT-CCAG------CCT---GGGCAACAAGA 79 229 ----AGAGAATCA-CT-TG---AACC-----AGGAG----ATG------GAGGTTG-CAGTG--AGCT-GAG---AT-CAC------CTCATT------GCACT-CCCG------CCT---GGGCAACAAGA 78 307 ------AAATCA-CT-TG---AACCC----AGGAG----CCG------GAGGTTG-CAGTG--AGCT-GAG---AT-CAC------ACCATT------GCACT-CCAG------CCT---GGGCAACAAGA 76 326 ----AGAGAATCG-CT-TG---AACCC----AGGAG----GCG------GAGGTTG-CAGTG--AGCC-GAG---AT-CAC------ACCATT------GTACT-CCGG------CCT---GGGCAACAAGA 79 224 ----AGAGAATCA-CT-TG---AACCC----A--AG----GCA------GAGGTTG-CAGTG--AGCT-GAG---AT-CAC------ACTATT------GCACT-CCAG------CCT---GGGGAATAAGA 77 261 ----AGAGAATCA-CT-TG---AACCC----AGGAG----GCA------GAGGTTG-TGGTG--AGCT-GAG---AT-CAC------GCCGTT------GCACT-CCAG------CCT---GGGCAACAAGA 79 94 ----AGAGAATCT-CT-TG---AACCC----AGGAG----GCA------GAGGTGG-TGGTG--AGTC-GAG---AT-AGC------ACCATT------GCACT-CCAG------CCT---GGGCAACAAGA 79 180 ----AGAGAATAG-CT-TG---AACCC----AGGAG----GCA------GAGGTTA-TGGTG--AGCC-GAG---AT-TGT------GCCATT------GCACT-CCAG------CCT---GGGCAACAAGA 79 223 ----AGAGAATCC-CT-TG---AGCCC----GGGAA----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-CGC------ACCATT------GCACT-CCAG------CCT---GGGCAACAAGA 79 235 ----AGAGAATCG-CT-TG---AGCCC----GGGAA----GCG------GAGGTTG-CAGTG--AGCC-GAG---AT-TGC------ACCATT------GCACT-CCAG------CCT---GGGCAACAAGA 79 58 ------AAATTG-CT-TG---AACCC----AGGAG----GTG------GAAGTTG-CGGTG--AGCC-GAG---AT-CGT------GTCATT------GTACT-CCAT------CCT---GGGCAACAAGA 76 192 ----AGAGAATCA-CT-TG---AACCC----GGGAG----GTG------GAAGTTG-CGGTG--AGCC-GAG---AT-CAT------GCCATT------GCACT-CCAG------CCT---GGGCAACAAGA 79 68 ------AAATCG-CT-TG---AACC-----GAGAG----GTG------GAGGTTG-CGGTG--AGCT-GAG---AT-CGC------GCCGTT------GCACT-CCAG------CTT---GGCCAACAAGA 75 221 -ACAGTAGAATCG-CC-TG---AACCT----GGGAG----GTG------GACTTTG-CAGTG--AGCC-GAG---AT-CAC------ACCATT------GCCCT-CCAG------CTT---GGGCAACAAGA 82 100 ---AG-AGAATTG-CT-T----ACCC-----GGGAG----GTG------GAGATTG-TGGTA--AGCT-GAG---AT-CGC------GCCATT------GCACT-CCAG------CCT---GGGCAACAAGA 77 185 ---AG-AGAATCG-CT-TG---AACC-----TGGAG----GTG------GAGATTG-CCGTG--AGCC-GAG---AT-CGC------GCCATT------GCACT-CCAG------CCT---GGGCAACAAGA 78 309 ---AG-AGAATTG-CT-TG---AACCC----GGGAG----GTG------GAGGTTG-CAGTG--AGCC-AAG---AT-CAT------ACCATT------GCACT-CCAG------CCT---GGGCAACATGA 79 344 ---AG-AGAAATG-CT-TG---AATCT----GGGAG----GAG------GAGGTTG-CAGTG--AGCT-GAG---AT-CAC------ACCACT------GCACT-CCAG------CCT---GGGCAACAAGA 79 14 ---AG----ATCG-CT-TG---AACCC----GGGAG----ACA------GATGTTG-CAGTG--AGCT-GAG---AT-CGC------ACCATT------GCACT-CCAG------CCT---GGGCGACAAGA 76 37 ---AGGAAAATCG-CC-TG---AACCC----GGGAG----GCA------GAGGTTG-CAGTG--AGCT-GAG---AT-CGT------GCCACT------GCACT-CCAT------CCT---GGGCGACAAGA 80 81 AGAGA----ATCG-CT-TG---AACCC----GGGAG----GCG------GAGGTCG-CAGTG--AGCT-GAG---AT-CGC------ACCACT------GCACT-CCAG------CCT---GGGCGACAAGA 79 276 ---AA----ATCA-CT-TG---AACCC----GGAAG----GCG------GAGGTTG-CAGTG--AGCT-GAG---AT-CGC------ACCACT------GTACT-CCAG------GTT---GGGCCACAAGA 76 250 -AGAG---AATCG-CT-TG---AACCC----GGGAG----GCG------GAGGTTG-CAGTG--AGCC-GAG---AT-CGC------GCCACT------GCACT-GCAG------CCT---GGGCTACAAGA 79 82 ----AGAGAATCA-CT-TG---AACC---T-GGGAG----GCA------GAGGTTG-CAGTG--AGCC-AAG---AT-CAC------ACCACT------GTACT-GCAG------CCT---GGGCAACAAGT 79 152 ----AGAGAATCA-CA-TG---AACC---T-GGGAG----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-CAT------GCCATT------GCACT-CCAG------CCC---AGGCGACAAGT 79 301 ----AGAGAATCG-CT-TG---AACC---T-GGGAG----ACG------GAGGTTG-TAGTG--AGCC-GAG---AT-CAT------GCCACT------GCACT-CCAG------CCT---GGGCGACAAGA 79 162 ----AGAGAATTG-CT-TG---AACC---T-GGGAG----GCA------GAAGTTG-CAGTG--AGCC-AAG---AT-CGC------ACCATT------GCCCT-CCAG------CCT---GGGTGACAAGA 79 2

193 ------AAATTG-CT-TG---AACC---T-GGGAG-----CA------GAGGTTG-CAGTG--AGCC-AAG---AT-CGC------ACCACT------GCACT-CCAG------CCT---GGGTGACAAGA 75 194 ----AGAGAATTG-CT-TG---AACCGCCT-GGGAG----GCA------GAGGTTG-CAGTG--GGCC-AAG---AA-CGC------GCCACT------GCACT-CCAG------CCT---GGGCAACAAGA 82 181 ----AGAGAATCA-CT-TG---AACC---T-GGGAG----GCG------GAGGTTG-CAGTG--AGCT-AAG---AT-TGC------GCCACT------GCACG-CCAG------CCT---GGGTGACAAGA 79 199 ----AGAGAATCA-CT-TG---AACC---T-GGGAG----GCG------GAGGTTG-CAGTG--AGCC-AAG---AT-TGC------CCCACT------GCACT-CCAG------CCT---GGGCAACAAGA 79 165 ----AGAGAATCG-CT-TG---AACCC----AGGAA----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-CGC------GCCATT------GCACT-CCAG------CCT---GGGGGACAAGA 79 296 ----ACAGAATCG-CT-TG---AACCC----AAGAG----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-CAT------ATCATT------GCACT-CCAG------CCT---GGGGGACAAGA 79 11 ------AGAATCA-CT-TG---AACCC----GGGAG----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-CGT------GCCATT------GCACT-CCAG------CCT---GGGTGACAAGA 77 45 ----AGAGAATTG-CT-TG---AACCC----GGGAG----GCG------GAGTTTG-CAGTG--AGCC-AAG---AT-CAC------GCCATT------GCACT-CCAA------CCT---TGGTGACAAGA 79 48 ----AGAGAATTG-CT-TG---AACCC----GGGAG----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-CAT------GCCATT------GCACT-CCAG------CCT---GGGTGACAAGA 79 19 ------AGAATAG-CC-TG---AACCC----GGGAG----GCG------GAGTTTG-CAGTG--AGCG-GAG---AT-CGT------GCCATT------GCACT-ACGG------CCT---GGGCGACAAGA 77 125 ---AG-AGAATCA-CT-TG---AACCT----GGGAG----ACG------GAGGTTG-CAGTG--AGCT-GAG---AT-TGT------ACCATT------GCACT-CCAG------CCT---GGGGGACAAGA 79 256 ---AGAAGAATCA-TT-TG---AACCC----GGGAG----GTG------GAGGTTG-CAGTG--AGCC-GAG---AT-TGC------ACCATT------GCACT-CCAG------ACT---GGGGGACAAGA 80 10 ---AG-AGAATCA-CT-TG---ATCCT----GGGAG----GTG------GAGGTTG-CAGTG--AGCC-GAG---AT-TGC------ACCACT------GCACT-CCAG------CCT---GGGCGACAAGA 79 182 ---AG-AGAATCA-CC-TG---AACCC----AGGAG----GTG------GAGGTTG-CAGTG--AGCC-GAG---AT-TGC------ACCACT------GCACT-CCAAG------CCT---GGGTGACAAGA 80 342 ---AG-AGAATCA-CT-TG---AACCC----AGGAG----GTG------GAGATTT-CAGTG--AGCC-GAG---AC-TGT------ACCATT------GCACT-CCAG------CCT---GGGTGACAAGA 79 16 ---AG-AGAATTG-CT-TG---AACCC----AGGAG----GTG------GAGGTTG-CAGTG--CTCC-GAG---AT-TGT------GCCACT------GCACT-GCAG------TCT---GGGTGACAAGA 79 230 ---AG-AGAGTCA-CT-TG---AACCC----AGGAG----GCG------GAGGTTG-CAGTGAATTGA-GAG---AT-TGT------GCCACT------GCACA-CCAC------CCT---GGGCAACAAGA 81 102 ---AG-AGAATTG-GT-TG---AACCC----GGGGG----TCG------GAGGTTG-CAGTG--AGCT-GAG---AT-TGC------GCCACT------TCACT-CCAG------CCT---GGGCAACAAAA 79 285 ---AG-AGAATTG-CT-TG---AACCC----GGGAG----TTG------GAGGTTG-CAGTG--AGCT-AAG---AT-TGC------GCCACT------GCACT-CCAG------CCT---GGGCAACAAGA 79 69 ---AGGAGAATCC-TT-TG---AACCC----AGGAG----GCT------GAGGTTG-CAGTG--AGCC-AAG---GT-TTT------GCCATT------GCACT-CCAG------CCT---GGGCGACAGGG 80 121 ----GGAGAATCC-TT-TG---AACCC----AGGAG----GCT------GAGGTTG-CAGTG--AGCC-AAG---GT-TTT------GCCATT------GCACT-CCAG------CCT---GGGCGACAGGG 79 248 ------AGAGTCA-CT-TG---GGCCC----CAGAG----GTG------GAGGTTG-CAGCG--AGCC-AGG---TT-CTT------GCCATT------GCACT-CCAG------CCT---GGGCGACAAGG 77 54 -ACAGGAAAATCA-CT-TG---AACCC----GGGAG----GCG------GAGGTTG-CAGTG--AGCC-AAG---AT-TGT------GCCATT------GCACT-CCAG------CCT---GGGCAACAGGG 82 86 ----AGAGAATCG-TT-TG---AACCC----GGGAG----GTG------GAGGTTG-CAGTG--AGCC-GAG---AT-CGT------GCCATT------GTACT-CCAA------CCT---GGGCGACAGGG 79 313 ----AGAGAATCA-TT-TG---AACCT----GGGAG----GTG------GAGGTTG-CAGTG--AACT-GAG---AT-TGC------GCCATT------GCACT-CCAG------CCT---GGGCAACAGGG 79 7 ------AG-ATTG-CT-TG---AGCCT----GGGAG----GTG------GAGGTTG-CAGTG--AGCT-GAG---AT-AGT------GCCACT------GCACT-CCAG------CCT---GGGCGACAGAC 76 330 ---AGAGG-ATCG-CT-TG---AGCCT----GGGAG----GAG------GAAGGTG-TAGTG--AGCT-GAG---AT-CAT------GCCACT------GCACT-CCAGCACTCCATCCT---GGGCGACAGAG 87 315 ---AGAAGAATCG-CT-TG---AGCCT----GGGAG----GCG------GATGTTG-TGGTG--AACT-GAG---AT-TGT------GCCACT------GCACT-CCAC------CCT---GGGGCACAGAG 80 232 ----AGAGAATCG-CT-TG---AGCCT----GGGAG----GCA------GAGGTTG-CAGTC--AGCG-GAG---AT-CGT------GCCACT------GCGCT-CCAG------CCT---TGGCGACACAG 79 26 ----AGAGAATTG-CT-TG---AACCC----TGGAG----ATG------GAGGTTG-CAGTG--AGCC-GAG---AT-TGT------GCCATT------GCATT-CCAG------CCT---AGGTGACAGAG 79 281 ------ATATTG-TT-TG---AACCT----GGGAG----ATG------GGGGTTG-CAGTG--AGCC-AAG---AT-CTT------GCCAGT------GCACT-CCAG------CCT---AGGGGACAGAG 76 107 ----AGAGAATCA-CT-GG---AACCT----GGGAG----GTG------GAACTTG-CAGTG--AGCC-AAG---AT-CAT------GCCATT------GCCCT-CCAG------CCT---GGGTGACAGAG 79 331 ----AGAGAATTC-CT-TG---AACCT----GGGAG----GTG------GAGGTTG-CAGTG--AGCC-AAG---AT-TGT------GCCATTGCACTCCAGCCCT-CCAG------CCT---GGGTGACAGAG 87 183 ----AGAGAATTG-CT-TG---AACCT----GGGAG----GCG------GAGGTTG-CTGTG--AGCC-GAG---AT-CAT------GCCATG------GCACT-CCGG------CCT---GGGTGACAGTG 79 146 ----AGAGAATCA-CT-TG---AACCT----GGGAG----GTG------TAGGTTG-CAGTG--AGCC-AAG---AT-TGT------GCCACT------ACACA-CCAA------CCT---GTGCAACAGAG 79 336 ----AGAGAATGG-CA-TG---AACCT----GGGAG----GTG------GAGCTTG-CAGTG--AGCC-AAG---AT-TGT------ACCACT------ACACT-CCAG------CCT---GGGCAACAGAA 79 335 ----AGAGAATGG-CT-TG---AACCT----GGGAG----GTG------GAGATTG-CAGTG--AGCC-AAG---AT-CGT------GCCACT------GAACT-CCAG------CCT---GGGCGACAGAG 79 33 ----AGAGAATCC-CT-TA---AACCT----GGGAG----GTG------GATGTTG-CAGCG--AGCT-GAG---AT-CAT------GCCACT------GCACT-TCAG------CCT---GGGTTACAGAG 79 170 ----AGAGAATCG-CT-TG---AACCT----GGGAG----GTG------GACGTTG-CAGTG--AGCC-GAG---AT-CAT------GCCGCT------GCACT-CCAG------CCT---GGGTGACAGAA 79 41 ----AGAGAATCG-CT-TG---AACCT----GGGAG----GTG------GAGGTTG-CGGTG--AGCT-GAG---AT-CGC------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAA 79 279 ----AGAGAATCG-CT-TG---AACCT----GGGTG----GTG------GAGGTTG-CGGTG--AGCT-GAG---AT-TGT------GCCACT------GTACT-CCTG------CCT---GGGTGACAGAA 79 225 ----AGAGAATTG-CT-TG---AACCT----GGGAG----GTG------GAGGTTG-CAGTG--AGCT-GAG---AT-TGT------ACCACT------GCACT-ACAG------CCT---GGGTGACAGAC 79 259 ----AGAGAATTG-CT-TG---AAACT----GGAAG----GTA------GAGTTTG-CAGTG--AGCT-GAG---AT-TGT------ATCACT------GCACT-CCAG------CCT---GGGTGACAGAA 79 95 ----AGAGAATCA-CT-TG---AACCT----GAGAG----GCG------GAGGTTG-CAGTG--AGCC-GAG---AT-CGT------GCCTCT------GCACT-CCAG------CCT---ACAGGACAGAG 79 320 ----GGAGGATCA-CT-TG---AACCT----GTGAG----GTG------GAGGCTG-CAGTG--AGTC-GAG---AT-TGT------GCCACT------GCACT-CCAG------CCT---AGGTGACAGAA 79 31 ---AG-AGAATTG-CT-TG---AGCCC----GGGAG----GTG------GAGGTTG-CAATG--AGCA-GAG---AT-CAC------ACCATT------GCAGT-CCAG------CCT---GGGCAACAGAG 79 172 ---AG-AGAACTG-CT-TG---AGGCT----GGGAG----GTC------AAGGCTG-CAGTG--TGCT-GAG---AT-CAT------ACCACT------GTATA-CCAG------CCT---GGGCAACAGAG 79 154 ---AG-AGAATTG-CT-TG---AACCC----GGGAG----GCA------GAGGCTG-CAGTG--AGCT-GAG---AT-CAC------ACCACT------GCACT-CCAG------CCT---GGGCTACAGGG 79 148 ---AG-AGAATCG-CT-TG---AACCT----GGGAG----GCG------GAGGTTG-CAGTG--AGCT-GAG---AT-CAC------ACCATT------GCACT-CCAG------CCT---GGGCGACAGAG 79 305 ---AGAAGAATGG-CT-TG---AACCT----GG-AG----GCG------GAGGTTG-CAGTG--AGCT-GAG---AT-CAC------ACCACT------GCACT-CCAG------CCT---GGGCGACAGAA 79 176 ---AG-AGAATCG-CT-TG---AACCT----AGGAG----GCA------GAGATTG-CAGTG--AGCT-GAG---AT-CAC------ATCACT------GCACT-CCAG------CCT---AGGCAACAGAG 79 59 --AGG-AGAATTG-CT-TG---AACCC----AGAAG----GCA------GAGGTTG-CAGTG--AGCT-GAG---AT-CAT------GCCACT------ACACT-GCAG------CCT---GGGCAACAGAG 80 3

127 ---AG-AGAATTG-CT-TG---AACCC----AAGAG----GCA------GAGGTTG-CAGTG--GGCC-GAG---AT-CGC------ACCACT------ACACT-ACAG------CCTGCTGGGCAACAGAG 82 62 ---AG-AGAATTT-CT-TG---AACCT----GGGAG----GCG------GAGGTTA-CAGTG--AGCT-GAG---AT-CAT------GCCACT------GCACT-CCAG------CCT---GGGCAACAGAG 79 254 ---AG-AGAATTT-CT-TG---AACCC----GGGAG----GCG------GAGATTG-CAGTG--AGCC-GAG---AT-CAT------GCCACT------GCACT-CCAG------CTT---GGGCAACAGAG 79 286 ---AG-AGAATTG-CT-TG---AACCC----GGGAG----GCG------TAGGTTG-CAGTG--AGTT-GAG---AT-CGT------GCCACT------GCACT-CCAG------CCT---GGGCAACAGAG 79 215 ---AG-AGAATTG-TT-TG---AACCC----GGGAA----GCG------GAGGTTG-CAGTG--AGCT-GAG---AT-TGT------ACCACT------GCACT-CCAG------CTT---GGGGAACAGAG 79 66 ------AGAATCG-CT-TG---AGCCT----GGGAG----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-TGT------GCCACT------GCACT-CCAG------CCT---GGGCAACAGAC 77 196 --ACA-AGAATCG-CT-TG---AACCT----GGGAG----GCA------GAGGTTG-CATTG--GGCC-GAG---AT-TGT------GCCACT------GCACT-CCAG------CTT---GGGCAACAGAG 80 207 ---AG-AGAATCG-CT-TG---AGCCT----GGGAG----GCA------GAGGTTG-CAGTG--CGCC-AAG---AT-TGT------ACCACT------GCATT-CCAG------TCT---GGGCAACAGAG 79 42 ---AC-AGAATTG-CT-TG---AACCT----GGGAG----GCA------GAGGTTG-CAGTG--AGCC-AAG---AT-CGT------GCCACT------GCACG-CGAG------CCT---GGGCAACAGAG 79 99 ---AG-AGAATTG-CT-TG---AACCT----GGGAG----GCA------GAAGTTG-CAGTG--AGCT-GAG---AT-TGC------GCCACT------GCACT-CTAG------CCC---AGGCAACAGTG 79 115 ---AG-AGAATCG-CT-TG---AACCT----GGGAG----GCA------GAGGTTG-CAGTG--AGCT-GAG---AT-TGT------GCCACT------GCACT-CTAG------CCC---AGGCAACAGTA 79 151 ---AC-AGAATTG-CT-TG---AACCT----GGGAG----AGA------GAGGTTG-CAGTG--AACC-GAG---AT-TGC------ACCACT------GCACT-CCAG------ACT---GGGCGACAGAG 81 214 ---AG-AGAATTG-CT-TG---AACCT----GGGAG----GAA------GAGGTTG-CAGTG--AGCC-GAG---ATCTGC------ACTACT------GCACT-CCAG------CCT---GGGCAACAGAA 80 198 -AAAGGAGAATTT-CT-TG---AACCC----GGGAG----ACA------GAGGTTG-CAATG--AGCT-GAG---AT-CGT------GCCATT------GCAGT-CCAG------CCT---GGGGGACAGAG 82 242 ---AGGAAAATTG-CT-TG---AACCC----GGGAG----GCA------GAGGTTG-CAGTG--AGCT-GAG---AT-TGT------ACCATT------GCACT-CCAG------CCT---GGGTGACAGTG 80 343 ----GGAGAACTC-CT-TG---AACCT----GGGAG----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-CGC------GCCATT------GCACT-CCAG------CCT---GGGCGACAGAG 79 4 ----AGAGGATCA-CT-TA---AGCCC----AGGAG----GCA------AAGTTTG-CAGTG--AGCC-GAG---AT-TGT------GTCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 12 ----AGAGGATCG-CA-TA---AGCCC----AGGAG----GCA------GAGGTTG-CAGTG--AGTC-GAG---AT-TGT------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 163 ----AGAGAATGG-TA-TA---AACCC----GGGAG----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-TGT------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 143 ----AGAGAATCG-CT-TG---AACCC----AGGAG----GTA------GAAGTTG-CAGTG--AGCC-GAG---AT-TGT------GCCACT------GCATC-CTAG------CCT---GGGTGACAGAG 79 253 ------AGAATAG-CG-TG---AACCC----AGGAG----GTA------GAGCTTG-CAGTG--AGCC-GAG---AT-TGT------GCCACT------GCACCTCCAG------CCT---GGGTGACAGAA 78 137 ---ACAAGAATCG-CT-TA---AACCC----GGGAG----GCA------GAGTCTG-CAGTG--AGCC-AAG---AC-TGT------GCCACT------GTACT-CCAG------CCT---GGGTGACAGAG 80 255 ------AGAATCG-CT-TG---AACCC----CGGAG----GCA------GAGGTTG-CAGTG--AGCT-GAG---AT-TGT------GCCACT------GTATT-CCAG------CCT---GGGTGACACAC 77 32 ---AGGAAAATCG-CT-TG---GACCC----GGGAG----GCA------GAGGCTG-CAGTG--AGCC-AAG---AT-TGC------ACCACT------GCACT-CCAG------CCT---GGGCAACAAGA 80 244 ---AG----ATTG-CT-TG---GACCC----AGGAG----GTT------GAGGCTG-CAGTG--AGCC-AAG---AT-TGC------ACCACT------GCACT-CCAG------CTT---GGGTGACAAAG 76 15 ---ACAAGAACCG-CC-TG---AACCC----AGGAG----GCA------GAGGTTG-CAGTG--AGCC-AAG---AT-TGC------ACCACT------GCATT-CCAG------CCT---GGGTGACAAAG 80 50 ----AGAGAATCG-CT-TG---AACCT----GGGAG----GCC------GAGTTTG-CTGTG--AGCC-AAG---AT-TGC------ACCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 311 ------AGAATGA-CG-TG---AACCT----GGGAG----GCA------GAGCTTG-CAGTG--AGCC-AAG---AT-TGC------TCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 77 243 ------AGAGTCG-CT-TG---AGCCT----GGGAAA---GCA------GAGGTTG-CAGTG--AGCC-AAG---AC-TGC------GCCACT------GCACT-TCAG------CCT---GGGCGACAGAA 78 278 ---AGAAGAATCG-CT-TG---AGCCT----GGGAG----GCA------GAGGTTG-TAGTG--AGCC-AAG---AT-TGT------ACCACT------GCACT-CCAG------CCT---GGGTGACAGGA 80 299 ----AGAGAATGG-CT-TG---TGCCC----GGGAG----GCA------GAGGTTG-CAGTG--AGTC-AAG---AT-TGA------ACCACT------CCACT-CCAG------CCT---GGGTGACAGAA 79 167 ----AGAGAATCG-CT-TG---AACCC----GGGAG----GCA------GAGGTTG-CAGTG--AGCC-AAG---AT-TGA------CCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 213 ----AGAAAATCG-CT-TG---AACCC----GAGAG----GCA------GAGGTTG-CAGTG--ACCT-GAG---AT-TGC------GCCACT------GCACT-CCAA------CCT---GGGTGACAGAA 79 324 ----AGAGAATCG-CT-TG---AACCC----GGGAG----GCA------GAGGTTG-CAGTG--AGCG-GAG---AT-TGC------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAT 79 168 ----AGAGAATCG-CT-TG---AACCC----GGGAG----GCA------GAGGTTA-CAGTG--AGCT-GAG---AT-TGC------ACCACT------GTACT-CCAG------CCT---GGGCGACAGAG 79 29 ----AGAGAATCA-CT-TG---AACCC----GGGAG----GCA------GAGGTTG-CAGTG--AGCT-AAG---AT-TGT------GCCACTGCACACT-GCACT-CTAG------CCT---GGGTGACAGAG 86 252 ----AGAGAATCA-CT-TG---AACCC----GGGAG----GCA------GAGGTTG-CAGTG--AGCC-AAG---AT-TGC------GCCACT------GCGCT-CCAG------CCT---GGGCGACAGAG 79 96 ------AGATCA-CT-TG---AACCC----AGGAA----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-TGC------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 76 136 ------AAATCA-TT-TG---AACCC----AGGAG----GCA------GAGGCTG-CAGTG--AGCC-GAG---AT-TGT------GCCACT------GCACT-CCAG------CCT---GGGAGACAGAA 76 144 ------AGAATCA-CT-TG---AACCC----CGGAG----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-CGT------GCCACT------GCACT-CCAT------CCT---GGGTGACAGCA 77 303 ---AGAAGAATCA-CT-TG---AACAC----CGGAA----GCA------GAGGTTG-CAGTG--AGCT-GAG---AT-CGT------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAT 80 101 ----AGAGAATTA-CT-TG---AACCC----AGAAG----GCA------GGGGCTG-CAGTG--AGCT-GAG---AC-TGT------GCCACT------GCACT-GCTG------CCT---GGGTGACAGAG 79 195 ----AGAGGATCA-CT-TG---AACCC----GGGAG----GCA------GGAGCTG-CAGTG--AGCT-GAG---AT-CGC------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 120 ------AGAATAA-CT-TG---AACCT----GGAAA----GCA------GAGGTTG-CAGTG--AGCT-GAG---AT-GGT------GCCGCT------GCACT-CCAG------CCT---GGGTGACAGAG 77 297 ------AGAACAA-CC-TG---AACCT----TGGAG----GCA------GAAGTTG-CAGTG--AACT-GAG---AA-TGT------GCCTCT------GCACT-CCAG------CCT---GGGTGACAGAG 77 226 ----AGAGAATTA-CT-TG---AACCC----GGGAG----GCA------GAGGTTG-CAGTG--AGCT-GAG---AT-CGT------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 55 ---AG-AGAATCA-CT-TG---AACCC----AGGAT----GGG------GAAGTTG-CAGTA--AGCC-AAG---AT-TGT------GCCACT------GTACT-CCAG------CCT---GGGTGACAGAG 79 202 ---AG-AGAATCA-CT-TG---AACCC----AGAAG----GTG------GAGGTTG-CAGTC--ACCC-AAG---AT-TGT------GCCACT------GCACT-CCAG------CCT---GGGTGACAGTG 79 116 ---AG-AGCATCA-CT-TG---AACCC----AGGAG----GGG------GAGGTTG-CAGTG--AGCT-GAG---AT-TGT------ACCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 186 ---AGAAGAATCA-CT-TG---GACCC----AAGAG----GTG------GAGGTTG-CGGTG--AGCC-GAG---AT-TGC------TCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 80 245 ---AG-AGAATCA-CT-TG---AACCC----GAGAG----GTG------GAGGTTG-CGG-G--AGCC-AAG---AT-TGC------GCCACT------GCACT-CCAG------CCT---GGGTGACAGCG 78 233 ---AG-AGAATCA-CT-TG---AACCC----GGGAG----GTG------GAGGTTG-CAGTG--AGCC-GAG---AT-TGC------GCCACC------GCACT-CCAG------CCT---GGGTGACAGAG 79 4

76 ---AT-AGAATCC-CT-TG---AACCCT---GGGTG----GCG------GAGGTTG-CAGTG--AACC-GAG---AT-TGT------GCCACT------GCATT-CTAG------CCT---GGGTGACAGAG 80 134 ---AG-AGAATCA-CT-TG---AACCC----GGGAG----GCG------GAGGTTG-CAGTG--AGCC-GAG---AC-TGT------GTCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 140 ------AGATCG-CT-TG---AACCT----GGGAG----GTG------AAGGTTG-CAGAG--AGCC-AAG---CT-TGC------GCCACT------GCACT-CTAG------CCT---GGGTGACAGAG 76 262 ------AGATCG-CT-TG---AACCC----GGGAG----GTG------GAGGTTG-CAGTG--AGCC-AAG---AT-TGT------GCCACT------GCACT-CCAG------CTT---GGGTGACAGAG 76 80 ----GGAGGATGG-CT-TG---AACCC----AGGAG----GCG------AAGGTTG-CAGTG--AGCT-AAG---AT-TGG------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 118 ------AAATTG-CT-TG---AACCT----GGGAG----GCG------GAGGTTG-CAGTG--AGCT-GAG---AT-TGT------GCCACT------GCGCT-CCAG------CCT---GGGTGACAGAG 76 241 ------AAATCA-CT-CG---AACCC----CGGAG----GTG------GAGGTTG-CAGTG--AGCT-GGG---AT-TGT------GCCACT------GTGCT-CCAG------CCT---GGGTGACAGAG 76 6 ----AGAGAATCGCA--TG---AA-CCCG---GGAG----GTG------GAGGTTG-CAGTG--AGCC-GAG---AT-CAT------GCCAGCT------GCATT-CCAG------CCT---GGGCAACAGAG 80 114 ----AGAGAATCACT--TG---AA-CCCG---GGAG----GTG------GAGGTTG-CAGAG--AGCC-GAG---AT-CAC------TCCA-TT------GCACT-CCAG------CCT---GGGCAACAGAG 79 78 ----AGAGAATCGCC--TG---AA-CCCG---GCAG----GTG------GAGGTTG-CAGTG--AGCC-AAG---AT-CAC------GCCA-CT------GCACT-CCAG------CCT---GGGTAACAGAG 79 295 ----AGAGAATCGCT--TG---AA-CCCA---GGAG----GTG------GAGGTTG-TAGTG--AGTC-AGG---AT-CA------GCCA-CT------GCACT-CCTG------CCT---GGGCAACAGAG 78 91 ----AGAGAATCACT--TG---AG-CCCG---GGAG----GTG------GATATTG-CAGTG--AGCC-AAG---AT-CAC------GCCA-CT------GCACT-CCAG------CCT---GG-CAACAGAG 78 113 ----AGAGAATCTCT--TG---AA-CCCG---GGAG----GCA------GATATTG-CAGTG--AGCC-AGG---AT-CAT------GCCA-CT------GCACT-CCAG------CCT---GGGCAACAGAG 79 240 ----AGAGAATCACT--TG---AG-CCCG---GGAG----GTG------GAGATTG-CAGTG--AACC-AAG---AT-CAT------GCCA-CT------GCACT-CCAC------CCT---GGGTGACAGAT 79 316 ----AGAGAATCACT--TG---AACCCCG---GGTG----GTG------GATGTTG-CAGTG--AACC-GAG---AT-CAT------GTCA-CC------GCACT-CCAA------CCT---GGGTGACAGAG 80 18 ------AAATCACT--TG---GA-CCAG---GAAG----GCA------GAGGTTG-CAGTG--AGCC-AAG---AT-CAT------GCCA-CT------GCACT-CCAG------CCT---GGGCGACAGAG 76 159 ----AAAGAATCGCT--TG---CA-CCCG---GGTG----GTG------GAGGTTG-CAGTG--AGCC-AAG---AT-CAT------GCCA-CT------GCACT-CCAG------CCT---GGGGGACAGAG 79 28 ----AGAGAATCGCT--TG---AA-CCCT---GGAG----GCG------GAGATTG-CAGTG--AGCC-AAG---AT-CAT------GCCA-CT------GCACT-CTAG------CCT---GGGTAACAGAA 79 325 ---AAAAGAATCGCT--TG---AA-CCCG---GGAG----GCA------GAGGTTG-CAGTG--AGCC-AAG---AT-CAT------GCCA-CT------GCACT-CCAG------CCT---AGGTAACAGAA 80 130 ----AGAGAATCGCT--TG---AA-CCCG---GGAG----GCA------GAGGTTG-CAGTG--AGCC-GAG---AT-CAT------GCCA-CT------GCACT-CCAG------CCT---GGGCAACAGAG 79 9 ----AGAGAATCGCT--TG---AA-CCCG---GGAG----GCA------GAGGTTG-CAGTG--AGCC-AAG---AT-CAC------GTCA-CT------GTACT-CCAG------CCT---GGGTGACAGAT 79 89 ----AGAGAATCTCT--TG---AA-CCTG---GGAG----GCA------GAGGTTG-CAGTG--AGCC-AAG---AT--AT------GTCA-CT------GCACT-CCAG------CCT---GGGTGACAGAG 78 61 ----AGAGAATCGAT--TG---AA-CCCC---GGAG----GCG------GAGGTTG-CAGTG--AGCC-AAG---AT-TGC------GCCA-TT------GCACT-CCAG------CCT---GGGC-ACAGGC 78 265 ----AGAGAATCGCT--TG---AA-CCCA---GGAG----ATG------GAGGTTG-CAGTG--AGCC-AAG---AT-CGC------GCCA-CT------GCACT-GCAA------CCT---GGGCGACAGAC 79 332 ----AGAGTATCGCT--TG---AA-CCCA---GGAG----GCA------GAGGTTG-CAGTG--AGCC-AAG---AC-TGC------GCCA-TT------GCACT-GCAG------CCT---GGGTGACAGAG 79 129 ----AGAGAATCGCT--TG---AA-CCCA---GCAG----ACG------GAAGTTG-CATTG--AGCC-AAG---AT-CGT------GTCA-TA------GCACT-CCAG------CCT---GGGTGACAGAG 79 348 ----AGAGAATCGCT--TG---AA-CCCG---GGAG----GCG------GAAGTTG-CAGTG--AGCC-GAG---AT-CGT------GTCA-TT------GCACT-CTAG------CCT---GGGCGACAAAG 79 312 ----AGAGAATCGCT--TG---AA-CCCG---GGAG----GCA------GAAGTTG-CAGTG--AGCC-AAG---AT-CGC------GCCA-TT------GCACT-CCAG------CCT---GGGGGACAGAG 79 71 ----AGAGAACTGCT---T---GAACCCT---GGAG----GCG------GAAGTTG-CAGTG--AGCT-GAG---AT-TGT------GCCA-CT------GCACT-CCAG------CCT---GGGCGACAGAG 79 189 ----AGAGAATTGCT---T---GAACCCT---GGAG----GCG------GAAGTTG-CAGTG--AGCC-AAG---AT-CGT------GCCA-CT------GCACT-CCAG------CCT---GGGCGACAGAG 79 171 ----AGAGAATTGCT---A---AACCTGG---AGGG----GCG------ACGGTTG-CAGTG--AGCT-GAG---AT-CGT------GCCA-CT------GCACT-CCAG------CCT---GGGCGACAGAG 79 184 ----AGAGAATTGCTT-AA---AAACCCT---AGAG----GTG------GAGCTGG-CAGTG--AGCC-AAG---AT-CGC------ACCA-CT------GCACT-CCAG------CCT---GGGTGACAGAG 81 219 ----AGAGAATTTCT---T---AAACCTG---GGAG----GCG------GAGGTTG-CAGTG--AGCC-AAG---AT-TGT------GCCA-CT------GCACT-CCAT------CCT---GGGGGACAGAG 79 257 ----AGAGAATTGATT-GA---ACCCCC----AGAG----GCG------GAGGTTG-CAGTG--AGCC-AAG---AT-TGC------GCCA-CT------GCACT-CCAG------CCT---GGGAGACAAAG 80 322 ----AGAGAATCGCTT-G----ACCCCG----GGAG----GCG------GAGGTTG-CAGTG--AGCC-AAG---GT-TGC------GCCA-CT------GCATT-CCAG------CCT---GGGTGACAAAG 79 17 ------AGAATGG-CG-TG---AACCT----GGGAG----G------TGGAACTTG-CAGTG--AGCT-GAG---AT-CGC------GCCACT------GCACT-CCAG------CCT---GGGCGACAGAG 77 119 ------AGAATGG-CG-TG---AACCC----AGGAG----G------TGGAACTTG-CAGTG--AGCT-GAG---AT-TGC------GCCACT------GCACT-CCAG------CCT---GGGCGACAGAA 77 263 ------AGAGTGG-CG-TG---AACCC----GGGAA----G------TGGAGCTTG-CAGTG--AGCT-GAG---AT-CGC------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 77 212 ---AGAAGAATGG-CA-TG---AACCC----GGGAG----G------TGGAGCTTGGCAGTG--AGCT-GAG---AT-CAC------GCCACT------GGACG-CCAG------CCT---GGGCGACAGCG 81 264 ---AG-AGAATGG-CA-TG---AACCT----GGGAG----G------TGGAGCTTG-CAGTG--AGCT-GAG---AT-CAT------GCCACT------GCACT-CCAG------CCT---GGGCGACAGAG 79 77 ----AGAGAATGG-TG-TG---AACCC----AGGAACCTGGGAG--GCGGAGCTTG-CAGTG--AGCT-GAG---AT-TGT------GCCACT------GTACT-CCAG------CCT---GGGTGACAGAG 87 321 ----AGAGAATGG-CG-TG---AACCC----GGGAA---GG------GGAGCTTG-CAGTG--AGCA-GAG---AT-TGT------GCCACT------GGACT-CCAG------CCT---GGGCGACAGAG 79 273 ------AGAATGG-CG-TG---ATCCC----GGGAG----G------CGGAGCTTG-CAGTG--AGCT-GAG---AT-TGT------GCCACT------GCACT-CCAG------CCT---GGGCAACAGAA 77 306 ------AGAATGG-CG-TG---AACCC----GGGAG----G------TGGAGCTTG-CAATG--AGCT-GAG---AT-GGT------GCCACT------GCACT-CCAG------CCT---GGGCAACAGAG 77 164 ------AGAATGG-CG-TG---AACCC----TGGAG----G------TGGAGCTTG-CAGTG--AGCC-GAG---AT-CGC------GCCACT------GCATT-CCAG------CCT---GGGCGACAGAG 77 220 ----AGAGAATGG-CA-TG---AACCC----TGGAG----G------TGGAGCTTG-CAGTG--AGTT-GAG---AT-CGC------GCCACT------GCATT-GCAG------CCT---GGGTGACAGAG 79 310 ----AGAGAATGG-CG-TG---GACCC----GGGAG----G------TGGAGCTTG-CAGTG--AGCC-GAG---AT-CGC------GCCACT------GCACT-CCAG------CCT---GGGCGACAGGT 79 49 ----AGAGAATGA-TG-TG---AACCC----GGGAG----G------CGGAGCTTG-CAGTG--AGCC-GAG---AA-AGC------CACT------GAATT-CCAG------CCT---GGGCAACACAG 77 231 ----AGAGAATGG-TG-TG---GACCC----GGGAG----G------TGGAGCTTG-CAGTG--AGCC-AAG---AT-TGC------GCCATT------GCATT-CCAG------CCT---GGGCGACATAG 79 201 ----AGAGAATGG-TG-TG---AACCC----GGGAG----G------TGGAGCTTG-CAGTG--AGCC-GAG---AT-TGC------ACCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 70 ------AGAATGG-CG-TG---AACCC----GGGAG----G------CGGATCTTG-CAGTG--AGCC-GAG---AT-TGC------GCCACT------GCGCT-CCAG------CCT---GGGCGACAAAG 77 156 ----AGAGAATGG-CG-TG---AACCC----GGGAG----G------CAGAGCTTG-CAGTG--AGCC-GAG---AT-CTC------ACCACT------GCACT-CCAG------CCT---GGGCGACAAAA 79 5

52 ----AGATAATGG-CG-TG---AACCC----GGGAG----A------CACA-CTTG-CAGTG--AGCC-AAG---AT-CAT------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAA 78 277 ----AGAGAATGG-CG-TG---AACCC----GGGAG----G------CGGAGCTTG-CAGTG--AGCA-AAG---AT-CAT------GCCACT------GCACT-CCAG------CCT---GGGAGACAGAG 79 124 ----AGAGAATGG-CG-TG---AACCC----AGGAG----G------TGGAGCTTG-CAGTG--AGAC-AAG---AT-CAT------GTCACT------GCACC-CCAG------CCT---GGGCGACAGAG 79 287 ----AGAGAATGG-TG-TG---ACCCC----GGGAG----G------TGGAGCTTG-CAGTG--AGCC-AAG---AT-CGT------GCCACT------GCACC-CCAG------CCT---GGGTGACAGAG 79 57 ----AGAGAATGG-CG-CG---AACCC----AGGTG----G------CAGAGCTTG-CAGTG--AGCC-GAG---AT-CGC------GCCACT------GCACT-CCAG------CCT---GGGTGACAGGG 79 247 ----AGAGGATGG-CG-TG---AACCC----AGGAG----G------CAGAGCTTG-CAGTG--AGCC-GAG---AT-CGC------GCCACT------GCATT-CCAG------CCT---GGGTGACAGAG 79 47 ----AGAGAATGA-CG-AG---AACCC----GGGAG----G------CAGAGCTTG-CAGTG--AGCC-GAG---AT-CGC------GTTGCT------GCACT-CCAG------CCT---GGGCGACAGAG 79 112 ------AGAATGG-CG-TG---AACCC----GGGAG----G------CAGCGCTTG-CAGTG--AGCT-GAG---AT-AGC------GCCACT------GCACT-CCAG------CCT---GGGCGACAGAG 77 288 ------AGAATGG-CG-TG---AACCT----GGGAG----G------CAGAGCTTG-CAGTG--AGCG-GAG---AT-AGT------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 77 117 ----AGAGAATGG-CA-TG---AACCT----GGGAG----G------CAGAGCTTG-CAGTG--AGCT-GAG---AT-CGC------ACCACT------GCACT-GCAG------CCT---GGGCGACAGAG 79 300 ----AGAGAATGG-CA-TG---AACCT----GGGAG----G------CAGAGCTTG-CAGTG--AGCA-GAG---AT-CGT------GCCACT------GCACT-CCAG------CCT---GGGCGACAGAG 79 39 ----AGAGAATGG-CC-TG---AGCCC----AGGAC------ACAGAGGTTG-CAGTG--AGCT-GAG---AT-CGC------ACCACT------GTACT-TTAG------CCT---GGGCGACAGAG 79 260 ------AGAATGG-CG-TG---AGCCC----AGGAG------ACGGAGCTTG-CAGTG--AGCT-GAG---AT-CAC------GCCACT------GCACT-CCAG------CCT---GGGCGACAGAG 77 169 ----AGAGAATGG-CG-TG---AACCC----AGGG------GCAGAGGTTG-CAGTG--AGCC-GAG---AT-CAT------GCCACT------GCACT-CTAG------CCT---GGGCGACAGAG 78 249 ----AGAGAATGG-CA-TG---AACCC----GGGAG----G------CGGAACTTG-CAGTG--AGCT-GAG---AT-CAC------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 292 ------AGAATAG-CA-TG---AACCC----GGGAG----G------CGGAGCTTG-CAGTG--AGCT-GAG---AT-CGC------GCCACT------GCACT-TCAC------TCT---GGGTGACAGAG 77 274 ----AGAGAATCG-TA-TG---AACCC----GGGAG----G------CTGAGGTTG-CAGTG--AGCT-GAA---AT-CGC------ACCACT------GCACT-GCAG------CCT---GGGTGACACAG 79 210 ----AGAGAATCG-TT-TG---AACCC----AGGAG----G------CGGAGGTTG-CAGTG--AGCT-GAG---AT-CGC------GCCACG------GCACC-CCAG------CCT---GGGTGACAGAG 79 64 ----AGAGAATCG-CT-TG---AACCC----AGGAG----G------CAGAGGTTG-CAGTG--AGCT-GAG---AT-CGT------GCCACT------GCACT-CCAA------CCT---GGGTGACAGAG 79 74 ----AGAGAATTG-CT-TG---AACCC----AGGAG----G------CAGAGGTTG-CCGTG--AGCT-GAG---AT-CCT------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 141 ----AGAGAATCA-CT-TG---AACCC----AGGAG----G------CAGAGGTTG-TAGTG--AGCC-GAG---AT-CGT------GCCACT------GCACT-CCAG------CCT---GG-TGACAGAG 78 190 ----AGAGAATCG-CT-TG---AACCC----AGGAG----G------CGGAAGTTG-CAGTG--AGCC-GAG---AT-CAC------GCCACT------GTGCT-CCAG------CCT---AGGTGACAGAG 79 205 ----AGAGAATTG-CT-TG---AACCC----AGGAG----G------CAGAGGTTT-CAGTG--AGCC-GAG---AT-TGC------ACCACT------GTGCT-CCAG------CCT----GGTGACATAG 78 339 ----AGAGAATCG-CT-TA---AACCC----AGAAG----G------CAGAGGTTG-CAGTG--AGCC-GAG---AT-CAC------GCCACT------GCACT-CCAG------CCT---AGGTGACAGAG 79 131 ----AGAGAATAG-GT-TG---AACCC----GGGAG----G------CAGAGGTTG-CAGTG--AGCC-AAG---AT-CGC------GCCACT------GCACT-CCAG------CTT---GGGTGACAGAG 79 203 ----AGAGAATCG-CT-TG---AATCC----GGAAG----G------CAGAGGTTG-CAGTG--AGCC-ACG---AT-CGC------TCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 126 ----AGAGAACTG-CT-TG---AACCC----AGGAG----G------CAGAGGTTG-CAGTG--AGCC-AAG---AT-CGC------GCCACT------GCACT-CCAA------CCT---GGGTGACAGAG 79 150 ------AGAATAG-CT-TG---AACCC----TGGAG----G------CAGAGGTTG-CAGTG--AGCC-GAG---AT-CGC------GCCACT------GCACT-GTAG------CCT---GGGTGACAGGG 77 236 ----AGAGAATCG-CT-TG---AGCCC----AGGAG----GCG------AAGGTTG-CAGTG--AGCC-GAG---AT-TGC------GCCATT------GCACT-CCAG------CCT---GGGGGACAGAG 79 239 ----AGAGAATCG-CT-TG---AATCC----AGGAG----GCG------CAGGTTG-CAGTG--AGCC-GAG---AT-TGC------CCCACT------GCACT-CCAA------CCT---GGGGGACAGAG 79 13 ----AGAGAATCG-CT-TG---AACCC----AGGAG----GCG------AAGGTTG-CAGTG--AGCC-GAG---AT-TGC------ACTACT------GCACT-CCAG------CCT---GGGAGACAGAG 79 73 ----AGAGAATTG-CT-TT---AACCC----AGGAG----TCG------GATGTTG-AAGTG--GGCT-GAG---AT-TGC------ACCATT------GCATT-CCAG------CCT---GGGTGACAGAG 79 333 ----AGAGAATTG-CT-TG---AACCC----AGGAG----GCG------GAGGTTG-CAGTG--AGCC-GAG---AT-TGC------ACGATT------GCACT-CCAG------CCT---GGGCGACAGAG 79 147 ------AAATCG-CT-TG---AACCC----AGGAG----GTG------GAGGTTG-CAGTG--ATCC-GAG---AT-TAC------ACCACT------GCACT-CCAG------CCT---GGGCGACAGAG 76 270 ------AAATCG-CT-TG---AACCC----AGGAG----TCG------GAGGTTG-CAGTG--AGCC-GAG---AT-TAC------ACCACT------GCACT-CCAG------CCT---GGGTGACAGAG 76 337 -ACAGGAGAATCG-TT-TG---AACCC----AGGAG----GTG------GAGGTTG-CTGTG--AGCC-GAG---AT-TGC------ACCACT------GCACT-CCAG------CCT---GGGTGACAGAG 82 217 ----AGAGAATCG-CT-TG---AACCC----AGGAG----GTG------GAGATTG-CAGTG--AGCT-GAG---AT-AGC------ACCACT------GCACT-CCAG------CCT---GGGTGACAGAG 79 290 ----AGAGAATCG-CT-TG---AACCC----AGGAG----GTG------GAGGTTG-CAGTA--AGCT-GAG---AT-TGC------ACCACT------GCACT-CCAG------TCT---GGGTGACAGAG 79 289 ----AGAGAATCG-CT-TG---AACCT----AGGAG----GTG------GAGGTTG-CAGTG--AGCT-GAG---AT-CGC------ACCACT------GCACT-CCAG------CCT---TGGTGACAGAG 79 92 ----AGAGAATCG-CT-TG---AACCC----GGGAG----CTG------GAGGTTG-CAGTG--AGCT-GAG---AT-CGC------ACCATT------GTACA-CCAG------CCT---GGGGGACAGAG 79 227 ------AGAATAG-CT-TG---AACCC----AGGAG----GTG------GAGGTTG-CAGTG--AGCC-GAG---AC-CAC------ACCACT------GCATA-CCAG------CCT---GGGGGACAGAG 77 153 ----GGAGAATCG-CT-CG---AACCC----GGGAG----GCG------GAGGTTG-CGGTG--AGCTTGAG---AT-CGC------ACCACT------GCATT-CCAG------CCT---GGGCGACAAAG 80 251 ----AGAGAATCG-CC-TG---AACCT----GGGAG----GTG------GAGGTTG-CAGTG--AGCC-GAG---AT-CGC------GTCACT------GCATT-CCAG------CCT---GGGTGACAGAT 79 314 ----AGAGAATCA-CT-TG---AACCT----GGGAG----ACG------GAGATGG-CAGTG--AGCC-GAG---AT-CGC------ACTACT------GCATT-CCAG------CCT---GGGTGACAGAG 79 323 ----AGAGAATCG-CT-TG---AACCC----GGGAG----GCG------GAGGTTG-CAATG--AGCC-GAG---AT-CGC------ACCACT------GCACT-CCAG------CCT---CGGTGACAGAG 79 110 ----AGAGAATCG-CT-TG---AACCC----AGGCG----GTA------GAGGTTGGCAGTG--AGCC-GAG---AC-AGC------TCCACT------GCACT-CCAG------CCT---AGGCGACAGAG 80 216 ----AGAGAATTG-CT-TG---AACCC----AGGAG----GTT------GAGGTTG-CAGCG--AGCA-GAG---AC-TGC------ATTACT------GCACT-CCAG------CCT---GGGCGACAAAG 79 338 ----AGAGAATCG-CT-TG---GACCC----AGGAG----GTG------GATGTTG-CAGTG--AGCA-GAG---AT-CGC------GTCACT------GTACT-CCAG------CCT---GGGCAACAGAG 79 60 ------AGATTG-CT-TG---AGCCC----AGGAG----GCAGG-----AGGTTG-CAGTG--AGCT-GAG---AT-TGT------GCCACT------GCACT-CCAG------CCT---GGGTGACAAAG 77 75 ------ATATTG-TT-TA---AGCCC----GGGAG----GCAG------AGGTTG-CAGTG--AACT-GAG---AT-CGT------GCCACT------GCACT-CCAG------CCT---GGGCAACAAAG 76 200 ----AAAGAATTG-CT-TA---AGCCC----AGGAG----TTCA------AGGTTA-CAGTG--AGCT-ATG---AT-TGT------GCCACT------GCACT-CCAG------CCT---GGGTGACAAGA 79 298 ----AGAGGATCG-CT-TG---AGCCC----AGGAG----GCAG------AGGTTA-TAGTG--AGCT-GAG---AC-TGC------GCCACT------GTACT-CCAG------CCT---GGGTGACAACA 79 6

106 ----AGAGAATCA-CT-TG---AGCCC----AGGAG----GTCA------AGGCTG-CAGTG--AGCT-GAG---AT-CAC------TCCACT------GCACT-CCAG------CTT---GGGTAACAAAA 79 340 ----AGAGAATCA-CT-TG---AGCCC----AGGAG----GTTG------AGGCTG-CGGTG--AGCT-GAG---GT-CGC------ACCACT------GCACT-CCAG------CCT---GGGTGACAAAG 79 85 ----AGAGAATCG-CT-TG---AACCC----AGGAG----G-CG------AGGCTA-CAGTG--AGCC-GAG---AT-AGC------GCCACT------GCACT-CCAG------CCT---GAGTGACAAAG 78 84 ------AAATCA-CT-TG---AACCC----CAGAG----ATG------GAGGCTG-TAGTG--AGCA-GAG---AT-CAT------GCCACT------TCACT-CCAG------CCT---GGGTGACAGAG 76 155 ----AGAGAATTG-CT-TG---AACCC----GGGAG----GTG------GAGGCCG-AGGTG--ACCA-GAG---AT-CAT------GCCACTGCATT---GCACT-CCAA------CCT---GGGTGACAGAG 84 302 ----AGAGAATCG-CT-TG---AACCC----AAGAG----GCG------GAGACTG-CAGTG--AGCC-AAG---AT-CAT------GCCATT------GCACT-CCAG------CCT---GGGCGACAGAG 79 2 ----AGAGAATCA-CT-TG---AACCC----AGGAG----GCG------GAGGTTG-CAGTG--AGCT-GAG---AT-CAT------GCCATT------GCACT-ACAG------CCT---GGGTGACAGAG 79 22 ----AGAGAATTG-TT-TG---AACCC----AGAAG----GCG------GAGGTTG-CAGTG--AGCC-AAG---AT-CAC------ACCGTT------ATACT-CTGG------CCT---GGGTGACAGAG 79 67 ----AGAGAATTG-GT-TG---AACCC----AGGAG----GTG------GAGGTTG-CAGTG--AGCC-AAG---CT-CAT------GCCACT------GTACT-CCAG------CCT---GGGTGACAGAG 79 122 ----AGAGGATTG-CT-TG---AACCC----AGAAG----GTT------GAGGCTG-CAGTG--AGCC-GTG---AT-CAT------GCCACT------GTACT-CCAG------CCT---GGGTGACAGAG 79 272 ----AGAGGATCC-AT-TG---AGCCC----AGGAG----GTC------GAGGCTG-CTGTG--AGTC-GTG---AT-CGT------GCCACT------GTACT-TCAG------CCT---GGGCAACAGAG 79 128 ------AAACCA-CT-TG---AACCC----GGGAG----GTG------GAGGTTG-CAGTG--AGCC-GAG---AT-CAT------GCCACT------GTACT-CTAG------CCT---GGGCAACAGAA 76 291 ---ACAAGAATCG-CT-TG---AACCC----AGGAG----GTG------GAAGTTG-CAGTG--AGCC-GAG---AT-CTT------GCCACT------GCACT-CCAG------CGT---GGGGAACAGAA 80 161 ----AAAGGATCA-CT-TG---AACCC----AGGAG----GTG------GAGGTTG-CAGTG--AGCC-GAG---GT-CAC------GCCACT------GCACT-CCAG------CCT---GGGAGACAGAA 79 51 ------AGATCA-CT-TG---AACCC----AGGAG----TTG------GAGGTTG-CAGTG--AGCT-GAG---AT-CAC------ACCACT------GCACT-CCAG------CCT---GGGTGACAGAG 76 145 -ACAGGAAAATTG-TT-TG---AGCCC----AGGAG----GTG------GAGGTTG-CAGTG--AGCT-GAG---AT-CAC------ACCACT------GCACT-CCAG------CCT---AGGTGACAGAG 82 56 ---AGAGGAATTG-CT-TG---AACCC----AGAAG----GTA------GAGGTTG-CAGTG--TGCT-GAG---AT-CAC------ACCACT------GCACT-CCAG------CCT---GGGTGACAGAG 80 318 ---AGAAAAATTG-CT-TG---AACCC----AGTAG----GCA------GAG-TTG-CAGTG--AGCT-GAG---AT-CGC------ACCACT------GCACT-CCAG------CAT---GGGTGACAGAA 79 175 ------AAATTA-CT-TG---AACCC----AGGAG----GCG------GAGGCTT-CAGTG--AGCT-GAG---AT-CGC------ACCACT------GCACT-CCAG------CCT---GGGCAACAGAT 76 65 ------AGATCACCT-----GAGCCCG----GGAG----CTG------GAGGTT-TCAGTG--AGCT-CAG---AT-GTG------CCACT------GCATT-CCAG------CCT---GGGCTACAGAG 75 237 ------AGATCGCTT-GATTGAGCCCG----GGAG----GCA------GAGGTT-GAAGTG--AGCT-TTG---ATTGGG------CCACT------GCACT-CCAG------CCT---GGGCAACAGAG 80 108 ------AGATCACTT-G----AGTCTG----GGAG----GTT------GAGGCTTGCTGTG--AGCC-AAG---ATCACT------CCAGT------GCACT-CTAG------CCT---GGGCAACAAAA 77 294 ------AGATCAATT-G----AACCTG----GGAG----GTT------GAGGCT-GTAGTG--AGCT-ATG---ATCATG------CCACT------GCAGT-CCAG------CCT---GGGGAACAGAA 76 293 ------AGATCATCT-G----AGCCTG----GGAG----GTT------GAGGCT-GCAGTG--AACC-ATG---ATCGTG------TCACT------GCACT-CCAG------CCC---AGGCAACAAGA 76 304 ------AAATCATCT-GG---ACCCAG----GGAG----GTC------AAGGCA-GCAGTG--AGCC-TTG---ATTGTG------GCCACT------GCACT-CCAG------CCT---GGGCAACAGAA 78 83 ------AGATCACTT-G----AACCTG----GGAA----GTT------GAGGCT-GCAGTG--AGCT-GAG---AGCGTG------CCACT------ACACT-CCAA------CCT---GGGCAACAAAG 76 166 ------AGATCAATT-G----AGCCTG----GGAG----GTT------GAGGTT-GCAGTG--AGCT-GTG---ATTGCA------CCACT------GTACT-CCAG------CCT---GGGCCACAGAG 76 27 ------AG-ATCG-CT-TG---AGCCTAGCTAGGAGT---TTG------AGGCTG-AAGTG--AGCTAT-----GGTCTC------ACCAGT------GTACT-CCAG------CCT---GGGCAACAGAG 80 238 ------AG-ATCG-CT-TC---AGCCCA----GGAGC---TTG------AGGCCG-CAGTG--AGCTAT-----GATTGC------ACCTCT------GAACT-CCAG------CCT---GGGCAACAGAG 76 21 ------AG-ATCA-CT-TG---AGCCCA----GGAGT---GTG------AGGTTG-CAGTG--AGCTAT-----GAATGC------ACCACT------ACACT-CCAG------CGT---GGGCAACAGAG 76 132 ------AG-ATCAGTT-TG---AGCCCA----GGAGT---TTG------AGGTTG-CAGTG--AGCCAT-----GATAGC------GCCACT------GCACT-GCAG------CCT---GGGCAACAAAG 77 30 ------AG-ATCA-CT-TG---AGCCCA----GAAGG---TCA------AGGCTG-CAGTG--AGCTGT-----GATTAT------GCCACT------GCACT-CCAG------CCT---GGGCAACAGAG 76 283 ------AG-ATCA-CT-TG---AGCCCA----GAAGT---TCA------AGGCTG-CAGTG--AGCCGT-----GATCAT------GCCACT------GTACT-CCAG------CCT---GGGCAACAGAC 76 97 ------AG-ATCA-C---G---AGCCCA----ACAGC---TCC------AAGTTA-CAGTG--AGCTAT-----TATTCT------GCCACT------GCACT-CCAG------CCT---GGGCAACAGAG 74 173 ------AG-ATCA-CT-TG---AGCCCA----GGAGT---TCA------AGGCTA-CAGTG--AACTAT-----GATATT------GCCACC------ATACT-CCAG------CCT---GGGCAACAGAG 76 187 ------AG-ATCA-AT-TG---AGCCTG----GGAGT---TCA------AGGCTA-CAGTG--AGCTGT-----GATTGT------GGCACT------GCACT-CCAT------TCT---AGGCAACAGAG 76 98 ------AG-ATCA-CT-TG---AGCCCA----GGAGT---TTG------AGGCTG-CAGTG--AGCTAG-----GATTAT------GCCACT------GCACT-CTAG------CCT---GGGCAACAGAG 76 123 ACAGGAGG-ATCA-CT-T----AGCCCA----GGAGT---TTG------AGGCTA-TAGTG--AGCAAT-----GATTATGATCGTGCCACT------GCACT-CCAG------CCT---GGGCAACAGAG 87 228 ------AG-ATCA-CT-TG---AGCCCA----GGAGT---TAG------AGGCTG-CAGTG--AGCTAT-----GATCAT------GCCACC------GCAAT-CCAG------CTT---GGGCAACAGAA 76 138 ------AG-ATCA-TC-TG---AGCCCA----GGAGG---TCG------AGGCTG-CAGTG--AGCCAT-----GATCTAGCTA--GCCATT------GTACT-TCAG------CCT---GGGTGACAGTG 80 267 ------AG-ATCA-TT-TA---AGCCCA----GGAGT---TCA------AGGCTG-CAGTG--AGCCAT-----GATT--GC----GCCACT------GTACT-CCAG------CCT---GGGTAACAGAG 76 38 ------AG-ATTG-TG-TG---AGTCCA----GGAAT---TGG------AGGCTG-CAGTG--AGCTAT-----AATTGTGC-----CAACT------GTACT-CCAG------CCT---GGGCAACAGTT 77 234 ------AG-ATCA--C-TA---AGCCCA----AGAGG---TGG------AGGCTG-CAGTG--AGCCGT-----AATTGTGC------CACT------GGGCT-CCAG------CAT---GGGCAACAGAG 75 72 ------AG-ATCACTT--G---AGCCCA----GGAGG---TTG------AGGCTG-CAGTG--AGCTGA-----GATTGC------ACCAT------T-CCAG------CCT---GGGCAACAGAG 71 104 ------AG-ATCACTT--G---AGCCCA----GGAGT---TTG------AGGCTG-CAGTG--AGCCGT-----GATCAC------ACAATG------GCAGT-GCAG------CCT---GGGCAACAGAG 76 218 ------AG-ATCACTT--G---AGCCCA----GGAGG---TTG------AGGCTG-TGGTA--AACTGT-----GAGTGT------GCCACC------ACACT--CAG------CCT---GGGTGACAGAG 75 280 ------AG-ATCACTTGAG---AGCCCA----GGAGG---TGG------AGGTTG-GGGTG--AGCTGA-----GATCGC------GCTACC------GCATT-CCAC------CCT---GGGTGACAGAG 78 178 ------AG-ATCA-CT-TG---AACCCA----GGAGT---TCA------AGGCTG-CAGTG--AGCTAA-----GACTGT------GCCACT------GCACT-CCAG------CCT---GGGTGACAGAG 76 191 ------AG-ATCG-CT-TG---AGCCCA----GGTGG---TCA------AGGCTG-CAGTG--AGCTAT-----GATTGT------GCTACT------GCACT-CCAG------CCT---GGGCGACAGAG 76 133 ------AG-ATCA-CT-TG---AGCCCG----GGAAG---TTG------AGGCTG-TAGTG--AGCCAT-----GCTCGT------GCCGCT------GCGCT-CCAA------CCT---GGGTGACAGAC 76 197 ------AG-ATGA-CT-TG---AGCCTG----GGATT---TCG------AGGCTG-CAGTG--AGTTGT-----GTTCTT------GCCACT------GCATT-CTAA------CCT---GGGTGACAGAG 76 142 ------AG-ATTG-CT-TG---AGCCCA----GGAAG---TTGG------TGGCTG-CAGTG--AGCTGT-----GGTTGC------GCCACT------GCACT-CTAG------TCT---GGGTGACAGAG 77 7

222 ----GGAGAATCC-CT-TG---AGCCCA----GCAGT---TCA------AGGCTA-CAGTG--AGCTAT-----AATTGC------ACCCCT------GCACT-CCTG------CCT---GGGTGACAGAG 79 271 ----GGAGGATCC-CT-TG---AGCCCA----GGGG------G------AGGCTA-CAGTG--AGCTATGGTCTGTTTGC------ACCACT------GTACT-CCCG------CTT---GGGCAACAGTG 81 179 ----AGAGGATCG-TT-TG---AGCCCA----GGAGT---TTG------AGGCTG-CAGTG--AGCTAT-----GATCGC------ACCACT------GCACT-CCAG------CCTT--GGGGGACAGAG 80 246 ----GGAGGATTA-CT-TG---TGCCTA----GGAGT---TGG------AGAGTG-CACTG--AGCTAT-----GATTGC------ACCACT------GCACT-CCAG------CCT---GGCTGACAGAA 79 93 ----GGAGGATCA-CT-TG---AGCCCA----GGAGG---TAG------AGGCTG-CAGTG--AGCCAT-----GATCGT------ACCACT------GCACT-CCAG------CCA---GGGTGACAGAG 79 20 ------AG-ATTACT--TG---AGCCC----AGGAG----TTG------AATACAG-CAGTG--AGCT-GAG---AC-CAT------GCCACT------GCATT-CCAG------GCT---GGGCAACAGAG 76 308 ---GGATG-ATTGCT--TG---AGCCC----AGGAG----TTC------AAGACTG-CAGTG--AGCT-GTG---AT-CAC------GCCCCT------GCGCT-CCAC------CCT---GGGCAACAAGA 79 40 ------AG-ATCACT--TG---AGCCC----AGGAG----CTG------GAGATTG-TAGTG--AGCT-GAG---AT-CAC------ACCACT------GCACT-CCAC------CCT---GAGCAACAGCA 76 160 ---AGAGG-ATCGCT--TG---AGCCC----AGGAA----CTG------GAGGCTG-CAGTG--AGTT-GTG---AT-CAT------ACAACT------GCACT-CCAG------GCT---GGGCAACAGAA 79 25 ------AGAATCACT--TG---AGCCC----AGGAA----TTC------AGGGTTG-CAGTG--AGAT-GTG---AT-CAC------ACCTCT------GCATG-TCAA------CCT---GGGCGACAGAG 80 23 ----ACAG-AACG----AG---ACCCT----GATAG----TTT------GAGGATG-CAGTG--AGCT-GTA---AT-CAT------GCCACT------GCACT-CCAG------CCT---GGGCAACAGAG 76 44 ----AGAGGATTGTT--TG---AGCCC----AGGAG----TTT------GAGGTCA-CAGTG--AGCT-GTA---GT-CAT------GCCACT------GCACT-CCAG------CCT---AGGCAACAGAA 79

8

Jady_Supplemental_Table 2

Family (No) AluACA RNA AluSx (45): 2, 6, 26, 33, 42, 50, 54, 55, 64, 78, 79, 81, 84, 85, 101, 107, 110, 116, 129, 130, 139, 141, 148, 151, 168, 176, 181, 198, 213, 223, 225, 233, 251, 257, 265, 274, 301, 318, 323, 325, 327, 332, 333, 338, 339. AluSx1 (45): 5, 29, 31, 34, 41, 56, 66, 67, 69, 95, 113, 114, 118, 120, 121, 131, 144, 145, 147, 150, 159, 175, 182, 195, 202, 236, 239, 245, 252, 254, 255, 266, 275, 289, 290, 295, 297, 302, 314, 315, 320, 324, 331, 343, 347. AluJb (37): 4, 7, 30, 44, 51, 63, 72, 75, 83, 105, 106, 108, 109, 149, 166, 172, 174, 178, 185, 191, 197, 204, 218, 234, 237, 244, 268, 272, 280, 282, 284, 293, 294, 304, 319, 330, 340. AluY (37): 17, 39, 47, 49, 52, 57, 70, 77, 112, 117, 119, 124, 156, 163, 164, 169, 201, 212, 220, 231, 247, 249, 253, 260, 263, 264, 273, 277, 287, 288, 292, 300, 306, 310, 311, 321, 336. AluSz (36): 9, 18, 36, 62, 71, 73, 74, 76, 89, 92, 96, 126, 128, 137, 154, 155, 157, 161, 170, 186, 207, 227, 232, 241, 242, 258, 259, 270, 278, 279, 281, 299, 312, 335, 337, 341. AluSq2 (30): 8, 10, 11, 14, 16, 19, 37, 45, 46, 53, 58, 61, 125, 152, 161, 165, 180, 193, 211, 221, 229, 248, 256, 261, 296, 307, 309, 326, 329, 342. AluJo (20): 1, 3, 20, 23, 38, 40, 60, 93, 104, 123, 135, 142, 160, 173, 179, 187, 209, 222, 246, 298. AluJr (18): 21, 25, 27, 65, 88, 90, 98, 122, 132, 133, 138, 228, 238, 267, 283, 308, 328, 345. AluSx3 (13): 12, 13, 22, 28, 87, 99, 103, 115, 134, 183, 184, 188, 190. AluSz6 (13): 15, 59, 80, 136, 146, 167, 196, 216, 240, 243, 262, 291, 316. AluSq (11): 32, 43, 102, 158, 199, 230, 235, 250, 269, 276, 344. AluSp (10): 24, 68, 94, 100, 111, 192, 224, 285, 317, 334. AluSg (9): 48, 82, 86, 153, 206, 305, 313, 322, 348. AluSg7 (6): 171, 214, 215, 219, 226, 286. AluSc8 (6): 35, 143, 177, 189, 203, 210. AluSc (5): 91, 127, 194, 205, 217. AluSg4 (3): 140, 208, 346. AluJr4 (3): 97, 200, 271. AluSx4 (1): 303.

Supplemental Table 2. The 348 novel intron-encoded human AluACA RNAs fall into 19 different subgroups of the three major Alu subfamilies. Jady_Supplemental_Table 3

Supplemental Table 3. List of the host of human intron-encoded AluACA RNAs.

AluACA1, SEC24A, SEC24 related family, member A AluACA2, HINT1, cDNA: FLJ22904 fis AluACA3, FBXO10, F-box 10 AluACA4, TRIM24, transcriptional intermediary factor 1 alpha AluACA5, DNM2, 2 isoform 4 AluACA6, TGS1, trimethylguanosine synthase homolog AluACA7, PARP1, poly (ADP-ribose) polymerase family, member 1 AluACA8, NUTF2, moderately similar to Nuclear transport factor 2 AluACA9, RIMKLB, ribosomal modification protein rimK-like family AluACA10, ITFG1, integrin alpha FG-GAP repeat containing 1 AluACA11, No apparent host gene AluACA12, methyltransferase like 10 AluACA13, Human 19 open reading frame 70 (C19orf70) AluACA14, HscB iron-sulfur cluster co-chaperone homolog (E. coli) AluACA15, ERLIN1, Endoplasmic reticulum lipid raft-associated protein 1 AluACA16, TMEM11, Homo sapiens cDNA, FLJ92237 AluACA17, BCOR, Putative uncharacterized protein BCOR AluACA18, SMURF1, Smad ubiquitination regulatory factor 1 isoform AluACA19, PDHA1, pyruvate dehydrogenase E1 alpha 1 precursor AluACA20, PKN2, protein kinase N2 AluACA21, THOC2, THO complex 2 AluACA22, LRRFIP1, leucine rich repeat (in FLII) interacting AluACA23, SRGAP2, SLIT-ROBO Rho GTPase activating protein 2 AluACA24, ATP6V1A, ATPase, H+ transporting, lysosomal 70kDa, V1 subunit A AluACA25, SMG-1, PI-3-kinase-related kinase AluACA26, PIK3R3, phosphoinositide-3-kinase, regulatory subunit 3 AluACA27, ACLY, ATP citrate isoform 1 AluACA28, TMEM87A, transmembrane protein 87A isoform 1 AluACA29, C9orf11, Acr formation associated factor isoform 1 AluACA30, TTBK, tau kinase 2 AluACA31, MTHFD1L, methylenetetrahydrofolate dehydrogenase (NADP+) AluACA32, SPSB1, splA/ryanodine receptor domain and SOCS box containing 1 AluACA33, DAP3, death-associated protein 3 AluACA34, EXOC3, Sec6 protein AluACA35, ZNF778, zinc finger protein 778 AluACA36, LEPRE1, leprecan 1 isoform 1 AluACA37, HECTD1, HECT domain containing 1 AluACA38, CABIN1, calcineurin binding protein 1 AluACA39, DPP9, dipeptidylpeptidase 9 AluACA40, C2orf86, clone 24963 mRNA sequence AluACA41, CCDC104, coiled-coil domain containing 104 AluACA42, CCNC, cyclin C isoform b AluACA43, No apparent host gene AluACA44, HTT, huntingtin AluACA45, MED13, mediator complex subunit 13 AluACA46, PTPRG, protein tyrosine phosphatase, receptor type, G precursor variant protein AluACA47, RNF170, ring finger protein 170 isoform b AluACA48, PDIA5, protein disulfide A5 precursor AluACA49, RASSF8, Ras association (RalGDS/AF-6) domain family AluACA50, BTBD7, BTB/POZ domain-containing protein 7 AluACA51, NR2C1, nuclear receptor subfamily 2, group C, member 1 AluACA52, NCKAP1, NCK-associated protein 1 AluACA53, ANKRD13C, ankyrin repeat domain 13C AluACA54, HEATR5A, HEAT repeat containing 5A AluACA55, MBOAT, membrane bound O-acyltransferase domain AluACA56, MTHFD1L, methylenetetrahydrofolate dehydrogenase (NADP+) AluACA57, DHX33, DEAH (Asp-Glu-Ala-His) box polypeptide 33 AluACA58, ADSL, adenylosuccinate lyase isoform a AluACA59, TTC1, tetratricopeptide repeat domain 1 AluACA60, CEP290, centrosomal protein 290kDa AluACA61, ZCCHC10, zinc finger, CCHC domain containing 10 AluACA62, BAT3, HLA-B associated transcript-3 isoform b AluACA63, No apparent host gene AluACA64, ME1, cytosolic malic 1 AluACA65, KIAA0146, putative protein AluACA66, TPRKB, TP53RK binding protein AluACA67, ATXN3, ataxin 3, transcript variant am, non-coding RNA AluACA68, KPRD2, Regulation of nuclear pre-mRNA domain-containing protein 2 AluACA69, NBPF, hypothetical protein LOC400818 AluACA70, PPA2, (inorganic) 2

1

AluACA71, NFE2L2, nuclear factor (erythroid-derived 2)-like 2 AluACA72, SNUPN, snurportin 1 AluACA73, UBAP2L, associated protein 2-like AluACA74, SESTD1, SEC14 and spectrin domains 1 AluACA75, MRPL48, mitochondrial ribosomal protein L48 AluACA76, ZNF266, zinc finger protein 266 AluACA77, DDX46, DEAD (Asp-Glu-Ala-Asp) box polypeptide 46 AluACA78, SMG5, Smg-5 homolog, nonsense mediated mRNA decay factor AluACA79, FOXN2, Forkhead box protein N2 AluACA80, SUPT16H, Facilitates chromatin transcription complex subunit SPT16 AluACA81, TSHZ2, teashirt zinc finger homeobox 2 AluACA82, No apparent host gene AluACA83, No apparent host gene AluACA84, CHRM3, N10 NTera2D1 teratocarcinoma, m3 muscarinic acetylcholine receptor AluACA85, TBC1D23, TBC1 domain family, member 23 AluACA86, LRP6, low density lipoprotein receptor-related protein AluACA87, UEVLD, ubiquitin-conjugating enzyme E2-like isoform b AluACA88, RBCK1, RanBP-type and C3HC4-type zinc finger containing AluACA89, IGF1R, insulin-like growth factor 1 receptor precursor AluACA90, TTLL11, tubulin tyrosine -like family, member 11 AluACA91, GAPVD1, GTPase activating protein and VPS9 domains 1 AluACA92, ZNF346, zinc finger protein 346 AluACA93, CAST, calpastatin isoform i AluACA94, USP9X, ubiquitin specific protease 9, X-linked isoform AluACA95, EPHA4, Ephrin type-A receptor 4 AluACA96, APC, adenomatous polyposis coli AluACA97, DCP1A, DCP1 decapping enzyme homolog A AluACA98, AK000470, cDNA FLJ20463 fis AluACA99, MAP2K1, mitogen-activated protein kinase kinase 1 AluACA100, No apparent host gene AluACA101, ULK4, Serine/threonine-protein kinase ULK4 AluACA102, MAP4K5, mitogen-activated protein kinase kinase kinase AluACA103, KHDRBS1, KH domain containing, RNA binding, signal AluACA104, AKAP8L, kinase (PRKA) anchor protein 8-like AluACA105, CREBBP, CREB binding protein isoform a AluACA106, ALuACA106, C5orf32, hypothetical protein LOC84418 AluACA107, HGS, hepatocyte growth factor-regulated tyrosine AluACA108, C12orf48, UPF0419 protein AluACA109, LAT, linker for activation of T cells isoform b AluACA110, CREBBP, CREB binding protein isoform a AluACA111, No apparent host gene AluACA112, PTPN12, protein tyrosine phosphatase, non-receptor type AluACA113, GRLF1, glucocorticoid receptor DNA binding factor 1 AluACA114, KDM2A, lysine (K)-specific demethylase 2A AluACA115, KDM2B, F-box and leucine-rich repeat protein 10 isoform AluACA116, ARHGAP5, Rho GTPase activating protein 5 isoform b AluACA117, ERICH1, glutamate-rich 1 AluACA118, RPA1, replication protein A1 AluACA119, HSPA4L, heat shock 70kDa protein 4-like AluACA120, ZER1, zyg-11 homolog B AluACA121, NBPF20, hypothetical protein LOC400818 AluACA122, IFT52, intraflagellar transport 52 homolog AluACA123, C20orf4, hypothetical protein LOC25980 AluACA124, CDC2L1, cell division cycle 2-like 1 (PITSLRE ) AluACA125, NIPBL, Nipped-B-like protein AluACA126, RPS10, ribosomal protein S10 AluACA127, BTF3L4, basic 3-like 4 isoform 2 AluACA128, PSMB2, beta 2 subunit AluACA129, TFB1M, transcription factor B1, mitochondrial AluACA130, KDM6A, ubiquitously transcribed tetratricopeptide AluACA131, PUS7L, pseudouridylate synthase 7 homolog AluACA132, FNTB, farnesyltransferase, CAAX box, beta AluACA133, FOXK2, Forkhead box protein K2 AluACA134, No apparent host gene AluACA135, MNAT1, menage a trois 1 (CAK assembly factor) AluACA136, AP3B1, adaptor-related protein complex 3, beta 1 AluACA137, CMAR, cell matrix adhesion regulator variant (CMAR) mRNA AluACA138, EFNA5, ephrin-A5 precursor AluACA139, UST, uronyl-2-sulfotransferase AluACA140, OTUD4, OTU domain containing 4 protein isoform 3 AluACA141, MORN1, MORN repeat containing 1 AluACA142, GDI2, GDP dissociation inhibitor 2 isoform 2 AluACA143, TRIM54, ring finger protein 30 isoform 1 AluACA144, MYO19, XIX isoform 3 AluACA145, RNF130, ring finger protein 130 precursor AluACA146, PPP3CC, protein phosphatase 3, catalytic subunit, gamma AluACA147, UBR2, ubiquitin protein ligase E3 component n-recognin

2

AluACA148, TNRC6B, trinucleotide repeat containing 6B isoform 3 AluACA149, DOPEY2, dopey-2 protein AluACA150, VKORC1L1, vitamin K epoxide reductase complex, subunit 1-like 1 AluACA151, POLR3E, RNA polymerase III polypeptide E AluACA152, ZNF81, zinc finger protein 81 AluACA153, HERC2, hect domain and RLD 2 (HERC2) AluACA154, FBXL3, F-box and leucine-rich repeat protein 3 AluACA155, TSHZ2, teashirt zinc finger homeobox 2 AluACA156, PIGQ, phosphatidylinositol glycan anchor biosynthesis AluACA157, ARFGEF2, ADP-ribosylation factor guanine AluACA158, UBE4B, ubiquitination factor E4B isoform 2 AluACA159, KIAA0196, strumpellin AluACA160, RHPN2, rhophilin, Rho GTPase binding protein 2 AluACA161, MLL3, myeloid/lymphoid or mixed-lineage leukemia 3 AluACA162, LAMA3, LAMA3 protein AluACA163, ZNF681, zinc finger protein 681 AluACA164, SNRK, SNF related kinase AluACA165, FBXO25, F-box only protein 25 isoform 2 AluACA166, LRRC20, leucine rich repeat containing 20 isoform 1 AluACA167, PDS5B, regulator of cohesion maintenance, homolog AluACA168, GABRR2, gamma-aminobutyric acid (GABA) receptor, rho 2 AluACA169, BMS1, BMS1-like, ribosome assembly protein AluACA170, TBCE< beta-tubulin cofactor E AluACA171, HNRPLL, heterogeneous nuclear ribonucleoprotein L-like AluACA172, MTMR15, myotubularin related protein 15 isoform a AluACA173, WNK1, lysine deficient protein kinase 1 AluACA174, NF1, neurofibromin isoform 2 AluACA175, WIPI1, WD repeat domain phosphoinositide-interacting protein 1 AluACA176, KIAA1009, Protein QN1 homolog AluACA177, DAAM1, dishevelled-associated activator of morphogenesis 1 AluACA178, AP1G1, adaptor-related protein complex 1 AluACA179, OATL1, Ornithine aminotransferase-like 1 AluACA180, NUFIP2, Nuclear fragile X mentalretardation-interacting protein 2 AluACA181, MAN1A2, mannosidase, alpha, class 1A, member 2 AluACA182, RNF103, ring finger protein 103 AluACA183, UBAP2, Ubiquitin-associated protein 2 AluACA184, C1orf159, hypothetical protein LOC54991 AluACA185, USP34, ubiquitin specific protease 34 AluACA186, UBR2, Ubiquitin-protein ligase E3-alpha-II AluACA187, ASAP1, Development and differentiation enhancing factor 1 AluACA188, DAGLB, diacylglycerol lipase, beta isoform 1 AluACA189, SUGT1P, suppressor of G2 allele of SKP1 pseudogene (S. cerevisiae) (SUGT1P), non- coding RNA AluACA190, FRYL, furry-like protein AluACA191, ERC1, RAB6-interacting protein 2 isoform delta AluACA192, AMDHD1, highly similar to Homo sapiens amidohydrolase domain containing 1 AluACA193, BCAT1, branched chain aminotransferase 1 AluACA194, TAOK3, TAO kinase 3 AluACA195, CA5BP, Putative carbonic anhydrase 5B-like protein AluACA196, MAP7, microtubule-associated protein 7 AluACA197, MCC, mutated in colorectal isoform 1 AluACA198, MAP2K6, mitogen-activated protein kinase kinase 6 AluACA199, WHSC2, Wolf-Hirschhorn syndrome candidate 2 protein AluACA200, SH3RF3, SH3 domain containing ring finger 3 AluACA201, ANKS3, Ankyrin repeat and SAM domain-containing protein 3 AluACA202, EHBP1, EH domain binding protein 1 isoform 3 AluACA203, PCGF6, polycomb group ring finger 6 isoform a AluACA204, MPP7, palmitoylated membrane protein 7 AluACA205, HAUS2, centrosomal protein of 27 kDa AluACA206, SH3D19, SH3 domain containing 19 isoform b AluACA207, DNAJC16, DnaJ (Hsp40) homolog, subfamily C, member 16 AluACA208, ASH1L, absent, small, or homeotic 1-like AluACA209, DENND1B, DENN/MADD domain containing 1B AluACA210, C12orf5, putative ORF AluACA211, DGCR8, DiGeorge syndrome critical region 8 AluACA212, DENND5B, DENN/MADD domain containing 5B AluACA213, NRF1, nuclear respiratory factor 1 AluACA214, ACADS, short-chain acyl-CoA dehydrogenase precursor AluACA215, BID, BH3 interacting domain death agonist isoform 1 AluACA216, PTPRK, protein tyrosine phosphatase, receptor type, K AluACA217, EZH1, enhancer of zeste homolog 1 AluACA218, TRIM9, tripartite motif protein 9 AluACA219, hypothetical protein LOC129293 precursor AluACA220, COL4A3BP, alpha 3 type IV collagen binding protein AluACA221, WDHD1, similar to WD repeat and HMG-box DNA-binding protein 1 AluACA222, ARL2BP, binder of ADP-ribosylation factor (ARF)-like proteins AluACA223, APPL2, adaptor protein, phosphotyrosine interaction, PH

3

AluACA224, SPPL2A, signal peptide peptidase-like 2A AluACA225, PI4KA, phosphatidylinositol 4-kinase type 3 alpha AluACA226, COPA, coatomer protein complex, subunit alpha isoform AluACA227, CLIP1, restin isoform a AluACA228, AK055993, putative protein AluACA229, MYST3, MYST histone acetyltransferase AluACA230, WD424381, EST AluACA231, BANP, BTG3 associated nuclear protein isoform a AluACA232, CCT2, T-complex protein 1 subunit beta AluACA233, TAOK1, TAO kinase 1 AluACA234, ATRIP, ATR-interacting protein AluACA235, RAB20, member RAS family AluACA236, ALS2CR12, amyotrophic lateral sclerosis 2 AluACA237, DDX39, DEAD (Asp-Glu-Ala-Asp) box polypeptide 39 AluACA238, ZCCHC2, zinc finger, CCHC domain containing 2 AluACA239, GNA12, guanine binding protein () AluACA240, ZC3H7A, zinc finger CCCH-type containing 7A AluACA241, WDYHV1, WDYHV motif containing 1 AluACA242, TUBGCP5, tubulin, gamma complex associated protein 5 AluACA243, ARID1B, AT rich interactive domain 1B AluACA244, NFATC2, nuclear factor of activated T-cells AluACA245, DHDDS, dehydrodolichyl diphosphate synthase isoform a AluACA246, PLCL2, phospholipase C-like 2 isoform 1 AluACA247, PPP2CB, protein phosphatase 2, catalytic subunit, beta AluACA248, ASPM, asp (abnormal spindle)-like, microcephaly AluACA249, AKAP2, A kinase (PRKA) anchor protein 2 isoform 2 AluACA250, C5orf42, hypothetical protein LOC65250 AluACA251, EXT1, exostosin 1 AluACA252, DENND4B, DENN/MADD domain containing 4B AluACA253, PKD1L1, polycystin-1L1 AluACA254, LHFPL2, lipoma HMGIC fusion partner-like 2 AluACA255, TCTEX1D2, Tctex1 domain containing 2 AluACA256, FBXO21, F-box only protein 21 isoform 2 AluACA257, GGNBP2, Gametogenetin-binding protein 2 AluACA258, GPR156, G protein-coupled receptor 156 AluACA259, KIAA0922, Hypothetical protein AluACA260, ADSL, lyase isoform b AluACA261, WDR70, WD repeat domain 70 AluACA262, AP2A1, adaptor-related protein complex 2, alpha 1 AluACA263, RCOR3, REST corepressor 3 isoform b AluACA264, SDHC, succinate dehydrogenase complex, subunit C AluACA265, ATP6V0A1, ATPase, H+ transporting, lysosomal V0 subunit a1 AluACA266, CF529189,Spliced EST AluACA267, HSDL2, hydroxysteroid dehydrogenase like 2 AluACA268, ZC3H7B, zinc finger CCCH-type containing 7B AluACA269, UBA3, ubiquitin-activating enzyme 3 isoform 2 AluACA270, SEC24A, similar to yeast protein transport protein Sec24A AluACA271, MEMO1, mediator of cell motility 1 isoform 2 AluACA272, AGPS, alkyldihydroxyacetone phosphate synthase AluACA273, TTC28, tetratricopeptide repeat domain 28 AluACA274, DDx50, nucleolar protein GU2 AluACA275, LAMA5, laminin alpha 5 precursor AluACA276, EPB41L4A, erythrocyte protein band 4.1-like 4 AluACA277, RAP2A, member of RAS oncogene family precursor AluACA278, PNPLA3, patatin-like phospholipase domain containing 3 AluACA279, PPP2R5E, epsilon isoform of regulatory subunit B56 AluACA280, SEC14, SEC14 and spectrin domains 1 AluACA281, TNPO1, transportin 1 isoform 2 AluACA282, GPBP1, GC-rich promoter binding protein 1 isoform 1 AluACA283, PTPN14, protein tyrosine phosphatase, non-receptor type AluACA284, LONP2, peroxisomal LON protease-like AluACA285, CBFB, core-binding factor, beta subunit isoform 2 AluACA286, ZP3, zona pellucida glycoprotein 3 isoform 2 AluACA287, C3orf37, hypothetical protein LOC56941 AluACA288, CEP78, centrosomal protein 78kDa isoform b AluACA289, NRP1, neuropilin 1 isoform a AluACA290, ALG8, dolichyl pyrophosphate Glc1Man9GlcNAc2 AluACA291, RREB1, ras responsive element binding protein 1 isoform AluACA292, GPCPD1, glycerophosphocholine phosphodiesterase GDE1 homolog (S. cerevisiae) AluACA293, MAP3K3, mitogen-activated protein kinase kinase kinase 3 AluACA294, NFATC3, nuclear factor of activated T-cells AluACA295, POLR1A, DNA-directed RNA polymerase I A AluACA296, KPNA6, Karyopherin subunit alpha-5 AluACA297, TTF1, transcription termination factor, RNA polymerase I AluACA298, DLG5, discs, large homolog 5 (Drosophila) AluACA299, CBLB, Cas-Br-M (murine) ecotropic retroviral AluACA300, C21orf63, C21orf63 isoform A protein

4

AluACA301, NUDT3, nudix-type motif 3 AluACA302, METTL6, methyltransferase like 6 AluACA303, ZNF138, zinc finger protein 138 isoform 1 AluACA304, PLCB4, phospholipase C beta 4 isoform b AluACA305, NPHP4, nephroretinin AluACA306, SEMA3C, semaphorin 3C precursor AluACA307, SYNE1, spectrin repeat containing, nuclear envelope 1 AluACA308, LARP4, c-Mpl binding protein isoform a AluACA309, ATR, ataxia telangiectasia and Rad3 related protein AluACA310, COL25A1, collagen, type XXV, alpha 1 isoform 2 AluACA311, TMCO1, similar to Transmembrane and coiled-coil domains protein 1 AluACA312, TP73, tumor protein p73 isoform d AluACA313, DZIP1, DAZ interacting protein 1 isoform 2 AluACA314, MLLT3, myeloid/lymphoid or mixed-lineage leukemia AluACA315, C16orf73, hypothetical protein LOC254528 isoform 2 AluACA316, C5orf36, hypothetical protein LOC285600 isoform 1 AluACA317, RPS19, ribosomal protein S19 AluACA318, NACC2, BTB (POZ) domain containing 14A AluACA319, PDXDC1, pyridoxal-dependent decarboxylase domain AluACA320, CNOT8, CCR4-NOT transcription complex, subunit 8 AluACA321, MFSD11, major facilitator superfamily domain containing AluACA322, SNX6, sorting nexin 6 isoform a AluACA323, KIAA1035, highly similar to Homo sapiens ATP/GTP binding protein 1 AluACA324, No apparent host gene AluACA325, CADPS2, Ca2+-dependent activator protein for secretion 2 AluACA326, DTNA, dystrobrevin alpha isoform 9 AluACA327, KIAA0556, hypothetical protein LOC23247 AluACA328, No apparent host gene AluACA329, GRHL1, Grainyhead-like protein 1 homolog AluACA330, SYNE2, spectrin repeat containing, nuclear envelope 2 AluACA331, CARD4, highly similar to Caspase recruitment domain-containing protein 4 AluACA332, ATP8B1, ATPase, class I, type 8B, member 1 AluACA333, PALB2, partner and localizer of BRCA2 AluACA334, PBRM1, polybromo 1 isoform 1 AluACA335, TXNRD2, thioredoxin reductase 2 precursor AluACA336, GPN3, GPN-loop GTPase 3 isoform 1 AluACA337, SENP5, SUMO1/sentrin specific peptidase 5 AluACA338, PSMB6, proteasome beta 6 subunit precursor AluACA339, TRNAU1AP, tRNA selenocysteine associated protein 1 AluACA340, RASA2, RAS p21 protein activator 2 AluACA341, VANGL1, vang-like 1 AluACA342, No apparent host gene AluACA343, LATS1, LATS homolog 1 AluACA344, EDC3, enhancer of mRNA decapping 3 AluACA345, RBCK1, RanBP-type and C3HC4-type zinc finger containing AluACA346, CCT3, chaperonin containing TCP1, subunit 3 isoform b AluACA347, TGS1, trimethylguanosine synthase homolog AluACA348, No apparent host gene

5