Eccb2010 010101010111110100101001010101000101011110000101101001010111001101011

Eccb2010 010101010111110100101001010101000101011110000101101001010111001101011

ECCB2010 010101010111110100101001010101000101011110000101101001010111001101011 CandidateCandidate genegene prioritizationprioritization basedbased onon spatiallyspatially mappedmapped genegene expression:expression: anan applicationapplication toto XLMRXLMR Rosario M. Piro, Ivan Molineris, Ugo Ala, Paolo Provero, and Ferdinando Di Cunto [email protected] Molecular Biotechnology Center and Dep. Genetics, Biology and Biochemistry University of Torino, Italy ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA Motivation 010101010111110100101001010101000101011110000101101001010111001101011 ● Linkage analysis can help to identify disease- associated loci, but: ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA Motivation 010101010111110100101001010101000101011110000101101001010111001101011 ● Linkage analysis can help to identify disease- associated loci, but: ● often hundreds of positional candidate genes ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA Motivation 010101010111110100101001010101000101011110000101101001010111001101011 ● Linkage analysis can help to identify disease- associated loci, but: ● often hundreds of positional candidate genes ● Online Mendelian Inheritance in Man (OMIM): currently over 1000 “orphan” disease loci for phenotypes with unknown molecular basis but mapped locus ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA Motivation 010101010111110100101001010101000101011110000101101001010111001101011 ● Linkage analysis can help to identify disease- associated loci, but: ● often hundreds of positional candidate genes ● Online Mendelian Inheritance in Man (OMIM): currently over 1000 “orphan” disease loci for phenotypes with unknown molecular basis but mapped locus ● next generation sequencing techniques? – large-scale re-sequencing of Chr. X in 208 XLMR families indicated 3 novel (and several known) disease genes, but in most cases the genetic cause could not be reliably determined, although many mutations (even truncating) were found. (Tarpey et al., Nat. Genet., 2009) ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA Motivation 010101010111110100101001010101000101011110000101101001010111001101011 ● Linkage analysis can help to identify disease- associated loci, but: ● often hundreds of positional candidate genes ● Online Mendelian Inheritance in Man (OMIM): currently over 1000 “orphan” disease loci for phenotypes with unknown molecular basis but mapped locus ● next generation sequencing techniques? – large-scale re-sequencing of Chr. X in 208 XLMR families indicated 3 novel (and several known) disease genes, but in most cases the genetic cause could not be reliably determined, although many mutations (even truncating) were found. (Tarpey et al., Nat. Genet., 2009) => Computational disease gene prioritization ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA 3D expression data 010101010111110100101001010101000101011110000101101001010111001101011 ● “Traditional” high-throughput expression data sample 1 2 3 4 5 6 7 8 9 10 11 12 13 gene X ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA 3D expression data 010101010111110100101001010101000101011110000101101001010111001101011 ● “Traditional” high-throughput expression data sample 1 2 3 4 5 6 7 8 9 10 11 12 13 gene X gene Y ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA 3D expression data 010101010111110100101001010101000101011110000101101001010111001101011 ● “Traditional” high-throughput expression data sample 1 2 3 4 5 6 7 8 9 10 11 12 13 gene X gene Y ● carry limited spatial information, no precise 3D localization ● have a sparse coverage (arbitrary, not-consecutive positions) ● problem for tissues/organs with a high degree of spatial organization, e.g. the central nervous system (CNS) ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA 3D expression data 010101010111110100101001010101000101011110000101101001010111001101011 ● “Traditional” high-throughput expression data sample 1 2 3 4 5 6 7 8 9 10 11 12 13 gene X gene Y ● carry limited spatial information, no precise 3D localization ● have a sparse coverage (arbitrary, not-consecutive positions) ● problem for tissues/organs with a high degree of spatial organization, e.g. the central nervous system (CNS) © Allen Institute gene X ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA 3D expression data 010101010111110100101001010101000101011110000101101001010111001101011 ● “Traditional” high-throughput expression data sample 1 2 3 4 5 6 7 8 9 10 11 12 13 gene X gene Y ● carry limited spatial information, no precise 3D localization ● have a sparse coverage (arbitrary, not-consecutive positions) ● problem for tissues/organs with a high degree of spatial organization, e.g. the central nervous system (CNS) © Allen Institute gene Y gene X ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA 3D expression data 010101010111110100101001010101000101011110000101101001010111001101011 ● “Traditional” high-throughput expression data sample 1 2 3 4 5 6 7 8 9 10 11 12 13 gene X gene Y ● carry limited spatial information, no precise 3D localization ● have a sparse coverage (arbitrary, not-consecutive positions) ● problem for tissues/organs with a high degree of spatial organization, e.g. the central nervous system (CNS) ● Allen (Mouse) Brain Atlas (Lein et al., Nature, 2007) ● http://www.brain-map.org/ © Allen Institute ● ~20k genes (~18k with Entrez ID) ● in situ hybridization (ISH) ● covers the entire mouse brain ● expression levels for “voxels” gene Y (cubes) of 200 μm side length gene X ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA Mapping to human genes 010101010111110100101001010101000101011110000101101001010111001101011 ● We use mouse expression data also for human Mendelian disorders gene X (human) ~15k genes gene Y (human) ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA Mapping to human genes 010101010111110100101001010101000101011110000101101001010111001101011 ● We use mouse expression data also for human Mendelian disorders gene X gene X (human) (mouse) HomoloGene 1-to-1 mapping ~15k ~18k genes genes gene Y gene Y (human) (mouse) HomoloGene 1-to-1 mapping ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA Mapping to human genes 010101010111110100101001010101000101011110000101101001010111001101011 ● We use mouse expression data also for human Mendelian disorders gene X gene X (human) (mouse) HomoloGene 1-to-1 mapping ~15k ~18k genes © Allen Institute genes gene Y gene Y (human) (mouse) HomoloGene 1-to-1 mapping ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA Prioritization approach (1) 010101010111110100101001010101000101011110000101101001010111001101011 ● Step 1 ̵̶̵ select “reference genes” ● genes already known to be involved in the given phenotype or in similar phenotypes (MimMiner; van Driel et al., Eur. J. Hum. Genet., 2006) ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA Prioritization approach (1) 010101010111110100101001010101000101011110000101101001010111001101011 ● Step 1 ̵̶̵ select “reference genes” ● genes already known to be involved in the given phenotype or in similar phenotypes (MimMiner; van Driel et al., Eur. J. Hum. Genet., 2006) ● the method can be applied also to OMIM phenotypes of so far unknown molecular basis ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA Prioritization approach (1) 010101010111110100101001010101000101011110000101101001010111001101011 ● Step 1 ̵̶̵ select “reference genes” ● genes already known to be involved in the given phenotype or in similar phenotypes (MimMiner; van Driel et al., Eur. J. Hum. Genet., 2006) ● the method can be applied also to OMIM phenotypes of so far unknown molecular basis ● Step 2 ̵̶̵ for each reference gene g: ● build ranked co-expression lists by ranking all other genes according to their (Pearson) correlation with g ACGTCAGCTAGCTAGCTAGTCAGTCGATCGATGTGTACATGCAATCGTAGCTAGCTAGCTAGCTA TGCATGCAACTGATGCATGCATGCTGATCGATGCATGCT [email protected] CTAGCA Prioritization approach (1) 010101010111110100101001010101000101011110000101101001010111001101011 ● Step 1 ̵̶̵ select “reference genes” ● genes already known to be involved in the given phenotype or in similar phenotypes (MimMiner; van Driel et al., Eur. J. Hum. Genet., 2006)

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    45 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us