Genetic Modifiers of B-Thalassemia
Total Page:16
File Type:pdf, Size:1020Kb
Thalassemia Syndrome • Molecular Basis of Disease Genetic modifiers of b-thalassemia Swee Lay Thein As the defective genes for more and more genetic disorders become unravelled, it is clear that patients with apparently identical genotypes can have many different clinical conditions even in simple monogenic disorders. b thalassemia occurs when there is a deficiency in the synthesis of b globin chains. The clinical manifestations of b tha- lassemia are extremely diverse, spanning a broad spectrum from severe anemia and transfusion-dependency to the asymptomatic state of thalassemia trait. The remark- able phenotypic diversity of the b thalassemias is prototypical of how a wide spectrum of disease severity can be generated in single gene disorders. The most reliable and predictive factor of disease phenotype is the nature of the mutation at the b globin locus itself. However, relating phenotype to genotype is complicated by the complex interaction of the environment and other genetic factors at the secondary and tertiary levels, some implicated from family studies, and others, as yet unidentified. This arti- cle reviews the clinical and hematologic diversity encountered in b thalassemia with an overview of the modifier genes that moderate their disease expression. Key words: b thalassemia, modifier genes. Haematologica 2005; 90:649-660 ©2005 Ferrata Storti Foundation From the Department of Haematological Medicine, he thalassemias refer to a diverse Mediterranean region suggests that the dis- Guy’s, King’s & St Thomas’ School group of hemoglobin disorders char- ease was present as far back as the Neolithic of Medicine, King’s College London, acterized by a reduced synthesis of period.3 More than 200 mutations affecting UK. T one or more of the globin chains (a, b, g, db, the b globin gene are now known to result 1 Correspondence: gdb, d and egdb). b thalassemia occurs in a phenotype of b thalassemia. The most Professor Swee Lay Thein, when there is a deficiency of b globin; typ- common forms are those that are prevalent Department of Haematological ically, it is caused by a direct down-regula- in the malarial tropical and sub-tropical Medicine, Guy’s, King’s & tion in the synthesis of structurally normal regions where a few mutations have St. Thomas’ School of Medicine King’s College London, Denmark Hill b chains. However, a thalassemia pheno- reached high gene frequencies because of Campus, Bessemer Road type can also arise from structural b chain the protection they provide against malaria. London SE5 9PJ. variants if they are synthesized at a reduced In these countries in which b thalassemia is E-mail: [email protected] rate, e.g. Hb E (b26GluÆLys). Alternatively, prevalent, a limited number of alleles (4 to 5) the variants are produced at a normal rate account for 90% or more of the b tha- but are so unstable that they are rapidly lassemias, such that a focused molecular destroyed giving rise to a functional defi- diagnostic approach can be undertaken.4 ciency. The former group of variants is also The epidemiology of the disease, however, referred to as b thalassemic hemoglo- is changing due to a fall in total birth rate, binopathies. The hyperunstable b chain prevention programs and recent population variants act in a dominant negative fashion, movements. In Northern Europe, including causing a disease phenotype even when the UK, the US and Australasia, due to present in a single copy, and hence, have recent population migration, thalassemia been referred to as dominantly inherited b has become an important part of clinical 2 thalassemia. In contrast, typical b tha- practice, and in these regions a much wider lassemia is inherited as a haploinsufficient spectrum of b thalassemia mutations can be Mendelian recessive disease. The tha- encountered. Furthermore, with improved lassemias are the commonest monogenic clinical care and increasing survival of disorders in the world and globally it is esti- affected individuals, complications such as mated that there are 270 million carriers, of chronic liver disease, thrombosis and pul- which 80 million are carriers of b tha- monary hypertension are being increasingly lassemia. Archeological evidence in the recognized in the older patients. haematologica/the hematology journal | 2005; 90(5) | 649 | S.L. Thein et al. cluster A bLCR b 3’HSI 5 4321 e GgAg ybdb 5’ 3’ 10 kb Scale B CCAAT CACCC CACCC 1 30 30 104 105 146 5’ UTR 5’ UTR 5' IVS 1 IVS 2 3' 1 2 3 C ATG GT AG GT AG TAA ATAAA AATAA Promoter 5’ and 3’ UTR RNA + 0 processing b ,b ATG Nonsense Codons Frameshifts G A Figure 1. A. The b globin gene cluster and its flanking regions on chromosome 11p. The e, g, g, d and b globin genes are indicated as gray boxes. The 5' hypersensitive sites (1 to 5) which comprise the bLCR and the 3' hypersensitive sites (3'HS1) are shown as vertical arrows. Hatched boxes represent the olfactory receptor genes. B. General structure of the b globin gene with the 3 exons (gray boxes) and the 2 intervening sequences (IVS1 and IVS2). Conserved sequences (detailed in text) are indicated. The different classes of point mutations causing b thalassemia are shown below the b globin genes. The general structure of the b globin gene is typical 6,7 The b globin gene of the other globin loci (Figure 1B). The genomic sequence spans 1600 bp and codes for 146 amino acids; b globin is encoded by a structural gene found in a the transcribed region is contained in three exons sep- cluster with the other b-like genes spanning 70 Kb on arated by two introns or intervening sequences the short arm of chromosome 11 (11p15.4) (Figure 1). (IVS).The first exon encodes amino acids 1 to 29 G A The cluster contains five functional genes, 5e g- g-yb‚ together with the first two bases for codon 30, exon 2 d-b3’, which are arranged in the order of their develop- encodes part of residue 30 together with amino acids mental expression. Upstream of the entire b globin 31 to 104, and exon 3, amino acids 105 to 146. Exon 2 complex is the locus control region (LCR), which is encodes the residues involved in heme binding and ab essential for the expression of all the genes in the com- dimer formation, while exons 1 and 3 encode for the plex. This region consists of five DNase 1 hypersensi- non-heme-binding regions of the b globin chain. tive (HS) sites (designated HS1 5) distributed between Conserved sequences important for gene function are 6 and 20 Kb 5’ of the e gene. There is one other hyper- found in the 5’ promoter region, at the exon-intron sensitive site ~20 Kb 3’ to the b gene. The two extreme junctions, and in the 3’ untranslated region (3’-UTR) at HS sites flanking the b complex have been suggested to the end of the mRNA sequences. The b globin gene mark the boundaries of the b globin gene domain. The promoter includes 3 positive cis-acting elements: TATA b globin complex is embedded in a cluster of olfactory box (positions –28 to –31), a CCAAT box (positions receptor genes (ORG), part of the family of ~1000 –72 to –76) and duplicated CACCC motifs (proximal genes that are widely distributed throughout the at positions –86 to –90, and distal at position –101 to genome, and expressed in the olfactory epithelium.5 –105). While the CCAAT and TATA elements are | 650 | haematologica/the hematology journal | 2005; 90(5) Genetic modifiers of b-thalassemia found in many eukaryotic promoters, the CACCC silencing and gene competition, mediated by the dif- sequence is found predominantly in erythroid cell-spe- ferent transcription factors in embryonic, fetal and cific promoters. Binding of the Erythroid Krüppel Like adult cells. While the e- and g globin genes are auto- Factor (EKLF) to the CACCC motif appears to be cru- nomously silenced at the appropriate developmental 8,9 cial for normal adult b globin expression. In addition stage, expression of the adult b globin gene depends on to these motifs, the region upstream of the b globin lack of competition from the g gene for the LCR promoter contains two binding motifs for the ery- sequences. This is supported by the down-regulation throid transcription factor GATA 1. The importance of of the cis b gene when g gene is up regulated by point these various 5’ flanking sequences for normal gene mutations in their promoters as illustrated by the non- expression is underscored by b thalassemia arising deletional hereditary persistence of fetal hemoglobin 13 from point mutations in these sequences specifically in (HPFH). Mutations which affect the b promoter, and around the TATA box and the CACCC motifs in which remove competition for the bLCR also, tend to the –80 to –100 region. An enhancer is also found in be associated with variable increases in g and d gene intron 2 and 3’ of the globin gene, 600 to 900 bp down- expression (see later). stream of the poly (A) site.10 The 5’ untranslated region (UTR) occupies a region Molecular basis of b thalassemia of 50 nucleotides between the CAP site, the start of There are two main varieties of b thalassemia alleles; 0 + transcription, and the initiation (ATG) codon. There b thalassemia in which no b globin is produced, and b are two prominently conserved sequences in the 5’ thalassemia in which some b globin is produced, but UTR of the various globin genes (both a and b). One is less than normal. Less severe forms are sometimes des- ++ the CTTCTG hexanucleotide found 8 to 13 ignated b to reflect the minimal deficit in b chain pro- nucleotides downstream from the CAP site, i.e.