IDENTIFICATION AND FUNCTIONAL VALIDATION OF NOVEL FOXP2 REGULATORY TRANSCRIPTION FACTORS IN HUMAN CELL LINES

Quimbaya, Mauricio1 and España, Jina Marcela1 1Pontificia Universidad Javeriana Cali, Colombia

ABSTRACT Different types of genetic analyzes based on individuals with different hereditary forms of speech disorders, such as verbal dyspraxia and some types of autism, have allowed the identification of specific closely associated with the disorders mentioned above.

In humans, the FOXP2 has been widely studied. Mutations in FOXP2 are directly associated with inherited verbal dyspraxia and speech disorders. However, there is little experimental evidence on which genes or transcription factors directly regulate FOXP2, from this point of view, it is essential to determine the transcriptional regulators that act upstream of FOXP2.

The identification of new transcription factors associated with FOXP2 regulation will allow the construction of more holistic and inclusive networks, providing an important advance in the identification of molecular pathways involved in the establishment of neural circuit associated with the evolution of vertebrate communication skills. In the same way, the identification of specific regulatory elements that control FOXP2 expression could be of great importance for the appearance of new therapies to prevent or control nervous system development pathologies, specifically those related to speech and language. Here, four transcription factors were chose to evaluate their regulation function with FOXP2. POU3F2 and TBR1 showed great significance in the study since both confirmed a positive FOXP2 regulation and have special roles in the nervous system development.

INTRODUCTION Language and communication are inherent characteristics of the human brain with great importance in the establishment of social relations. Complications during language and communication development relay on specific dysfunctions at neuronal and nervous system scales. These inabilities have not been extensively analyzed from a genetics perspective.

During the 80s a case of study was followed in three generations within a single family showing verbal disorders. The genetic analysis identified a mutation in the FOXP2 , awakening the interest for many neuroscientists to study it thoroughly, trying to understand how only one particular gene might contribute to human communication. This approach opened a new era for language and communication understanding from a genetics perspective. Known as the communication gene, the transcription factor FOXP2 has been extensively studied in the nervous system development. FOXP2 expression is confined to specific brain regions and its expression is highly conserved in vertebrates. Numerous disruptions and alterations in this gene have been associated with deficits in human language development. Similarly, the understanding of FOXP2 control and signaling, has allowed the understanding of synaptic plasticity, the motor ability of mice and the mechanism of songbirds learning.

The disorders associated with FOXP2 are due to 1) gene deletion (52% of the affected individuals), 2) specific variant sequences within the gene (~ 29%), 3) maternal uniparental disomy of 7 that reduces the expression of FOXP2 (~ 11%) and 4) structural variants of the gene (translocations and inversion) (~ 8%) [1].

One of the most common disorders associated with FOXP2 alterations is speech apraxia, characterized by disturbances in speech planning, affecting the production, pronunciation and timing of sounds, syllables and words [2]; apraxia that hinders the transformation of sounds into syllables, syllables into words and words into matching phrases, affects 10% of the population [3]. Other FOXP2 disorders include oral motor dyspraxia, which obstructs the planning of oral movements [4]; dysarthria, which interferes with the quality of the voice and affect the neuromuscular side of speech [5]; the difficulty in receiving and expressing the language (Watkins et al., 2002), deficiency in reading and writing [6]; the difficulty of developing fine motor skills [7] among others.

The aim of the present research project is to propose a new genetic approach to identify and validate novel FOXP2 regulatory transcription factors. The identification of these regulatory elements will contribute to the generation of new information for the scientific community, being of high significance in the elucidation of new therapies to prevent pathologies associated with the nervous system development.

THEORIC FRAME The 114076877-positional substitution in FOXP2 eighth intron, significantly affects the expression of the gene since it is located in a conserved sequence fundamental for POU3F2 binding, an important neural transcription factor [8]. The direct interaction between POU3F2 and FOXP2 is also supported by their expression pattern since both are exclusively expressed in postmitotic neurons, so it is possible to assume that FOXP2 regulation by POU3F2 occurs precisely in neuronal tissue [8]. Similarly, Bao and collaborators [9] reported, that the CTCF binds upstream and downstream to the FOXP2 gene, moreover, Sakai and colleagues [10] and Deriziotis teamwork [11] reported in a protein interaction study based on autism disorders, that the transcription factor TBR1 interacts physically with FOXP2. It is important to highlight the importance of TBR1 in the development of the nervous system, since it is a neuron-specific transcription factor with key roles in nervous system patterning, regulating neuronal identity in cortical development, with numerous implications in the autism disorder [12].

Morelli and colleagues [13] reported a young woman case with severe language disorders who presented a chromosomal rearrangement involving an inversion of chromosome 7 that was translocated to chromosome 11. The karyotype of the young woman, together with the clinical history that included sever complications in neurodevelopment and language abilities, indicated that the different defects were probably associated with the FOXP2 gene. The breaking point was located at 7q31 position, more specifically 200 Kb downstream to the 3'UTR of FOXP2. In this region, numerous miRNA bind, influencing not only FOXP2 expression, but also, other genes such as ROBO2, a gene associated with pathologies of the nervous system, such as autism and dyslexia [14]. Additionally, it was possible to identify and characterize a localized regulatory element 205kb downstream FOXP2, which includes an open chromatin region with specific histone modifications, typical from an enhancer region, suggesting that the 7q31 breakpoint could interfere with FOXP2 expression. The same region has been also documented as the genomic binding site of transcription factors such as PPP1R3A, MDFIC and TFEC [15].

METHODS AND RESULTS 1. Identification of novel transcription factors, involved in nervous system development that might interact with FOXP2.

For the selection of FOXP2 potential interactors, different filters were applied using four bioinformatics databases: MotifMap, Uniprot, COXPRESdb and Cytoscape. From the four, two of them were relevant in the study.

Table 1. Transcription factors potentially associated with FOXP2 according to MotifMap database.

Name Protein name Name Protein name Name Protein name

Pancreas/duodenum Forkhead box protein TBR1 T-Box, Brain 1 ipf1 FOXF2 protein 1 F2 POU domain, class Homeobox protein -2 FOXD1 Forkhead box protein D1 POU3F2 3, transcription CDX-2 factor 2 ARP- Homeobox protein Pituitary homeobox HOXB9 FOXO1 1(COUP- Hox-B9 2 TF2) Hepatocyte nuclear factor Homeobox protein HOXA9 Homeobox A9 HNF3beta NKX25 3-beta Nkx-2.5 TAL BHLH Homeobox protein TAL1 Transcription Factor Freac-2 Forkhead box protein F2 NKX22 Nkx-2.2 1 Forkhead box Homeobox protein FOXM1 Freac-4 Forkhead box protein D1 Nkx3-2 protein M1 Nkx-3.2 Homeobox protein Forkhead box protein Barx1 Meis1 Homeobox protein Meis1 Freac-3 BarH-like 1 C1

vasoconstriction gliogenesis muscle organ development generation of neurons vascular smooth muscle contraction nervous system development regulation of blood vessel size smooth muscle contraction skeletal system development vascular process in circulatory neurogenesis system organ development negative regulation of locomotion vasculature development blood circulation muscle hypertrophy neuron differentiation tissue development muscle contraction system development muscle structure development muscle tissue development circulatory system processmuscle system process blood vessel development

regulation of locomotion cell differentiation blood vessel remodeling anatomical structure development system process

cell development multicellular organismal process

angiogenesis positive regulation of locomotion

neuron development blood vessel morphogenesis neuron projection morphogenesis regulation of cell migration negative regulation of biological process cellular developmental process positive regulation of biological cell projection morphogenesis anatomical structure morphogenesis process positive regulation of cell migration multicellular organismal neuron projection development regulation of biological process development negative regulation of angiogenesis cell projection organization regulation of multicellular organismal process

developmental process regulation of cell morphogenesis cellular process regulation of anatomical structure regulation of developmental process morphogenesis activation of immune response biological_process regulation of cell shape positive regulation of immune immune system process regulation of angiogenesis response

Figure 1. BinGO analysis of the common coexpression neighborhood of FOXP2, CDX-2, Freac- 2, FOXF2, POU3F2 and Nkx 3-2 genes (group 1). The blue color intensity represents the significance level (p<0.05) according to the hypergeometric distribution and the size of the circle corresponds to the number of genes involved in each GO category.

negative regulation of coagulation regulation of fibrinolysis

biological regulation negative regulation of fibrinolysis regulation of sterol transport regulation of coagulation regulation of wound healing activation of immune responseregulation of biological process homeostatic process positive regulation of blood regulation of biological quality coagulation regulation of immune system regulation of blood coagulation process regulation of protein stability regulation of response to external regulation of response to stimulus stimulus regulation of body fluid levels immunecomplement system process activation, alternative pathwayregulation of immune response hemostasis regulation of response to stress complement activation positive regulation of response to multicellular organismal process phospholipid efflux stimulus positive regulation of developmental process phospholipid transport positive regulation of immune sterol transport digestive system process response blood coagulation lipid transport cholesterol transport digestion cholesterol efflux coagulation reverse cholesterol transport localization response to stimulusplatelet activation intestinal cholesterol absorption positive regulation of immune lipid digestionmacromolecule localization system process monocarboxylic acid transport intestinal absorption protein transport wound healing carboxylic acid transport lipid localization lipoprotein transport bile acid and bile salt transport lipoprotein particle clearance establishment of localization

high-density lipoprotein particle organic acid transport clearance chylomicron remnant clearance transport

Figure 2. BinGO analysis of the common coexpression neighborhood of TBR1, TAL1, FOXM1, ipf1, FOXD1, FOXO1, ARP-1, Barx-1, Meis1 and HOXA9 genes. The blue color intensity represents the significance level (p<0.05) according to the hypergeometric distribution and the size of the circle corresponds to the number of genes involved in each GO category. 2. Real-time PCR (qPCR) HeLa and HEK293 cell culture were used for specific siRNAs transfections, once transfection is achieved, real-time PCR was reached. Conventional protocols were used for the experimentation. FOXP2 Expression HeLa cells 1.2 1 0.8 0.6 0.4 0.2 0

FOXP2 FOXP2 Relative Expression No Target POU3F2-KD TBR1-KD CDX2-KD (Control -)

Figure 3. FOXP2 expression reduction upon specific gene silencing. Upon POU3F2 knock-down there is a reduction of 34% for FOXP2. Similarly when TBR1 and CDX2 are knocked-down a reduction in expression for FOXP2 of 33% and 20% respectively is achieved.

1. TBR1 qPCR results demonstrated that with a 50% TBR1 knock-down, FOXP2 expression was reduced a 33%, which is in agreement with a specific perspective that relates TBR1 and FOXP2. TBR1 encodes a T-box family neuron-specific transcription factor, a family of highly involved in important biological roles such as vertebrate body plan definition, differentiation and organogenesis [16]. A study of Deriziotis and collaborators [11] demonstrated that TBR1 mutations results in sporadic autism. A specific reported mutation over TBR1 gene, affects the interactions between TBR1 and FOXP2, possibly, loss of TBR1–FOXP2 interaction may contribute to speech and language deficits. In the same way Becker and colleagues [15]. Therefore, it was of great importance for the presence study to evaluate TBR1 inhibition over FOXP2 expression.

2. POU3F2 qPCR results demonstrated that when knocked-down, POU3F2 induces a 34% reduction over FOXP2 expression, supporting the findings described by Maricic and collaborators [8]. This study identified specific nucleotide differences between Neandertal FOXP2 and human FOXP2, affecting an important binding site for the transcription factor POU3F2, such nucleotide change modifies the way in which POU3F2 interacts with FOXP2, therefore, it modifies the way in which FOXP2 acts in the cells. Interestingly, the nucleotide change occurred in a highly conserved sequence. POU3F2 is expressed in postmitotic neurons and glia [17] as well as FOXP2, and it is exclusive from the central nervous system [18] turning into mandatory, to evaluate POU3F2 as a potential regulator of FOXP2 expression.

3. CDX2 AND HOXA9 All the FOXP genes have the capacity of auto-regulation, and regulation among them. FOXP2- FOXP1 interaction is notably in the thalamus, hypothalamus and basal ganglia in many vertebrates [19]. Given that FOXP1 and FOXP2 are coexpressed, FOXP1 regulators may contribute also to the regulation of FOXP2 expression.

With a 10% knock-down of CDX2, it was possible to reduce 20% of FOXP2 expression. CDX2 is a homeobox member, a group of master genes with key roles in embryogenesis, regulating HOXA9 [20]. HOX transcription factors have key roles in motor neurons and their target muscles, by converting all the neural signals into patterns of connectivity, and by taking FOXP1 as an accessory-helping factor to make this happens. Dasen and coworkers [21] determined FOXP1 as a crucial element for motor neurons connectivity and diversification. Similarly, Li S., and collaborators [22] discussed about a combined repression between FOXP1/2/4 and HOXA9-13 and PAX2/8/9 in the endoderm at early stages.

Even though it was not possible to determine a regulatory function for HOXA9 since transfection was not successfully achieved, it would be interesting to evaluate HOXA9 expression with CDX2 knock-down.

It is of great importance to understand the upstream regulatory pathways that control FOXP2 expression, by this way, it would be much clear to understand human language and related frequent disorders, and, it would be the missing piece to complete the molecular FOXP2 regulatory network.

REFERENCES [1] Morgan AT, Liégeois F, Vargha-Khadem F. 2010. Motor speech outcome as a function of the site of brain pathology: A developmental perspective. Chap 6. In: Maassen B, van Lieshout P, eds. Speech Motor Control: New Developments in Basic and Applied Research. Oxford, UK: Oxford University Press; 95-115. [2] Fisher SE, Scharff C. 2009. FOXP2 as a molecular window into speech and language. Trends in genet. 25(4):166-77. [3] Gibbs J, Appleton J, Appleton R. 2007. Dyspraxia or developmental coordination disorder? Unravelling the enigma. Arch Dis Child. 92(6): 534–539. [4] Alcock KJ, Passingham RE, Watkins KE, Vargha-Khadem F. 2000. Oral dyspraxia in inherited speech and language impairment and acquired dysphasia. Brain Lang. 75:17–33. [5] Turner SJ, Hildebrand MS, Block S, Damiano J, Fahey M, Reilly S, Bahlo M, Scheffer IE, Morgan AT. 2013. Small intragenic deletion in FOXP2 associated with childhood apraxia of speech and dysarthria. Am J Med Genet A. 161A:2321–6. Doi: 10.1002/ajmg.a.36055 [6] Vargha-Khadem F, Gadian D, Copp A, Mortimer Mishkin M. 2005. FOXP2 AND THE NEUROANATOMY OF SPEECH AND LANGUAGE. Nat. Rev. Neurosci. 6, 131–138. [7] Lai CS, Fisher SE, Hurst JA, Vargha-Khadem F, Monaco AP. 2001. A forkhead-domain gene is mutated in a severe speech and language disorder. Nature. 413:519–23. [8] Maricic T, Günther V, Georgiev O, Gehre S, Curlin M, Schreiweis C, Naumann R, Burbano HA, Meyer M, Lalueza-Fox C, de la Rasilla M, Rosas A, Gajovic S, Kelso J, Enard W, Schaffner W, Pääbo S. 2013. A recent evolutionary change affects a regulatory element in the human FOXP2 gene. Mol Bio and Evol. 30:844–852. [9] Bao L, Zhou M, Cui Y. 2008. CTCFBSDB: a CTCF-binding site database for characterization of vertebrate genomic insulators. Nucleic Acids Res. 36: D83–D87. [10] Sakai Y, Shaw C, Dawson B, Dugas D, Al-Mohtaseb Z, Hill D, et al. 2011. Protein interactome reveals converging molecular pathways among autism disorders. Sci Transl Med. 3:86ra49 [11] Deriziotis P, O'Roak BJ, Graham SA, Estruch SB, Dimitropoulou D, Bernier RA, Gerdts J, Shendure J, Eichler E, Fisher SE. 2014. De novo TBR1 mutations in sporadic autism disrupt protein functions. Nat Commun. 5, 4954 [12] McKenna WL, et al. 2011. Tbr1 and Fezf2 regulate alternate corticofugal neuronal identities during neocortical development. J Neurosci. 31:549–564 [13] Morelli D, Nudel R, Chan MT, Green CM, Volpi EV, Benitez-Burraco A, et al. 2015. Language impairment in a case of a complex chromosomal rearrangement with a breakpoint downstream of FOXP2. Mol Cytogenet. 8:36. [14] Anitha A, Nakamura K, Yamada K, Suda S, Thanseem I, Tsujii M, Iwayama Y, Hattori E, Toyota T, Miyachi T, Iwata Y, Suzuki K, Matsuzaki H, et al. 2008. Genetic analyses of roundabout (ROBO) axon guidance receptors in autism. Am J Med Genet B Neuropsychiatr Genet. 147B(7):1019-27. [15] Becker M, Devanna P, Fisher S, Vernes S. 2015. A chromosomal rearrangement in a child with severe speech and language disorder separates FOXP2 from a functional enhancer. Mol Cytogenet. 8:69. [16] Wilson V & Conlon FL. 2002. The T-box family. Genome biology, 3(6), reviews3008-1. [17] Hagino-Yamagishi K, Saijoh Y, Ikeda M, Ichikawa M, Minamikawa-Tachino R, Hamada H. 1997. Predominant expression of Brn-2 in the postmitotic neurons of the developing mouse neocortex. Brain research, 752(1-2), 261-268. [18] Schreiber E, Tobler A, Malipiero U, Schaffner W & Fontana A. 1993. cDNA cloning of human N-Oct 3, a nervous-system specific POU domain transcription factor binding to the octamer DNA motif. Nucleic acids research, 21(2), 253-258 [19] Takahashi K, Liu FC, Oishi T, Mori T, Higo N, Hayashi M. 2008. Expression of FOXP2 in the developing monkey forebrain: comparison with the expression of the genes FOXP1, PBX3, and MEIS2. J. Comp. Neurol. 509, 180–189. [20] Rawat VP, Cusan M, Deshpande A, Hiddemann W, Quintanilla-Martinez L, Humphries RK & Buske C. 2004. Ectopic expression of the homeobox gene Cdx2 is the transforming event in a mouse model of t (12; 13)(p13; q12) acute myeloid leukemia. Procee Nat Acad Sci, 101(3), 817- 822 [21] Dasen JS, De Camilli A, Wang B, Tucker PW & Jessell TM. 2008. Hox repertoires for motor neuron diversity and connectivity gated by a single accessory factor, FoxP1. Cell, 134(2), 304-316. [22] Li S, Morley M, Lu M, Zhou S, Stewart K, French CA & Morrisey EE. 2016. Foxp transcription factors suppress a non-pulmonary gene expression program to permit proper lung development. Dev biology, 416(2), 338-346.