View Regulatory Gene Expression in Mafb Deficient Macrophages

Morita et al. BMC Systems Biology 2011, 5(Suppl 2):S7 http://www.biomedcentral.com/1752-0509/5/S2/S7 PROCEEDINGS Open Access Combinatorial motif analysis of regulatory gene expression in Mafb deficient macrophages Mariko Morita*, Megumi Nakamura, Michito Hamada, Satoru Takahashi From 22nd International Conference on Genome Informatics Busan, Korea. 5-7 December 2011 Abstract Background: Deficiency of the transcription factor MafB, which is normally expressed in macrophages, can underlie cellular dysfunction associated with a range of autoimmune diseases and arteriosclerosis. MafB has important roles in cell differentiation and regulation of target gene expression; however, the mechanisms of this regulation and the identities of other transcription factors with which MafB interacts remain uncertain. Bioinformatics methods provide a valuable approach for elucidating the nature of these interactions with transcriptional regulatory elements from a large number of DNA sequences. In particular, identification of patterns of co-occurrence of regulatory cis-elements (motifs) offers a robust approach. Results: Here, the directional relationships among several functional motifs were evaluated using the Log-linear Graphical Model (LGM) after extraction and search for evolutionarily conserved motifs. This analysis highlighted GATA-1 motifs and 5’AT-rich half Maf recognition elements (MAREs) in promoter regions of 18 genes that were down-regulated in Mafb deficient macrophages. GATA-1 motifs and MafB motifs could regulate expression of these genes in both a negative and positive manner, respectively. The validity of this conclusion was tested with data from a luciferase assay that used a C1qa promoter construct carrying both the GATA-1 motifs and MAREs. GATA-1 was found to inhibit the activity of the C1qa promoter with the GATA-1 motifs and MafB motifs. Conclusions: These observations suggest that both the GATA-1 motifs and MafB motifs are important for lineage specific expression of C1qa. In addition, these findings show that analysis of combinations of evolutionarily conserved motifs can be successfully used to identify patterns of gene regulation. Introduction provide insights into variation in cell fate decision, such In recent years, genomic analyses have identified many as (re)differentiation or (dys)function. The genomic ele- short DNA sequences that function as transcriptional ments that control variations in gene expression, and regulatory elements and also show evolutionary conser- hence cell fate, are those associated with transcription vation. These signature sequences are usually referred to factors. As a consequence, considerable efforts are being as ”motifs”. There is considerable interest in these made to detect and characterize these motifs. motifs because variations in gene expression play crucial Three procedures are widely employed to identify roles in many biological functions and are also of motifs: sequence alignment, motif extraction and motif importance in disease etiology. Much of the information search. Sequence alignments make it possible to identify on the roles of these motifs has been obtained using biologically meaningful regions [1][2]. In order to expe- microarray or qRT-PCR analyses to investigate the dite investigation of long and complex mammalian gen- dynamics of gene expression. These technologies also omes, it was necessary to develop computer science methods that permitted analyses to be performed in real-time with high-sensitivity and high-precision. One * Correspondence: [email protected] Department of Anatomy and Embryology, Institute of Basic Medical such approach is the Smith-Waterman algorithm [3], Sciences, Graduate School of Comprehensive Human Sciences, University of which permits sequence ambiguity, but also provides Tsukuba, 1-1-1, Tennodai, Tsukuba, 305-8575, Ibaraki, Japan © 2011 Morita et al; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Morita et al. BMC Systems Biology 2011, 5(Suppl 2):S7 Page 2 of 10 http://www.biomedcentral.com/1752-0509/5/S2/S7 high accuracy and flexibility, although not with high genes, or of the cooperation of MafB with other tran- processing speed. Subsequent development of this algo- scription factors. This uncertainty could be surmounted rithm led to the Smith-Waterman-Gotoh (SWG) inatimelyandcost-effectivemannerbyuseofmotif method, which overcame these problems in efficiency detection protocols to sift through large amounts of [4]. The SWG method is now the standard algorithm DNA sequence data. for optimal local alignment, which is a representative In this study, we showed bioinformatics methods to method in sequence analysis. The PRRN algorithm, identify regulatory motifs and transcription factors which provides one of the highest-precision approaches, which interact with MafB. Through use of the LGM uses a doubly nested randomized iterative (DNR) after multiple alignments for finding evolutionarily con- method for efficient production of multiple alignments served motifs from a large number of DNA sequences, [5-7]. Once the multiple alignments have been pro- the relationships of several functional motifs were inves- duced, it is then necessary to undertake motif extraction tigated. In an attempt to understand the motif informa- from conserved common patterns in the set of consen- tion, we focused on our analysis of MafB, which is sus sequences. Two main methods can be used for this bound to the MARE in the promoter region of genes step, namely, the numeration method and the probabil- that are down-regulated in Mafb deficient macrophages. istic method, based on the Weeder [8] and MEME [9] The relationships among several motifs were elucidated algorithms, respectively. Motif research is also per- using data from prior biochemical experiments and formed to find already-known motifs, for example, using from a new study. The observations also suggest that the MAST [10], TFSEARCH [11] and TFBIND [12] combinations of evolutionarily conserved motifs can be algorithms. used to predict gene regulation. These techniques for Many approaches have been employed to investigate investigating combinations of regulatory motifs in evolu- motif interaction, such as regression methods [13]. How- tionarily conserved regions should help to accelerate ever, very few studies have focused on regulatory interac- development of applications in medical sciences and tions in a large-scale combination of motifs. One such lead to elucidation of causes of diseases. study made use of the Log-linear Graphical Model (LGM) [14] for statistical tests and estimations of the Methods causal relationships among motifs [15]. The LGM is a Three approaches for the input sequences multivariate analysis and probabilistic model presented Three approaches were employed for input sequences to as a graph model with probabilistic conditional indepen- discover motifs among multiple sequences [20]. Multiple dency. However, volume of many motifs may be beyond genes, single species: this is based on the supposition that the scaling limits of real-time calculation. It is important regulatory motifs are conserved among co-regulated to remember, therefore, that current methods require genes within a species, and that the level of gene expres- selection of motifs. Additionally, it is important that con- sion is constant under the chosen experimental condi- clusions derived from computer analyses of interactions tions. Different transcription factors might have an among motifs are confirmed by practical methodologies. indirect influence on the same function. Single gene, With regard to methods for confirmation of interac- multiple species: the rate of mutation of a regulatory tions, the use of knockout (KO) mice or induced over- motif is presumed to be slowed by selective pressure. In expression of transcription factors can provide evidence a single gene group, therefore, universally conserved with a high confidence level. However, these methods regions contain regulatory motifs among cross-species. generally focus on only one transcription factor at a Conserved regions among closely related species may time. Interactions involving two or more motifs are diffi- contain less-functional motifs as noise. On the other cult to identify due to complications arising from motif hand, alignment can be problematic due to changes in ambiguity. In our laboratory, we have generated Mafb function during evolution across species with large evo- KO mice [16]. MafB is a transcription factor and is lutionary distances. Multiple genes, multiple species: known to regulate genes that are expressed in macro- alignment of orthologous sequences is used to identify phages, a type of leukocyte, in almost all species. MafB conserved regions; these regions are then analyzed as is a DNA binding protein with an acidic domain, a basic above in Multiple genes and single species. Since poten- region and a leucine zipper structure (b-zip structure); it tial scores for motif predictions can be improved by can form a homodimer or a heterodimer with a b-zip alignment of multiple genomes [21], this approach here structure protein. The protein binds to the Maf recogni- was adopted to detect regulatory motifs. tion element [17,18] and 5’AT-rich half-MARE [19] (MAREs) in regulatory promoter

View Regulatory Gene Expression in Mafb Deficient Macrophages

UNIVERSITY of CALIFORNIA, SAN DIEGO Towards an Understanding of Inflammation in Macrophages a Dissertation Submitted in Partial

Transcriptional and Post-Transcriptional Regulation of ATP-Binding Cassette Transporter Expression

The Viral Oncoproteins Tax and HBZ Reprogram the Cellular Mrna Splicing Landscape

Characterization of the Macrophage Transcriptome in Glomerulonephritis-Susceptible and -Resistant Rat Strains

Combined DNA Methylation and Transcriptomic Assessments to Determine a Prognostic Model for PD-1-Negative Hepatocellular Carcinoma

Bioinformatics Analysis to Screen Key Genes in Papillary Thyroid Carcinoma

Identification of Novel Diagnostic Biomarkers for Thyroid Carcinoma

Rank Aggregation Via the Cross Entropy Algorithm

Characterizing Genomic Duplication in Autism Spectrum Disorder by Edward James Higginbotham a Thesis Submitted in Conformity

Apoptosis Inhibitor of Macrophage (AIM) Is Required for Obesity-Associated Recruitment of Inﬂammatory Macrophages Into Adipose Tissue

Content Based Search in Gene Expression Databases and a Meta-Analysis of Host Responses to Infection

Peripheral Nerve Single-Cell Analysis Identifies Mesenchymal Ligands That Promote Axonal Growth