Supporting Information Appendix
Total Page:16
File Type:pdf, Size:1020Kb
Supporting Information Appendix SI Materials and Methods Plant materials. Bread wheat (Triticum aestivum L.), diploid ancestral species of T. urartu and A. tauschii and allotetraploid T. turgidum were used in this study. The mutants ms1d.1 and ms1e were obtained from the Wheat Genetics Resource Center at Kansas State University. We screened ms1d.2 and ms1h-q from an EMS-mutagenized population of bread wheat (variety ‘Ningchun 4’). The plants used for map-based cloning were progeny segregated from heterozygous ms1e plants. Triticum turgidum L. accession Langdon (AABB), T. urartu accession G1812 (AA) and Ae. tauschii accession AL8/78 were provided by Professor Hongqing Ling (Institute of Genetics and Developmental Biology, Chinese Academy of Sciences). All plants were grown in a greenhouse under long-day conditions (16 h of light at 22–25°C/8 h of dark at 15–20°C) at a white light intensity of 250 mmol/m2 s. The preparation of EMS-mutagenized population and isolation of ms1 alelles in ‘Ningchun 4’ variety. The wheat variety ‘Ningchun 4’ was used for preparation of EMS-mutagenized population. In brief, about 200 kilograms of seeds were soaked in 0.5% (v/v) ethyl methane sulfonate (EMS, Sigma-Aldrich, St Louis, MO, USA) for 12 hours at room temperature (about 25°C), and were planted in a field at Yunnan, China; 990 kilograms of M2 seeds were harvested. One hundred and thirty-five male sterile mutants were screened from a population of 50 kilograms of M2 seeds (about 1 million seeds). After allelism test, 11 of the 135 male sterile mutants were confirmed 1 to be ms1 alleles. RT-PCR, qRT-PCR and in situ hybridization. Total RNA was isolated using TRI reagent (Takara Bio Inc.); genomic DNA was removed with DNase I (Promega). Two micrograms of RNA per sample were used to synthesize cDNA using a First-Strand cDNA Synthesis Kit (Thermo Fisher). RT-PCR was performed with LA Taq (Takara Bio Inc.). qRT-PCR was performed on a cycler apparatus (Bio-Rad) using SYBR Premix Ex Taq GC (Takara Bio Inc.) according to the manufacturer’s instructions. Amplification was conducted as follows: 95°C for 2 min, followed by 40 cycles of 95°C for 5 s and 60°C for 35 s. ACTIN was used as an internal control. Three biological replicates with three technique repeats per replicate were conducted. The primers used for RT-PCR and qRT-PCR are provided in SI Appendix Table S9. In situ hybridization was performed according to Shitsukawa et al.(1) with minor modifications. Tissues were cut into 10-μm-thick sections and hybridization was performed overnight at 50°C. The probe (a 971-bp Ms1 fragment) was amplified using primers Ms1-ISH-F/R (SI Appendix Table S10) and inserted into pEASY-T1 Simple Cloning Vector (TransGen Biotech) in both forward and reverse orientations. The vectors were linearized by digestion with HindIII and EcoRI and used as a template to generate anti-sense and sense probes with T7 RNA polymerase. Histological analysis. For paraffin sections, tissues were prepared as for RNA in situ hybridization. Transverse sections (10 μm thick) were cut and stained with 0.25% 2 toluidine blue. Each section was observed under an Axio Imager M2 microscope (Zeiss). RNA-seq, resequencing and bioinformatics processing of the sequence data. To map Ms1, we first collected the ms1e allele (from an EMS-mutagenized line of wild-type Chris) for further analysis (2). Male-sterile and wild-type plants produced from heterozygous ms1e plants were collected for MutMap-based cloning. To obtain RNA-seq data, RNA was extracted using plant RNA extraction kits (Qiagen) from microspore-stage anthers of segregated wild-type plants, ms1e plants and their heterozygous progeny. Paired-end libraries were prepared from 10 µg of cDNA reverse-transcribed from the RNA samples (mean insert size: 250 bp). The libraries were sequenced using the Illumina HiSeq 2000 system to produce 101-bp paired-end DNA reads. Library preparation and sequencing were performed at the sequencing center of Peking University (BIOPIC sequencing platform). We obtained 25.0 Gb of data for wild type, 47.6 Gb of data for ms1e and 12.2 Gb of data for the heterozygous plants (SI Appendix Table S1). The reads were trimmed for quality control and to remove adapter sequences with Trimmomatic (3) and then aligned to the wheat genome (4) using TopHat2 (5) (parameter: --b2 -mp 40). After filtering for sequence repeats and reads with multiple mapping regions in the genome, the clean reads were extracted with SAMtools (6) and then further processed using perl scripts. To obtain resequencing data, DNA was extracted using plant DNA extraction kits (Qiagen) from 10-day-seedlings of segregated homozygous progeny of wild-type and 3 ms1e plants. Paired-end libraries were prepared from 1 µg of DNA (mean insert size: 350 bp). The libraries were sequenced using the Illumina HiSeq 2500 system to produce 150-bp paired-end reads. Library preparation and sequencing were performed by Novogene Co. DNA resequencing generated 524.7 Gb of data for wild type and 522.1 Gb of data for ms1e (SI Appendix Table S3). The reads were trimmed for quality control and to remove adapter sequences with Trimmomatic (3), and then aligned to the available wheat genomes, including IWGSC (4), TGAC (7) and W7984 (8), using Bowtie 2 (9) (parameter: --mp 40). Sequence repeats and reads with multiple mapping locations were filtered. The RNA-seq and resequencing data have been deposited in the NCBI’s SRA database (accession no. SRP113349). All data will be publicly available after the publication of this work. Identification of candidate SNPs between wild type and ms1e from the RNA-seq and resequencing data by MutMap analysis. We applied the MutMap method (10) to our RNA-seq and resequencing data to map Ms1. After filtering SNPs with low read coverage (<6), index values for the SNPs were computed as follows: index = Nmutant/(Nreference + Nmutant), where N represents the number of accumulated reads with corresponding genotypes. The SNPs were mapped to the wheat chromosome (8), and peaks with high indexMU/indexWT ratios were identified as candidate chromosomal regions containing Ms1. To exclude bias due to index values caused by homologous sequences outside the candidate chromosomal region, two steps were performed. First, SNPs from homologous genes in the wheat genome were identified by comparing 4 sequences from the candidate region with the whole genome sequence using BLASTn, and SNPs obtained from the BLAST analysis were filtered during index calculation. Second, haplotypes for 200-bp regions around each candidate SNP were generated using Haploview (11), and SNPs from the different haplotypes were removed so that only the index ratios of the same haplotypes between wild type and ms1e were calculated. When indexWT =0, we define that the indexMU/indexWT ratio is 15 in the RNA-seq analysis and is 30 in the re-sequencing analysis. Thus, the highest indexMU/indexWT ratio is 15 and 30 for the RNA-seq analysis and the re-sequencing analysis, respectively, in our analysis. As the loci with indexMU/indexWT ratios lower than 2 in RNA-seq analysis and lower than 5 in re-sequencing analysis represent the low possibilities for the candidate genes, we only included the loci with indexMU/indexWT ratios higher than 2 in RNA-seq analysis and higher than 5 in re-sequencing analysis in Fig. 1F and G, respectively. Molecular cloning of Ms1 A traditional map-based cloning approach was adopted to clone Ms1 using SNPs between wild-type and ms1e as markers. Using 112 male-sterile plants segregated from ms1e heterozygotes, we initially mapped Ms1 to interval YZ5–YZ2 with the SNP markers derived from our RNA-seq data. High-resolution markers were developed using DNA-seq data; Ms1 was initially mapped between DYZ18 and YZ2, and then to a 198-kb region between DYZ23 and DYZ19. 5 Southern blotting. Genomic DNA was extracted from young leaves of T. aestivum L. (Ms1/Ms1 and ms1g/ms1g), T. turgidum L. accession Langdon (AABB), T. urartu accession G1812 (AA) and Ae. tauschii accession AL8/78 (DD) by the cetyl trimethylammonium bromide (CTAB) method. The concentration of the purified DNA was quantified with a nucleic acid analyzer (NanoDrop 2000; Thermo Scientific). Forty micrograms of each DNA sample was digested overnight at 37°C with HindIII (Takara Bio Inc.), then purified and separated on a 0.8% agarose gel overnight at 4°C and 35 V. The separated genomic DNA was transferred to Amersham Hybond-N+ nylon membranes (GE Healthcare) and immobilized by UV crosslinking. The probe DNA was labeled with digoxigenin according to the manufacturer’s guidelines (DIG Probe Synthesis Kit; Roche); the primers for the probe are listed in SI Appendix Table S10. We used a 469-bp fragments from the first intron of Ms1 as probe to get the result included in the manuscript. The identities between Ms1 and Ms-A1 and Ms-D1 of the 469-bp probe sequence are 71% and 76%, respectively. The membranes were probed and then analyzed using a chemiluminescence kit (RPN2106; GE Healthcare). Sequence alignment and phylogenetic tree analysis. Sequences were aligned with Clustal X 2.1 and a phylogenetic tree was constructed using Molecular Evolutionary Genetics Analysis (MEGA) 5.2.1 software by the neighbour-joining method. A bootstrap analysis of 1000 replicates was performed to provide confidence estimates for the tree topologies. SI Appendix, Fig. S5 shows the amino acid sequences used for tree construction. 6 DNA methylation analysis. Genomic DNA was isolated from spikes at meiosis in each sample by the CTAB method. DNA samples (30 μg) were treated with proteinase K (AMRESCO Inc.) at 45°C for 1 h.