The Board of Trustees of the Center for Excellence in Education
Honorary Members The Honorable Jimmy Carter The Honorable Tom Davis The Honorable William H. Frist The Honorable Joseph I. Lieberman The Honorable Ellen O. Tauscher
Board of Trustees Mr. Mel Chaskin, Chairman Ms. Joann P. DiGennaro, President
Mr. Bill Andresen Mr. Michael H. Jordan The Honorable Caroline E. Baker Mr. Raymond C. Kubacki Mr. John D. Bergen Dr. David Lee The Honorable Frank C. Carlucci Admiral W.A. Owens, USN (Ret.) Mr. Red Cavaney Mr. James C. Paul The Honorable Anne Collins The Honorable Thomas R. Pickering Dr. Mary DeLong Mr. David K. Rensin Mrs. Betty Dranow Ms. Cynthia Pickett-Stevenson Mr. Nicholas S. Gouletas Ms. Diane Sawyer H.E. Bahia El Hariri Mr. Fred Weinert Mr. Douglas Heimburger Mrs. Gayle Wilson
Center for Excellence in Education 8201 Greensboro Drive,Suite 215 McLean,VA 22102
A Nonprofit Organization Under U.S. Code 501 (c) (3)
Research Science Institute 2004 Faculty and Staff
RSI Directors Mr. Matthew Paschke (MIT) RSI ’91 North Canton,OH Dr. Carey Inouye (Caltech) Honolulu,HI Vice President of Administration Ms. Maite P. Ballestero Bethesda,MD Assistant Directors Mr. James Skelley (MIT) RSI ’99 Joplin,MO Ms. Dagny Looper (Caltech) RSI ’99 Bryant,AR Professors — MIT Dr. Amy Bany RSI ’90E Yale University Mr. Adrian Campbell RSI ’92 Bayer Corporation Dr. Leonardo Hsu RSI ’87 University of Minnesota Mr. Lance Rhoades University of Washington Dr. Christopher Skinner RSI ’88 University of Michigan Professors — Caltech Dr. Charles Doran RSI ’87 Columbia University Dr. Parth Patwari RSI ’90E Brigham & Women’s Hospital Mr. Lance Rhoades University of Washington Dr. Karn Sorasaenee California Institute of Technology Dr. Dennis Ugolini RSI ’90W Trinity College Academic Staff— MIT Ms. Shaudi Hosseini RSI ’99 Kingston,PA Mr. Christopher Mihelich RSI ’97 Princeton,NJ Ms. Viviana Risca RSI ’99 Port Washington,NY Dr. Jenny Sendova Sofia,Bulgaria Mr. Balint Veto RSI ’98 Cambridge,MA Academic Staff— Caltech Ms. Jacqueline Choi RSI ’97 Leonia,NJ Dr. John Rickert Terre Haute,IN Mr. David Zhang RSI ’99 Overland Park,KS Teaching Assistants — MIT Mr. Allen Chao RSI ’02 Beaumont,TX Mr. Anatoly Preygel RSI ’02 Germantown,MD Ms. Michelle Rengarajan RSI ’02 Pasadena,CA Mr. Prasanna Tambe RSI ’91 Philadelphia,PA Mr. Vivek Venkatachalam RSI ’01 Berkeley Heights,NJ Mr. Kartik Venkatram RSI ’00 Morristown,NJ Teaching Assistants — Caltech Mr. Edward Boas RSI ’94 San Diego,CA Mr. Bryant Mathews RSI ’97 Grand Rapids,MI Ms. Emma Schmidgall RSI ’02 Golden Valley,MN Ms. LeeAnn Tzeng RSI ’90E Hanover,NH Technical Assistants Mr. Ho Seung Paul Ryu (MIT) RSI ’03 Pacific Grove,CA Mr. David Jake Wildstrom (Caltech) RSI ’97 Kensington,MD Student Counselors — MIT Ms. Cynthia Chi RSI ’03 Sugar Land,TX Mr. Logan Dean RSI ’03 Quantico,VA Ms. Julie Finkelstein RSI ’03 Woodland Hills,CA Ms. Aidan Parker RSI ’02 Cambridge,UK Mr. Arup Sarma RSI ’03 Shawnee Mission,KS Mr. Jacob Temme RSI ’02 Casper,WY Student Counselors — Caltech Mr. Steve Byrnes RSI ’02 Lexington,MA Mr. Jon Hanover RSI ’03 Chesterland,OH Ms. Corinna Zygourakis RSI ’01 Houston,TX Preface
The Research Science Institute 2004 Compendium displays the work of 109 academically talented high- school students who participated in the 2004 Research Science Institute (RSI) administered by the Center for Excellence in Education with the sponsorship of the Massachusetts Institute of Technology in Cambridge, Massachusetts, and (in its inaugural year of sponsorship) the California Institute of Technology in Pasadena, California. This unique six-week summer program combines theoretical classroom instruction in mathematics and science with one-on-one research experience with scientists from the sponsoring institutions and various nearby universities, private corporations, and research organizations in and around Cambridge and Pasadena. It is the only U.S. program of its kind offered at no cost to participants. The scientists and their respective organizations are acknowledged in the headings of the reports. These reports are the culmination of students’ research in mathematics, physics, chemistry, biology, computer science, and engineering. The faculty of the Institute, together with a team of outside experts, selected five outstanding written reports from each program site for special recognition. Panels of visiting educators and scientists also designated five projects from each site as examples of outstanding oral presentation. The chosen written reports are printed first in their entirety,† followed by the abstracts of the commended oral presentations, and finally by the remaining student abstracts. The Institute has drawn students from all over the U.S. and from 45 other countries. In 2004 the 109 students represented 39 states of the U.S. and the District of Columbia, as well as the countries of Bulgaria, China, Germany, Greece, India, Israel, Jordan, Kuwait, Lebanon, Morocco, Pakistan, Poland, Saudi Arabia, Singapore, Turkey, and the United Kingdom. The students were all selected having completed coursework comparable to or exceeding that of the third year of high school. The U.S. students averaged 79 of 80 points on the mathematics section and 74 of 80 points on the verbal sections of the Preliminary Scholastic Aptitude Test (PSAT), representing the 99th percentile of all high-school students taking the PSAT. The inestimable academic and social value of the Institute to scientifically talented students is evident from the following praises from participants of RSI 2004:
I have never felt better than the moment I realized I was a “Rickoid.” RSI assured me that I am on the right path . . . that I will want to major in math or science and choose a career in math and science. I learned what real research is like. It made me realize that research requires a lot of patience and perseverance. This summer has only increased my love for the sciences. I had the best experience in my life; I have improved my leadership skills and made lots of friends. The students are incredible!
The editors are honored to have taken part in the Institute as both students and staff and are pleased to present to you the excellent work of their successors.
Chris Mihelich and Jake Wildstrom, RSI 1997, Editors
† But some appendices containing computer code or other supplementary material have been suppressed to conserve space.
Table of Contents
Outstanding Written Reports — Papers 1
Albert Chen,“Synthesis of 3-amino-2-alkenoate” 1
Jerry Guo,“Regulation of Late-Stage Flower Development by Downstream Genes of the Homeotic Protein AGAMOUS” 9 Wenxian Hong,“Design and Characterization of a Littrow Configuration External Cavity Diode Laser” 21
Samuel Lederer,“The Effect of Chemical Additives on Ice-Spike Formation” 31
John Lesieutre,“On a Generalization of the Collatz Conjecture” 37
Da Lin,“Evaluation of cis-cis HOONO Production by Laser Photolysis through Modeling of Reaction Kinetics” 47
Po-Ling Loh,“Closure Properties of D2p in Finite Groups” 57
Renee Park,“The Effects of Atmospheric Turbulence on Simulated Starlight in a Low-Pressure Telescope Environment” 63
Antoni Rangachev,“On the Solvability of p-adic Diagonal Equations” 71
Jacob Sanders,“Glucocorticoid Regulation of Cell Survival in Normal and Cytokine-Treated Gastric Chief Cells” 77
Outstanding Oral Reports — Abstracts 89
Szymon Acedanski´ ,“Using Algebraic Coding for Transferring Files in Peer- to-Peer Networks”
Khaled Al-Rabe,“Characterization of Co(II) Binding to PAMAM G4-NH 2 Dendrimer in Aqueous Solutions Using UV and EPR Spectroscopy”
Hannah Chung,“Investigation of Graphs Having Identical Path Layer Matrices”
Wenxian Hong,“Design and Characterization of a Littrow-Configuration External-Cavity Diode Laser” Andrew P. Klein,“Improved Copper Resistance in E. coli Using a Heterol- ogous Copper Chaperone from Schizosaccharomyces pombe”
Samuel Lederer,“The Effect of Chemical Additives on Ice-Spike Formation”
John Lesieutre,“On a Generalization of the Collatz Conjecture”
Grace Lu,“The Effect of Wind on the Emission of Grass Pollen”
Jacob Sanders,“Glucocorticoid Regulation of Cell Survival in Normal and Cytokine-Treated Gastric Chief Cells”
David Yang,“Pointed Isoperimetric Inequalities”
Abstracts from Other Papers 93 Synthesis of 3-amino-2-alkenoate
Albert CheÒ
under the direction of Dr. Mohammad Movassaghi and Robert Sindelar Massachusetts Institute of Technology
Abstract form other useful molecules. For example, β-amino- α-hydroxy acids can be used to produce taxol, a The main aim of the project is to synthesize an iso- treatment for tumors; bestatin, an immune system merically pure sample of 3-amino-2-alkenoate, a use- modifier; and oligopeptides that can be used as seda- ful β-amino acid derivative that is a precursor to tives [3]. Also, β-amino acids are used to produce molecules with medicinal properties. We success- β-lactams, a class of antibiotics that includes peni- fully made the product from the reaction of ethyl cillin [1]. acetoacetate and ammonium acetate dissolved in methanol. We also attempted to synthesize 3-pyrrole- 2-alkenoate from the reaction of our product and 2,5-dimethoxytetrahydrofuran. The products of both reactions were chracterized by means of 1Hnuclear magnetic resonance spectroscopy.
1 Introduction Figure 1: 3-amino-2-alkenoate Organic chemistry research has revealed the promis- ing applications of β-amino acids and their deriva- tives, encouraging further study of their properties In this paper we explore an approach to the syn- and methods of synthesis. β-amino acids rarely oc- thesis of 3-amino-2-alkenoate, a useful β-amino acid cur in nature, but they do appear in free form or in derivative (Figure 1). Using ethyl acetoacetate, a β- active peptides [9]. β-amino acids have one additional keto ester, as starting material, we seek a pure sample carbon in the parent chain compared to the more containing only the shown isomer. We also present a common α-amino acids used to make most polypep- reaction mechanism that is plausible given the chem- tides. Therefore, they have different properties and istry of the reaction. We also share our work on a can be used to create new classes of organic molecules. method for the formation of the vinyl pyrrole of the For example, peptides composed of β-amino acids 3-amino-2-alkenoate. are resistant to decomposition by enzymatic activity Thin-layer chromatography (TLC) and gas chro- [6], whereas α-peptides are not. There is also evi- matography (GC) were used to track the progress dence that β-peptides may fold in similar ways to α- of the reaction and verify that it had gone to com- peptides under certain conditions, specifically when pletion. Flash chromatography (column chromatog- nearby monomers have weak intermolecular forces be- raphy) was used to purify the sample and to sepa- tween them [5]. They have also been shown to form rate out components. Nuclear magnetic resonance secondary structures similar to those of α-peptides (NMR) spectroscopy was conducted to ensure the fi- [10]. Since β-peptides may be useful to the human delity of the product. 1H NMR signals vary based on body but are not broken down by digestion, they are the chemical environments of protons in the molecule potentially a novel class of medicinal molecules. In andthereforecanbeusedtoinferastructure. addition to peptides, β-amino acid derivatives can Goals of synthesis include removal of impurities, 2 2004 RSI Compendium
optimization of yield, separation of isomers, and ver- ammonium acetate were purchased from Fisher and ification of product identity. β-amino acid derivative Alfa Aesar respectively. 2,5-dimethoxytetrahydro- samples with isomeric purity are important in form- furan, 10-camphorsulfonic acid, and all organic sol- ing arrangement-dependent molecules because alter- vents were purchased from Aldrich. native isomers may not react identically [6]. Here, we are concerned with the cis-trans isomeric dispar- ity that occurs with double bonds. We seek the cis isomer, with the amino group interacting with the carbonyl group. Cis-trans isomers often either are in equilibrium with each other or are produced coex- isting in the same sample. One efficient method for Figure 2: Ethyl acetoacetate and 3-amino-2- isomeric isolation is to determine a convenient step of alkenoate the synthesis process where the isomers can be sep- arated via chromatography, because the isomers of some molecules have greater differences in physical properties than those of others. 2.4 3-amino-2-alkenoate The reaction to produce the β-amino acid deriva- 2 Experimental Section tive (Figure 2) was conducted as follows. A 250 mL round-bottom flask (RBF) with a magnetic stir bar 2.1 General Procedures was flame-dried using a propane torch. Methanol (150. mL) was poured into the RBFalong with solid All reactions were performed in flame-dried round ammonium acetate (46.2 g). The flask was then cov- bottom flasks fitted with rubber septa under a pos- ered with a septum and evacuated and flushed with itive pressure of argon. Air- and moisture-sensitive argon three times and then put under positive argon liquids and solutions were transferred via sterile sy- pressure. 15.3 mL of ethyl acetoacetate was syringed ringe. Analytical thin-layer chromatography was per- into the flask. The flask was placed over a magnetic formed using glass plates precoated with 0.25 mm spinner and stirred. Theoretical yield was calculated 230–400 mesh silica gel impregnated with a fluores- to be 15.5 grams. cent indicator (254 nm). Thin-layer chromatography Within an hour after the reaction had been begun, plates were visualized by exposure to ultraviolet light a 0.1 mL sample was taken from the flask and diluted and to an indicator solution of p-anisaldehyde fol- with dichloromethane in order to execute a TLC (sol- lowed by heating on a hot plate. Organic solutions vent: 25% EtOAc/hexanes) and a GC (maximum re- were concentrated by rotary evaporation at reduced tention time: 15 minutes). Both demonstrated that pressures and high temperatures. the reaction was proceeding rapidly. The reaction was continued for another 15 hours before another 2.2 Instrumentation sample determined that the reaction was complete. The flask was then placed on a rotary evapora- Proton NMR spectra were recorded with a Mercury tor to evaporate excess solvent. Water was observed Varian (300 MHz or 500 MHz). Chemical shifts for on the edges of the flask. The product was then di- protons are reported in parts per million downfield luted with 300. mL of dichloromethane, which pro- from tetramethylsilane and are referenced to residual duced a solid white precipitate. The precipitate was protium in deuterated chloroform or benzene. NMR vacuum filtered and then rinsed with 2 × 300. mL data are organized as follows: chemical shift, mul- of dichloromethane. The filtrate was poured into a tiplicity (s = singlet, d = doublet, t = triplet, q = separatory funnel and washed first with distilled wa- quartet, m = multiplet), integration, coupling con- ter and then with brine. Because dichloromethane is stants in Hertz, and assignment. denser than water, the organic layer and therefore the product sank to the bottom. The organic layer was 2.3 Materials collected in an Erlenmeyer flask and dried over anhy- drous sodium sulfate. The product was then vacuum- Ethyl acetoacetate was purchased from Aldrich and filtered again and then concentrated in a 1000 mL distilled within the laboratory. The methanol and RBFusing a rotary evaporator. After the solvent 2004 RSI Compendium 3 had evaporated, the yield of the crude 3-amino-2- 0.1 M. Then 0.500 mL of 2,5-dimethoxytetrahydro- alkenoate was 15.0 grams. furan was syringed to start the reaction. Theoretical A 5 cm-wide chromatography column was pre- yield was calculated to be 694 mg. pared as described by McMurry [8]. Approxi- Within one hour, the reaction mixture was sam- mately 1500 mL of solvent was made from 10% pled for TLC and GC. Both indicated the reaction EtOAc/hexanes. Anhydrous sodium sulfate was was complete. Two main compounds seem to have poured into the bottom followed by 8 cm of com- been formed. We hypothesized that they were the cis pressed silica gel. After more sodium sulfate was and trans isomers of the vinyl pyrrole. The molecules added to the top, the product was loaded onto the stained and separated in the same way as the TLC column and was allowed to submerge into the silica standard, but the retention times were drastically dif- before the rest of the column was filled with solvent. ferent from the GC standard for the vinyl pyrrole. 30 mL fractions were taken and a TLC was done on The reaction mixture was worked up by washing each. Fractions containing the material that matched with 25 mL water and 50 mL ethyl acetate and sepa- the characteristics of the product were pooled into a ration in a separatory funnel. Any extra product re- preweighed RBF. Fractions containing other mate- maining in the aqueous layer was extracted by wash- rial were pooled separately. The RBFwas placed on ing with 3 × 25 mL ethyl acetate in the separatory a rotary evaporator to obtain the product. The final funnel. All organic layers were then combined and yield of the pure 3-amino-2-alkenoate was 12.0 grams dried over sodium sulfate. The product was then fil- (77.4%). tered by vacuum to remove sodium sulfate and other NMR spectroscopies were taken of the 3-amino- solid impurities. Solvent was then evaporated under 2-alkenoate. The placement of the hydrogens in the reduced pressure. molecule showed that both the cis and trans forms Flash chromatography was performed on the were present, most likely in rapid equilibrium with product using a 3 cm-wide column and 10 mL frac- each other. Because of the fast cis-trans isomer- tions. The solvent system was 10% EtOAc/hexanes. ization of 3-amino-2-alkenoate, it was unlikely that One pool provided one of the major products, an- the isomers would be separable. Therefore, the com- other provided a product not previously seen and bined isomers were reacted to synthesize 3-pyrrole-2- that appeared to be starting material, and no pool alkenoate considering its slower isomerization. provided the second major product. Impure frac- tions were pooled together to form a final pool, which was subjected to a second flash chromatography after being concentrated. The second column utilized 3% EtOAc/hexanes as solvent in order to induce greater separation. 482 mg of product was recovered. NMR spectroscopy indicated that neither of the primary products of the reaction were the correct Figure 3: 3-amino-2-alkenoate and 3-pyrrole-2- product, implying that unwanted hydrolysis had oc- alkenoate curred. A second reaction, using acetic acid instead of 10-camphorsulfonic acid, gave the same GC results; presumably the same compounds were formed.
2.5 The Vinyl Pyrrole 3Results The synthesis of 3-pyrrole-2-alkenoate (Figure 3) was attempted. 500.2 mg of the 3-amino-2-alkenoate 3.1 3-amino-2-alkenoate product of the previous reaction was weighed out and placed in a 100 mL round-bottom flask. 90.02 mg of The ethyl acetoacetate and the product were both solid 10-camphorsulfonic acid was added, and then UV-active and had virtually the same Rf (retention the flask was covered with a septum. It was sub- factor) values. In 25% EtOAc/hexanes, the Rf value sequently placed under vacuum and flushed with ar- was 0.31. Anisaldehyde caused the starting mate- gon three times before being put under positive argon rial to stain white surrounded by red. The prod- pressure. 38.7 mL of THFwas syringed into the flask uct stained red. These agreed with chromatographic in order to make the molarity of 3-amino-2-alkenoate standards, so TLC indicated that the product was 4 2004 RSI Compendium
3-amino-2-alkenoate. The retention times of the products of the at- The retention times of ethyl acetoacetate were tempted synthesis of 3-pyrrole-2-alkenoate had re- 1.624 and 2.269 minutes. Ethyl acetoacetate does tention times of 2.307 and 2.451 minutes, respec- not behave well in GC procedures, changing between tively. These disagreed with the chromatographic the enol and ketone forms and creating two reten- standards of approximately 5 minutes. These prod- tion times. The retention time of the product was ucts were much more volatile than the desired prod- 4.330 minutes. This agreed with chromatographic ucts, demonstrating that the incorrect products had standards, so GC indicated that the correct product been formed. had been formed. NMR results showed that the incorrect products NMR results are as follows. had formed.
4TheMechanism
A reaction mechanism for the formation of 3-amino- 2-alkenoate was proposed (Figure 4). Ammonium ac- etate is in equilibrium with ammonia and acetic acid. Even though the salt is favored in the equilibrium, the ammonia and acetic acid can still react in the mech- 1 ◦ H NMR (500 MHz, CDCl3,20 C): 7.93 (bs, 1H, anism of the formation of 3-amino-2-alkenoate. The NH), 4.54 (bs, 1H, NH), 4.52 (s, 1H, C=CH), 4.11 acetic acid protonates 1 because of the high density (q, 2H, J =7Hz,CH3CH2), 1.91 (s, 3H, CH3C=C), of electrons around the electronegative oxygen. The 1.26 (t, 3H, J= 7 Hz, CH3CH2). extra proton further increases the tendency for elec- trons to be drawn towards the oxygen, making the carbon nearest to it electron-deficient. This gives the nucleophilic ammonia the ability to donate electrons to the carbon to form 3. The acetate ion, attracted by charge to the amino group, then pulls off a hydro- gen to regenerate acetic acid. Once again, the acetic acid protonates the oxygen due to the oxygen’s effec- tive negative charge. Once 5 is achieved, the proto- 1 ◦ H NMR (500 MHz, C6D6,20 C): 5.00 (s, 1H, nated oxygen makes up a very likely leaving group, C=CH), 4.21 (q, 2H, J =7Hz,CH3CH2), 2.27 (s, and is released as water, taking the carbon-oxygen 3H, CH3C=C), 1.28 (t, 3H, J =7Hz,CH3CH2), bond electrons with it. The lone-pair electrons on NH2 protons not observed. the amino group are forced to bond with the carbon to stabilize the compound, thus forming 6. Because NMR showed that both cis-trans isomers were the amino group is positively charged, it attracts the present in the sample. This was inferred from the two acetate anion and gives up its proton to form 7.In different peaks for the protons in the methyl group the final step, tautomerization occurs as a hydro- and two different peaks for the protons in the amino gen is transferred from within the molecule to the group. The hydrogen bonding of the amino group to amino group, causing a rearrangement of bonds. Be- the nearby carbonyl group produced the noticeable cause of the stability of the hexagonal ring, 3-amino- difference. 2-alkenoate is unlikely to participate in the reverse reaction. 3.2 3-pyrrole-2-alkenoate Overall, the acetic acid is not consumed and is therefore a catalyst. The ammonia, however, reacts The products of the attempted synthesis of the vinyl with the ethyl acetoacetate. Water is formed as a pyrrole were not UV-active and had Rf values of 0.31 product, which was observed in the reaction. The and 0.41, respectively. These agreed with chromato- reaction does not go to completion quickly; it takes graphic standards for the two cis-trans isomers of 3- several hours to finish, which is perhaps due to the re- pyrrole-2-alkenoate, indicating that the correct prod- versibility of each step of the mechanism. The mech- ucts had been formed. anism fits the observations we made. 2004 RSI Compendium 5
Figure 4: Proposed mechanism for the production of 3-amino-2-alkenoate
5 Future Studies 6Conclusion
Further investigation into the viability of the method We have found that the synthesis of 3-amino-2- of synthesizing 3-pyrrole-2-alkenoate from 3-amino- alkenoate from ethyl acetoacetate is a viable and 2-alkenoate is needed. Research into the production moderately efficient method and presented a mech- of β-amino acids from the molecules synthesized here anism explaning the reaction’s success. This prod- could provide this work with numerous medicinal ap- uct was purified and has been isolated with 77.4% plications. Other pathways from ethyl acetoacetate yield. We have also found, however, that one method to 3-pyrrole-2-alkenoate should also be studied (Fig- of producing 3-pyrrole-2-alkenoate from 3-amino-2- ure 5). alkenoate is not viable. Our findings build onto the growing knowledge pool of how to best synthesize β- amino acid derivatives for eventual commercial or in- dustrial use. 6 2004 RSI Compendium
Figure 5: Other pathways currently being explored
7 Acknowledgments I thank Chris Mihelich for editing my paper and giving me advice on my mentorship and my presen- I would like to thank my mentor Professor Moham- tation. I acknowledge Paul Ryu for helping me with mad Movassaghi for teaching me the concepts of or- technical and computing issues. I acknowledge Alan ganic chemistry and overseeing the direction of my Chao and Kartik Venkatram for teaching me how to project. I would also like to thank my laboratory use Athena and LATEX to write my paper. mentor Robert Sindelar for demonstrating, teaching, Finally, I would like to thank the Research Science and supervising the lab techniques of my research Institute and the Center for Excellence in Education and for getting me the materials and information I for providing me with the opportunity to perform this needed. My fellow lab workers deserve credit as well, research and immersing me in a stimulating intellec- especially Allison, Matt, Giovanni, Diana, and Mike. tual environment. 2004 RSI Compendium 7 References
[1] C. Gomersall. Beta-lactams. Available at http://www.aic.cuhk.edu.hk/web8/ beta-lactams.htm (2004/07/23). [2] D.L. Pavia, G.M. Lampman, and G.S. Kriz. Or- ganic Laboratory Techniques. 3rd ed. Saunders College Publishing, Philadelphia, PA (1998). [3] G. Cardillo and C. Tomasini. Asymmetric syn- thesis of β-amino acids and α-substituted β- amino acids. Chemical Society Reviews (1996), 117–128. [4] G.M. Loudon. Organic Chemistry. 3rd ed. The Benjamin/Cummings Publishing Company, Inc., Redwood, CA (1995). [5] G.P. Dado and S.H. Gellman. Intramolecular hydrogen bonding in derivatives of β-alanine and γ-amino butyric acid: Model studies for the fold- ing of unnatural polypeptide backbones. Journal of the American Chemical Society 116 (1994), 1054–1062. [6] G. Zhu, A. Chen, and X. Zhang. Highly efficient asymmetric synthesis of β-amino acid deriva- tives via rhodium-catalyzed hydrogenation of β- (Acylamino)acrylates. Journal of Organic Chem- istry 64 (1999), 6907–6910. [7] J. Kling. Beta-peptide antimicrobials. Available at http://www.chemistry. org/portal/a/c/s/1/feature_pro.html? id=5ef69da2d21111d5e8e54fd8fe800100 (2004/07/03). [8] J. McMurry. Organic Chemistry. 4th ed. Brookes/Cole Publishing Company, Pacific Grove, CA (1996).
[9] N. Sewald. Synthetic routes towards enantiomer- ically pure β-amino acids. Angewandte Chemie International Edition 42 (2003), 5794–5795. [10] S. Krauthauser, L.A. Christianson, D.R. Pow- ell, and S.H. Gellman. Antiparallel sheet forma- tion in β-peptide foldamers: effects of β-amino acid substitution on conformational preference. Journal of the American Chemical Society 119 (1997), 11719–11720. 8 2004 RSI Compendium Regulation of Late-Stage Flower Development by Downstream Genes of the Homeotic Protein AGAMOUS
Jerry GÙÓ
under the direction of Dr. Toshiro Ito and Professor Elliot Meyerowitz Division of Biology California Institute of Technology
Abstract duces a large number of seeds in a short generational period, but its entire genome has also been sequenced The genes and pathways AG regulates in late-stage and mutant lines can be established [2]. flower development are largely unknown. Several pu- tative downstream genes of AG involved in anther 1.1 Floral Organ Structure dehiscence were identified, including DAD1, MYB26, OPR3, COI1, and RBR1, through a bioinformatics Angiosperm eudicots develop flowers to produce ga- approach. A 35S::AG-GR inducible line was con- metophytes. The flower consists of four concentric structed for timed-induction expression analysis of whorls of organs: (1) sepals, (2) petals, (3) stamens, anther dehiscence versus indehiscence, which revealed and (4) carpels, that sequentially form inward from the likely direct induction of DAD1 by AG. Mutated the floral meristem. Stamens, the male reproduc- AG binding sites of DAD1 and MYB26 were linked to tive organ, and carpels, the female counterpart that the β-glucuronidase reporter gene; the plasmid con- holds the ovules, produce sporogenous cells around structs were transformed into plants for GUS staining floral stage 5. The stamen consists of an anther, the to test in vivo site activity. Results support the hy- site of pollen development, and a stalk-like filament pothesis that AG is continually functional and con- that aids in pollen dispersal. At floral stage 14, the trols late-stage flower development by regulation of flower opens and anther dehisces to release the ma- downstream genes. ture pollen [3].
1 Introduction 1.2 Developmental Model The unifying principle in the flower-development field Floral organ development is a phenomenon that spa- is the ABC model, which is applicable to a wide range tially and temporally involves the specific expression of species. Three classes of homeotic selector genes of a considerable number of genes in a variety of cas- that encode transcription factors (termed A, B, and cades and networks. Since the rise of the molecular C) determine floral organ identity combinatorially. age, much basic research has been devoted to elu- Class-A genes alone specify sepals, A and B together cidating the mechanisms and functions of and re- specify petals, B and C together specify stamens, and lationships between these homeotic genes [1]. For- C alone specifies carpels. In addition, classes A and tunately, because the evolutionary process provides C are mutually repressive. In general, gene products relative physiological and biochemical similarity in are expressed in floral regions that exhibit defects in plants, fundamental discoveries on the model organ- mutants. The resulting transcription factors regu- ism Arabidopsis thaliana can be widely applied to all late floral development through a hierarchy of down- 25,000 flowering species. Arabidopsis not only pro- stream targets [4]. 10 2004 RSI Compendium
Figure 1: Arabidopsis plants ready for tis- Figure 2: Four components of a flower [5] sue sampling or mutant selection [5]
Figure 3: Flower structure of wild-type and AG mutant, as explained by the ABC model [5]
AGAMOUS, a class-C gene that encodes a tran- stage, programmed cell death ruptures the stomium; scription factor of the MADS-box family, plays a cen- the tapetum and middle layer degenerates; the en- tral role in flower development. The MADS-box fam- dothecial layer expands, and connective cells form ily transcription factors bind to a target DNA se- fibrous bands. A lipid-derived signaling compound quence, the CArG box, as dimers. Since AG is re- found in many plant species, jasmonic acid, was re- sponsible for reproductive organ development, mu- cently discovered to be actively involved in the control tants have a whorl pattern of sepal-petal-petal-sepal. of anther dehiscence and male gametophyte develop- The processes, mechanisms, and genes involved in dif- ment [3]. ferentiating whorl 3 and 4 primordia by AG into sta- mens and carpels are largely unknown [5]. Three genes directly regulate this pathway and hence late-stage floral development. The DEFEC- TIVE IN ANTHER DEHISCENCE1 (DAD1) gene 1.3 Late-stage Regulatory Genes in catalyzes the first step of JA biosynthesis by encoding Stamen Development a phospholipase A1 [6]. OPR3 encodes an isozyme of 12-oxophytodienoate reductase that reduces OPDA Like any other developmental process, a hierarchy of for JA biosynthesis [7]. Mutants of both genes are genes regulates anther dehiscence. During this late sterile but can be rescued by exogenous JA. The JA 2004 RSI Compendium 11
signal transduction gene COI1 encodes an F-box pro- components and only 50% of the genetic potential has tein insensitive to JA treatment [8,9]. been achieved, the research holds significant applica- On the other hand, MYB26 controls the mechan- tions to improving food yield, genetically engineered ical rather than the biochemical aspect of anther de- crops, and breeding programs. At the basic-science hiscence. Mutants produce fertile pollen when me- level, the findings will also help elucidate the general chanically released from the anthers. Therefore, not behavior of plant homeotic proteins and further the only does the gene directly regulate late-stage devel- floral developmental model [2]. opment, it is also a valuable tool for manipulating male sterility. However, the endothecial layer lacks wall fortifications and does not shrink, which is nec- 2 Materials and Methods essary for retraction of the anther wall. The gene en- codes a putative R2R3-type MYB transcription fac- 2.1 AG-Inducible System tor that activates the phenylpropanoid pathway to To reveal downstream processes of AG, a strain provide lignin residues for wall thickening [10]. with AG-inducible activity was constructed. This strain was homozygous for the ag-1 -null mutation 1.4 Purpose and Significance and transgenic for 35S::AG-GR, a promoter-driven The genes and developmental processes that AG reg- constitutive AG gene with a fusion at the carboxy- ulates throughout reproductive-organ development terminal to the steroid-binding domain of the rat glu- are in large part still left to be discovered. A key cocorticoid receptor. This genotype of ag-1 mutants question to be answered is: does AG, which repre- results in flowers with an indefinite sepal-petal-petal sents a typical plant homeotic protein, act as a switch whorl pattern [13]. that activates various developmental processes which However, continual treatment with dexametha- then proceed without its expression, or does AG con- sone (DEX), a steroid hormone, leads to a transloca- tinually need to be expressed? In animals, the latter tion from the cytoplasm to the nucleus of the fusion tends to be true, as illustrated by the Drosophilia protein. The resulting flower has functional stamens Ultrabithorax (UBX) homeotic gene [11]. and carpels and resembles the 35S::AG plants, which The question leads to two extreme hypotheses on implies that the AG-GR fusion protein can function how AG functions. One is that AG triggers a few as a normal AG protein. On the other hand, a sin- genes at the top of a cascade during floral develop- gle treatment with DEX only partially rescues the ment, and in turn these AG targets regulate other phenotype as flowers contain petals with stamenoid genes. However, other than SHATTERPROOF2, no structures. putative target genes of AG are known. The second possibility that is explored in this study is that AG 2.2 Plant Materials and Treatments directly and continually regulates the expression of ◦ many genes with different functions that act through- All plants were grown at 22 C under constant il- out reproductive-organ development. lumination in the Landsberg erecta background. Since AG has been recently implicated in mi- Transgenic plants were generated by Agrobacterium- crosporogenesis, an early-stage developmental pro- mediated infiltration. The 35S::AG-GR construct cess [12], it is essential to determine the expression was transformed into wild-type plants and the pri- and regulation of downstream AG genes in late-stage mary T1 transformants were screened by basta selec- development in order to validate the second hypothe- tion. Phenotypic analyses in the ag-1 were performed sis. Therefore, this research will identify and explore in the T1 generation, as well as in the T2 generation the relationships of downstream genes of AG that that was produced by selfing ag-1 /+ T1 plants. regulate anther dehiscence, a late-stage developmen- tal process. In particular, this study will determine 2.3 AG Downstream Genes and Bind- whether the downstream genes are direct or indirect ing Site Identification targets of AG. AG has homologs in a variety of plant species, A review of previous research identified several genes many of which are of great economic interest, and directly implicated in anther dehiscence. To deter- also in humans as oncogenes involved in the onset of mine if these genes are downstream of AG requires the cancer. Because 80% of plant food consists of flower identification of putative AG binding sites to the gene 12 2004 RSI Compendium
promoter. The genomic region of each identified gene was examined for the 16–base-pair perfect and weak consensus binding sequences of AG, which contains the 10-bp ‘CArG-box core’ (5’-CCNN(A/T)4GG-3’) [14].
2.4 Timed-Induction Gene Expression Assay Ag-1 35S::AG-GR plants were DEX treated (10 µM DEX, 0.015% Silwet L-77) either four times to in- duce AG and anther dehiscence or three times to ob- serve indehiscence, in one-day intervals starting at stage 3. Inflorescence containing floral buds were col- lected from stage 8 to stage 13. RNA was isolated and purified from tissue samples of two independent sets by the RNeasy Mini Kit (Qiagen). Total RNA was reverse-transcribed to synthesize cDNA for RT-PCR by the ThermoScript RT-PCR system (Invitrogen). The induction of the identified genes DAD1, MYB26, OPR3, COI1, RBR1, and a lipase con- trol was tested through transcriptional expression levels in semiquantitative polymerase chain reaction with reverse transcription (RT-PCR) using primers DAD1-5’, DAD1-3’, MYB26-5’, MYB26-3’, OPR3- 5’, OPR3-3’, COI1-5’, COI1-3’, RBR1-5’, RBR-4’, LIPASE-5’, and LIPASE-3’ after 42, 26, 28, 28, 28, and 24 cycles, respectively. Electrophoresis gel was run on the PCR products to measure the level of gene expression.
2.5 35S::AG-GR Construct
Figure 4: Cross-section of anthers at different stages. The coding region of AG was amplified from inflores- 1C, connective; E, epidermis; En, endothecium; ML, cence cDNA and cloned into a blunt-ended EcoRI site middle layer; S, septum; St, stomium; StR, stomium of pBluescript SK (Stratagene) to produce pSK-AG. region; T, tapetum; Td, tetrads; TPG, tricellular To produce pSK-AG-GR, pSK-AG was digested with pollen grains; V, vascular bundle [3] EcoRI, filled in, and ligated with a DNA fragment containing a rat glucocorticoid hormone binding do- main, which was excised with BamHI and XbaI from pRS020 (pDeltaGRBX) and filled in. The AG-Gr fragment was released from pSK-AG-GR by digestion with XbaI and ClaI and ligated into the correspond- ing sites of the binary vector pMAT137 that contains tandem cauliflower mosaic virus 35S enhancers and a terminator.
2.6 Plasmid Reporter-Gene Construct and Mutagenesis Figure 5: Phenotypic effects of 35S::AG-GR induc- tion The GUS-DAD1-5’ construct was produced by first amplifying the coding region of the 3.5 kb genomic 2004 RSI Compendium 13
AGAMOUS
Transcription Factors
SPOROCYTELESS/NOZZLE Regulatory Genes Regulator Genes Regulatory cascade?
Microsporogenesis Filament Elongation Anther Morphogenesis Anther Dehiscence
Figure 6: The “switch” hypothesis with late-stage functions unknown
sequence comprising DAD1 and the flanking in- formed according to previously published protocols tergenic sequences from inflorescence cDNA using (Ito, 1997). About 50 µg of crude nuclear extra was primers DAD1-5’A20 and DAD1-5’B. UltraPfu High- loaded onto the 7.5% Ready polyacrylamide gel (Bio- Fidelity DNA polymerase (Stratagene) and wild-type rad) and subsequently blotted onto the PVDFnylon Columbia genomic DNA were used through PCR membrane (Biograd). The AG-GR fusion protein was at an extension time of 4 minutes and then cloned detected with AG-specific antibody using SuperSig- into the pENTR vector and subsequently into pBG- nal west dura extended duration substrate (Pierce). WFS7 for basta selection. An additional construct The band intensity was normalized with band inten- was further mutagenized by reversing the CArG-box sities of small nuclear proteins by Memcode reversible core sequence CC...GG with the primers DAD1-5’MA protein stain kit for PVDFmembranes (Pierce). and DAD1-5’MB and corrected to the perfect con- sensus with DAD1-5MCA and DAD1-5MCB using the QuikChange II XL Site-Directed Mutagenesis Kit 2.8 GUS Staining (Stratagene). Stage 8 to 13 inflorescences were rinsed and stained The MYB26 promoter cloning was accomplished with x-gluc to determine β-glucuronidase activity for with amplification of the coding region by primers GUS expression analysis. Chlorophyll was removed MYB26-5A2 and MYB26-5B at an extension time of by an ethanol series. Cleared whole-mount observa- 7 minutes. The fragment was cloned into pENTR, di- tion was performed according to published protocols. gested with Bst98I, blunt-ended with T4 DNA poly- merase, and the GUS fragment was cut out with NarI, then blunt-ended with Klenow fragment, and 3Results finally ligated to produce the MYB26-GUS fusion. Further mutagenized and mutagenized to perfect AG- Figure 7 indicates that near-consensus AG binding consensus sequence constructs were made by MYB26 sites were found on several putative downstream mutagenesis primers. The MYB26-GUS fragment genes. In particular, the conserved core CArG-box was then cloned into the pKGW gateway vector for sequence in the 5’ promoter region of DAD1 is very plant transformation and selection. All plasmid con- close to the coding region, which makes the gene a structs were sequenced using DAD1 primers 1–7 and likely candidate for AG binding. MYB26 also is a MYB26 primers 1–19. likely candidate since it has two near-consensus AG binding sites on each promoter region. 2.7 Western Blot The idealized Graph 1 was based on the RT-PCR products shown in Figure 8, where a lipase deriva- To check the timing of AG function, dynamic accu- tive gene was the control. Four DEX treatments in- mulation of AG-GR protein in the nuclei caused by duced anther dehiscence and caused a peak in DAD1 repeated DEX treatments was examined. The ag-1 expression around day 8, which thereafter gradually 35S::AG-GR transgenic plants were treated once or declined. Three DEX treatments resulted in a much 4 times with DEX, and the inflorescences were har- lower expression level in DAD1, which quickly disap- vested at stage 1-10 after 3, 5, 7, 9, and 12 days. The peared. MYB26 expression, independent of AG in- crude nuclear extraction and purification were per- duction, reached a sharp peak around day 6 but also 14 2004 RSI Compendium
Figure 7: Sequence Schematics for Identified Putative AG Downstream Genes
Graph 1. Timed-Induction Gene Expression Figure 8: RT-PCR Expression Products. The asterisk denotes four DEX treatments, which ultimately cause anther dehiscence.
quickly disappeared. OPR3, RBR1, and COI showed 4 Discussion no sign of change in expression level. Figure 9 traces normal stamen development with 4.1 Timed-Induction Expression of prolonged AG induction. At the molecular level, four Identified Regulatory Genes DEX treatments transferred enough AG-GR fusion protein into the nucleus where it stayed to cause an- The primary results, illustrated in Graph 1, of the ther dehiscence, as shown by Figure 10. Figure 11 AG timed-induction gene-expression assay reveals a focuses on the stage of anther dehiscence and reveals substantial amount of interesting information about that the threshold for successful completion of devel- the relationship and functions of genes in the late- opment is four DEX treatments. Figure 12 illustrates stage developmental pathway. This component of the correct progression of flower development with the study seeks to elucidate the effects of AG in AG induction. late-stage development through an inducible system 2004 RSI Compendium 15 by examining the expression levels of various puta- dency in MYB26, OPR3, RBR1, and COI1 support tive downstream genes when AG is turned on and the working hypothesis that AG specifically regulates off. First, when AG is induced, the SPOROCYTE- certain genes in late-stage development. In this case, LESS/NOZZLE (SPL/NZZ) gene, which is responsi- AG is on top of the anther dehiscence regulatory hi- ble for microsporogenesis, is quickly expressed. The erarchy and likely controls DAD1 as a direct target. early, rapid induction and subsequent dramatic de- cline in SPL/NZZ expression implicates AG in early- 4.2 Phenotypic Duration Analysis stage development. The remaining genes all regulate anther dehis- Phenotypes of AG-induced inflorescences were also cence, a late-stage developmental pathway. A pro- examined in addition to exploring late-stage devel- nounced difference in expression levels of DAD1 is opmental relationships and functions at the molecu- evident between AG-induced anther dehiscence and lar level. Figure 9 reveals that normal reproductive- indehiscence. The difference can be attributed to organ development, especially in the late stages, re- the three versus four DEX treatments. In other quires continued AG induction. Prolonged AG ac- words, AG induction is responsible for the subsequent tivity throughout floral development indicates that induction of DAD1, which supports the hypothesis AG actively functions in and even controls late-stage of prolonged AG regulation in flower development. pathways. Prolonged AG activity can be seen in Fig- When AG induction is not strong enough to regulate ure 10, which shows the continued presence of the anther dehiscence, downstream expression of DAD1 homeotic protein AG-GR fusion in the nucleus when reaches a small peak shortly before the stage when an- induced. During anther indehiscence, there is no AG ther dehiscence occurs and then quickly disappears. activity, as the fusion proteins quickly dissipated in However, when AG does induce anther dehiscence the first few days, which means all downstream path- through regulation of DAD1, DAD1 expression levels ways are shut down, also illustrated by the expression climb to a much higher peak and only drop slightly. assay. Therefore, the phenotypic results logically jus- The continued expression of DAD1 implies the neces- tify the molecular findings that AG controls anther sary prolonged induction of AG for anther dehiscence dehiscence by direct regulation of downstream genes. to occur. Furthermore, Figure 11 and 12 indicate that an- ther dehiscence occurs between the third and fourth On the other hand, the MYB26 gene does not re- DEX treatments. Obviously, no DEX treatment re- spond to a change in AG induction, as indicated by sults in no AG induction and therefore no stamen very similar expression levels between three and four development at all, reflective of the ag-1 mutant. DEX treatments. However, at this point, the results The fact that more than three DEX treatments are reveal earlier expression of MYB26 than DAD1, even needed for anther dehiscence is evidence that AG is though both regulate the anther-dehiscence pathway. deeply involved in that developmental pathway. In The difference between the two genes is evidence addition, since just one treatment difference days af- that they are involved in separate regulatory cas- ter initial induction determines whether development cades. Since the RNA expression assayed inflores- completes successfully, AG is a critical regulator that cences were collected in bulk across various stages of controls a cascade of other regulatory genes in late- development, differences in expression levels could be stage development, and the absence of AG has been overshadowed. Thus, results from the GUS staining, seen to have far-reaching consequences during those which reveals gene activity spatially and temporally stages. for individual inflorescences, must be analyzed before MYB26 can be classified as a direct, indirect, or non– AG-downstream target. 5 Conclusions Additionally, OPR3, RBR1, and COI1 do not seem to be transcriptionally regulated at all due to 5.1 Late-Stage Developmental the uniform levels in expression, which is in line with Hierarchy the negative lipase control, shown in Figure 8. Since the genes are involved in intermediate steps of various This study revealed that AGAMOUS is indeed func- developmental pathways, it is logical that they would tional in late-stage flower development. In particular, be continually expressed unless a homeotic regulator AGAMOUS was shown to directly control anther de- turns off the entire pathway. The lack of AG depen- hiscence through regulation of the downstream gene 16 2004 RSI Compendium
Figure 9: Continued AG Function Throughout Development
Figure 10: Extended Localization of AG-GR Fusion Protein in Nucleus
DAD1. AG putatively binds to the near-consensus genomic analysis, resulting in elevated levels of DAD1 site on the 5’ promoter region of DAD1, identified by expression that ultimately causes anther dehiscence. 2004 RSI Compendium 17
On the other hand, MYB26 has two near-consensus 5.4 Acknowledgments AG binding sites, but the timed-induction expression I would like to thank my mentors, Dr. Toshiro Ito, assay gave no proof that the regulatory gene is a Senior Research Scientist, and Professor Meyerowitz, direct downstream target of AG. However, MYB26 Division of Biology Chair, for their continued sup- was shown to be expressed earlier than DAD1 even though it also directly controls anther dehiscence. port, guidance, and inspiration. I would also like to thank my tutor, Jacqueline Choi, and fellow students, Therefore, it is probable that MYB26 is involved in staff, and lab members for their invaluable help with an AG-dependent late-stage developmental pathway. the research paper and presentation. I greatly ap- Results also provided the evidence that OPR3, COI1, preciate the opportunity provided by the Center for and RBR1 are not transcriptionally regulated due to Excellence in Education to conduct research through uniform levels of expression. the 2004 Research Science Institute. 5.2 Necessity of Prolonged AG Induc- tion for Flower Development References
On a more general level, prolonged AG activity was [1] H. Yu, T. Ito, Y. Zhao, J. Peng, P. Kumar, and found to be necessary for late-stage development of E.M. Meyerowitz. Floral homeotic genes are tar- normal stamens, although endogenous expression lev- gets of gibberellin signaling in flower develop- els did decrease over time, which implies dependence ment. Proceedings of the National Academy of on more complex hierarchies involving indirect tar- Science 101 (2004), no. 20, 7827–7832. gets in late-stage development. AG was also pheno- typically observed through the inducible system to be [2] S. Harris. Arabidopsis—map makers of required for anther dehiscence. Therefore, the molec- the plant kingdom. Available at http: ular and phenotypic evidence illustrated the direct //www.nsf.gov/od/lpa/news/publicat/ control of late-stage development by AG through a nsf0050/arabidopsis/arabidopsis.htm regulatory hierarchy. In turn, the results also sup- (2004/07/30). port the second hypothesis that AG directly and con- tinually regulates the expression of many genes with [3] R.J. Scott, M. Spielman, and H.G. Dickinson. different functions that act throughout reproductive- Stamen structure and function. The Plant Cell organ development. 16 (2004), S46–S60. [4] E. Meyerowitz. The genetics of flower develop- 5.3 Future Work ment. Scientific American (November 1994), 56– Although the research expanded on the developmen- 65. tal model, much more work needs to be devoted to [5] T. Jack. Molecular and genetic mechanisms of revealing the relationship and functions of these reg- floral control. The Plant Cell 16 (2004), S1–S17. ulatory genes. Results from the GUS staining of the plasmid constructs, which should come in within a [6] S. Ishiguro, A. Kawai-Oda, J. Ueda, I. Nishida, couple months, will more clearly indicate whether and K. Okada. The DEFECTIVE IN ANTHER these downstream genes are direct or indirect tar- DEHISCENCE1 gene encodes a novel phospho- gets of AG in vivo, in addition to determining their lipase A1 catalyzing the initial step of jasmonic spatial expression. The downstream genes that were acid biosynthesis, which synchronizes pollen classified as AG-dependent by current results could maturation, anther dehiscence, and flower open- still turn out to be direct targets since samples will ing in Arabidopsis. The Plant Cell 13 (2001), be analyzed individually. Chromatin immunoprecip- 2191–2209. itation will also be conducted to observe the physical binding of AG to the identified binding sites. Finally, [7] A. Stintzi and J. Browse. The Arabidop- genome-wide microarray analysis can be conducted to sis male-sterile mutant, opr3, lacks the 12- identify more downstream genes of AG that regulate oxophytodienoic acid reductase required for jas- late-stage development since the current research has monate synthesis. Proceedings of the National illustrated how a plant homeotic protein functions Academy of Science 97 (2000), no. 19, 10625– throughout development. 10630. 18 2004 RSI Compendium
Figure 11: Timed-Induction Floral Phenotypes
Figure 12: AG-Induced (4 × DEX) Inflorescences in Different Stages
AGAMOUS
SPOROCYTELESS/NOZZLE Regulatory Genes Regulatory Genes DAD1 MYB26
Microsporogenesis Filament Elongation Anther Morphogenesis COI1, OPR3
Anther Dehiscence
Figure 13: The “continued” hypothesis of AG function in late-stage development
[8] B.J.F. Feys, C.E. Benedetti, C.N. Penfold, and and H. Saedler. Disruption of Arabidopsis J.G. Turner. Arabidopsis mutants selected for thaliana MYB26 results in male sterility due resistance to the phytotoxin coronatine are male to non-dehiscent anthers. The Plant Journal 34 sterile, insensitive to methyl jasmonate, and re- (2003), 519–528. sistant to a bacterial pathogen. The Plant Cell 6 (1994), 751–759. [11] J. Riechmann. The Arabidopsis book.TheAmer- ican Society of Plant Biologists, Rockville, MD [9] D. Xie, B.F. Feys, S. James, M. Nieto-Rostro, (2002). and J.G. Turner. COI1:AnArabidopsis gene re- quired for jasmonate-regulated defense and fer- tility. Science 280 (1998), 1091-1094 [12] T. Ito, F. Wellmer, H. Yu, P. Das, N. Ito, M. Alves-Ferreira, J.L. Riechmann, and [10] S. Steiner-Lange, U.S. Unte, L. Eckstein, C. E.M. Meyerowitz. The homeotic protein AGA- Yang,Z.A.Wilson,E.Schmelzer,K.Dekker, MOUS controls microsporogenesis by regulation 2004 RSI Compendium 19
of SPOROCYTELESS. Nature 430 (2004), 356– 360. [13] T. Ito, H. Sakai, and E.M. Meyerowitz. Whorl- specific expression of the SUPERMAN gene of Arabidopsis is mediated by cis elements in the transcribed region. Current Biology 13 (2003), no. 17, 1524–1530. [14] H. Yu, T. Ito, F. Wellmer, and E.M. Meyerowitz. Repression of AGAMOUS-LIKE 24 is a crucial step in promoting flower development. Nature Genetics 36 (2004), no. 2, 157–161. 20 2004 RSI Compendium Design and Characterization of a Littrow Configuration External Cavity Diode Laser
Wenxian ÀÓÒg
under the direction of Professor Oskar J. Painter California Institute of Technology
Abstract beam with respect to the grating, otherwise known as the Littrow angle θ. Thus, by changing the Littrow An enhanced tunable Littrow-configuration external- angle, precise wavelength tuning can be achieved. cavity diode laser (ECDL) is constructed from inex- The Littrow configuration offers the advantage of pensive commercial components and then character- high efficiency and power but also faces problems of ized. The introduction of an additional mirror par- mode-hopping and beam angular displacement as the allel to the diffraction grating allows the laser to be grating angle is adjusted. tuned without changing the direction of the output beam. The 785 nm ECDL has a typical power out- put of 4 mW at 56.0 mA of current input with an effective tuning range of 13 nm and exhibits excellent output-beam directional stability. An upper limit on its linewidth is placed at 63.0 MHz.
1 Introduction Figure 1: Schematic diagram of external cavity in External-cavity diode lasers (ECDL) have widespread Littrow configuration. Adapted from [10] applications in optical and atomic physics. Such lasers make use of inexpensive yet efficient semicon- ductor diode lasers which are pumped by electri- On the other hand, the Littman-Metcalf config- cal energy [4]. The diode is coupled to an exter- uration [8, 9], as depicted in Figure 2, utilizes a nal cavity that incorporates a diffraction grating as diffraction grating at grazing incidence. The first- a wavelength-selective element, which then provides order diffraction beam goes to an additional mirror, frequency-selective optical feedback to the diode laser which then reflects the beam back to the grating via its antireflection-coated output facet. This con- and into the diode laser as optical feedback. Tun- cept of frequency selective feedback allows the laser to ing is achieved by varying the mirror angle instead achieve narrow linewidth and remarkable tunability. of the grating angle, allowing the grating, and thus Typically, such feedback is obtained through diffrac- the zeroth-order output beam, to remain fixed as tion gratings in either the Littrow or Littman-Metcalf wavelength is changed. Relative to the Littrow de- configurations. sign, the Littman-Metcalf configuration overcomes In the more popular Littrow configuration, shown the problems of mode-hopping and beam angular dis- in Figure 1, the grating is aligned such that the first- placement but does not share as high an output ef- order diffraction from the grating is coupled directly ficiency. Furthermore, the additional components in- back into the laser while the zeroth-order diffraction volved and its design complexity prevent it from sur- is reflected as the output beam. The lasing wave- passing the widespread prevalence of the relatively length is dependent on the angle of the incident laser simpler yet adequately effective Littrow arrangement. 22 2004 RSI Compendium
of a modified optical mount (Newport U100-P2K). The modifications on the optical mount include the tapping of three holes into the front plate and the re- moval of a square section of the plate. The diffraction grating used (Thorlabs GR13-1208) is gold-coated and blazed at 28◦ with 1200 lines/mm on a 12.7 mm × 12.7 mm × 6 mm substrate. A grating mount is constructed to hold the diffraction grating and is sub- sequently attached to the front plate of the modified U100-P2K. The grating is vertically and horizontally adjusted by the mirror mount actuators for coarse wavelength tuning. Figure 2: Schematic diagram of an external-cavity diode laser in the Littman-Metcalf configuration [7]
Here, an external-cavity diode laser is constructed based on the Littrow configuration, in close reference to the design of Hawthorn, Weber, and Scholten [6]. Minor modifications have been made to circumvent the problem of beam angular displacement and to produce a fixed-direction output beam. With a fixed output beam, the flexibility of the ECDL is enhanced as the output beam is now able to undergo fiber cou- pling for delivery to distant applications. The power output and linewidth of the modified ECDL are then determined using instruments such as an optical spec- Figure 3: Schematic diagram of the enhanced trum analyzer (OSA), a scanning Fabry-Perot in- Littrow-configuration external-cavity diode laser terferometer (SFP), and a Michelson interferometer. (Promise), as viewed from above This modified design is not only low-cost and simple to build, but also produces single-mode emissions of varying wavelengths sufficient for most applications, The main modification made to ensure a fixed- particularly pertaining to high-resolution, broadband direction output beam is the addition of a single plane spectroscopy. mirror (Newport 05SD520ER.2) parallel to the tun- ing diffraction grating and located rigidly with re- spect to it. This is done by attaching the mirror 2 Design and Construction to a mirror mount that is then secured to the front plate of the U100-P2K. The laser beam reflects from The design of the external-cavity diode laser pre- the grating and then from the mirror, thus replacing sented here takes advantage of a simple modifica- an otherwise large angular beam displacement with tion of standard Littrow-configured ECDLs to pro- a negligible translational beam displacement. When duce a fixed-direction output beam without signif- the grating is rotated by ∆θ, the beam reflected by icant output-power loss. The ECDL, which was the grating is rotated by twice that angle, or 2∆θ, named Promise (Figure 3), consists of a 70 mW, 785 by the law of specular reflection. Since the mirror nm laser diode (Sanyo DL7140-201S) and aspheric rotates by the same angle as the grating, when the collimating lens (Thorlabs C230TM-B, f =4.5mm reflected beam from the grating strikes the mirror, and 0.55 NA) mounted in a collimation tube (Thor- it is again reflected back by 2∆θ, such that the out- labs LT230P-B). The threaded collimation tube al- put beam remains relatively fixed. This concept of lows for accurate focusing of the collimating lens and adding a mirror to prevent beam displacement has also aligns the lens axis with the laser diode axis. The also been employed in the design of Hawthorn, We- diode-tube assembly is then fixed to the back plate ber and Scholten [6] and is an improvement on the 2004 RSI Compendium 23 previous design by Arnold, Wilson, and Boshier [2]. the ECDL design of Prowl is likely to demonstrate However, due to the differences in blaze angle of grat- the strengths of the modified Littrow configuration ings used, minor design changes were made to the to as great an extent as the initially intended design positions of the mirror and grating so that they were of Promise. parallel and did not obstruct the output beam. The compact nature of the Promise ECDL also provides for simple and efficient temperature regula- 3 Methodology tion of the entire system using a Peltier thermoelec- tric cooler (Melcor CP1.4-71-045L). A 10 kΩ ther- Upon construction, the lasers were characterized mistor (Thorlabs TH10K) attached to the collima- based on certain performance indicators, namely out- tion tube assembly provides feedback to the thermo- put power, tuning range, output-beam directional electric cooler via a temperature controller, thus en- stability, and linewidth. A battery of tests was run suring reasonable thermal stability. Thermal paste using various devices to analyze the laser perfor- is applied at each interface to ensure optimal ther- mance, the methodologies of which will be presented mal conductivity and efficient cooling of the setup. in this section. However, due to time constraints, the In addition to the thermistor used for temperature experiments outlined mainly pertain to the perfor- control, another temperature sensor (LM35) on the mance characteristics of Prowl. Nevertheless, similar collimation tube gives independent readouts. Other experiments can be conducted with the Promise de- improvements made to the ECDL system include the sign to evaluate its effectiveness when time permits. use of a massive aluminum metal base/heat sink to provide inertial and thermal damping. The ECDL 3.1 Alignment of Feedback setup is isolated from the optical table by placing a layer of Sorbothane beneath the baseplate as well as The key to the Littrow configuration is the back- by enclosing the laser with an aluminum cover, which coupling of the first-order diffraction beam from the is also isolated from the laser by Sorbothane. This grating into the laser diode. Without this feedback, dampens acoustic noise and shields the laser from air the Littrow laser cannot achieve single-mode emis- currents, thus providing for great stability and preci- sion and will lase at a wavelength set by the gain sion of the instruments. peak of the semiconductor active region. Thus, it is The design of Promise described above calls for critical to first align the feedback beam before carry- the modifications of several components and the con- ing out further experiments. Preliminary alignment struction of other new parts. During the time spent is done visually by rotating the actuators and merg- waiting for the machining of components, a second ing the secondary feedback and output beams into a ECDL, named Prowl, was constructed using exist- single bright spot on a screen at the output of the ing components. The concept of the latter laser re- laser. Good alignment is confirmed by inspecting the mains the same as the former, making use of the laser spectrum using an optical spectrum analyzer or modified Littrow configuration, which introduces the scanning Fabry-Perot interferometer, and modifica- plane mirror that moves simultaneously with the ro- tions are made until single-mode emission is obtained. tation of the diffraction grating. The only visible dif- Aligning the feedback beam is also considerably eas- ference is that instead of containing the entire ECDL ier when the laser is close to its lasing threshold, since system within a single U100-P2K mount, the Prowl the laser is most sensitive to feedback at this point. design makes use of the back plate of one U100-P2K When the feedback beam is well aligned, interference mount to hold the laser diode and another unmodified fringes are expected in the output beam and the laser U100-P2K mount to provide for manual translation remains single-mode with temperature and current of the grating and mirror. All essential aspects of the changes. laser, such as the thermoelectric cooler, have been included, though less important components such as 3.2 Optical Spectrum Analyzer the independent temperature sensor have been omit- ted. These minor modifications will only serve to An optical spectrum analyzer is an instrument that lengthen the cavity length slightly, reducing the mode displays the wavelength-dependent intensity of in- spacing of the external cavity, but should not signif- put light. This is achieved by means of a rotat- icantly hinder the performance of the laser. Thus, ing diffraction grating that deflects light of different 24 2004 RSI Compendium wavelengths at different angles. A fixed detector mea- ter the normalization of waveforms, the linewidth of sures the intensity of incident light at a given angle, a single wave is then determined. The SFP is also thus determining the wavelength-dependent intensity particularly useful in ascertaining the quality align- as the diffraction grating is rotated. Using this instru- ment of the grating feedback, as multi-mode emission ment, we are able to determine the effective tuning lines can be clearly resolved. range of the Prowl ECDL as well as its output power. The output beam of the Prowl ECDL was directed 3.4 Michelson Interferometer to a fiber coupler leading to the OSA located 70 cm from the output mirror of the laser. The ECDL actu- In a Michelson interferometer, a beam splitter divides ators were rotated so that the diffraction grating was the incident waves from a light source into two sam- tilted about an axis parallel to its grooves, thereby ples, which are then directed along separate paths. giving rise to coarse wavelength tuning. The laser Each beam of light travels a different path length wavelength was continuously tuned until a signifi- along each arm of the interferometer, acquiring a cant mode-hop back to the central gain peak was relative phase difference in the process. When the detected, signaling that the wavelength-dependent beams are reflected back to the beam splitter, they gain had overcome the wavelength-selective influence recombine and interfere to produce a interference pat- of the feedback. This demarcated the upper and tern. By examining the interference in the recom- lower limits of the ECDL tuning range. Five digi- bined beam, information on the source spectrum can tal screen shots were taken across the entire tuning be obtained. Here the Michelson interferometer is range, spaced at approximately equal wavelength in- employed to observe the visibility of fringes produced tervals. The linewidth of the laser can also be ap- at the output beam. As the relative path length is proximated from a single normalized digital scan. lengthened by moving one of the arms away from the Throughout all these experiments, the temperature beam splitter, the beam in the arm gradually loses co- was regulated by the thermoelectric cooler at a ther- herence, causing the fringes to oscillate between con- mistor resistance of 11.0 kΩ, corresponding to a rela- structive and destructive interference and leading to tively constant temperature of 23.9◦C, so as to min- an overall decrease in fringe visibility. A photode- imize the adverse effects of thermal drift. tector placed at the recombined beam measures the intensity of the fringes and translates the data to an oscillating voltage output. The envelope of the oscil- 3.3 Scanning Fabry-Perot Interferom- lating output is then displayed on an oscilloscope. eter This decay in fringe visibility occurs as a result of broadening effects, in particular Lorentz broadening. The scanning Fabry-Perot interferometer provides a Lorentz broadening (or lifetime broadening) results greater resolution for the linewidth measurement as from species in excited states having finite lifetimes. compared to the OSA. The SFP comprises a regu- From Heisenberg’s uncertainty principle, the product lar Fabry-Perot etalon with a fixed mirror arm and of uncertainties in energy levels and lifetimes is on a piezoelectric actuator attached on the other par- the order of . Hence, the finite lifetime of an ex- tially transmitting mirror arm. As voltage is applied cited species leads to a spectrum of energy emitted across the piezostack, the Fabry-Perot cavity length when the species returns to ground state, giving rise constantly changes by a small displacement and thus to broadening of the laser waveform. The fringe vis- varies the standing wavelengths within the cavity. A ibility function, V (τ), for a Lorentz broadened wave detector measures the intensity of light that eventu- is described by the exponential curve ally leaves the cavity via the partially transmitting mirror, while an oscilloscope displays the intensity of V (τ)=exp−πτ∆ν/2, (1) light against the voltage applied to the piezostack, which can later be related to frequency. where τ is the time difference between the two paths The SFP used in the characterization of Prowl and ∆ν is the linewidth of the laser mode [3]. is the Tropel Model 240 Spectrum Analyzer, with a The Michelson-interferometer measurements were finesse of 200 at 700 nm and a free spectral range carried out with a current input of 40.1 mA and (FSR) of 1.5 GHz. The output beam from the Prowl a thermistor resistance of 11.0 kΩ. One arm of ECDL is directed through the SFP to obtain a re- the interferometer was lengthened by 45 cm using a peating series of Lorentzian lineshape functions. Af- Unislide carriage, and the fringe visibility curve was 2004 RSI Compendium 25 plotted by the oscilloscope. Three repetitions were Combined Spectra carried out with the same interferometer alignment 1.1 fitted curve configuration and with the Unislide moving at the 1 same velocity. By fitting the empirical data obtained 0.9 from the oscilloscope on a similar exponential, the 0.8 linewidth of the laser can then be calculated. 0.7 0.6
0.5 Power (mW) 4Performance 0.4 0.3 4.1 Tuning range of ECDL 0.2 0.1 The digital screenshots taken from the OSA were 760 765 770 775 780 785 790 compiled using MATLAB to describe the tuning Wavelength (nm) range of the Prowl ECDL. At 56.0 mA, the wave- length of laser output can be tuned over a 11 nm Figure 4: OSA plot of intensity against wavelength at range from 775 nm to 786 nm by rotating the grating 37.8 mA, showing the tuning range of the ECDL. The alone. At lower current inputs of 37.8 mA, wave- fitted curve represents the projected gain spectrum of length tunability can be further extended to a wider the laser. range of 13 nm between 774 nm and 787 nm, as shown in Figure 4. A qualitative explanation of this phe- nomenon stems from the knowledge that the wave- Combined Spectra 0.55 length at which spontaneous emission occurs is de- fitted curve pendent on the gain spectrum, while the lasing wave- 0.5 length is dependent on both the gain spectrum and 0.45 reflectivity of the grating. Thus, at higher currents, 0.4 the broad gain spectrum tends to overwhelm the in- 0.35 fluence of the narrow band grating reflectivity, mak- 0.3 ing it more difficult for the ECDL to lase based on 0.25 Power (mW) frequency-selective feedback. However, when the cur- 0.2 rent input nears the lasing threshold limit at 33.0 mA 0.15
(Figure 5), the tuning range of the ECDL once again 0.1 shrinks slightly to 12 nm (774–786 nm). This is be- 0.05 cause the amplitude of the gain spectrum decreases 760 765 770 775 780 785 790 with lower currents. Thus, the range of wavelengths Wavelength (nm) at which gain exceeds loss (i.e., possible lasing wave- lengths) is reduced, limiting the tuning range of the Figure 5: OSA plot of intensity against wavelength at ECDL. 33.0 mA, showing the tuning range of the ECDL. The It is also worthwhile to note that the gain spec- fitted curve represents the projected gain spectrum of trum of the ECDL, although approximated by a the laser. Gaussian in Figures 4 and 5, is actually slightly asym- metrical. This is because the semiconductor mate- rial in the laser diode absorbs at shorter wavelengths compared to the wavelengths at which it emits. Since range. Nevertheless, the wavelength-tunability values gain is proportional to the difference between emis- obtained for the Prowl ECDL are reasonably com- sion and absorption of photons, higher gain is ob- parable to commercial standards, demonstrating the tained at longer wavelengths, resulting in a steeper feasibility of the design. gradient on the gain spectrum at shorter wavelengths and an asymmetrical gain spectrum shape. 4.2 Output power of ECDL These wavelength measurements do not take into account the effect of temperature variations, which With the laser wavelength tuned to the gain peak, the should allow the laser to tune over an even wider output power of the Prowl ECDL was measured to be 26 2004 RSI Compendium
approximately 1.1 mW at 37.8 mA of current input is likely to be too large as commercial lasers of the and about 4.0 mW with 56.0 mA of current. It may same standard generally have linewidths in the order seem surprising that the power output of the Prowl of kHz or MHz. Hence, it is reasonable to infer that ECDL is a mere fraction of the typical power output the OSA lacks the necessary resolution bandwidth to of the 70 mW laser diode. However, this is expected resolve the narrow linewidth of the laser. since the diffraction grating used has a high efficiency
of 60%–80% at blaze wavelength. Consequently, most 2.5 of the diode power output will be reflected back into
the diode by the grating instead of being directed into 2 the zeroth-order beam, resulting in the observed low power output. However, the small power coupled out 1.5 of the ECDL is sufficient to accurately characterize the laser and should not pose too great a concern. Power (mW) 1 4.3 Output-beam directional stability 0.5 of ECDL
The output-beam directional stability of the Prowl 774.8 774.85 774.9 774.95 775 775.05 775.1 775.15 775.2 775.25 ECDL was demonstrated by monitoring the tuning Wavelength (nm) of wavelength as the grating is rotated using an OSA located 70 cm from the laser. Despite the high sen- Figure 6: Measurement of linewidth using the op- sitivity of the OSA fiber coupler to beam misalign- tical spectrum analyzer. Note that the linewidth ment, the full tuning range of 13 nm was scanned suc- is limited by the resolution bandwidth of the OSA cessfully by adjusting the grating angle, without any ( ∼ 0.08 nm). need for realignment. This wavelength variation will normally displace the output direction of the laser beam by about 0.45◦ and cause a lateral translation Therefore, the SFP was employed to determine of 5.5 mm at the fiber coupler. However, with the ad- the linewidth of Prowl to greater precision. Af- ditional mirror, the lateral displacement of the output ter single-mode emission was obtained, two adjacent beam for a small change ∆θ in grating angle is now peaks were normalized and spaced at nine divisions given by ∆x ∼ 2L∆θ,whereL is the distance the apart by adjusting the x-axis scale of the oscilloscope beam travels between the grating and mirror. (Figure 7a). Since the FSR of the SFP is given as Given that L ( ∼ 3 cm) is far smaller than 70 cm, 1.5 GHz, the voltage separation on the x-axis be- the geometric ingenuity of the Prowl design has not tween the peaks can now be related to frequency. The only enabled the angle of output beam to remain un- FWHM is then measured on the normalized scale. changed, but also made the lateral displacement neg- Figure 7b shows that at 50 mA, the FWHM of the ligible at approximately 0.47 mm. Such minute shifts waveform is measured to be approximately 0.7 of a are insignificant for most applications, indicating the division. Using simple proportion calculations, the success of the Prowl design in maintaining a fixed- FWHM corresponds to about 120 MHz. At 70 mA, direction output beam. the FWHM is exactly 1 division long, or 170 MHz in terms of frequency. 4.4 Linewidth of ECDL Now, it is imperative to note that the FWHM measurements obtained could refer to either the An attempt to measure the linewidth of the Prowl Prowl ECDL linewidth or scanning Fabry-Perot ECDL was first made using the OSA. A single linewidth. In order to distinguish between the two, Lorentzian lineshape waveform obtained at the laser the effect of tilting the SFP on the waveform ob- emission wavelength at 60.0 mA was plotted and nor- tained is examined. By tilting the SFP slightly, malized. By examining the full width half maximum the reflectivity of the mirrors drops sharply, result- (FWHM) of the waveform, the linewidth of Prowl was ing in a significantly lower finesse. Recalling the re- measured to be approximately 0.08 nm, or 39 GHz, lationship ∆ν = νFSR/F , a lower finesse will give at 60.0 mA (Figure 6), which corresponds to the res- rise to a greater Fabry-Perot linewidth. So, if the olution bandwidth of the OSA. Intuitively, this value Fabry-Perot linewidth were much narrower than the 2004 RSI Compendium 27
Solving for ∆ν yields 2 V ∆ν = − ln f . (3) πτ V0
Substituting the relevant values of τ, V0 and Vf into Equation (3), allows the linewidth of the laser to be easily calculated. Table 1 shows the results of the Figure 7: a) Normalization of adjacent peaks on SFP Michelson-interferometer experiments conducted and oscilloscope. b) Measurement of linewidth on nor- the linewidth values obtained. malized scale using the SFP. τ /ns V0 /V Vf /V ∆ν /MHz 3.0 11.6 8.1 76.2 3.0 11.1 7.6 80.4 laser linewidth, any slight increase in the Fabry- 3.0 10.9 7.7 73.8 Perot linewidth should still not alter FWHM mea- surements significantly. However, it is indeed ob- Table 1: Summary of Michelson interferometer ex- served that by changing the angle of incident light on periment. the SFP, the FWHM measurements do not remain constant. Therefore, one can infer that the Fabry- By averaging the three linewidth values obtained, Perot linewidth must be bigger than that of the laser. the average linewidth of the laser is calculated to be As such, even though the SFP measurements have 76.8 MHz. This is a conservative estimate, but nev- narrowed down the laser linewidth by a factor of 300, ertheless further narrows the linewidth range of the this is still insufficient to totally resolve the narrow laser. However, undulations observed in the empirical laser linewidth. decay curve have rendered the linewidth values less This result is not surprising, as the Tropel Model accurate than desired. This problem can be overcome 240 Spectrum Analyzer is designed for wavelengths by only considering part of the decay graph, where ranging from 600 nm to 700 nm, which is not within the progression seems relatively exponential. Given the wavelength range of the ECDL ( ∼ 780 nm). The that the average velocity of the Unislide carriage was dielectric mirrors of the SFP have a narrow band- 0.20 cm · s−1, and that the timeframe considered was width of wavelengths for which they have high reflec- 70 s, a new value of τ was calculated. Table 2 shows tivity, and beyond this bandwidth, the finesse of the the results of the linewidth calculations under these SFP drops drastically (a value of 100 MHz for the new conditions. Fabry-Perot linewidth would actually imply that the finesse has fallen from 200 to 15). In order to provide τ /ns V0 /V Vf /V ∆ν /MHz greater resolution for the laser linewidth, a SFP with 0.93 8.8 8.1 56.7 a higher finesse at the required wavelength should be 0.93 8.4 7.6 68.5 used. However, the measurements of the SFP have 0.93 8.4 7.7 63.9 served a useful purpose in placing an upper bound- ary on the linewidth of the laser, meaning that the linewidth cannot exceed frequencies on the order of Table 2: Linewidth calculations after additional con- 100 MHz. siderations on the exponential nature of the data curve. Note that the distance travelled by the Unis- The linewidth measurements obtained from the lide carriage is no longer 45 cm. Michelson interferometer experiment were slightly more satisfactory. Assuming that the decay curve ob- tained from the oscilloscope follows the exponential This time, by averaging the three linewidth val- in Equation (1), we are able to describe the exponen- ues, a narrower average linewidth of 63.0 MHz is ob- tained. Considering the mechanical noise produced tial curve using the initial voltage V0 and the final by the Unislide carriage while in motion, which may voltage Vf . Equation (1) then changes to disturb the laser, as well as the limited ability of such Michelson interferometer experiments to mea- V0 exp −πτ∆ν/2=Vf . (2) sure linewidth, a linewidth value below 100 MHz 28 2004 RSI Compendium demonstrates the success of this particular experi- the feedback beam. Alternatively, the grating could ment. In conclusion, after using the three different be deliberately misplaced away from its blaze angle. instruments, OSA, SFP, and Michelson interferome- Lastly, smoother tuning and wider wavelength vari- ter, the Michelson interferometer proved to be most ations can be achieved by applying an antireflection precise, and the final upper bound on the linewidth (AR) coating to reduce the reflectivity of the laser was set at 63.0MHz. diode output facet. External-cavity lasers using op- timally AR coated diodes have achieved continuous tuning ranges of several THz [1]. Thus, an appropri- 5 Discussion ate AR coating will appreciably enhance the tunabil- ity of the ECDL. Prowl has clearly proven itself to be feasible for gen- Future studies could begin with the characteri- eral use as a tunable laser, from its wide tuning range, zation of the Promise design, since time constraints mechanical stability and narrow linewidth. One of have prevented the construction and testing of this the biggest advantages of the design is that it primar- laser. A comparison could be made between the de- ily makes use of inexpensive, readily available optical signs of Promise and Prowl to evaluate the strengths components. The total cost of construction was un- and weaknesses of both lasers. Furthermore, three der US$1000, comparatively cheaper than most com- methods of linewidth measurement have been pre- mercial Littrow-configured ECDLs. Moreover, the sented, although these are not as accurate as desired. ready accessibility of the different parts makes the Higher linewidth resolution could be provided by het- laser especially simple to construct. The additional erodyning and photomixing methods, where the beat mirror introduced has also worked superbly to al- frequency of two ECDLs operating at nearly the same low for coarse tuning without alteration of alignment, frequency is measured. As an extension, both lasers greatly enhancing the versatility of the conventional can be locked and stabilized by the Zeeman effect and Littrow ECDL. Both the Promise and Prowl lasers the radiofrequency (rf) beat between them observed will be used subsequently in experiments pertaining with an avalanche photodiode [12]. to atomic spectroscopy and laser cooling and have helped pave the way for other novel designs of com- pact ECDLs. 6Conclusion A significant problem faced in the construction of the Prowl laser, however, was the ease with which The design of a compact tunable Littrow-configured the laser diode could be blown. This occurred despite external-cavity diode laser was described. Empiri- the use of a current controller that protects the laser cal data from performance tests indicated that the diode with a shunting circuit by dissipating current Prowl ECDL has an effective tuning range of 13 nm, transients when the power cable is detached. The a typical output power of 4 mW and a linewidth of blowing of the diode can be attributed to the loose at most 63.0 MHz. The laser also exhibited excep- fit between the diode ends and diode socket. When tional output-beam directional stability due to the the diode socket was accidentally disconnected, there introduction of a plane mirror parallel to the grating. was no protective circuitry to prevent strong current These superior characteristics of Prowl testify to the transients from entering the diode and ruining its in- advantages of the enhanced ECDL design. The sim- trinsic material properties. Thus, future design con- ple modifications presented here can be applied to siderations should incorporate a firm coupling to give existing Littrow-configuration ECDLs, and the de- a tighter fit between the diode and the socket. sign holds exciting prospects for future applications Several other improvements can be made to the in atomic and optical physics. existing Prowl or Promise design to further extend its applications. These include the installation of a piezostack on the front plate of the U100-P2K and 7 Acknowledgments apiezodiskonthediffractiongrating,soastoen- able fine continuous wavelength tuning across the I would like to extend my sincerest thanks to my men- cavity mode spacing. Secondly, to increase the exist- tor, Professor Oskar J. Painter of the California Insti- ing power output, a diffraction grating with lower effi- tute of Technology, and his graduate students Orion ciency at blaze wavelength could be used so that more Crisafulli and Raviv Perahia for the invaluable guid- power is coupled into the output beam instead of ance and assistance they provided in the course of the 2004 RSI Compendium 29 project. Also, I would like to thank my tutor, Dr. [10] O.I. Permyakova, A.V. Yakovlev, and P.L. John Rickert, and the staff of the Research Science Chapovsky. Simple external cavity diode laser. Institute at Caltech for their input and suggestions To be published (2003). as well as help rendered in times of need. Finally, I would like to express my heartfelt gratitude to the [11] B.E.A. Saleh and M.C. Teich. Fundamentals of Center for Excellence in Education and the Ministry Photonics. John Wiley & Sons, Inc., Canada of Education, Singapore, for giving me this unique (1991). opportunity to enrich myself and expand my hori- [12] C.E. Wieman and L. Hollberg. Using diode zons. lasers for atomic physics. Review of Scientific In- struments 62 (1991), no. 1, 1–20. References
[1] A. Andalkar, S.K. Lamoreaux and R.B. War- rington. Improved external cavity design for ce- sium D1 (894 nm) diode laser. Review of Scien- tific Instruments 71 (2000), no. 11, 4029–4031.
[2] A.S. Arnold, J.S. Wilson and M.G. Boshier. A simple extended-cavity diode laser. Review of Scientific Instruments 69 (1998), no. 3, 1236– 1239.
[3] M. Born and E. Wolf. Principles of Optics. 7th ed. Cambridge University Press, Cambridge, United Kingdom (2002).
[4] L.A. Coldren and S.W. Corzine. Diode Lasers and Photonic Integrated Circuits. John Wiley & Sons, Inc., New York, NY (1995).
[5] T.M. Hard. Laser wavelength selection and out- put coupling by a grating. Applied Optics 9 (1970), no. 8, 1825–1830.
[6] C.J. Hawthorn, K.P. Weber and R.E. Scholten. Littrow configuration tunable external cavity diode laser with fixed direction output beam. Re- view of Scientific Instruments 72 (2001), no. 12, 4477–4479.
[7] K.C. Harvey and C.J. Myatt. External-cavity diode laser using a grazing-incidence diffraction grating. Optics Letters 16 (1991), no. 12, 910– 912.
[8] M.G. Littman and H.J. Metcalf. Spectrally nar- row pulsed dye laser without beam expander. Applied Optics 17 (1978), no. 14, 2224–2227.
[9] K. Liu and M.G. Littman. Novel geometry for single-mode scanning of tunable lasers. Optics Letters 6 (1981), no. 3, 117–118. 30 2004 RSI Compendium The Effect of Chemical Additives on Ice-Spike Formation
Samuel ÄedeÖeÖ
under the direction of Dr. Kenneth Libbrecht California Institute of Technology
Abstract
Solutions of sodium bicarbonate, calcium chloride, ethanol, acetic acid, and methanol, in concentra- tions ranging from 10−7 to 0.1 molar, were frozen under controlled conditions to measure the depen- dence of ice-spike formation on chemical impurities in water. The probability of ice-spike formation was plotted against molarity of solution to determine gen- eral trends for different solutes. The varying effects of different solutes at the same concentrations sug- gested that freezing-point depression was unlikely to cause inhibition of ice-spike formation in solutions, and a model based on dendritic sheet formation was proposed.
1 Introduction
Ice spikes, or spicules, are thin, tapered projections of ice occasionally formed during the freezing of stand- ing water, sleet pellets [3], and icicles [11]. They are also found when distilled water is frozen in ordi- nary household ice-cube trays (Figure 1). This phe- nomenon has been observed and recorded for nearly a century [5], but its causes remain poorly understood. A qualitative model of the phenomenon was put forth independently by Bally and Dorsey in the early 20th century and remains the most widely accepted description of this effect [2, 3, 5, 14]. For the case of water confined to a container, the model outlines Figure 1: Examples of typical ice spikes from water the following sequence (Figure 2). Surface water nu- frozen in standard household trays. The spike pro- cleates around irregularities where it meets the con- trudes from the surface at an angle and tapers from tainer wall and freezes inward, eventually leaving a base to tip. Sources: Morris, Libbrecht [15, 13]. small hole in the surface. Meanwhile, water adjacent to the walls of the container begins to freeze, and the temperature of water in the center of the cube falls 32 2004 RSI Compendium
Figure 3: The probability of ice-spike formation ver- sus temperature for pure distilled water in ordinary Figure 2: A depiction of the Bally-Dorsey model. plastic ice-cube trays. A fan inside the freezing cham- Water freezing on the sides of the cube pushes liq- ber promoted evaporative cooling and facilitated the uid water up through the hole in the surface, where growth of ice spikes. Probability of ice-spike forma- − ◦ it forms a drop whose freezing edges create the ice tion is maximized near 7 C. Source: Libbrecht, Lui spike. Source: Libbrecht, Lui [14]. [14].
property of freezing-point depression: below 4◦C, causing it to expand. The expansion of − · ◦ interior water and the reduction of volume in the in- 1.86 kg C ∆Tf = · mi, (1) terior of the cube increase the pressure on the water, mol pushing it upward through the hole. This results in where ∆Tf is the change in freezing point; m is the the formation of a drop of water immediately above molality of the given solution; and i is the van’t Hoff the hole. The water at the edges of this drop freezes, factor, which describes the dissociation of the solute initiating the tube. As volume within the cube is in- into component ions. For ethanol, methanol, sodium creasingly taken up by ice, the remaining liquid water bicarbonate, and calcium chloride, the i factors are is pushed up through this tube and freezes, lengthen- 1, 1, 2, and 3, respectively. For acetic acid, the value ing the tube. The growth of the tube ceases when the of i varies as the inverse square root of concentration, drop at the top of the tube freezes entirely, which is with values between 1 and 2 in this experiment. Note substantially before the rest of the water in the cube that for low concentrations, such as those involved in is frozen. this experiment, which were less than or equal to 0.1 Previous research has shown that the formation molar, molality is numerically equivalent to molar- of ice spikes in pure distilled water is maximized ity, so these measures will be used interchangeably in when the water is frozen in ordinary plastic ice-cube calculations. trays at a temperature near −7◦C in the presence of Eighty percent of tap-water systems in the United adequate air circulation (Figure 3) [14]. Other re- States use groundwater [7], and the maximum lev- ports find that the probability of ice-spike formation els of chemical impurities mandated by the EPA are is drastically reduced by the presence of impurities in negligible in comparison to dissolved minerals [8], the water [11, 13, 14], an effect which is not addressed so a calculation of the freezing-point depression in in the Bally-Dorsey model. In particular, although groundwater may be used to estimate the freezing- ice spikes form readily in most home freezers when point depression in tap water. A common measure distilled water is used, they rarely form from tap wa- of the presence of these mineral impurities in water ter. It is possible that this is due to the colligative is total dissolved solids (TDS), expressed in mg/L. 2004 RSI Compendium 33
The EPA advises, but does not require, that TDS moved and inspected. not exceed 500 mg/L [6]. Estimating TDS as 1000 For analysis purposes, an ice spike was defined to mg/L, the approximate concentration of impurities be a distinct projection from the ice surface of length in the water may be obtained by consulting Table 1. 0.5 cm or greater. The total number of cubes with ice The total molality of groundwater is approximately spikes was counted, and this number was divided by 0.002 mol/kg, which would produce a freezing-point 50 to estimate the probability of ice-spike formation depression of only 0.004◦C, yet ice spikes are ex- for the given conditions. Further trials were some- tremely rare in tap water. However, there are also times necessary and are reflected in reduced error some reports of regular ice-spike formation in tap wa- margins on the data plots. ter. These facts strongly suggest that factors other than freezing-point depression are involved. The pur- pose of this research is to investigate which chemical 3Results impurities facilitate or hinder ice-spike growth and to determine the variation of probability of ice-spike 3.1 Observations formation as a function of chemical-impurity concen- tration. The experimental data are summarized in Figure 4. Pure water exhibited a probability of ice-spike forma- Component %TDS tion of roughly 0.34. The separate addition of both
Silica (SiO2) 3.0 calcium chloride and sodium bicarbonate in concen- −5 Calcium 11.2 trations less than 10 molar resulted in a slightly Sodium 10.0 higher probability of ice-spike formation, although Magnesium 2.5 this may have been purely due to experimental error. − With increasing concentrations, the two salts showed Bicarbonate (HCO3 ) 62.5 2− similar behavior, drastically reducing the probabil- Sulfate (SO4 ) 10.0 Chloride (Cl−) 5.0 ity of ice-spike formation. These results are consis- tent with the results of a previous study using sodium chloride [14]. Both methanol and ethanol, added sep- Table 1: Relative concentrations of impurities in arately to water, significantly facilitated the forma- groundwater, measured by percentage of total dis- tion of ice spikes, only producing an inhibitory effect − solved solids. Source: Ozsvath [16]. at concentrations greater than or equal to 10 3 mo- lar. Acetic acid produced a larger inhibitory effect than other solutes at very low concentrations, but behaved in a similar manner as as the two salts at 2 Materials and Methods concentrations greater than 10−5 molar. In addition to the number of ice spikes, the ap- The freezer for this experiment was an insulated cop- pearance of the frozen cubes was also observed. Two per cylinder cooled by circulating chilled methanol effects were found to occur at high concentrations. through copper tubing attached to its top and The first was a smoothing of the surface of the cube. sides. The air temperature within the freezer was Cubes consisting of pure, or nearly pure, water usu- maintained near −9◦C, and a small electric fan ally exhibit a few millimeter-scale lumps distributed was mounted inside to promote evaporative cooling. randomly over the surface—features which were al- Household plastic ice-cube trays were cut to fit in- most entirely absent in cubes of solutions of impurity side the tank and, when filled, were positioned on a concentration greater than 10−3 molar. Although wire rack approximately 25 cm above the bottom of no quantitative data were recorded, the appearances the tank. Solutions frozen consisted of sodium bicar- of ice lumps and ice spikes seem highly correlated. bonate, ethanol, calcium chloride, acetic acid, and The second effect was a change in the appearance of methanol dissolved in laboratory deionized water. the insides of the ice cubes. High concentrations of Only one solute was present in each trial. Concen- acetic acid, methanol, and ethanol sometimes pro- trations tested ranged from 10−7 to 10−1 molar. For duced cubes that were nearly transparent. Pure wa- each of the trials, 5 trays, each containing 10 cubes, ter cubes exhibited a cloudy appearance due to gases were filled with the appropriate solution, placed in in the water which came out of solution during freez- the freezer for approximately 6 hours, and then re- ing, leaving bubbles. 34 2004 RSI Compendium
cube trays on the wire rack, which varied slightly from trial to trial. Both of these sources of error could have been addressed, but this would have re- quired a prohibitive amount of time and reduced the amount of data collected.
4 Discussion
The data prompt many questions about the ice-spike phenomenon. The presence of impurities can clearly inhibit ice-spike formation, and this effect becomes more prominent at higher concentrations. However, a preliminary model for ice-spike inhibition based on freezing-point depression seems unlikely due to the scale of the concentrations involved. For instance, a 0.001 molar solution of CaCl2 would depress the Figure 4: Consolidated experimental data. Con- ◦ centrations are molar. Data for sodium bicarbon- freezing point of water by only 0.006 C, which is far smaller than the scale of random temperature fluctua- ate (NaHCO3), calcium chloride (CaCl2), and acetic tions within the freezer. Nevertheless, ice spikes were acid (CH3COOH) were similar, producing a large inhibitory effect at relatively low concentrations. seven times less likely to occur in 0.001 molar calcium chloride than in pure water. The highest freezing- Ethanol (CH3CH2OH) and methanol (CH3OH) ex- point depressions in the experiment, those for the 0.1 hibited no inhibitory properties at low concentrations ◦ and required roughly 100 times the concentration of molar alcohol solutions, were only 0.186 C, a negli- the other solutes to have a similar inhibitory effect. gible amount. Moreover, the probability of ice-spike formation at a given concentration varied widely be- tween solutes, suggesting that colligative properties are not at work. The data suggest that freezing-point 3.2 Sources of Error depression could only play a significant role given two rather unlikely conditions. The first is that solutes This experiment was designed to determine general reach concentrations orders of magnitude larger in trends, so minimizing error was secondary to acquir- drops at the tips of ice spikes than in the original ing substantial amounts of data. Error bars shown solution. The second is that the scale of this ampli- in plots are one standard deviation, calculated using fication effect varies widely between solutes. A fu- Bernoulli statistics. In addition to random error due ture experiment comparing concentrations of solutes to vibrations and minute temperature fluctuations, at the tips of ice spikes with concentrations in the there were several sources of systematic error in this rest of the cube could investigate this model. experiment, most of which were considered inconse- Since freezing-point depression is an unlikely quential. The first of these was the inaccuracy of the source of ice-spike inhibition, a more likely model is concentrations of the solutions frozen. Volumes were based on crystal formation. According to the Bally- measured in a 300 mL beaker accurate to about 3 mL Dorsey model, ice spikes form when most of the sur- and in a 2 L beaker accurate to about 10 mL. Masses face of the water is frozen, leaving only a small hole. were measured with an electronic balance accurate to The convergence of three dendritic crystal sheets usu- 0.1 g. These instruments may have contributed an er- ally forms this triangular hole, which becomes the ror of as much as 10% in the concentrations, which base of the ice spike. Inhibition of the formation of was judged harmless due to the logarithmic distribu- these sheets could easily account for reduced proba- tion of trial concentrations. bilities of ice-spike formation at large concentrations. There were two main sources of human error. The Though this model seems promising, the dependence first was in the identification of ice spikes. The crite- of crystal formation on the chemical properties of the ria were subjective, and verification of appropriate substance is not well understood. An experiment di- length was accomplished by eye, without use of a rectly observing crystal growth in water as a function ruler. The second was in the placement of the ice- of the concentration of chemical additives would be 2004 RSI Compendium 35 necessary to investigate this model. Emma Schmidgall and Bryant Mathews, and Molly Qualitative observations of ice lumps and trans- Peeples for their aid in completing this research. I parent cubes present numerous avenues for future re- would like to especially express my gratitude to the search. It is possible that the ice lumps observed Research Science Institute and the Center for Excel- at low concentrations are formed in a manner simi- lence in Education for making this research possible. lar to ice spikes, whereby liquid water is pushed out through a hole in the nearly frozen surface. Lumps may be formed when water is pushed out too quickly, References causing it to flow over the surface, or when the initial hole is too large, in which case the drop will freeze too [1] G. Abrusci. Question #65. What conditions de- slowly to lengthen into a spike. Since the appearance termine crystal growth? American Journal of of ice spikes appears to be highly correlated with the Physics 65 (1997), no. 10, 941. appearance of ice lumps, it is likely that the same fac- [2] O. Bally. Uber¨ eine eigenartige Eiskrystallbil- tors lead to the inhibition of both. The transparency dung. Helvetica Chimica Acta 18 (1933), 475. of some cubes at high concentrations is a more per- plexing phenomenon, as there is no readily appar- [3] D.C. Blanchard. Shorter contribution: a verifi- ent mechanism for the dissipation of dissolved gases. cation of the Bally-Dorsey theory of spicule for- This effect was only observed in the presence of liquid mation on sleet pellets. Journal of Meteorology solvents, perhaps indicating that dissolved solids and 8 (1951), 268–269. liquids may accumulate differently in the ice during freezing. The correlation between the transparency [4] M. Chen. Ice spike formation induced by den- of ice cubes and the inhibition of ice spikes could not dritic ice sheets. Undergraduate research report, even be qualitatively established within the context University of Toronto (1993). of this experiment. A longer study might make more [5] H.G. Dorsey. Peculiar ice formations. American careful inspections of the cubes in addition to testing Physical Society 18 (1921), no. 2, 162–164. more solutes to gain a better understanding of this effect. [6] United States Environmental Protection Agency. Secondary drinking water regulations: guidance for nuisance chemicals. Available at 5Conclusion http://www.epa.gov/safewater/consumer/ 2ndstandards.html (2004/08/01). Chemical additives in water have been found to in- hibit the formation of ice spikes. Metallic salts have [7] United States Environmental Protection been found to inhibit the formation of ice spikes to Agency. Ground water and drinking water: a large extent, even at low concentrations. Alco- frequently asked questions. Available at http: hols have been found to promote the formation of //www.epa.gov/safewater/faq/faq.html ice spikes, inhibiting ice-spike formation only at con- (2004/08/02). centrations greater than or equal to 10−3 molar. This experiment has not supported a model of the inhibi- [8] United States Environmental Protection tion of ice spikes based on freezing-point depression, Agency. List of contaminants and MCLs. Avail- but has suggested one based on the formation of den- able at http://www.epa.gov/safewater/mcl. dritic sheets. html (2004/08/02). [9] L. Evans, R. McLachlan, and S. Morris. The spikes/whiskers phenomenon. The Physics 6 Acknowledgments Teacher 31 (1993), no. 5, 264–265.
I would like to thank my mentor, Dr. Kenneth Lib- [10] J. Hallett. Crystal growth and the formation of brecht, physics professor and department chair at the spikes in the surface of supercooled water. Jour- California Institute of Technology, for his guidance nal of Glaciology 3 (1960), no. 28, 698–704. and direction in completing this project. I would also like to thank my tutor, Dr. John Rickert, RSI tech- [11] L. Hill, E. Lozowski, and R.D. Sampson. Exper- nology assistant Jake Wildstrom, teaching assistants imental Evidence on Ice Spikes. 36 2004 RSI Compendium
[12] V. Jamieson. Freezer teaser. New Scientist 180 (2003), 36. [13] K. G. Libbrecht. Ice spikes. Available at http://www.its.caltech.edu/~atomic/ snowcrystals/icespikes/icespikes.htm (2004/07/02). [14] K.G. Libbrecht and K. Lui. An investigation of laboratory-grown ice spikes. To appear in Jour- nal of Glaciology. [15] S. Morris. Got spikes on your ice cubes? Avail- able at http://www.physics.utoronto.ca/ ~smorris/edl/icespikes/icespikes.html (2004/07/02). [16] D.L. Ozsvath. Groundwater quality. Avail- able at http://www.uwsp.edu/geo/faculty/ ozsvath/lectures/Groundwater_Quality. htm (2004/08/01). [17] H.F. Perry. Ice spikes—can you explain them? The Physics Teacher 31 (1993), 112. [18] H.F. Perry. A last word on ice spikes. The Physics Teacher 33 (1995), 148–149. [19] H.F. Perry. Answer to Question #65. What con- ditions determine crystal growth? American Journal of Physics 69 (2001), no. 2, 106. On a Generalization of the Collatz Conjecture
John Äe×ieÙØÖe
under the direction of Mr. Zuoqin Wang Massachusetts Institute of Technology
Abstract numbers do eventually reach 1, though there still may be an infinite number of exceptions. Other theorems In this paper we analyze a generalized version of the estimate the number of iterations a typical number Collatz conjecture proposed by Zhang Zhongfu and takes to reach 1 or show that the conjecture holds Yang Shiming. We present a heuristic argument in for certain broad classes of integers (see Lagarias’s favor of their conjecture and generalize a number of survey [5]). fundamental theorems from the original problem. We Here we deal with a generalization of the Col- then obtain results related to properties of the gener- latz conjecture proposed by Zhang Zhongfu and Yang alized conjecture which do not arise in the original. Shiming [8]. We examine the mapping Tn : N → N defined by the rule x 1 Introduction where the pi are exactly pi1 pi2 ...pik the primes ≤ pn that Tn(x)= The Collatz conjecture is one of the long-standing divide the numerator, open problems of mathematics. Apparently first pn+1x +1 ifnopi ≤ pn divides x. posed by Lothar Collatz in the 1930s, it has since For example, with n = 3, we multiply by 7, add 1 withstood every attempt at proof. Since the problem and then divide out by 2, 3, and 5. Taking x =37as was passed around orally for many years before any our initial value, the trajectory of x under iteration articles were written about it, it goes by a number of T is 37 → 260 → 26 → 13 → 92 → 46 → 23 → of names, including Kakutani’s problem, Hasse’s al- n 162 → 27 → 9 → 3 → 1. Here we investigate the gorithm, Ulam’s problem, and the Syracuse problem. properties of this function under iteration. We note The problem is related to a wide range of topics in that our general case contains nontrivial cycles, which mathematics, including number theory, computabil- are conjectured not to occur in the original problem, ity theory, and the analysis of dynamical systems. and thus formulate our motivating conjecture: The statement of the Collatz conjecture involves the mapping T : N → N where Conjecture 1. For any pn+1 and x0, repeated itera- (i) tion of Tn(x) on x0 will eventually yield Tn (x0)=1 x/2ifx is even, (i) T (x)= or Tn (x ) will enter one of a finite number of non- 3x +1 ifx is odd. 0 trivial periodic cycles. Let T (i)(x) denote the result of i iterations of T on These nontrivial cycles are discussed more fully in x. We call the sequence of T (i)(x)thetrajectory or x. Section 3.4. A small example is the periodic trajec- The Collatz conjecture states that for all x there is tory 17 → 47 → 37 → 17 →··· for pn+1 = 11. some i such that T (i)(x) = 1. For example, if we start The Collatz conjecture is very similar to the case with x = 7, the iteration goes 7 → 22 → 11 → 34 → n = 1 in our more general statement, except that it 17 → 52 → 26 → 13 → 40 → 20 → 10 → 5 → 16 → also excludes the possibility of nontrivial cyclic tra- 8 → 4 → 2 → 1. The conjecture has been verified jectories. A proof of the general conjecture would not for initial values of x up to 250, but proof remains resolve the Collatz conjecture, as there could still ex- elusive. However, there exist many partial results; ist nontrivial cycles. It is known [5] that if such cycles one theorem shows that asymptotically, “nearly all” exist they must have period at least 275,000. 38 2004 RSI Compendium
While a great body of work pertaining to the orig- Proof. Consider the set of residues of A taken mod inal conjecture (our n = 1) exists, the only paper pi. There are no 0 residues by definition of A,since | # concerning this generalization is the paper by Zhang pi pn . However, the residues must be evenly dis- Zhongfu and Yang Shiming in which the generalized tributed over the other pi − 1 possibilities. Suppose form is suggested. They do not analyze in it much that some residue k1 in A occurs more often than depth, only demonstrating the existence of nontrivial some other residue k2. When we multiply A by a con- # cycles and discussing a slight further generalization. stant relatively prime to the modulus pn ,wegetthe Using a wide range of techniques, mathematicians same residue system. But there is one such number ≡ # have proved numerous results about the Collatz con- C so that k1 Ck2 (mod pn ). Therefore k1 and k2 jecture, from demonstrating that it holds for “almost must occur with equal frequency among the elements all” large x to showing the truth of certain cases to of A taken mod p . So the residue −1 occurs #(A) i pi−1 demonstrating its connections to rational approxima- times in the set A taken mod pi.Now,thesetkA is # tions of log2 3. It has been attacked with graph the- thesameresiduesystemasA for (k, pn )=1.Sothe ory, analysis, elementary number theory, and ergodic residue −1 occurs there this same number of times. theory, with each of these approaches yielding new Now, in the set kA + 1 the residue 0 occurs exactly results. For a more complete overview, the reader this many times, and exactly that many elements of may consult Lagarias’s excellent survey article [5]. kA + 1 are divisible by pi. Lemma 3. We have ∞ 2 Probabilistic Analysis of Tra- −i 2 (n−i)(n ) = n−n/(n−1) jectories i=1 Our conjecture is equivalent to the assertion that for n>1. there are only a finite number of cycles and that the Proof. First, note that mapping contains no divergent trajectories, that is, (i) ∞ ∞ →∞ →∞ −i −i there is no x0 for which Tn (x0) as i .If −i (n ) −i (n ) we assume that for large initial values, the function (n ) =explog (n ) i=1 i=1 acts sufficiently randomly that residues are equally ∞ k # k −i −i distributed modulo pn =(p1p2 ...pn) ,wemay =exp n log n calculate expected trends in the size of the numbers i=1 in our trajectory and then supply some supporting ∞ evidence for part of our conjecture. We specifically =exp− log n in−i . investigate the expected ratio between consecutive i=1 ≤ terms not divisible by primes pn in trajectories. ∞ −i 2 3 ··· So now let k = i=1 in =1/n +2/n +3/n + For a large integer n0, we may estimate the ratio be- # so that tween n0 and n1, the next term coprime to pn . 2 3 4 nk =1+ + + + ··· Heuristic 1. We expect that n n2 n3 1 1 1 2 3 n 2 = 1+ + + ··· + + + + ··· n1 ≈ −pj /(pj −1) n n2 n n2 n3 pn+1 pj . n0 1 j=1 = + k, − 1 1 n Proof. Before proceeding, we must establish two pre- liminary lemmata which are necessary to our argu- whence k = n/(n − 1)2. Thus our product is equal ment. to ∞ 2 −n/(n−1) Lemma 2. If set A is a reduced residue system mod- exp − log n in−i = elog n # ≤ ulo pn = p1p2 ...pn,thenanypi with i n divides i=1 precisely #(A)/(p −1) elements of the set kA+1 for − − 2 i = n n/(n 1) , any k not divisible by any prime less than or equal to pn. as desired. 2004 RSI Compendium 39
Now we return to the heuristic. The first step is that instead this ratio converges: note first that to determine the expected factor by which a given n pj − 2 prime pi will decrease a number when we divide out (pj −1) pn+1 pj by small factors. Consider a prime pi ≤ pn.The j=1
probability that pi will divide pn+1x +1 exactly m n − − 2 times is equal to the probability that it will divide it =exp log p log p pj /(pj 1) − n+1 j once, times the probability that it will divide it m 1 j=1 more times, times the probability that it will divide it n pj no more times after that. By Lemma 2 with k = pn+1 − =exp log pn+1 2 log pj ∈ (pj − 1) and n0 A, the chance that pn+1n0 + 1 is divisible j=1
by pi once is 1/(pi − 1). The chance that pn+1n0 +1 n 1 1 − 2pj will be divisible by pi exactly m timesisthen − =exp log pn+1 + 2 log pj pj pj(pj − 1) j=1 m−1 1 1 pi − 1 1 n n log pj (1 − 2pj)logpj = m , − − pi − 1 pi pi p =exp log pn+1 2 . i pj pj(pj − 1) j=1 j=1 as described above. It may seem irrational that we It is a short exercise in analytic number theory to treat the first division differently than the others, but show that log p =logN + C + o(1) for some to see why this is plausible, consider the 3x +1 map- p≤N p 1 constant C . Some simple calculus shows that the ping of the Collatz conjecture; when we apply 3x +1 1 second sum converges over all the naturals, and there- to an odd number we may always take out a factor of fore its sum over the primes is some constant C . 2 immediately, but after that we treat it as random. 2 Taking N = pn+1 above, we find that our product is Now, when pi divides a number m times, the factor m − − − − − of reduction is 1/pi . We then expect pi to reduce elog pn+1 log pn+1 C1 C2 o(1) = e (C1+C2+o(1)), pn+1x + 1 by an average factor of which converges to some constant C as n →∞.Cal- ∞ “ ” ∞ 1 −m 2 m pm −m (p ) −pi/(pi−1) culating the value over the first 10,000 primes sug- (1/p ) i = (p ) i = p i i i gests that C ≈ 0.520. We also note that the ex- m=1 m=1 pected ratio seems to be less than 1 for all pn+1,and by Lemma 2. The expected total factor of reduction our apparently tight error bound combined with our calculations over the first 10,000 p provides strong due to all primes up to pn is therefore n+1 evidence for this. n 2 −pj /(pj −1) pj . j=1 3 Structure of Trajectories
The expected ratio is then found as It is very difficult to directly analyze the trajectory of a given number, but we may still obtain some results. n 2 To begin, we present generalizations of several basic n1 ≈ −pj /(pj −1) pn+1 pj . results of Terras and others presented in Lagarias’s n0 j=1 paper [5].
3.1 The Encoding Matrix For n = 1, the Collatz conjecture, the expected One of the most important tools in the study of the ratio is 3/4, which suggests that on a large scale, tra- Collatz conjecture has been Terras’s encoding vector jectories tend to decrease and divergence is unlikely. (called the parity vector by Lagarias), which contains √4 5 3 ≈ entries corresponding to the action of the function for Forn = 2, the expected ratio is 12 0.54836, which suggests that trajectories will tend to decrease even each iteration (whether the image of x is 3x +1or faster. We might expect that this ratio would tend to x/2) [7]. In our general case we define Ak(n)tobethe ≤ 0asn increases without bound since we are dividing matrix whose entries mij satisfy 0 mij action of the function on the initial number under and repeated iteration. For example, the iteration of T 1 a(Ak,i+1) 0 on 13 with p =7yields13→ 92 → 46 → 23 → pn+1 pn+1 n+1 ρ (x)= = =1. 1 b(βi,j) 0 162 → 27 → 9 → 3 → 1. As an encoding matrix, we ≤ p i=1 j≤n pj j n j write Then we have T (1)(x)=λ (x)x + ρ (x)inthiscase. 100101111 1 1 Now consider the case when α has some entry equal A (13) = 121200001 . 1 9 to 0. In this case λ has a numerator of 1 since 321322431 1 a(A1, 1) = 0 and a denominator equal to the prod- ≤ rd uct of all primes pn dividing x, while ρ1 is 0 since For example, the 3 column corresponds to 46, which (1) is congruent to 0 mod 2, 1 mod 3, and 1 mod 5. The c(α1)=0.ThenTn (x)isx divided by these primes, th as it should be. So our base case holds. final column represents 1. We denote the n column (k) th So now assume that we have Tn (x)=λk(x)x + by αn(x) and the vector of the n row as βn(x). We ρk(x) for all k.Whenαk+1 has all nonzero entries, may use the encoding matrix to study the trajectory (k+1) (k) (k) we should have Tn = pn+1Tn +1. Wehave of x. We may derive an important formula for Tn (x) λk+1 = pn+1λk because a(Ak+1, 1) − a(Ak, 1) = 1 in terms of Ak(x). th and the denominator remains unchanged. We also Explicitly, letting αi denote the i column of Ak, th get βi the i row of Ak and defining k +1 a(Ak,i+1) a(A ,j)=#{α : j ≤ i ≤ k, α has no zero entry}, c(αi)pn+1 k i i ρk+1 = = pn+1ρk +1 b(βi,j) b(βi,j)=#{mie : j ≤ e ≤ k, mie =0}, i=1 j≤n pj and (k+1) 0 if some entry of αi is 0, from the definitions of a, b,andc.SoTn (x)= c(αi)= 1otherwise, λk+1(x)x + ρk+1(x)= pn+1λk(x)x + pn+1ρk(x)+1= (k) pn+1 λk(x)x+ρk(x) +1 = pn+1Tn +1, as it should. (k) we can decompose Tn (x) as follows. Now consider the case when αk+1 has entries equal to 0. Here λk+1(x)isjustλk(x) divided by the primes (k) (k) Theorem 4. We have Tn (x)=λk(x)x + ρk(x), dividing the most recent iterate Tn (x). Addition- where ally ρk+1(x)isρk(x) divided by these primes, since a(Ak,1) pn+1 ck+1 = 0 and our bottom changes by this factor. So λk(x)= (k+1) (k) b(βj ,1) we have Tn (x)=Tn (x) divided out by small ≤ p j n j primes. This completes our proof. and k a(Ak,i+1) We may prove various other theorems about the c(αi)pn+1 ρk(x)= . densities and structures of trajectories, but first, we b(βj ,i) i=1 j≤n pj require some definitions. Essentially, the quantity λ measures the over- k 3.2 Matrices: Attainable, Admissible, all multiplicative factor, and ρk makes small correc- tions for the +1s. Note that we may sometimes write and Inflating λi(Ak)andρi(Ak), and here Ak is the matrix repre- We call a matrix Bk attainable if there exists some senting the trajectory of x. n0 such that Ak(n0)=Bk. To see why not all matrices are attainable, note Proof. We proceed by induction on k.First,wehan- that no encoding matrix may contain the sequence dle the base case k =1.Ifα1 contains all nonzero entries, so that a(A1, 1) = 1 and x is divisible by no 10 (1) prime ≤ pn, the image of x is Tn (x)=pn+1x +1. 01 Our formula gives since this would correspond to taking an odd number, 1 dividing it by 3, and getting an even number. If two pn+1 λ1(x)= = pn+1 vectors α and α may occur as consecutive α and 1 1 2 i 2004 RSI Compendium 41 αi+1,wesaythatα2 is a successor of α1,andwe 0 0 0 0 1 1 1 denote the number of possible successors of a vector → , 0 0 1 2 0 1 2 α by n(α). A matrix is attainable if and only if αi+1 is a successor of αi for all 0 ≤ i Following Lagarias, we say that a matrix Ak is admis- 0 0 1 → , sible with length k if λk(Ak) < 1, while λi(Ak) > 1for 2 1 1 1 ≤ i call a matrix Ak inflatingwith length m if λi(Ak) > 1 1 0 ≤ ≤ → . for 1 i m. This means that the multiplicative 2 2 factor is always greater than 1 for the first m columns of the matrix. # k It is clear that there are (pn ) different matrices Figure 1: Possible successors of length k for a given pn. But how many of these are attainable? It is difficult to analyze this in the general case but not altogether too difficult to solve the case n =2(pn+1 = 5). We find that the number for k ≥ 1, with the initial condition a1 = ··· = of attainable matrices grows exponentially with k. f1 = 1. These Fibonacci-type recurrences guaran- tee that our sum ai + bi + ci + di + ei + fi grows Theorem 5. The number of attainable matrices of essentially exponentially. To see this, we rewrite the length k in the case n =2is asymptotically propor- system as Xk+1 = AXk,whereXk is the column vec- k tional to λ1 ,whereλ1 is the unique real solution to tor (ak,bk,...,fk)andA is the matrix x5 − x4 − 2x3 − 2x2 − x − 1=0. While this theorem does not apply to the general 11 case, it suggests how it may be solved and also that 11 the frequency of allowed matrices should decrease as 11 1 . k increases. 11 111 Proof. We have noted that a matrix is admissible 11 1 if and only if each α is a successor of its predeces- sor. Therefore, in estimating the number of attain- − Then X = Ak 1X for k ≥ 1, where X is the col- able matrices, we need only consider the last α.We k 1 1 umn vector (1, 1, 1, 1, 1, 1). By machine computation can then easily derive relations between the number one finds that the eigenvalues of A are 1 and the roots of attainable matrices for k and k +1.Tobegin,we of λ5 −λ4 −2λ3 −2λ2 −λ−1 = 0, of which λ ≈ 2.335 consider all possible pairs of successors. These are 1 is the largest in absolute value. Furthermore, the co- given in Figure 1. efficient in X of the eigenvector corresponding to λ Denoting by a ,b ,...,f the number of 0 ’s, 1 1 k k k 0 is nonzero. Consequently the sum of the entries of 0 ’s, ..., and 1 ’s ending attainable matrices of 1 2 Xk,namelyak + ···+ fk, is asymptotic to a constant length k,wederivethesystem times λk. ak+1 = ak + ek bk+1 = ak + ck ck+1 = ak + bk + fk dk+1 = ak + dk In particular, because λ1 < 6, allowed matrices ek+1 = ak + ck + dk become increasingly less frequent (as a fraction of all fk+1 = ak + bk + dk matrices) as the length k increases. 42 2004 RSI Compendium 3.3 Some Fundamental Results on En- In the Collatz problem, a proof that every number has coding Matrices a finite stopping time would amount to a proof of the conjecture since we could inductively show that each The encoding matrix possesses a number of interest- number > 1 must reach a smaller number, which ing properties that make it useful in the study of our in turn must reach 1 by the induction hypothesis. problem. We generalize two important theorems of Also let ω(x), the coefficient stoppingtime function, Terras below [7]. be the smallest k such that λk(x) < 1. Then let Our first result states that the numbers whose en- Hk = {x : σ(x)=k} and Ik = {x : ω(x)=k}.It coding matrices of length k are identical fall in arith- is clear that Hk ⊆ Ik. In fact the inclusion is not far metic sequences. from an equality. { } Theorem 6. Let S(Ak)= x : Ak(x)=Ak for Theorem 7. I \ H is a finite set. { k k an attainable matrix Ak.ThenS(Ak)= x0(Ak)+ # k−1 ≥ } Proof. Our proof closely follows that of Terras’ for ipn j=1 n(αj):i 0 ,wheren(αj) is the number of possible successors to αj,andx0(Ak) is the least the Collatz problem [7]. To begin, note that when ∈ x satisfying Ak(x)=Ak. Ak is an admissible matrix of length k and x S(A), then for 1 ≤ i ≤ k − 1wehave Proof. We proceed by induction on k, the length of (i) ≥ ≥ A. Clearly, when k = 1 our theorem holds. By the Tn (x) λk(Ak)x x Chinese Remainder Theorem, some x0 exists in the # interval [1,pn ] satisfying A1(x0)=A1. Also, it is by the definition of admissibility. Thus, all x with # ≥ clear that A1(x0)=A1(x) if and only if x = x0 +pn i. length-k matrix Ak have a stopping time σ(x) k. − Further, we see that after we take the image of the Now define ,k(Ak)=1 λk(Ak) for admissible Ak. (k) members of this sequence under A1, they must be Then we have Tn (x)=x + ρk(Ak) − ,k(Ak)x . evenly distributed over the possible successors of α1 The number x has stopping time exactly k when- # when taken modulo pn .Toseethis,considertwo ever ρk(Ak) − ,k(Ak)x<0, that is, whenever x> cases: when we divide by some product of primes ρk(Ak)/,k(Ak). Since this depends only on Ak,there ≤ pn, we get a sequence of numbers with some fixed are only finitely many x ∈ S(Ak)forwhichσ(x) = residues modulo each pj by which we did not divide ω(x). So if Ak is admissible then all sufficiently large and with all possible choices for the residue modulo x in S(Ak) have stopping time k. each pi by which we did divide, and these are equally Now suppose that Ak is not admissible. If no distributed. The number of these possibilities is ex- initial segment of Ak is admissible, then Ak is inflat- actly n(α1). And when we map by pn+1x +1,we ing and λk(Ak) > 1, so no members of S(Ak)have # have only one possible residue modulo pn ,whichis stopping time k. If some initial segment of Ak is ad- still the equidistribution we seek. missible, then both ω(Ak)andσ(Ak)arelessthank Now assume the theorem holds for a length for sufficiently large elements of S(Ak), and only a { k. That is, we assume S(Ak)= x0(Ak)+ finite number of elements of S(Ak)mayhavestop- # k−1 ≥ } ipn j=1 n(αj):i 0 . After we take the image of ping time k. This, taken with our first argument and some x under the mapping described by Ak,wemust the observation that there are only a finite number of be left with one of the n(αk) possible residues. None matrices of length k, proves our theorem. of these possible residues occurs more often than any other by an argument analogous to that above. Our Corollary 8. Hk and Ik have the same asymptotic sequence must therefore be split in n(α )newarith- density. Further, this asymptotic density is given by k metic sequences with the same common difference. F (k)= weight(A), Therefore the common difference is multiplied by this A admissible factor n(αk). There must also still exist some x0 for length(A)≤k each Ak. These facts imply our inductive step and thereby our theorem. where k−1 −1 # We generalize two functions of Terras [7]. Let weight(A)= pn n(αj) . σ(x), the stoppingtime function, be the smallest k so j=1 (k) that Tn (x) they have equal asymptotic density. Our weight func- The prime pn+1 = 61 seems to be the only prime less tion is derived from Theorem 6 and makes sense on a than 300 for which more than one cycle exists, but fairly intuitive level; when all possible successor ma- we cannot be sure that there are other cycles that trices to some matrix are still admissible, our weight eluded our computer search. remains unchanged. But when some of these are not, our weight corresponding to that particular branch is reduced by some factor. The sum of the weights over 4 A Further Generalization all admissible vectors of length k must be the density We may consider a more general version of our prob- of numbers with stopping time ≤ k. lem, replacing pn+1x + 1 in the definition of Tn(x) with p x + K for some K with (K, p#)=1.We Conjecture 2. limk→∞ F (k)=1. n+1 n still divide out by primes ≤ pn as in our original prob- We conjecture that the density of numbers with lem. These generalization is also posed and briefly finite stopping time is 1. This is a known result in the discussed by Zhang and Yang [8]. We may derive Collatz conjecture. We have attempted to resolve this several simple theorems based on this map relating conjecture by using combinatorial methods to bound cycles between two different K in certain cases. the sizes of the sets of attainable, admissible, and in- flating vectors of length k and then applying a bound 4.1 Basic Results on the weight function. However, our methods have not achieved strict enough bounds to ensure conver- Theorem 10. A cycle in the mappingusing pn+1x+ gence to 1. k maybetransformedtooneinpn+1x + kq when q is not divisible by any prime ≤ pn. → → ··· → 3.4 Nontrivial Cycles Proof. Consider a trajectory m1 m2 mi →··· that eventually becomes cyclic and is in its One conjecture about the Collatz mapping is that for cycle for i ≥ s.Wehavems+1 =(pn+1ms + k)/Fs+1 our n = 1, there are no nontrivial cycles (i.e., no where the Fi are divisible only by primes ≤ pn.So numbers go into repeating loops that do not include now (qms+1)= pn+1(qms)+kq /Fi and we see 1). However, in our general case, many such cycles that (qms,qms+1,...) is a cycle in the mapping with exist and we may find them fairly easily. As noted in pn+1x + kq. the introduction, the simplest such cycle occurs with pn+1 = 11, when we have the loop 17 → 47 → 37 → We may also obtain the following stronger but less 17 → ···.Agivenpn+1 may lead to multiple such general result: cycles; pn+1 = 61 seems to be the smallest such case, Theorem 11. Suppose we replace pn+1x +1 by a though we have not proven this. We may show that pn+1x + pn+1k in the definition of Tn(x). Then for a agivenpn+1 cannot have too many small nontrivial given k and pn+1, all mappings of this form contain cycles: the same number of cycles. → → Theorem 9. For a given pn+1,thereexistonly Proof. As above, consider a trajectory m0 m1 → ··· → → ··· a finitely many cycles of any given length k ≥ 1. m2 mi so that pn+1mi−1 + pn+1k = Fimi with Fi containing only prime factors ≤ pn and Proof. Our theorem is a consequence of Theorem 7. mi containing no such factors. Now define li and ni li Consider the set C of smallest elements of cycles of so that mi = pn+1ni with pn+1 ni, and consider two length k. All elements of C have some finite ω(x) cases. Suppose first that li ≥ a.Then (k) since Tn (x)=x,sothatλk(Ak(x)) < 1 because a mi+1 =(pn+1mi + pn+1k)/Fi+1 ρk(Ak(x)) must be positive. However, no x ∈ C has li a a finite σ(x), since x is the smallest element of the = pn+1(pn+1ni)+pn+1k /Fi+1; cycle. By Theorem 7, the set C is finite. li+1 a li−a+1 pn+1ni+1 = pn+1(pn+1 + k)/Fi+1,