<<

populations have diverged from this microbial autoimmune,­ inflammatory and metabolic Immunology, Stanford University School signature (Fig. 1). disorders in industrialized populations9,10. of Medicine, Stanford, California 94305, The authors moved beyond focusing on Wibowo and colleagues’ work indicates USA. J.L.S. is also at the Center for species identity: they compared the genes, that there are now two viable alternatives to Microbiome Studies, Stanford University and the predicted functions of the proteins time travel for understanding the composi- School of Medicine. encoded by those genes, for the microbes in tion of ancient microbiomes. Palaeofaeces e-mails: [email protected]; palaeo­faeces with those found in present-day enable the direct investigation of ancient [email protected] samples. Both the industrialized and the microbiomes,­ but the sample age limits the non-industrialized present-day samples had further measurements and experiments that 1. Hooper, L. V., Littman, D. R. & Macpherson, A. J. Science 336, 1268–1273 (2012). a greater prevalence of antibiotic-resistance can be performed. Importantly, this study 2. Karlsson, F., Tremaroli, V., Nielsen, J. & Bäckhed, F. genes than did the palaeofaeces, a finding con- validates that present-day Indigenous popu- Diabetes 62, 3341–3349 (2013). sistent with the ancient microbes being from lations living traditional lifestyles have similar 3. Asnicar, F. et al. mSystems 2, e00164-16 (2017). 4. Moeller, A. H. et al. Science 353, 380–382 (2016). before the era of antibiotic use. Palaeofaeces microbiome compositions to those of ancient 5. Vangay, P. et al. 175, 962–972 (2018). had a high prevalence of genes encoding pro- . It is essential to acknowledge that 6. Dethlefsen, L. & Relman, D. A. Proc. Natl Acad. Sci. USA teins that can degrade the molecule chitin, a most of these present-day populations are 108, 4554–4561 (2011). 7. Wibowo, M. C. et al. Nature 594, 234–239 (2021). component of insect exoskeletons. This find- marginalized, lead a vulnerable existence, and 8. Smits, S. A. et al. Science 357, 802–806 (2017). ing is consistent with human consumption of require exceptional protections to ensure they 9. Blaser, M. J. Cell 172, 1173–1177 (2018). insects, known to be a component of ancestral are not exploited. With ethically conducted 10. Sonnenburg, J. L. & Sonnenburg, E. D. Science 366, eaaw9255 (2019). diets. Insect ingestion was confirmed by the research, these modern populations might 11. Makki, K., Deehan, E. C., Walter, J. & Bäckhed, F. Cell Host authors’ microscopy analysis of material in the open a window on our microbial past. Microbe 23, 705–715 (2018). palaeo­faeces. The authors report many genes 12. Desai, M. S. et al. Cell 167, 1339–1353 (2016).

that were particularly prevalent in industrial- Matthew R. Olm and Justin L. Sonnenburg The authors declare no competing interests. ized samples, including those involved in the are in the Department of Microbiology and This article was published online on 17 May 2021. degradation of mucus in the human gut. Wibowo and colleagues’ study is a remark- Computer engineering able technical achievement. They were able to recover high-quality DNA from microbial organisms that lived thousands of years ago, probably because of the good preservation Superhuman floorplans possible in the dry desert environment in which the samples were located. Multiple inde- for microchips pendent lines of evidence authenticated the sample age and the human origin of the faeces. Andrew B. Kahng Having these ancient DNA sequences available in the public domain will undoubtedly benefit A machine-learning system has been trained to place memory scientists for years to come. blocks in microchip designs. The system beats human experts However, DNA-sequence-based analyses at the task, and offers the promise of better, more-rapidly do have limitations when the results are not paired with validation by other types of lab- produced chip designs than are currently possible. See p.207 oratory experiment. Using computational tools to predict information about proteins encoded by DNA is an imperfect method Success or failure in designing microchips of kilometres of wiring to achieve the designed under ideal conditions, and is particularly depends heavily on steps known as floor­ functionality. Given this staggering complex- tricky when analys­ing gene functions for pre- planning and placement. These steps deter- ity, the chip-design process itself is another viously unknown organisms, such as those mine where memory and logic elements miracle — in which the efforts of engineers, discovered in this study. Moreover, micro­ are located on a chip. The locations, in turn, aided by specialized software tools, keep the biomes are highly variable between individu- strongly affect whether the completed complexity in check. als and between populations. Analyses of more chip design can satisfy operational require- The locations of cells and macro blocks in palaeo­faeces from a wider range of timescales ments such as processing speed and power the chip are crucial to the design outcome. and locations will be needed to better under- efficiency. So far, the floorplanning task, in Their placement determines the distances stand general and population-specific features particular, has defied all attempts at automa- that wires must span, and thus affects whether of ancient human gut microbiomes. tion. It is therefore performed iteratively and the wiring can be successfully routed between The authors found notable differences in painstakingly, over weeks or months, by expert components and how quickly signals can be the composition and function of microbes human engineers. But on page 207, research- transmitted between logic gates. Optimization in palaeo­faeces compared with those of ers from Google (Mirhoseini et al.1) report a of chip placement has been extensively stud- microbes in present-day faeces. The higher machine-learning approach that achieves ied for at least six decades2,3. Seminal inno- prevalence of mucus-degrading species and superior chip floorplanning in hours. vations in the mathematical field of applied genes in industrialized microbiomes than in Modern chips are a miracle of technology optimization, such as a method known as ancient and non-industrialized ones is prob- and economics, with billions of transistors laid simulated annealing4, have been motivated ably driven by Western diets, which often lack out and interconnected on a piece of silicon by the challenge of chip placement. sufficient dietary fibre to support once-nu- the size of a fingernail. Each chip can contain Because macro blocks can be thousands or merous fibre-degrading microbial species11,12. tens of millions of logic gates, called standard even millions of times larger than standard Given the links between the microbiome cells, along with thousands of memory blocks, cells, placing cells and blocks simultaneously and the immune system, these differences known as macro blocks, or macros. The cells is extremely challenging. Modern chip-design might be connected to the rising rates of and macro blocks are interconnected by tens methods therefore place the macro blocks

Nature | Vol 594 | 10 June 2021 | 183 ©2021 Spri nger Nature Li mited. All ri ghts reserved. ©2021 Spri nger Nature Li mited. All ri ghts reserved.

News & views design: with no more than six extra hours of a b fine-tuning steps, the agent can produce floor- Macro block plans that are superior to those developed by human experts for existing chips. Moreover, the agent’s solutions are very different from those of trained human experts (Fig. 1). Arthur C. Clarke famously noted6 that “any sufficiently advanced technology is indistin- guishable from magic”. To long-time practi- tioners in the fields of chip design and design automation, Mirhoseini and colleagues’ results can indeed seem magical. In the past year, experts worldwide have contemplated questions such as, ‘How is it that the agent can initially place each macro block in turn so effectively that the chosen placement is used in the final, manufactured chip design?’ Figure 1 | Microchip floorplans designed by humans differ from those produced by a machine-learning The authors report that the agent places system. An early step in microchip design is floorplanning — the placement of memory components called macro blocks sequentially, in decreasing macro blocks on an empty layout canvas. Floorplanning is immensely complicated because of the vast order of size — which means that a block can number of potential configurations of macro blocks, and it involves multiple iterations as the logic-circuit be placed next even if it has no connections design evolves. Each iteration is produced manually by human engineers, over days or weeks. a, This (physical or functional) to previously placed 8 floorplan for a chip (the Ariane RISC-V processor ) is considered by human designers to be a good one. blocks. When blocks have the same size, the Its 37 macro blocks are close-packed in well-aligned rows and columns, leaving an uncluttered area for agent’s choice of the next block echoes the placement of other components. b, Mirhoseini et al.1 report a machine-learning agent that, in just a few choices made by ‘cluster-growth’ methods7, hours, designs floorplans that outperform those designed by humans. This agent-produced arrangement which were previously developed in efforts is another implementation of the Ariane processor, and is very different from that shown in a. (Image in b adapted from Extended Data Fig. 4 of the paper1.) to automate floorplan design, but were abandoned several decades ago. It will be fascinating to see whether the authors’ use first, in a step called floorplanning. Standard of the current paper to the online arXiv of massive computation and deep learning cells are then placed in the remaining layout repository. It stated that “in under 6 hours, reveal that chip designers took a wrong turn area. Just placing the macro blocks is incred- our method can generate placements that in giving up on sequential and cluster-growth ibly complicated: Mirhoseini et al. estimate are superhuman or comparable” — that is, methods. that the number of possible configurations the method can outperform humans in a Another much-debated question has been, (the state space) of macro blocks in the floor­ startlingly short period of time. Within days, ‘How does the agent’s choice of macro-block planning problems solved in their study is numerous semiconductor-design companies, placements survive subsequent steps in the about 102,500. By comparison, the state space design-tool vendors and academic-research chip-design process?’ As mentioned earlier, of the black and white stones used in the board groups had launched efforts to understand human engineers must iteratively adjust their game Go is just 10360. and replicate the results. floorplans as the logic-circuit design evolves. Viable floorplanning solutions must leave Mirhoseini and colleagues trained a The trained agent’s macro-block placements empty regions on the chip to achieve all of machine-learning ‘agent’ that can successfully somehow evade such landmines in the design the subsequent steps — placement of the place macro blocks, one by one, into a chip lay- process, achieving superhuman outcomes for standard cells, routing of the wiring and out. This agent has a brain-inspired architec- timing (ensuring that signals produced in the maximizing of the chip’s processing speed. ture known as a deep neural network, and is chip arrive at their destinations on time) and However, the optimiz­ations of logic circuitry trained using a paradigm called reinforcement for the feasibility and efficiency with which inherent in these steps can increase the total learning. At any given step of floorplanning, wiring can be routed between components. area taken up by standard cells by 15% or more. the trained agent assesses the ‘state’ of the More­over, Mirhoseini and colleagues’ use of Human engineers must therefore iteratively chip being developed, including the partial simple metrics as proxies for key parameters adjust their macro-block placements as the floorplan that it has constructed so far, and of the chip design works surprisingly well — it logic-circuit design evolves. Each of these then applies its learnt strategy to identify the will be interesting to understand why these iterations is carried out manually, and takes best ‘action’ — that is, where to place the next proxies are so successful. The authors’ inten- days or weeks. macro block. tion to make their code available is invaluable The computer industry has famously been The technical details of this approach, in this light. driven by Moore’s law — the number of compo- such as how to represent the chip-design and The development of methods for auto- nents per chip has roughly doubled every two partial-floorplanning solutions, were devel- mated chip design that are better, faster and years. This rate of advancement corresponds oped with the overarching goal of finding a cheaper than current approaches will help to to an increase in the number of components general, transferable solution to the mac- keep alive the ‘Moore’s law’ trajectory of chip on a chip of about one per cent per week. The ro-placement problem. In other words, the technology. Indeed, for technical leaders failure to automate floorplanning is therefore trained agent should succeed even when con- and decision-makers in the chip industry, the problematic — not only because of the asso- fronted with chip designs that it has not previ- most important revelation in Mirhoseini and ciated time costs, but also because it limits ously encountered, drawn from a wide range of colleagues’ paper might be that the authors’ the number of solutions that can be explored applications and markets. The authors report floorplan solutions have been incorporated within chip-development schedules. that, when their agent is pre-trained on a set of into the chip designs for Google’s next-gen- But everything changed on 22 April 2020. 10,000 chip floorplans, it is already quite suc- eration artificial-intelligence processors. This On that day, Mirhoseini et al. posted a preprint5 cessful when used in a ‘one shot’ mode on a new means that the solutions are good enough for

184 | Nature | Vol 594 | 10 June 2021 ©2021 Spri nger Nature Li mited. All ri ghts reserved. ©2021 Spri nger Nature Li mited. All ri ghts reserved.

millions of copies to be printed on expensive, 1. Mirhoseini, A. et al. Nature 594, 207–212 (2021). integuments are thus clearly very different cutting-edge silicon wafers. We can therefore 2. Markov, I. L., Hu, J. & Kim, M.-C. Proc. IEEE 103, 1985–2003 from each other. (2015). expect the semiconductor industry to 3. Kahng, A. B. Proc. ACM/IEEE Int. Symp. Physical Design Shi and colleagues describe extremely re­­double its interest in replicating the authors’ 15–22 (2021). well-preserved fossils of extinct seed . work, and to pursue a host of similar applica- 4. Kirkpatrick, S., Gelatt, C. D. Jr & Vecchi, M. P. Science 220, They use these remarkable newly described 671–680 (1983). tions throughout the chip-design process. 5. Mirhoseini, A. et al. Preprint at https://arxiv.org/ fossils, as well as other similar-looking fossils abs/2004.10746 (2020). reported earlier2, to build a tree of relation- Andrew B. Kahng is in the Department of 6. Clarke, A. C. Profiles of the : An Inquiry into the ships between plants. Their effort places all Limits of the Possible 41 (Macmillan, 1973). Computer Science and Engineering, and in 7. Sait, S. M. & Youssef, H. VLSI Physical Design Automation: of these fossils on the family tree at a posi- the Department of Electrical and Computer Theory and Practice Ch. 3, 90–95 (McGraw-Hill, 1995). tion close to that of flowering­ plants — they are Engineering, University of California, San Diego, 8. Zaruba, F. & Benini, L. IEEE Trans. Very Large Scale very close relatives of modern angiosperms, Integration 27, 2629–2640 (2019). La Jolla, California 92093, USA. and, as such, might provide clues to their ori- e-mail: [email protected] The author declares no competing interests. gin. The fossils of these ancient relatives of flowering plants show remarkable diversity in the shape of their reproductive structures. Evolution Amazingly, these are not recently discovered fossil specimens; they were collected almost a century ago, deposited in museum collec- Seed for thought tions and only recently unearthed for a second time (this time, from museum drawers by the Douglas E. Soltis authors), with their relevance for reconstruct- ing the plant family tree now finally recognized. The origin and rapid diversification of flowering plants is a These extinct plants had a cup-like struc- long-standing “abominable mystery”, as Charles Darwin put ture, termed a cupule (Fig. 1), that surrounded it. Part of the puzzle – the origin of the protective covering of the developing seed or seeds (each seed itself flowering-plant seeds – is nearing resolution. See p.223 having just one integument, which typifies all gymnosperms, living and extinct). The role of these cupules is unknown, but they might have provided extra protection for the seed, One of the landmark events in evolution is the mystery is how the second (outer) protective or aided its dispersal. appearance of flowering plants, termed angio- layer evolved. The outer integument differs Might the cupule be the precursor to the sperms. On page 223, Shi et al.1 describe fossil from the inner one in its developmental outer protective layer unique to flowering-plant­ evidence that sheds light on a long-standing pathway and in the genetic control govern- seeds? There is another twist (pun intended) mystery about seed evolution. ing tissue-layer formation5,6, and the two to solving the mystery of angiosperm seeds. A number of botanical innovations accom- panied the appearance of flowering plants a Extinct b Present-day and contributed to the organisms’ rapid rise gymnosperm angiosperm to worldwide dominance of terrestrial and many aquatic . Such innovations include floral organs and a special nutritive tissue for the plant embryo — endosperm. In Cupule addition, flowering plants and another plant group, gymnosperms, produce seeds, which provide a protective layer around the develop- Integument ing embryo. This enabled seed plants to truly Embryo conquer the terrestrial environment, over­

taking other land plants such as mosses and Outer Carpel ferns, which do not produce seeds. Seeds have integument also had a central role in enabling human sur- Inner vival by providing an important food source. integument Flowering plants arose from an ancestor in Funiculus the gymnosperms2,3. The fossil record contains many clades of now-extinct gymnosperms, and it is not clear which group gave rise to the angiosperms, although it is definitely not one of the living groups of gymnosperms — plants Figure 1 | Seed evolution. Shi et al.1 present fossil evidence that sheds light on a long-standing mystery such as conifers, ginkgo (Ginkgo biloba) or about the origin of the protective layer, termed the outer integument (or second integument), that 2,3 cycads . surrounds the seeds of flowering plants (angiosperms). It is thought that flowering plants arose from All gymnosperms have just one protective now-extinct plants belonging to a group called the gymnosperms (living members of which include layer, termed an integument, that surrounds conifers). a, Gymnosperms have one integument that surrounds the embryo of a seed. A structure called their seeds, whereas flowering plants have the cupule, which might have provided protection or aided seed dispersal, formed the outer layer of ancient two such layers. In a letter4 to the botanist gymnosperm seeds. b, Flowering plants have inner and outer integuments. Shi and colleagues find that the J. D. Hooker, Charles Darwin described the cupule probably evolved to form the outer integument of flowering plants. Angiosperm seeds are connected origin of flowering plants as an “abomina- to the carpel structure that surrounds them through a stalk called the funiculus. Shi and colleagues suggest ble mystery”. An enigma embedded in this that the funiculus evolved from the stalk of the gymnosperm cupule.

Nature | Vol 594 | 10 June 2021 | 185 ©2021 Spri nger Nature Li mited. All ri ghts reserved. ©2021 Spri nger Nature Li mited. All ri ghts reserved.