Multi-Hop Question Generation with Graph Convolutional Network

Total Page:16

File Type:pdf, Size:1020Kb

Multi-Hop Question Generation with Graph Convolutional Network Multi-hop Question Generation with Graph Convolutional Network Dan Su, Yan Xu, Wenliang Dai, Ziwei Ji, Tiezheng Yu, Pascale Fung Center for Artificial Intelligence Research (CAiRE) Department of Electronic and Computer Engineering The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong fdsu, yxucb, wdaiai, zjiad, [email protected], [email protected] Abstract Paragraph A: Marine Tactical Air Command Squadron 28 (Location T) is a United States Marine Multi-hop Question Generation (QG) aims to Corps aviation command and control unit based generate answer-related questions by aggre- at Marine Corps Air Station Cherry Point (Location gating and reasoning over multiple scattered evidence from different paragraphs. It is a C) ... more challenging yet under-explored task com- Paragraph B: Marine Corps Air Station Cherry pared to conventional single-hop QG, where Point (Location C) ... is a United States Marine the questions are generated from the sentence Corps airfield located in Havelock, North Car- containing the answer or nearby sentences in olina (Location H), USA ... the same paragraph without complex reason- Answer: Havelock, North Carolina (Location H) ing. To address the additional challenges Question: What city is the Marine Air Control in multi-hop QG, we propose Multi-Hop En- coding Fusion Network for Question Genera- Group 28 (Location T) located in? tion (MulQG), which does context encoding in multiple hops with Graph Convolutional Table 1: An example of multi-hop QG in the Hot- Network and encoding fusion via an Encoder potQA (Yang et al., 2018) dataset. Given the answer Reasoning Gate. To the best of our knowl- is Location H, to ask where is T located, the model edge, we are the first to tackle the challenge needs a bridging evidence to know that T is located in of multi-hop reasoning over paragraphs with- C, and C is located in H (T ! C ! H). This is done out any sentence-level information. Empiri- by multi-hop reasoning. cal results on HotpotQA dataset demonstrate the effectiveness of our method, in comparison with baselines on automatic evaluation metrics. single-hop reasoning (Zhou et al., 2017; Zhao et al., Moreover, from the human evaluation, our pro- 2018). Little effort has been put in multi-hop QG, posed model is able to generate fluent ques- which is a more challenging task. Multi-hop QG re- tions with high completeness and outperforms quires aggregating several scattered evidence spans the strongest baseline by 20.8% in the multi- from multiple paragraphs, and reasoning over them hop evaluation. The code is publicly available to generate answer-related, factual-coherent ques- at https://github.com/HLTCHKUST/MulQG. tions. It can serve as an essential component in 1 Introduction education systems (Heilman and Smith, 2010; Lind- berg et al., 2013; Yao et al., 2018), or be applied Question Generation (QG) is a task to automati- in intelligent virtual assistant systems (Shum et al., cally generate a question from a given context and, 2018; Pan et al., 2019). It can also combine with optionally, an answer. Recently, we have observed question answering (QA) models as dual tasks to an increasing interest in text-based QG (Du et al., boost QA systems with reasoning ability (Tang 2017; Zhao et al., 2018; Scialom et al., 2019; Nema et al., 2017). et al., 2019; Zhang and Bansal, 2019). Intuitively, there are two main additional chal- Most of the existing works on text-based QG lenges needed to be addressed for multi-hop QG. focus on generating SQuAD-style (Rajpurkar et al., The first challenge is how to effectively iden- 2016; Puri et al., 2020) questions, which are gen- tify scattered pieces of evidence that can con- erated from the sentence containing the answer nect the reasoning path of the answer and ques- or nearby sentences in the same paragraph, via tion (Chauhan et al., 2020). As the example shown 4636 Findings of the Association for Computational Linguistics: EMNLP 2020, pages 4636–4647 November 16 - 20, 2020. c 2020 Association for Computational Linguistics in Table1, to generate a question asking about “Ma- • To the best of our knowledge, we are the first rine Air Control Group 28” given only the answer to tackle the challenge of multi-hop reasoning “Havelock, North Carolina”, we need the bridging over paragraphs without any sentence-level evidence like “Marine Corps Air Station Cherry information in QG tasks. Point”. The second challenge is how to reason over • We propose a new and effective framework for multiple pieces of scattered evidence to generate Multi-hop QG, to do context encoding in mul- factual-coherent questions. tiple hops(steps) with Graph Convolutional Previous works mainly focus on single-hop QG, Network (GCN). which use neural network based approaches with • We show the effectiveness of our method on the sequence-to-sequence (Seq2Seq) framework. both automatic evaluation and human evalu- Different architectures of encoder and decoder have ation, and we make the first step to evaluate been designed (Nema et al., 2019; Zhao et al., 2018) the model performance in multi-hop aspect. to incorporate the information of answer and con- text to do single-hop reasoning. To the best of 2 Methodology our knowledge, none of the previous works ad- dress the two challenges we mentioned above for The intuition is drawn from human’s multi-hop multi-hop QG task. The only work on multi-hop question generation process (Davey and McBride, QG (Chauhan et al., 2020) uses multi-task learning 1986). Firstly, given the answer and context, we with an auxiliary loss for sentence-level supporting skim to establish a general understanding of the fact prediction, requiring supporting fact sentences texts. Then, we find the mentions of entities in in different paragraphs being labeled in the training or correlated to the answer from the context, and data. While labeling those supporting facts requires analyse nearby sentences to extract useful evidence. heavy human labor and is time-consuming, their Besides, we may also search for linked information method cannot be applied to general multi-hop QG in other paragraphs to gain a further understanding cases without supporting facts. of the entities. Finally, we coherently fuse our knowledge learned from the previous steps and In this paper, we propose a novel architecture start to generate questions. named Multi-Hop Encoding Fusion Network for Question Generation (MulQG) to address the afore- To mimic this process, we develop our MulQG mentioned challenges for multi-hop QG. First of framework. The encoding stage is achieved by a all, it extends the Seq2Seq QG framework from novel Multi-hop Encoder. At the decoding stage, sing-hop to multi-hop for context encoding. Ad- we use maxout pointer decoder as proposed in Zhao ditionally, it leverages a Graph Convolutional Net- et al.(2018). The overview of the framework is work (GCN) on an answer-aware dynamic entity shown in Figure1. graph, which is constructed from entity mentions in answer and input paragraphs, to aggregate the po- 2.1 Multi-hop Encoder tential evidence related to the questions. Moreover, Our Multi-hop Encoder includes three modules: we use different attention mechanisms to imitate (1) Answer-aware context encoder (2) GCN-based the reasoning procedures of human beings in multi- entity-aware answer encoder (3) Gated encoder hop generation process, the details are explained in reasoning layer. Section2. The context and answer are split into word-level We conduct the experiments on the multi-hop tokens and denoted as c = fc1; c2; :::; cng and QA dataset HotpotQA (Yang et al., 2018) with a = fa1; a2; :::; amg, respectively. Each word our model and the baselines. The proposed model is represented by the pre-trained GloVe embed- outperforms the baselines with a significant im- ding (Pennington et al., 2014). Furthermore, for the provement on automatic evaluation results, such as words in context, we also append the answer tag- BLEU (Papineni et al., 2002). The human evalua- ging embeddings as described in Zhao et al.(2018). tion results further validate that our proposed model The context and answer embeddings are fed into is more likely to generate multi-hop questions with two bidirectional LSTM-RNNs separately to obtain Fluency Answerability d×n high quality in terms of , and their initial contextual representations C0 2 R Completeness d×m scores. and A0 2 R , in which d is the hidden state Our contributions are summarized as follows: dimension in LSTM. 4637 Multi-hop Encoder Decoder Encoder Reasoning Gate Answer-aware Context Encoder Attention Layer Entity-aware Answer Encoder residual Bi-Attn Maxout Pointer Generator Entity Graph Answer-aware Context Encoder LSTM LSTM LSTM Encoder Encoder Decoder Word Embeddings + Answer Tag Answer Context Figure 1: Overview of our MulQG framework. In the encoding stage, we pass the initial context encoding C0 and answer encoding A0 to the Answer-aware Context Encoder to obtain the first context encoding C1, then C1 and A0 will be used to update a multi-hop answer encoding A1 via the GCN-based Entity-aware Answer Encoder, and y we use A1 and C1 back to the Answer-aware Context Encoder to obtain C2. The final context encoding Cfinal are obtained from the Encoder Reasoning Gate which operates over C1 and C2, and will be used in the max-out based decoding stage. mask d×n C1 = BiLSTM([C~1; C0]) 2 R (6) GCN Firstly, we compute an alignment matrix S (Eq.1), and normalize it column-wise and row- wise to get two attention matrices S0 (Eq.2) and S00 ... (Eq.3). S0 represents the relevance of each answer token over the context, and S00 represents the rele- vance of each context token over the answer.
Recommended publications
  • Machine Learning Is Driving an Innovation Wave in Saas Software
    Machine Learning is Driving an Innovation Wave in SaaS Software By Jeff Houston, CFA [email protected] | 512.364.2258 SaaS software vendors are enhancing their soluti ons with machine learning (ML) algorithms. ML is the ability of a computer to either automate or recommend appropriate actions by applying probability to data with a feedback loop that enables learning. We view ML as a subset of artificial intelligence (AI), which represents a broad collection of tools that include/leverage ML, such as natural language processing (NLP), self-driving cars, and robotics as well as some that are tangent to ML, such as logical rule-based algorithms. Although there have been multiple “AI Winters” since the 1950s where hype cycles were followed by a dearth of funding, ML is now an enduring innovation driver, in our opinion, due in part to the increasing ubiquity of affordable cloud-based processing power, data storage—we expect the pace of breakthroughs to accelerate. automation, customer support, marketing, and human resources) through both organic and M&A These innovations will make it easier for companies initiatives. As for vertical-specific solutions, we think to benefit beyond what is available with traditional that a new crop of vendors will embrace ML and business intelligence (BI) solutions, like IBM/ achieve billion-dollar valuations. VCs are making big Cognos and Tableau, that simply describe what bets that these hypotheses will come to fruition as happened in the past. New ML solutions are demonstrated by the $5B they poured into 550 amplifying intelligence while enabling consistent and startups using AI as a core part of their solution in better-informed reasoning, behaving as thought 20162.
    [Show full text]
  • Market Update January 2020 a Machine Learning & Artificial Intelligence Market Update
    PRIVATE & CONFIDENTIAL Market Update January 2020 A Machine Learning & Artificial Intelligence Market Update B Canaccord Genuity Overview / Update Artificial Intelligence Machine Learning f (x) Deep Learning Page 1 Driven by your success. Machine Learning (“ML”) and Artificial Intelligence (“AI”) continue to generate strong levels of attention and excitement in the marketplace, based on the promise of self-correcting algorithms driving increased intelligence and automation across a number of mission critical applications and use cases When ML & AI were first introduced as concepts that would impact the IT landscape, most companies in the sector were limited to collections of data scientists or technologies in search of use cases – today there are defined categories emerging and companies with real traction in ML/AI, as well as a growing set of tangible use cases While there is real innovation and traction occurring in ML/AI, in some cases it is still difficult to understand where certain companies truly play in the ML ecosystem and the unique value that each brings to the table – this presentation aims to provide a framework to understand the ML landscape First we look to define and better understand ML/AI technology, both the underlying algorithms as well as data science platforms, operational frameworks and advanced analytics solutions which leverage and / or optimize core ML/AI technologies – these are also described as “AI Infrastructure” From a category perspective, we focus primarily on horizontal platforms which can provide data
    [Show full text]
  • Shikhar-Cv.Pdf
    SHIKHAR SHARMA [email protected] Senior Research SDE, Microsoft Research http://www.shikharsharma.com/ RESEARCH INTERESTS • Deep Learning • Machine Learning • Recurrent Neural Networks • Adversarial Learning • Computer Vision • Task-oriented Dialogue RESEARCH STATEMENT SUMMARY My long-term research interest lies in advancing the capabilities of current artificial intelligence systems. I am highly interested in neural network architectures augmented with memory and attention mechanisms. My recent research deals with image synthesis and visually-grounded multi-modal dialogue systems, which stand to gain significant advantage from using memory and attention similar to how we as humans do. Prior to this, I have worked on visual attention models for action recognition and video description. I have broad interests in deep learning research in natural language processing and computer vision, which I believe are important research areas on the path to true artificial intelligence. EDUCATION University of Toronto Aug'14 - Feb'16 M.Sc. in Computer Science with the Deep Learning and Machine Learning group GPA: 3.83/4 • Thesis title: Action Recognition and Video Description using Visual Attention • Thesis supervisor: Prof. Ruslan Salakhutdinov Indian Institute of Technology (IIT), Kanpur Jul'10 - Jun'14 B.Tech. in Computer Science GPA: 9.3/10 • Thesis title: Speech Recognition using Deep Belief Networks and Hidden Markov Models • Thesis supervisor: Prof. Harish Karnick PUBLICATIONS yequal contribution Theses - Shikhar Sharma. 2016. \Action Recognition and Video Description using Visual Attention." Masters Thesis, University of Toronto Journal papers - Ashesh Jain, Shikhar Sharma, Thorsten Joachims, Ashutosh Saxena. 2015. \Learning preferences for ma- nipulation tasks from online coactive feedback." The International Journal of Robotics Research (IJRR), 34, 1296-1313 Conference and Workshop papers - Alaaeldin El-Nouby, Shikhar Sharma, Hannes Schulz, Devon Hjelm, Layla El Asri, Samira Ebrahimi Kahou, Yoshua Bengio, Graham W.
    [Show full text]
  • Profiles in Innovation: Artificial Intelligence
    EQUITY RESEARCH | November 14, 2016 Artificial intelligence is the apex technology of the information era. In the latest in our Profiles in Innovation Heath P. Terry, CFA series, we examine how (212) 357-1849 advances in machine [email protected] learning and deep learning Goldman, Sachs & Co. have combined with more Jesse Hulsing powerful computing and an (415) 249-7464 ever-expanding pool of data [email protected] to bring AI within reach for Goldman, Sachs & Co. companies across Mark Grant industries. The development (212) 357-4475 [email protected] of AI-as-a-service has the Goldman, Sachs & Co. potential to open new markets and disrupt the Daniel Powell (917) 343-4120 playing field in cloud [email protected] computing. We believe the Goldman, Sachs & Co. ability to leverage AI will Piyush Mubayi become a defining attribute (852) 2978-1677 of competitive advantage [email protected] for companies in coming Goldman Sachs (Asia) L.L.C. years and will usher in a Waqar Syed resurgence in productivity. (212) 357-1804 [email protected] Goldman, Sachs & Co. PROFILESIN INNOVATION Artificial Intelligence AI, Machine Learning and Data Fuel the Future of Productivity Goldman Sachs does and seeks to do business with companies covered in its research reports. As a result, investors should be aware that the firm may have a conflict of interest that could affect the objectivity of this report. Investors should consider this report as only a single factor in making their investment decision. For Reg AC certification and other important disclosures, see the Disclosure Appendix, or go to www.gs.com/research/hedge.html.
    [Show full text]
  • Microsoft Corporation
    A Progressive Digital Media business COMPANY PROFILE Microsoft Corporation REFERENCE CODE: 8ABE78BB-0732-4ACA-A41D-3012EBB1334D PUBLICATION DATE: 25 Jul 2017 www.marketline.com COPYRIGHT MARKETLINE. THIS CONTENT IS A LICENSED PRODUCT AND IS NOT TO BE PHOTOCOPIED OR DISTRIBUTED Microsoft Corporation TABLE OF CONTENTS TABLE OF CONTENTS Company Overview ........................................................................................................3 Key Facts.........................................................................................................................3 Business Description .....................................................................................................4 History .............................................................................................................................5 Key Employees .............................................................................................................26 Key Employee Biographies .........................................................................................28 Major Products & Services ..........................................................................................35 SWOT Analysis .............................................................................................................36 Top Competitors ...........................................................................................................44 Company View ..............................................................................................................45
    [Show full text]
  • 2. Ai in Enterprise Software
    1. CONTENTS 1. INDUSTRY SNAPSHOT ...................................................................................................................... 3 Introduction .......................................................................................................................................... 3 Market scope ....................................................................................................................................... 4 Geographical analysis of the AI landscape.......................................................................................... 5 VC Funding Trends .............................................................................................................................. 6 Growth drivers influencing the AI demand ........................................................................................... 8 Key AI vendors ..................................................................................................................................... 9 The AI application ecosystem ............................................................................................................ 10 Challenges ......................................................................................................................................... 11 2. AI IN ENTERPRISE SOFTWARE ...................................................................................................... 12 Introduction .......................................................................................................................................
    [Show full text]
  • Proceedings of the Second Storytelling Workshop, Pages 1–10 Florence, Italy, August 1, 2019
    ACL 2019 Storytelling Proceedings of the Second Workshop August 1, 2019 Florence, Italy c 2019 The Association for Computational Linguistics Order copies of this and other ACL proceedings from: Association for Computational Linguistics (ACL) 209 N. Eighth Street Stroudsburg, PA 18360 USA Tel: +1-570-476-8006 Fax: +1-570-476-0860 [email protected] ISBN 978-1-950737-44-4 ii Introduction Welcome to the second *ACL workshop on Storytelling! Human storytelling has existed for as far back as we can trace, and is thought to be fundamental to being human. Stories help share ideas, histories, and common ground. This workshop examines what storytelling is, its structure and components, and how it is expressed, with respect to state of the art in language and computation. Part of grounding artificial intelligence work in human experience can involve the generation, understanding, and sharing of stories. This workshop highlights the diverse work being done in storytelling and AI across different fields. Papers at this workshop are multi-disciplinary, including work on neural and linguistic approaches to understanding and generating stories in narrative texts, social media, and visual narratives. The second workshop of Storytelling received 22 submissions. We accepted 14 submissions into the proceedings, 6 of which were presented as oral presentations, and 8 as posters. We accepted 3 of the submitted papers as non-archival works to be presented during the poster session. We are pleased to host an invited talk from Melissa Roemmele. We hope you enjoy the workshop! The Storytelling Workshop Organizers iii Organizers: Francis Ferraro, University of Maryland Baltimore County Ting-Hao (Kenneth) Huang, Pennsylvania State University Stephanie M.
    [Show full text]
  • Mergers in the Digital Economy
    2020/01 DP Axel Gautier and Joe Lamesch Mergers in the digital economy CORE Voie du Roman Pays 34, L1.03.01 B-1348 Louvain-la-Neuve Tel (32 10) 47 43 04 Email: [email protected] https://uclouvain.be/en/research-institutes/ lidam/core/discussion-papers.html Mergers in the Digital Economy∗ Axel Gautier y& Joe Lamesch z January 13, 2020 Abstract Over the period 2015-2017, the five giant technologically leading firms, Google, Amazon, Facebook, Amazon and Microsoft (GAFAM) acquired 175 companies, from small start-ups to billion dollar deals. By investigating this intense M&A, this paper ambitions a better understanding of the Big Five's strategies. To do so, we identify 6 different user groups gravitating around these multi-sided companies along with each company's most important market segments. We then track their mergers and acquisitions and match them with the segments. This exercise shows that these five firms use M&A activity mostly to strengthen their core market segments but rarely to expand their activities into new ones. Furthermore, most of the acquired products are shut down post acquisition, which suggests that GAFAM mainly acquire firm’s assets (functionality, technology, talent or IP) to integrate them in their ecosystem rather than the products and users themselves. For these tech giants, therefore, acquisition appears to be a substitute for in-house R&D. Finally, from our check for possible "killer acquisitions", it appears that just a single one in our sample could potentially be qualified as such. Keywords: Mergers, GAFAM, platform, digital markets, competition policy, killer acquisition JEL Codes: D43, K21, L40, L86, G34 ∗The authors would like to thank M.
    [Show full text]
  • The Democratization of Artificial Intelligence
    Deep Learning’s Governmentality The Other Black Box Jonathan Roberge/Kevin Morin/Marius Senneville Introduction Frank Pasquale’s 2016 book The Black Box Society is now considered a landmark study in law-related disciplines, in the social sciences and beyond. The topic in itself, digitalization and the invasiveness of the internet, is of the utmost impor- tance. The book reveals the inversion of operational secrecy by digital platforms and the extensive access to users’ private data. Facebook, Google and the like collect and aggregate bits and bytes of information to create massive profiling schemes, the modus operandi of which little is known. Problematically, as private actors, they acquire massive data and “knowledge” about the society and our in- dividual behaviors that we don’t. To that complex and most certainly unpleasant reality, Pasquale’s analytical rigor and finesse contribute valuable insights on po- tential regulations and on the possibility of developing a smarter citizenry. How- ever, we want to argue that there is another, broader and maybe more cultural reason why The Black Box Society draw so much attention—thus further explaining the success of the book. The image of a concealed, networked entity is evocative of some common fears. It captures a sense of “loss of control” vis-à-vis the latest au- tomation processes. Such an algorithmic black box, in other words, taps into a dif- fused anxiety regarding what is to be called a “known-unknown”, i.e. something we recognize to be a mostly hidden form of knowledge production. The image of a black box is a disenchanted one; here lies its strength as well as its weakness.
    [Show full text]
  • GEM: a General Evaluation Benchmark for Multimodal Tasks
    GEM:AGeneralEvaluation Benchmark forMultimodal Tasks Lin Su1 Nan Duan2 Edward Cui1 Lei Ji2 Chenfei Wu2 Huaishao Luo3 Yongfei Liu4 Ming Zhong1 Taroon Bharti1 Arun Sacheti5 1 Bing Multimedia Team, Microsoft, China 2 Natural Language Computing, Microsoft Research Asia, China 3 Southwest Jiaotong University, China 4 ShanghaiTech University, China 5 Bing Multimedia Team, Microsoft, United States flins, nanduan, edwac, leiji, chewu, minzhon, tbharti, [email protected] [email protected], [email protected] Abstract to vision-language scenarios (Lu et al., 2019; Chen et al., 2019; Li et al., 2020a,b; Ni et al., 2021; Sun 1 In this paper, we present GEM as a General et al., 2019b,a; Luo et al., 2020) to handle multi- Evaluation benchmark for Multimodal tasks. modal tasks such as image(or video)-text retrieval Different from existing datasets such as GLUE (Wang et al., 2018), SuperGLUE (Wang et al., and image (or video) captioning. However, there is 2019), XGLUE (Liang et al., 2020) and still no comprehensive benchmark dataset for evalu- XTREME (Hu et al., 2020) that mainly fo- ating such multimodal pre-trained models. Besides, cus on natural language tasks, GEM is a large- most existing vision-language datasets are labeled scale vision-language benchmark, which con- in English only, which cannot be used to evaluate sists of GEM-I for image-language tasks and the qualities of such models on other languages. GEM-V for video-language tasks. Compar- Motivated by this, we present GEM, a General ing with existing multimodal datasets such as E M MSCOCO (Chen et al., 2015) and Flicker30K valuation benchmark for ultimodal tasks.
    [Show full text]
  • World Tech M&A Report 2019
    WORLD TECH M&A REPORT 2019 www.corumgroup.com 2019 – FEW LIMITS ON TECH M&A We’ve reached an age LUMINARY PANEL where distance, size Our thanks to our luminary panelists who took part in “Forecast 2019: and technology have Global Tech M&A.” For more of their thoughts on the technology few limits, fueling a landscape, view the original webcasts at corumgroup.com. record pace in tech M&A. A major private “When we try to go forward from where we are equity firm recently now, instead of working backwards from the future, told us that they never we fall into traps of incremental improvement, and imagined that they of doing an old thing better until after it’s become would go to the other the wrong thing entirely.” side of the world to make an acquisition for a platform company, but PETER COFFEE, SALESFORCE they did! Even the smallest niche companies are “Technologies are tools, not an end goal. The end getting interest—with small company specialist goal is to use the tools to solve the problems and add Constellation the leading strategic buyer. Even value to others. Technology trends reflect the market seemingly mundane applications are being demand and supply for such tools. What drives acquired—Corum added Blue Collar software market demand, though, are real-world problems.” as a Top Ten Disruptive Trend. HENRY HU, IBM Private Equity continues to dominate deal volume due to bolt-ons to their platform “Designing for a global market is challenging. One technology companies, with Vista Equity leading approach is to use a spotted cow strategy: design for the financial buyers.
    [Show full text]
  • Cesifo Working Paper No. 8056
    8056 2020 January 2020 Mergers in the Digital Economy Axel Gautier, Joe Lamesch Impressum: CESifo Working Papers ISSN 2364-1428 (electronic version) Publisher and distributor: Munich Society for the Promotion of Economic Research - CESifo GmbH The international platform of Ludwigs-Maximilians University’s Center for Economic Studies and the ifo Institute Poschingerstr. 5, 81679 Munich, Germany Telephone +49 (0)89 2180-2740, Telefax +49 (0)89 2180-17845, email [email protected] Editor: Clemens Fuest www.cesifo-group.org/wp An electronic version of the paper may be downloaded · from the SSRN website: www.SSRN.com · from the RePEc website: www.RePEc.org · from the CESifo website: www.CESifo-group.org/wp CESifo Working Paper No. 8056 Mergers in the Digital Economy Abstract Over the period 2015-2017, the five giant technologically leading firms, Google, Amazon, Facebook, Amazon and Microsoft (GAFAM) acquired 175 companies, from small start-ups to billion dollar deals. By investigating this intense M&A, this paper ambitions a better understanding of the Big Five’s strategies. To do so, we identify 6 different user groups gravitating around these multi-sided companies along with each company’s most important market segments. We then track their mergers and acquisitions and match them with the segments. This exercise shows that these five firms use M&A activity mostly to strengthen their core market segments but rarely to expand their activities into new ones. Furthermore, most of the acquired products are shut down post acquisition, which suggests that GAFAM mainly acquire firm’s assets (functionality, technology, talent or IP) to integrate them in their ecosystem rather than the products and users themselves.
    [Show full text]