Interpretable Machine Learning for Natural Language Generation

Total Page:16

File Type:pdf, Size:1020Kb

Interpretable Machine Learning for Natural Language Generation NeurIPS 2020 Meetup Beijing Controllable and Interpretable Machine Learning for Natural Language Generation Lei Li ByteDance AI Lab 12/6/2020 Revolution in Information Creation and Sharing • New media platforms • Tremendous improvement in the efficiency and quality of content creation • Massive distribution of personalized information 2 AI for Information Creation and Sharing Information Creation & AI Sharing Technology 3 AI for Information Creation and Sharing Information Creation & AI Sharing Technology Automated Natural Lang. news writing Generation Sharing Content Machine Globally Translation Filtering Classification/ Misinformation Graph Neural Nets/ GANs 4 Why is NLG important? Machine Writing Question Answering ChatBOT Machine Translation 5 Machine Translation 6 AI to Improve Writing Text generation to rescue! Gmail smart compose, smart reply. 7 8 Automated News Writing Xiaomingbot is deployed and constantly producing news on social media platforms (Toutiao & TopBuzz). 9 human written GPT3, edited by human 10 A New Working Style for Authors Human-AI Co-authoring 11 Outline 1. Motivation and Basics 2. Deep Latent Variable Models 3. Multimodal machine writing: show case 4. Summary 12 Modeling a Sequence The quick brown fox jumps over the lazy dog . x = ( x1 , x2 , x3 , x4, x5 , x6 , x7, x8, x9, x10) The central problem of language modeling is to find the joint probability distribution: pθ(x) = pθ(x1, ⋯, xL) There are many ways to represent and learn the joint probability model. 13 DGM Taxonomy pθ(x) ⟷ pdata(x) Maximum Likelihood Estimation Adversarial Learning GAN Explicit Density Implicit Density GSN Tractable Density Intractable Density Energy-based Auto- Markov Parallel Latent Conditional Constrained Regressive Factorization Factorization Variable Model EBM PM Factorization RNN, LSTM Markov GLAT VAE CGMH Transformer Transformer NAT VTM MHA TSMH Transformer Output Softmax Multi-Head Scaled Dot-Product Attention Linear Linear MatMul Add & Norm Concat Feed Forward SoftMax Add & Norm Scaled Dot-Product Mask (opt.) Add & Norm h Multi-Head Attention Attention Feed Forward Scale Add & Norm LinearLinear LinearLinear LinearLinear Add & Norm MatMul Masked Multi-Head Multi-Head Q K V Attention Attention Q K V Input Output Embedding Embedding Inputs Outputs Figure 1: The Transformer - model architecture.Figure 1: The Transformer - model architecture. 15 Decoder: The decoder is also composedDecoder: of a stackThe of decoderN =6identical is also composed layers. In of addition a stack to of theN =6 two identical layers. In addition to the two sub-layers in each encoder layer, the decodersub-layers inserts in each a third encoder sub-layer, layer, whichthe decoder performs inserts multi-head a third sub-layer, which performs multi-head attention over the output of the encoderattention stack. Similar over the to outputthe encoder, of the we encoder employ stack. residual Similar connections to the encoder, we employ residual connections around each of the sub-layers, followedaround by layer each normalization. of the sub-layers, We followed also modify by layer the self-attention normalization. We also modify the self-attention sub-layer in the decoder stack to preventsub-layer positions in the from decoder attending stack to to subsequent prevent positions positions. from This attending to subsequent positions. This masking, combined with fact that the outputmasking, embeddings combined are with offset fact bythat one the position, output embeddings ensures that are the offset by one position, ensures that the predictions for position i can depend onlypredictions on the known for position outputsi can at positions depend only less on than thei. known outputs at positions less than i. 3.2 Attention 3.2 Attention An attention function can be describedAn as attention mapping function a query and can a be set described of key-value as mapping pairs to a an query output, and a set of key-value pairs to an output, where the query, keys, values, and outputwhere are the all query,vectors. keys, The values, output andis computed output are as all a weighted vectors. The sum output is computed as a weighted sum of the values, where the weight assignedof tothe each values, value where is computed the weight by assigned a compatibility to each function value is ofcomputed the by a compatibility function of the query with the corresponding key. query with the corresponding key. 3.2.1 Scaled Dot-Product Attention3.2.1 Scaled Dot-Product Attention We call our particular attention "ScaledWe Dot-Product call our particular Attention" attention (Figure "Scaled 2). The Dot-Product input consists Attention" of (Figure 2). The input consists of queries and keys of dimension dk, andqueries values and of dimension keys of dimensiondv. We computedk, and thevalues dot of products dimension of thedv. We compute the dot products of the query with all keys, divide each by pdqueryk, and with apply all a keys, softmax divide function each by top obtaindk, and the apply weights a softmax on the function to obtain the weights on the values. values. 3 3 Deep Latent Variable Models for Text • Disentangled Representation Learning for Text Generation [ICLR 20b, ACL 19c] • Interpretable Deep Latent Representation from Raw Text [ICML 20] • Mirror Generative Model for Neural Machine Translation [ICLR 20a] 16 Natural Language Descriptions name Sukiyaki eatType pub Sukiyaki is a Japanese food Japanese restaurant. It is a pub and it has a price average average cost and rating good good rating. It is area seattle based in seattle. 17 Data to Text Generation Data Table Sentence <key, value> Medical The blood pressure is higher than Reports normal and may expose to the risk of hypertension Style long dress Made of poplin, this long dress has Painting bamboo ink Fashion Product an ink painting of bamboo and Texture poplin Description feels fresh and smooth. Feel smooth Sia Kate Isobelle Furler (born Name: Sia Kate Isobelle Furler 18 December 1975) is an DoB: 12/18/1975 Person Australian singer, songwriter, Nationality: Australia Biography Occupation: Singer, voice actress and music Songwriter video director. 18 [1] The E2E Dataset: New Challenges For End-to-End Generation. https://github.com/tuetschek/e2e-dataset [2] Can Neural Generators for Dialogue Learn Sentence Planning and Discourse Structuring?. https://nlds.soe.ucsc.edu/sentence-planning- NLG Previous Idea: Templates [name] is a [food] restaurant. It is a [eatType] and it has a [price] cost and [rating] Sukiyaki is a Japanese rating. It is in [area]. restaurant. It is a pub and it has a name Sukiyaki average cost and eatType pub good rating. It is in food Japanese seattle. price average But manually creation of rating good templates are tedious area seattle 19 Our Motivation for Variational Template Machine Motivation 1: Continuous and Template disentangled representation for Sentence template and content Content Table Motivation 2: Incorporate raw text corpus to learn good q (template, Raw text content | representation. sentence) 20 VTM [R. Ye, W. Shi, H. Zhou, Z. Wei, Lei Li, ICLR20b] Variational Template Machine Input: triples of <field_name, table position, value> data x f p v K template {xk, xk , xk }k=1 variable 1. p(c|x) ∼ Neural Net content k k k variable c z maxpool(tanh(W ⋅ [xf , xp, xv ] + b)) 2. Sample , e.g. z ∼ p0(z) Gaussian text y 3. Decode y from [c, z] using another NN (e.g. Transformer) 21 VTM [R. Ye, W. Shi, H. Zhou, Z. Wei, Lei Li, ICLR20b] Learning with Raw Corpus • Semi-supervised learning: “Back-translate” corpus to obtain pseudo-parallel pairs <table, text>, to enrich the learning Table Text name Sukiyaki eatType pub Sukiyaki is a Japanese restaurant. It is food Japanese price average a pub and it has a average cost and rating good good rating. It is in seattle. area seattle ? Known for its creative flavours, Holycrab's signatures are the Hokkien crab. q(<c,z>|y) 22 VTM Produces High-quality and Diverse Text Ideal WIKI Ideal SPNLG 0.3 0.5 VTM T2S-beam T2S-beam 0.256 0.41 VTM T2S- T2S- 0.212 pretrain 0.32 pretrain BLEU 0.23 BLEU 0.168 0.14 0.124 Temp-KN 0.05 Temp-KN 0.08 0.3 0.48 0.66 0.84 1.02 0.6 0.705 0.81 0.915 1.02 Self-BLEU Self-BLEU VTM uses beam-search decoding. 23 VTM [Ye, …, Lei Li, ICLR20b] VTM Generates Diverse Text Input Data Table Generated Text 1: John Ryder (8 August 1889 – 4 April 1977) was an Australian cricketer. 2: Jack Ryder (born August 9, 1889 in Victoria, Australia) was an Australian cricketer. 3: John Ryder, also known as the king of Collingwood (8 August 1889 – 4 April 1977) was an Australian cricketer. 24 Learning Disentangled Representation of Syntax and Semantics DSSVAE enables learning and syntactic semantic transferring sentence-writing styles style content Syntax provider Semantic content zsyn zsem There is an apple The dog is on the table behind the door DSSVAE x sentence There is a dog behind the door 25 DSS-VAE [Y. Bao, H. Zhou, S. Huang, Lei Li, L. Mou, O. Vechtomova, X. Dai, J. Chen, ACL19c] Interpretable Text Generation Latent structure dialog actions GENERATOR Sampling “Remind me about x1 x2 x3 the football game.” [action=remind] “Will it be overcast tomorrow?” [action=request] x0 x1 x2 …… Generate Sentences with interpretable factors 26 How to Interpret Latent Variables in VAEs? Variational Auto-encoder (VAE) interpretable structure z x (Kingma & Welling, 2013) z: difficult to continuou interpret s latent variables discrete factors Will it be humid in New York today? Remind me about my meeting. 27 Discrete Variables Could Enhance Interpretability - but one has to do it right! Gaussian Mixture Variational Auto- encoder (GM-VAE) interpretable structure c z x (Dilokthanakul et al., 2016; Jiang et al., 2017) : discrete c Why? mode- component How to fix it? collapse z: continuous Will it be overcast Remind me latent variable tomorrow? about the football game.
Recommended publications
  • China in 50 Dishes
    C H I N A I N 5 0 D I S H E S CHINA IN 50 DISHES Brought to you by CHINA IN 50 DISHES A 5,000 year-old food culture To declare a love of ‘Chinese food’ is a bit like remarking Chinese food Imported spices are generously used in the western areas you enjoy European cuisine. What does the latter mean? It experts have of Xinjiang and Gansu that sit on China’s ancient trade encompasses the pickle and rye diet of Scandinavia, the identified four routes with Europe, while yak fat and iron-rich offal are sauce-driven indulgences of French cuisine, the pastas of main schools of favoured by the nomadic farmers facing harsh climes on Italy, the pork heavy dishes of Bavaria as well as Irish stew Chinese cooking the Tibetan plains. and Spanish paella. Chinese cuisine is every bit as diverse termed the Four For a more handy simplification, Chinese food experts as the list above. “Great” Cuisines have identified four main schools of Chinese cooking of China – China, with its 1.4 billion people, has a topography as termed the Four “Great” Cuisines of China. They are Shandong, varied as the entire European continent and a comparable delineated by geographical location and comprise Sichuan, Jiangsu geographical scale. Its provinces and other administrative and Cantonese Shandong cuisine or lu cai , to represent northern cooking areas (together totalling more than 30) rival the European styles; Sichuan cuisine or chuan cai for the western Union’s membership in numerical terms. regions; Huaiyang cuisine to represent China’s eastern China’s current ‘continental’ scale was slowly pieced coast; and Cantonese cuisine or yue cai to represent the together through more than 5,000 years of feudal culinary traditions of the south.
    [Show full text]
  • Joint Civil Society Report Submitted to the Committee on the Elimination of Racial Discrimination
    Joint Civil Society Report Submitted to The Committee on the Elimination of Racial Discrimination for its Review at the 96th Session of the combined fourteenth to seventeenth periodic report of the People’s Republic of China (CERD/C/CHN/14-17) on its Implementation of the Convention on the Elimination of All Forms of Racial Discrimination Submitters: Network of Chinese Human Rights Defenders (CHRD) is a coalition of Chinese and international human rights non-governmental organizations. The network is dedicated to the promotion of human rights through peaceful efforts to push for democratic and rule of law reforms and to strengthen grassroots activism in China. [email protected] https://www.nchrd.org/ Equal Rights Initiative is a China-based NGO monitoring rights development in Western China. For the protection and security of its staff, specific identification information has been withheld. Date of Submission: July 16, 2018 Table of Contents I. Executive Summary Paras. 1-2 II. Recommendations Para. 3 III. Thematic Issues & Findings A. Legislation underpinning discriminatory counter-terrorism policies Paras. 4-7 [Articles 2 (c) and 4; List of Themes para. 8] B. Militarized policing, invasive surveillance, and constant monitoring Paras. 8-21 [Articles 3, 4, and 5 (a-b); LOT para. 22] C. Extrajudicial detention, forced disappearances, torture, and other abuses Paras. 22-28 in “Re-education” camps [Article 5 (a)(b)(d); LOT para. 21] D. Counter-terrorism used to justify arbitrary detention and discriminatory Paras. 29-34 punishment of ethnic minorities [Articles 4 and 5 (a)(b)(d); LOT paras. 6 and 21] E. Discrimination and restrictions on religious freedom Paras.
    [Show full text]
  • The Rural Video Influencers in China: on the New Edge of Urbanization
    THE RURAL VIDEO INFLUENCERS IN CHINA: ON THE NEW EDGE OF URBANIZATION A Thesis Presented to the Faculty of the Graduate School of Cornell University In Partial Fulfillment of the Requirements for the Degree of Master of Arts by Xinwen Zhang May 2020 © 2020 Xinwen Zhang ABSTRACT On the new media platforms in China, especially the video platforms, some rural content has become quite influential, which is, to some extent, inconsistent with people's impression of the vulnerable position of rural areas the urban-rural inequality. This thesis studied some of the most popular the rural content producers with the close reading of their videos, explaining how they frame themselves and their artworks and what kind of "rurality" is performed to the audiences. While the audience might assign them as rural figures, these people were on the edge of the urban and the rural as they had a shared history as some people who were from the rural areas, spent a period of their lives as migrant workers, and finally became video influencers performing some kind of rural lifestyle. BIOGRAPHICAL SKETCH Xinwen Zhang was born in Shenyang, China, and spent the first 18 years of her life there. Then, she went to Sun Yat-sen University and entered Boya college, which sets no determined major and emphasizes on text close reading to explore potential for students. She finally chose anthropology as the major and researched the breakfast stalls in New Phoenix Village. She went to the village and had close contact with the migrant workers running breakfast stalls and people at breakfast, mainly focusing on the population identification, identity cognition, and community construction of the migrant population as well as their dilemma.
    [Show full text]
  • Analysis of Bytedance with a Close Look on Douyin / Tiktok ——————————————————
    —————————————————— Analysis of ByteDance with a close look on Douyin / TikTok —————————————————— Author: Xiaoye SHI Supervisor: Prof. Dr. Didier SORNETTE A thesis submitted to the Department of Management, Technology and Economics (D-MTEC) of the Swiss Federal Institute of Technology Zurich (ETHZ) in partial fulfilment of the requirements for the degree of Master of Science in Management, Technology and Economics June 2019 Acknowledgements I would like to thank my supervisor Prof. Dr. Didier Sornette, who has kindly given me countless guidance and advice throughout the Master thesis. I appreciate the opportunity to work on this interesting topic under the Chair of Entrepreneurial Risks and is deeply grateful for all the help received along the process. I also want to express my deepest gratitude to my parents, who have been supportive and encouraging under all circumstances. Without them, I would not be able to become the person I am today. - 2 - Abstract In 2018, ByteDance, a young Internet company with only 6 years of history, broke out on various news headlines as the highest valued unicorn. With the acquisitions of musical.ly and Flipgram, the company’s flagship product Douyin strikes to develop its global presence under the name TikTok. This thesis analyzed Douyin’s historical growth and revenue model. As a main revenue driver, future user growth is predicted and calibrated by extending the methodology proposed in earlier studies by Cauwels and Sornette. We considered three growth scenarios – base, high and extreme, and estimated Douyin as well as ByteDance’s value based on comparable company analysis. ByteDance’s key performance metrics and multiples were compared with four other firms in the similar industry, Facebook, Weibo, Momo and iQIYI.
    [Show full text]
  • China (Includes Tibet, Hong Kong, and Macau) 2018 Human Rights Report
    CHINA (INCLUDES TIBET, HONG KONG, AND MACAU) 2018 HUMAN RIGHTS REPORT EXECUTIVE SUMMARY The People’s Republic of China (PRC) is an authoritarian state in which the Chinese Communist Party (CCP) is the paramount authority. CCP members hold almost all top government and security apparatus positions. Ultimate authority rests with the CCP Central Committee’s 25-member Political Bureau (Politburo) and its seven-member Standing Committee. Xi Jinping continued to hold the three most powerful positions as CCP general secretary, state president, and chairman of the Central Military Commission. Civilian authorities maintained control of security forces. During the year the government significantly intensified its campaign of mass detention of members of Muslim minority groups in the Xinjiang Uighur Autonomous Region (Xinjiang). Authorities were reported to have arbitrarily detained 800,000 to possibly more than two million Uighurs, ethnic Kazakhs, and other Muslims in internment camps designed to erase religious and ethnic identities. Government officials claimed the camps were needed to combat terrorism, separatism, and extremism. International media, human rights organizations, and former detainees reported security officials in the camps abused, tortured, and killed some detainees. Human rights issues included arbitrary or unlawful killings by the government; forced disappearances by the government; torture by the government; arbitrary detention by the government; harsh and life-threatening prison and detention conditions; political prisoners;
    [Show full text]
  • China's Propensity for Innovation in the 21St Century
    C O R P O R A T I O N STEVEN W. POPPER, MARJORY S. BLUMENTHAL, EUGENIU HAN, SALE LILLY, LYLE J. MORRIS, CAROLINE S. WAGNER, CHRISTOPHER A. EUSEBI, BRIAN CARLSON, ALICE SHIH China's Propensity for Innovation in the 21st Century Identifying Indicators of Future Outcomes For more information on this publication, visit www.rand.org/t/RRA208-1 Library of Congress Cataloging-in-Publication Data is available for this publication. ISBN: 978-1-9774-0596-8 Published by the RAND Corporation, Santa Monica, Calif. © Copyright 2020 RAND Corporation R® is a registered trademark. Cover: Blackboard/Adobe Stock; RomoloTavani/Getty Images Limited Print and Electronic Distribution Rights This document and trademark(s) contained herein are protected by law. This representation of RAND intellectual property is provided for noncommercial use only. Unauthorized posting of this publication online is prohibited. Permission is given to duplicate this document for personal use only, as long as it is unaltered and complete. Permission is required from RAND to reproduce, or reuse in another form, any of its research documents for commercial use. For information on reprint and linking permissions, please visit www.rand.org/pubs/permissions. The RAND Corporation is a research organization that develops solutions to public policy challenges to help make communities throughout the world safer and more secure, healthier and more prosperous. RAND is nonprofit, nonpartisan, and committed to the public interest. RAND’s publications do not necessarily reflect the opinions
    [Show full text]
  • Learning Deep Latent Models for Text Sequences Lei Li Bytedance AI Lab
    2020 Learning Deep Latent Models for Text Sequences Lei Li ByteDance AI Lab 4/29/2020 The Rise of New Media Platforms Toutiao Helo Douyin/Tiktok 2 Huge Demand for Automatic Content Generation Technologies • Automatic News Writing • Author writing assist tools – Title generation and text summarization • Automatic Creative Advertisement Design • Dialog Robots w/ response generation • Translation of content across multiple languages • Story Generation 3 4 Automated News Writing Xiaomingbot is deployed and constantly producing news on social media platforms (TopBuzz & Toutiao). 5 AI to Improve Writing Text generation to rescue! 6 Outline 1. Overview 2. Learning disentangled latent representation for text 3. Mirror-Generative NMT 4. Multimodal machine writing 5. Summary 7 Disentangled Latent Representation for Text VTM [R. Ye, W. Shi, H. Zhou, Z. Wei, Lei Li, ICLR20b] DSS-VAE [Y. Bao, H. Zhou, S. Huang, Lei Li, L. Mou, O. Vechtomova, X. Dai, J. Chen, ACL19c] Natural Language Descriptions name Sukiyaki eatType pub Sukiyaki is a Japanese food Japanese restaurant. It is a pub and it has a price average average cost and rating good good rating. It is area seattle based in seattle. 9 Data to Text Generation Data Table Sentence <key, value> Medical The blood pressure is higher than Reports normal and may expose to the risk of hypertension Style long dress Made of poplin, this long dress has Painting bamboo ink Fashion Product an ink painting of bamboo and Texture poplin Description feels fresh and smooth. Feel smooth Sia Kate Isobelle Furler (born Name: Sia Kate Isobelle Furler 18 December 1975) is an DoB: 12/18/1975 Person Australian singer, songwriter, Nationality: Australia Biography Occupation: Singer, voice actress and music Songwriter video director.
    [Show full text]
  • Social Media
    中国媒体概览 China Media Overview Social Media © 2016 Beyond Summits 3 Summary: • 90s generation has become the main users of Chinese social media. 90s generation prefer to watch internet videos, while 80s and 70s prefer to shop online and read news respectively. • Chinese netizens open and use 6-10 media APPs on average every day. The most frequently used social media platforms are WeChat, QQ and Weibo. When they use social media platforms, they outweigh ‘media’ nature of the platforms to obtain news and information as well as share their lives instead of being ‘social’. • WeChat users are younger and more frequently browse the platform. The users not only acquire common socialization by WeChat, but obtain information and life convenience as well. • Most of the Weibo users are male whose ages are relatively older than that of WeChat. Geographically, the Weibo users concentrate in such areas as Shanghai, Zhejiang Province, Jiangsu Province and the Zhujiang Delta. © 2016 Beyond Summits 4 Summary: • Currently, various APPs developed by such Internet tycoons as Baidu, Alibaba and Tencent are taking leading positions in China’s mobile Internet market. • Mobile Social Platforms: Tencent’s two APPs outperformed their competitive APPs. • Mobile Entertainment Platforms: The top3 mobile video APPs are Tencent, Youku and IQIYI. And the top3 mobile music APPS are KUGOU music, QQ music and KUWO music. • Mobile News Platforms: The top5 APPs are Tencent, TOUTIAO, 163 news, SOHU news and IFENG news. • Mobile Tourism: In this October, a merger between QUNAR and CTRIP produced the greatest mobile tourism APP in China. • Mobile E-commerce: TAOBAO outperformed other APPs.
    [Show full text]
  • Chinese Tech Landscape Overview NSCAI Presentation
    Chinese Tech Landscape Overview NSCAI Presentation epic.org EPIC-19-09-11-NSCAI-FOIA-20200331-3rd-Production-pt9 000534 EP,c-,,,,_,,,_,,,,,, May 2019 "Core tech" vs. "tech enabled" businesses • Being regarded as a core-tech business is glamorous -- everyone wants to believe and talk about their technological capabilities as a moat. But there are few industries where that 's actually the case. o e.g. mass deployment of machine vision for medical diagnosis is not blocked by the tech. o There are relatively few "core tech businesses" that compete in markets where cutting edge technology is the primary axis of competition and barrier to entry (e.g. Intel, Nvidia, Waymo, ) • It is more useful to understand most of these companies as "tech-enabled businesses". o e.g. Facebook, Uber, Linkedin, and Airbnb derive their power from network effects. Amazon's e-commerce platform derives its power from heavy capex. epic.org EPIC-19-09-11-NSCAI-FOIA-20200331-3rd-Production-pt9 000535 EPIC-2019-001-000603 epic.org EPIC-19-09-11-NSCAI-FOIA-20200331-3rd-Production-pt9 000536 WeChat(6 ) 0 BAT (Baidu, Alibaba, Tencent) - The Big 3 Q. Search .. J.:11llll~ fJ ~ You have added ;}:JJ as your WeChat c. • Tencent ($504B Valuation): Social and gaming. Best known for Subscriptions , • -. El 1559]jl1,i,r fft.!!ff\25!11:itfi,P~OIJM8 creating WeChat. Also the largest gaming company in the world. lilLR:::lo &f GGV 996 .,,"'ttH_o+ ,, DJ> (34 mossagos] lW' Cloo nera ara 60% of all mobile time in China is spent on Tencent properties.
    [Show full text]
  • Rise and Fall: Audience Participation and Market Economy in Traditional
    Title Page Rise and Fall: Audience Participation and Market Economy in Traditional Chinese Opera by Yuanziyi Zhang Bachelor of Science, Illinois Wesleyan University, 2018 Submitted to the Graduate Faculty of the Dietrich School of Arts and Sciences in partial fulfillment of the requirements for the degree of Master of Arts University of Pittsburgh 2021 Committee Membership Page UNIVERSITY OF PITTSBURGH DIETRICH SCHOOL OF ARTS AND SCIENCES This thesis was presented by Yuanziyi Zhang It was defended on April 8, 2021 and approved by Dr. Kun Qian, Associate Professor, Department of East Asian Languages and Literatures Dr. Charles Exley, Associate Professor, Department of East Asian Languages and Literatures Dr. Song Shi, Visiting Assistant Professor, School of Computing and Information Thesis Advisor: Dr. Kun Qian, Associate Professor, Department of East Asian Languages and Literatures ii Copyright © by Yuanziyi Zhang 2021 iii Abstract Rise and Fall: Audience Participation and Market Economy in Traditional Chinese Opera Yuanziyi Zhang, MA University of Pittsburgh, 2021 Throughout Chinese history, traditional opera has encountered peaks and valleys. Although the status of the art and performers has been long controversial, the most significant factor that determines the destiny of traditional Chinese opera is actually the vitality of its market. My thesis seeks to tackle the role of market and audience participation in different stages of the development of Peking Opera, delineating the tripartite relationship between political interference, audience participation, and technological development. Within Henry Jenkins’ theoretical framework of media convergence, I investigate collective intelligence, participatory culture, and contemporary fan culture vis-à-vis digital media that has been contextualized in traditional Chinese opera and its economic impact.
    [Show full text]
  • Comparing Chinese Mobile Applications' Data and User Privacy
    INTERNET POLICY REVIEW Journal on internet regulation Volume 9 | Issue 3 Going global: Comparing Chinese mobile applications’ data and user privacy governance at home and abroad Lianrui Jia University of Toronto, Canada, [email protected] Lotus Ruan Citizen Lab, University of Toronto, Canada, [email protected] Published on 16 Sep 2020 | DOI: 10.14763/2020.3.1502 Abstract: We examine and compare data and privacy governance by four China-based mobile applications and their international versions: Baidu, Toutiao and its international version TopBuzz, Douyin and its international version TikTok, and WeChat. Together, these four applications represent popular Chinese apps branching into diverse overseas markets such as Europe, Brazil, North America, and Southeast Asia. We first present an overview of the ownership, functions, business models and strategies of the reviewed apps. To study the app’s interface design, we employ the walkthrough method to examine privacy features during the account registration and deletion stages in app usage. Lastly, we conducted content analysis of the terms of service and privacy policies to establish the app’s data collection, storage, transfer, use, and disclosure measures. Our analysis showed variations across apps and within the Chinese and international-facing versions in their data and privacy governance in app design and policies. Baidu has the most unsatisfactory data and privacy protection measures, while ByteDance’s TikTok/Douyin and TopBuzz/Toutiao offer more comprehensive user protection from different jurisdictions. Moreover, this paper highlights the role of platform owners (e.g., Google and Apple) in gatekeeping mobile app privacy standards and the role of the state in imposing a data protection framework on overseas versions of China-based mobile apps.
    [Show full text]
  • Artificial Intelligence Factory, Data Risk, and Vcs' Mediation: the Case of Bytedance, an AI-Powered Startup
    Journal of Risk and Financial Management Article Artificial Intelligence Factory, Data Risk, and VCs’ Mediation: The Case of ByteDance, an AI-Powered Startup Peiyi Jia 1 and Ciprian Stan 2,* 1 Robert J. Manning School of Business, University of Massachusetts Lowell, One University Avenue, Pulichino-Tong Business Center Suite 228, Lowell, MA 01854, USA; [email protected] 2 Rinker School of Business, Palm Beach Atlantic University, 901 S Flagler Drive, West Palm Beach, FL 33401, USA * Correspondence: [email protected] Abstract: The AI factory is an effective way of managing artificial intelligence (AI) processes, enabling broad AI deployment in a firm. The purpose of this study is to explore the role of the AI factory in an entrepreneurship context. How do AI-powered startups leverage AI to grow, and manage data risks? What is the role of venture capitalists in this process? We answer these research questions by conducting an in-depth study of an AI-powered startup: ByteDance. Our study extends both AI and entrepreneurship literature by showing that AI-powered startups adopt the AI factory approach to optimize scale, scope, and learning. Our discussion also emphasizes the critical role played by venture capitalists in assisting AI-powered startups in building AI factories and in reducing data risk. Keywords: artificial intelligence; scaling growth; AI factory; AI-powered startup; data risk Citation: Jia, Peiyi, and Ciprian Stan. 2021. Artificial Intelligence Factory, 1. Introduction Data Risk, and VCs’ Mediation: The Artificial intelligence can process, analyze, and interpret data much faster and at Case of ByteDance, an AI-Powered a scale unachievable by human capabilities, potentially leading to increased prosperity, Startup.
    [Show full text]