Artificial Intelligence Standardization White Paper (2018 Edition) 2018

Total Page:16

File Type:pdf, Size:1020Kb

Artificial Intelligence Standardization White Paper (2018 Edition) �����������2018 Translation The following government-issued white paper describes China's approach to standards-setting for artificial intelligence. Appendices list all of China's current (as of January 2018) and planned AI standardization protocols, and provide examples of applications of AI by China's leading tech companies. Title Artificial Intelligence Standardization White Paper (2018 Edition) 2018 Author China Electronics Standardization Institute (CESI; ; ) is the "compiling unit" () for this white paper. The 2nd Industrial Department () of the Standardization Administration of China (SAC; ) is the "guidance unit" () for this white paper. Source CESI website, January 24, 2018. CESI is a think tank subordinate to the PRC Ministry of Industry and Information Technology (MIIT; ); CESI is also known as the 4th Electronics Research Institute (; ) of MIIT. SAC is a component of the PRC State Administration for Market Regulation (), a ministry-level agency under China's cabinet, the State Council. The Chinese source text is available online at: http://www.cesi.cn/images/editor/20180124/20180124135528742.pdf Translation Date Translator Editor May 12, 2020 Etcetera Language Group, Inc. Ben Murphy, CSET Translation Lead Contributing institutions (in no particular order) China Electronics Standardization Institute Shanghai Development Center of Computer (CESI) Software Technology Institute of Automation, Chinese Academy Shanghai Xiao-i Robot Technology Co., Ltd. of Sciences Beijing Institute of Technology Beijing iQIYI Technology Co., Ltd. Tsinghua University Beijing Yousheng Zhiguang Technology Co., Ltd. (mavsyin) Peking University Extreme Element (Beijing) Intelligent Technology Co., Ltd. Renmin University of China Beijing ByteDance Technology Co., Ltd. Beihang University Beijing Sensetime Technology Development Co., Ltd. iFLYTEK Co., Ltd. Zhejiang Ant Small and Micro Financial Services Group Co., Ltd. Huawei Technologies Co., Ltd. Baidu Netcom Science and Technology Co., Ltd. IBM China Co. Ltd. Intel (China) Co., Ltd. Alibaba Cloud Computing Ltd. IBM China Company Limited Panasonic Corporation of China Chongqing Kaize Technology Co., Ltd. (t jydin) China Telecom Corporation Limited Haier Industry Intelligence Research Institute Co., Ltd. Tencent Internet Plus (Shenzhen) Co., Ltd. Chongqing CloudWalk Technology Co. Ltd. Alibaba Network Technology Co., Ltd. Beijing DeepGlint Technology Limited Contents REFACE ESEARCHBACKROUN ESEARCHOBJECTIVESANSINIFICANCE UTINEOF/RTIFICIA5NTEIENCE/5 ( 4ISTORANCONCEPTOF/5( RIINSANHISTOROF/5( 1ONCEPTOF/5 3EATURESOF/5 ( /5REFERENCEARCHITECTURE, ( 1URRENT/5SITUATIONANTRENS ( 7E/5TECHNOOIES 8ACHINEEARNIN 7NOEERAPHS 9ATURAANUAEPROCESSIN( 4UMANCOMPUTERINTERACTION) 1OMPUTERVISION 0IOMETRICFEATURERECONITION- IRTUAREAITAUMENTEREAIT. RENSIN/5TECHNOOEVEOPMENT ( 1URRENTSITUATIONANTRENSOFTHE/5INUSTR 5NTEIENTINFRASTRUCTURE 5NTEIENTINFORMATIONANATA( 5NTEIENTTECHNOOSERVICES( 5NTEIENTPROUCTS) 5NUSTRIAAPPICATIONSOF/5 /5INUSTRTRENS. (( AFETETHICAANPRIVACISSUES( /5SAFETISSUES( /5ETHICAISSUES( /5PRIVACISSUES( () HEIMPORTANTROEOF/5STANARIATION(( ) 1URRENTSTATEOF/5STANARIATION( ) 1URRENTSTATEOFSTANARIATIONINTERNATIONA( 552161( 5(, 521(- 5(- ) 1URRENTSTATEOFSTANARIATIONOVERSEAS(- 5222(- 95(. THER(. )( 1URRENTSTATEOFSTANARIATIONIN1HINA(. I 9ATIONA5NFORMATIONECHNOOTANARIATIONECHNICA 1OMMITTEE(. 1HINA9ATIONAECHNICA1OMMITTEEFOR/UTOMATIONSTEMS AN5NTERATIONTANARIATION) 9ATIONA/UIOIEOAN8UTIMEIATANARIATION ECHNICA1OMMITTEE) 9ATIONA5NFORMATIONECURITTANARIATIONECHNICA 1OMMITTEE) 9ATIONAECHNICA1OMMITTEE-ON5NTEIENTRANSPORT STEMS) )) ROBEMSANCHAENESFACIN/5STANARIATION) ) /NASISOF/5STANARIATIONREUIREMENTS) ) 1ONSTRUCTIONOFORANIATIONAMECHANISMSFOR/5STANARIATION)( /5STANARIATIONSSTEMS)) TRUCTUREOFTHE/5STANARIATIONSSTEM)) TANARSSSTEMFRAMEORK) 3OUNATIONASTANARS), ATFORMSUPPORTSTANARS), 7ETECHNICASTANARS), ROUCTSANSERVICESSTANARS). /PPICATIONSSTANARS). ECURITETHICSSTANARS ( TANARSTHATURENTNEETOBEFORMUATEINTHENEARTERM 7ERECOMMENATIONSFOR/5STANARIATIONORK) II 1 Preface 1.1 Research background The concept of artificial intelligence (AI) was born in 1956. Over the course of more than a half century of development, AI technology and applications have experienced many ups and downs due to the influence of intelligent algorithms, computing speeds, data storage levels, and other factors. Since 2006, tremendous success has been achieved in fields like machine vision and speech recognition using machine learning algorithms represented by deep learning. The great improvements in recognition accuracy have once again made AI the object of widespread attention across academia and industry. At the same time that technologies like cloud computing and big data are boosting calculation speeds and lowering computing costs, they are also providing rich data resources for AI development, helping to train more intelligentized (l) algorithm models. The AI development model has also undergone a gradual transformation from seeking to "use computers to simulate artificial intelligence" to enhanced hybrid intelligent systems combining machines and humans. These use combinations of machines, humans, and networks to form collective intelligence systems, and use combinations of machines, humans, networks, and things to form more complex intelligent systems. As the core driving force behind a new round of industrial transformation, AI is spawning new technologies and new products. At the same time, AI is also playing a powerful empowering role for traditional industries. It has the potential to induce major changes in the industrial structure and to bring about an overall leap in social productive forces. AI can free humans from monotonous labor, as more and more simple, repetitive and dangerous tasks are being completed by AI systems. While reducing labor inputs and increasing work efficiency, AI systems can also do things faster and more accurately than humans. AI can find broad applications in such fields as education, medical treatment, eldercare, environmental protection, city operation, and justice services, and it is able to greatly improve the precision level of public services and increase the quality of people's lives across the board. AI can also help humanity accurately perceive, predict, and warn of major situations in the operation of infrastructure and public security, promptly grasp changes in collective consciousness and psychology, take the initiative to make decisions in reaction to events, and significantly improve social governance capabilities while safeguarding public security. Since AI is a future-shaping strategic technology, the world's developed nations are all striving for dominance in a new round of international competition, and issuing plans and policies centered around AI. They are making deployments for core AI technologies, top AI talents, and AI standards and norms, and are accelerating the development of AI technologies and industries. The major technology companies are constantly enlarging their investments of money and manpower to seize the high ground in AI development. In 2017, China released the New Generation Artificial Intelligence Development Plan (Guo Fa [2017] No. 35), the Three Year Action Plan to Promote the Development of the New Generation Artificial Intelligence Industry, 2018-2020 (No. 231 [2018] of the Ministry of Industry and Information Technology) and other policy documents to promote the R&D and industrialized development of AI technology. Currently, there is a certain technological and industrial foundation for AI development in China, with an array of AI companies concentrated in the microchip, data, platform, and applications fields. They have achieved good initial results in some directions 1 and moved towards commercialized development. For example, applications have been achieved in such fields as the financial, security, and service industries; and in specific tasks, the degree of accuracy and effectiveness of semantic recognition, speech recognition, facial recognition, image recognition technologies already far exceed those of humans. Standardization work plays fundamental, supportive, and guiding roles for AI and its industrial development. It represents both a key lever for promoting the development of industry innovation, as well as the commanding heights of industry competition. At present, while AI-related products and services in China are growing increasingly abundant, problems with inadequate degrees of standardization are also surfacing. AI involves numerous fields. Although some fields already have a certain foundation of standardization, these scattered standardization efforts are insufficient to fully support the entire AI field. In addition, AI is an emerging field that is just beginning to take off. On a global scale, standardization work is still in its infancy, and a complete system of standards has yet to take shape. With China and the rest of the world basically on the same starting line, a window of opportunity exists for breakthroughs. As long as we take aim at that opportunity and deploy rapidly, it will be entirely possible to seize the commanding heights of standards innovation. Otherwise, a good opportunity may be lost. Consequently, there is an urgent need to take advantage of the opportunity at hand, accelerate research on AI technology and industry development, systematically sort through and develop systems of standards for various AI fields, clarify the relationships of dependency and constraint between standards, establish unified and complete systems of standards, and use
Recommended publications
  • Standards Analysis Ict Sector Luxembourg
    Version 6.0 – June 2016 ISSN 2354-483X STANDARDS ANALYSIS ICT SECTOR LUXEMBOURG · V5.0/09.2015 LUXEMBOURG LUXEMBOURG ICT SECTOR ICT CONTACT : ILNAS & ANEC STANDARDS ANALYSIS ANALYSIS STANDARDS Southlane Tower I · 1, avenue du Swing · L-4367 Belvaux Tel. : (+352) 24 77 43 -70 · Fax : (+352) 24 79 43 -70 E-mail : [email protected] ANS/AN03 www.portail-qualite.lu 2015 · © ILNAS/ANEC September Executive summary In 2012 the “Institut Luxembourgeois de la Normalisation, de l’Accréditation, de la Sécurité et qualité des produits et services” (ILNAS) initiated an analysis of European and international standards in the Information and Communication Technology (ICT) sector. The aim of this analysis is to develop an information and exchange network for ICT standardization knowledge in the Grand Duchy of Luxembourg. Since 2013, this analysis has been carried out in the frame of the implementation of the “Luxembourg’s policy on ICT technical standardization” (which was last updated in 2015)1. The ICT sector is already an active sector at the national standardization level with 65 national delegates currently registered by ILNAS2. These delegates are involved in standardization technical committees to participate actively and follow closely the work performed at international level and have the task of ensuring that the views and positions of Luxembourg are understood and known by the technical committees. ILNAS is convinced that this sector could be even more active, especially since some ICT subsectors do not yet benefit from a sufficient representation of national delegates (e.g.: Internet of Things, Cloud Computing, Big Data, Smart Cities). Thus, the purposes of this analysis are firstly, to provide useful information to national stakeholders regarding standardization activities in the field of ICT and secondly, to involve them into an integrated and innovative approach of standardization.
    [Show full text]
  • China's Changes Through Reform on Documentary
    18 | Thursday, November 21, 2019 LIFE CHINA DAILY HONG KONG EDITION Israel tastes the majesty of the Mogao Grottoes JERUSALEM — An exhibition centered around the Mogao Left: A scene from the hit series The Mystic Nine, which features Grottoes, a renowned UNESCO actress Zhao Liying and actor William Chan. Above: A still image World Heritage Site in North­ from The Golden Eyes, starring actor Zhang Yixing (center) as the west China’s Gansu province, protagonist. PHOTOS PROVIDED TO CHINA DAILY kicked off last week at the China Cultural Center in Tel Aviv, Israel. The exhibition, which has been met with fascination by local attendees, includes two parts — the digital Dunhuang exhibition, titled The Pearl of the Silk Road Living in Digitized Eternity and Streaming toward success the Chinese painting exhibition, titled Pilgrimage Dunhuang. Wearing virtual reality glasses, visitors are able to appreciate the A new wave of hen producer Cai Jia excellent dramas as possible, and majesty of the famous caves, was a young child, the read popular review sites like Dou­ sculptures and paintings from domestic drama little “secret” that ban to find out what genres audien­ the Mogao Grottoes, and interact producers are pleased her most was ces want to see. with them. to slip out of bed, sneak into the liv­W Her entry into the industry was Dozens of Chinese paintings picking up ing room and turn on the TV set marked by something of a coinci­ depicting the Mogao Grottoes when her parents were sleeping. dence, since 2015 was the year and their art, created by Gansu awards across Approaching midnight, The Jour­ regarded as a turning point, and one Art Institute, proved to be a star Asia, and raising ney to the West — perhaps the most that reshaped China’s film and TV attraction.
    [Show full text]
  • Read the Report
    23 Dec 2020 CMB International Securities | Equity Research | Company Initiation iQIYI (IQ US) BUY (Initiation) China’s online Disney in the making Target Price US$23.2 Up/Downside +34.8% Current Price US$17.2 We keep positive on IQIYI’s long-term subs trend and margin improvement , backed by its sizable users, unique original content and valuable IP. We forecast iQIYI to deliver 11% revenue CAGR during FY20-22E, with subs price hike and China Internet Sector ads recovery. Given fundraising partly priced in, we think iQIYI’s valuation is attractive. Initiate with BUY with DCF-based TP US$23.2. Sophie Huang An online video leader: Not only “Netflix”, but more. iQIYI is a leading (852) 3900 0889 online video platform in China, with 105mn subs. Backed by its sizable users, [email protected] vibrant original content and IP reserve, we forecast iQIYI to deliver 11% rev CAGR in FY20-22E, with subs price hike, ads recovery, and user expansion. Miriam Lu (852) 3761 8728 Original content makes the difference. With online video industry growth [email protected] tapering off, we believe content matters more to users than traffic or subsidies. iQIYI exceled itself with exclusive original content in both popular dramas and variety shows. Moreover, it bore initial fruits from short-episode dramas (e.g. Stock Data Mkt Cap (US$ mn) 13,499 the popularity of The Bad Kids <隐秘的角落>), and would further enrich Light Avg 3 mths t/o (US$ mn) 180.41 on Series (迷雾剧场) with stronger pipeline. We expect short-episode dramas 52w High/Low (US$) 28.03/ 14.51 series to stimulate its subs & brand ads momentum with higher ROI, such as Total Issued Shares (mn) 373 Who is Murderer<谁是凶手> in 2021 pipeline.
    [Show full text]
  • Standards Analysis Ict Sector Luxembourg
    Version 5.0 – September 2015 ISSN 2354-483X STANDARDS ANALYSIS ICT SECTOR LUXEMBOURG · V5.0/09.2015 LUXEMBOURG LUXEMBOURG ICT SECTOR ICT CONTACT : ILNAS & ANEC STANDARDS ANALYSIS ANALYSIS STANDARDS Southlane Tower I · 1, avenue du Swing · L-4367 Belvaux Tel. : (+352) 24 77 43 -70 · Fax : (+352) 24 79 43 -70 E-mail : [email protected] ANS/AN03 www.portail-qualite.lu 2015 · © ILNAS/ANEC September Executive summary The “Institut Luxembourgeois de la Normalisation, de l’Accréditation, de la Sécurité et qualité des produits et services” (ILNAS) has inititated in 2012 an analysis of European and international standards in the Information and Communication Technology (ICT) sector. This analysis intends to develop an information and exchange network for ICT standardization knowledge in the Grand Duchy of Luxembourg. Since 2013, this analysis has been carried out in the frame of the implementation of the “Luxembourg’s policy on ICT technical standardization” (which was last updated in 2015)1. The ICT sector is already an active sector at the national standardization level with 53 national delegates currently registered by ILNAS. These delegates are involved in standardization technical committees to participate actively and follow closely the work performed at international level and have the task of ensuring that the views and positions of the country are understood and known by the technical committees. ILNAS is convinced that this sector could be even more active, especially since some ICT subsectors do not yet have national delegates (e.g.: Sensor Networks, Internet of Things). Thus, the purposes of this analysis are firstly, to provide useful information to national stakeholders regarding standardization activities in the field of ICT and secondly, to involve them into an integrated and innovative approach of standardization.
    [Show full text]
  • Industrial Automation
    ISO Focus The Magazine of the International Organization for Standardization Volume 4, No. 12, December 2007, ISSN 1729-8709 Industrial automation • Volvo’s use of ISO standards • A new generation of watches Contents 1 Comment Alain Digeon, Chair of ISO/TC 184, Industrial automation systems and integration, starting January 2008 2 World Scene Highlights of events from around the world 3 ISO Scene Highlights of news and developments from ISO members 4 Guest View ISO Focus is published 11 times Katarina Lindström, Senior Vice-President, a year (single issue : July-August). It is available in English. Head of Manufacturing in Volvo Powertrain and Chairman of the Manufacturing, Key Technology Committee Annual subscription 158 Swiss Francs Individual copies 16 Swiss Francs 8 Main Focus Publisher • Product data – ISO Central Secretariat Managing (International Organization for information through Standardization) the lifecycle 1, ch. de la Voie-Creuse CH-1211 Genève 20 • Practical business Switzerland solutions for ontology Telephone + 41 22 749 01 11 data exchange Fax + 41 22 733 34 30 • Modelling the E-mail [email protected] manufacturing enterprise Web www.iso.org • Improving productivity Manager : Roger Frost with interoperability Editor : Elizabeth Gasiorowski-Denis • Towards integrated Assistant Editor : Maria Lazarte manufacturing solutions Artwork : Pascal Krieger and • A new model for machine data transfer Pierre Granier • The revolution in engineering drawings – Product definition ISO Update : Dominique Chevaux data sets Subscription enquiries : Sonia Rosas Friot • A new era for cutting tools ISO Central Secretariat • Robots – In industry and beyond Telephone + 41 22 749 03 36 Fax + 41 22 749 09 47 37 Developments and Initiatives E-mail [email protected] • A new generation of watches to meet consumer expectations © ISO, 2007.
    [Show full text]
  • Stephan Goericke Editor the Future of Software Quality Assurance the Future of Software Quality Assurance Stephan Goericke Editor
    Stephan Goericke Editor The Future of Software Quality Assurance The Future of Software Quality Assurance Stephan Goericke Editor The Future of Software Quality Assurance Editor Stephan Goericke iSQI GmbH Potsdam Germany Translated from the Dutch Original book: ‘AGILE’, © 2018, Rini van Solingen & Manage- ment Impact – translation by tolingo GmbH, © 2019, Rini van Solingen ISBN 978-3-030-29508-0 ISBN 978-3-030-29509-7 (eBook) https://doi.org/10.1007/978-3-030-29509-7 This book is an open access publication. © The Editor(s) (if applicable) and the Author(s) 2020 Open Access This book is licensed under the terms of the Creative Commons Attribution 4.0 Inter- national License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence and indicate if changes were made. The images or other third party material in this book are included in the book’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the book’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use.
    [Show full text]
  • What Do Chinese-Language Microblog Users Do with Baidu Baike And
    What do Chinese-language microblog users do with Baidu Baike and Chinese Wikipedia? A case study of information engagement Han-Teng Liao Oxford Internet Institute University of Oxford Oxford, United Kingdom [email protected] ABSTRACT than in the West [3]. By the first half year of 2010, the content This paper presents a case study of information engagement based produced by amateur Chinese Internet users surpassed that on microblog posts gathered from Sina Weibo and Twitter that produced by professional websites [25]. According to a survey mentioned the two major Chinese-language user-generated report based on 2012 data, around 66.1% of mainland Chinese encyclopaedias. The content analysis shows that microblog users Internet users used a blog/personal space, 54.7% used microblogs, not only engaged in public discussions by using and citing both and 48.8% used social networking websites [5]. One US market encyclopaedias, but also shared their perceptions and experiences report found that 47% of Chinese broadband users had more generally with various online platforms and China’s contributed to UGC in China, including online review sites, filtering/censorship regime to which user-generated content and forums, blogs, etc., and that this UGC content had influenced activities are subjected. This exploratory study thus raises several 58% of purchase decisions in China. Both numbers are much research and practice questions on the links between public higher than the corresponding statistics in the US [23]. discussions and information engagement on user-generated The growth of UGC in mainland China may have contributed to platforms. the creation of a so-called “public opinion monitoring sector” (舆 情监测服務市場), where managers (mostly from the public Categories and Subject Descriptors sector) buy up-to-date information about the latest online public [Human-centered computing]: Collaborative and social opinion in mainland China.
    [Show full text]
  • Cross-Border E-Commerce
    How to Enter the Chinese Market via Cross Border Ecommerce Speaker: John Piao & Winnie Xu We value your success www.digiantglobal.com www.ChemLinked.com SPEAKERS John Piao Winnie Xu Co-founder and Head of Digiant Global E-commerce ChemLinked Cosmetic Regulatory Editor & Analyst Department Winnie is the cosmetic regulatory editor and analyst from John has 13 years of E-commerce Operations and ChemLinked with considerable expertise in cosmetic Digital Marketing experience. He has led several cross- regulations of China, Japan, ASEAN and Australia. Having border transactions between Europe and Asia while at attended various industry conferences in China, she has BetterLife Group, Parkland Group and Lenovo. In the extensive experience in China cosmetic market entry, past 8 years, John has promoted consumers' F&B, especially in CBEC, cosmetic pre-market approval Personal Care, and Fashion brands from Germany and compliance requirements. Italy in the Chinese Market. Email: [email protected] Email: [email protected] How to Enter the Chinese Market via Cross Border Ecommerce Speaker: John Piao & Winnie Xu We value your success www.digiantglobal.com www.ChemLinked.com Agenda 1. Overview of Chinese Cosmetic/Personal Care Products in CBEC Market 1.1 Growing Trends of Cosmetic Products 1.2 China CBEC Market Size 2. Deeply Decoding China CBEC Channel 2.1 What is CBEC? 2.2 Two Major Modes 2.3 Interpretation of 2018 CBEC policy 2.4 Comparison 3. Winning Strategy for Chinese EC Market 3.1 E-commerce Law 3.2 Factors that Influence Chinese Users’ Online Shopping 4. Preparation of Starting EC Operation in China 4.1 Are You Ready to start EC Operation? 4.2 Consumer’s Journey and Preparation from Brand 4.3.
    [Show full text]
  • Navigating China's Search Engine Market
    Navigating China’s Search Engine Market October 19, 2017 Tom Garzilli Chief Marketing Officer Brand USA 2 Brand USA Webinars All Brand USA Webinars can be viewed and downloaded from the Brand USA website at: http://www.thebrandusa.com/media-events/webinars 3 Michael Horvitz Director, Strategic Partnerships Hylink Digital Solutions 4 Who is Baidu? - Search, Data, Technology In a market without Google, Baidu dominates the Desktop and Mobile search market in China. How to maximize the use of Baidu, bringing market value to your brand? 5 Agenda • Introduction of Baidu • Search for U.S. travel on Baidu • Market your destination on Baidu • Case study- creative AR campaigns on Baidu • Solutions for Brand USA 6 INTRODUCTION OF BAIDU Baidu is the Google of China = Baidu Mobile Search Page Baidu Desktop Search Page 8 Search on Baidu Travel in the USA Search 9 Baidu: Key Numbers to Highlight ● 100+ Million Daily Active Users ● 6+ Billion Total Daily Search Volume ● 83% Market Share in China’s Search Market 10 Source: Baidu Baidu is More Than a Search Engine Baidu Baike Baidu Zhidao Has Solved million+ queries is equal to 5 0.23 billion+ questions times of “The British 5 Encyclopedia” If 1 person needs 1 minute to answer 1 question, it would take 4 centuries to answer all questions 11 Source: Baidu Travel Industry Search Volume Trend Travel industry search volume on mobile has surpassed desktop and continues to increase. 300,000,000 200,000,000 100,000,000 2014 2015 2016 2017 Total Desktop Mobile Desktop YoY: -26.9% Mobile YoY: +11.6% Total YoY:
    [Show full text]
  • Downloaded and Used in Any Application in That Wikipedia Has Been Blocked Intermittently Ever Since It Was First Language.1 Established in 2001
    Censorship of Online Encyclopedias: Implications for NLP Models Eddie Yang∗ Margaret E. Roberts∗ [email protected] [email protected] University of California, San Diego University of California, San Diego La Jolla, California La Jolla, California ABSTRACT use daily. NLP impacts how firms provide products to users, con- While artificial intelligence provides the backbone for many tools tent individuals receive through search and social media, and how people use around the world, recent work has brought to attention individuals interact with news and emails. Despite the growing that the algorithms powering AI are not free of politics, stereotypes, importance of NLP algorithms in shaping our lives, recently schol- and bias. While most work in this area has focused on the ways ars, policymakers, and the business community have raised the in which AI can exacerbate existing inequalities and discrimina- alarm of how gender and racial biases may be baked into these al- tion, very little work has studied how governments actively shape gorithms. Because they are trained on human data, the algorithms training data. We describe how censorship has affected the devel- themselves can replicate implicit and explicit human biases and opment of Wikipedia corpuses, text data which are regularly used aggravate discrimination [6, 8, 39]. Additionally, training data that for pre-trained inputs into NLP algorithms. We show that word em- over-represents a subset of the population may do a worse job beddings trained on Baidu Baike, an online Chinese encyclopedia, at predicting outcomes for other groups in the population [13]. have very different associations between adjectives and a range of When these algorithms are used in real world applications, they concepts about democracy, freedom, collective action, equality, and can perpetuate inequalities and cause real harm.
    [Show full text]
  • The Copycat of Wikipedia in China
    The Copycat of Wikipedia in China Gehao Zhang COPYCATS OFWIKIPEDIA Wikipedia, an online project operated by ordinary people rather than professionals, is often considered a perfect example of human collaboration. Some researchers ap- plaud Wikipedia as one of the few examples of nonmarket peer production in an overwhelmingly corporate ecosystem.1 Some praise it as a kind of democratization of information2 and even call it a type of revolution.3 On the other hand, some researchers worry about the dynamics and consequences of conflicts4 in Wikipedia. However, all of these researchers have only analyzed the original and most well- known Wikipedia—the English Wikipedia. Instead, this chapter will provide a story from another perspective: the copycat of Wikipedia in China. These copycats imitate almost every feature of Wikipedia from the website layout to the core codes. Even their names are the Chinese equivalent of Pedia. With the understanding of Wiki- pedia’s counterpart we can enhance our sociotechnical understanding of Wikipedia from a different angle. Wikipedia is considered an ideal example of digital commons: volunteers gener- ate content in a repository of knowledge. Nevertheless, this mode of knowledge production could also be used as social factory,5, 6 and user-generated content may be taken by commercial companies, like other social media. The digital labor and overture work of the volunteers might be exploited without payment. Subsequently, the covert censorship and surveillance system behind the curtain can also mislead the public’s understanding of the content. In this sense, the Chinese copycats of Wiki- pedia provide an extreme example. This chapter focuses on how Chinese copycats of Wikipedia worked as a so-called social factory to pursue commercial profits from volunteers’ labor.
    [Show full text]
  • Flywheel Models + Iqiyi (NASDAQ: IQ) an Investment Case Study by Hayden Capital Valuex Vail | June 27-29, 2018
    Flywheel Models + iQiyi (NASDAQ: IQ) An Investment Case Study By Hayden Capital ValueX Vail | June 27-29, 2018 Hayden Capital 79 Madison Ave, 3rd Floor New York, NY. 10016 Office: (646) 883-8805 Mobile: (513) 304-3313 Email: [email protected] Disclaimer These materials shall not constitute an offer to sell or the solicitation of an offer to buy any interests in any fund or account managed by Hayden Capital LLC (“Hayden Capital”) or any of its affiliates. Such an offer to sell or solicitation of an offer to buy will only be made pursuant to definitive subscription documents between a fund and an investor. The fees and expenses charged in connection with the investment may be higher than the fees and expenses of other investment alternatives and may offset profits. No assurance can be given that the investment objective will be achieved or that an investor will receive a return of all or part of his or her investment. Investment results may vary substantially over any given time period. Reference and comparisons are made to the performance of other indices (together the “Comparative Indexes”) for informational purposes only. Hayden Capital’s investment program does not mirror any of the Comparative Indexes and the volatility of Hayden Capital’s investment strategy may be materially different than that of the Comparative Indexes. The securities or other instruments included in the Comparative Indexes are not necessarily included in Hayden Capital’s investment program and criteria for inclusion in the Comparative Indexes are different than those for investment by Hayden Capital. The performance of the Comparative Indexes was obtained from published sources believed to be reliable, but which are not warranted as to accuracy or completeness.
    [Show full text]