Enabling Large-Scale Collaboration in Language Conservation
Total Page:16
File Type:pdf, Size:1020Kb
Enabling Large-Scale Collaboration in Language Conservation Mat Bettinson Submitted in total fulfilment of the requirements of the degree of Doctor of Philosophy School of Languages and Linguistics T U M January 2019 Copyright © 2019 Mat Bettinson All rights reserved. No part of the publication may be reproduced in any form by print, photoprint, microfilm or any other means without written permission from the author. Abstract present scale of conservation activity falls short of what is required to address T the rate of language loss worldwide. Current practice is framed as expert activity which limits the audience of would-be conservators. However decentralised collabora- tive action is commonplace in the post Web 2.0 digital landscape, opening the door for self-determined acts of conservation. This thesis shifts the emphasis towards large-scale collaboration inspired by crowdsourcing collective intelligence in other domains. I engage in a programme of applied research to formulate an architecture for a collaborative mobile app/services suited for the constrained resource environments of language activism. This is achieved by means of an inductive-iterative method centred on the development of three significant software artefacts positioned as practical solutions for existing challenges in language conservation. Each of these projects was developed in conjunction with speakers of an endangered language in rural North Western Taiwan. I describe a pattern for a for large-scale language conservation app/service and present a complete implementation. I conclude this thesis by discussing the opportunities and remaining challenges for achiev- ing large-scale collaborative language conservation. iii Declaration This is to certify that 1. the thesis comprises only my original work towards the PhD, 2. due acknowledgement has been made in the text to all other material used, 3. the thesis is less than 100,000 words in length, exclusive of tables, maps, bibliogra- phies and appendices. Mat Bettinson, January 2019 v Acknowledgements This work, and indeed all that I do, is dedicated to my wife Jeannine. Her unwavering support through thick and thin, in times of plenty and times of little, was crucial for this journey, and the ultimate destination of finding meaning and happiness. Particular thanks go to Steven Bird who inspired me every step, and offered the guid- ance and support to explore this subject, particularly when it seemed unknowable. Onya boss. Thanks also to Nick Thieberger who was always on hand to guide me as my work took another unexpected turn. Thanks to Elizabeth Zeitoun for her advice and introductions which lead to my con- nection with the Saisiyat people. I’m grateful to so many Saisiyat people for their endur- ing patience and hospitality, but particularly to my brother bo:ong ba:bai and his family who hosted me in DongHe village and went out of their way to allow me to observe and participate in the spirit of the Saisiyat people. I’m grateful to ya’aw kaybaybaw that intro- duced me to the beautiful children of DongHe junior school. Ma’alo! Thanks also to the extended Penglai clan for their hospitality and friendship and particularly thanks to my friend ’oemaw for not getting angry when I sprayed him with petrol and little sa:wan for not destroying all of my equipment. Ma’alo! I’d like to acknowledge Liu YuYang for his incredible work in digital language preser- vation for all Formosan languages, and thank him for setting an example of the good work ethno-technologists can do. A few Taipei-based laoweis provided another layer of support and friendship in Taiwan, including Tobie Openshaw, Sean Kaiteri and my lefty besty Monti. vii Preface This thesis contains and extends some of the work presented in the following publications: • Bettinson, M. & Bird, S. (2018) Image-Gesture-Voice: A Web Component for Elic- iting Speech, Proceedings of the LREC 2018 Workshop “CCURL2018 –Sustaining Knowledge Diversity in the Digital Age” • Bettinson, M. & Bird, S. (2017) Developing a Suite of Mobile Applications for Col- laborative Language Documentation, Proceedings of the 2nd Workshop on the Use of Computational Methods in the Study of Endangered Languages. ix To Jeannine. xi Contents 1 Introduction 1 1.0.1 The post-2.0 digital age ......................... 5 1.0.2 Language Conservation 2.0 ....................... 7 1.1 Statement of the problem ............................ 8 1.2 Aim and scope .................................. 10 1.3 Research objectives ............................... 10 1.3.1 A versatile scalable method for language documentation ...... 11 1.3.2 Collaborative workflow management for mobile devices ....... 11 1.3.3 The collaborative language app architecture ............. 12 1.4 Overview of the study .............................. 13 2 Background 17 2.1 Language Endangerment ............................ 18 2.1.1 Languages ................................ 18 2.1.2 Linguistic and cultural diversity .................... 19 2.1.3 Causes of language loss ......................... 20 2.1.4 Process of language loss ......................... 21 2.2 The scale challenge ................................ 22 2.3 Responses to language endangerment ..................... 23 2.3.1 Language Documentation ........................ 24 2.3.2 Challenges for language documentation ................ 26 2.3.3 Language Revitalisation ......................... 27 2.4 The digital landscape .............................. 29 2.4.1 Global digital access ........................... 29 2.4.2 The Web 2.0 ............................... 30 2.4.3 Mobile phones .............................. 31 2.4.4 Web Technologies ............................ 32 2.5 Digital tools for language conservation .................... 33 2.5.1 Metadata tools .............................. 34 2.5.2 Elicitation tools ............................. 35 2.5.3 Annotation tools ............................. 35 2.5.4 Conservation groupware ........................ 37 2.5.5 Mobile apps ............................... 38 2.5.6 Sustainability of Digital Tools ..................... 40 2.6 Workflow Management ............................. 41 xiii 2.6.1 From Crowdsourcing to peer production ............... 42 2.6.2 Computer Supported Language Documentation ........... 43 2.7 Saisiyat ...................................... 45 2.7.1 Community based language revitalization ............... 47 3 First steps in collaborative tool design 51 3.1 Pilot study .................................... 51 3.1.1 Motivation ................................ 52 3.1.2 Method .................................. 53 3.1.3 Results .................................. 54 3.1.4 ‘Fixing’ Aikuma ............................. 55 3.1.5 Discussion ................................ 58 3.2 Field study with existing tools ......................... 60 3.2.1 Documentary method .......................... 60 3.2.2 Fieldwork ................................. 63 3.2.3 Findings ................................. 66 3.2.4 Discussion ................................ 68 3.3 Aikuma-NG .................................... 70 3.3.1 Motivation ................................ 71 3.3.2 Requirements .............................. 72 3.3.3 Developing in the field ......................... 73 3.3.4 Implementation ............................. 75 3.3.5 Discussion ................................ 81 4 Aikuma-Link: a tool for network collaboration 87 4.1 Low resource software development ...................... 88 4.2 The ‘serverless’ architecture ........................... 90 4.3 In consideration of prototyping ......................... 91 4.4 A minimal collaborative workflow management scheme ........... 92 4.5 Aikuma-Link ................................... 95 4.5.1 Design .................................. 95 4.5.2 Implementation ............................. 106 4.6 Discussion .................................... 111 4.6.1 The ‘serverless’ paradigm ........................ 112 4.6.2 Mobile design and responsive ergonomics .............. 114 4.6.3 Aikuma-Link in retrospect ....................... 115 4.6.4 Future work ............................... 115 xiv 5 Bringing it all together 117 5.1 Offline-first design ................................ 118 5.1.1 Mobile access data models ....................... 118 5.1.2 Conflict-free collaborative data ..................... 119 5.1.3 Synchronisation tactics ......................... 120 5.1.4 Supporting discovery offline ...................... 122 5.2 Mobile workflow management ......................... 124 5.2.1 Overview ................................. 124 5.2.2 The local component .......................... 126 5.2.3 The social component .......................... 128 5.3 Image Gesture Voice ............................... 133 5.3.1 Beyond BOLD .............................. 134 5.3.2 IGV record ................................ 135 5.3.3 IGV annotate ............................... 136 5.3.4 Mobile view layout ............................ 137 5.3.5 Images and knowledge sharing ..................... 138 5.3.6 Recording and representing touch ................... 139 5.4 Zahwa ....................................... 140 5.4.1 Overview ................................. 141 5.5 Initial design ................................... 143 5.5.1 App structure .............................. 144 5.5.2 Zahwa’s menu .............................. 145 5.5.3 Recipe discovery ............................. 147 5.5.4 Recipe authoring ............................. 148 5.5.5 Translating recipes ..........................