Language Archive Records: Interoperability of Referencing Practices and Metadata Models
Total Page:16
File Type:pdf, Size:1020Kb
University of North Dakota UND Scholarly Commons Theses and Dissertations Theses, Dissertations, and Senior Projects January 2021 Language Archive Records: Interoperability Of Referencing Practices And Metadata Models Hugh J. Paterson Iii Follow this and additional works at: https://commons.und.edu/theses Recommended Citation Paterson Iii, Hugh J., "Language Archive Records: Interoperability Of Referencing Practices And Metadata Models" (2021). Theses and Dissertations. 3937. https://commons.und.edu/theses/3937 This Thesis is brought to you for free and open access by the Theses, Dissertations, and Senior Projects at UND Scholarly Commons. It has been accepted for inclusion in Theses and Dissertations by an authorized administrator of UND Scholarly Commons. For more information, please contact [email protected]. LANGUAGE ARCHIVE RECORDS: INTEROPERABILITY OF REFERENCING PRACTICES AND METADATA MODELS by Hugh Joseph Paterson III Bachelor of Arts, Southern Illinois University Edwardsville, 2007 A Thesis Submitted to the Graduate Faculty of the University of North Dakota in partial fulfillment of the requirements for the degree of Master of Arts Grand Forks, North Dakota May 2021 Copyright 2021 Hugh Joseph Paterson III This work is licensed under a Creative Commons Attribution-NonCommercial- NoDerivatives 4.0 International License. ii This thesis, submitted by Hugh Joseph Paterson III in partial fulfillment of the requirements for the Degree of Master of Arts from the University of North Dakota, has been read by the Faculty Advisory Committee under whom the work has been done and is hereby approved. _______________________________________ J. Albert Bickford, Chair _______________________________________ Douglas M. Fraiser _______________________________________ Robert Fried This thesis is being submitted by the appointed advisory committee as having met all of the requirements of the School of Graduate Studies at the University of North Dakota and is hereby approved. ____________________________________ Chris Nelson Dean of the School of Graduate Studies ____________________________________ Date iii PERMISSION Title Language Archive Records: Interoperability of Referencing Practices and Metadata Models Department Linguistics Degree Master of Arts In presenting this thesis in partial fulfillment of the requirements for a graduate degree from the University of North Dakota, I agree that the library of this University shall make it freely available for inspection. I further agree that permission for extensive copying for scholarly purposes may be granted by the professor who supervised my thesis work or, in his absence, by the Chairperson of the department or the dean of the School of Graduate Studies. It is understood that any copying or publication or other use of this thesis or part thereof for financial gain shall not be allowed without my written permission. It is also understood that due recognition shall be given to me and to the University of North Dakota in any scholarly use which may be made of any material in my thesis. Hugh Joseph Paterson III 4/21/2021 iv CONTENTS List of Figures . xi List of Tables . xvi Acknowledgments . xviii Abbreviations . xix Abstract . xxii CHAPTER 1. Background . 1 2. Importing bibliographic data . 9 2.1. DOI import . 11 2.2. Embedded metadata in HTML import . 12 2.3. File based import . 13 3. Using bibliographic data . 14 3.1. Export fles . 14 3.2. Citation Style Language based export . 14 3.3. Crafting references . 16 4. Archives . 23 4.1. Archive organization & record management . 24 4.1.1. Tiered structure of archives . 24 4.1.2. Bibliographic records and archives . 28 vi 4.1.3. Metadata standards and archives . 30 4.1.4. Referencing archival materials . 31 4.2. PARADISEC . 34 4.2.1. Technology infrastructure . 35 4.2.2. Collections structure . 35 4.2.3. Collections and artifacts reviewed . 35 4.2.4. DOI import . 39 4.2.4.1. DOI import from Collection . 40 4.2.4.2. DOI import from Item . 42 4.2.4.3. DOI Discussion . 44 4.2.5. Embedded metadata in HTML for collection and item levels . 47 4.2.6. File download . 48 4.2.7. Complete or sufcient . 48 4.2.7.1. Collection . 48 4.2.7.2. Item . 52 4.3. Pangloss . 54 4.3.1. Technology infrastructure . 56 4.3.2. Collections structure . 56 4.3.3. Collections and artifacts reviewed . 57 4.3.4. DOI import . 63 4.3.4.1. DOI Collection import . 64 4.3.4.2. DOI Artifact import . 67 4.3.5. Embedded metadata . 71 4.3.5.1. HTML . 71 vii 4.3.5.2. unAPI . 76 4.3.6. File download . 81 4.3.7. Complete or sufcient . 85 4.3.7.1. APA Collection—Cocoon . 86 4.3.7.2. APA Collection—Villejuif . 89 4.3.7.3. Chicago Collection—Cocoon . 91 4.3.7.4. Chicago Collection—Villejuif . 95 4.3.7.5. APA artifact—Cocoon . 96 4.3.7.6. APA artifact—Villejuif . 99 4.3.7.7. Chicago artifact—Cocoon . 99 4.3.7.8. Chicago artifact—Villejuif . 103 4.3.7.9. Pangloss summary . 103 4.4. SIL Language & Culture Archives . 105 4.4.1. Technology infrastructure . 106 4.4.2. Collections structure . 107 4.4.3. Collections and artifacts reviewed . 107 4.4.4. DOI import . 108 4.4.5. Embedded metadata in HTML . 109 4.4.6. File download . 110 4.4.7. Complete or sufcient . 110 4.4.7.1. Collection . 110 4.4.7.2. Item . 110 4.5. Endangered Languages Archive . 112 4.5.1. Technology infrastructure . 112 viii 4.5.2. Collections structure . 112 4.5.3. Collections and artifacts reviewed . 113 4.5.4. DOI import . 117 4.5.5. Embedded metadata in HTML . 117 4.5.6. File download . 119 4.5.7. Complete or sufcient . 119 4.6. Kaipuleohone . 121 4.6.1. Technology infrastructure . 122 4.6.2. Collections structure . 122 4.6.3. Collections and artifacts reviewed . 122 4.6.4. DOI import . 125 4.6.5. Embedded metadata in HTML . 125 4.6.6. File download . 128 4.6.7. Complete or sufcient . 128 4.6.7.1. Collection . 129 4.6.7.2. Item . 130 4.7. Review summary . 130 5. Example references . 132 6. Theory and applications . 139 6.1. Value exchange between archives and language users . 148 6.2. Value exchange between archives and linguists in academia . 150 6.3. Archives: maintaining the ability to deliver value . 154 6.3.1. Arrangement . 155 6.3.2. Description . 163 ix 6.3.2.1. Where have all the collections gone? . 166 6.3.2.2. The item type identifcation crisis . 170 6.3.2.3. Identifying the item for referencing . 174 7. Forward movement . 179 References . 182 x LIST OF FIGURES Figure Page 1. The linking cycle between scholarly works and archives . 4 2. Zotero’s ISBN and DOI lookup tool . 10 3. Zotero’s browser plugin demonstrating the detection of multiple input sources 10 4. Hodges & McClurkin (2011:3)’s typical archival structures . 26 5. Basic FRBR model . 29 6. Top portion of the Roger Blench 5 collection display in PARADISEC . 37 7. Full view of the Kamuku wordlist an item in the RB5 collection at PARADISEC 38 8. Top portion of the Kamuku wordlist, an item in the RB5 collection at PARADISEC 39 9. DataCite API supplied JSON response for the RB5 Collection . 41 10. Zotero’s interpretation of DataCite API supplied JSON for the RB5 collection . 42 11. DataCite API supplied JSON response for the Kamuku Wordlist . 43 12. Zotero’s interpretation of DataCite API supplied JSON for the Kamuku Wordlist 44 xi 13. HTML header code from the RB5 collection. 47 14. Zotero record for a collection using Zotero’s Extra feld. 52 15. Pangloss Collection-level page in the Cocoon interface . 59 16. Top of the AuCo Collection-level page in the Cocoon interface . 60 17. Top of the Vietnamese (Hanoï dialect) Collection-level page in the Villejuif in- terface . 61 18. Item-level page in the Cocoon interface . 62 19. Item-level page in the Villejuif interface . 63 20. JSON response for the Cocoon collection AuCo from the DataCite API . 65 21. Zotero’s interpretation of DataCite API supplied JSON for the Cocoon collection 66 22. JSON response for the Cocoon artifact from the DataCite API . ..