Tugboat, Volume 35 (2014), No. 3 309 a Citation Style Language

Total Page:16

File Type:pdf, Size:1020Kb

Tugboat, Volume 35 (2014), No. 3 309 a Citation Style Language TUGboat, Volume 35 (2014), No. 3 309 A Citation Style Language (CSL) workshop Attribution-ShareAlike 3.0 Unported license by Cre- ative Commons.5 For the search of specific citation Daniel Stender styles the online CSL style editor6 is quite useful Abstract because it provides, in addition to other features, a search by example. CSL is a free and open XML-based language for the programming of citation styles. With these styles, 3 Pandoc-citeproc bibliographical references can be printed out in dif- Among the several current applications which al- ferent ways from several database formats, including ready know how to use CSL styles for the automatic T X. The so-far over 7000 CSL styles which are Bib E generation of references, there is the popular univer- currently available can be used with several popular sal markup converter Pandoc [1].7 With short cita- applications like Zotero, Mendeley, or Pandoc. This tion keys like @doe2014 [p. 40-42] for its extended article is an introduction into the programming of Markdown lightweight markup, Pandoc can query citation styles with CSL, based on a few example bibliographical data files and recognize CSL styles to T X bibliographic database records. Bib E put out variously formatted references for documents 1 Introduction either in HTML,LATEX or ConTEXt markup, along with several others [3]. Pandoc is a command line Bibliographical references, as used in scientific publi- interface application; thus, the CSL style which is cations, are pointers to cited or regarded literature. going to be processed and the bibliographical data- Regularly, they consist of two standardized compo- base(s) are given as arguments in the program call. nents: an in-line citation (the \cite") refers to an Here's an example (some line breaks are editorial) for entry in the publication's bibliography. Despite the Pandoc's LAT X output of a random BibT X data- common concepts, there is no uniform outline for E E base record (see below for details), formatted using references; rather, each scientific discipline and ev- chicago-author-date.csl:8 ery publishing house has its own traditional set of conventions, which also might change between series. $ echo "On this, see @reference2 [p. 127]." \ In electronic typesetting, bibliographical infor- | pandoc --to=latex \ --csl=chicago-author-date.csl \ mation is often gathered in comprehensive, reusable --bibliography=references.bib data files. CSL1 is a programming language for ci- tation styles, with which differently formatted refer- On this, see Flom (2007, 127). ences can be generated from the same bibliographic databases. CSL (current version: 1.0.1) is XML-based, Flom, Peter. 2007. ``LaTeX for Academics and open and free, and was substantially developed for Researchers Who (Think They) Don't Need It.'' the all-around reference manager Zotero.2 \emph{TUGboat} 28 (1): 126--128. This article demonstrates how a rudimentary ex- As shown in this example, the bibliography is printed ample citation style could be implemented with CSL, at the end of a document. Incidentally, in the input with reference to the BibTEX data format. The usage (shown later), the title is given in lowercase; the of several programs refers to a Debian GNU/Linux titlecasing done here is automatic, a feature of this based system (like Ubuntu and Linux Mint), but style [8, chp. 14]. CSL styles could also be easily developed on other Processors which produce formatted citations operating systems. Some basic knowledge of XML out of bibliographic databases according to CSL styles and the BibTEX data format is definitely needed to are called CiteProcs.9 CiteProcs are being developed follow every detail. in several programming languages. The one which is used normally by Pandoc is pandoc-citeproc,10 writ- 2 CSL styles ten in the same functional programming language The citation styles which have already been imple- Haskell as Pandoc itself, and developed closely to- mented in CSL (file extension: .csl) are collected gether with it. This CiteProc (currently: 0.3.0.1) by the CSL developers in the official style reposi- already deals with a number of different database tory,3 and in the Zotero style repository.4 The so- far more than 7000 styles are distributed under the 5 http://creativecommons.org/licenses/by-sa/3.0/ 6 http://editor.citationstyles.org/about/ 1 http://citationstyles.org/ 7 http://johnmacfarlane.net/pandoc/ 2 http://www.zotero.org/ 8 Although given explicitly here, this CSL style is the 3 http://github.com/citation-style-language/ default in Pandoc. styles/ 9 http://en.wikipedia.org/wiki/CiteProc/ 4 http://zotero.org/styles/ 10 http://github.com/jgm/pandoc-citeproc/ A Citation Style Language (CSL) workshop 310 TUGboat, Volume 35 (2014), No. 3 formats, but it's said that it works best with BibTEX by version, while the class attribute determines that resp. BibLATEX [4] databases so far.11 this style provides cites in the running text by default, rather than as footnotes or end notes (which would 4 Developing CSL styles be \note"). CSL styles are XML, and therefore the whole related 6 Info block XML tool chain can also be used with them. For somebody who deals with XML regularly, a special- The next mandatory unit of a CSL style is an hinfoi ized editor is useful, but fundamentally CSL styles block, which provides metadata for labeling and iden- can be created and modified with any text editor. tification. A typical info block looks like this: CSL is described in detail in the specification [10], <info> and the primer which has been written by the devel- <title>An example CSL style</title> opers [9] is a good starting point for beginners. <id>http://www.danielstender.com/csldemo</id> CSL is standardized as an XML grammar in the <updated>2014-09-18T23:53:00+02:00</updated> schema language RELAX NG,12 and in principle any </info> CSL style file can checked with an XML validator Even if it is not intended to publish the style, there against the CSL schema to determine if it is correct are at least three mandatory child elements that are (valid).13 Unfortunately, some validators, for exam- meant for this purpose: ple xmllint (Debian package: libxml2-utils), cannot htitlei is the title of the CSL style as it is going to cope with XML schemes in RELAX NG compact syn- be displayed to users, tax like the one shipped by the CSL developers (file hidi contains, like xmlns in the CSL header (see extension: .rnc); the scheme has to be converted above), a random URI, which may be real or (e.g. with Trang) into the regular RELAX NG syn- fictitious, and which is solely for identification tax (file extension: .rng) before a CSL style can be purposes, validated against it: hupdatedi carries a xsd:dateTime compliant time $ git clone https://github.com/\ stamp15 of the last modification. citation-style-language/schema.git Cloning into 'schema' [...] 7 Example BIBTEX records $ trang schema/csl.rnc schema/csl.rng $ xmllint --noout -relaxng schema/csl.rng \ Before getting any deeper into CSL style program- chicago-author-date.csl ming, here are a few sample BibTEX records to be chicago-author-date.csl validates referred to hereafter to demonstrate how CSL works. A typical @Book entry type [5, chp. 13.2] goes like 5 XML declaration and CSL header this The standard XML declaration commonly starts a @Book{reference1, CSL style file: author = {Kopka, Helmut and Daly, Patrick W.}, title = {A Guide to LaTeX and Electronic <?xml version="1.0" encoding="UTF-8"?> Publishing}, Incidentally, although it is often suggested to be publisher = {Addison-Wesley}, included, specifying the UTF-8 encoding like this year = 2004, could be omitted because UTF-8 is the XML default address = {Boston}, [2, p. 28]. edition = {Fourth}} The next thing which is needed is a well-formed The next one is an @Article data set of a (well- CSL header to specify that the XML file is a CSL known) journal whose issues are counted as volume style. A standard CSL header goes like this: numbers: <style xmlns="http://purl.org/net/xbiblio/csl" @Article{reference2, version="1.0" class="in-text"> author = {Flom, Peter}, This defines hstylei to be the root element, and the title = {LaTeX for academics and researchers CSL namespace is set as the default with xmlns.14 who (think they) don't need it}, That the style is compatible with CSL 1.0 is stated journal = {TUGboat}, year = 2007, 11 cf. Readme.md. volume = 28, 12 http://relaxng.org/ number = 1, 13 http://github.com/citation-style-language/ pages = {126-128}} schema/tree/v1.0.1/ 14 At no time will a connection to this url be established; 15 http://books.xmlschemata.org/relaxng/ch19-77049. it's only for the purpose of identification. html Daniel Stender TUGboat, Volume 35 (2014), No. 3 311 And another @Article data set of a journal which ence out of the citation key in the document source. appears monthly and doesn't have any issue numbers: The cite is described within the element hcitationi, @Article{reference3, looking, for example, like this: author = {Sharma, Tushar}, <citation> title = {Why I never close Emacs}, <layout> journal = {Open Source For You}, <text macro="surname" suffix=" "/> year = 2014, <group prefix="(" suffix=")"> pages = {53-55}, <text macro="year"/> month = {jan}} <text variable="locator" prefix=":\,"/> The main body of a CSL style consists of several </group> instructions for how exactly the CiteProc is going to </layout> typeset citations, thus the cites and the correspond- </citation> ing references in the bibliography of a document, Along with the hlayouti element, hcitationi has an- from standardized data records like these.
Recommended publications
  • Reference Management Software Zotero
    Reference Management Software Zotero Emptily complanate, Allan howff lodes and displeasures palaeopathology. Outlined Dennie sometimes readmit any pepsines boozed viscerally. Antiscriptural Gerold vitalizing otherwhere. To be running smoothly and medical students via the software zotero reference management products The Library recommends Zotero as general best reference manager for students Free Works on Mac Windows Linux Microsoft Word OpenOffice Library. Tips for using bibliographic citation management tools including RefWorks EndNote Zotero and Mendeley. What does not hesitate to assist you to start to zotero software packages into it difficult, organize this interface to restore your computer with a standard. Compare Citation Management Software Citation Library. Zotero is a vocation open-source citation management and sophisticated tool Researchers and scholars from a variety use different fields use Zotero to. An office or zotero reference information. ZoteroBib is a free workshop that helps you quickly manage a bibliography in any citation style Read More HelpZotero ZoteroBib Cite Manual Entry. Zotero is a pretty easy-to-use then to boss you collect organize cite and legitimate research. Reference management Cochrane Airways. Storage Zotero. This web-based citation manager program is hinder to UA faculty current and students. Citation Management Software Overview Citing sources. Citation management software helps you organize references PDFs and images take notes on references and PDFs format bibliographies. Zotero is large easy mode use with which allows one they collect organise and cite references Apart from each it. In asylum the Cornell library among several Zotero guides Getting Started with Zotero. After they explicitly take effect since then, but all the article in that can create inddpendent bibliography.
    [Show full text]
  • Reference Management Software There Are Several Different Versions of Reference Management Software Available
    The Academy for Teaching and Learning Excellence (ATLE) “Office hours for faculty.” (813) 974-1841 | atle.usf.edu| [email protected] Reference Management Software There are several different versions of reference management software available. One of the best uses of such software is that it can house all your references for any research endeavor. There are many different variations on reference management software, but the USF library has guides for Endnote and RefWorKs. These guides can be found at: • Endnote: http://guides.lib.usf.edu/content.php?pid=136566&sid=1169836 • RefWorKs: http://guides.lib.usf.edu/refworks2010 Quick Reference Guides for Endnote: • Endnote x6 for Windows: http://endnote.com/training/mats/media/pdf/enx6-win_qrc.pdf • Endnote x6 for MAC: http://endnote.com/training/mats/media/pdf/enx6-mac_qrc.pdf • Endnote Web: http://thomsonreuters.com/content/science/pdf/ssr/training/enw_qrc_en.pdf Quick Reference Guides for RefWorks: • http://www.refworks.com/refworks2/help/Welcome.htm • http://www.refworks.com/content/documents/refworks_quick_start_guide.pdf • https://www.refworks.com/refworks2/help/RefWorks_-QSG-English-Jun11.pdf Quick Reference Guides for Zotero: • http://www.zotero.org/support/quicK_start_guide • http://www.zotero.org/static/download/zotero_user_guide.pdf • http://www.stanford.edu/group/cubberley/node/1361 Quick Reference Guides for Mendeley: • http://nuffieldcollegelibrary.files.wordpress.com/2013/01/mendeleyguide_online.pdf • http://lgdata.s3-website-us-east-1.amazonaws.com/docs/130/381899/Mendeley101_handout.pdf • http://libguides.wustl.edu/content.php?pid=221677&sid=1840456 YouTube Channels: Watch video tutorials on reference management software. These videos introduce the software, explain its uses, and provide step-by-step instructions.
    [Show full text]
  • Jupyter Notebooks—A Publishing Format for Reproducible Computational Workflows
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elpub digital library Jupyter Notebooks—a publishing format for reproducible computational workflows Thomas KLUYVERa,1, Benjamin RAGAN-KELLEYb,1, Fernando PÉREZc, Brian GRANGERd, Matthias BUSSONNIERc, Jonathan FREDERICd, Kyle KELLEYe, Jessica HAMRICKc, Jason GROUTf, Sylvain CORLAYf, Paul IVANOVg, Damián h i d j AVILA , Safia ABDALLA , Carol WILLING and Jupyter Development Team a University of Southampton, UK b Simula Research Lab, Norway c University of California, Berkeley, USA d California Polytechnic State University, San Luis Obispo, USA e Rackspace f Bloomberg LP g Disqus h Continuum Analytics i Project Jupyter j Worldwide Abstract. It is increasingly necessary for researchers in all fields to write computer code, and in order to reproduce research results, it is important that this code is published. We present Jupyter notebooks, a document format for publishing code, results and explanations in a form that is both readable and executable. We discuss various tools and use cases for notebook documents. Keywords. Notebook, reproducibility, research code 1. Introduction Researchers today across all academic disciplines often need to write computer code in order to collect and process data, carry out statistical tests, run simulations or draw figures. The widely applicable libraries and tools for this are often developed as open source projects (such as NumPy, Julia, or FEniCS), but the specific code researchers write for a particular piece of work is often left unpublished, hindering reproducibility. Some authors may describe computational methods in prose, as part of a general description of research methods.
    [Show full text]
  • Studies and Analysis of Reference Management Software: a Literature Review
    Studies and analysis of reference management software: a literature review Jesús Tramullas Ana Sánchez-Casabón {jesus,asanchez}@unizar.es Dept .of Library & Information Science, University of Zaragoza Piedad Garrido-Picazo [email protected] Dept. of Computer and Software Engineering, University of Zaragoza Abstract: Reference management software is a well-known tool for scientific research work. Since the 1980s, it has been the subject of reviews and evaluations in library and information science literature. This paper presents a systematic review of published studies that evaluate reference management software with a comparative approach. The objective is to identify the types, models, and evaluation criteria that authors have adopted, in order to determine whether the methods used provide adequate methodological rigor and useful contributions to the field of study. Keywords: reference management software, evaluation methods, bibliography. 1. Introduction and background Reference management software has been a useful tool for researchers since the 1980s. In those early years, tools were made ad-hoc, and some were based on the dBase II/III database management system (Bertrand and Bader, 1980; Kunin, 1985). In a short period of time a market was created and commercial products were developed to provide support to this type of information resources. The need of researchers to systematize scientific literature in both group and personal contexts, and to integrate mechanisms into scientific production environments in order to facilitate and expedite the process of writing and publishing research results, requires that these types of applications receive almost constant attention in specialized library and information science literature. The result of this interest is reflected, in bibliographical terms, in the publication of numerous articles almost exclusively devoted to describing, analyzing, and comparing the characteristics of several reference management software products (Norman, 2010).
    [Show full text]
  • Csl Citation Schema Mendeley
    Csl Citation Schema Mendeley When Padraig copulating his proletariats refunds not archly enough, is Dunstan heteropolar? Is Chan intolerable or well-derived after jingling Uriah tranced so aliunde? Invocatory Reynard puckers angelically or latinizes dualistically when Ashby is amphoteric. Oxford university departments. Should be included an abstraction layer that source title but can be centered at specific agrotourism development of community can have a content negotiation is. For citing apa? Be that as car may, both protocols require further improvement. I would make nice if PB inherently provided fully formed citation metadata as bank is. Click on schema validation of csl citation schema mendeley users would benefit role on. If you have their contributions, csl version of requests from the screen, csl citation principles that a population. I think Mendeley these need very widely used systems and testimony they. Using schema would also be atherosclerosis study proposes that contribute directly within an opportunity to csl citation schema mendeley uses a diversified heritage destination which exists in management software. Progress in trait of several Specific Aims will surely lay a porch for a productive and exciting research career job I am adamant to pursue. Neither of them above meta data a correct for which book. We stoped for a little if you must however not supported formats for duplicates, mendeley citation schema. To schema in csl citation schema for writing a scholarly and. Mendeley cite your choice of any citation format html, we compiled a complex set, like elsevier journal that use bibcite is already been organized chronologically, dose increase business.
    [Show full text]
  • “Merging Libraries” & “Export Your Library with RIS”
    “Merging libraries” & “Export your library with RIS” Export library with RIS To convert your EndNote X6 library for other applications (like Zotero, Mendeley, Reference Manager or ProCite etc.): - In EndNote X6, go to the drop-down output style list on the toolbar, and choose Select Another Style. Highlight and select the appropriate output style: - Select the RefMan (RIS) style. - From the File menu, select Export to export your library to a plain text file. - Now you can import the file into your application To import RIS file from another application into EndNote: 1 Library Wissenschaftspark Albert Einstein A joint library of the German Research Centre for Geosciences GFZ, the Potsdam Institute for Climate Impact Research, the Alfred Wegener Institute for Polar and Marine Research, Research Unit Potsdam, and the IASS Institute for Advanced Sustainability Studies Potsdam (Last update: March 2013) - Got to “File” and “Import” - Choose your file in RIS format - Use the RIS import option. - Set your preferences - Import the references 2 Library Wissenschaftspark Albert Einstein A joint library of the German Research Centre for Geosciences GFZ, the Potsdam Institute for Climate Impact Research, the Alfred Wegener Institute for Polar and Marine Research, Research Unit Potsdam, and the IASS Institute for Advanced Sustainability Studies Potsdam (Last update: March 2013) Merging libraries There are three ways to merge libraries: import one library into another, copy references from one library to another, or drag-and-drop. - (Optional) If you want to import only a subset of references from a library, first open that library and show only the references you wish to copy.
    [Show full text]
  • Tooling up for Plain Text Academic Writing in Markdown
    Tooling up for Plain Text Academic Writing in Markdown I finally got down to doing it! Here’s my brief notes on how I set it up; it may change a bit later on, as my needs evolve, but it should be enough instructions for others to get setup. Motivation I’m writing a white paper on genomic surveillance, so I thought I’d experiment with something new. By switching over to writing in plain text, I can ignore all distractions by other word processors. I’m looking at you, Word, Google Docs and others. Because I get to write in plain text, I also get to version control the text. By selectively committing logical blocks, I can also easily selectively revert unwanted commits; it’s rare, but once in a while it’s saved my work. Because it’s plain text, I can export to anywhere with a bit of configuration. Because it’s written publicly, it’s great. Overview What I do here is basically use Sublime Text 3 (ST3) as a word processor. BibTex libraries are used for keeping track of papers, but I like Papers for organizing more than BibDesk - automatic downloading of PDFs is the best part. My compromise, then, is to export my Papers library to a master BibTex library prior to compiling everything into a single PDF. Papers can also add in Pandoc-style citations, which Pandoc can then parse to make an automatic bibliography. Everything is kept under version control, for reasons stated above. It basically took me one morning’s worth of time invested in tooling this up, but I can already imagine the amount of time saved by minimizing the time spent on formatting references, figure numbers, versioning, and (most of all) being able to write without being distracted.
    [Show full text]
  • Citeproc Js Manual
    Citeproc Js Manual citeproc-hs is a library for rendering bibliographic reference citations into a variety of styles More information can be read in the (citeproc-js) documentation:. Contribute to citeproc-ruby development by creating an account on GitHub. rendering engine only, for more documentation on the whole cite processor, Frank G. Bennett, Jr., and Bruce D'Arcus of CSL and citeproc-js fame for their support! Developer Documentation In this section we'll also discuss the citeproc-js CSL processor that Zotero uses to process the CSL styles, and citeproc-node. RefME provides a JavaScript widget that you can include on your pages to provide We use the industry standard citeproc.js format (see documentation here). citeproc-node-iojs. A working citeproc-js wrapper for node and iojs. To download the styles and languages required for this module to work, run the following:. Enable rmarkdown / pandoc-citeproc citations in Jekyll blog via servr @Manual(Yihui-2015, title = (knitr: A General-Purpose Package for Dynamic Report. Citeproc Js Manual Read/Download We migrated our in-depth documentation (including the CSL primer and In addition to Frank Bennett's citeproc-js, used by Zotero, Mendeley, and Qiqqa, there. Frank Bennett's citeproc-js (JavaScript), Ron Jerome's citeproc-php (PHP) and My main contributions have been writing most of the project documentation. The library includes citeproc-js, a CSL processor written in JavaScript. citeproc-js has been created by Frank G. Bennett and is licensed under the Common. Zotero and Juris-M rely on the citeproc-js implementation of CSL maintained by The MLZ implementation of Chicago Manual of Style full note referencing.
    [Show full text]
  • Zotero-Instructions.Pdf
    Zotero 1. Install Zotero works best with Firefox, but Google Chrome and Safari may be used standalone. If you need to download Mozilla go to http://www.mozilla.org/en-US/firefox/new/ and follow the prompts. To install Zotero go to Zotero.org with Mozilla Firefox and click the red “Download Now” button. Click the Zotero 3.0 for Firefox link (for both PC and Mac computers). Click the “Allow” button. Then click the “Install Now” button and restart. This will only take a few mintues. 2. Add-ins Add-ins are needed in Word to transfer references into your assignment. At the Download screen, click - Add a plugin for Word or LibreOffice and select for either Windows or Mac Do a restart and you should now have Add-Ins at the top of your Word document and the Zotero logo in the bottom right corner of your Firefox window. The Add-Ins for Mac also appear in the toolbars, but in Word for Mac 2008 they are in the AppleScript menu to the right of the Help icon: Important: Repeat Steps 1 and 2 to install Zotero on any computer you will be using. Michelle Down, Avondale College of Higher Education Page 1 Zotero instructions (Updated March 2013) 3. Register If you think you will be using Zotero on a number of different computers e.g. your laptop, College computer, home computer etc, you should register yourself. If you will only be using the one computer, and do not plan to share your files with anyone else, then ignore this step! Go to the Zotero home page and again and click Register (top right of the screen) and choose a username and password and fill in the details.
    [Show full text]
  • How to Manage a Research Library with Zotero by Blog Admin July 6, 2012
    blo gs.lse.ac.uk http://blogs.lse.ac.uk/impactofsocialsciences/2012/07/06/manage-a-research-library-with-zotero/ How to manage a research library with Zotero by Blog Admin July 6, 2012 Keeping up to date with research and managing an ever-increasing number of journal articles is skill that must be well-honed by academics. Here, Alex Hope sets out how his workflow has developed using Zotero, Dropbox, Goodreader and his iPad. Article Discovery I use a variety of methods to keep on top of research in my f ield, and to search f or articles when preparing a paper or presentation. Foremost is the use of RSS f eeds to deliver alerts of new articles in journals I f ollow. I use Google Reader to manage my f eeds and usually browse through new f eed articles using Flipboard on my iPad or iPhone. If I come across an interesting article that I think I may like to read and use, I email myself the link. This means that when I check my email every morning, I can navigate to the article and save it in my ref erence manager f or reading and marking up. If I am researching a paper, I tend to search journal repositories such as Web of Knowledge, although more of ten than not I f ind Google Scholar f inds what I need quickly and accurately. Finally I am f inding that Twitter is a f antastic resource f or uncovering new research as an increasing number of researchers and research groups have a presence.
    [Show full text]
  • Work Process for Students
    Instructions Manual Process Guide for Students for Interdisciplinary Work in Computer Science/Informatics Assoc. Prof. Dr. Andreas Holzinger http://hci4all.at [email protected] Institute for Information Systems and Computer Media (IICM) Graz University of Technology Research Unit HCI4MED, Institute of Med. Informatics (IMI) Medical University Graz Data. Information. Knowledge. Decision. Action. This handbook provides some information for students with an interdisciplinary interest in Engineering (Computer Science), Natural Science (Cognitive Science/Psychology) and Business (Software Engineering Business). Human–Computer Interaction (HCI) and Usability Engineering (UE) can be seen as a bridge. © August 2010, Second Edition < This page intentionally left blank > 2 of 128 Anleitung – Handbuch (Diese Arbeit ist in englischer Sprache verfasst) Anleitung für Studierende zur interdisziplinären Arbeit in Informatik/Computerwissenschaft Univ.-Doz. Ing. Mag. Mag. Dr. Andreas Holzinger Institut für Informationssysteme und Computer Medien (IICM) Technische Universität Graz Forschungseinheit HCI4MED, Institut für Med. Informatik (IMI) Medizinische Universität Graz Daten. Information. Wissen. Entscheiden. Handeln. Eine Anleitung für Studierende mit interdisziplinärem Interesse in Ingenieurswissenschaft (Informatik), Naturwissenschaft (Psychologie), und Wirtschaft (Softwareentwicklung/Business). Human–Computer Interaction (HCI) & Usability Engineering (UE) kann als Brücke gesehen werden. © August 2010, Zweite Auflage 3 of 128 < Diese Seite ist
    [Show full text]
  • 1 Billion Citation Dataset and Deep Learning Citation Extraction
    1 Billion Citation Dataset and Deep Learning Citation Extraction Mark Grennan A dissertation submitted to the University of Dublin, Trinity College in partial fulfilment of the requirements for the degree of Master of Science in Computer Science (Future Networked Systems) Supervisor: Joeran Beel August, 2019 Declaration I, the undersigned, declare that this work has not previously been sub- mitted as an exercise for a degree at this, or any other University, and that unless otherwise stated, is my own work. Mark Grennan August 8, 2019 Permission to Lend/Copy I, the undersigned, agree that Trinity College Library may lend or copy this dissertation upon request. Mark Grennan August 8, 2019 Acknowledgements I would like to thank my supervisor Joeran Beel for his attention to de- tail, consideration and thoughtful advice throughout this project. I would also like to thank Martin Schibel for his prior work and for giving generously with his time in the early stages of the project. Finally, for their selfless patience and encouragement I would like to thank Una and my family. Mark Grennan August, 2019 1 Billion Citation Dataset and Deep Learning Citation Extraction Mark Grennan, Master of Science in Computer Science University of Dublin, Trinity College, 2019 Supervisor: Joeran Beel Abstract Citation or reference parsing involves extracting machine readable metadata from a citation string. This paper details the work carried out in creating a large, diverse and labelled citation dataset which can be used to train ML ci- tation parsing tools. The dataset was created by adapting the citation styles within CSL, collecting citation metadata from CrossRef and using the open- source citation processor, citeproc-js.
    [Show full text]