Unsupervised Lexical Semantic Frame Induction: a Semeval 2019 Task Proposal
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Focusing on Lexis in English Classrooms in Japan: Analyses of Textbook Exercises and Proposals for Consciousness-Raising Activities
Focusing on Lexis in English Classrooms in Japan: Analyses of Textbook Exercises and Proposals for Consciousness-raising Activities by Michiko Kasuya A dissertation submitted to the Faculty of Arts of the University of Birmingham in part fulfilment of the requirements for the degree of Master of Arts in Teaching English as a Foreign or Second Language (TEFL/TESL) This dissertation consists of approximately 15,000 words Supervisor: Dr Susan Hunston Centre for English Language Studies School of English University of Birmingham Edgbaston Birmingham B15 2TT England March 2000 ABSTRACT Recently the importance of focusing on lexis has been widely recognised in language acquisition theories. In particular, it is considered indispensable to deal with fixed expressions, lexical collocations and patterns, in consciousness-raising ways. This dissertation has attempted to reveal the problems of activities in the current English classrooms in Japan, especially regarding lexis teaching, and to propose activities that could develop learners’ competence to use the language. By analysing exercises from authorised textbooks, it has become obvious that English teaching in Japan has two problems. Firstly, the activities need to be more carefully constructed as to what knowledge they aim to develop in learners. There is too much emphasis on features of single words and not enough focus on lexical collocations. Secondly, the ways the activities are conducted need improvement. They merely require learners to memorise and manipulate the lexical items, and do not encourage learners to examine them. This dissertation proposes several activities, such as creating a learner’s concordance, comparing English collocations with Japanese collocations, and connecting patterns and meanings using reference materials. -
Lectures on English Lexicology
МИНИСТЕРСТВО ОБРАЗОВАНИЯ И НАУКИ РОССИЙСКОЙ ФЕДЕРАЦИИ ГОУ ВПО «Татарский государственный гуманитарно-педагогический университет» LECTURES ON ENGLISH LEXICOLOGY Курс лекций по лексикологии английского языка Казань 2010 МИНИСТЕРСТВО ОБРАЗОВАНИЯ И НАУКИ РОССИЙСКОЙ ФЕДЕРАЦИИ ГОУ ВПО «Татарский государственный гуманитарно-педагогический университет» LECTURES ON ENGLISH LEXICOLOGY Курс лекций по лексикологии английского языка для студентов факультетов иностранных языков Казань 2010 ББК УДК Л Печатается по решению Методического совета факультета иностранных языков Татарского государственного гуманитарно-педагогического университета в качестве учебного пособия Л Lectures on English Lexicology. Курс лекций по лексикологии английского языка. Учебное пособие для студентов иностранных языков. – Казань: ТГГПУ, 2010 - 92 с. Составитель: к.филол.н., доцент Давлетбаева Д.Н. Научный редактор: д.филол.н., профессор Садыкова А.Г. Рецензенты: д.филол.н., профессор Арсентьева Е.Ф. (КГУ) к.филол.н., доцент Мухаметдинова Р.Г. (ТГГПУ) © Давлетбаева Д.Н. © Татарский государственный гуманитарно-педагогический университет INTRODUCTION The book is intended for English language students at Pedagogical Universities taking the course of English lexicology and fully meets the requirements of the programme in the subject. It may also be of interest to all readers, whose command of English is sufficient to enable them to read texts of average difficulty and who would like to gain some information about the vocabulary resources of Modern English (for example, about synonyms -
Semantic Role Labeling: an Introduction to the Special Issue
Semantic Role Labeling: An Introduction to the Special Issue Llu´ıs Marquez` ∗ Universitat Politecnica` de Catalunya Xavier Carreras∗∗ Massachusetts Institute of Technology Kenneth C. Litkowski† CL Research Suzanne Stevenson‡ University of Toronto Semantic role labeling, the computational identification and labeling of arguments in text, has become a leading task in computational linguistics today. Although the issues for this task have been studied for decades, the availability of large resources and the development of statistical machine learning methods have heightened the amount of effort in this field. This special issue presents selected and representative work in the field. This overview describes linguistic background of the problem, the movement from linguistic theories to computational practice, the major resources that are being used, an overview of steps taken in computational systems, and a description of the key issues and results in semantic role labeling (as revealed in several international evaluations). We assess weaknesses in semantic role labeling and identify important challenges facing the field. Overall, the opportunities and the potential for useful further research in semantic role labeling are considerable. 1. Introduction The sentence-level semantic analysis of text is concerned with the characterization of events, such as determining “who” did “what” to “whom,” “where,” “when,” and “how.” The predicate of a clause (typically a verb) establishes “what” took place, and other sentence constituents express the participants in the event (such as “who” and “where”), as well as further event properties (such as “when” and “how”). The primary task of semantic role labeling (SRL) is to indicate exactly what semantic relations hold among a predicate and its associated participants and properties, with these relations Departament de Llenguatges i Sistemes Informatics,` Universitat Politecnica` de Catalunya, Jordi Girona ∗ Salgado 1–3, 08034 Barcelona, Spain. -
Ten Years of Babelnet: a Survey
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI-21) Survey Track Ten Years of BabelNet: A Survey Roberto Navigli1 , Michele Bevilacqua1 , Simone Conia1 , Dario Montagnini2 and Francesco Cecconi2 1Sapienza NLP Group, Sapienza University of Rome, Italy 2Babelscape, Italy froberto.navigli, michele.bevilacqua, [email protected] fmontagnini, [email protected] Abstract to integrate symbolic knowledge into neural architectures [d’Avila Garcez and Lamb, 2020]. The rationale is that the The intelligent manipulation of symbolic knowl- use of, and linkage to, symbolic knowledge can not only en- edge has been a long-sought goal of AI. How- able interpretable, explainable and accountable AI systems, ever, when it comes to Natural Language Process- but it can also increase the degree of generalization to rare ing (NLP), symbols have to be mapped to words patterns (e.g., infrequent meanings) and promote better use and phrases, which are not only ambiguous but also of information which is not explicit in the text. language-specific: multilinguality is indeed a de- Symbolic knowledge requires that the link between form sirable property for NLP systems, and one which and meaning be made explicit, connecting strings to repre- enables the generalization of tasks where multiple sentations of concepts, entities and thoughts. Historical re- languages need to be dealt with, without translat- sources such as WordNet [Miller, 1995] are important en- ing text. In this paper we survey BabelNet, a pop- deavors which systematize symbolic knowledge about the ular wide-coverage lexical-semantic knowledge re- words of a language, i.e., lexicographic knowledge, not only source obtained by merging heterogeneous sources in a machine-readable format, but also in structured form, into a unified semantic network that helps to scale thanks to the organization of concepts into a semantic net- tasks and applications to hundreds of languages. -
ACNLP at Semeval-2020 Task 6: a Supervised Approach for Definition
ACNLP at SemEval-2020 Task 6: A Supervised Approach for Definition Extraction Fabien Caspani Pirashanth Ratnamogan Mathis Linger Mhamed Hajaiej BNP Paribas, France ffabien.caspani, pirashanth.ratnamogan, mathis.linger, [email protected] Abstract We describe our contribution to two of the subtasks of SemEval 2020 Task 6, DeftEval: Extracting term-definition pairs in free text. The system for Subtask 1: Sentence Classification is based on a transformer architecture where we use transfer learning to fine-tune a pretrained model on the downstream task, and the one for Subtask 3: Relation Classification uses a Random Forest classifier with handcrafted dedicated features. Our systems respectively achieve 0.830 and 0.994 F1-scores on the official test set, and we believe that the insights derived from our study are potentially relevant to help advance the research on definition extraction. 1 Introduction SemEval 2020 Task 6 (Spala et al., 2020) is a definition extraction problem divided into three different subtasks, in which the objective is to find the term-definition pairs in free text, possibly spanning across sentences boundaries. In Subtask 1 the system classifies a sentence as containing a definition or not. Subtask 2 consists of labeling each token with their corresponding BIO tags. Finally, in Subtask 3 the system pairs tagged entities, and labels the relation existing between them. The Definition Extraction from Texts (DEFT) corpus (Spala et al., 2019) is the dataset used for this task, and consists of human-annotated data from textbooks on a variety of subjects. Definition extraction methods in the research literature fall into one of the following categories. -
Semeval 2020 Task 1: Unsupervised Lexical Semantic Change Detection
SemEval 2020 Task 1: Unsupervised Lexical Semantic Change Detection November 11, 2020 Dominik Schlechtweg,| Barbara McGillivray,};~ Simon Hengchen,♠ Haim Dubossarsky,~ Nina Tahmasebi♠ [email protected] |University of Stuttgart, }The Alan Turing Institute, ~University of Cambridge ♠University of Gothenburg 1 Introduction I evaluation is currently our most pressing problem in LSC detection 1 I SemEval is competition-style semantic evaluation series I SemEval 2020 Task 1 on Unsupervised Lexical Semantic Change Detection (Schlechtweg, McGillivray, Hengchen, Dubossarsky, & Tahmasebi, 2020)2 I datasets for 4 languages with 100,000 human judgments I 2 subtasks I 33 teams submitted 186 systems 1https://semeval.github.io/ 2 https://languagechange.org/semeval/ 2 Tasks I comparison of two time periods t1 and t2 (i) reduces the number of time periods for which data has to be annotated (ii) reduces the task complexity I two tasks: I Subtask 1 { Binary classification: for a set of target words, decide which words lost or gained senses between t1 and t2, and which ones did not. I Subtask 2 { Ranking: rank a set of target words according to their degree of LSC between t1 and t2. I defined on word sense frequency distributions 3 Sense Frequency Distributions (SFDs) Figure 1: An example of a sense frequency distribution for the word cell in two time periods. 4 Corpora t1 t2 English CCOHA 1810-1860 CCOHA 1960-2010 German DTA 1800-1899 BZ+ND 1946-1990 Latin LatinISE -200{0 LatinISE 0{2000 Swedish Kubhist 1790-1830 Kubhist 1895-1903 Table 1: Time-defined subcorpora for each language. 5 Annotation 1. -
Trends in Second/Foreign Language Teaching and Learning: the Position Assigned to the Learning of Lexis Over the Years
Advances in Language and Literary Studies ISSN: 2203-4714 Vol. 7 No. 6; December 2016 Australian International Academic Centre, Australia Flourishing Creativity & Literacy Trends in Second/Foreign Language Teaching and Learning: The Position Assigned to the Learning of Lexis over the Years Debbita Tan School of Languages, Literacies and Translation, Universiti Sains Malaysia, 11800 Penang, Malaysia E-mail: [email protected] Doi:10.7575/aiac.alls.v.7n.6p.84 Received: 08/08/2016 URL: http://dx.doi.org/10.7575/aiac.alls.v.7n.6p.84 Accepted: 15/10/2016 Abstract There is no doubt that second/foreign language teaching and learning has evolved over the years, distinguishing varied trends in the course of its history. Interestingly, vocabulary has been the dominant focus in the last decades despite it being an undervalued and often overlooked component in the earlier stages. This reorientation is reflected not only in terms of classroom practices and education policies, but also in second/foreign language acquisition (SLA) research. The following seeks to provide an understanding of the position assigned to lexis within the scope of different methods implemented throughout the years, thus allowing us to: 1) trace the evolvement of the significance of vocabulary in SLA, and 2) appreciate the importance of lexical development in SLA. In congruity with these elements, current trends and practices related to second/foreign language vocabulary knowledge development are also discussed. This paper is of significance to scholars, researchers, language educators and curriculum designers. Keywords: SLA, lexis, lexical development, vocabulary learning, historical trends, current approaches 1. Introduction In recent decades, we have witnessed a general consensus among scholars, researchers and teachers that lexical development plays an essential role in mastering a second or foreign language. -
The Art of Lexicography - Niladri Sekhar Dash
LINGUISTICS - The Art of Lexicography - Niladri Sekhar Dash THE ART OF LEXICOGRAPHY Niladri Sekhar Dash Linguistic Research Unit, Indian Statistical Institute, Kolkata, India Keywords: Lexicology, linguistics, grammar, encyclopedia, normative, reference, history, etymology, learner’s dictionary, electronic dictionary, planning, data collection, lexical extraction, lexical item, lexical selection, typology, headword, spelling, pronunciation, etymology, morphology, meaning, illustration, example, citation Contents 1. Introduction 2. Definition 3. The History of Lexicography 4. Lexicography and Allied Fields 4.1. Lexicology and Lexicography 4.2. Linguistics and Lexicography 4.3. Grammar and Lexicography 4.4. Encyclopedia and lexicography 5. Typological Classification of Dictionary 5.1. General Dictionary 5.2. Normative Dictionary 5.3. Referential or Descriptive Dictionary 5.4. Historical Dictionary 5.5. Etymological Dictionary 5.6. Dictionary of Loanwords 5.7. Encyclopedic Dictionary 5.8. Learner's Dictionary 5.9. Monolingual Dictionary 5.10. Special Dictionaries 6. Electronic Dictionary 7. Tasks for Dictionary Making 7.1. Panning 7.2. Data Collection 7.3. Extraction of lexical items 7.4. SelectionUNESCO of Lexical Items – EOLSS 7.5. Mode of Lexical Selection 8. Dictionary Making: General Dictionary 8.1. HeadwordsSAMPLE CHAPTERS 8.2. Spelling 8.3. Pronunciation 8.4. Etymology 8.5. Morphology and Grammar 8.6. Meaning 8.7. Illustrative Examples and Citations 9. Conclusion Acknowledgements ©Encyclopedia of Life Support Systems (EOLSS) LINGUISTICS - The Art of Lexicography - Niladri Sekhar Dash Glossary Bibliography Biographical Sketch Summary The art of dictionary making is as old as the field of linguistics. People started to cultivate this field from the very early age of our civilization, probably seven to eight hundred years before the Christian era. -
Fuzzy Simulation of Historical Associative Thesaurus
Advances in Science, Technology and Engineering Systems Journal Vol. 4, No. 5, 224-233 (2019) ASTESJ www.astesj.com ISSN: 2415-6698 Special Issue on Advancement in Engineering and Computer Science Fuzzy Simulation of Historical Associative Thesaurus Olena Basalkevych*,1, Olexandr Basalkevych2 1Danylo Halytsky Lviv National Medical University, Department of Medical Informatics, 79005, Ukraine 2Product Engineering and Development Globallogic Inc., 79016, Ukraine A R T I C L E I N F O A B S T R A C T Article history: Research is dedicated to fuzzy reconstructions in the diachronic associative linguistics, and Received: 03 June, 2019 encompasses several simulation practices. The principles of an associative memory model Accepted: 13 September, 2019 and fuzzy computing with words are adopted in the working methodology. The evolved Online: 08 October, 2019 mathematical model reanimates the older patterns of Scottish associative map by means of fuzzy logic. The compiled fuzzy associative fields are connected by epidigmatic relations Keywords: represented by fuzzy associative word structures in a fuzzy associative map constituting the fuzzy associative field grounds of the proposed thesaurus. The simulation of the historical associative thesaurus stimulated fuzzy associative field enables extended associative scrutiny: tracing associative antipodes, reconstructing an fuzzy associative word structure associative experiment for any map associate, finding words’ similarities and distances, as fuzzy associative map well as associative differentiating and stratifying the Older Scottish lexis. The special fuzzy historical associative interest represents the fuzzy associative differential, an electronic tool for estimating the thesaurus lexis through its stable syntagmatic associates. All the simulation processes are fuzzy associative antipodes initiated and quantified. -
Abstract : Grammar , Lexis and Context
1 GRAMMAR , LEXIS AND CONTEXT Rob Batstone University of London , Institute of Education Thesis submitted for degree of PhD ABSTRACT : GRAMMAR , LEXIS AND CONTEXT Language teaching has been strongly influenced over recent years by talk of notions and functions , most notably through Wilkins' (1976) work on Notional Syllabuses . Yet the notional/functional syllabus has been criticized for failing to capture anything more than a superficial correspondence between form and meaning . In this thesis I argue for a framework in which a deeper congruence between form and meaning is developed . I identify regularities in the lexico-syntactic structure of English which express recognizable notional relationships , which in turn reflect deeper conceptualizations of relations between events and participants . These conceptualizations are represented on a semantic continuum of 'contextual distance' . By reference to this continuum , I argue that we can identify a clear congruence between increasing conceptual complexity and increasing lexico- syntactic complexity . This account gives considerable prominence to the role of lexis , and to the interdependence between grammar , lexis and context in the signalling of meaning , something which has not always been adequately considered within linguistics or within applied linguistics . I then consider a possible application of these ideas to pedagogy . In many 'product' approaches to syllabus design and methodology , learners work with language forms whose meanings are to an extent already fixed , with grammar subsuming lexis and with cotext and context already clearly related by the materials designer . In such approaches the interdependence between grammar , lexis and context is sometimes lost sight of , and I argue for a revised approach in which this interdependence is made central . -
Semeval-2013 Task 4: Free Paraphrases of Noun Compounds
SemEval-2013 Task 4: Free Paraphrases of Noun Compounds Iris Hendrickx Zornitsa Kozareva Radboud University Nijmegen & University of Southern California Universidade de Lisboa [email protected] [email protected] Preslav Nakov Diarmuid OS´ eaghdha´ QCRI, Qatar Foundation University of Cambridge [email protected] [email protected] Stan Szpakowicz Tony Veale University of Ottawa & University College Dublin Polish Academy of Sciences [email protected] [email protected] Abstract The frequency spectrum of compound types fol- lows a Zipfian distribution (OS´ eaghdha,´ 2008), so In this paper, we describe SemEval-2013 Task many NC tokens belong to a “long tail” of low- 4: the definition, the data, the evaluation and frequency types. More than half of the two-noun the results. The task is to capture some of the types in the BNC occur exactly once (Kim and Bald- meaning of English noun compounds via para- phrasing. Given a two-word noun compound, win, 2006). Their high frequency and high produc- the participating system is asked to produce tivity make robust NC interpretation an important an explicitly ranked list of its free-form para- goal for broad-coverage semantic processing of En- phrases. The list is automatically compared glish texts. Systems which ignore NCs may give up and evaluated against a similarly ranked list on salient information about the semantic relation- of paraphrases proposed by human annota- ships implicit in a text. Compositional interpretation tors, recruited and managed through Ama- is also the only way to achieve broad NC coverage, zon’s Mechanical Turk. The comparison of because it is not feasible to list in a lexicon all com- raw paraphrases is sensitive to syntactic and morphological variation. -
Detecting Semantic Difference: a New Model Based on Knowledge and Collocational Association
DETECTING SEMANTIC DIFFERENCE: A NEW MODEL BASED ON KNOWLEDGE AND COLLOCATIONAL ASSOCIATION Shiva Taslimipoor1, Gloria Corpas2, and Omid Rohanian1 1Research Group in Computational Linguistics, University of Wolverhampton, UK 2Research Group in Computational Linguistics, University of Wolverhampton, UK and University of Malaga, Spain {shiva.taslimi, omid.rohanian}@wlv.ac.uk [email protected] Abstract Semantic discrimination among concepts is a daily exercise for humans when using natural languages. For example, given the words, airplane and car, the word flying can easily be thought and used as an attribute to differentiate them. In this study, we propose a novel automatic approach to detect whether an attribute word represents the difference between two given words. We exploit a combination of knowledge-based and co- occurrence features (collocations) to capture the semantic difference between two words in relation to an attribute. The features are scores that are defined for each pair of words and an attribute, based on association measures, n-gram counts, word similarity, and Concept-Net relations. Based on these features we designed a system that run several experiments on a SemEval-2018 dataset. The experimental results indicate that the proposed model performs better, or at least comparable with, other systems evaluated on the same data for this task. Keywords: semantic difference · collocation · association measures · n-gram counts · word2vec · Concept-Net relations · semantic modelling. 1. INTRODUCTION Semantic modelling in natural language processing requires attending to both semantic similarity and difference. While similarity is well-researched in the community (Mihalcea & Hassan, 2017), the ability of systems in discriminating between words is an under-explored area (Krebs et al., 2018).