HSK Word List - Level 3 HSK Word List - Level 3

Total Page:16

File Type:pdf, Size:1020Kb

HSK Word List - Level 3 HSK Word List - Level 3 11/5/11 HSK Word List - Level 3 HSK Word List - Level 3 1 阿姨 āyí maternal aunt; step-mother; childcare worker; nursemaid; woman of similar age to one's parents (term of address used by child); CL:個|个[gè] 2 啊 a modal particle ending sentence, showing affirmation, approval, or consent 3 矮 ǎi low; short (in length) 4 ài to love; affection; to be fond of; to like 5 好 àihào to like; to take pleasure in; keen on; fond of; interest; hobby; appetite for; CL:個|个[gè] 6 安静 ānjìng quiet; peaceful; calm 7 八 bā eight; 8 8 把 bǎ to hold; to contain; to grasp; to take hold of; a handle; particle marking the following noun as a direct object; classifier for objects with handle 9 爸爸 bàba (informal) father; CL:個|个[gè],位[wèi] 10 吧 ba (modal particle indicating polite suggestion); ...right?; ...OK? 11 白 bái white; snowy; pure; bright; empty; blank; plain; clear; to make clear; in vain; gratuitous; free of charge; reactionary; anti-communist; funeral; to stare coldly; to write wrong character; to state; to explain; vernacular; spoken lines in opera; surname Bai 12 百 bǎi hundred; numerous; all kinds of; surname Bai 13 班 bān team; class; squad; work shift; classifier for groups; ranking; surname Ban; CL:個|个[gè] 14 搬 bān to move; to shift; to remove; to transport; to apply indiscriminately; to copy mechanically 15 半 bàn half; semi-; incomplete; (after a number) and a half 16 法 bànfǎ means; method; way (of doing sth); CL:條|条[tiáo], 個|个[gè] hewgill.com/hsk/hsk3.html 1/29 11/5/11 HSK Word List - Level 3 17 公室 bàngōngshì an office; business premises; a bureau; CL:間| [jiān] 18 帮忙 bāngmáng to help; to lend a hand; to do a favor; to do a good turn 19 帮助 bāngzhù assistance; aid; to help; to assist 20 包 bāo to cover; to wrap; to hold; to include; to take charge of; package; wrapper; container; bag; to hold or embrace; bundle; packet; to contract (to or for); surname Bao; CL:個|个[gè],隻|只[zhī] 21 bǎo to eat till full; satisfied 22 bàozhǐ newspaper; newsprint; CL:份[fèn],期[qī],張| [zhāng] 23 杯子 bēizi cup; glass; CL:個|个[gè],支[zhī],枝[zhī] 24 北方 běifāng north; the northern part a country; China north of the Yellow River 25 北京 běijīng Beijing, capital of People's Republic of China; Peking; PRC government 26 被 bèi by (indicates passive-voice sentences or clauses); quilt; to cover (literary) 27 本 běn roots or stems of plants; origin; source; this; the current; root; foundation; basis; classifier for books, periodicals, files etc; originally 28 鼻子 bízi nose; CL:個|个[gè],隻|只[zhī] 29 比 bǐ (particle used for comparison and "-er than"); to compare; to contrast; to gesture (with hands); ratio 30 比 bǐjiào compare; contrast; fairly; comparatively; relatively; quite; rather 31 比 bǐsài competition (sports etc); match; CL:場|[chǎng], 次[cì] 32 必 bìxū to have to; must; compulsory; necessarily 33 化 biànhuà change; variation; to change; to vary; CL:個|个[gè] 34 表示 biǎoshì to express; to show; to say; to state; to indicate; to mean hewgill.com/hsk/hsk3.html 2/29 11/5/11 HSK Word List - Level 3 35 表演 biǎoyǎn play; show; performance; exhibition; to perform; to act; to demonstrate; CL:場|[chǎng] 36 别 bié to leave; to depart; to separate; to distinguish; to classify; other; another; do not; must not; to pin 37 别人 biérén other people; others; other person 38 bīnguǎn guesthouse; CL:個|个[gè],家[jiā] 39 冰箱 bīngxiāng icebox; freezer cabinet; refrigerator; CL:臺|台[tái], 個|个[gè] 40 不客气 búkèqi you're welcome; impolite; rude; blunt; don't mention it 41 不 bù (negative prefix); not; no 42 才 cái ability; talent; endowment; gift; an expert; only (then); only if; just 43 菜 cài dish (type of food); vegetables; vegetable; cuisine; CL:盤|[pán],道[dào] 44 菜 càidān menu; CL:份[fèn],張|[zhāng] 45 参加 cānjiā to participate; to take part; to join 46 草 cǎo grass; straw; manuscript; draft (of a document); careless; rough; CL:棵[kē],撮[zuǒ],株[zhū],根[gēn] 47 céng layer; stratum; laminated; floor (of a building); storey; classifier for layers; repeated; sheaf (math.) 48 茶 chá tea; tea plant; CL:杯[bēi],壺|[hú] 49 差 chà differ from; short of; to lack; poor 50 cháng length; long; forever; always; constantly 51 唱歌 chànggē to sing a song 52 超市 chāoshì supermarket (abbr.); CL:家[jiā] 53 衫 chènshān shirt; blouse; CL:件[jiàn] 54 成 chéngjì achievement; performance records; grades; CL:項| [xiàng],個|个[gè] 55 城市 chéngshì city; town; CL:座[zuò] hewgill.com/hsk/hsk3.html 3/29 11/5/11 HSK Word List - Level 3 56 吃 chī to eat; to have one's meal; to eradicate; to destroy; to absorb; to suffer; to exhaust 57 到 chídào to arrive late 58 出 chū to go out; to come out; to occur; to produce; to go beyond; to rise; to put forth; to happen; classifier for dramas, plays, operas etc 59 出 chūxiàn to appear; to arise; to emerge; to show up 60 出租 chūzūchē taxi 61 厨房 chúfáng kitchen; CL:間|[jiān] 62 除了 chúle besides; apart from (... also...); in addition to; except (for) 63 穿 chuān to bore through; pierce; perforate; penetrate; pass through; to dress; to wear; to put on; to thread 64 船 chuán a boat; vessel; ship; CL:條|条[tiáo],艘[sōu],隻|只 [zhī] 65 春 chūn spring (time); gay; joyful; youthful; love; lust; life 66 cíyǔ word (general term including monosyllables through to short phrases); term (e.g. technical term); expression 67 次 cì next in sequence; second; the second (day, time etc); secondary; vice-; sub-; infra-; inferior quality; substandard; order; sequence; hypo- (chemistry); classifier for enumerated events: time 68 明 cōngming acute (of sight and hearing); clever; intelligent; bright; smart 69 从 cóng from; via; passing through; through (a gap); past; ever (followed by negative, meaning never); (formerly pr. zòng and related to 縱|) to follow; to comply with; to obey; to join; to engage in; adopting some mode of action or attitude; follower; retainer; accessory; accomplice; related by common paternal grandfather or earlier ancestor; surname Cong 70 cuò mistake; error; blunder; fault; cross; uneven; wrong; CL:個|个[gè] hewgill.com/hsk/hsk3.html 4/29 11/5/11 HSK Word List - Level 3 71 打 dǎdiànhuà to make a telephone call 72 打球 dǎlánqiú play basketball 73 打 dǎsǎo to clean; to sweep 74 打算 dǎsuàn to plan; to intend; to calculate; plan; intention; calculation; CL:個|个[gè] 75 大 dà big; huge; large; major; great; wide; deep; oldest; eldest 76 大家 dàjiā authority; everyone 77 dài band; belt; girdle; ribbon; tire; area; zone; region; CL:條|条[tiáo]; to wear; to carry; to lead; to bring; to look after; to raise 78 担心 dānxīn anxious; worried; uneasy; to worry; to be anxious 79 蛋糕 dàngāo cake; CL:塊|[kuài],個|个[gè] 80 但是 dànshì but; however 81 当然 dāngrán only natural; as it should be; certainly; of course; without doubt 82 到 dào to (a place); until (a time); up to; to go; to arrive 83 地 de -ly; structural particle: used before a verb or adjective, linking it to preceding modifying adverbial adjunct 84 的 de of; structural particle: used before a noun, linking it to preceding possessive or descriptive attributive 85 得 de structural particle: used after a verb (or adjective as main verb), linking it to following phrase indicating effect, degree, possibility etc 86 灯 dēng lamp; light; lantern; CL:盞|[zhǎn] 87 等 děng to wait for; to await 88 低 dī low; beneath; to lower (one's head); to let droop; to hang down; to incline 89 弟弟 dìdi younger brother; CL:個|个[gè],位[wèi] 90 地方 dìfāng region; regional (away from the central administration) 91 地 dìtiě subway; metro 92 地 dìtú map; CL:張|[zhāng],本[běn] hewgill.com/hsk/hsk3.html 5/29 11/5/11 HSK Word List - Level 3 93 第一 dìyī first; number one 94 点 diǎn drop (of liquid); stain; spot; speck; jot; dot stroke (in Chinese characters); decimal point; point; mark (of degree or level); a place (with certain characteristics); iron bell; o’clock; a little; a bit; some; (point) unit of measurement for type; to touch on briefly; to make clear; to light; to ignite; to kindle; period of time at night (24 minutes) (old); a drip; to dibble; classifier for small indeterminate quantities 95 diànnǎo computer; CL:臺|台[tái] 96 diànshì television; TV; CL:臺|台[tái],個|个[gè] 97 梯 diàntī elevator; CL:臺|台[tái],部[bù] 98 影 diànyǐng movie; film; CL:部[bù],片[piàn],幕[mù],場| [chǎng] 99 子件 diànzǐyóujiàn electronic mail; email; CL:封[fēng] 100 dōng east; host (i.e. sitting on east side of guest); landlord; surname Dong 101 西 dōngxi thing; stuff; person; CL:個|个[gè],件[jiàn] 102 冬 dōng winter 103 懂 dǒng to understand; to know 104 物 dòngwù animal; CL:隻|只[zhī],群[qún],個|个[gè] 105 都 dōu all, both; entirely (due to) each; even; already 106 dú to read; to study; reading of word (i.e. pronunciation), similar to 拼音[pīn yīn] 107 短 duǎn short or brief; to lack; weak point; fault 108 段 duàn paragraph; section; segment; stage (of a process); classifier for stories, periods of time, lengths of thread etc 109 duànliàn to engage in physical exercise; to toughen; to temper 110 duì couple; pair; to be opposite; to oppose; to face; versus; for; to; correct (answer); to answer; to reply; hewgill.com/hsk/hsk3.html 6/29 11/5/11 HSK Word List - Level 3 to direct (towards sth); right 111 不起 duìbuqǐ unworthy; to let down; I'm sorry; excuse me; pardon me; if you please; sorry? (please repeat) 112 多 duō many; much; a lot of; numerous; multi- 113 多么 duōme how (wonderful etc); what (a great idea etc); however (difficult it may be etc) 114 多少 duōshǎo number; amount; somewhat 115 è to be hungry; hungry 116 而且 érqiě (not only ...) but also; moreover; in addition; furthermore 117 儿子 érzi son 118 耳朵 ěrduo ear; CL:隻|只[zhī],個|个[gè],對|[duì] 119 二 èr two; 2; stupid (Beijing dialect) 120 fāshāo have a high temperature (from illness); have a fever 121 fāxiàn to find; to discover 122 fànguǎn restaurant; CL:家[jiā]
Recommended publications
  • POS Tagging for Improving Code-Switching Identification In
    POS Tagging for Improving Code-Switching Identification in Arabic Mohammed Attia1 Younes Samih2 Ali Elkahky1 Hamdy Mubarak2 Ahmed Abdelali2 Kareem Darwish2 1Google LLC, New York City, USA 2Qatar Computing Research Institute, HBKU Research Complex, Doha, Qatar 1{attia,alielkahky}@google.com 2{ysamih,hmubarak,aabdelali,kdarwish}@hbku.edu.qa Abstract and sociolinguistic reasons. Second, existing the- oretical and computational linguistic models are When speakers code-switch between their na- tive language and a second language or lan- based on monolingual data and cannot adequately guage variant, they follow a syntactic pattern explain or deal with the influx of CS data whether where words and phrases from the embedded spoken or written. language are inserted into the matrix language. CS has been studied for over half a century This paper explores the possibility of utiliz- from different perspectives, including theoretical ing this pattern in improving code-switching linguistics (Muysken, 1995; Parkin, 1974), ap- identification between Modern Standard Ara- plied linguistics (Walsh, 1969; Boztepe, 2003; Se- bic (MSA) and Egyptian Arabic (EA). We try to answer the question of how strong is the tati, 1998), socio-linguistics (Barker, 1972; Heller, POS signal in word-level code-switching iden- 2010), psycho-linguistics (Grosjean, 1989; Prior tification. We build a deep learning model en- and Gollan, 2011; Kecskes, 2006), and more re- riched with linguistic features (including POS cently computational linguistics (Solorio and Liu, tags) that outperforms the state-of-the-art re- 2008a; Çetinoglu˘ et al., 2016; Adel et al., 2013b). sults by 1.9% on the development set and 1.0% In this paper, we investigate the possibility of on the test set.
    [Show full text]
  • SELECTING and CREATING a WORD LIST for ENGLISH LANGUAGE TEACHING by Deny A
    Teaching English with Technology, 17(1), 60-72, http://www.tewtjournal.org 60 SELECTING AND CREATING A WORD LIST FOR ENGLISH LANGUAGE TEACHING by Deny A. Kwary and Jurianto Universitas Airlangga Dharmawangsa Dalam Selatan, Surabaya 60286, Indonesia d.a.kwary @ unair.ac.id / [email protected] Abstract Since the introduction of the General Service List (GSL) in 1953, a number of studies have confirmed the significant role of a word list, particularly GSL, in helping ESL students learn English. Given the recent development in technology, several researchers have created word lists, each of them claims to provide a better coverage of a text and a significant role in helping students learn English. This article aims at analyzing the claims made by the existing word lists and proposing a method for selecting words and a creating a word list. The result of this study shows that there are differences in the coverage of the word lists due to the difference in the corpora and the source text analysed. This article also suggests that we should create our own word list, which is both personalized and comprehensive. This means that the word list is not just a list of words. The word list needs to be accompanied with the senses and the patterns of the words, in order to really help ESL students learn English. Keywords: English; GSL; NGSL; vocabulary; word list 1. Introduction A word list has been noted as an essential resource for language teaching, especially for second language teaching. The main purpose for the creation of a word list is to determine the words that the learners need to know, in order to provide more focused learning materials.
    [Show full text]
  • Modeling and Encoding Traditional Wordlists for Machine Applications
    Modeling and Encoding Traditional Wordlists for Machine Applications Shakthi Poornima Jeff Good Department of Linguistics Department of Linguistics University at Buffalo University at Buffalo Buffalo, NY USA Buffalo, NY USA [email protected] [email protected] Abstract Clearly, descriptive linguistic resources can be This paper describes work being done on of potential value not just to traditional linguis- the modeling and encoding of a legacy re- tics, but also to computational linguistics. The source, the traditional descriptive wordlist, difficulty, however, is that the kinds of resources in ways that make its data accessible to produced in the course of linguistic description NLP applications. We describe an abstract are typically not easily exploitable in NLP appli- model for traditional wordlist entries and cations. Nevertheless, in the last decade or so, then provide an instantiation of the model it has become widely recognized that the devel- in RDF/XML which makes clear the re- opment of new digital methods for encoding lan- lationship between our wordlist database guage data can, in principle, not only help descrip- and interlingua approaches aimed towards tive linguists to work more effectively but also al- machine translation, and which also al- low them, with relatively little extra effort, to pro- lows for straightforward interoperation duce resources which can be straightforwardly re- with data from full lexicons. purposed for, among other things, NLP (Simons et al., 2004; Farrar and Lewis, 2007). 1 Introduction Despite this, it has proven difficult to create When looking at the relationship between NLP significant electronic descriptive resources due to and linguistics, it is typical to focus on the dif- the complex and specific problems inevitably as- ferent approaches taken with respect to issues sociated with the conversion of legacy data.
    [Show full text]
  • THE IMPACT of USING WORDLISTS in the LANGUAGE CLASSROOM on STUDENTS’ VOCABULARY ACQUISITION Gülçin Coşgun Ozyegin University
    International Journal of English Language Teaching Vol.4, No.3, pp.49-66, March 2016 ___Published by European Centre for Research Training and Development UK (www.eajournals.org) THE IMPACT OF USING WORDLISTS IN THE LANGUAGE CLASSROOM ON STUDENTS’ VOCABULARY ACQUISITION Gülçin Coşgun Ozyegin University ABSTRACT: Vocabulary has always been an area of interest for many researchers since words represent “the building block upon which knowledge of the second language can be built” and without them people cannot convey the intended meaning (Abrudan, 2010). Nation (1988) emphasised if teachers want to help their learners to deal with unknown words, it would be better to spend more time on vocabulary learning strategies rather than spending time on individual words. However, Schmitt in Schmitt and McCarthy (1997:200) stated that among vocabulary learning strategies only ‘guessing from context’ and ‘key word method’ have been investigated in depth. Therefore, there is need for more research on vocabulary learning whose pedagogical implications may contribute to the field of second language learning. Considering the above- mentioned issues, vocabulary is a worthwhile field to investigate. Hence, this paper aims at proposing a framework for vocabulary teaching strategy in English as a foreign language context. KEYWORDS: Wordlists, Vocabulary Teaching, Word walls, Vocabulary Learning Strategies INTRODUCTION Vocabulary has always been an area of interest and concern for many researchers and teachers alike since words represent “the building block upon which knowledge of the second language can be built” and without them people cannot convey the intended meaning (Abrudan, 2010). Anderson and Freebody (1981) state that a strong correlation exists between vocabulary and academic achievement.
    [Show full text]
  • All-Words Word Sense Disambiguation Using Concept Embeddings
    All-words Word Sense Disambiguation Using Concept Embeddings Rui Suzuki, Kanako Komiya, Masayuki Asahara, Minoru Sasaki, Hiroyuki Shinnou Ibaraki University, 4-12-1 Nakanarusawa, Hitachi, Ibaraki JAPAN, National Institute for Japanese Language and Linguistics, 10-2 Midoricho, Tachikawa, Tokyo, JAPAN, [email protected], kanako.komiya.nlp, minoru.sasaki.01, hiroyuki.shinnou.0828g @vc.ibaraki.ac.jp, [email protected] Abstract All-words word sense disambiguation (all-words WSD) is the task of identifying the senses of all words in a document. Since the sense of a word depends on the context, such as the surrounding words, similar words are believed to have similar sets of surrounding words. We therefore predict the target word senses by calculating the distances between the surrounding word vectors of the target words and their synonyms using word embeddings. In addition, we introduce the new idea of concept embeddings, constructed from concept tag sequences created from the results of previous prediction steps. We predict the target word senses using the distances between surrounding word vectors constructed from word and concept embeddings, via a bootstrapped iterative process. Experimental results show that these concept embeddings were able to improve the performance of Japanese all-words WSD. Keywords: word sense disambiguation, all-words, unsupervised 1. Introduction many words have the same article numbers. Several words Word sense disambiguation (WSD) involves identifying the can have the same precise article number, even when the senses of words in documents. In particular, the WSD task semantic breaks are considered. where the senses of all the words in a document are disam- biguated is referred to as all-words WSD.
    [Show full text]
  • A Web-Based Vocabulary Profiler for Chinese Language Teaching and Research
    DA A web-based vocabulary profiler for Chinese language teaching and research Jun Da, Middle Tennessee State University This paper describes the design and development of a web-based Chinese vocabulary profiler that is intended as a handy tool for Chinese language instruction and research. Given a piece of text, the profiler is capable of generating character and vocabulary profiles with frequency and availability information based on a comparison of the set of characters and words found in the text with a selection of other pre-compiled reference lists such as Da’s Modern Chinese character and N-gram frequency lists, the HSK word list and customized vocabulary list provided by a user. 1. Introduction Vocabulary profiling is a measure of the proportions of low and high frequency vocabulary used in a written text. In addition to frequency information, a profiler designed for lexical analysis of texts often provides other information such as the presence/absence of the set of words from the input text in other specialized word lists and their collocation with other words, etc. While it has been used in stylistic studies of written texts (e.g., Graves’s (2004) profiling of Jane Austen’s novels and letters), vocabulary profiling has also been suggested as a useful instrument in second language acquisition research and pedagogy (Laufer and Nation 1995). For example, Morris and Cobb (2004) examined the potential of using vocabulary profile as a predictor of TESL (Teaching English as a Second Language) trainees’ academic performance and reported that vocabulary profile results correlate significantly with their course grades.
    [Show full text]
  • Simple Features for Statistical Word Sense Disambiguation
    SENSEVAL-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain, July 2004 Association for Computational Linguistics Simple Features for Statistical Word Sense Disambiguation Abolfazl K. Lamjiri, Osama El Demerdash, Leila Kosseim CLaC Laboratory Department of Computer Science Concordia University, Montreal, Canada fa keigho,osama el,[email protected] Abstract more than four words around the target are In this paper, we describe our experiments on considered (Ide and V´eronis, 1998). While re- statistical word sense disambiguation (WSD) searchers such as Masterman (1961), Gougen- using two systems based on different ap- heim and Michea (1961), agree with this ob- proaches: Na¨ıve Bayes on word tokens and Max- servation (Ide and V´eronis, 1998), our results imum Entropy on local syntactic and seman- demonstrate that this does not generally ap- tic features. In the first approach, we consider ply to all words. A large context window pro- a context window and a sub-window within it vides domain information which increases the around the word to disambiguate. Within the accuracy for some target words such as bank.n, outside window, only content words are con- but not others like different.a or use.v (see Sec- sidered, but within the sub-window, all words tion 3). This confirms Mihalcea's observations are taken into account. Both window sizes are (Mihalcea, 2002). In our system we allow a tuned by the system for each word to disam- larger context window size and for most of the biguate and accuracies of 75% and 67% were re- words such context window is selected by the spectively obtained for coarse and fine grained system.
    [Show full text]
  • Cross-Linguistic Word Frequency Visualization for PT and EN
    Cross-Linguistic Word Frequency Visualization for PT and EN Mariana Shimabukuro* Christopher Collins† Ontario Tech University Figure 1: Circular heatmap and zoomed rank exploration view. The heatmap shows the distribution of words between Portuguese (PT) and their English (EN) translations in both language ranks. The side view shows words from a chosen cell and their rank location in the bars on the right (PT) and left (EN) side, e.g., the word “that” appears 4 times, indicating PT speakers tend to overuse “that”. ABSTRACT Ex.: “This a strong/powerful tea.”, “I feel very confusing this In this project, we present a visualization for investigating the re- morning.” or “I ate a medicine pill.” lationship between commonly used words in Portuguese and their Acceptability errors can be a source of embarrassment for non- translations in English. This cross-linguistic analysis can help us to native English speakers, and often come from the differences in word understand English word choices made by Portuguese native speak- frequency across languages. For example, the word “polemic” is ers and the influence of language transfer effects. In this paper, considered formal and rarely used in English, whereas its Portuguese we discuss how word frequency is commonly used as a resource translation, “polemico,”ˆ is a very common word. Thus, a Portuguese for both textual and cross-linguistic analysis. Moreover, we briefly speaker may choose to use the word “polemic” in English conversa- explain the data processing pipeline building on machine translation tion, where “controversial” would be a more natural-sounding choice. and word frequencies from large corpora.
    [Show full text]
  • Using Word Lists to Learn Second Language Vocabulary Is Unproductive
    MYTH 2 Using word lists to learn second language vocabulary is unproductive. In the Real World . AT A WORKSHOP IN JAPAN IN 1993, THE NOTED second language acquisi- tion expert Rod Ellis asked the audience, which was composed of per- haps 90 percent Japanese EFL teachers and 10 percent English-speaking teachers, “I assume that most of you teachers here today learned English as a Foreign Language. What methods or techniques did your teachers use?” Most of the responses were murmurs of a few different methods, and then one older woman said in a somewhat shy voice but with impeccable English pronunciation, “Audiolingual.” Dr. Ellis’s next question was addressed to this woman. “Do you think that was a good method? Was it effective?” CopyrightIn (c) typical 2004, Japanese University fashion, of Michiganshe hesitated Press. for a All long rights time. reserved. She finally answered, “Well, I thought it was good,” with emphasis on the word thought to indicate that she may have been wrong. In her voice— as we hear in many of our students’ voices when we pose a difficult question to the entire group—you could hear that she was trying to give the right answer but hedge also. However, she really did think that /35 36 \ Vocabulary Myths audiolingual was a good way to learn English—at least for her. She put emphasis on the word thought to indicate that she expected Dr. Ellis’s next comments to explain why audiolingual was bad since that is what the vast majority of current methods books tell us.
    [Show full text]
  • A Referred Word List Analyzing Tool with Keyword, Concordancing and N-Gram Functions∗∗∗
    VocabAnalyzer: A Referred Word List Analyzing Tool with Keyword, Concordancing and N-gram Functions∗∗∗ a b a Siaw-Fong Chung , F.Y. August Chao , and Yi-Chen Hsieh a Department of English, National Chengchi University, No. 64, ZhiNan Road Section 2, Wenshan District, Taipei City 11605, Taiwan {sfchung, 96551016}@nccu.edu.tw b Department of Management Information Systems, National Chengchi University, No. 64, ZhiNan Road Section 2, Wenshan District, Taipei City 11605, Taiwan [email protected] Abstract. This paper introduces the newly created VocabAnalyzer which is equipped with keyword, concordancing and n-gram functions. The VocabAnalyzer also allows the comparison of the inputted text against Jeng et al. (2002) vocabulary word list. Two case studies will be discussed in this paper. The first study compares two versions of the English Bible and the second study compares word list created by abstracts written by graduate students of various English departments in Taiwan. Both study shows that the VocabAnalyzer is beneficial for applied linguistics studies as well as for teaching and learning of English. Analyses of student writing can be carried out based on the results from this tool. Keywords: VocabAnalyzer, word list, keyword, concordancer, vocabulary 1 Introduction With the increasing expansion of corpus linguistics and its applications to fields such as theoretical linguistics, applied linguistics and language teaching, concordancers are now in greater demand, especially by researchers who do not write programming scripts themselves. Wordsmith (Scott,1999), AntConc (Anthony, 2004) and the Concordancer Software (Nguyen and Munson, 2003) are some of the most frequently used concordancers because these programs allow plain texts to be processed and displayed as KWIC or analyzed according to collocations.
    [Show full text]
  • A New General Service List: the Better Mousetrap We’Ve Been Looking For? Charles Browne Meiji Gakuin University Doi
    Vocabulary Learning and Instruction Volume 3, Issue 1, Month 2014 http://vli-journal.org Article A New General Service List: The Better Mousetrap We’ve Been Looking for? Charles Browne Meiji Gakuin University doi: http://dx.doi.org/10.7820/vli.v03.1.browne Abstract This brief paper introduces the New General Service List (NGSL), a major update of Michael West’s 1953 General Service List (GSL) of core vocabulary for second language learners. After describing the rationale behind the NGSL, the specific steps taken to create it and a discussion of the latest 1.01 version of the list, the paper moves on to comparing the text coverage offered by the NGSL against both the GSL as well as another recent GSL published by Brezina and Gablasova (referred to as Other New General Service List [ONGSL] in this paper). Results indicate that while the original GSL offers slightly better coverage for texts of classic literature (about 0.8% better than the NGSL and 4.5% more than the ONGSL), the NGSL offers 5Á6% more coverage than either list for more modern corpora such as Scientific American or the Economist. In the precomputer era of the early 20th century, Michael West and his colleagues undertook a very ambitious corpus linguistics project that gathered and analyzed more than 2.5 million words of text and culminated in the publication of what is now known as The General Service List (GSL; West, 1953), a list of about 2000 high-frequency words that were deemed important for second language learners. However, as useful and helpful as this list has been
    [Show full text]
  • Annotation of 'Word List by Semantic Principles' Labels for the Balanced
    PACLIC 32 Annotation of ‘Word List by Semantic Principles’ Labels for the Balanced Corpus of Contemporary Written Japanese Sachi Kato Masayuki Asahara Makoto Yamazaki National Institute for Japanese Language and Linguistics, Japan Tokyo, Japan fyasuda-s, masayu-a, yamazakig AT ninjal DOT ac DOT jp Abstract proposed in the benchmark data. A thesaurus-based word-sense tagged corpus has also been proposed. This article presents a word-sense annotation (Bond et al., 2012) developed annotation data for for the Balanced Corpus of Contemporary Japanese WordNet, which is a translation of English Written Japanese: a mashed-up Japanese lexi- data. con based on the ‘Word List by Semantic Prin- ciples’ (WLSP). The WLSP is a large-scale In this study, we develop a new semantic infor- Japanese thesaurus which includes 98,241 en- mation annotation of BCCWJ. The semantic infor- tries with syntactic and hierarchical seman- mation is based on the thesaurus ’Word List by Se- tic categories. We utilized a morpheme-word mantic Principles, Revised and Enlarged Version’ sense alignment table to extract all possible (hereafter WLSP) (Kokuritsu Kokugo Kenkyusho, word sense candidates for each word appear- 2004). This article presents the design of the an- ing in the target corpus. Then, we manually disambiguated the word senses for 182,166 notation and basic statistics of the data. We use a content words in the texts. subset of the core data of BCCWJ as the target cor- pus for word-sense annotations. The data include books (BCCWJ sample ID: PB), magazines (PM), 1 Introduction and newspapers (PN). The BCCWJ has morpheme Semantic information annotation is an important lin- information annotations such as word boundary and guistic resource to explore synonyms by semantic part-of-speech annotations, based on a morphologi- category or figurative expressions by discrepancies cal analyzer dictionary, UniDic.
    [Show full text]