Available corpora ID name part of collection online time variety/ spoken/ no. of words (CD‐ROM) info/manual frame speakers written C LD 1 , format 1 ICAME Collection LINK 1960s American English written 1.000.000 (1st edition) C LD 1 Brown Corpus, format 2 ICAME Collection LINK 1960s American English written 1.000.000 (1st edition) C LD 1 LOB: Lancaster‐Oslo‐Bergen Corpus ICAME Collection LINK 1960s British English written 1.000.000 (1st edition) C LD 1 LOB: Lancaster‐Oslo‐Bergen Corpus, ICAME Collection LINK 1960s British English written 1.000.000 tagged version (1st edition) C LD 1 Kolhapur Corpus of Indian English ICAME Collection LINK 1978 Indian English written 1.000.000 (1st edition) C LD 1 Helsinki Corpus of English Texts, ICAME Collection LINK OE, ME, British English written 1.500.000 Diachronic part (1st edition) EModE C LD 1, 2. ACE: Australian Corpus of English ICAME Collection LINK 1986 Australian written 1.000.000 Aufl. (2nd edition) English C LD 1, 2. Brown Corpus, format 1 ICAME Collection LINK 1960s American English written 1.000.000 Aufl. (2nd edition) C LD 1, 2. Brown Corpus, format 2 ICAME Collection LINK 1960s American English written 1.000.000 Aufl. (2nd edition) C LD 1, 2. Brown Corpus, tagged version ICAME Collection LINK 1960s American English written 1.000.000 Aufl. (2nd edition) C LD 1, 2. CEECS: Corpus of Early English ICAME Collection LINK 1418‐ British English written 450.000 Aufl. Correspondence Sampler (2nd edition) 1680 C LD 1, 2. COLT: Corpus of London Teenage Language ICAME Collection LINK 1993 British English; spoken 500.000 Aufl. (2nd edition) adolescents C LD 1, 2. FLOB: Freiburg‐LOB Corpus of British ICAME Collection LINK 1990s British English written 1.000.000 Aufl. English (2nd edition) C LD 1, 2. FROWN: Freiburg‐Brown Corpus of ICAME Collection LINK 1990s American English written 1.000.000 Aufl. American English (2nd edition)

Corpus Linguistics Help, JLU Gießen last updated: 05 January 2016 1/7 C LD 1, 2. Helsinki Corpus of English Texts, ICAME Collection LINK OE, ME, British English written 1.500.000 Aufl. Diachronic part (2nd edition) EModE C LD 1, 2. ICE‐EA: International Corpus of English, ICAME Collection LINK 1990‐ Kenyan English, spoken & 1.000.000 Aufl. East‐African component (2nd edition) 1996 Tanzanian written English C LD 1, 2. ICAMET: Innsbruck Computer‐Archive of ICAME Collection LINK ME British English written ? Aufl. Machine‐Readable English Texts (2nd edition) C LD 1, 2. Kolhapur Corpus of Indian English ICAME Collection LINK 1978 Indian English written 1.000.000 Aufl. (2nd edition) C LD 1, 2. Lampeter Corpus of Early Modern English ICAME Collection LINK 1640‐ British English written 1.200.000 Aufl. Tracts (2nd edition) 1740 C LD 1, 2. London‐Lund Corpus ICAME Collection LINK 1953 to British English spoken 500.000 Aufl. (2nd edition) 1987 C LD 1, 2. LOB: Lancaster‐Oslo‐Bergen Corpus ICAME Collection LINK 1960s British English written 1.000.000 Aufl. (2nd edition) C LD 1, 2. LOB: Lancaster‐Oslo‐Bergen Corpus, ICAME Collection LINK 1960s British English written 1.000.000 Aufl. tagged version (2nd edition) C LD 1, 2. Newdigate Newsletters ICAME Collection LINK 1670s‐ British English written 750.000 Aufl. (2nd edition) 1690s C LD 1, 2. Helsinki Corpus of Older Scots ICAME Collection LINK 1450‐ Scots written 870.000 Aufl. (2nd edition) 1700 C LD 1, 2. POW: Polytechnic of Wales Corpus ICAME Collection LINK 1978‐ British English; spoken 65.000 Aufl. (2nd edition) 1984 native speaker children C LD 1, 2. SEC: Lancaster/IBM Spoken English Corpus ICAME Collection LINK 1980s British English spoken 50.000 Aufl. (2nd edition) C LD 1, 2. Wellington Corpus of Written New ICAME Collection LINK 1986‐ New Zealand written 1.000.000 Aufl. Zealand English (2nd edition) 1990 English C LD 1, 2. Wellington Corpus of Spoken New Zealand ICAME Collection LINK 1990s New Zealand spoken 1.000.000 Aufl. English (2nd edition) English C LD 1, 2. Lancaster Parsed Corpus ICAME Collection LINK 1990s British English written 130.000 Aufl. (2nd edition)

Corpus Linguistics Help, JLU Gießen last updated: 05 January 2016 2/7 C LD 2 International Corpus of Learner English n.a. LINK 1990s‐ advanced/uni‐ written 2.500.000 (ICLE) 2000s versity learners (10 different mother tongues)

C LD 2, 2. International Corpus of Learner English n.a. LINK 1990s‐ advanced/uni‐ written 3.700.000 Aufl. (ICLE), Version 2 (34‐bit version) 2000s versity learners (16 different mother tongues)

CLD 2, 3. International Corpus of Learner English n.a. LINK 1990s‐ advanced/uni‐ written 3.700.000 Aufl. (ICLE), Version 3 (64‐bit version) 2000s versity learners (16 different mother tongues)

C LD 3 ICE‐IND: International Corpus of n.a. LINK 1990+ Indian English spoken & 1.000.000 English,Indian component written C LD 3‐T ICE‐IND: International Corpus of n.a. LINK 1990+ Indian English spoken & 1.000.000 English,Indian component ‐ tagged written C LD 5 ICE‐EA: International Corpus of English, n.a. LINK 1990+ Kenyan English, spoken & 1.000.000 East‐African component Tanzanian written English C LD 6 ICE‐PHI: International Corpus of English, n.a. LINK 1990+ Philippine English spoken & 1.000.000 The Philippines component written C LD 7 ICE‐NZ: International Corpus of English, n.a. LINK 1990+ New Zealand spoken & 1.000.000 The New Zealand component English written C LD 7‐T ICE‐NZ: International Corpus of English, n.a. LINK 1990+ New Zealand spoken & 1.000.000 The New Zealand component ‐ tagged English written

C LD 8 MARSEC ‐Machine Readable Spoken n.a. LINK British English spoken 50.000 English Corpus C LD 9 ICE‐SIN: International Corpus of English, n.a. LINK 1990+ Singapore spoken & 1.000.000 The Singapore component English written

Corpus Linguistics Help, JLU Gießen last updated: 05 January 2016 3/7 C LD 9‐T ICE‐SIN: International Corpus of English, n.a. LINK 1990+ Singapore spoken & 1.000.000 The Singapore component ‐ tagged English written

C LD 10 ICE‐GB: International Corpus of English, n.a. LINK British English spoken & 1.000.000 The British component written C LD 10, 2. ICE‐GB: International Corpus of English, n.a. LINK 1990+ British English spoken & 1.000.000 Aufl. The British component (Release 2: 2006) written

C LD 10 ‐ S, The International Corpus of English, The n.a. 1990+ British English spoken 600.000 2. Auflage British Component, Sound recordings (11 CDs) C LD 11 BNC World Edition n.a. LINK 1990s British English spoken & 100.000.000 written C LD 12 BNCweb Query System n.a. LINK 1990s British English spoken & 100.000.000 written C LD 13 BNC Sampler BNC Baby Disk V2 LINK 1990s British English spoken & 2.000.000 written C LD 13 A Standard Corpus of Present Day Edited BNC Baby Disk V2 LINK 1990s American English written 1.000.000 American English (the original Brown)

C LD 14 ICE‐HK: International Corpus of English, n.a. LINK 1990+ Hong Kong spoken & 1.000.000 TheHong Kong component English written C LD 14‐T ICE‐HK: International Corpus of English, n.a. LINK 1990+ Hong Kong spoken & 1.000.000 The Hong Kong component ‐ tagged English written

C LD 15 ICNALE V 1.2: International Corpus n.a. LINK learners from 9 written 1.200.000 Network of Asian Learners of English Asian countries C LD 15/2 ICNALE V 2.0: International Corpus n.a. LINK learners from 10 written 1.300.000 Network of Asian Learners of English Asian countries

C LD 16 DCPSE (The Diachronic Corpus of Present‐ n.a. LINK 1950s to British English spoken 800.000 Day Spoken English) 1990s

Corpus Linguistics Help, JLU Gießen last updated: 05 January 2016 4/7 C LD 17 MEMT (Middle English Medical Texts) n.a. LINK 1375‐ British English written 500.000 1500 C LD 18 LOCNESS (Louvain Corpus of Native English n.a. LINK American written 320.000 Essays) English, British English C LD 19 CED: A Corpus of English Dialogues 1560‐ n.a. LINK 1560‐ British English written (drama 1.200.000 1760 1760 etc.)

C LD 20 ICE‐IR: International Corpus of English, The n.a. LINK 1990+ Irish English spoken & 1.000.000 Irish component written C LD 21‐CD LINDSEI ‐ Louvain International Database n.a. LINK 2004 advanced/uni‐ spoken 1.000.000 of Spoken English Interlanguage (34‐bit‐ versity learners version) C LD 21‐ LINDSEI ‐ Louvain International Database n.a. LINK 2004 advanced/uni‐ spoken 1.000.000 CD, 2. Aufl. of Spoken English Interlanguage, Version 2 versity learners (64‐bit version)

C LD 22 ICE‐SL: The International Corpus of English, n.a. LINK 2003‐ Sri Lankan written 400.000 The Sri Lankan Component ‐ written only 2009 English

C LD 23 ZEN ‐ Zurich English Newspaper Corpus n.a. LINK 1661‐ British English written 1.600.000 1791 C LD 24 ICE‐CAN: International Corpus of English, n.a. LINK 1990+ Canadian English spoken & 1.000.000 The Canadian component written C LD 24‐T ICE‐CAN: International Corpus of English, n.a. LINK 1990+ Canadian English spoken & 1.000.000 The Canadian component ‐ tagged written

C LD 25 ICE‐JA: International Corpus of English, The n.a. LINK 1990+ Jamaican English spoken & 1.000.000 Jamaican component written C LD 25‐T ICE‐JA: International Corpus of English, The n.a. LINK 1990+ Jamaican English spoken & 1.000.000 Jamaican component ‐ tagged written

Corpus Linguistics Help, JLU Gießen last updated: 05 January 2016 5/7 C LD 26 SAVE: South Asian Varieties of English n.a. LINK 2000‐ South Asian written 18.000.000 2008 Englishes C LD 27 ICE‐USA: International Corpus of English, n.a. LINK 1990+ American English written 400.000 The American component ‐written only

C LD 28 SPICE‐Ireland (ICE‐Ireland with pragmatic n.a. LINK 1990+ Irish English spoken & 1.000.000 annotation) written C LD 29 ICE‐Nigeria: International Corpus of n.a. Link 2000+ Nigerian English spoken & 1.000.000 English, The Nigerian component written C LD 30 WebELF: The Web ‐ English as a Lingua n.a. Link 1990+ English as a written 2.700.000 Franca Lingua Franca on the Internet C LD 31 PPCMBE: Penn Parsed Corpus of Modern Penn Parsed Corpora LINK 1700‐ Modern British written 950.000 British English of Historical English 1914 English

C LD 31 PPCME2: Penn‐Helsinki Parsed Corpus of Penn Parsed Corpora LINK 1150‐ Middle English written 1.155.000 Middle English, 2nd Edition of Historical English 1500

C LD 31 PPCEME: Penn‐Helsinki Corpus of Early Penn Parsed Corpora LINK 1500‐ Early Modern written 1.737.000 Modern English of Historical English 1710 English

C LD 32 ICCI: International Corpus of n.a. n.a. 2007+ Austrian learners written 113.515 Crosslinguistic Interlanguage ‐ Austrian of English aged component 11‐17

C LD 33 CCE: Corpus of Cameroon English n.a. n.a. 1990‐ Cameroon written 821.000 1994 English C LD 34 COCA: Corpus of Contemporary American n.a. n.a. 1990‐ Contemporary written & 450 000 000 English 2015 American English spoken

C LD 35 COHA: Corpus of Historical American n.a. n.a. 1810‐ Historical written 400 000 000 English 2009 American English

Corpus Linguistics Help, JLU Gießen last updated: 05 January 2016 6/7 C LD 36 GloWbE: Corpus of Global Web‐Based n.a. n.a. 2012‐ Web‐based written 1,900,000,000 English 2013 English from 20 different countries

Corpus Linguistics Help, JLU Gießen last updated: 05 January 2016 7/7