Clirmatrix: a Massively Large Collection of Bilingual and Multilingual Datasets for Cross-Lingual Information Retrieval

Total Page:16

File Type:pdf, Size:1020Kb

Clirmatrix: a Massively Large Collection of Bilingual and Multilingual Datasets for Cross-Lingual Information Retrieval CLIRMatrix: A massively large collection of bilingual and multilingual datasets for Cross-Lingual Information Retrieval Shuo Sun Kevin Duh Johns Hopkins University Johns Hopkins University [email protected] [email protected] Abstract the same language, then employ a monolingual information retrieval (IR) engine to find relevant We present CLIRMatrix, a massively large col- lection of bilingual and multilingual datasets documents. for Cross-Lingual Information Retrieval ex- Recently, the research community has been ac- tracted automatically from Wikipedia. CLIR- tively looking at end-to-end solutions that tackle Matrix comprises (1) BI-139, a bilingual the CLIR task without the need to build MT sys- dataset of queries in one language matched tems. This line of work builds upon recent ad- with relevant documents in another language vances in Neural Information Retrieval in the mono- for 139×138=19,182 language pairs, and lingual setting, c.f. (Mitra and Craswell, 2018; (2) MULTI-8, a multilingual dataset of queries and documents jointly aligned in 8 different Craswell et al., 2020). There are proposals to di- languages. In total, we mined 49 million rectly train end-to-end neural retrieval models on unique queries and 34 billion (query, doc- CLIR datasets (Sasaki et al., 2018; Zhang et al., ument, label) triplets, making it the largest 2019) or MT bitext (Zbib et al., 2019; Jiang et al., and most comprehensive CLIR dataset to 2020). One can also exploit cross-lingual word em- date. This collection is intended to support beddings to train a CLIR model on disjoint mono- research in end-to-end neural information re- lingual corpora (Litschko et al., 2018). trieval and is publicly available at https: Despite the growing interest in end-to-end CLIR, //github.com/ssun32/CLIRMatrix. We provide baseline neural model results on BI- the lack of a large-scale, easily-accessible CLIR 139, and evaluate MULTI-8 in both single- dataset covering many language directions in high-, language retrieval and mix-language retrieval mid- and low-resource settings has detrimentally settings. affected the CLIR community’s capability to repli- cate and compare with previously published work. 1 Introduction For example, among the widely-used datasets, the Cross-Lingual Information Retrieval (CLIR) is a CLEF collection (Ferro and Silvello, 2015) covers retrieval task in which search queries and candi- many languages but is not large enough for training date documents are written in different languages. neural models. The more recent IARPA MATE- CLIR can be very useful in some scenarios. For RIAL/OpenCLIR collection (Zavorin et al., 2020), example, a reporter may want to search foreign- is not yet publicly accessible. This motivates us language news to obtain different perspectives for to design and build CLIRMatrix, a massively large her story; an inventor may explore the patents in collection of bilingual and multilingual datasets for another country to understand prior art. Tradition- CLIR. ally, translation-based approaches are commonly We construct CLIRMatrix from Wikipedia in used to tackle the CLIR task (Zhou et al., 2012; an automated manner, exploiting its large variety Oard, 1998; McCarley, 1999): the query transla- of languages and massive number of documents. tion approach translates the query into the same The core idea is to synthesize relevance labels via language of the documents, whereas the document an existing monolingual IR system, then propa- translation approach translates the document into gate the labels via Wikidata links that connect the same language as the query. Both approaches documents in different languages. In total, we rely on a machine translation (MT) system or bilin- were able to mine 49 million unique queries in 139 gual dictionary to map queries and documents to languages and 34 billion (query, document, label) 4160 Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, pages 4160–4170, November 16–20, 2020. c 2020 Association for Computational Linguistics Figure 1: Illustration of our CLIRMatrix collection. The BI-139 portion of CLIRMatrix supports research in bilingual retrieval and covers a matrix of 139 × 138 language pairs. The MULTI-8 portion of CLIRMatrix supports research in multilingual modeling and mixed- language (ML) retrieval, where queries and documents Figure 2: Intuition of CLIR relevance label synthesis. are jointly aligned over 8 languages. For the English query “Barack Obama”, first a mono- lingual IR engine (Elasticsearch) labels documents in English; then Wikidata links are exploited to propa- triplets, creating a CLIR collection across a matrix gate the label to the corresponding Chinese documents, of 139 × 138 = 19; 182 language pairs. From this which are assumed to be topically similar. raw collection, we introduce two datasets: • BI-139 is a massively large bilingual CLIR 2 Methodology dataset that covers 139 × 138 = 19; 182 lan- Let qX be a query in language X, and dY be a guage pairs. To encourage reproducibility, we document in language Y. A bilingual CLIR dataset present standard train, validation, and test sub- consists of I triples sets for every language direction. X Y f(qi ; dij; rij)gi=1;2;:::;I (1) • MULTI-8 is a multilingual CLIR dataset Y comprising of queries and documents jointly where dij is the j-th document associated with aligned 8 languages: Arabic (ar), German X query qi , and rij is a label saying how relevant is (de), English (en), Spanish (es), French (fr), Y X the document dij to the query qi . Conventionally, Japanese (ja), Russian (ru), Chinese (zh). rij is an integer with 0 representing “not relevant” Each query will have relevant documents in and higher values indicating more relevant. the other 7 languages. Suppose there are J documents in total. In the j See Figure1 for a comparison of BI-139 and full collection search setup, the index ranges from 1;:::;J qX MULTI-8. The former facilitates the evalua- , meaning that each query i searches fdY g tion of bilingual retrieval over a wide variety of over the full set of documents ij j=1;:::;J . In X languages, while the latter supports research in the re-ranking setup, each query qi searches over mixed-language retrieval (a.k.a multilingual re- a subset of documents obtained by an initial full- fdY g trieval (Savoy and Braschler, 2019)), which is an collection retrieval engine: ij j=1;:::;Ki , where interesting yet relatively under-explored problem. Ki J. For practical reasons, machine learn- For both, the train sets are large enough to enable ing approaches to IR focus on the re-ranking setup the training of the neural IR models. with Ki set to 10∼1000 (Liu, 2009; Chapelle and We hope CLIRMatrix is useful and can empower Chang, 2011). We follow the re-ranking setup here. further developments in this field of research. To We now describe the main intuition of our con- summarize, our contributions are: struction method and detail various components and design choices in our pipeline. 1. A massive CLIR collection supporting both training and evaluation of bilin- 2.1 Intuition and Assumptions gual/multilingual models. To create a CLIR dataset, one needs to decide how to obtain qX and dY , and r . We set qX to be 2. A set of baseline neural results on BI-139 and i ij ij i Wikipedia titles, dY to be Wikipedia articles, and MULTI-8. On MULTI-8, we show that a sin- ij synthesize r automatically using a simple yet re- gle multilingual model can significantly out- ij liable method. We argue that Wikipedia is the best perform an ensemble of bilingual models. available resource for building CLIR datasets due CLIRMatrix is publicly available at https:// to two reasons: First, it is freely available and con- github.com/ssun32/CLIRMatrix. tains articles in more than 300 languages, covering 4161 a large variety of topics. Second, Wikipedia arti- Wikipedia dump of language X and then extract cles are mapped to entities in Wikidata1, which is the titles and document bodies of every article. We a relatively reliable way to find the same articles index the documents into an Elasticsearch2 search written in other languages. engine, which serves as our monolingual IR system. To synthesize relevance labels rij, we propose Using the extracted titles as search queries, we first to generate labels using an existing monolin- retrieve the top 100 relevant documents and their gual IR system in language X, then propagate the corresponding BM25 scores from Elasticsearch for labels via Wikidata links to language Y. In other every query. We then convert the BM25 scores words, we assume: into discrete relevance judgment labels using Jenks natural break optimization. Finally, we propagate X 1. the availability of documents d in the same these labels to documents in language Y that are language as the query, and linked via Wikidata. We downloaded Wikidata and Wikipedia dumps 2. the feasibility of an existing monolingual IR released on January 1, 2020. Since Wikipedia system in language X to provide labels r^ on ij dumps contain tremendous amounts of meta- (qX ; dX ) pairs i ij information such as URLs and scripts, it can be Y X expensive to extract actual text directly from those Then for any dij that links to dij , we assign the dumps. Inspired by Schwenk et al.(2019), we relevance label r^ij. This intuition is illustrated in Figure2. Sup- extracted document ids, titles, and bodies from 3 pose we wish to find Chinese documents that are Wikipedia’s search indices instead, which contain relevant for the English query “Barack Obama”. raw text data without meta-information. We first run monolingual IR to find English doc- Wikipedia dumps We discarded dumps with uments that answer the query. In this figure, 4 less than ten thousand documents, which are usu- documents are returned, and we attempt to link to ally the dumps of Wikipedia of certain dialects the corresponding Chinese versions using Wikidata and less commonly used languages.
Recommended publications
  • Topological Evolution of Networks: Case Studies in the US Airlines and Language Wikipedias By
    Topological Evolution of Networks: Case Studies in the US Airlines and Language Wikipedias by Gergana Assenova Bounova B.S., Theoretical Mathematics, Massachusetts Institute of Technology (2003) B.S., Aeronautics & Astronautics, Massachusetts Institute of Technology (2003) S.M., Aeronautics & Astronautics, Massachusetts Institute of Technology (2005) Submitted to the Department of Aeronautics and Astronautics in partial fulfillment of the requirements for the degree of Doctor of Philosophy at the MASSACHUSETTS INSTITUTE OF TECHNOLOGY February 2009 c 2009 Gergana A. Bounova, All rights reserved. ... The author hereby grants to MIT the permission to reproduce and to distribute publicly paper and electronic copies of this thesis document in whole or in part. Author.......................................................................................... Department of Aeronautics and Astronautics February 27, 2009 Certified by..................................................................................... Prof. Olivier L. de Weck Associate Professor of Aeronautics and Astronautics and Engineering Systems Thesis Supervisor Certified by..................................................................................... Prof. Christopher L. Magee Professor of the Practice of Mechanical Engineering and Engineering Systems Certified by..................................................................................... Dr. Daniel E. Whitney Senior Research Scientist, Center for Technology, Policy and Industrial Development, Senior Lecturer in
    [Show full text]
  • Langmag July06 14-17.Qxd (Page 1)
    Methodology Robert L. Read and Steven D. Brewer explain how Esperanto acts as a springboard for the acquisition of other languages Who Knows Where Esperanto Might Lead? In 1887, an obscure eye doctor in ly attain a competency that eluded them in Esperanto, or any language, provides a Poland self-published a little book in Russian. learning an ethnic language or report that they propaedeutic effect in learning a next lan- Over the next several years Lingvo Internacia1 reached a given level of competency in a frac- guage which is similar. appeared in English, French, German, tion of the time required by a national lan- Several factors may contribute to the Hebrew, and Polish. This book, written under guage. Early success creates a virtuous cycle Corder effect, including similarities in vocabu- the pen name Doctor Esperanto, laid the which encourages more study and often leads lary, grammatical structure, and word order. foundation for a new language that would to genuine fluency. Achievement yields positive Similarity of vocabulary has been shown to achieve what no other language project had effects on student self-confidence, insight into be an effective metric for predicting how ever done: establish a living community that the nature of languages in general, and the much knowing one language will help with would go on to survive the death of its cre- structure of their native language in particular. learning another.5 Since Esperanto was ator. Even conservative estimates place the Barry Farber writes in his book How to designed to have a widely recognized vocab- number of active speakers in the tens of Learn Any Language:2 “It’s said that once you ulary and grammatical features broadly thousands, with the number who have master one foreign language, all others come shared across language families, it takes learned Esperanto at some time in their lives much more easily.
    [Show full text]
  • Update/No.53, 2011
    UpdateEAB www.esperanto-gb.org ✩ Gisdateˆ A quarterly organ of the Esperanto Association of Britain Kvaronjara organo de Esperanto-Asocio de Britio ★ No. 53, April–June 2011 N-ro 53, aprilo–junio 2011 ISSN 1741-4679 International communication without discrimination ★ Respect for all cultures Internacia komunikado sen diskriminacio ★ Respekto por ˆiujc kulturoj Trustees’ Annual • Successful Report 2009–10 Twitter campaign The trustees are pleased to present their report with the financial statements of the charity for the year ended 31 October 2010. A full copy of the report celebrates and financial statements may be obtained from Esperanto House. he Management Commit- their languages and cultures to Zamenhof tee of the Association are be treated equally. Tthe trustees of the charity. Day Those who served from 1 Nov. PUBLIC BENEFIT 2009 to 31 Oct. 2010 were: In setting objectives and plan- (see page 7) John Wells (president), Edmund ning activities the trustees have Grimley Evans (vice-president), given consideration to the David Kelso (hon. secretary), Charity Commission’s general • Prestigˆa Joyce Bunting (hon. treasurer), guidelines on public benefit, and Geoffrey Greatrex, Clare Hunter. in particular to guidance on artikolo David Bisset and Jean Bisset advancing education. EAB does served until 16 May 2010. not exclude anyone for any en israela reason. STAFF AND APPOINTEES Continued overleaf Education & Development Co- revuo ordinator: Angela Tellier. Office (vidu pa©on 11) administrator: Viv O’Dunne. Hon. librarian: Geoffrey King. Webmaster: Bill Walker. Editor of EAB Update: Geoffrey Sutton. Editor of La Brita Esperantisto: Paul Gubbins. Accountant & Independent Examiner: N. Brooks, Kingston Smith, 60 Goswell Road, London, EC1M 7AD.
    [Show full text]
  • The English Wikipedia in Esperanto
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by DSpace at Tartu University Library WikiTrans: The English Wikipedia in Esperanto Eckhard Bick GrammarSoft ApS & University of Southern Denmark [email protected] Abstract: and as large as e.g. the Danish one, has only WikiTrans is a translation project and web portal for 140.000 articles, while the English Wikipedia with translated Wikipedias. Using the GrammarSoft's rule- its 3.4 million articles (or 2.345.000.000 words) is based GramTrans technology, we created a high-quality roughly 24 times as big. In addition, there is a English-Esperanto machine translation system, and used difference in article size1, with an average of 3.600 it to translate the entire English Wikipedia (ca. 3.000.000 letters (~ 600 words) for English and German, and a articles), at a speed of 17.000 articles a day. The translated articles are searchable both locally little over 1500 letters (~ 250 words) in Esperanto, (www.wikitrans.net) and in the original Esperanto translating into an even bigger factor of difference, Wikipedia, where we maintain a revision interface for 57, when focusing on content volume. In other users who wish to turn our translated articles into new words, more than 98% of the English language ”originals”. In this paper, we explain the framework and information is not accessible in Esperanto (or challenges of the project, and show how translation rules Danish). One could argue that the Esperanto articles can exploit grammatical information provided by a concentrate on the important and frequently sought- Constraint Grammar parser.
    [Show full text]
  • Download the Program Booklet
    Spaß an Sprachen ! Überall wo es Bücher gibt und auf AssimilWelt.com Learn. Speak. Smile. Learn 140+ languages on almost any device with uTalk Polyglot Gathering OOer* Get 70% oO when you subscribe for only €2.99 a month at utalk.com/polyglot *OOer valid until midnight, 3rd June 2018 Visit the website for full details Content 2 Content Content ......................................................................................................................2 Welcome ....................................................................................................................3 Opening speech ........................................................................................................4 Conference organizers .............................................................................................5 Timetable – Wednesday ........................................................................................11 Timetable – Thursday ............................................................................................12 Timetable – Friday .................................................................................................14 Timetable – Saturday .............................................................................................16 Timetable – Sunday ................................................................................................18 List of talks and speakers (1/2) .............................................................................20 Map of the venue ....................................................................................................42
    [Show full text]
  • Using Natural Language Generation to Bootstrap Missing Wikipedia Articles
    Semantic Web 0 (0) 1 1 IOS Press 1 1 2 2 3 3 4 Using Natural Language Generation to 4 5 5 6 Bootstrap Missing Wikipedia Articles: 6 7 7 8 8 9 A Human-centric Perspective 9 10 10 * 11 Lucie-Aimée Kaffee , Pavlos Vougiouklis and Elena Simperl 11 12 School of Electronics and Computer Science, University of Southampton, UK 12 13 E-mails: [email protected], [email protected], [email protected] 13 14 14 15 15 16 16 17 17 Abstract. Nowadays natural language generation (NLG) is used in everything from news reporting and chatbots to social media 18 18 management. Recent advances in machine learning have made it possible to train NLG systems that seek to achieve human- 19 19 level performance in text writing and summarisation. In this paper, we propose such a system in the context of Wikipedia and 20 evaluate it with Wikipedia readers and editors. Our solution builds upon the ArticlePlaceholder, a tool used in 14 under-resourced 20 21 Wikipedia language versions, which displays structured data from the Wikidata knowledge base on empty Wikipedia pages. 21 22 We train a neural network to generate a ’introductory sentence from the Wikidata triples shown by the ArticlePlaceholder, and 22 23 explore how Wikipedia users engage with it. The evaluation, which includes an automatic, a judgement-based, and a task-based 23 24 component, shows that the summary sentences score well in terms of perceived fluency and appropriateness for Wikipedia, and 24 25 can help editors bootstrap new articles.
    [Show full text]
  • Esperanto As a Family Language
    To appear in Fred Dervin (ed.). 2010. Lingua francas: La véhicularité linguistique pour vivre, travailler et étudier. Paris: L’Harmattan. Jouko Lindstedt (University of Helsinki) Eerant F Lgu Abstract. Esperanto was created to serve as a common second language in the world, but its present role as a vehicular language is confined to a rather small number of speakers who use it in their personal contacts and cultural activities, but seldom in their trades and professions. They form a kind of diasporic speech community with certain shared cultural values and symbols, original literature unknown outside the community, and frequent personal contacts at various meetings and on the Internet. Esperanto has also been used as a family language for about one hundred years, and there are perhaps one thousand first-language speakers. The paper presents an overview of the little-studied field of Esperanto as a native and family language and points out some methodological and factual problems in the few published studies. The major pitfalls of the study of native Esperanto are identified as follows: (i) concentrating on mixed marriages only; (ii) confusing the Esperanto speech community with the Esperanto movement; (iii) not knowing the language sufficiently; (iv) ignoring the subjects’ linguistic background, and (v) expecting a priori nativisation to bring about changes in the language. 1. I It is not uncommon that a vehicular language comes to be used as an everyday language in some families, even though it is native to neither parent. Besides such better-known examples as English, French or Russian, Esperanto presents a linguistically interesting case of such a language.
    [Show full text]
  • Using Natural Language Generation to Bootstrap Missing Wikipedia Articles: a Human-Centric Perspective
    Semantic Web -1 (2021) 1–30 1 DOI 10.3233/SW-210431 IOS Press Using natural language generation to bootstrap missing Wikipedia articles: A human-centric perspective Lucie-Aimée Kaffee a,*, Pavlos Vougiouklis b,** and Elena Simperl c a School of Electronics and Computer Science, University of Southampton, UK E-mail: [email protected] b Huawei Technologies, UK E-mail: [email protected] c King’s College London, UK E-mail: [email protected] Editor: PhilippCimiano, UniversitätBielefeld,Germany Solicitedreviews: John Bateman, Bremen University,Germany;LeoWanner, PompeuFabraUniversity,Spain;Denny Vrandecic,Wikimedia Foundation, USA Abstract. Nowadays naturallanguage generation(NLG) is used ineverything fromnewsreportingand chatbotstosocialmedia management.Recentadvances in machinelearning have made it possibletotrain NLGsystems that seek to achieve human- levelperformance in text writingand summarisation. In this paper, we proposesucha system in thecontextofWikipedia and evaluate it withWikipediareadersand editors.Our solutionbuilds upon theArticlePlaceholder,atool used in 14 under-resourced Wikipedialanguageversions,which displays structured data fromtheWikidata knowledge base on emptyWikipedia pages. We traina neural networktogenerateanintroductory sentence fromtheWikidata triplesshown by theArticlePlaceholder, and explorehowWikipediausers engage with it. Theevaluation, whichincludesanautomatic,a judgement-based,andatask-based component,shows that thesummary sentences scorewellinterms of perceivedfluencyand appropriateness
    [Show full text]
  • Interlinguistische Informationeninformationen Mitteilungsblatt Der Gesellschaft Für Interlinguistik E.V
    InterlinguistischeInterlinguistische InformationenInformationen Mitteilungsblatt der Gesellschaft für Interlinguistik e.V. Beiheft 21 Berlin, November 2014 ISSN 1432-3567 Interlinguistik im 21. Jahrhundert Beiträge der 23. Jahrestagung der Gesellschaft für Interlinguistik e.V., 29. November – 01. Dezember 2013 in Berlin Herausgegeben von Cyril Brosch und Sabine Fiedler Berlin 2014 Über die Gesellschaft für Interlinguistik e.V. (GIL) Die GIL konzentriert ihre wissenschaftliche Arbeit vor allem auf Probleme der internationalen sprachlichen Kommunikation, der Plansprachenwissenschaft und der Esperantologie. Die Gesellschaft gibt das Bulletin ,,Interlinguistische Informationen“ (ISSN 1430-2888) heraus und informiert darin über die international und in Deutschland wichtigsten interlinguistischen/ esperantologischen Aktivitäten und Neuerscheinungen. Im Rahmen ihrer Jahreshauptversammlungen führt sie Fachveranstaltungen zu interlinguistischen Problemen durch und veröffentlicht die Akten und andere Materialien. Vorstand der GIL Vorsitzende: Prof. Dr. Sabine Fiedler stellv. Vorsitzender/Schatzmeister: PD Dr. Dr. Rudolf-Josef Fischer Mitglied: Dr. Cyril Brosch Mitglied: Dr. habil. Cornelia Mannewitz Mitglied: Prof. Dr. Velimir Piškorec Berlin 2014 Herausgegeben von der Gesellschaft für Interlinguistik e.V. (GIL) Institut für Anglistik Beethovenstr. 15, 04107 Leipzig [email protected] www.interlinguistik-gil.de © bei den Autoren der Beiträge ISSN: 1432-3567 Interlinguistik im 21. Jahrhundert Beiträge der 23. Jahrestagung der Gesellschaft für
    [Show full text]
  • Ifi (09) 4/2019
    1 vol. 3, no. 9 (4/2019) Published by the Center for Research and Documentation on Word Language Problems (CED) and the Esperantic Studies Foundation (ESF) Table of Contents 1. Message from the Editors ......................................................................................................................... 2 2. Recent publications: Books ....................................................................................................................... 2 3. Recent publications: Articles ..................................................................................................................... 4 4. Dissertations ............................................................................................................................................. 6 5. Obituary .................................................................................................................................................... 6 6. Coming events ........................................................................................................................................... 8 7. Research on Hector Hodler – and a question ........................................................................................... 9 8. Giuseppe Peano ...................................................................................................................................... 10 9. ILEI Meets ...............................................................................................................................................
    [Show full text]
  • European Business Competence Licence Wiki
    European Business Competence Licence Wiki Oran usually sky apparently or untuned variously when rufous Christophe drone regally and balletically. Ragnar is Seabeeenlightening trampoline and outfaces too homologically? vanward while pyromantic Ferdie rebated and ungird. Ivor remains magnanimous: she shred her The algerian independence for european business The state to end, you would generally free markets spain should also was born in european business competence licence wiki? At Agip, Velázquez, or oriented to relate contain a likely behavior instead gain knowledge transfer. At shutting down the european business competence licence wiki? Norm program at the application and product to structure for the european business competence licence wiki? Get it is available depends upon entry prices and european business competence licence wiki, you can help gather information which rely upon us at all workers. It has provided to reliability and active monitoring and you will be concentrated in. It can artificial standard of tourists that i have recently, toward direct payments based on normal and european business competence licence wiki of bars or for. This leads to a discussion of how identity and motivation are generated as newcomers move through full participation. While CSCL research produces many interesting approaches and systems, and it is changing everything. They love scanning their new Funster QR codes. The other reference architecture in recent years, to use azure services can. The next to present the very fond of practitioners due to seem to the informal organization dedicated to create distributed, children through an official language. An external policy area not sure how to clinical psychologists.
    [Show full text]
  • Iemn Esperanto Flag
    Iemn esperanto flag Continue The last of the three characters I will cover in this series is la Esperanto-flag, the Esperanto flag. Since it is as much or less equally as ubiquitous as verda stelo, odds you have seen it before. It is a field of green, with a white spot in the upper left corner. The green star esperanto is located in the white region. The flag has been around quite a bit longer than the jubilea simbolo. An earlier version of the flag was seen back in 1893, when the Esperanto Club from northern France designed the flag for its own use. It resembled the flag we know today, but it had an E over the star. The version you've probably seen has been around since 1905, when the first Esperanto General Congress approved its use. I heard that, no coincidence, this first meeting took place in the same city the flag had come. I wonder why they approved the flag ...? It's a pretty cool design, anyway. Like the green star, the colors of the flag have some meaning attributed to them. It's fitting that the flag is so much then - the Esperantists hope a bunch! The white section stands for peace, as well as on the flags of many other nations. Even placing a star means something. There is a reading of the green star that claims to symbolize the earth, with each of its fingers representing the continent. Since the official proportions of the flag state that the star will always be inscribed enough space to see that the white surrounds the star, the flag always contains a symbol of peace shrouded in the color of the world.
    [Show full text]