Analysing Language-Specific Differences in Multilingual

Total Page:16

File Type:pdf, Size:1020Kb

Analysing Language-Specific Differences in Multilingual ANALYSING LANGUAGE-SPECIFIC DIFFERENCES IN MULTILINGUAL WIKIPEDIA Fakult¨atf¨urElektrotechnik und Informatik der Gottfried Wilhelm Leibniz Universit¨atHannover zur Erlangung des Grades Master of Science M. Sc. Thesis von Simon Gottschalk Erstpr¨ufer:Prof. Dr. techn. Wolfgang Nejdl Zweitpr¨ufer:Prof. Dr. Robert J¨aschke Betreuer: Dr. Elena Demidova 2015 ABSTRACT Wikipedia is a free encyclopedia that has editions in more than 280 languages. While Wikipedia articles referring to the same entity often co-exist in many Wikipedia language editions, such articles evolve independently and often con- tain complementary information or represent community-specific point of view on the entity under consideration. In this thesis we analyse features that en- able to uncover such edition-specific aspects within Wikipedia articles to pro- vide users with an overview of overlapping and complementary information available for an entity in different language editions. In this thesis we compare Wikipedia articles at different levels of granular- ity: First, we identify similar sentences. Then, these sentences are merged to align similar paragraphs. Finally, a similarity score at the article level is com- puted. To align sentences, we employ syntactic and semantic features including cosine similarity, links to other Wikipedia articles and time expressions. We evaluated the sentence alignment function on a dataset containing 1155 sen- tence pairs extracted from 59 articles in German and English Wikipedia that had been annotated during a user study. Our evaluation results demonstrated that the inclusion of semantic features can lead to an improvement of the break-even point from 70:95% to 77:52% in this dataset. Given the sentence alignment function, we developed an algorithm to build similar paragraphs starting from the sentences that have been aligned before. We implemented a visualization of the algorithm results that enables users to obtain an overview of the similarities and differences in the articles by looking at the paragraphs aligned using the proposed algorithm and the other para- graphs, whose contents are unique to an article in a specific language edition. To further support this comparison, we defined an overall article similarity score and applied this score to illustrate temporal differences between article editions. Finally, we created a Web-based application presenting our results and visualising all the aspects described above. In the future work, the algorithms developed in this thesis can be directly applied as a help for Wikipedia authors to provide an overview of the en- tity representation across Wikipedia language editions. These algorithms can also build a basis for cultural research towards better understanding of the language-specific similarities and differences in multilingual Wikipedia. Contents Table of Contents iii List of Figures vii List of Tables ix List of Algorithms xi 1 Introduction1 1.1 Motivation.................................2 1.2 Problem Definition............................3 1.3 Overview..................................5 2 Background on Multilingual Wikipedia7 2.1 Overview..................................8 2.2 Wikipedia Guidelines...........................9 2.2.1 Translations............................9 2.2.2 Neutrality............................. 11 2.3 Linguistic Point of View......................... 11 2.4 Reasons for Multilingual Differences................... 12 2.5 Wikipedia Studies............................. 14 3 Background on Multilingual Text Processing 17 3.1 NLP for Multilingual Text........................ 17 iii iv 3.1.1 Machine Translation....................... 17 3.1.2 Textual Features......................... 19 3.1.3 Topic Extraction......................... 19 3.1.4 Sentence Splitting......................... 21 3.1.5 Other NLP techniques...................... 21 3.2 Aligning Multilingual Text........................ 23 3.2.1 Comporable Corpora....................... 23 3.2.2 Plagiarism Detection in Multilingual Text........... 24 4 Approach Overview 27 5 Feature Selection and Extraction 31 5.1 Syntactic Features............................ 31 5.2 Evaluation on Sentence Similarity of Parallel Corpus......... 33 5.3 Semantic Features............................. 36 5.4 Evaluation of Entity Extraction Tools.................. 39 5.4.1 Aim and NER tools........................ 40 5.4.2 Data................................ 40 5.4.3 Entity Extraction and Comparison............... 41 5.4.4 Comparison............................ 42 5.4.5 Results............................... 43 6 Sentence Alignment and Evaluation 47 6.1 Data.................................... 48 6.2 Pre-Selection of Sentence Pairs..................... 49 6.3 Selection of Sentence Pairs for Evaluation............... 52 6.4 User Study................................ 53 6.5 Judgement of Similarity Measures.................... 54 6.6 Second Dataset.............................. 59 6.7 Pre-Selection and Creation of Similarity Function........... 59 6.8 Results................................... 61 7 Paragraph Alignment and Article Comparison 69 7.1 Finding Similar Paragraphs....................... 69 7.1.1 Aggregation of Neighboured Sentences............. 71 7.1.2 Aggregation of Proximate Sentence Pairs............ 72 7.1.3 Paragraph Aligning Algorithm.................. 74 v 7.2 Similarity on Article Level........................ 75 7.2.1 Text Similarity.......................... 75 7.2.2 Feature Similarity......................... 76 7.2.3 Overall Similarity......................... 78 7.3 Visualisation................................ 79 8 Implementation 85 8.1 Data Model................................ 85 8.2 Comparison Extracting.......................... 87 8.3 Preprocessing Pipeline.......................... 89 8.4 Text Parsing................................ 90 8.5 Resources................................. 91 9 Discussion and Future Work 93 9.1 Discussion................................. 93 9.2 Future Research Directions........................ 94 Bibliography 97 List of Figures 1.1 Text Comparison Example........................4 2.1 English Wikipedia Article ”Großer Wannsee".............8 2.2 Interlanguage links for the English article "Pfaueninsel"........ 10 3.1 First Paragraphs of the Wikipedia Article "Berlin".......... 22 4.1 Process of Article Comparison...................... 27 5.1 Precision Recall Graphs for Textual Features with Break-Even Points 35 5.2 Box Plots for Textual Features...................... 36 6.1 Screenshot of User Study on Similar Sentences............. 54 6.2 Correlation of Syntactic Features for First Data Set.......... 56 6.3 Correlation of Text Length Similarity.................. 57 6.4 Correlation of External Links Similarity................ 57 6.5 Correlation of Time and Entity Similarity for the first Dataset.... 58 6.6 Iteration to Create Similarity Functions................. 60 6.7 Precision-recall Diagram of Sentences with Overlapping Facts.... 64 6.8 Precision-recall Diagram of Sentences with the Same Facts...... 65 6.9 Precision-recall Diagram of Sentences with the Same Facts (Adjusted Similarity Functions)........................... 66 7.1 Paragraph Construction Example (Step 1)............... 70 7.2 Paragraph Construction Example (Steps 2 and 3)........... 70 vii viii LIST OF FIGURES 7.3 Paragraph Construction Example (Steps 4 and 5)........... 71 7.4 Comparison of the English and German article on "Knipp"...... 79 7.5 Website Example: Text.......................... 81 7.6 Website Example: Links......................... 81 7.7 Website Example: Images........................ 82 7.8 Website Example: Authors........................ 82 7.9 Website Example: Overall Similarity.................. 83 8.1 Data Model................................ 86 8.2 Preprocessing Pipeline.......................... 89 List of Tables 2.1 Statistics on Wikipedias in Different Languages............9 3.1 Machine Translation Example...................... 18 5.1 Example Sentence Pairs for Time Similarity.............. 37 5.2 Statistics of the N3 Dataset....................... 41 5.3 Number of Entities Extracted from English Texts........... 42 5.4 Number of Entities Extracted from German Texts........... 42 5.5 Results of Entity Extraction....................... 44 6.1 Wikipedia Articles Used in the User Study............... 49 6.2 Feature Combination Distribution in 14 Wikipedia Articles...... 50 6.3 Weights of Similarity Functions for Pre-Selection............ 51 6.4 Feature combination distribution in pre-selected Sentence Pairs... 52 6.5 Feature Distribution in the Dataset for the First Round of Evaluation 53 6.6 Correlation Coefficients for Similarity Measures............ 55 6.7 Dataset Evaluated in the Second Round................ 62 6.8 Retrieved Sentence Pairs per Article Pair................ 67 7.1 Composition of Overall Similarity.................... 78 7.2 60 Wikipedia Article Pairs Ordered by Overall Similarity....... 84 8.1 Example of Revisions of an Article in Different Languages...... 88 8.2 Example of Revision Triples....................... 89 ix List of Algorithms 5.1 Computation of TP, FP and FN for the Evaluation of Entity Extraction 43 6.1 Identification of Candidates for Similar Sentences........... 51 7.1 Extension of Sentence Pairs with Neighbours.............. 72 7.2 Extension of a Sentence with its Neighbours.............. 72 7.3 Aggregation of Sentence Pairs...................... 73 7.4 Paragraph Alignment........................... 74 xi 1 Introduction Wikipedia1 is a user-generated
Recommended publications
  • Position Description Addenda
    POSITION DESCRIPTION January 2014 Wikimedia Foundation Executive Director - Addenda The Wikimedia Foundation is a radically transparent organization, and much information can be found at www.wikimediafoundation.org . That said, certain information might be particularly useful to nominators and prospective candidates, including: Announcements pertaining to the Wikimedia Foundation Executive Director Search Kicking off the search for our next Executive Director by Former Wikimedia Foundation Board Chair Kat Walsh An announcement from Wikimedia Foundation ED Sue Gardner by Wikimedia Executive Director Sue Gardner Video Interviews on the Wikimedia Community and Foundation and Its History Some of the values and experiences of the Wikimedia Community are best described directly by those who have been intimately involved in the organization’s dramatic expansion. The following interviews are available for viewing though mOppenheim.TV . • 2013 Interview with Former Wikimedia Board Chair Kat Walsh • 2013 Interview with Wikimedia Executive Director Sue Gardner • 2009 Interview with Wikimedia Executive Director Sue Gardner Guiding Principles of the Wikimedia Foundation and the Wikimedia Community The following article by Sue Gardner, the current Executive Director of the Wikimedia Foundation, has received broad distribution and summarizes some of the core cultural values shared by Wikimedia’s staff, board and community. Topics covered include: • Freedom and open source • Serving every human being • Transparency • Accountability • Stewardship • Shared power • Internationalism • Free speech • Independence More information can be found at: https://meta.wikimedia.org/wiki/User:Sue_Gardner/Wikimedia_Foundation_Guiding_Principles Wikimedia Policies The Wikimedia Foundation has an extensive list of policies and procedures available online at: http://wikimediafoundation.org/wiki/Policies Wikimedia Projects All major projects of the Wikimedia Foundation are collaboratively developed by users around the world using the MediaWiki software.
    [Show full text]
  • Arxiv:1212.0018V2 [Cs.SI] 18 Dec 2012 Etn Ucin Ffe-Akteoois[ Economies Free-Market of Functions Setting E.[ Ref
    Evidence for Non-Finite-State Computation in a Human Social System Simon DeDeo∗ Santa Fe Institute, Santa Fe, NM 87501, USA (Dated: December 19, 2012) We investigate the computational structure of a paradigmatic example of distributed so- cial interaction: that of the open-source Wikipedia community. The typical computational approach to modeling such a system is to rely on finite-state machines. However, we find strong evidence in this system for the emergence of processing powers over and above the finite-state. Thus, Wikipedia, understood as an information processing system, must have access to (at least one) effectively unbounded resource. The nature of this resource is such that one observes far longer runs of cooperative behavior than one would expect using finite- state models. We provide evidence that the emergence of this non-finite-state computation is driven by collective interaction effects. Social systems—particularly human social systems—process information. From the price- setting functions of free-market economies [1, 2] to resource management in traditional communi- ties [3], from deliberations in large-scale democracies [4, 5] to the formation of opinions and spread of reputational information in organizations [6] and social groups [7, 8], it has been recognized that such groups can perform functions analogous to (and often better than) engineered systems. Such functional roles are found in groups in addition to their contingent historical aspects and, when described mathematically, may be compared across cultures and times. The computational phenomena implicit in social systems are only now, with the advent of large, high-resolution data-sets, coming under systematic, empirical study at large scales.
    [Show full text]
  • The Importance of Female Voices
    The Importance of Female Voices Goals Collaborate to read and understand a text, and examine and use topic-specific vocabulary; Identify and define individual and collective knowledge, differentiating knowledge of girls and boys. Essential Questions Are women today participating in Canadian public life in percentages equal to that of men? Have efforts to increase women’s participation in public life succeeded in Canada? Does it matter if female voices are not heard in more equal proportions? Enduring Understandings Even after much progress in the past 30 years, women are vastly under-represented in many aspects of Canadian public life. This imbalance even showed up in Wikipedia, the open online resource encyclopedia, where only 15-20% of content contributors are women. Efforts to increase female participation in public life in the Canada have succeeded but growth is slow. The executives of the Wikipedia foundation viewed the small number of female contributors as a problem. Many believe that increasing the percentage of female voices will enrich diversity of opinions, and better reflect the user community. Materials Define Gender Gap? (2011) Making the edit: why we need more women in Wikipedia (2019) Wikipedia is a mirror of the world's gender biases (2018) Vocabulary intractable [ in-trak-tuh-buh ] (adjective) not easily controlled or directed; not docile or manageable; stubborn; obstinate. nuance [ noo-ahns ] (noun) a subtle difference or distinction in expression or meaning. disparity [ di-ˈsper-ə-tee ] (noun) difference; containing or made up of fundamentally different or unequal elements. egalitarian [ ih-gal-i-tair-ee-uh n ] (adjective) characterized by a belief in the equal status of all people in political, economic, or social life.
    [Show full text]
  • Wikimedia Foundation 2010-11 Plan
    Wikimedia Foundation The Year Ahead, and the Year in Review SUE GARDNER WIKIMANIA – WASHINGTON DC – 14 JULY 2012 Purpose of this presentation is two things: Tell you what the WMF did in 2011-12 Tell you what the WMF plans to do in 2012-13 3 34.8 million dollars 4 You wrote the projects. You earned the goodwill. Your work is what donors are supporting. 5 6 7 Recapping 2011-12 8 Recapping 2011-12 Activities Supported 25% increase in readership from 400 to 500 million UVs; Visual Editor – released with save/edit capability & new parser, in restricted namespace on Mediawiki.org; Global Ed work is being done in 12 countries, and in eight the work is driven by volunteers – 19 million characters added to Wikipedia, with 50% female editors; The mobile platform has been redeveloped, Android app released, mobile PVs up 187%. Wikipedia Zero launched with new deals with two companies covering 28 countries, and more underway; Editor recruitment activity is underway in India and Brazil; Wikimedia Labs is launched and exceeding participation targets; Internationalization team has made significant improvements, particularly for Indic languages. It has also created new translation tools; New editor engagement – MoodBar/Feedback Dashboard, AFTv5, New Pages Feed tool, plus many research projects and small experiments (e.g., warnings & welcoming study, lapsed editor e-mail experiment,Teahouse). Nonetheless, editors are down to 85 from 89K; Upload wizard increased image uploads 27% over the year, with WLM causing a visible spike in September 2011. 9 Strategy Plan 2015 Goals (for reference) 1) Increase readership. 2) Increase the quantity of material we offer.
    [Show full text]
  • Emotions and Activity Profiles of Influential Users in Product Reviews Communities
    Research Collection Journal Article Emotions and activity profiles of influential users in product reviews communities Author(s): Tanase, Dorian; Garcia, David; Garas, Antonios; Schweitzer, Frank Publication Date: 2015-11 Permanent Link: https://doi.org/10.3929/ethz-b-000108082 Originally published in: Frontiers in Physics 3, http://doi.org/10.3389/fphy.2015.00087 Rights / License: Creative Commons Attribution 4.0 International This page was generated automatically upon download from the ETH Zurich Research Collection. For more information please consult the Terms of use. ETH Library ORIGINAL RESEARCH published: 17 November 2015 doi: 10.3389/fphy.2015.00087 Emotions and Activity Profiles of Influential Users in Product Reviews Communities Dorian Tanase, David Garcia *, Antonios Garas and Frank Schweitzer Chair of Systems Design, ETH Zurich, Zurich, Switzerland Viral marketing seeks to maximize the spread of a campaign through an online social network, often targeting influential nodes with high centrality. In this article, we analyze behavioral aspects of influential users in trust-based product reviews communities, quantifying emotional expression, helpfulness, and user activity level. We focus on two independent product review communities, Dooyoo and Epinions, in which users can write product reviews and define trust links to filter product recommendations. Following the patterns of social contagion processes, we measure user social influence by means of the k-shell decomposition of trust networks. For each of these users, we apply sentiment analysis to extract their extent of positive, negative, and neutral emotional expression. In addition, we quantify the level of feedback they received in their reviews, the length of their contributions, and their level of activity over their lifetime in the community.
    [Show full text]
  • Brain & Cognitive Science 2016
    Brain & Cognitive Science 2016 press.princeton.edu Contents 1 general interest 3 psychology New New 6 Phishing for Phools How to Clone a The Economics of Mammoth social science Manipulation and Deception The Science of De-Extinction George A. Akerlof & Beth Shapiro Robert J. Shiller 9 “[A] fascinating book. A great biology & “This fun but serious book tells popular science title, and one neuroscience how the standard story about that makes it clear that a future free markets often gets it wrong. you may have imagined is already 11 Indeed, Akerlof and Shiller suggest underway.” that we should drop the view —Library Journal, starred review philosophy of markets as generally benign “As a researcher who is shaping institutions. The argument is laid this eld, Shapiro is the perfect out with the help of fascinating 12 guide to the ongoing discussion anecdotes, the language is con- best of the backlist about de-extinction. While many versational, and the book is easy news items and conference to read.” presentations have focused on 13 —Dani Rodrik, author of The the technology required to create index | order form Globalization Paradox extinct life, Shapiro carefully “Phishing for Phools is a coherent considers every step along the and highly plausible explanation journey to de-extinction, from of why markets—although usually choosing a species to revive to bene cial—can lead to undesir- making sure they don’t become able outcomes. The book takes extinct all over again. Whether an intriguing approach and gives you’re all for de-extinction or many interesting examples.” against it, Shapiro’s sharp, witty, —Diane Coyle, author of GDP: A and impeccably-argued book is Brief but A ectionate History essential for informing those who Ever since Adam Smith, the central will decide what life will become.” teaching of economics has been —Brian Switek, NationalGeo- that free markets provide us with graphic.com’s Laelaps blog material well-being, as if by an in- “Beth Shapiro.
    [Show full text]
  • Florida State University Libraries
    )ORULGD6WDWH8QLYHUVLW\/LEUDULHV 2020 Wiki-Donna: A Contribution to a More Gender-Balanced History of Italian Literature Online Zoe D'Alessandro Follow this and additional works at DigiNole: FSU's Digital Repository. For more information, please contact [email protected] THE FLORIDA STATE UNIVERSITY COLLEGE OF ARTS & SCIENCES WIKI-DONNA: A CONTRIBUTION TO A MORE GENDER-BALANCED HISTORY OF ITALIAN LITERATURE ONLINE By ZOE D’ALESSANDRO A Thesis submitted to the Department of Modern Languages and Linguistics in partial fulfillment of the requirements for graduation with Honors in the Major Degree Awarded: Spring, 2020 The members of the Defense Committee approve the thesis of Zoe D’Alessandro defended on April 20, 2020. Dr. Silvia Valisa Thesis Director Dr. Celia Caputi Outside Committee Member Dr. Elizabeth Coggeshall Committee Member Introduction Last year I was reading Una donna (1906) by Sibilla Aleramo, one of the most important ​ ​ works in Italian modern literature and among the very first explicitly feminist works in the Italian language. Wanting to know more about it, I looked it up on Wikipedia. Although there exists a full entry in the Italian Wikipedia (consisting of a plot summary, publishing information, and external links), the corresponding page in the English Wikipedia consisted only of a short quote derived from a book devoted to gender studies, but that did not address that specific work in great detail. As in-depth and ubiquitous as Wikipedia usually is, I had never thought a work as important as this wouldn’t have its own page. This discovery prompted the question: if this page hadn’t been translated, what else was missing? And was this true of every entry for books across languages, or more so for women writers? My work in expanding the entry for Una donna was the beginning of my exploration ​ ​ into the presence of Italian women writers in the Italian and English Wikipedias, and how it relates back to canon, Wikipedia, and gender studies.
    [Show full text]
  • Introduction
    Introduction Maibritt Borgen, Nanna Bonde Thylstrup & Kristin Veel Introdution controversy, as the initiative was both lauded and lamented for its potential to disrupt the tradition- Wikipedia is one of the most visited knowledge re- al structures of the knowledge field and industry sources in the world. The Alexa traffic rankings put (Rosenzweig, 2006; Ford, this issue). The discus- it at number 7, well above the New York Times (104), sions revolved in particular around the authority of the BBC (106), the Library of Congress (1,175), experts versus lay people. Today this polarization and the venerable Encyclopedia Britannica (3711) has given way to closer cooperation between Wiki- (Alexa, 2016). As historian Roy Rosenzweig puts it, pedia and other traditional knowledge-producing Wikipedia has become "perhaps the largest work of communities such as libraries and museums.1 Yet, as online historical writing, the most widely read work the dust settles over the expert/layman disputes, new of digital history, and the most important free histori- contours of contestation have become apparent. One cal resource on the World Wide Web" (Rosenzweig is the political ideology and ideological potential of 2006, p. 52). Wikipedia has become so ingrained in Wikipedia (Firer-Blaess and Fuchs, 2014, p. 87-103; our everyday search for information that users rarely Chozik, 2013, June 27).2 Another is its politics of give thought to the mechanisms and agency under- transparency (Tkacz, 2015). A third is its bureaucrat- neath its production of knowledge: who produces its ic structures (Jemielniak, 2014). But the most per- content? And what visible and invisible structures sistent point of contestation remains its gender gap govern this production? Indeed, we have come to problem.
    [Show full text]
  • Selling Sex: What Determines Rates and Popularity? an Analysis of 11,500 Online Profiles, Culture, Health & Sexuality
    This is an authors’ copy of: Alicia Mergenthaler & Taha Yasseri (2021) Selling sex: what determines rates and popularity? An analysis of 11,500 online profiles, Culture, Health & Sexuality. DOI: 10.1080/13691058.2021.1901145 Selling sex: what determines rates and popularity? An analysis of 11,500 online profiles Alicia Mergenthalera, Taha Yasseri*abc aOxford Internet Institute, University of Oxford, Oxford, UK; bSchool of Sociology, University College Dublin, Dublin, Ireland; cGeary Institute for Public Policy, University College Dublin, Dublin, Ireland *Corresponding Author: Taha Yasseri: [email protected] Abstract Sex work, or the exchange of sexual services for money or goods, is ubiquitous across eras and cultures. However, the practice of selling sex is often hidden due to stigma and the varying legal status of sex work. Online platforms that sex workers use to advertise services have become an increasingly important means of studying a market that is largely hidden. Although prior literature has primarily shed light on sex work from a public health or policy perspective (focusing largely on female sex workers), there are few studies that empirically research patterns of service provision in online sex work. This study investigated the determinants of pricing and popularity in the market for commercial sexual services online by using data from the largest UK network of online sexual services, a platform that is the industry-standard for sex workers. While the size of these influences varies across genders, nationality, age and the services provided are shown to be primary drivers of rates and popularity in sex work. Keywords: sex work, popularity dynamics, gender, online marketplace, UK Introduction In this article, we analyse a dataset from AdultWork.com, a UK-based online platform to determine drivers behind pricing and popularity in the market for sex work.
    [Show full text]
  • Interview with Sue Gardner, Executive Director WMF October 1, 2009 510 Years from Now, What Is Your Vision?
    Interview with Sue Gardner, Executive Director WMF October 1, 2009 5-10 years from now, what is your vision? What's different about Wikimedia? Personally, I would like to see Wikimedia in the top five for reach in every country. I want to see a broad, rich, deep encyclopedia that's demonstrably meeting people's needs, and is relevant and useful for people everywhere around the world. In order for that to happen, a lot of things would need to change. The community of editors would need to be healthier, more vibrant, more fun. Today, people get burned out. They get tired of hostility and endless debates. Working on Wikipedia is hard, and it does not offer many rewards. Editors have intrinsic motivation not extrinsic, but even so, not much is done to affirm or thank or recognize them. We need to find ways to foster a community that is rich and diverse and friendly and fun to be a part of. That world would include more women, more newly-retired people, more teachers ± all different kinds of people. There would be more ways to participate, more affirmation, more opportunities to be social and friendly. We need a lot of tools and features to help those people be more effective. Currently, there are tons of hacks and workarounds that experienced editors have developed over time, but which new editors don©t know about, and would probably find difficult to use. We need to make those tools visible and easier to use, and we need to invent new ones where they are lacking.
    [Show full text]
  • Taha Yasseri
    Curriculum Vitae TAHA YASSERI Personal Address Birth: 06. Sep. 1984 (27), Tehran Department of Theoretical Physics Nationality: Iranian Budapest University of Technology and Economics Marital status: single Budafoki ´ut8. http://www.phy.bme.hu/∼yasseri H1111 Budapest, Hungary CURRENT POSITION Post-doctoral fellow in European Commission project ICTeCollective, Research Group of Prof. J´anos Kert´esz. Department of Theoretical Physics, Budapest University of Technology and Economics, Budapest, Hungary. Since September 2010 WORK EXPERIENCES • Assistant Scientist, Institute for theoretical Physics, Georg-August-Universit¨at G¨ottingen,Germany. January 2009 - July 2010 • Researcher, SFB 602, Complex Structures in Condensed Matter from Atomic to Mesoscopic Scales, Georg-August-Universit¨atG¨ottingen,Germany. January 2007 - January 2009 • Member of Scientific Committee of the First Step to Nobel Prize in Physics National Competition, Young Scholars Club, Tehran, Iran. November 2003 - March 2005 • Teacher of Physics, Farzanegan and Allame Helli High Schools under the supervision of National Or- ganization for Developing Exceptional Talents, Tehran, Iran. September 2002 - April 2004 • Teacher of Physics Laboratory, Young Scholars Club, Tehran, Iran. April 2002 - September 2005 EDUCATION PhD, Physics, Faculty of Physics, Georg-August-Universit¨at G¨ottingen,Germany. January 2007 - January 2010 TOPIC: Pattern formation at ion-sputtered surfaces. (Adviser: Prof. Dr. R. Kree, Co-adviser: Prof. Dr. A.K. Hartmann) Secondary topics: Computational Neuroscience and Physics of Polymers. Master of Science, Physics, Department of Physics, Sharif University of Technology Tehran, Iran. September 2005 - September 2006 THESIS - Localization in scale free networks. (Adviser: Prof. Dr. M. Reza Rahimi Tabar) Bachelor of Science, Physics, Department of Physics, Sharif University of Technology Tehran, Iran.
    [Show full text]
  • Modeling the Rise in Internet-Based Petitions Taha Yasseri, Scott A
    Yasseri et al. RESEARCH Modeling the rise in Internet-based petitions Taha Yasseri, Scott A. Hale and Helen Z. Margetts Oxford Internet Institute, University of Oxford, OX1 3JS Oxford, UK Abstract Contemporary collective action, much of which involves social media and other Internet-based platforms, leaves a digital imprint which may be harvested to better understand the dynamics of mobilization. Petition signing is an example of collective action which has gained in popularity with rising use of social media and provides such data for the whole population of petition signatories for a given platform. This paper tracks the growth curves of all 20,000 petitions to the UK government over 18 months, analyzing the rate of growth and outreach mechanism. Previous research has suggested the importance of the first day to the ultimate success of a petition, but has not examined early growth within that day, made possible here through hourly resolution in the data. The analysis shows that the vast majority of petitions do not achieve any measure of success; over 99 percent fail to get the 10,000 signatures required for an official response and only 0.1 percent attain the 100,000 required for a parliamentary debate. We analyze the data through a multiplicative process model framework to explain the heterogeneous growth of signatures at the population level. We define and measure an average outreach factor for petitions and show that it decays very fast (reducing to 0.1% after 10 hours). After 24 hours, a petition's fate is virtually set. The findings seem to challenge conventional analyses of collective action from economics and political science, where the production function has been assumed to follow an S-shaped curve.
    [Show full text]