Finding a Growth Business Model at Stackoverflow-Final2
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  An Empirical Study of Obsolete Answers on Stack Overflow1 An Empirical Study of Obsolete Answers on Stack Overflow Haoxiang Zhang, Shaowei Wang, Tse-Hsun (Peter) Chen, Ying Zou, Senior Member, IEEE, and Ahmed E. Hassan, Fellow, IEEE Abstract—Stack Overflow accumulates an enormous amount of software engineering knowledge. However, as time passes, certain knowledge in answers may become obsolete. Such obsolete answers, if not identified or documented clearly, may mislead answer seekers and cause unexpected problems (e.g., using an out-dated security protocol). In this paper, we investigate how the knowledge in answers becomes obsolete and identify the characteristics of such obsolete answers. We find that: 1) More than half of the obsolete answers (58.4%) were probably already obsolete when they were first posted. 2) When an obsolete answer is observed, only a small proportion (20.5%) of such answers are ever updated. 3) Answers to questions in certain tags (e.g., node.js, ajax, android, and objective-c) are more likely to become obsolete. Our findings suggest that Stack Overflow should develop mechanisms to encourage the whole community to maintain answers (to avoid obsolete answers) and answer seekers are encouraged to carefully go through all information (e.g., comments) in answer threads. Index Terms—Q&A Website, Stack Overflow, Obsolete Knowledge, Knowledge Sharing. F 1 INTRODUCTION ECHNICAL Q&A websites are becoming an important to handle such a situation2. Additionally, a survey of 453 T and popular platform for knowledge sharing and learn- Stack Overflow users reports that outdated code on Stack ing. They have revolutionized how users seek knowledge Overflow is one of the most important issues that users on the Internet.
- 
												  Uila Supported AppsUila Supported Applications and Protocols updated Oct 2020 Application/Protocol Name Full Description 01net.com 01net website, a French high-tech news site. 050 plus is a Japanese embedded smartphone application dedicated to 050 plus audio-conferencing. 0zz0.com 0zz0 is an online solution to store, send and share files 10050.net China Railcom group web portal. This protocol plug-in classifies the http traffic to the host 10086.cn. It also 10086.cn classifies the ssl traffic to the Common Name 10086.cn. 104.com Web site dedicated to job research. 1111.com.tw Website dedicated to job research in Taiwan. 114la.com Chinese web portal operated by YLMF Computer Technology Co. Chinese cloud storing system of the 115 website. It is operated by YLMF 115.com Computer Technology Co. 118114.cn Chinese booking and reservation portal. 11st.co.kr Korean shopping website 11st. It is operated by SK Planet Co. 1337x.org Bittorrent tracker search engine 139mail 139mail is a chinese webmail powered by China Mobile. 15min.lt Lithuanian news portal Chinese web portal 163. It is operated by NetEase, a company which 163.com pioneered the development of Internet in China. 17173.com Website distributing Chinese games. 17u.com Chinese online travel booking website. 20 minutes is a free, daily newspaper available in France, Spain and 20minutes Switzerland. This plugin classifies websites. 24h.com.vn Vietnamese news portal 24ora.com Aruban news portal 24sata.hr Croatian news portal 24SevenOffice 24SevenOffice is a web-based Enterprise resource planning (ERP) systems. 24ur.com Slovenian news portal 2ch.net Japanese adult videos web site 2Shared 2shared is an online space for sharing and storage.
- 
												  Stack Overflow Question Retrieval System Vishi Cline Southern Methodist University, [email protected]SMU Data Science Review Volume 1 | Number 2 Article 13 2018 Stack Overflow Question Retrieval System Vishi Cline Southern Methodist University, [email protected] Abhishek Dharwadkar Southern Methodist University, [email protected] Rajni Goyal Southern Methodist University, [email protected] Daniel Engels Southern Methodist University, [email protected] Raghuram Srinivas Southern Methodist University, [email protected] See next page for additional authors Follow this and additional works at: https://scholar.smu.edu/datasciencereview Recommended Citation Cline, Vishi; Dharwadkar, Abhishek; Goyal, Rajni; Engels, Daniel; Srinivas, Raghuram; and Rafiqi, Sohail (2018) "Stack Overflow Question Retrieval System," SMU Data Science Review: Vol. 1 : No. 2 , Article 13. Available at: https://scholar.smu.edu/datasciencereview/vol1/iss2/13 This Article is brought to you for free and open access by SMU Scholar. It has been accepted for inclusion in SMU Data Science Review by an authorized administrator of SMU Scholar. For more information, please visit http://digitalrepository.smu.edu. Stack Overflow Question Retrieval System Creative Commons License This work is licensed under a Creative Commons Attribution-Noncommercial 4.0 License Authors Vishi Cline, Abhishek Dharwadkar, Rajni Goyal, Daniel Engels, Raghuram Srinivas, and Sohail Rafiqi This article is available in SMU Data Science Review: https://scholar.smu.edu/datasciencereview/vol1/iss2/13 Cline et al.: Stack Overflow Question Retrieval System Stack Overflow Question Retrieval System Abhishek Dharwadkar1, Vishi Cline1, Rajni Goyal1 Daniel Engels2, Raghuram Srinivas2, Sohail Rafiqi2 1 Master of Science in Data Science, Southern Methodist University 6425 Boaz Lane, Dallas, TX 75205 {avdharwadkar, rajnig, vcline, rsrinivas}@mail.smu.edu {dwe, srafiqi}@lyle.smu.edu Abstract.
- 
												  The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) - Joel on SoftwareThe Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) - Joel on Software Joel on Software The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) by Joel Spolsky Wednesday, October 08, 2003 Ever wonder about that mysterious Content-Type tag? You know, the one you're supposed to put in HTML and you never quite know what it should be? Did you ever get an email from your friends in Bulgaria with the subject line "???? ?????? ??? ????"? I've been dismayed to discover just how many software developers aren't really completely up to speed on the mysterious world of character sets, encodings, Unicode, all that stuff. A couple of years ago, a beta tester for FogBUGZ was wondering whether it could handle incoming email in Japanese. Japanese? They have email in Japanese? I had no idea. When I looked closely at the commercial ActiveX control we were using to parse MIME email messages, we discovered it was doing exactly the wrong thing with character sets, so we actually had to write heroic code to undo the wrong conversion it had done and redo it correctly. When I looked into another commercial library, it, too, had a completely broken character code implementation. I corresponded with the http://www.joelonsoftware.com/articles/Unicode.html[5/11/2014 1:30:52 AM] The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) - Joel on Software developer of that package and he sort of thought they "couldn't do anything about it." Like many programmers, he just wished it would all blow over somehow.
- 
												  Glitch.Com What Is Glitch?glitch.com What is Glitch? “Everybody has good ideas of things we want to make on the Internet. But honestly, it’s gotten too hard. Gomix fixes that. If you’re not a professional developer, the barriers are way too high. There are pretty good tools if you just went to make a marketing website or a blog, but if your idea is an app or a bot or something more interactive, you’re out of luck. It ought to be as easy to make an app as it is to edit a blog post or to change a formula in a spreadsheet.” Anil Dash, December 6, 2016 https://medium.com/gomix/introducing-gomix-aec205c421cb#.fqutfey03 https://glitch.com/about/ Glitch handles the mess • While you work with Glitch, it will seamlessly upgrade your servers and cloud infrastructure in the background. • There’s no deployment or server provisioning because it all happens automatically. Backed by a real company Glitch • Glitch is made by Fog Creek, one of the most influential small tech companies in the world. • They made Trello, FogBugz and co-created Stack Overflow. Why did they make it In some ways, Glitch is a throwback to an older era of software or the internet, when there were simpler ways to get started making cool stuff. For people who were around at that time, they'll understand Glitch easily: It brings “View Source” back. The ability to just look at the code behind something, and tweak it, and make your own thing, was essential to making the Internet fun, and weird, and diverse, in its early days.
- 
												  I RETRIEVING CURATED STACK OVERFLOW POSTS of SIMILARRETRIEVING CURATED STACK OVERFLOW POSTS OF SIMILAR PROJECT TASKS Gláucia Melo dos Santos Dissertação de Mestrado apresentada ao Programa de Pós-graduação em Engenharia de Sistemas e Computação, COPPE, da Universidade Federal do Rio de Janeiro, como parte dos requisitos necessários à obtenção do título de Mestre em Engenharia de Sistemas e Computação. Orientador: Toacy Cavalcante de Oliveira Rio de Janeiro Dezembro de 2018 i RETRIEVING CURATED STACK OVERFLOW POSTS OF SIMILAR PROJECT TASKS Gláucia Melo dos Santos DISSERTAÇÃO SUBMETIDA AO CORPO DOCENTE DO INSTITUTO ALBERTO LUIZ COIMBRA DE PÓS-GRADUAÇÃO E PESQUISA DE ENGENHARIA (COPPE) DA UNIVERSIDADE FEDERAL DO RIO DE JANEIRO COMO PARTE DOS REQUISITOS NECESSÁRIOS PARA A OBTENÇÃO DO GRAU DE MESTRE EM CIÊNCIAS EM ENGENHARIA DE SISTEMAS E COMPUTAÇÃO. Examinada por: _____________________________________________ Prof. Toacy Cavalcante de Oliveira, D.Sc. _____________________________________________ Prof. Cláudia Maria Lima Werner, D.Sc. _____________________________________________ Prof. Eber Assis Schmitz, PhD. RIO DE JANEIRO, RJ - BRASIL DEZEMBRO DE 2018 ii Santos, Gláucia Melo dos Retrieving curated Stack Overflow Posts of similar project tasks / Gláucia Melo dos Santos – Rio de Janeiro: UFRJ/COPPE, 2018. XI,118 p.:il.; 29,7cm. Orientador: Toacy Cavalcante de Oliveira Dissertação (mestrado) – UFRJ/COPPE/Programa de Engenharia de Sistemas e Computação, 2018. Referências Bibliográficas: p.92-99 1. Software Engineering. 2. Curated Stack Overflow Posts. 3. Project Task Context Similarity. I. Oliveira, Toacy Cavalcante de. II. Universidade Federal do Rio de Janeiro, COPPE, Programa de Engenharia de Sistemas e Computação. III. Título. iii Agradecimentos A única maneira de começar é agradecendo a Deus. Sou imensamente grata. Agradeço aos meus pais Fátima e Cristóvão, minha irmã Gisele, minha madrinha Lila e prima Thaisa pelo apoio ILIMITADO enquanto estive buscando o título de mestre e nas tantas outras fases.
- 
												  An Analysis of Python's Topics, Trends, and Technologies Through Mining Stack Overflow Discussions1 An Analysis of Python’s Topics, Trends, and Technologies Through Mining Stack Overflow Discussions Hamed Tahmooresi, Abbas Heydarnoori, and Alireza Aghamohammadi F Abstract—Python is a popular, widely used, and general-purpose pro- (LDA) [3] to categorize discussions in specific areas such gramming language. In spite of its ever-growing community, researchers as web programming [4], mobile application development [5], have not performed much analysis on Python’s topics, trends, and tech- security [6], and blockchain [7]. However, none of them has nologies which provides insights for developers about Python commu- focused on Python. In this study, we shed light on the nity trends and main issues. In this article, we examine the main topics Python’s main areas of discussion through mining 2 461 876 related to this language being discussed by developers on one of the most popular Q&A websites, Stack Overflow, as well as temporal trends posts, from August 2008 to January 2019, using LDA based through mining 2 461 876 posts. To be more useful for the software topic modeling . We also analyze the temporal trends of engineers, we study what Python provides as the alternative to popular the extracted topics to find out how the developers’ interest technologies offered by common programming languages like Java. is changing over the time. After explaining the topics and Our results indicate that discussions about Python standard features, trends of Python, we investigate the technologies offered web programming, and scientific programming 1 are the most popular by Python, which are not properly revealed by the topic areas in the Python community.
- 
												  Networking Best Practices for Large Deployments Google, IncNetworking Best Practices for Large Deployments Google, Inc. 1600 Amphitheatre Parkway Mountain View, CA 94043 www.google.com Part number: NETBP_GAPPS_3.8 November 16, 2016 © Copyright 2016 Google, Inc. All rights reserved. Google, the Google logo, G Suite, Gmail, Google Docs, Google Calendar, Google Sites, Google+, Google Talk, Google Hangouts, Google Drive, Gmail are trademarks, registered trademarks, or service marks of Google Inc. All other trademarks are the property of their respective owners. Use of any Google solution is governed by the license agreement included in your original contract. Any intellectual property rights relating to the Google services are and shall remain the exclusive property of Google, Inc. and/or its subsidiaries (“Google”). You may not attempt to decipher, decompile, or develop source code for any Google product or service offering, or knowingly allow others to do so. Google documentation may not be sold, resold, licensed or sublicensed and may not be transferred without the prior written consent of Google. Your right to copy this manual is limited by copyright law. Making copies, adaptations, or compilation works, without prior written authorization of Google. is prohibited by law and constitutes a punishable violation of the law. No part of this manual may be reproduced in whole or in part without the express written consent of Google. Copyright © by Google Inc. Google provides this publication “as is” without warranty of any either express or implied, including but not limited to the implied warranties of merchantability or fitness for a particular purpose. Google Inc. may revise this publication from time to time without notice.
- 
												  Predicting Web Searcher Satisfaction with Existing Community-Based AnswersPredicting Web Searcher Satisfaction with Existing Community-based Answers Qiaoling Liu§, Eugene Agichtein§, Gideon Dror†, Evgeniy Gabrilovich‡, Yoelle Maarek†, Dan Pelleg†, Idan Szpektor†, §Emory University, Atlanta, GA, USA †Yahoo! Research, Haifa, Israel ‡Yahoo! Research, Santa Clara, CA, USA {qliu26, eugene}@mathcs.emory.edu, {gideondr, gabr, yoelle, dpelleg, idan}@yahoo-inc.com ABSTRACT 1. INTRODUCTION Community-based Question Answering (CQA) sites, such “Just because Google exists doesn’t mean you should as Yahoo! Answers, Baidu Knows, Naver, and Quora, have stop asking people things.” – Alexia Tsotsis [35] been rapidly growing in popularity. The resulting archives Community-based Question Answering (CQA) sites, such of posted answers to questions, in Yahoo! Answers alone, as Yahoo! Answers, Baidu Knows, and Naver Ji-Sik-In, as already exceed in size 1 billion, and are aggressively indexed well as more social-oriented newcomers such as Vark and by web search engines. In fact, a large number of search Quora, have gained substantial popularity over the recent engine users benefit from these archives, by finding existing years, effectively filling a niche left by the mainstream Web answers that address their own queries. This scenario poses search engines. People around the globe resort to commu- new challenges and opportunities for both search engines nity help for a variety of reasons, from lack of proficiency in and CQA sites. To this end, we formulate a new problem of Web search to seeking an answer “with a human touch”. Al- predicting the satisfaction of web searchers with CQA an- though some of these sites allow for monetary payments in swers.
- 
												  Google Ngram Viewer by Andrew WeissGoogle Ngram Viewer By Andrew Weiss Digital Services Librarian California State University, Northridge Introduction: Google, mass digitization and the emerging science of culturonomics Google Books famously launched in 2004 with great fanfare and controversy. Two stated original goals for this ambitious digitization project were to replace the generic library card catalog with their new “virtual” one and to scan all the books that have ever been published, which by a Google employee’s reckoning is approximately 129 million books. (Google 2014) (Jackson 2010) Now ten years in, the Google Books mass-digitization project as of May 2014 has claimed to have scanned over 30 million books, a little less than 24% of their overt goal. Considering the enormity of their vision for creating this new type of massive digital library their progress has been astounding. However, controversies have also dominated the headlines. At the time of the Google announcement in 2004 Jean Noel Jeanneney, head of Bibliothèque nationale de France at the time, famously called out Google’s strategy as excessively Anglo-American and dangerously corporate-centric. (Jeanneney 2007) Entrusting a corporation to the digitization and preservation of a diverse world-wide corpus of books, Jeanneney argues, is a dangerous game, especially when ascribing a bottom-line economic equivalent figure to cultural works of inestimable personal, national and international value. Over the ten years since his objections were raised, many of his comments remain prescient and relevant to the problems and issues the project currently faces. Google Books still faces controversies related to copyright, as seen in the current appeal of the Author’s Guild v.
- 
												  The Power of Google: Serving Consumers Or Threatening Competition? Hearing Committee on the Judiciary United States SenateS. HRG. 112–168 THE POWER OF GOOGLE: SERVING CONSUMERS OR THREATENING COMPETITION? HEARING BEFORE THE SUBCOMMITTEE ON ANTITRUST, COMPETITION POLICY AND CONSUMER RIGHTS OF THE COMMITTEE ON THE JUDICIARY UNITED STATES SENATE ONE HUNDRED TWELFTH CONGRESS FIRST SESSION SEPTEMBER 21, 2011 Serial No. J–112–43 Printed for the use of the Committee on the Judiciary ( U.S. GOVERNMENT PRINTING OFFICE 71–471 PDF WASHINGTON : 2011 For sale by the Superintendent of Documents, U.S. Government Printing Office Internet: bookstore.gpo.gov Phone: toll free (866) 512–1800; DC area (202) 512–1800 Fax: (202) 512–2104 Mail: Stop IDCC, Washington, DC 20402–0001 VerDate Nov 24 2008 11:11 Dec 21, 2011 Jkt 071471 PO 00000 Frm 00001 Fmt 5011 Sfmt 5011 S:\GPO\HEARINGS\71471.TXT SJUD1 PsN: CMORC COMMITTEE ON THE JUDICIARY PATRICK J. LEAHY, Vermont, Chairman HERB KOHL, Wisconsin CHUCK GRASSLEY, Iowa DIANNE FEINSTEIN, California ORRIN G. HATCH, Utah CHUCK SCHUMER, New York JON KYL, Arizona DICK DURBIN, Illinois JEFF SESSIONS, Alabama SHELDON WHITEHOUSE, Rhode Island LINDSEY GRAHAM, South Carolina AMY KLOBUCHAR, Minnesota JOHN CORNYN, Texas AL FRANKEN, Minnesota MICHAEL S. LEE, Utah CHRISTOPHER A. COONS, Delaware TOM COBURN, Oklahoma RICHARD BLUMENTHAL, Connecticut BRUCE A. COHEN, Chief Counsel and Staff Director KOLAN DAVIS, Republican Chief Counsel and Staff Director SUBCOMMITTEE ON ANTITRUST, COMPETITION POLICY AND CONSUMER RIGHTS HERB KOHL, Wisconsin, Chairman CHUCK SCHUMER, New York MICHAEL S. LEE, Utah AMY KLOBUCHAR, Minnesota CHUCK GRASSLEY, Iowa AL FRANKEN, Minnesota JOHN CORNYN, Texas RICHARD BLUMENTHAL, Connecticut CAROLINE HOLLAND, Democratic Chief Counsel/Staff Director DAVID BARLOW, Republican General Counsel (II) VerDate Nov 24 2008 11:11 Dec 21, 2011 Jkt 071471 PO 00000 Frm 00002 Fmt 5904 Sfmt 5904 S:\GPO\HEARINGS\71471.TXT SJUD1 PsN: CMORC C O N T E N T S STATEMENTS OF COMMITTEE MEMBERS Page Feinstein, Hon.
- 
												  April 5, 2012 Isabelle Falque-Pierrotin Presidente CNIL 8, RueApril 5, 2012 Isabelle Falque-Pierrotin Presidente CNIL 8, rue Vivienne 75002 Paris, FR Re: Google Dear Madame la Presidente, Thank you for your letter and questionnaire of March 16, 2012, responding to our letter to you of February 28, 2012. We are happy to provide the additional information you request. Your questionnaire is wide ranging, covering a number of technical and legal areas in detail. Only some of the 69 questions you ask relate to our new Privacy Policy changes specifically, with many of the questions exploring broader privacy matters. While we are committed to providing the CNIL and all European data protection authorities with answers to the questions asked, we will, as agreed with your staff, need slightly longer than the time you have suggested to respond. As an interim step, we attach our responses to questions 1 to 24. The rest will follow as soon as complete. In addition to our written responses, Google would, as noted in our letter of February 28, 2012, also welcome the chance to meet with the CNIL to explain and discuss Google’s approach to providing information to users. This is an important issue for us. We have taken a great deal of time and care in designing our approach. In our very first letter to the CNIL on this topic, we emphasised that while we did not feel able to pause the implementation of our Privacy Policy, we would welcome the opportunity to discuss how and where Google provides information to our users. We have reached out many times to the CNIL asking for a meeting to discuss this, and we make that offer again now.