Teachers' Perception of Jupyter and R Shiny As Digital Tools for Open

Total Page:16

File Type:pdf, Size:1020Kb

Teachers' Perception of Jupyter and R Shiny As Digital Tools for Open Hanˇc et al. RESEARCH Teachers' perception of Jupyter and R Shiny as digital tools for open education and science Jozef Hanˇc1* , Peter Strauchˇ 1, Eva Paˇnkov´a1 and Martina Hanˇcov´a2 *Correspondence: [email protected] Abstract 1Institute of Physics, Faculty of Science, Pavol Jozef Saf´arikˇ During the last ten years advances in open-source digital technology, used University, Koˇsice,Slovakia especially by data science, led to very accessible ways how to obtain, store, Full list of author information is process, analyze or share data in almost every human activity. Data science tools available at the end of the article yEqual contributor bring not only transparency, accessibility, and reproducibility in open science, but also give benefits in open education as learning tools for improving effectiveness of instruction. Together with our pedagogical introduction and review of Jupyter as an interactive multimedia learning tool we present our three-years long research in the framework of a complex mixed-methods approach which examines physics teachers' perception of Jupyter technology in three groups: Ph.D. candidates in physics education research (PER) (N = 9), pre-service physics teachers (N = 33) and in-service physics teachers (N = 40). Despite the fact that open-source Jupyter notebooks are natural and easy as email or web, the results suggest that in-service teachers are not prepared for Jupyter technology and open analysis, but positively accept open education data presented via another open-source data science tool, R Shiny interactive web application, as an important form of immediate feedback and learning about the quality of their instruction. Simultaneously our instruction results in the frame of the Flipped Learning also indicate that young beginning PER researchers and pre-service physics teachers can master key digital skills to work with Jupyter technology appreciating its big impact on their learning, data and statistical literacy or professional development. All results support the ongoing worldwide effort to implement Jupyter in traditional education as a promising free open-source interactive learning tool to foster learning process, especially for the upcoming young generation. Keywords: data science tools; learning tools; interactive multimedia; data visualization; data analysis; user experience questionnaire; flipped learning Introduction arXiv:2007.11262v2 [physics.ed-ph] 23 Jul 2020 During the last decade, new digital technologies such as mobile devices, cloud in- frastructure, open data, artificial intelligence, decentralized and social networks (Downes, 2019) caused that we live in the digital and data-intensive age. Ninety percent of all data has been created in the last two years (Boaler, 2020; Marr, 2018). Data touches all aspects of our lives. The world economy, our jobs, our health, our environment and our roles as citizens increasingly depend on the knowledge, skills and technology required to work with, understand, and effectively use data (Boisvert et al, 2016). In April 2020 GitHub (GitHub, Inc., 2020; Warner, 2018), the world largest web cloud platform for storing, social coding and collaborating on any open code or Hanˇc et al. Page 2 of 32 digital content, reached a further milestone, 50 million developers working on over 100 million repositories | 3,000 times more than after its starting year 2008. Simultaneously, during last years open-source software and data science tools, namely programming languages Python and R together with environments for work- ing with them | especially Jupyter and RStudio, have conquered the data science world (kaggle, 2020) providing everybody free, open, revolutionary and very acces- sible ways how to gather, store, process, analyze, present or share data in almost every human activity. Speaking about science, in the near future, the European Open Science Cloud (EOSC, eosc-portal.eu), officially launched at the end of 2018, will offer to mil- lions of European researchers, professionals, but also university students, in science, technology, the humanities and social sciences a virtual environment with open and seamless services for storage, management, analysis and re-use of open research data, interoperable among all scientific domains or EU member states. By federat- ing, the EOSC will connect existing and being built scientific data infrastructures, currently dispersed and isolated across disciplines and borders of EU states. One of the key and easily accessible open digital research environments in EOSC (see e.g. the infrastructure EOSC project OpenDreamKit, 2015-2019, opendreamkit.org) should become the mentioned technology of Jupyter Notebooks. The highest ambi- tion of the EOSC is to change the way we do science, to open it. As for education, in 2015 from US a worldwide Call for Action to promote data literacy swept the world (Boisvert et al, 2016; EDC Oceans of Data Institute, 2015). Its signatories call for \a revolution in education, placing data literacy at its core, in- tegrated throughout K-16 education nationwide and around the world. By enabling learners to use data more effectively, we prepare them to make better decisions and to lead more secure, better-informed and productive lives." On February 2020, Jo Boaler, a professor of mathematics education at Stanford, invited a group of fifty mathematicians, data scientists, teachers and education policy leaders to start a movement, a YouCubed initiative, which will modernize the K-12 math curriculum and will prepare high school students to the data age (Boaler, 2020; Spector, 2020). Jupyter in science and education One of the current main data science tools for open science is Jupyter technology of interactive multimedia notebooks | shortly and officially called Jupyter Notebooks. From the viewpoint of users, the technology is natural and easy as email or web. It was released in 2014 in the frame of Project Jupyter (jupyter.org, Kluyver et al, 2016; Project Jupyter et al, 2018) as a free, open alternative to the well-known Mathe- matica Notebooks from commercial software Wolfram Mathematica (wolfram.com). As for the technology platform, Project Jupyter evolved from IPython, a personal side project of a Jupyter-project co-founder and former nuclear physicist, Fernando Per´ez. Jupyter is an open source, free interactive web computing environment, accessi- ble through any modern web browser, that enables users to use, modify or create interactive multimedia web documents which mix live code (over 100 computer lan- guages with a focus on Python, called kernels), interactive computations, equations, simulations, plots, images, narrative texts, annotations, audios or videos. Hanˇc et al. Page 3 of 32 Over the last 5 years, Jupyter Notebooks have become the most widely used as an environment for performing scientific calculations, open analysis, data processing and scientific reporting (Frederickson, 2019). Data and analysis of the recent Nobel prize-winning detections of gravitational waves (2015), predicted by Einstein in 1915, were released as a Jupyter Notebook. At the same time we are witnesses of an explosion of scientific and technical articles dealing with Jupyter's application in science, technology and education (Fig.1). Figure 1 Annual numbers of scientific publications dealing with Jupyter in four scholar databeses during 2015-2019. Our review as bar plots was effectively created using Scientific Python in a Jupyter notebook (data analysis library pandas). As we can see in hundreds of scholarly papers connected to education (Fig.1), e.g. Weiss(2017) { chemistry, Koehler and Kim(2018) { mathematics, Odden et al (2019) { physics, Wright et al(2020) { biology, Cardoso et al(2019) { engineer- ing, Jupyter is widely applied in open STEM and STEAM education (Khine and Areepattamannil, 2019), supporting data literacy and including also programming, statistics, data science, cognitive science, computer science, machine learning, digi- tal humanities, scientific computation or robotics (Barba et al, 2019). Today, Jupyter notebooks, as living interactive \storytelling" documents, is the technology behind many innovative educational programs and also became Hanˇc et al. Page 4 of 32 platform-of-choice for tutorials, workshops, online lessons, and even books. One of the best elaborated book examples is an open GitHub handbook Teaching and Learning with Jupyter (Barba et al, 2019) covering topics like why and how to use Jupyter in education, pedagogical instruction designs and case studies elicited from authors' real experience or technical details of Jupyter implementation in practice. Another example is Coursera (coursera.org), a world-wide MOOC platform, offering many free courses and guided projects using Jupyter as a key learning tool. The revolutionary significance of Jupyter for open science and education also became the subject of philosophical reflections, e.g. from a writer and programmer J. Sommers (Somers, 2018) or a Nobel prize laureate P. Romer (Romer, 2018) or a Canadian philosopher and open education visionary S. Downes (Downes, 2019). Jupyter as a tool of interactive teaching methods Finally in this introductory section, we would like to point out three important, but less known, Jupyter features in connection with three modern interactive instruc- tion approaches which are \designed to promote conceptual understanding through interactive engagement of students in heads-on (always) and hands-on (usually) activities which yield immediate feedback through discussion with peers and/or instructors" (Fraser
Recommended publications
  • Staff Guide to Google Classroom
    Staff Guide to Google Classroom Preston Lodge High School March 2018 Preston Lodge High School Connected Learning Contents Logging in 3 Setting up Google Classroom on your mobile device 4 Google Classroom main pages 5 Creating a class 6 Invite other teachers to a class 6 Archiving and deleting classes 7 Adding students to your class 8 Making an announcement 9 Sharing resources with your classes 10 Creating Assignments 13 Marking assignments 14 Annotating pupil assignments on mobile devices 15 Creating a Question 16 Using Google forms for surveys and assignments 18 Checking responses from Google Forms 21 Reuse a post 22 Communicating with guardians / parents 23 Using Google classroom as a markbook 24 Getting more help 25 Gareth Evans Staff Guide to Google Classroom Page 2 / 26 Preston Lodge High School Connected Learning Logging in To make use of Google Classroom, you need to login to Google using your Edubuzz account. Personal Google Accounts (such as your home @gmail.com account) will not have access to GApps for Education, which includes Google Classroom. Most browsers are compatible with Google Classroom, although Google Chrome is highly recommend. You will find this in the Applications folder on the Desktop of your school computer. Once opened, you can pin it to the taskbar (bar at the bottom of a Windows PC) by right clicking on the icon as selecting Pin this program to ​ taskbar. Internet Explorer, the default browser on school PCs, isn’t the best browser for Google Apps. ​ You can login to Google by visiting http://www.google.com and ​ ​ selecting the Sign in button at the top right of the screen.
    [Show full text]
  • Jupyter Notebooks—A Publishing Format for Reproducible Computational Workflows
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elpub digital library Jupyter Notebooks—a publishing format for reproducible computational workflows Thomas KLUYVERa,1, Benjamin RAGAN-KELLEYb,1, Fernando PÉREZc, Brian GRANGERd, Matthias BUSSONNIERc, Jonathan FREDERICd, Kyle KELLEYe, Jessica HAMRICKc, Jason GROUTf, Sylvain CORLAYf, Paul IVANOVg, Damián h i d j AVILA , Safia ABDALLA , Carol WILLING and Jupyter Development Team a University of Southampton, UK b Simula Research Lab, Norway c University of California, Berkeley, USA d California Polytechnic State University, San Luis Obispo, USA e Rackspace f Bloomberg LP g Disqus h Continuum Analytics i Project Jupyter j Worldwide Abstract. It is increasingly necessary for researchers in all fields to write computer code, and in order to reproduce research results, it is important that this code is published. We present Jupyter notebooks, a document format for publishing code, results and explanations in a form that is both readable and executable. We discuss various tools and use cases for notebook documents. Keywords. Notebook, reproducibility, research code 1. Introduction Researchers today across all academic disciplines often need to write computer code in order to collect and process data, carry out statistical tests, run simulations or draw figures. The widely applicable libraries and tools for this are often developed as open source projects (such as NumPy, Julia, or FEniCS), but the specific code researchers write for a particular piece of work is often left unpublished, hindering reproducibility. Some authors may describe computational methods in prose, as part of a general description of research methods.
    [Show full text]
  • Deep Dive on Project Jupyter
    A I M 4 1 3 Deep dive on Project Jupyter Dr. Brian E. Granger Principal Technical Program Manager Co-Founder and Leader Amazon Web Services Project Jupyter © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Project Jupyter exists to develop open-source software, open standards and services for interactive and reproducible computing. https://jupyter.org/ Overview • Project Jupyter is a multi-stakeholder, open source project. • Jupyter’s flagship application is the Text Math Jupyter Notebook. • Notebook document format: • Live code, narrative text, equations, images, visualizations, audio. • ~100 programming languages supported. Live code • Over 500 contributors across >100 GitHub repositories. • Part of the NumFOCUS Foundation: • Along with NumPy, Pandas, Julia, Matplotlib,… Charts Who uses Jupyter and how? Students/Teachers Data science Data Engineers Machine learning Data Scientists Scientific computing Researchers Data cleaning and transformation Scientists Exploratory data analysis ML Engineers Analytics ML Researchers Simulation Analysts Algorithm development Reporting Data visualization Jupyter has a large and diverse user community • Many millions of Jupyter users worldwide • Thousands of AWS customers • Highly international • Over 5M public notebooks on GitHub alone Example: Dive into Deep Learning • Dive into Deep Learning is a free, open-source book about deep learning. • Zhang, Lipton, Li, Smola from AWS • All content is Jupyter Notebooks on GitHub. • 890 page PDF, hundreds of notebooks • https://d2l.ai/ © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Common threads • Jupyter serves an extremely broad range of personas, usage cases, industries, and applications • What do these have in common? Ideas of Jupyter • Computational narrative • “Real-time thinking” with a computer • Direct manipulation user interfaces that augment the writing of code “Computers are good at consuming, producing, and processing data.
    [Show full text]
  • What Is Google Classroom? Google Classroom Is a Class-Organization
    What is Google Classroom? Google Classroom is a class-organization platform that incorporates Google's core G Suite (Google Docs, Sheets, Slides, Drive, and other Google products) so students can access everything they need for a class, including assignments, projects, and files. Can my child use a mobile device or tablet to access Google classroom? Absolutely. Like all the other Google apps available, your student can access the full functionality of Google Classroom from virtually any device. How tech savvy do I need to be to help my kid with Google Classroom? If your kids are younger, it's probably a good idea to have some familiarity with Google's G Suite so you can help your kid upload documents, check the calendar, and do other tasks. It also helps to know how the programs work so you can at least describe the problem to a teacher if anything goes wrong. Older kids may not need any help. Google Classroom is designed to be easy to use, and there's lots of online help. How do we get started? The process for setting up your child's account is pretty simple. BCCS sets up the classrooms and invites all students to the right classes. All they have to do is accept the invite. Your child can get into their classroom anytime by going to classroom.google.com. Students may have multiple classrooms for different classes. ​ From their dashboard, they can choose the class they want to view. Once they enter a classroom, students find four tabs at the top of the page.
    [Show full text]
  • Social Studies Classroom Google Maps and Apps to Improve Student Learning
    Social Studies Classroom Google Maps and Apps to Improve Student Learning Andrew San Angelo Newtown Middle School Newtown Connecticut Today’s Agenda O Google Classroom O Google Slides and Classroom O Google Map Investigation O Peardeck and Google Classroom O Questions O Making of ... Please join my Google Classroom h8wpad Google Slides How to Analyze Political Cartoons Observe Reflect Question How to Analyze Political Cartoons Observe Reflect Question The Star Spangled Banner Oh, say, can you see, by the dawn's early light, What so proudly we hailed at the twilight's last gleaming? Whose broad stripes and bright stars, thro' the perilous fight; O'er the ramparts we watched, were so gallantly streaming. And the rockets red glare, the bombs bursting in air, Gave proof through the night that our flag was still there. Oh, say, does that star-spangled banner yet wave O'er the land of the free and the home of the brave? The Star Spangled Banner Oh, say, can you see, by the _______, What so happy we met at the ______________? Whose wide stripes and _______, thro' the ____ fight; O'er the walls we watched, were so boldly flowing? And the rockets red flash, the bombs exploding in sky, Gave evidence through the evening that our flag was still there. Oh, say, does that colored flag yet flap O'er the land of the free and the house of the heroic? The Star Spangled Banner Oh, say, can you see, by the dawn's early light, What so proudly we hailed at the twilight's last gleaming? Whose broad stripes and bright stars, thro' the perilous fight; O'er the ramparts we watched, were so gallantly streaming.
    [Show full text]
  • Google-Classsroom-FAQ.Pdf
    FAQ’s Install Google Classroom on Android devices To use Classroom on your Android device, download the Classroom app from Google Play. 1. On your device, tap Google Play Store. 2. Find and install the Google Classroom app. Install Google Classroom on an iPhone To use Classroom on your iPhone® or iPad®, download the Classroom app from the App Store. 1. On your device, tap App Store. 2. Find and install the Google Classroom app. FAQ’s Sign in for the first time 1. Tap Classroom 2. Tap Get Started. 3. Tap Add account. 4. Enter your students email address and password (Passwords cannot be changed) 5. Enter your password and tap Next. 6. If there is a welcome message, read it and tap Accept. FAQ’s 7. Once logged into Google Classroom you will see a screen, similar to the below, that will list the classes your child has and grade level announcements. 8. To access homework for a particular class just click in that particular classroom. FAQ’s Sign in daily 1. Tap Classroom 2. To log into Google Classroom to check your student’s progress daily, enter your students email address and password (Passwords will be provided to you by your teacher and CANNOT be changed) 3. Once logged into Google Classroom you will see a screen, similar to the below, that will list the classes your child has and grade level announcements. 4. To access homework for a particular class just click in that particular classroom. FAQ’s How to log on from your home computer 1.
    [Show full text]
  • Student Directions for Accessing Google Classroom and Meet
    Student Directions for Accessing Google Classroom and Meet Student Directions for Accessing Google Classroom and Meet Getting to Google Classroom Joining a Meet Session Rules for Tools in Google Meet Getting to Google Classroom Students can access Google Meet through their teacher’s Google Classroom. Students can access Google Classroom in three ways. 1. Use the Chrome browser and go to classroom.google.com and login using your ​ ​ @fcpsschools.net account 2. If you are already logged in to G Suite, go to the menu at the top right of your screen and select the Classroom icon. You may need to scroll down to locate the Classroom icon. 3. If your teacher has added a link to Google Classroom to their Blackboard course, you can also go to Google Classroom through your teacher’s course. Joining a Meet Session Once logged into your teacher’s Google Classroom, there are two ways to join a Meet Session 1. Click the link in the Course Banner 2. Go to the Classwork tab and click the Meet icon. If you do not see the above options, your teacher has not enabled Google Meet in their Google Classroom. Rules for Tools in Google Meet Audio: ● Enter Meet with your audio turned off. ● Turn on the microphone when called on and turn it off when you finish speaking. Chat: ● Use kind and appropriate language and images. Video: ● Enter Meet with your video off. ● Follow your teacher’s directions on whether to turn on your video; however, students always have the option to keep their camera turned off.
    [Show full text]
  • STEM School Approved Apps & Websites
    Last Updated 8/22/2020 Resource Name Privacy Policy Terms of Service Notes For school-based activities, teachers and school administrators can act in the place of parents to provide consent for the collection of personal data from children. Schools should always notify parents about these activities. 3DS Max (Autodesk) Privacy Policy Children's Privacy 7zip Terms no need to register or enter personal info to use Abcya Privacy Policy Terms The Platform does not require Children to provide their name, address, or other contact information in order to play games. Ableton Music Software Privacy Policy Trial requires account/some personal details Schools that participate in the primary and secondary education named user offering may issue a child under 13 an enterprise-level Adobe ID, Adobe products Privacy Policy [1] Terms [2] but only after obtaining express parental consent. The personal information you provide will be used for the purpose for which it was provided - to contact you, to process an order, to Advanced IP Scanner Privacy Policy register your product Advanced Port Scanner Privacy Policy Advent of code Terms "uses OAuth to confirm your identity through other services, this reveals no information about you beyond what is already public" Affinity Photo & Designer Privacy Policy Terms Affinity Privacy Letter If you are a teacher interested in using Albert with children under 13, please contact us at [email protected] and we will work with albert.io Privacy Policy Terms you toward a school license and collecting the necessary parental consents. allsides.org Privacy Policy Terms Website directed at 13 and older.
    [Show full text]
  • Google Classroom Is a Class-Organization Platform That
    What is Google Classroom? Google Classroom is a class-organization platform that incorporates Google's core G Suite (Google Docs, Sheets, Slides, Drive, and other Google application) so students can access everything they need for a class, assignments, group projects, files, and Google Hangouts Meet with the teacher or the entire class. Why we use Google Classroom? ● Google Classroom allows for assignments to be shared with individuals, groups of students, or whole classes to allow for personalization and differentiation within the one classroom page. ● Families will only have ONE platform to locate assignments for all their K-12 students. Families have indicated that the number of emails for their students and families have been overwhelming. In some cases, students are receiving multiple emails from multiple educators in different places. ● Students can “turn in” assignments virtually, and teachers can provide feedback to their students individually. ● Google Classroom protects student privacy because it is linked to Melrose Public Schools accounts. Platforms must be COPPA and PPRA compliant by federal law. ● Google Classroom can be easily translated for students and families. (You can either use the Google Translate extension or just alt-click on the actual page to choose the language-there are dozens!) What about students in grades K-2? ● All students including those in grades K-2 have successfully learned to navigate the use of Google Classroom this year in their Digital Literacy classrooms. Students have shown proficiency in opening the platform and navigating multiple assignments. As we progress, we will adjust and support students and families to navigate the platform.
    [Show full text]
  • SENTINEL-2 DATA PROCESSING USING R Case Study: Imperial Valley, USA
    TRAINING KIT – R01 DFSDFGAGRIAGRA SENTINEL-2 DATA PROCESSING USING R Case Study: Imperial Valley, USA Research and User Support for Sentinel Core Products The RUS Service is funded by the European Commission, managed by the European Space Agency and operated by CSSI and its partners. Authors would be glad to receive your feedback or suggestions and to know how this material was used. Please, contact us on [email protected] Cover image credits: ESA The following training material has been prepared by Serco Italia S.p.A. within the RUS Copernicus project. Date of publication: November 2020 Version: 1.1 Suggested citation: Serco Italia SPA (2020). Sentinel-2 data processing using R (version 1.1). Retrieved from RUS Lectures at https://rus-copernicus.eu/portal/the-rus-library/learn-by-yourself/ This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. DISCLAIMER While every effort has been made to ensure the accuracy of the information contained in this publication, RUS Copernicus does not warrant its accuracy or will, regardless of its or their negligence, assume liability for any foreseeable or unforeseeable use made of this publication. Consequently, such use is at the recipient’s own risk on the basis that any use by the recipient constitutes agreement to the terms of this disclaimer. The information contained in this publication does not purport to constitute professional advice. 2 Table of Contents 1 Introduction to RUS ........................................................................................................................
    [Show full text]
  • Final Report for the Redus Project
    FINAL REPORT FOR THE REDUS PROJECT Reduced Uncertainty in Stock Assessment Erik Olsen (HI), Sondre Aanes Norwegian Computing Center, Magne Aldrin Norwegian Computing Center, Olav Nikolai Breivik Norwegian Computing Center, Edvin Fuglebakk, Daisuke Goto, Nils Olav Handegard, Cecilie Hansen, Arne Johannes Holmin, Daniel Howell, Espen Johnsen, Natoya Jourdain, Knut Korsbrekke, Ono Kotaro, Håkon Otterå, Holly Ann Perryman, Samuel Subbey, Guldborg Søvik, Ibrahim Umar, Sindre Vatnehol og Jon Helge Vølstad (HI) RAPPORT FRA HAVFORSKNINGEN NR. 2021-16 Tittel (norsk og engelsk): Final report for the REDUS project Sluttrapport for REDUS-prosjektet Undertittel (norsk og engelsk): Reduced Uncertainty in Stock Assessment Rapportserie: År - Nr.: Dato: Distribusjon: Rapport fra havforskningen 2021-16 17.03.2021 Åpen ISSN:1893-4536 Prosjektnr: Forfatter(e): 14809 Erik Olsen (HI), Sondre Aanes Norwegian Computing Center, Magne Aldrin Norwegian Computing Center, Olav Nikolai Breivik Norwegian Program: Computing Center, Edvin Fuglebakk, Daisuke Goto, Nils Olav Handegard, Cecilie Hansen, Arne Johannes Holmin, Daniel Howell, Marine prosesser og menneskelig Espen Johnsen, Natoya Jourdain, Knut Korsbrekke, Ono Kotaro, påvirkning Håkon Otterå, Holly Ann Perryman, Samuel Subbey, Guldborg Søvik, Ibrahim Umar, Sindre Vatnehol og Jon Helge Vølstad (HI) Forskningsgruppe(r): Godkjent av: Forskningsdirektør(er): Geir Huse Programleder(e): Bentiske ressurser og prosesser Frode Vikebø Bunnfisk Fiskeridynamikk Pelagisk fisk Sjøpattedyr Økosystemakustikk Økosystemprosesser
    [Show full text]
  • Exploratory Data Science Using a Literate Programming Tool Mary Beth Kery1 Marissa Radensky2 Mahima Arya1 Bonnie E
    The Story in the Notebook: Exploratory Data Science using a Literate Programming Tool Mary Beth Kery1 Marissa Radensky2 Mahima Arya1 Bonnie E. John3 Brad A. Myers1 1Human-Computer Interaction Institute 2Amherst College 3Bloomberg L. P. Carnegie Mellon University Amherst, MA New York City, NY Pittsburgh, PA [email protected] [email protected] mkery, mahimaa, bam @cs.cmu.edu ABSTRACT engineers, many of whom never receive formal training in Literate programming tools are used by millions of software engineering [30]. programmers today, and are intended to facilitate presenting data analyses in the form of a narrative. We interviewed 21 As even more technical novices engage with code and data data scientists to study coding behaviors in a literate manipulations, it is key to have end-user programming programming environment and how data scientists kept track tools that address barriers to doing effective data science. For of variants they explored. For participants who tried to keep instance, programming with data often requires heavy a detailed history of their experimentation, both informal and exploration with different ways to manipulate the data formal versioning attempts led to problems, such as reduced [14,23]. Currently even experts struggle to keep track of the notebook readability. During iteration, participants actively experimentation they do, leading to lost work, confusion curated their notebooks into narratives, although primarily over how a result was achieved, and difficulties effectively through cell structure rather than markdown explanations. ideating [11]. Literate programming has recently arisen as a Next, we surveyed 45 data scientists and asked them to promising direction to address some of these problems [17].
    [Show full text]