Saturday Saturday 1 February - First Part

Total Page:16

File Type:pdf, Size:1020Kb

Saturday Saturday 1 February - First Part SEE YOU TOMORROW BRUSSELS 1 FEBRUARY - 2 FEBRUARY SATURDAY SATURDAY 1 FEBRUARY - FIRST PART 09:00 10:00 11:00 12:00 13:00 Janson Welcome to FOS- The Linux Kernel: We have to finish this LibreOffice turns ten and Over Twenty Years Of Auto- Blender, Coming of Age thing one day ;): Solving big problems DEM 2020 in small steps for more than two what's next mation decades Ton Roosendaal FOSDEM Staff Thorsten Leemhuis Michael Meeks James Shubin 13:00-13:50 09:30-09:55 10:00-10:50 11:00-11:50 12:00-12:50 K.1.105 (La How FOSS could revolution- The Selfish Contributor The Ethics Behind Your IoT Freedom and AI: Can Free Fontaine) ize municipal government: Explained Molly de Blanc Software include ethical AI with recent real-world James Bottomley systems? examples 12:00-12:50 11:00-11:50 Justin W. Flory / Michael Danese Cooper Nolan 10:00-10:50 13:00-13:50 H.2215 Civil society A tool What's Web3 Next, Web- (Ferrer) needs Free for in my - the the pro- late: Software Com- food ? Internet gram- open- hackers munity Open of Free- mable source Matthias Sup- Food dom, web contin- Kirschner ported Facts, Value, browser uous Agri- the and local- 12:00-12:15 Atlas culture Wiki- Trust Engi- ization (CSA) pedia Bruno neer platform man- of Škvorc Václav age- Food 13:20- 13:00- 13:35 Zbránek ment, Pierre Ope- 13:15 13:40- Slam- 13:55 nOlitor ich Mikel 12:40- Cordo- 12:55 villa 12:20- 12:35 H.1301 Getting started Quantum machine Quantum computing The role of Quantum Advan- (Cornil) with quantum learning with Pen- hardware and control open source in tage and Quantum software devel- nyLane systems building quan- Computing in the opment Joshua Izaac Felix Tripier tum computing Real World Tomas Babej ecosystem from Mark Matting- 11:05-11:40 11:50-12:25 scratch 10:30-11:00 ley-Scott Hakob Avetisyan 13:20-13:55 12:35-13:10 H.1302 (Dep- State of OpenJDK Project TornadoVM: A ByteBuffers are Free at Last! Shenandoah age) Loom: Virtual Machine dead, long live The Tale of 2.0 Mark Reinhold for Exploiting Advanced ByteBuffers! Jakarta EE Roman 10:30-11:15 concurrency High-Performance Maurizio Cimada- Mike Mil- Kennke for fun and Heterogeneous Execution of Java more inkovich profit 13:35-14:00 Programs 12:20-13:00 13:05-13:30 Andrew Thanos Stratiko- Haley poulos 11:20-11:45 11:50-12:15 SATURDAY 1 FEBRUARY - SECOND PART 14:00 15:00 16:00 17:00 18:00 19:00 The Hidden Early History of Generation gaps HTTP/3 for everyone State of the Onion SCION Janson Unix: The Forgotten history of early Unix Liam Proven Daniel Stenberg Pili Mateusz Kowalski / Warner Losh 15:00-15:50 16:00-16:50 17:00-17:50 Kamila Součková 14:00-14:50 18:00-18:50 How Containers and Fixing the Kubernetes Address Space Isolation Guix: Unifying provision- K.1.105 (La Kubernetes re-defined clusterfuck in the Linux Kernel ing, deployment, and Fontaine) the GNU/Linux Operating package management in System Kris Nova James Bottomley / Mike the age of containers Rapoport Daniel Riek 15:00-15:50 Ludovic Courtès 14:00-14:50 16:00-16:50 17:00-17:50 Kapow! Yjs: A En- Protect Opti- Index- Verif- Man- Red- KDE Gate The Track- Con- Desk- H.2215 A Web CRDT crypt your mizing ing En- pal dos Wax Itiner- proj- pool ing cept ConnD: (Ferrer) Frame- frame- your data 2,0973sand cm- crypt- Na- Teddy - trust ary ect next local Pro- Secure, work work col- ob- box ed dim Hoge- only Volker Timo to the stor- gram- cross- for the for labo- jects, cre- Data Ko- born your- Krause Savo- ocean: age ming, plat- Shell shared ration not ation Using self How to con- from form beissi 16:20- 17:00- la Rober- editing with your with a Bloom Dirk- bring figu- ideas IPC on Crypt- net- FUSE Filters 16:00- 16:35 17:15 17:20- Open- ration to the net- to Ab- Kevin 16:15 Wil- 17:35 delk- Jahns Pad work file Claude lem Source on code work con- sys- skills to linux ader 14:20- Ludo- War- van Chris- Omer Mart- vic nec- tem ren Gulik more Alas- tophe Akram 14:35 tions people ínez Du- Julio 15:40- 16:40- dair de 18:40- Pérez bost Steph- Meri- 15:55 16:55 Jo- Ker- Dine- 18:55 14:00 14:40- an no hannes gon chin -14:15 14:55 Sch- 15:20- Tigges 18:00- 18:20- wicht- 15:35 17:40- 18:15 18:35 enberg 17:55 15:00- 15:15 Quantum circuit SimulaQron - a The Role of Computing with Quantum classi- Quantum com- Quantum H.1301 optimisation, simulator for Open Source the TensorNet- fiers, robust data puter brands: Open Source (Cornil) verification, and developing Frameworks work Library encodings, and connecting Foundation simulation with software to imple- quantum inter- in Quantum apples and PyZX Stefan ment them Mark Finger- net software Computing and oranges John van de Leichenauer Ryan LaRose huth Axel Dahlberg Technologies Petar Korponaić Wetering 16:20-16:55 17:05-17:40 18:30-19:00 14:05-14:40 14:50-15:25 Jack Hidary 17:50-18:25 15:35-16:10 JMC & JFR - 2020 Hacking on Reducing G1: To Just-in-time Helpful Taming The Open- JRuby Startup and H.1302 Vision GraalVM: OpenJDK infinity and compiling NullPointer- Metaspace: JDK JVM : AOT Java Gar- Exceptions a look at the Securing (Depage) Jie Kang A (very) beyond Java in 2020 Charles Nutter / Rough bage Collec- - The little machinery, a moving tion times Stefan Martin thing that and a pro- target or Thomas Enebo 14:05-14:45 Guide with stack Johansson Doerr became a posal for a What could 18:20-19:00 allocation JEP better one possibly go Andrew 15:50-16:15 16:20-16:45 Dinn / Josh Nikola Christoph Thomas wrong? Matsuoka Grcevski Langer Stüfe Andrew Dinn 14:50-15:15 15:20-15:45 16:50-17:15 17:20-17:45 17:50-18:15 Please refer to the website for updates or last minute changes: https://fosdem.org/schedule/amendments SATURDAY 1 FEBRUARY - FIRST PART 09:00 10:00 11:00 12:00 13:00 H.1308 Fundamental Skydive Do you really see Endless Re- Ana- XDP and Weave Net, an placing (Rolin) Technologies We Sylvain what’s happening on Network lyzing page_ Open Source Need to Work on for your NFV infrastruc- Programming iptables DPDK pool API Container Net- Baubeau with Cloud-Native Net- ture? − An Update appli- work / Sylvain eBPF in Ilias Apal- working Afchain Emma Foley / Krzysz- from eBPF Kuberne- cations odimas / Bryan Boreham Land with Magnus Karlsson 11:10- tof Kepka tes with Lorenzo 13:30-14:10 Cilium eBPF 10:30-11:10 11:30 11:30-12:10 Quentin Bianconi Monnet Michal Stephen 13:10- Rostecki 12:10-12:30 Hem- 13:30 12:30- minger 12:50 12:50- 13:10 H.1309 (Van 1* DNS Management HashDNS and State of Testing DoH Improving unwind(8) ex- Rijn) in OpenStack FQDNDHCP djbdns- and DoT BIND 9 Code Florian Obser tending curve6 servers, com- Quality catalog Graham Hayes Renzo Davoli 13:10-13:40 Erwin pliance and Ondřej Surý zones 10:35-11:05 11:10-11:40 performance Hoffmann 12:35-13:05 Leo (feh) Stéphane Vande- 11:45- Bortzmeyer woesti- 12:05 12:10-12:30 jne 13:45- 14:00 H.2213 Design- LibrePCB Open-source de- ngspice open Towards KiCad: Back to Pocket De- ing and Status sign ecosystems source circuit CadQue- the Future Science signing Producing Update around FreeCAD simulator ry 2.0 Wayne Stam- Lab from func- Open Urban Yorik van Havre / Holger Vogt Adam baugh Develop- tional Source ment to objects Bruhin Brad Collette 11:55-12:15 Ur- 12:45-13:15 Hardware banczyk Production with with FOSS/ 10:55-11:15 11:20-11:50 func- 12:20- Mario OSHW Behling tional tools 12:40 objects 13:20-13:40 Tsvetan Marius Usunov Kintel 10:30- 13:45- 10:50 14:05 H.2214 3* MySQL 8 MyRocks How Safe is The conse- Over- Whats new SELinux vs MariaDB in the Wild Asynchro- quences of view of in Prox- fun with 10.4 Wild West! nous Mas- sync_binlog encryp- ySQL 2.0? MySQL and Peter Alkin ter-Master != 1. tion Nick Vyzas friends Setup? features Zaitsev Tezuysal Jean-François 13:10-13:30 Matthias 10:40-11:00 11:10-11:30 Sveta Gagné Hrvoje C / Ivan Smirnova 12:10-12:30 Matija- Groene- 11:40-12:00 kovic wold 12:40- 13:40- 13:00 14:00 *1. DNS Devroom Opening, Shane Kerr / Pieter Lexis / Peter van Dijk, 10:30-10:35 *2. Leveraging Open Source Designs, Lasse Mönch, 14:10-14:20 SATURDAY 1 FEBRUARY - SECOND PART 14:00 15:00 16:00 17:00 18:00 19:00 Rethinking kubernetes Akraino Edge KNI Fast QUIC sockets Mixing Dial your Network- User- Vita: H.1308 networking with SRv6 blueprint for cloud network- kool- ing Code up to 11 space high- (Rolin) and Contiv-VPP ing aids! network- speed Yolanda Robla Acceler- Bruce Richardson / ing: be- traffic Ahmed Abdelsalam / Mota / Ricardo Nathan Skrzyp- ate the Harry van Haaren yond the encryp- internet Miroslaw Walukiewicz Noriega czak / Aloys 16:40-17:20 kernel tion on / Filip Gschwandtner / Augustin with bypass x86_64 15:00-15:40 AF_XDP Daniel Bernier with with 15:40-16:20 & DPDK RDMA! Snabb 14:10-15:00 Ciara Benoît Max Loftus Ganne Rotten- / Kevin kolber Laatz 17:20- 16:20- 17:40 17:40- 16:40 18:00 The Check Yourself Metrics and Hint, Hint, Font The ultimate Shipping a per- The journey of H.1309 Different Before You Wreck models for Web Loading Matters! guide to HTTP formance API on building Open- (Van Rijn) Ways of Yourself performance Sia Karamalegos resource prioriti- Chromium SpeedMonitor Minimiz- Nic Jansma evaluation zation Nicolás Peña Stefan Burnicki / ing ANY 16:20-16:55 15:00-15:35 Dario Rossi Robin Marx Moreno Nils Kuhn Edward 15:40-16:15 17:00-17:35 17:40-18:15 18:20-18:55 Lewis 14:05- 14:25 2* Fritzing Sparse- Open CAS- News Gmsh AXIOM Horizon OpenPit- Design- Finite element H.2213 ing Hard- - the lizard: a CADE Tech- from Chris- - open EDA - on: An modeling with past, the general nology - an gEDA/ source Version Open- ware, the deal.II soft- tophe Journey present purpose introduction gaf Geuzaine cinema 1.0 Source from ware library and
Recommended publications
  • Open Source Copyrights
    Kuri App - Open Source Copyrights: 001_talker_listener-master_2015-03-02 ===================================== Source Code can be found at: https://github.com/awesomebytes/python_profiling_tutorial_with_ros 001_talker_listener-master_2016-03-22 ===================================== Source Code can be found at: https://github.com/ashfaqfarooqui/ROSTutorials acl_2.2.52-1_amd64.deb ====================== Licensed under GPL 2.0 License terms can be found at: http://savannah.nongnu.org/projects/acl/ acl_2.2.52-1_i386.deb ===================== Licensed under LGPL 2.1 License terms can be found at: http://metadata.ftp- master.debian.org/changelogs/main/a/acl/acl_2.2.51-8_copyright actionlib-1.11.2 ================ Licensed under BSD Source Code can be found at: https://github.com/ros/actionlib License terms can be found at: http://wiki.ros.org/actionlib actionlib-common-1.5.4 ====================== Licensed under BSD Source Code can be found at: https://github.com/ros-windows/actionlib License terms can be found at: http://wiki.ros.org/actionlib adduser_3.113+nmu3ubuntu3_all.deb ================================= Licensed under GPL 2.0 License terms can be found at: http://mirrors.kernel.org/ubuntu/pool/main/a/adduser/adduser_3.113+nmu3ubuntu3_all. deb alsa-base_1.0.25+dfsg-0ubuntu4_all.deb ====================================== Licensed under GPL 2.0 License terms can be found at: http://mirrors.kernel.org/ubuntu/pool/main/a/alsa- driver/alsa-base_1.0.25+dfsg-0ubuntu4_all.deb alsa-utils_1.0.27.2-1ubuntu2_amd64.deb ======================================
    [Show full text]
  • Nosql Databases
    Query & Exploration SQL, Search, Cypher, … Stream Processing Platforms Data Storm, Spark, .. Data Ingestion Serving ETL, Distcp, Batch Processing Platforms BI, Cubes, Kafka, MapReduce, SparkSQL, BigQuery, Hive, Cypher, ... RDBMS, Key- OpenRefine, value Stores, … Data Definition Tableau, … SQL DDL, Avro, Protobuf, CSV Storage Systems HDFS, RDBMS, Column Stores, Graph Databases Computing Platforms Distributed Commodity, Clustered High-Performance, Single Node Query & Exploration SQL, Search, Cypher, … Stream Processing Platforms Data Storm, Spark, .. Data Ingestion Serving ETL, Distcp, Batch Processing Platforms BI, Cubes, Kafka, MapReduce, SparkSQL, BigQuery, Hive, Cypher, ... RDBMS, Key- OpenRefine, value Stores, … Data Definition Tableau, … SQL DDL, Avro, Protobuf, CSV Storage Systems HDFS, RDBMS, Column Stores, Graph Databases Computing Platforms Distributed Commodity, Clustered High-Performance, Single Node Computing Single Node Parallel Distributed Computing Computing Computing CPU GPU Grid Cluster Computing Computing A single node (usually multiple cores) Attached to a data store (Disc, SSD, …) One process with potentially multiple threads R: All processing is done on one computer BidMat: All processing is done on one computer with specialized HW Single Node In memory Retrieve/Stores from Disc Pros Simple to program and debug Cons Can only scale-up Does not deal with large data sets Single Node solution for large scale exploratory analysis Specialized HW and SW for efficient Matrix operations Elements: Data engine software for
    [Show full text]
  • L'exemple De Wikipédia Laure Endrizzi Chargée D'études Et De Recherche, Cellule Veille Scientifique Et Technologique, INRP, Lyon
    La communauté comme auteur et éditeur : l’exemple de Wikipédia Laure Endrizzi To cite this version: Laure Endrizzi. La communauté comme auteur et éditeur : l’exemple de Wikipédia. Journée nationale du réseau des URFIST : Evaluation et validation de l’information sur internet, Jan 2007, Paris, France. edutice-00184888 HAL Id: edutice-00184888 https://edutice.archives-ouvertes.fr/edutice-00184888 Submitted on 2 Nov 2007 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Journée d'études des URFIST 31 janvier 2007, Paris « Evaluation et validation de l'information sur internet » La communauté comme auteur et éditeur : l'exemple de Wikipédia Laure Endrizzi chargée d'études et de recherche, cellule Veille scientifique et technologique, INRP, Lyon Résumé L’ensemble des technologies dites 2.0 place l’usager au cœur de la création des contenus numériques tout en l’inscrivant dans une dynamique collective. Ces transformations remettent en cause le modèle éditorial traditionnel, sans offrir de représentations claires et stabilisées des modes de production et de validation qui sont à l’œuvre. Avec l’exemple de Wikipédia, nous tenterons de comprendre les mécanismes de la régulation éditoriale, pour ensuite nous interroger sur les formes d’expertise sollicitées et les figures de l’auteur.
    [Show full text]
  • Easybuild Documentation Release 20210907.0
    EasyBuild Documentation Release 20210907.0 Ghent University Tue, 07 Sep 2021 08:55:41 Contents 1 What is EasyBuild? 3 2 Concepts and terminology 5 2.1 EasyBuild framework..........................................5 2.2 Easyblocks................................................6 2.3 Toolchains................................................7 2.3.1 system toolchain.......................................7 2.3.2 dummy toolchain (DEPRECATED) ..............................7 2.3.3 Common toolchains.......................................7 2.4 Easyconfig files..............................................7 2.5 Extensions................................................8 3 Typical workflow example: building and installing WRF9 3.1 Searching for available easyconfigs files.................................9 3.2 Getting an overview of planned installations.............................. 10 3.3 Installing a software stack........................................ 11 4 Getting started 13 4.1 Installing EasyBuild........................................... 13 4.1.1 Requirements.......................................... 14 4.1.2 Using pip to Install EasyBuild................................. 14 4.1.3 Installing EasyBuild with EasyBuild.............................. 17 4.1.4 Dependencies.......................................... 19 4.1.5 Sources............................................. 21 4.1.6 In case of installation issues. .................................. 22 4.2 Configuring EasyBuild.......................................... 22 4.2.1 Supported configuration
    [Show full text]
  • Are Encyclopedias Dead? Evaluating the Usefulness of a Traditional Reference Resource Rachel S
    St. Cloud State University theRepository at St. Cloud State Library Faculty Publications Library Services 2012 Are Encyclopedias Dead? Evaluating the Usefulness of a Traditional Reference Resource Rachel S. Wexelbaum St. Cloud State University, [email protected] Follow this and additional works at: https://repository.stcloudstate.edu/lrs_facpubs Part of the Library and Information Science Commons Recommended Citation Wexelbaum, Rachel S., "Are Encyclopedias Dead? Evaluating the Usefulness of a Traditional Reference Resource" (2012). Library Faculty Publications. 26. https://repository.stcloudstate.edu/lrs_facpubs/26 This Article is brought to you for free and open access by the Library Services at theRepository at St. Cloud State. It has been accepted for inclusion in Library Faculty Publications by an authorized administrator of theRepository at St. Cloud State. For more information, please contact [email protected]. Are Encyclopedias Dead? Evaluating the Usefulness of a Traditional Reference Resource Author Rachel Wexelbaum is Collection Management Librarian and Assistant Professor at Saint Cloud State University, Saint Cloud, Minnesota. Contact Details Rachel Wexelbaum Collection Management Librarian MC135D Collections Saint Cloud State University 720 4 th Avenue South Saint Cloud, MN 56301 Email: [email protected] Abstract Purpose – To examine past, current, and future usage of encyclopedias. Design/methodology/approach – Review the history of encyclopedias, their composition, and usage by focusing on select publications covering different subject areas. Findings – Due to their static nature, traditionally published encyclopedias are not always accurate, objective information resources. Intentions of editors and authors also come into question. A researcher may find more value in using encyclopedias as historical documents rather than resources for quick facts.
    [Show full text]
  • Misc Thesisdb Bythesissuperv
    Honors Theses 2006 to August 2020 These records are for reference only and should not be used for an official record or count by major or thesis advisor. Contact the Honors office for official records. Honors Year of Student Student's Honors Major Thesis Title (with link to Digital Commons where available) Thesis Supervisor Thesis Supervisor's Department Graduation Accounting for Intangible Assets: Analysis of Policy Changes and Current Matthew Cesca 2010 Accounting Biggs,Stanley Accounting Reporting Breaking the Barrier- An Examination into the Current State of Professional Rebecca Curtis 2014 Accounting Biggs,Stanley Accounting Skepticism Implementation of IFRS Worldwide: Lessons Learned and Strategies for Helen Gunn 2011 Accounting Biggs,Stanley Accounting Success Jonathan Lukianuk 2012 Accounting The Impact of Disallowing the LIFO Inventory Method Biggs,Stanley Accounting Charles Price 2019 Accounting The Impact of Blockchain Technology on the Audit Process Brown,Stephen Accounting Rebecca Harms 2013 Accounting An Examination of Rollforward Differences in Tax Reserves Dunbar,Amy Accounting An Examination of Microsoft and Hewlett Packard Tax Avoidance Strategies Anne Jensen 2013 Accounting Dunbar,Amy Accounting and Related Financial Statement Disclosures Measuring Tax Aggressiveness after FIN 48: The Effect of Multinational Status, Audrey Manning 2012 Accounting Dunbar,Amy Accounting Multinational Size, and Disclosures Chelsey Nalaboff 2015 Accounting Tax Inversions: Comparing Corporate Characteristics of Inverted Firms Dunbar,Amy Accounting Jeffrey Peterson 2018 Accounting The Tax Implications of Owning a Professional Sports Franchise Dunbar,Amy Accounting Brittany Rogan 2015 Accounting A Creative Fix: The Persistent Inversion Problem Dunbar,Amy Accounting Foreign Account Tax Compliance Act: The Most Revolutionary Piece of Tax Szwakob Alexander 2015D Accounting Dunbar,Amy Accounting Legislation Since the Introduction of the Income Tax Prasant Venimadhavan 2011 Accounting A Proposal Against Book-Tax Conformity in the U.S.
    [Show full text]
  • Release 2021-03
    Metrics Release 2021-03 https://chaoss.community/metrics MIT License Copyright © 2021 CHAOSS a Linux Foundation® Project CHAOSS Contributors include: Aastha Bist, Abhinav Bajpai, Ahmed Zerouali, Akshara P, Akshita Gupta, Amanda Brindle, Anita Ihuman, Alberto Martín, Alberto Pérez García-Plaza, Alexander Serebrenik, Alexandre Courouble, Alolita Sharma, Alvaro del Castillo, Ahmed Zerouali, Amanda Casari, Amy Marrich, Ana Jimenez Santamaria, Andre Klapper, Andrea Gallo, Andy Grunwald, Andy Leak, Aniruddha Karajgi, Anita Sarma, Ankit Lohani, Ankur Sonawane, Anna Buhman, Armstrong Foundjem, Atharva Sharma, Ben Lloyd Pearson, Benjamin Copeland, Beth Hancock, Bingwen Ma, Boris Baldassari, Bram Adams, Brian Proffitt, Camilo Velazquez Rodriguez, Carol Chen, Carter Landis, Chris Clark, Christian Cmehil- Warn, Damien Legay, Dani Gellis, Daniel German, Daniel Izquierdo Cortazar, David A. Wheeler, David Moreno, David Pose, Dawn Foster, Derek Howard, Don Marti, Drashti, Duane O’Brien, Dylan Marcy, Eleni Constantinou, Elizabeth Barron, Emily Brown, Emma Irwin, Eriol Fox, Fil Maj, Gabe Heim, Georg J.P. Link, Gil Yehuda, Harish Pillay, Harshal Mittal, Henri Yandell, Henrik Mitsch, Igor Steinmacher, Ildiko Vancsa, Jacob Green, Jaice Singer Du Mars, Jaskirat Singh, Jason Clark, Javier Luis Cánovas Izquierdo, Jeff McAffer, Jeremiah Foster, Jessica Wilkerson, Jesus M. Gonzalez- Barahona, Jilayne Lovejoy, Jocelyn Matthews, Johan Linåker, John Mertic, Jon Lawrence, Jonathan Lipps, Jono Bacon, Jordi Cabot, Jose Manrique Lopez de la Fuente, Joshua Hickman, Joshua
    [Show full text]
  • Towards a Fully Automated Extraction and Interpretation of Tabular Data Using Machine Learning
    UPTEC F 19050 Examensarbete 30 hp August 2019 Towards a fully automated extraction and interpretation of tabular data using machine learning Per Hedbrant Per Hedbrant Master Thesis in Engineering Physics Department of Engineering Sciences Uppsala University Sweden Abstract Towards a fully automated extraction and interpretation of tabular data using machine learning Per Hedbrant Teknisk- naturvetenskaplig fakultet UTH-enheten Motivation A challenge for researchers at CBCS is the ability to efficiently manage the Besöksadress: different data formats that frequently are changed. Significant amount of time is Ångströmlaboratoriet Lägerhyddsvägen 1 spent on manual pre-processing, converting from one format to another. There are Hus 4, Plan 0 currently no solutions that uses pattern recognition to locate and automatically recognise data structures in a spreadsheet. Postadress: Box 536 751 21 Uppsala Problem Definition The desired solution is to build a self-learning Software as-a-Service (SaaS) for Telefon: automated recognition and loading of data stored in arbitrary formats. The aim of 018 – 471 30 03 this study is three-folded: A) Investigate if unsupervised machine learning Telefax: methods can be used to label different types of cells in spreadsheets. B) 018 – 471 30 00 Investigate if a hypothesis-generating algorithm can be used to label different types of cells in spreadsheets. C) Advise on choices of architecture and Hemsida: technologies for the SaaS solution. http://www.teknat.uu.se/student Method A pre-processing framework is built that can read and pre-process any type of spreadsheet into a feature matrix. Different datasets are read and clustered. An investigation on the usefulness of reducing the dimensionality is also done.
    [Show full text]
  • PDF (611K, 38 Pages)
    Getting Unstuck A Sampler of Advice for Open Source Projects by Sumana Harihareswara © 2020 Sumana Harihareswara under the Creative Commons Attribution-ShareAlike 4.0 license (CC BY-SA) Please feel free to share this book, translate it, and reuse it per the license.1 Sumana Harihareswara Changeset Consulting LLC P.O. Box 721160 Jackson Heights, NY 11372 https://changeset.nyc/ +1 (929) 255-4578 Written in emacs and in New York City, 2020. Cover design and layout by Julia Rios Cover photograph by Susanne Stöckli For Leonard, my foundation. And for Aaron Swartz, our lighthouse. Table of Contents • Introduction • Conducting a SWOT analysis • How to start thinking about budgets and money • Teaching and including unskilled volunteers • An outline of the full book • Acknowledgments • About the author • Feedback welcome Introduction Getting Open Source Projects Unstuck (or, in other words: maintaining legacy open source projects. Below is the introduction for the full, forthcoming book.) Who this book is for and what you should get out of it You are about to get an open source project unstuck. Maybe a bunch of work is piling up in the repository and users are getting worried, waiting for a release. Maybe developers have gotten bogged down, trying to finish a big rewrite while maintaining the stable release. Maybe the project's suffering for lack of infrastructure — testing, money, an institutional home. You noticed the problem. So that means it's up to you to fix it. Or you're getting paid to fix it, even though you didn't start this thing. A while ago I blurted out the phrase "dammit-driven leadership." Because sometimes you look around, and you realize something needs doing, and you're the only one who really gets why, so you say, “Dammit, okay, I'll do it, then.” After reading this book, you should be prepared to: 1.
    [Show full text]
  • Annual Report 2006
    Annual Report 2006 Table of contents Foreword Letter from the Chairman, Dave Neary 4–5 A year in review 2006—a year in GNOME 8–10 Distributions in 2006 11 Events and community initiatives GUADEC—The GNOME Conference 12–13 GNOME hackers descend on MIT Media Center 14–15 GNOME User Groups 16 The www.gnome.org revamp 17 GNOME platform 17 GNOME Foundation Administrator 17 Foundation development The Women’s Summer Outreach Program 18–20 The GNOME Mobile and Embedded Initiative 21 The GNOME Advisory Board 22–23 PHOTO The GNOME Foundation Board and Advisory Board members by David Zeuthen (continued on the inside back cover) GNOME Foundation 3 Dear Friends, All traditions need a starting point, they say. What you now hold in your hands is the first annual report of the GNOME Foundation, at the end of what has been an eventful year for us. Each year brings its challenges and rewards for the members of this global project. This year, many of our biggest challenges are in the legal arena. European countries have been passing laws to conform with the European Union Copyright Directive, and some, including France, have brought into law provisions which we as software developers find it hard to understand, but which appear to make much of what we do illegal. We have found our- selves in the center of patent wars as bigger companies jockey for position with offerings based on our hard work. And we are scratching our heads trying to figure out how to deal with the constraints of DRM and patents in multimedia, while still offering our users access to their media files.
    [Show full text]
  • Large Free Software Projects and Bugzilla Lessons from GNOME Project QA
    Large Free Software Projects and Bugzilla Lessons from GNOME Project QA Luis Villa Ximian, Inc. [email protected], http://tieguy.org/ Abstract itory for quality assurance and patch track- ing. This was not a process without problems- The GNOME project transitioned, during the hackers resisted, QA volunteers did too lit- GNOME 2.0 development and release cycle, tle, or too much, we learned things we need from a fairly typical, mostly anarchic free soft- to know too late, or over-engineered the pro- ware development model to a more disciplined, cess too early. Despite the problems, though, release driven development model. The ed- GNOME learned a great deal and as a result, ucational portion of this paper will focus on GNOME’s latest releases have been more sta- two key components that made the GNOME ble and reliable than ever before (even if far QA model successful: developer/QA interac- from perfect yet :) tion and QA process. Falling into the first cat- The purpose of this paper isn’t to teach some- egory, it will discuss why GNOME develop- one to do QA, or to impress upon the reader ers bought in to the process, how Bugzilla was the need for good QA—other tomes have been made easier for them and for GNOME as a written on each of those subjects. Instead, it whole, and why they still believe in the pro- will focus on QA in a Free Software context— cess despite having been under Bugzilla’s lash how it works in GNOME, what needed to be for more than a year.
    [Show full text]
  • A Multilingual Information Extraction Pipeline for Investigative Journalism
    A Multilingual Information Extraction Pipeline for Investigative Journalism Gregor Wiedemann Seid Muhie Yimam Chris Biemann Language Technology Group Department of Informatics Universita¨t Hamburg, Germany gwiedemann, yimam, biemann @informatik.uni-hamburg.de { } Abstract 2) court-ordered revelation of internal communi- cation, 3) answers to requests based on Freedom We introduce an advanced information extrac- tion pipeline to automatically process very of Information (FoI) acts, and 4) unofficial leaks large collections of unstructured textual data of confidential information. Well-known exam- for the purpose of investigative journalism. ples of such disclosed or leaked datasets are the The pipeline serves as a new input proces- Enron email dataset (Keila and Skillicorn, 2005) sor for the upcoming major release of our or the Panama Papers (O’Donovan et al., 2016). New/s/leak 2.0 software, which we develop in To support investigative journalism in their cooperation with a large German news organi- zation. The use case is that journalists receive work, we have developed New/s/leak (Yimam a large collection of files up to several Giga- et al., 2016), a software implemented by experts bytes containing unknown contents. Collec- from natural language processing and visualiza- tions may originate either from official disclo- tion in computer science in cooperation with jour- sures of documents, e.g. Freedom of Informa- nalists from Der Spiegel, a large German news or- tion Act requests, or unofficial data leaks. Our ganization. Due to its successful application in the software prepares a visually-aided exploration investigative research as well as continued feed- of the collection to quickly learn about poten- tial stories contained in the data.
    [Show full text]