Application of Genetic Algorithms to Structure Elucidation of Halogenated Alkanes Considering the Corresponding C NMR Spectra*

Total Page:16

File Type:pdf, Size:1020Kb

Application of Genetic Algorithms to Structure Elucidation of Halogenated Alkanes Considering the Corresponding C NMR Spectra* CROATICA CHEMICA ACTA CCACAA 77 (1–2) 213¿219 (2004) ISSN-0011-1643 CCA-2919 Original Scientific Paper Application of Genetic Algorithms to Structure Elucidation of Halogenated Alkanes Considering the Corresponding 13C NMR Spectra* Thomas Blenkers and Peter Zinn** Lehrstuhl für Analytische Chemie, Ruhr-Universität Bochum, D-44780 Bochum, Germany RECEIVED MAY 5, 2003; REVISED JULY 19, 2003; ACCEPTED AUGUST 20, 2003 A new approach for structure elucidation using genetic algorithms is introduced. In analogy to the genetic programming paradigm developed by Koza, the new concept supports genetic ope- rations on hierarchically coded chemical line notations. The implementation of this concept consists of 5 steps. In the first step, a start population of chemical compounds is randomly gen- erated. As the second step, physical properties of each compound of the population are pre- dicted. The third step is the comparison of each individual property with the observed property of an unknown compound, resulting in the calculation of the fitness value for each generated compound. Depending on the fitness values, the candidates for the next generation are selected by a spinning wheel procedure during the fourth step. In the last step, these candidates are rear- Key words ranged by genetic mutation and crossover to form the next generation. Steps 2 to 5 of the de- genetic algorithms scribed procedure are repeated until the spectrum of one candidate is almost equal to the spec- structure elucidation trum of the unknown compound within acceptable tolerances. The introduced concept was 13C NMR spectra verified for halogenated alkanes. INTRODUCTION structures directly from powder diffraction data using Since the first publications in 1990, the interest in appli- genetic algorithms was reviewed recently.5 An application cations of genetic algorithms in chemistry or in closely of genetic algorithms to structure elucidation in a more related sciences has dramatically increased. Today, there general way was given by Meiler.6,7 This approach re- are nearly 4000 papers published, with a yearly growth quires the measured 13C NMR spectrum of the unknown of more than 600 contributions. An overview of the compound and its experimentally determined molecular varying application fields of genetic algorithms in chem- gross formula. Depending on the molecular formula, the istry was given by Leardi.1 space for searching the corresponding constitution in- Different approaches have been used to apply gene- creases rapidly. Genetic algorithms and 13C NMR spec- tic algorithms in structure elucidation.2,3,4 The objects of trum prediction by neural networks are demonstrated as these investigations are the protein fold prediction and the tools to find the constitution corresponding to the mea- pharmacophore elucidation, etc. Solving molecular crystal sured spectrum. * Dedicated to Professor Nenad Trinajsti} on the occasion of his 65th birthday. ** Author to whom correspondence should be addressed. (E-mail: [email protected]) 214 T. BLENKERS AND P. ZINN The concept of structure elucidation in the present To allow only genetic modifications that result in correct paper also focuses on 13C NMR spectra and genetic al- chemical structures, a system of corresponding constraints gorithms. As first described in 1995,8 the present method and modification rules seems to be necessary. This com- abstains from the molecular formula and allows struc- plication slows down the genetic algorithm and decreases ture elucidation without a priori structural knowledge. practicability. A completely different way of coding the individu- als of a genetic algorithm was introduced by J. Koza.11 GENERAL CONCEPT AND IMPLEMENTATION Koza’s Genetic Programming is a technique for finding STRATEGY the best mathematical formula in order to solve a given When including genetic algorithms into the structure problem. The principal idea is to code formulas in a elucidation procedure the most important difference tree-like hierarchical way and to modify them by genetic from the classical elucidation circuit is that, instead of operations from generation to generation until an opti- only one actual structure proposition, a whole popula- mal result is reached. Within a tree, sub-trees can be cut tion of possible chemical structures has to be considered. off and substituted by sub-trees from other trees in the Each time the population passes through the elucidation sense of a genetic crossover. Point mutations can be also circuit, the individual structures are modified by genetic performed easily. In the following chapter, we will dis- operations. Figure 1 shows the general concept of struc- cuss in detail the transformation of this coding to chemi- ture elucidation using genetic algorithms. cal structures. The starting point of the genetic algorithm is the ge- As the second step of the genetic structure elucida- neration of an initial population of chemical structures. tion, the prediction of a physical property of each indi- In generating an initial population the main problem is vidual chemical structure is to be performed. Physical to find a structure notation that allows genetic modifica- properties, e.g. spectral, diffraction, chromatographic, tions of the individual structures e.g. mutation and cross- thermodynamic or other data, can be used if a well de- over. Genetic algorithms can conveniently process binary fined quantitative structure property relationship (QSPR) coded individuals. These individuals are often represent- is known. Besides a single QSPR, the prediction of dif- ed as bit strings of fixed length or as the corresponding ferent properties or a combined QSPR is also of interest real numbers. On the other hand, typical computer-ori- and may improve the accuracy of the results of the ge- ented molecular codes9 are far from bit strings or real netic structure elucidation. Especially in cases when sin- number coding. The three most widespread structure no- gle QPR’s are of less predictive quality, a combination tations are the connection table, the different types of line may be successful. A set of property values correspond- notations, and the adjacency matrix.10 Transformation of ing to each individual structure of the actual population one of these representations into a bit string or a real is the result of this step. number is difficult and results in data structures unsuit- As the third step, the fitness value for each individual able for genetic operations. E.g., transformation of the has to be calculated, including the comparison between adjacency matrix to a bit string might succeed in linking the observed and the estimated property. Typical fitness the single lines or columns of the matrix one after the functions are error measures such as the root mean square, other. Genetic modifications of such bit strings could re- mean absolute error, etc. In the case of combined proper- sult in structures containing atoms with wrong valencies. ties, different fitness measures can be taken into account with respect to the reliability of the estimated properties. If the fitness value of an individual or the mean fitness of the complete population becomes better than the thre- shold value, the genetic algorithm stops; otherwise, the algorithm is continued with the next step. The fourth step is responsible for the selection of the individuals for the next generation. Depending on the fit- ness values, a random procedure decides which individ- uals or pairs of individuals are selected for mutation or crossover operations. A typical selection procedure is the spinning wheel method12 with circle sectors proportional to the fitness values of the individuals. In the fifth step, genetic operations applied to hierar- chically notated chemical structures are performed. Con- Figure 1. Overview of the iterative structure elucidation strategy cerning the genetic crossover, randomly chosen sub-trees applying genetic algorithms. Instead of one chemical compound, of two individuals are exchanged. Concerning the gene- whole generations of substances are passing through the circuit. tic mutation, a randomly chosen sub-tree of a single in- Croat. Chem. Acta 77 (1–2) 213–219 (2004) APPLICATION OF GENETIC ALGORITHMS TO STRUCTURE ELUCIDATION 215 dividual is eliminated and substituted by a randomly ge- nerated new sub-tree to form an individual for the next (a) generation. After the fifth step, the first elucidation cir- cuit is finished and a new generation of chemical struc- tures has been built to pass the next circuit, and so on. Because the genetic operations as well as the gener- ation of the start population have been verified as sym- bolic programming techniques instead of conventional numerical methods of the other steps, these operations (b) are described in more detail in the next two chapters. GENERATION OF HIERARCHICALLY CODED CHEMICAL STRUCTURES In a previous paper, we have introduced the basic list data types for performing list operations on chemical graphs.13 Among these data types, we have given an ex- ample of a tree – like hierarchical notation of a chemical Figure 2. Example of a hierarchical coded chemical line notation. structure. This hierarchical structure is based on a line Because it is not unique several notations are belonging to the notation that is a near SMILES14 implementation in the same molecule. A corresponding property prediction has to result programming language LISP.15 As an example, Figure 2 in the same value independent of the hierarchical notation of the gives the line notation of 1-bromo-2-chlorobutane. molecule. (a) Line notation of 1-bromo-2-chlorobutane. (b) Hier- archical representations of 1-bromo-2-chlorobutane. In order to generate a hierarchical representation need- ed in the genetic algorithm, a root atom has to be deter- mined. Then, corresponding to the root atom, branches and dure if the actual atom belongs to the node set. The num- sub-branches are added, characterized by additional pa- ber and kind of the elements of the terminal and node sets rentheses. The hierarchical representation of the exam- are responsible for the magnitude of the generated trees.
Recommended publications
  • Dc Prf-SPOJ-Classical.Ps
    Archives of the Sphere Online Judge classical problemset Editors: 1 u.swarnaprakash NghiaHemant Nguyen Verma Hoang Blue Mary Andrés Leonardo Rojas LukasŁukasz Mai Kuszner Adrian Kosowski Duarte Stephenbalaji Merriman Adrian Kuegel Brian YashRahul Garg Camilo Andrés Varela León Spooky RobertNeal Zane Rychcicki Jin Bin Paritosh Aggarwal VOJChinh problem Nguyen setters Thanh-Vy Hua Le Đôn Khue ?????Paweł Dobrzycki Roman Sol Csaba Noszaly KonradPatryk Pomykalski Piwakowski Wanderley Guimaraes Analysis Mode (Bogardan ZhangFrank RafaelTaizhi Arteaga Michał Czuczman Hellkite) MauroMiorel PaliiPersano Jelani Nelson (Minilek) Abhilash I P.KasthuriTomek Czajka Rangan• Daniel Gómez Didier Paul Draper SebastianPripoae Toni Kanthak Ngô Minh Đu+’c Bobby Xiao BartłomiejReinier César Kowalski Mujica Neal Wu Darek Dereniowski IvanHdez Alfonso Prasanna Nguye^~n Ha Du+o+ng OlamendyRadu Grigore Piotr Łowiec Nguyen Minh Hieu MartinMark Gordon Bader Robin Nittka Qu Jun dqdLovro Puzar Ahmed Aly Fabio Avellaneda PiotrLordxfastx Piotrowski Adam Dzedzej Hoang Hong Quan TomaszRuslan Sennov Goluch Ajay Somani Nguyen Van Quang Huy Rahulabhijith reddy d Nikola P Borisov Tomas. Bob Diego Satoba Mir Wasi Ahmed Pawel Gawrychowski Luka Kalinovcic Matthew Reeder yandry pérez Rafal clemente Marco Gallotta Tomasz Niedzwiecki Pavel Kuznetsov Andrés Mejía-Posada Robert Gerbicz Andres Galvis Chen Xiaohong Slobodan Simon Gog Alfonso2 Peterssen Kashyap KBR Krzysztof Kluczek John Rizzo Jose Daniel Rdguez Race with time Abel Nieto Rodriguez Michał Małafiejski Bogusław K. Osuch Ivan Metelsky Gogu Marian Phenomenal Le Trong Dao Nguyen Dinh Tu Muntasir Azam Khan 2 Last updated: 2009-10-09 09:00:05 3 Preface This electronic material contains a set of algorithmic problems, forming the archives of the Sphere Online Judge (http://www.spoj.pl/), classical problemset.
    [Show full text]
  • Prolog W Formie Dialogu Pomiędzy Studentem I (Cokolwiek) Sokratycznym Profesorem
    Teksty Drugie 2007, 1-2, s.127-143 Prolog w formie dialogu pomiędzy studentem i (cokolwiek) sokratycznym Profesorem. Bruno Latour Tłumaczenie zbiorowe pod kierunkiem Krzysztofa Abriszewskiego http://rcin.org.pl Bruno UTOUR Prolog w formie dialogu pomiędzy studentem i (cokolwiek) sokratycznym Profesorem^ {Gabinet w London School of Economics, późne wtorkowe popołudnie w lutym, przed pójściem do Beaver na kwartę piwa. Słychać ciche, ale natarc^we pukanie. Student za­ gląda do gabinetu.) Student: - Czy nie przeszkadzam? Profesor: - Nie, to i tak są moje godziny pracy. Proszę wejść i usiąść. S: - Dziękuję. P: - Mniemam, że... czuje się Pan trochę zagubiony? S: - Właściwie tak. Muszę przyznać, iż trudno mi zastosować Teorię Aktora-Sieci w moich badaniach nad organizacjami. P: - Nic dziwnego - nie można zastosować jej do niczego! S: - Ale uczono nas... mam na myśli... wydawało mi się, że to tutaj całkiem gorący towar. Czy mówi Pan, że jest zupełnie bezużyteczna? P: - Mogłaby być użyteczna, ale tylko jeśli nie „stosuje” się do niczego. S: - Przepraszam, ale czy to ma być jakaś sztuczka Zen? Muszę Pana ostrzec, że jestem jedynie doktorantem w badaniach nad organizacjami, więc proszę nie ocze­ kiwać... Nie jestem w temacie, jeśli chodzi o francuską myśl, przeczytałem trochę Mille Plateaux, ale nie bardzo zrozumiałem, o co tam chodzi... 1 Tłumaczenia zbiorowego pod kierunkiem Krzysztofa Abriszewskiego dokonali: Adrian Gahbler, Andrzej Kilanowski, Paweł Mil, Radosław Naworski, Natalia Organista, Dawid Piekło, Robert Szatkowski, Wojciech Wańczyk, Jakub Wolski. ^ http://rcin.org.pl Prezentacje P: - Przepraszam. Nie chciałem się wymądrzać. Chodzi o to, że ANT (skrót od ang. Actor-Network Theory - przyp. tłum.) przede wszystlsim jest negatywnym ro­ zumowaniem.
    [Show full text]
  • World Record Lunch
    World Record Lunch A group of people is trying to beat the world record for the largest number of people having lunch at the same time. In order achieve this goal, they are using the country's largest bridge and they have decided to arrange the tables following the shape of the letter 'S'. The table layout can be described by 4 integers: NH, NV, H and V. The two first integers, NH and NV, represent respectively the number or rows and number of columns in the layout. The last two integers represent respectively the number of tables in each row and column. For a given layout, the tables are numbered consecutively, starting with table #1 in the top-right corner. The following figure illustrates several possible layouts: Thousands of groups of people are expected to come, and the organizers have to define where to seat everyone. Each group needs a certain number of tables and they do not share tables with other groups. Furthermore, a group wants their tables to be together and not split among rows and columns, that is, they want a set of consecutive tables either on the same row or on the same column. If this condition cannot be met, the group prefers to go away and have lunch at another place. The groups also enjoy having some privacy and prefer unoccupied adjacent tables, that is, no one at the table exactly before the first table of the group, and no one at the table exactly after the last table of the group. If this happens, we say that the group found a private place.
    [Show full text]
  • Theoretical and Practical Aspects of Programming Contest Ratings
    Department of Informatics Faculty of Mathematics, Physics and Informatics Comenius University, Bratislava, Slovakia Theoretical and Practical Aspects of Programming Contest Ratings Dizertaèná práca v odbore doktorandského ¹túdia: 11-80-9 teoretická informatika RNDr. Michal Fori¹ek ©koliteľ: prof. RNDr. Branislav Rovan, Ph.D. Bratislava, 2009 iii Acknowledgements First and foremost, I would like to thank my advisor prof. Branislav Rovan for being an exceptional and inspirational teacher and advisor. I'm also grateful to other faculty members { and students { at our university. Thanks to many of them, both studying and teaching here has been a pleasure, and that is one of the factors that significantly influenced my career choices so far. But without any doubt my biggest thanks must go to my fiancée Jana. Without her constant love, support and understanding I would not be able to finish this Thesis. iv Contents Abstract (English) 1 Abstrakt (Slovensky) 3 1 Introduction 5 1.1 Goals of the Thesis . .5 1.2 Outline of this Thesis . .6 1.3 Motivation . .7 2 Background 11 2.1 Programming contests landscape . 11 2.1.1 Important programming contests . 11 2.1.2 Programming contests terminology . 14 2.1.3 Overview of the existing research on competitions . 16 2.2 Rating Systems and Rating Algorithms . 20 2.2.1 Introduction . 21 2.2.2 Overview of the Elo rating system . 22 2.2.3 TopCoder's event format . 23 2.2.4 TrueSkill(TM) rating algorithm . 25 2.2.5 eGenesis rating algorithm . 25 2.3 Item Response Theory . 26 2.3.1 Introduction and motivation .
    [Show full text]
  • PROGRAMMING EXERCISES EVALUATION SYSTEMS an Interoperability Survey
    PROGRAMMING EXERCISES EVALUATION SYSTEMS An Interoperability Survey Ricardo Queirós1 and José Paulo Leal2 1CRACS & INESC-Porto LA & DI-ESEIG/IPP, Porto, Portugal 2CRACS & INESC-Porto LA, Faculty of Sciences, University of Porto, Porto, Portugal Keywords: Learning Objects, Standards, Interoperability, Programming Exercises Evaluation. Abstract: Learning computer programming requires solving programming exercises. In computer programming courses teachers need to assess and give feedback to a large number of exercises. These tasks are time consuming and error-prone since there are many aspects relating to good programming that should be considered. In this context automatic assessment tools can play an important role helping teachers in grading tasks as well to assist students with automatic feedback. In spite of its usefulness, these tools lack integration mechanisms with other eLearning systems such as Learning Management Systems, Learning Objects Repositories or Integrated Development Environments. In this paper we provide a survey on programming evaluation systems. The survey gathers information on interoperability features of these systems, categorizing and comparing them regarding content and communication standardization. This work may prove useful to instructors and computer science educators when they have to choose an assessment system to be integrated in their e-Learning environment. 1 INTRODUCTION These surveys seldom address the PES interoperability features, although they generally One of the main goals in computer programming agree on the importance of the subject, due to the courses is to develop students’ understanding of the comparatively small number of systems that programming principles. The understanding of implement them. This lack of interoperability is felt programming concepts is closely related with the at content and communication levels.
    [Show full text]
  • Spoj, Topcoder, Github Í Experience May 2014 – Software Engineering Intern, Google, New York
    2609 Orchard Ave Los Angeles, CA-90007 Krishna Bharadwaj H +1 310-691-4078 B [email protected] Spoj, TopCoder, Github Í www.krishnabharadwaj.info Experience May 2014 – Software Engineering Intern, Google, New York. Present Worked on the inline browse data computation for git repositories Sep 2012 – Co-founder & Technology Lead, SMERGERS, Bangalore. Jul 2013 SMERGERS is a SME focused Mergers & Acquisitions Platform. Aug 2011 – Full stack Web Developer, BlockBeacon, PricePoint, Bangalore. Aug 2012 Worked remotely with startups in Santa Monica and San Francisco as an independent consultant. Aug 2011 – Founder & Developer, Refer a Geek, Bangalore. Sep 2012 Refer a Geek aimed at bringing the Referral model of recruting beyond any one company. Jul 2009 – Software Engineer, National Instruments India R&D, Bangalore. Jun 2011 Designed and developed physical layer algorithms for WCDMA/HSPA+ in LabVIEW. Education Fall 2013 – Masters in Computer Science, University of Southern California, Los Angeles. Present Student Programmer, Information Sciences Institute(USC) - Working with Dr. Mehdi Yahyanejad June 2009 BE, Information Science, B. M. S. College of Engineering, Bangalore. Microsoft Student Partner, Placement Coordinator & BMS Linux Users Group. Technical Skills & Interests Programming C/C++, Python, LabVIEW, PHP, Javascript Expertise Data Structures & Algorithms – Spoj, TopCoder, Github Databases MySQL, Oracle, MongoDB Web Dev HTML, CSS, jQuery, Django, CodeIgniter OS Windows & Linux system administraton A Others Qt, NumPy, Matplotlib, LTEX
    [Show full text]
  • INTRODUCTION to PROGRAMMING Dr
    INTRODUCTION TO PROGRAMMING Dr. Dipanjan Roy IIIT, Lucknow 2 Dr. Dipanjan Roy, IIIT, Lucknow A warm welcome to all of you! 3 Dr. Dipanjan Roy, IIIT, Lucknow Why are you here? 4 Dr. Dipanjan Roy, IIIT, Lucknow What is your expectation from this class? 5 Dr. Dipanjan Roy, IIIT, Lucknow Is programming important for your career? Why? 6 Dr. Dipanjan Roy, IIIT, Lucknow What is Programming? . It is the process of creating a set of instructions that tell a computer how to perform a task. It is sequence of instruction along with data for a computer. Programming can be done using a variety of programming "languages," such as SQL, Java, Python, C++, etc. 7 Dr. Dipanjan Roy, IIIT, Lucknow Hierarchy of Computer Languages High Assembly Machine Computer Level Language Language Hardware Language 8 Dr. Dipanjan Roy, IIIT, Lucknow Programming Languages . C . C++ . Java . Python . JavaScript . R . Ruby . SCALA . C# 9 Dr. Dipanjan Roy, IIIT, Lucknow Types of Programming 1. Web Development Programming 2. Desktop Application Programming 3. Distributed Application Programming 4. Core Programming 5. System Programming 6. Programming Scientist https://www.wikihow.com/Become-a-Programmer 10 Dr. Dipanjan Roy, IIIT, Lucknow How to become a programmer? WRITE CODE OPTIMIZE/ COMPILE IMPROVE DEBUG EXECUTE 11 Dr. Dipanjan Roy, IIIT, Lucknow Important Tips and Links . Resources: . https://www.geeksforgeeks.org . https://www.tutorialspoint.com/index.htm . https://stackoverflow.com/ 12 Dr. Dipanjan Roy, IIIT, Lucknow Important Tips and Links . Online Coding Platform: . https://www.topcoder.com . https://www.coderbyte.com/ . https://www.hackerrank.com/dashboard . https://www.codechef.com/ . https://www.spoj.com/ .
    [Show full text]
  • If I Am Not Good at Solving the Problems on the Competitive Programming Sites Like Codechef Or Hackerrank, Where Am I Lagging? - Quora
    9/28/2014 If I am not good at solving the problems on the competitive programming sites like CodeChef or Hackerrank, where am I lagging? - Quora QUESTION TOPICS If I am not good at solving the problems on the RELATED QUESTIONS competitive programming sites like CodeChef or HackerRank CodeChef: I am in my third year of Hackerrank, where am I lagging? university now. What should be my Codeforces strategy so that I am comfortable with I am a second year undergrad at one of the IITs and very decent with my CPI Sphere Online Judge solving problems of gene... (continue) (SPOJ) as of now. I have tried quite a few times to start with the likes of above mentioned sites but even the basic level questions take a long time for me to Computer Programming: Why am I CodeChef unable to concentrate in problem solving, get completed? If I know the programming language, if I understand the coding, reading, poor at math? TopCoder questions, where is the fallacy on my part that is preventing me from getting Competitive over them(solving questions) quickly and efficiently? I am quite motivated towards Programming programming, but there is a phase when I am unable to solve most of the problems. Software Follow Question 190 Comment Share 2 Downvote How do I ge... (continue) Programming Languages I am new to competitive programming, just joined CodeChef 10 days back. I am Indian Institutes of Sumit Saurabh finding the easy level questions very Technology (IITs) Edit Bio • Make Anonymous difficu... (continue) Computer Science Write your answer, or answer later.
    [Show full text]
  • Competitive Sport Programming and Open Source Technologies
    Competitive Sport Programming and Open Source Technologies - Ravi Kiran S ( Final Year, CSE ) - Sukin Kumar ( Final Year, CSE ) What is competitive sport programming? What are all the various competitions that happen all over the world? Various Global Programming Competitions: ● ACM-ICPC ( International Collegiate Programming Contest ) ● Google Codejam ● Facebook Hackercup ● Codechef Snackdown and many more... ACM-ICPC: Intro: ● The ACM ICPC is considered as the "Olympics of Programming Competitions". It is quite simply, the oldest, largest, and most prestigious programming contest in the world. ● The ACM-ICPC (Association for Computing Machinery - International Collegiate Programming Contest) is a multi-tier, team-based, programming competition. Headquartered at Baylor University, Texas, it operates according to the rules and regulations formulated by the ACM. The contest participants come from over 2,000 universities that are spread across 80 countries and six continents. Contest Structure: ● Online Round ● ICPC Regional Round ( Regional Sites: Amritapuri, Chennai, Kolkata, Kharagpur, Gwalior in India ) ● ICPC World Finals ACM-ICPC: Eligibility: ● Enrolled in a degree program at an institution. Contest Format: ● It is a team contest. ● Each team should have three members and one reserve (3 + 1). ● Each team must be headed by a coach, who must be a university faculty or staff member. ● The contest can have several problems (8 to 10 in general), of varying difficulty levels and mostly being algorithmic in nature. More Information: ● https://www.codechef.com/icpc Google Codejam: Intro: ● Code Jam calls on programmers around the world to put their skills to the test by solving multiple rounds of algorithmic puzzles. The online rounds conclude in the World Finals, which rotates globally.
    [Show full text]
  • Next-Generation Programming Learning Platform: Architecture and Challenges
    SHS Web of Conferences 77, 01004 (2020) https://doi.org/10.1051/shsconf /20207701004 ETLTC2020 Next-Generation Programming Learning Platform: Architecture and Challenges Yutaka Watanobe1,∗, Chowdhury Intisar1,∗∗, Ruth Cortez1,∗∗∗, and Alexander Vazhenin1,∗∗∗∗ 1Department of Computer Science and Engineering, University of Aizu Abstract. With the rapid development of information technology, programming has become a vital skill. An online judge system can be used as a programming education platform, where the daily activities of users and judges are used to generate useful learning objects (e.g., tasks, solution codes, evaluations). Intelligent software agents can utilize such objects to create an ecosystem. To implement such an ecosystem, a generic architecture that covers the whole lifecycle of data on the platform and the functionalities of an e-learning system should take into account the particularities of the online judge system. In this paper, an architecture that implements such an ecosystem based on an online judge system is proposed. The potential benefits and research challenges are discussed. 1 Introduction of the oldest major OJSs, UVa Online Judge, started pro- viding service in 1995 [1]. A number of OJSs are currently With the growing significance of technology and the ubiq- operated by various organizations and related tools are ac- uitous availability of information, a number of initiatives tively researched [2]. in computer education have been promoted by various or- Although learning materials have been dramatically ganizations. Programming has become a vital skill not improved, most major OJSs have shortcomings regarding only for information technology but also for industry and their educational use. For example, black-box tests are a academia.
    [Show full text]
  • Topcoder SRM Solution
    Codeforces #172 Tutorial xiaodao Contents 1 Problem 2A. Word Capitalization2 2 Problem 2B. Nearest Fraction3 3 Problem A. Rectangle Puzzle5 4 Problem B. Maximum Xor Secondary9 5 Problem C. Game on Tree 10 6 Problem D. k-Maximum Subsequence Sum 12 7 Problem E. Sequence Transformation 15 1 1 Problem 2A. Word Capitalization Brief Description Capitalize a given word. Tags implementation, strings Analysis As the first problem in the problemset, it intended to be simple so that any one could solve it. Just implement what the problem said under your favorite language. Negative Example h6363817, A.nahavandi, Sigizmynd, addou: There are always some competitors started write down the code before they got comprehensive understanding on the statement and the restriction. There is more than one single case and I am not going to list all of them here. LordArian, Cyber.1: Another part of our compatriots are still unfamiliar with some basic builtin-function. They are going to reinvent the wheel and it will definitely cost more of their time on implementation. And at the same time this means more possibility of making mistakes. Suggested Code astep, navi, shloub, Hohol: For situation like this, we believe that the shorter the better! There are the shortest codes on Ruby, Perl, Python and Haskell. And I hope there could be at least one of them to your taste. 2 2 Problem 2B. Nearest Fraction Brief Description x Find the nearest fraction to fraction y whose denominator is no more than n. Tags brute-force, greedy, math, number theory Analysis This problem can be solved by a straight forward method since we are at Div II, simply enumerate all possible denominator, and calculate the nearest fraction for each denominator.
    [Show full text]
  • A Survey on Online Judge Systems and Their Applications
    1 A Survey on Online Judge Systems and Their Applications SZYMON WASIK∗†, Poznan University of Technology and Polish Academy of Sciences MACIEJ ANTCZAK, Poznan University of Technology JAN BADURA, Poznan University of Technology ARTUR LASKOWSKI, Poznan University of Technology TOMASZ STERNAL, Poznan University of Technology Online judges are systems designed for the reliable evaluation of algorithm source code submitted by users, which is next compiled and tested in a homogeneous environment. Online judges are becoming popular in various applications. Thus, we would like to review the state of the art for these systems. We classify them according to their principal objectives into systems supporting organization of competitive programming contests, enhancing education and recruitment processes, facilitating the solving of data mining challenges, online compilers and development platforms integrated as components of other custom systems. Moreover, we introduce a formal definition of an online judge system and summarize the common evaluation methodology supported by such systems. Finally, we briefly discuss an Optil.io platform as an example of an online judge system, which has been proposed for the solving of complex optimization problems. We also analyze the competition results conducted using this platform. The competition proved that online judge systems, strengthened by crowdsourcing concepts, can be successfully applied to accurately and efficiently solve complex industrial- and science-driven challenges. CCS Concepts: •General and reference ! Evaluation; •Mathematics of computing ! Combinatorics; Discrete optimization; •Theory of computation ! Design and analysis of algorithms; •Networks ! Cloud computing; Additional Key Words and Phrases: online judge, crowdsourcing, evaluation as a service, challenge, contest ACM Reference format: Szymon Wasik, Maciej Antczak, Jan Badura, Artur Laskowski, and Tomasz Sternal.
    [Show full text]