Assisted Authoring for Avoiding Inadequate Claims in Scientific Reporting Anna Koroleva

Total Page:16

File Type:pdf, Size:1020Kb

Assisted Authoring for Avoiding Inadequate Claims in Scientific Reporting Anna Koroleva Assisted authoring for avoiding inadequate claims in scientific reporting Anna Koroleva To cite this version: Anna Koroleva. Assisted authoring for avoiding inadequate claims in scientific reporting. Bioin- formatics [q-bio.QM]. Université Paris-Saclay; Universiteit van Amsterdam, 2020. English. NNT : 2020UPASS021. tel-02938856 HAL Id: tel-02938856 https://tel.archives-ouvertes.fr/tel-02938856 Submitted on 15 Sep 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Assisted authoring for avoiding inadequate claims in scientific reporting Thèse de doctorat de l'université Paris-Saclay École doctorale n° 580 Sciences et Technologies de l’Information et de la Communication (STIC) Spécialité de doctorat: Informatique Unité de recherche : Université Paris-Saclay, CNRS, LIMSI, 91400, Orsay, France Référent : Faculté des sciences d'Orsay Thèse présentée et soutenue à Amsterdam, le 22 janvier 2020, par Anna KOROLEVA Sophia Ananiadou Présidente, Rapporteur Professeur, University of Manchester Paolo Rosso Rapporteur Professeur, Universitat Politècnica de València Ameen Abu-Hanna Examinateur Professeur, Universiteit van Amsterdam Evangelos Kanoulas Examinateur Professeur, Universiteit van Amsterdam Olivier Ferret Examinateur Ingénieur de Recherche, Université Paris-Saclay Nicolas Sabouret Examinateur Professeur, Université Paris-Saclay Patrick Paroubek Directeur de thèse Ingénieur de Recherche, Université Paris-Saclay Patrick M.M. Bossuyt Co-Directeur de thèse Professeur, Universiteit van Amsterdam Assisted authoring for avoiding inadequate claims in scientific reporting Anna Koroleva ISBN: 978-90-830376-4-6 Cover design: James Temple-Smith & Anna Koroleva Layout: Anna Koroleva Printed by: Print Service Ede - www.printservice-ede.nl This project was funded by the European Union’s Horizon 2020 research and innovation pro- gramme under the Marie Sklodowska-Curie grant agreement No 676207. ○c 2020 Anna Koroleva. No part of this thesis may be reproduced, stored or transmitted in any form or by any means without permission of the author or publishers of the included scientific papers. Assisted authoring for avoiding inadequate claims in scientific reporting ACADEMISCH PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Universiteit van Amsterdam op gezag van de Rector Magnificus prof. dr. ir. K.I.J. Maex ten overstaan van een door het College voor Promoties ingestelde commissie, in het openbaar te verdedigen in de Agnietenkapel op woensdag 22 januari 2020, te 16.00 uur door Anna Koroleva geboren te Moskou Promotiecommissie: Promotor: prof. dr. P.M.M. Bossuyt AMC-UvA Promotor: dr. P. Paroubek Université Paris-Saclay Overige leden: prof. dr. A. Abu-Hanna AMC-UvA prof. dr. E. Kanoulas Universiteit van Amsterdam prof. dr. S. Ananiadou University of Manchester prof. dr. P. Rosso Universitat Politècnica de València prof. dr. N. Sabouret Université Paris-Saclay dr. O. Ferret Université Paris-Saclay Faculteit der Geneeskunde Dit proefschrift is tot stand gekomen in het kader van het European project "MIROR" GA No. 676207, met als doel het behalen van een gezamenlijk doctoraat. Het proefschrift is voorbereid in de Faculteit der Geneeskunde van de Universiteit van Amster- dam en in de Centre National de la Recherche Scientific (CNRS) van de Université Paris-Saclay. This thesis has been written within the framework of the European project "MIROR" GA No. 676207, with the purpose of obtaining a joint doctorate degree. The thesis was prepared in the Faculty of Medicine at the University of Amsterdam and in the Centre National de la Recherche Scientific (CNRS) at the Université Paris-Saclay. Contents Introduction 1 I Algorithm scheme and annotation 9 1 Automatic detection of inadequate claims in biomedical articles: first steps. Anna Koroleva, Patrick Paroubek. Proceedings of Workshop on Curative Power of MEdical Data, Constanta, Romania, September 12-13, 2017. 11 2 Annotating Spin in Biomedical Scientific Publications: the case of Randomized Controlled Trials (RCTs). Anna Koroleva, Patrick Paroubek. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan, May 7-12, 2018 25 II Algorithms 43 3 Extracting outcomes from articles reporting randomized controlled trials using pre-trained deep language representations. Anna Koroleva, Sanjay Kamath, Patrick Paroubek. Submitted 45 4 Measuring semantic similarity of clinical trial outcomes using deep pre- trained language representations. Anna Koroleva, Sanjay Kamath, Patrick Paroubek. Journal of Biomedical Informatics - X 69 5 Towards Automatic Detection of Primary Outcome Switching in Articles Reporting Clinical Trials. Anna Koroleva, Patrick Paroubek. Submitted 101 6 Extracting relations between outcomes and significance levels in Random- ized Controlled Trials (RCTs) publications. Anna Koroleva, Patrick Paroubek. Proceedings of ACL BioNLP Workshop 2019, Florence, Italy, August 2019 121 III Implementation and interface 143 7 DeSpin: a prototype system for detecting spin in biomedical publications. Anna Koroleva, Sanjay Kamath, Patrick MM Bossuyt, Patrick Paroubek. Submitted 145 IV Difficulties 163 8 What is a primary outcome? A corpus study. Anna Koroleva, Elizabeth Wager, Patrick MM Bossuyt. Submitted 165 Summary 189 Samenvatting 195 Résumé 201 PhD Portfolio 207 Contributing authors 211 Acknowledgements 213 About the author 215 Introduction Many, if not all, of us came across a situation of being prescribed a medication that did not work as it was supposed to and, maybe, even worse, had some unpleasant side effects, leaving us wondering how such a substance could get into clinical use, why our doctor chose to prescribe it, and why there was no warning of side effects. Absence of effects, or even negative effects, of drugs can occur simply because persons differ, and even established and approved drugs may not work, or do not help, everyone. But what if there are other mechanisms as well that can lead to the dissemination of less effective drugs, to doctors prescribing them, and may lead the public into believing in their effectiveness? I had no idea that such mechanisms can exist before I started my PhD and learned about spin in reporting research results. The notion of "spin" comes from the domain of politics, where it denotes a form of propa- ganda and consists in passing on a biased message to the public, to create a certain (positive or negative) perception of a person or event (Tye, 1998; Jackson and Jamieson, 2007; Boardman et al., 2017). Political spin is often related to deception and manipulation. From politics, the term "spin" came into the domain of scientific research, where it refers to presenting research results in a more positive (or, rarely, more negative) way that the obtained evidence justifies. In 2010, Boutron et al. (2010) introduced and defined the term "spin"for research domain, in particular, for randomized controlled trials (RCTs) - clinical trials studying a new intervention by comparing it to an established intervention or to a placebo. In RCTs with non-significant primary outcome, spin is defined as "the use of specific reporting strategies, from whatever motive, to highlight that the experimental treatment is beneficial, despite a statistically nonsignificant difference for the primary outcome, or to distract the reader from statistically nonsignificant results" (Boutron et al., 2010). A variety of terms was usedto denote this phenomenon, such as distorted / misleading presentation, mis-/overinterpretation of research results; "spin" is now the most commonly accepted term for this phenomenon. If one looks at the examples of spin provided in the relevant articles, it may seem that the difference between "spinned" and fair presentation of results is very slight; as aresult, people often wonder if spin is really a problem. To answer this question, Boutron et al. (2014) 1 compared how clinicians perceive the treatments presented in abstracts with spin and in the same abstract rewritten without spin. The authors showed that the clinicians overestimated the effects of the treatment after reading a "spinned" abstract. Given that the abstract isoften the only part of an article available to wide public free of charge, readers often cannot read the full text of an article to assess the correctness of conclusions in the abstract. Hence, clinical decisions can be made on the basis of the information in abstracts; and so spin in abstracts can have a highly negative impact on clinical decision-making. Interventions with unproved efficacy or safety can get into production and clinical use (and maybe this is one of thereasons we have that negative experience with taking medications?). Spin in research articles can have a wider impact: it was shown to be related to spin in press releases and health news (Haneef et al., 2015; Yavchitz et al., 2012), and thus can impact the beliefs and expectations of the public regarding new treatments. The phenomenon of spin has started to attract attention not only of the research community, but also
Recommended publications
  • Data Mining – Intro
    Data warehouse& Data Mining UNIT-3 Syllabus • UNIT 3 • Classification: Introduction, decision tree, tree induction algorithm – split algorithm based on information theory, split algorithm based on Gini index; naïve Bayes method; estimating predictive accuracy of classification method; classification software, software for association rule mining; case study; KDD Insurance Risk Assessment What is Data Mining? • Data Mining is: (1) The efficient discovery of previously unknown, valid, potentially useful, understandable patterns in large datasets (2) Data mining is the analysis step of the "knowledge discovery in databases" process, or KDD (2) The analysis of (often large) observational data sets to find unsuspected relationships and to summarize the data in novel ways that are both understandable and useful to the data owner Knowledge Discovery Examples of Large Datasets • Government: IRS, NGA, … • Large corporations • WALMART: 20M transactions per day • MOBIL: 100 TB geological databases • AT&T 300 M calls per day • Credit card companies • Scientific • NASA, EOS project: 50 GB per hour • Environmental datasets KDD The Knowledge Discovery in Databases (KDD) process is commonly defined with the stages: (1) Selection (2) Pre-processing (3) Transformation (4) Data Mining (5) Interpretation/Evaluation Data Mining Methods 1. Decision Tree Classifiers: Used for modeling, classification 2. Association Rules: Used to find associations between sets of attributes 3. Sequential patterns: Used to find temporal associations in time series 4. Hierarchical
    [Show full text]
  • Text Mining Tools
    Chapter 21 Text mining tools A. Zanasi TEMIS SA, France. Modena & Reggio Emilia University, Italy. Introduction Several tools are already available in the text mining (or more generally un- structured data) quickly growing market. We recognize the text mining market as composed by pure players (companies which develop text mining software, e.g. Clearforest, Inxight, Temis), indirect players (which integrate text mining into their offering, e.g. IBM, SAS, SPSS), partial players (companies which use text mining to improve their core business, e.g. Fast, Verity). A section is dedicated to each player which provided us with their company description, and a cumulative section listing the most known players in the text mining arena. Hundreds of other companies are already working successfully in the worldwide market. For a list of them, go to www.kdnuggets.com. 1 Megaputer intelligence 1.2 Company description Megaputer Intelligence (www.megaputer.com) was founded in 1993. The company provides a family of tools for the analysis of structured data and text. 1.3 Products Megaputer Intelligence offers two products: PolyAnalyst for Text™, aimed pri- marily at the analysis of incident reports, survey responses and customer comm- unications data and TextAnalyst™ . PolyAnalyst for Text™ performs semantic text analysis, record coding, identi- fication and visualization of patterns and clusters of information, automated or manual taxonomy creation and editing, taxonomy-based forced or self-learning WIT Transactions on State of the Art in Science and Eng ineering, Vol 17, © 2005 WIT Press www.witpress.com, ISSN 1755-8336 (on-line) doi:10.2495/978-1-85312-995-7/21 316 Text Mining and its Applications to Intelligence, CRM and Knowledge Management categorization of documents, text OLAP, link analysis, and interactive visual processing of various combinations of structured and unstructured data.
    [Show full text]
  • Application of Polyanalyst to Flight Safety Datat Southwest Airlines
    Application of PolyAnalyst to Flight Safety Data at Southwest Airlines TM r Proof-of-Concept Demonstration of Data and Text Mining Prepared by: Megaputer Intelligence 120 W. 7th Street, Suite 310 Bloomington, IN 47404 Project Manager: Sergei Ananyan [email protected] Primary Analysts: Your Knowledge Partne Richie Kasprzycki [email protected] Vijay Kollepara [email protected] January 2004 DISCLAIMER This project report and the results are based on a time-constrained proof-of-concept demonstration carried out by Megaputer Intelligence for Southwest Airlines and involve analysis of only a subset of data. Therefore, the results should not be used to form any conclusions or business decisions. Table of Contents ACKNOWLEDGEMENTS ......................................................................................................................... ii EXECUTIVE SUMMARY ......................................................................................................................... iii 1.0 INTRODUCTION ................................................................................................................................. 1 1.1. PURPOSE OF THE PROOF-OF-CONCEPT DEMONSTRATION .............................................................. 1 1.2 OVERVIEW OF PARTICIPATING ORGANIZATIONS ........................................................................... 2 1.3 OVERVIEW OF POLYANALYST TEXT AND DATA MINING SYSTEM................................................. 2 1.4 INPUT DATA: AVIATION SAFETY ACTION PROGRAM
    [Show full text]
  • Jaquelina Belin Rossi.Pdf (2.890Mb)
    UNIVERSIDADE FEDERAL DO PARANÁ JAQUELINA BELIN ROSSI INTELIGÊNCIA ARTIFICIAL APLICADA ÀS ROTINAS JURÍDICAS: REVISÃO SISTEMÁTICA DE LITERATURA CURITIBA 2021 JAQUELINA BELIN ROSSI INTELIGÊNCIA ARTIFICIAL APLICADA ÀS ROTINAS JURÍDICAS: REVISÃO SISTEMÁTICA DE LITERATURA Trabalho de Conclusão de Curso apresentado como requisito parcial à obtenção de grau de Bacharel no Curso de Gestão da Informação, Departamento de Ciência e Gestão da Informação do Setor de Ciências Sociais Aplicadas da Universidade Federal do Paraná. Orientadora: Prof.ª Dr.ª Denise Fukumi Tsunoda CURITIBA 2021 AGRADECIMENTOS Primeiramente, agradeço a Deus que torna tudo possível, inclusive os meus sonhos. Agradeço a minha orientadora Prof.ª Dr.ª Denise Fukumi Tsunoda, por sua tolerância e paciência nos momentos difíceis. Agradeço a Prof.ª Dr.ª Suely Ferreira da Silva e ao Prof. Dr. José Simão de Paula Pinto por fazerem parte da banca examinadora por seus insights. Agradeço ao meu marido Paulo e filhos Giovanna, Guillermo e Luigi pelo apoio, paciência e colaboração. Agradeço aos meus pais e minhas irmãs pelo suporte e incentivo. Aos amigos e colegas do curso que de uma maneira ou outra e influenciaram nesta jornada. Ao Gilson, secretário do curso. E um agradecimento especial a todos os professores do curso de Gestão da Informação, pois o conhecimento e sabedoria transmitidos são inestimáveis. RESUMO A inteligência artificial está se estabelecendo no direito para auxiliar advogados e tribunais. Apresenta uma revisão sistemática aplicada em bases de dados internacionais, relacionando a inteligência artificial e o direito. As bases utilizadas incluem Scopus, Web of Science e EbscoHost, onde os artigos que compõe a base de dados da pesquisa foram selecionados para a realização de análises bibliométricas.
    [Show full text]
  • Polyanalyst Technical Capabilities and System Requirements
    PolyAnalyst 6 Technical capabilities and system requirements © 2007 Megaputer Intelligence Inc. All rights reserved. PolyAnalyst Features Client/Server architecture Server architecture enables the implementation of PolyAnalyst 6 as an enterprise level analytical system. It facilitates the collaboration between data analysts working on the same projects and sharing various related resources such as analysis scenarios, dictionaries, taxonomies, and multi-dimensional matrices. Server architecture helps enhance the performance of the system by performing calculations on the most powerful machines, reducing data transfer over the network, scheduling execution of tasks at a given time, and generating custom reports and condition based alerts for different groups of business users. It provides the centralized management and audit of the list of system users and their actions. Security Recognizing that data is one of the most valuable and sensitive assets of a modern organization, PolyAnalyst provides solid mechanisms to ensure data security. Communications between client and server are performed in a fully encrypted manner with a new encryption key generated by the server for every communication session. PolyAnalyst supports secure user login based on user rights and passwords and keeps track of individual and group rights and sequences of actions carried out by the users. In addition, full compliance with the requirements of the HIPAA legislation facilitates PolyAnalyst implementations at healthcare and insurance organizations. Scalability and Performance PolyAnalyst provides industrial level scalability: it can handle huge amounts of data within reasonable time intervals. This scalability is ensured through a combination of several factors. PolyAnalyst utilizes hard disk instead or RAM for holding all data and supporting meta-information.
    [Show full text]
  • Linguamatics Text Mining Summit 2016 October 17–19: Chatham, MA Welcome to the Linguamatics Text Mining Summit 2016
    Linguamatics Text Mining Summit 2016 October 17–19: Chatham, MA Welcome to the Linguamatics Text Mining Summit 2016 The Linguamatics Text Mining Summit is a place where the text mining community comes together to exchange ideas, network, learn from each other, and explore the latest NLP text mining technology from Linguamatics. There will be discussions on best practice; you will hear real user case studies, and find out how organizations from the life science and healthcare sectors are using NLP text mining every day to gain better insights from their ever-growing data. John M. Brimacombe, Linguamatics Executive Chairman Agenda Monday October 17 11:00am–12:00pm Registration in the Monomoy Foyer 12:00pm–1:00pm Lunch at the STARS restaurant 1:00pm–5:00pm Training workshops: Session 1 1A Introduction to I2E: Tony Wu, Eldridge Room 1B Introduction to I2E—Healthcare focus: Erin Tavano, Alden Room Linguamatics I2E Query Hackathon: David Milward, Monomoy Meeting House 3:00pm–3:25pm Refreshments in the Monomoy Foyer 6:15pm onward Evening social event at The Beach House Tuesday October 18 8:30am–9:00am Registration in the Monomoy Foyer 9:00am start Main day presentations in the Monomoy Meeting House 9:00am–9:05am Introduction Phil Hastings, Chief Business Development Officer, Linguamatics 9:05am–9:20am Welcome address John M. Brimacombe, Executive Chairman, Linguamatics 1 Tuesday October 18—continued 9:20am–9:45am Natural language processing validation of high-risk patient characteristics using Linguamatics I2E Yoni Dvorkis, Data Consultant, Atrius
    [Show full text]
  • Web-Based Text Mining of Hotel Customer Comments Using SAS® Text Miner and Megaputer Polyanalyst®
    Web-Based Text Mining of Hotel Customer Comments Using SAS® Text Miner and Megaputer Polyanalyst® Richard S. Segall Arkansas State University, Department of Computer & Information Technology, College of Business, Jonesboro, AR 72467-0130, [email protected] Qingyu Zhang Arkansas State University, Department of Computer & Information Technology, College of Business, Jonesboro, AR 72467-0130, [email protected] Mei Cao University of Wisconsin – Superior, Department of Business and Economics, Superior, WI 54880, [email protected] ABSTRACT This paper presents text mining using SAS® Text Miner and Megaputer PolyAnalyst® specifically applied for hotel customer survey data, and its data management. The paper reviews current literature of text mining, and discusses features of these two text mining software packages in analyzing unstructured qualitative data in the following key steps: data preparation, data analysis, and result reporting. Some screen shots are presented for web-based hotel customer survey data as well as comparisons and conclusions. Keywords: Web-based data, Algorithms, Web data management, Text mining, SAS® Text Miner, Megaputer PolyAnalyst®, Unstructured Data, Hotel Customer Surveys INTRODUCTION The increasing use of textual knowledge applications and the growing availability of online textual sources caused a boost in text mining research. This paper reviews some current literature of text mining and presents comparisons and summaries of the two selected software of SAS® Text Miner and Megaputer PolyAnalyst® for text mining. This paper discusses features offered by these two-selected software in the following key steps in analyzing unstructured qualitative data: software composition for text mining, data preparation, data analysis and result reporting. This paper also provides insight with the data management example of application to an actual web-based dataset for hotel customer survey data.
    [Show full text]
  • Advanced Analytics: the Hurwitz Victory Index Report
    HURWITZ VICTORY INDEX Advanced Analytics: The Hurwitz Victory Index Report HURWITZ VICTORY INDEX IBM DOUBLE VICTOR Marcia Kaufman COO and Principal Analyst Daniel Kirsch Senior Analyst HURWITZ VICTORY INDEX Table of Contents Executive Summary . 3 I . Introduction . 4. II . Victory Index Methodology . 6 III . Market Trends in Advanced Analytics . 8 . IV . Customer Examples in Advanced Analytics . .12 . V . Vendor Scores: Victors, Leaders, and Challengers . 14 Go to Market Strength . 14. Customer Experience Strength . 18 VI . Vendor Assessment: IBM . 21. Advanced Analytics : The Hurwitz Victory Index Report Page 2 © 2014, Hurwitz & Associates HURWITZ VICTORY INDEX Executive Summary Companies use advanced analytics to discover patterns and anomalies in large volumes of data, and then use this insight to predict the outcomes of future events and interactions . In addition, advanced analytics is used for optimization and complex event processing and analysis . With advanced analytics, your There is enormous organization can adjust its plans and strategies to become more competitive, opportunity in the advanced minimize potential risk and optimize decision-making in real time . In this analytics market as more report we assess the performance of major providers of advanced analytics solutions and provide insight into how their customers are benefiting from these customers begin to move solutions . out of their comfort zone of traditional uses for analytic The Hurwitz Victory Index scores vendor performance across four dimensions tools. Some customers may – Vision, Viability, Validity, and Value . Each dimension measures important need help getting started; components of a vendor’s overall ability to deliver innovative solutions, however, we expect to see outstanding customer service, and the business and technical value customers explosive growth in this demand .
    [Show full text]
  • New Capabilities of Polyanalyst Text and Data Mining Applied to STEADES Data at the International Air Transport Association (IATA)
    New Capabilities of PolyAnalyst Text and Data Mining Applied to STEADES Data at the International Air Transport Association (IATA) A Technology Demonstration In Partnership with the Federal Aviation Administration and the Global Aviation Information Network (GAIN) Report Prepared by: Dr. Sergei Ananyan President Megaputer Intelligence Mr. Michael Goodfellow Assistant Manager, STEADES International Air Transport Association (IATA) October 2004 Disclaimers; Non-Endorsement All data and information in this document are provided “as is,” without any expressed or implied warranty of any kind, including as to the accuracy, completeness, currentness, noninfringement, merchantability, or fitness for any purpose. The views and opinions expressed in this document do not necessarily reflect those of the Global Aviation Information Network or any of its participants, except as expressly indicated. Reference in this document to any commercial product, process, or service by trade name, trademark, servicemark, manufacturer, or otherwise, does not constitute or imply any endorsement or recommendation by the Global Aviation Information Network or any of its participants (e.g., FAA) of the product, process, or service. This project report and the results are based on a time-constrained proof-of-concept demonstration carried out by Megaputer Intelligence for the International Air Transport Association (IATA) STEADES group and involved analysis of only a subset of data. Therefore, the results should not be used to form any conclusions or business decision.
    [Show full text]
  • Ustracija Suviše Malog Izbora (A) I Suviše Velikog Izbora (B)
    UNIVERZITET SINGIDUNUM DEPARTMAN ZA INFORMATIKU I RAUNARSTVO DEPARTMAN ZA MENADŽMENT Vladislav Miškovic SISTEMI ZA PODRŠKU ODLUIVANJU Prvo izdanje Beograd, 2013 SISTEMI ZA PODRŠKU ODLUIVANJU Autor: dr Vladislav Miškovic Recenzenti: dr Milan Milosavljevi dr Mladen Veinovi dr Boško Nikoli Izdava: UNIVERZITET SINGIDUNUM Beograd, Danijelova 32 www.singidunum.ac.rs Za izdavaa: dr Milovan Staniši Tehnika obrada: Vladislav Miškovic Dizajn korica: Aleksandar Mihajlovi Godina izdanja: 2013. Tiraž: 550 primeraka Štampa: Mladost Grup Loznica ISBN: 978-86-7912-509-5 Copyright: © 2013. Univerzitet Singidunum Izdava zadržava sva prava. Reprodukcija pojedinih delova ili celine ove publikacije nije dozvoljena. Sadržaj 1 Uvod u sisteme za podršku odluivanju ..................................................................... 1 2 Osnove teorije odluivanja ....................................................................................... 25 3 Modeliranje i analiza procesa donošenja odluka ...................................................... 51 4 Raunarska podrška odluivanju: koncept i tehnologija sistema za podršku ........... 83 5 Poslovna inteligencija, skladištenje podataka i otkrivanje znanja ............................ 95 6 Složeno odluivanje: grupno donošenje odluka ..................................................... 108 7 Inteligentni sistemi: podrška donošenju složenih odluka ....................................... 122 8 Metodi i tehnike mašinskog uenja ........................................................................ 147 9 Inteligentna
    [Show full text]
  • Polyanalyst™ 6.5 TECHNICAL CAPABILITIES and SYSTEM REQUIREMENTS
    PolyAnalyst™ 6.5 TECHNICAL CAPABILITIES AND SYSTEM REQUIREMENTS PolyAnalyst™ is comprehensive data and text mining system providing tools for handling all steps of a typical data analysis process: from data loading, integration, manipulation and cleansing, to advanced text analytics, machine learning and knowledge discovery, and to versatile results visualization and reporting. Tools for the analysis of structured and textual data in a single environment enables joint processing of all available data for better insights. © 2019 MEGAPUTER INTELLIGENCE. ALL RIGHTS RESERVES VERSION 20.19 R.1 FEATURES / PolyAnalyst™ Web Reports PolyAnalyst™ Server PolyAnalyst™ Clients DOCUMENTS / EMAILS DATA LOADING REPORTING DATA STORAGE ALERTING ANALYSIS SCHEDULING WEB / SOCIAL MEDIA OLAP Web Report Editor DATABASES Administrative Client ODBC / OLEDB Figure 1. PolyAnalyst architecture. Client/Server architecture Security Server architecture enables the implementation of Recognizing that data is one of the most valuable and PolyAnalyst™ 6.5 as an enterprise level analytical sensitive assets of a modern organization, PolyAnalyst™ system. It facilitates the collaboration between data provides solid mechanisms to ensure data security. analysts working on the same projects and sharing Communications between client and server are various related resources such as analysis scenarios, performed in a fully encrypted manner with a new dictionaries, taxonomies, and multi-dimensional encryption key generated by the server for every matrices. Server architecture helps enhance the communication session. PolyAnalyst supports secure performance of the system by performing calculations user login based on user rights and passwords and on the most powerful machines, reducing data transfer keeps track of individual and group rights and over the network, scheduling execution of tasks at a sequences of actions carried out by the users.
    [Show full text]
  • 5F5=75=BK ˆ ˜˘ 2021.Xlsx
    Приложение к приказу факультета компьютерных наук от_________________ №_________________ Список рецензентов выпускных квалификационных работ Ученая степень, Место работы Тема на английском Руководитель из Департамент/ Фамилия Имя Отчество Тема работы Рецензент ученое звание рецензента, занимаемая языке НИУ ВШЭ Кафедра рецензента должность Frontend of the департамент Клиентская часть программной Software System for Паринов Андрей анализа данных и Белова Наталья Азаров Евгений Сергеевич системы подбора площадок для к.т.н доцент ДПИ ФКН Filming Locations Андреевич искусственного Сергеевна киносъемок Selection интеллекта 1 Web-приложение для Web Application for осуществления поиска в базе Searching the Database Виноградова Ольга Школа Манахов Павел Алешина Айсылу Наилевна к.т.н. доцент ДПИ ФКН данных студенческих текстов на of Students Texts in Ильинична лингвистики Алексеевич 2 английском языке REALEC English REALEC Search Application for Приложение поиска исполнителей департамент Performers of Common Брейман Александр Меликян Алиса кандидат наук по Алтунян Айк Эдвардович повседневных бытовых услуг. программной доцент ДПИ ФКН Household Services. Давидович Валерьевна образованию Клиентская часть инженерии 3 Frontend Система для тренировки и System for Machine департамент Брейман Александр Ломазова Ирина Аль-Шедиват Радуан Файсал использования моделей машинного Learning Model программной д.ф.-м.н., профессор профессор ДПИ ФКН Давидович Александровна 4 обучения Training and Usage инженерии департамент iOS приложение для проката iOS Application
    [Show full text]