Quick viewing(Text Mode)

Visual Analysis of Periodic Time Series Data

Visual Analysis of Periodic Time Series Data

Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. in 4 koe 2020 Oktober 14. Wien, is israinhbnbegutachtet: haben Dissertation Diese Filzmoser Peter Dr.techn. Dipl.-Ing. Univ.Prof. Zweitbetreuung: Miksch Silvia Dr.rer.soc.oec. Mag.rer.soc.oec. Univ.Prof. Betreuung: Wien Universität Technischen der Informatik für Fakultät der an ohrae muainudAusreißererkennung und Imputation Vorhersage, iulAayisfrdeModellauswahl, die für Analytics Visual eidshnZeitreihen periodischen otrdrTcnshnWissenschaften Technischen der Doktor -00Wien A-1040 iuleAayevon Analyse Visuelle u ragn e kdmshnGrades akademischen des Erlangung zur il-n.Mru öl BSc Bögl, Markus Dipl.-Ing. alpaz13 Karlsplatz arklumr00625252 Matrikelnummer ehiceUiesttWien Universität Technische DISSERTATION osMaciejewski Ross igrih von eingereicht e.+43-1-58801-0 Tel. www.tuwien.at aaa Turkay Cagatay aksBögl Markus Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. dio:Ui.rf a.e.o.e.D.e.o.e.Sli Miksch Silvia Dr.rer.soc.oec. Mag.rer.soc.oec. Univ.Prof. Advisor: ina 14 Vienna, h israinhsbe eiwdby: reviewed been has dissertation The eodavsr nvPo.Dp.Ig rtcn ee Filzmoser Peter Dr.techn. Dipl.-Ing. Univ.Prof. advisor: Second Wien TU the at Informatics of Faculty the to iulAayi fProi Time Periodic of Analysis Visual muain n ule eeto sn Visual Using Detection Outlier and Imputation, th uprigMdlSlcin Prediction, Selection, Model Supporting coe,2020 October, umte nprilfllmn fterqieet o h ereof degree the for requirements the of fulfillment partial in submitted otrdrTcnshnWissenschaften Technischen der Doktor -00Wien A-1040 il-n.Mru öl BSc Bögl, Markus Dipl.-Ing. eitainNme 00625252 Number Registration alpaz13 Karlsplatz eisData Series ehiceUiesttWien Universität Technische DISSERTATION osMaciejewski Ross Analytics e.+43-1-58801-0 Tel. by www.tuwien.at aaa Turkay Cagatay aksBögl Markus Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. otatoe e innc nnme id u ee alutrAgb e uleals Quelle der Angabe unter Fall jeden auf sind, entnommen nach Sinn dem oder Wortlaut in 4 koe 2020 Oktober 14. Wien, ule n ifmte oltni neee aeudds c i tle e ret– Arbeit der habe. gemacht im Stellen kenntlich Internet Entlehnung die dem oder ich Werken anderen dass die –, und verwendeten Abbildungen die habe und ich Karten angegeben Tabellen, dass einschließlich vollständig habe, verfasst Hilfsmittel selbständig und Arbeit diese Quellen ich dass ich, erkläre Hiermit BSc Bögl, Markus Dipl.-Ing. rlrn u efsugdrArbeit der Verfassung zur Erklärung aksBögl Markus v Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. ü igtudFrederik und Birgit Für ü en Familie meine Für Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. inr lxne id hitn idrr n aksWge;m nentoa collaborators international my Wagner; Markus and Niederer, Christina Rind, Alexander Aigner, Paolo Amor-Amorós, Albert Alsallakh, Bilal Schetinger, Victor Roger Gschwandtner, Piccolotto, Nikolaus Theresia Filipov, Leite, Velitchko A. Ceneda, Davide Bors, Christian Arleo, Alessio in lecture His Filzmoser. Peter advisor, second my thank to want CAT,fne yteAsra eea iityo cec,Rsac,adEooyi the in Economy and Research, Science, of Ministry Federal Austrian the by funded (CVAST), era h tdn.Ia seilygaeu oteognzr o hsaaigopruiyto opportunity amazing this for organizers the to grateful especially am I student. PhD a as year PhD. my pursuing while analytics visual yrsac a nnilyspotdb h etefrVsa nltc cec n Technology and Science Analytics Visual for Centre the by supported financially was research My forget never will me. doubts, I for my did overcome concepts. you musical me everything checking pastries, helping and dinner, babysitting, translating, vegetables, (lunch, made proofreading, gestures food and questions, tiny answering fruits bringing/preparing many fresh So possible: bringing dissertation cake), program. and this PhD of my extended phase my during final from me people the supported magnificent and several for helped gratitude who great family my express project. to research want international I Finally, an for my cooperation build in helped resulted Jörn eventually greatly notably, and papers—most This network and Andrienko. research books Gennady the and from Natalia knew Maciejewski, I Ross Kohlhammer, researchers famous those first my person in during meet 2013, in in participate UK, University, to Middlesex opportunity at the School had Summer the I Analytics as Visual feedback. well last valuable as great the and events the efforts, other for time, and community colloquium, their analytics for doctoral visual reviewers conferences, and workshops, visualization talks, the the in at people time all thank to Schumann. want Heidrun I and Lastly, Rostock, Tominski, Universität Christian from Eichner, those Christian and Luboschik, Kohlhammer; Martin Röhlig, Jörn and Martin Bernard Jürgen Wolfgang Darmstadt, Pölten, TU from St. Wien, FH TU the at from Department collaborators Statistics my Computational Mühlmann; the Christoph from and collaborators Nordhausen my Klaus to grateful also am I Lammarsch. Tim and Wien: Federico, TU at group research CVAST the from colleagues former and current all thank to and want analysis I data exploratory statistics, in interest my continue to me allowed Peter and Silvia I addition, In dissertation. encouragement, my support, finish eventually her for to Miksch, push Silvia final advisor, first and my attitude, to positive you enthusiasm, thank special a say to want I tre yjunyt iulaayis n h nedsilnr oprto between cooperation interdisciplinary the and analytics, visual to journey my started Acknowledgments xlrtr aaAayi and Analysis Data Exploratory ix Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. #283;teFFb BidSuc eaaini ieadSae (#P31881-N32). Space” and Time in Separation Methods” Source Analytics “Blind Visual by with FWF Hypotheses the Modeling (#P22883); “HypoVis: by FWF the (#I2850-N31); rcdr DC)“ISC:Vsa emnainadLbln fMliait ieSeries” Time Multivariate Agency of Lead Labeling the and by Segmentation Doctoral (FWF) Visual “VISSECT: the Fund (DACH) by Science Procedure Wien Austrian TU the Informatics; the Environmental (#822746); for initiative College Excellence of Centre Bassi Laura exceptional Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. alrzs nesüz id auemgih e nazdevsel nls e eteh,bietet Zeitreihe, der Modellaus- Analyse gesamte visuelle der die Ansatz dem der mit ermöglicht vor, Dazu Analytics-Ansatz wird. Visual unterstützt wahlprozess einen wir stellen Schritt ersten Als untersuchen und identifizieren Muster darunterliegenden die um explorieren, zu Analytics Visual zu Ausreißer und beurteilen zu Werte fehlende geschätzte interpretieren, zu erkannt besser Vorhersagen Periodizität die solange Vorteil, von Eigenschaft diese eben ist Zeitreihen von Analyse cäzne o elne aepntn uvresr n is uvroltnie.Danach vervollständigen. zu diese und verbessern zu Datenpunkten, fehlenden von Schätzungen für ( Darstellungsform Zeitreihen spezielle eine periodische wir verwenden Schritt in nächsten die Im Modellparameter wir Modellauswahlprozess. und den untersuchen Modelle Weiters gewählten Daten. der Vorhersagemöglichkeiten die genaue der die für Zusammenführung für Modelle sowie der Modellparameter Angemessenheit und bezüglich Modelle Diagnose der Auswahl die für Orientierungshilfe eine beeinflusst. positiv Zeitreihen die von Berechnungen, Analyse statistischen herausfordernde mit und mit Interaktionstechniken gemeinsam mit Zeitreihen Wahrnehmung, periodische wie menschlicher auf bzw. Betrachtungswinkel können werden visueller unterstützt neuer Verflechtung bestmöglich wird Aufgaben die es diesen und in untersucht Nutzer*innen all Analytics-Methoden wie Für Visual Ausreißern. ermittelt, von von Anwendbarkeit Erkennen die das wird sowie Schritte Werten diese fehlenden Modelle, von der Vorhersagequalität Ersetzen der und Berechnen Überprüfen das das Parametrisierung, Zeitreihenmodelle, der passender in möglichst Unterstützung Auswahl die die Zeitreihen, und von Zeitreihen Explorieren periodische das Speziellen umfassen Zeitreihen- im betreffen der Herausforderungen in Diese Herausforderungen betrachtet. verschiedenste analyse werden Dissertation vorliegenden der In von mithilfe Daten können. die zu oder Möglichkeit erkennen eine zu besteht Darstellungen identifizieren, visueller zu geeigneter solche mittels Um sie sein. gegeben versteckt darin, Daten selbst der Daten Zusammenhang den dem in aus oder direkt können Muster periodischen auszuwählen, Die Zeitreihenmodelle erkennen. passende helfen, sie kann Dann wird. die abgebildet Für richtig Muster. wiederkehrende und periodisch durch Datensätzen vielen in sich Eigen- zeigt strukturelle Zeit Diese gemessenen der ist. der schaft bedingt Struktur Kalenderstrukturen natürliche dahinterliegende die durch treffen. durch oder die zu Phänomene Zeitreihen, Entscheidungen von darauf, aufgezeichnet, Eigenschaft basierend eine wird ist und Daten Periodizität Na- analysieren von zu Wirtschaft, Art festzuhalten, in Diese Messungen mehr. es um dergleichen sei und präsent, Medizin Forschungsbereichen turwissenschaften, unterschiedlichen in sind Zeitreihen Zyklengraph bzw. yl cycle ,u i muain a eß,depassenden die heißt, das Imputation, die um ), Kurzfassung xi Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. et oi a dnizee o urienvresr z.es ermöglicht. erst bzw. verbessert Ausreißern von Identifizieren das sowie Werte nedaki e näz uc uzrinnznre n rnlc ecreeeFunktiona- beschriebene gründlich und Nutzer*innenszenarien durch und Ansätze Nutzen sowie der Anwendbarkeit verwendet Designprozesse nutzer*innenzentrierte iterative wurden Ansätze o muainnudAseßr.DeFrcugegbis egn asdeevselnDarstel- visuellen diese dass zeigen, Forschungsergebnisse Die Ausreißern. und Imputationen von ein Analytics-Ansätzen Visual bei erlaubt Darstellung visuellen der in Periodizität vorhandener u i ae itn ai iddeAsalagmsee oel z.Mdlprmtr das Modellparameter, fehlender bzw. Imputation Modelle die angemessener Modellauswahlprozess, Auswahl im Vorhersagemöglichkeiten Sichtweisen die der andere wird Miteinbeziehen Zeitreihen Damit von bieten. Analyse Daten die für die Zeit auf der Struktur Berücksichtigung periodischen gleichzeitiger speziellen bei der Datenabstraktion passenden der mit gemeinsam lungsformen Vorhersagen, von Zeitreihenmodellen, angewandten von Nachvollziehen und Verstehen besseres von Miteinbeziehen Das diskutiert. Forschungsmöglichkeiten sowie weitere beschrieben und Ansätze Fragestellungen dieser offene Auswirkungen die werden Abschließend bestätigt. litätstests vorgeschlagenen dieser jeden in Für Ausreißern kann. von werden Erkennen verwendet das Zeitreihen für periodischen und multivariaten konstruiert mithilfe Zeitreihen Zyklengraph multivariate dieser für wie Abstraktion Datenabstraktion, dieser neuartigen einer Verwendung unter wir zeigen Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. efis rps iulaayisapoc o uprigtewoepoeso selecting of process whole the supporting for approach analytics visual a propose first We epeetanvlasrcinmto oueaccepo ersnainfrmliait time multivariate for representation plot cycle a use to method abstraction novel a present we edsusdteipiain fsc ehd n ocue ihoe hlegsi these in challenges open with concluded and methods such of implications the discussed we iesre aaaeesnili ayfils ieeoois aua cecs n medicine, and sciences, natural economics, like fields, many in essential are data series Time eisa eli re ouei o ule eeto nproi iesre.Frec fthe of each For series. time periodic in detection outlier for it use to order in well as series onm e.Mauigadrcrigteedt lo st ouet nlz,admake and analyze, document, to us allow data these recording and Measuring few. a name to tutr ftm,alw o h nlsso iesre rmdffrn esetvsadprovides and perspectives different from series time of periodic analysis the the for considering allows when time, abstraction, The of and structure representation outliers. visual and adequate imputations, that predictions, approaches indicate models, analytics results applied visual the the comprehending into better periodicity for visualizing allows on focus additional Furthermore, Integrating walk-throughs. topics. illustrated thoroughly and utility scenarios the usage showed in we process; approaches design the user-centered of iterative an employed we solutions, proposed Thereafter, series. time periodic cycle in a employ values we missing Next, of imputation process. the selection support model to the representation into integrate plot model to the how investigate the of then guiding capabilities We while prediction diagnostics. the series model time and of exploration parametrization, visual selection, the model allowing models, series time appropriate analyzing in user the series. with fosters time together computations periodic visualization statistical tasks and using these techniques, series in interaction time users perception, periodic support user on can perspectives analytics new visual intertwining how investigate how we and steps, detecting these and all the values, For supporting missing models, outliers. imputing series series, performance, time time prediction appropriate periodic the for examining selecting analysis parametrization, series, series time time the of exploring stages with different starting the consider we dissertation, exploration, this the analysis.In for Visualization visual allow using and itself. patterns easily data underlying patterns such the such of in identify investigation hidden to and or perception identification, context human for the way to one due These is obvious detection. be outlier can and modeled a patterns imputation, and In prediction, periodic correctly selection, reoccurrences. identified model periodic if like by beneficial, tasks series mostly for time adequately, are these properties One of these many analysis, structure. in series calendar which time time underlying periodicity, of is the property areas or structural all measured the across phenomena finds series natural time the in either structures from natural stems most the of One decisions. Abstract xiii Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. aus n dniyn outliers. identifying and values, osblte o dniyn dqaetm eismdl,spotn h muaino missing of imputation the supporting models, series time adequate identifying for possibilities Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. Introduction 1 Exposition I Contents Abstract Kurzfassung nertn rdcin nTm eisMdlSlcin53 Selection Model Series 29 Time in Predictions Integrating 3 Analysis Series Time in Selection Model for Analytics Visual 2 Development II . sg cnro...... 57 54 . . . . 54 ...... 46 ...... 50 ...... 31 37 ...... 30 ...... 32 ...... 38 ...... Scenario ...... Usage ...... Approach . . . . . Analytics 3.3 . . Visual ...... 3.2 . . . . Introduction ...... 25 ...... 3.1 . . 24 ...... 14 ...... 19 Work ...... Future . and ...... Conclusion ...... 10 . . . . Analysis . . . 2.7 Series Evaluation . . . . . Time . . . in . . . . . Selection . . . Model . . 2.6 . . . for . . . VA ...... Analysis 2.5 Requirements ...... Characterization . 2.4 ...... Problem 5 ...... Work 2.3 Related ...... 4 ...... 2.2 . . . . Introduction ...... 2.1 ...... Contributions . and . . . . . Aim ...... Questions 1.7 . Research ...... Analysis . Series 1.6 . . . Time ...... 1.5 . Methodology ...... Analytics 1.4 . Visual . Visualization and . Analysis 1.3 Data Analysis Problem and 1.2 Motivation 1.1 Contents xiii 27 xv xv xi 3 1 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. Bibliography Tables of List Figures of List h utvraeCcePlot Cycle Multivariate The 5 61 Series Time Seasonal Univariate in Imputation Guided Statistically and Visually 4 Conclusion 6 Coda and Recapitulation III . ulctos...... 104 98 . . . . 93 ...... 95 . . . 98 ...... Publications . . Opportunities . Future . . and . Challenges 6.5 . . Open . . . . . 6.4 . Conclusion . . . Revisited Questions 6.3 . Research . 6.2 Summary 6.1 . eae ok...... 62 65 62 ...... 63 ...... 58 ...... 58 ...... Conclusion . . . . . and Discussion ...... Approach Imputation 4.4 . . . . Time-Series ...... Work 4.3 Related ...... 4.2 . . Introduction . . . . 4.1 ...... Conclusion and . Discussion . Work 3.5 Related 3.4 . ocuin...... 85 81 . . . 79 ...... 70 ...... 69 ...... 68 . . . 86 ...... 75 ...... 72 ...... Scenario . . . . Usage . . . for . . . . Material . . . . Supplementary ...... Appendix: ...... 5.9 . . Conclusion ...... 5.8 . . . . Discussion ...... Scenario 5.7 . Usage Environment . . . Exploration . Interactive . . . the of 5.6 . . Measures . Features Distance . for . . Requirements and . . . 5.5 Abstraction Task ...... 5.4 . . Background . . . Work 5.3 Related . . 5.2 Introduction 5.1 109 107 111 89 67 91 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. Exposition atI Part 1 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. ntigt hc ofimtr rcdr a o xlctyatce a decried was attached explicitly not was procedure confirmatory a which to Anything ste mhszdeatcnrain hi ehiusieial eaeless became inevitably techniques their confirmation, exact emphasized they As oa,epoaoyadcnraoycnadsol—rce ieb side. by side should—proceed can—and confirmatory and exploratory Today, s‘eedsrpiesaitc’ omte o uhw a ere rmi.[...] it. from learned had we much how matter no statistics’, descriptive ‘mere weakened.as was insights past with techniques used most the of connection The flexible. “ xcl—ocnr e hnseaty ahudrvr pcfi circumstances. specific very under each exactly, things few a confirm exactly—to neuo ie ttsiin nyepoe.Te hylandt confirm to learned they Then explored. only statisticians time, a upon Once onW ue,17.[u7,p vii] p. [Tuk77, 1977. Tukey, W. John Introduction CHAPTER ” 3 1 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 4 1. Introduction . oiainadPolmAnalysis Problem and Motivation 1.1 nlss oeatn n Control and Forecasting Analysis: codn oBsar n uac [ Kulahci and Bisgaard to According 6;atog hs aedrcnet r ua ae hsvr neetn eidcbhvo of behavior periodic interesting very this made, human are concepts calendar these although 46]; h ugse oe eeto prahb o n ekn sue nms etok ntime on textbooks most in used is Jenkins and Box by approach selection model suggested The iulzto eaefrmr oceeadeetv [ effective and concrete more technology. far and became science visualization in another or form one in existed always has data statistical of visualization tcapoc htealspattoest pl iesre ehd nfrcsig [ forecasting” in methods series time apply to practitioners enables that approach atic .. u ihteapaac fJh ue’ ineig17 book, 1977 pioneering Tukey’s John of appearance the with But [...] they [ and models” data, such to of fitted of validity models structure the mathematical the assess complex revealing to view in us to overall compete allow us see can allow to approach Graphs data other thoroughly. explore no so to behavior; data us detailed allow see Graphs to analysis. and data patterns to a critical [ lines, is color” display points, and “[d]ata of states, shading, use words, combined symbols, the numbers, of system, means coordinate by quantities measured display[ing] “visually as book famous his In [ as such analysis, series forecasts interval and point produce to (e) economic and of [...], [ adjustment analysis [...]” regression seasonal improve perform to to (d) [...], (c) data variables, between relationship dynamic the finance, [ few” as a such name fields to different management, many operations “in and applied healthcare, is engineering, analysis economics, series Time frustrating. and cated history. longer even an has contributed forecasting have that [ statisticians Tsay states of to generations According and field. data this [ series to mathematics, time time time over A of In variables analysis analysis. of statistical series measurements of so. time sequential history called do of is set to application a and techniques is research series appropriate of area requires this knowledge, and statistics, and ever in information specifically meaningful than for important data the more analyzing “Data is as [ nowadays. well measured, common” as data and extremely data, transaction collected are these data of time Collecting of over amounts sequentially amount huge vast collected to observations a centuries from of past obtained basis the the of is data it weather life, to from central phenomena. so natural is of time kinds Because all in and nature in everywhere found is [ time concepts calendar simple are time structuring philosophy, and in years documenting of of [ disciplines” thousands sources literally oldest other over many discussed and biology, is] astronomy, and physics, mathematics, [... mankind for even interest philosophers “The of nor everything. been field) [ and any exactly” everywhere (from define scientists is to neither Time how that know it. enigma true with a deal is to [time] has entity Everybody temporal time. escape can one No sdi uiesadeoois ()t td h yai tutr fapoes b oinvestigate to (b) process, a analysis of series structure time dynamic of the study purposes to main “(a) the listed economics: Tsay and forecasting, business and in series used time of history brief a Tsa00 .68.Acrigt sy h 90lnmr okb o n Jenkins, and Box by work landmark 1970 the Tsay, to According 638]. p. , h iulDslyo uniaieInformation Quantitative of Display Visual The BJR08 CB14 , BD91 Tsa00 [ BJ70 .20.“h udmna hnmnno iehsalways has time of phenomenon fundamental “The 250]. p. , BK11 , CC08 iesre nlssrahsbc o12,atog he although 1927, to back reaches analysis series time ] ,“a nipratmlsoe[.]I rvddasystem- a provided It [...] milestone important an “was ], ,mn rciinr n iesre nlsscompli- analysis series time find practitioners many ], , SS11 Cle94 , Ham94 .5.Ceeadfrhrage ht“[t]he that argues further Cleveland 5]. p. , Tuk77 , BK11 .Sne17,cagsi computer in changes 1977, Since ]. CC08 , BD02 ut defines Tufte , Tuf83 xlrtr aaAnalysis Data Exploratory , Cle94 , AMST11 CM09 .9.A Cleveland As 9]. p. , .Teei great a is There ]. BK11 , Pfa08 Tsa00 aagraphics data .4] The 45]. p. , AMST11 CC08 ieSeries Time .1.In 1]. p. , , .639]. p. , Bro11 .1]. p. , p. , ]. , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. . aaAayi n Visualization and Analysis Data 1.2 speiul etoe,i ttsis ue [ Tukey statistics, in mentioned, previously As famous his presenting should While computer “[a] process. that this statistician—stated famous of Anscombe—another adjustments quartet, iterative Anscombe’s with runs multiple as well h i fti israini opeetvsa nltc prahsta upr ouin to solutions support that approaches analytics visual present to is dissertation this of aim The analytics visual of area research interdisciplinary highly the challenges, such with deal To visual computations, data, involves that task challenging and tedious a is analysis series Time introduce to was manageable effectively and easily more it make to solution proposed Tukey’s iulzto,aditrcin h dacmn fcmuainaditrciecmue systems, computer interactive and computation of advancement The interaction. and visualization, make h eta on st oka h aa(rpial rnmrcly n eemn h eut in results the determine and numerically) or (graphically data the where at analysis, look data to exploratory is consider point also central data—to the the on based hypotheses testing basically selection, model including prediction. analysis, and series imputation, time detection, statistical outlier of exploration, field parametrization, the in challenges these of some as challenges, such analysis. support series to time techniques in and apparent approaches new completely allowed have analytics, [ [ computer” the through pass one than more for calls generally and matter, routine [ understanding” to his in demands Tukey like analysis, asks data process [ confirmatory described book and this famous exploratory Essentially, often combined found. is a workflow is for of model only kind adequate not This an until cycle. times this models multiple repeating other repeated reselecting and applications, parameters, model model these the of adjusting outcome models, and/or results, the the model the at at of looking about looking models, parameters outcome, the knowledge initial applying model domain on the checking deciding with parameters, models, model insights adequate computing/estimating selecting the models, combining series adjusting metrics, bytime metrics, achieved these computing are data, mentioned, visualizing the previously visualizing data, visually, as and analysis, numerically series data time the of exploring at purposes data the the Usually of domain hand. application the experts—in practitioners—generally and representations, imagined challenge that we data and in effects, structure 1]. unimagined discover intricate p. We revealing [Cle93, way. attack, ones” other is of any “[v]isualization line in explains, front absorbed Cleveland a be As provides cannot It plots. analysis. of or data representation graphs to visual statistical a critical called is analysis is data it exploratory whether of data, aspect central a make [ analysis; to mind” data is his exploratory data mind, of her body minds, any minds—our about by handleable problem effectively basic and [ “[a] easily goals” himself, Tukey more its by it not quote but original visualization, an out to back carry we how changed have systems oehrwt eerhpors aei aavsaiain ua optritrcin n visual and interaction, computer human visualization, data in made progress research with together nlss hr h anie a osittefcsfo anycnraoydt analysis— data confirmatory mainly from focus the shift to was idea main the where analysis, TC05 both , KKEM10 calculations Tuk77 a nrdcdwt h eea dat obn n netiecomputation, intertwine and combine to idea general the with introduced was ] Ans73 .vi,btas o obnn opttosadvsa ersnain as representations visual and computations combining for also but vii], p. , and .1] eas rudta []o ttsia nlssi o purely a not is analysis statistical “[g]ood a that argued also He 17]. p. , rps ohsrso uptsol esuid ahwl contribute will each studied; be should output of sorts Both graphs. Tuk77 nrdcdteae fepoaoydata exploratory of area the introduced ] ..Dt nlssadVisualization and Analysis Data 1.2. Cle93 .2.Going 2]. p. , Ans73 Tuk77 .17]. p. , .v]. p. , 5 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 6 1. Introduction codn oCeead sqoe ale,tehsoyo vsaiaino ttsia aahsalways has data statistical of “visualization of history the earlier, quoted as Cleveland, to According ai ndt cec oaas h ehooia n cetfi dacmn nmathematics, in advancement scientific and technological The nowadays. science data in valid etfruei oatr col”[ schools” monastery in use for text ae10s h r eeal ena h netr fmdr rpia eini h literature, the in design graphical modern of inventors the years as 800 seen some generally up are shows who 1700s, time-series of late plotted history a the of in graphic wonder extant isolated later”[ next and mysterious the a since as graphics, “appears line data graph the known when oldest graph, this the Tufte can of For one left and the Venus, on by repeat is to right. which chosen meant the duration, is is at periodic x-axis pattern ends longest The periodic the other. the of each that period to recognize full relation easily time, one over in show function them to shows a way phenomena and as many a orbits, orbits of in the planetary periodicity of the the like inclinations captures nature, of also form in but this graph, uses time-line only a not representing it essentially that is of 1.1, function Figure a in as orbits planetary the of inclinations the of plot a [ time” represent to “meant graph a to [ [ [ technology” representations and graphical science of in history another or form one in existed Press. [ (1936) Funkhouser Source: Image by published unknown, author 11 literature, possibly in found graph series [ time Funkhouser first the Possibly 1.1: Figure today. hold still analysis data aspects exploratory main on the ideas Tukey’s although of techniques, powerful goals including yet and visualizations, complex advanced more of create to creation interactions, the advanced for allowed science still computer and Tukey’s and central statistics, are them. analysis reject data or exploratory confirm of to concepts order the in including tested statistics, be to then contributions can that hypotheses generate to order Tuf83 Tuf83 , Fun36 AMST11 th .2] ut eest okb ila lyaradJhn enihLmeti the in Lambert Heinrich Johann and Playfair William by work to refers Tufte 28]. p. , .20 rmtetnho osbyeeet etr,wihsest ea“compiled a be to seems which century, eleventh possibly or tenth the from 260] p. , Fun36 century. ,a ela ntxbosaotsaitcltm eisaayi [ analysis series time statistical about textbooks in as well as ], .I hw lntr risa ucino ie tdtsfo h 10 the from dates It time. of function a as orbits planetary shows It ]. Fun36 Fun36 RG10 .21,Osiris 261], p. , .20.Tefsiaigtigaotti rp,shown graph, this about thing fascinating The 260]. p. , , Fun36 , Fun37 © 96 oreyo h nvriyo Chicago of University the of courtesy 1936, n nomto iulzto,sc as such visualization, information and ] Cle93 .2.Teltrtr bu the about literature The 2]. p. , CC08 ,alrefer all ], th or Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 81 h rpsipoe vrteyas u h oeeeet r h same. the are elements core the but years, the over improved graphs The 1801. codn oFnhue,WlimPafi myb aldtefte ftegahcmto in method graphic the of father the called be “may Playfair William Funkhouser, to According oiia Atlas Political iln,i e ril bu al xeietlgah [ graphs experimental early about article her in Tilling, his published Playfair that explains Funkhouser work, Playfair’s of significance the clarify To rteiinof edition first setal,bt lyaradLmetaepeue ob h rtt pl iuldt analysis, data visual apply to first the be to presumed are Lambert and 201]. Playfair p. both [Til75, Essentially, period its determine and variation natural periodic a a detect of titled pattern Lambert periodic by the as paper on a translated variation focuses to the again refers shows also Lambert which Tilling by 1.3, graph phenomenon. Figure This in shown graph temperature. as notable soil variation, One in periodic of results. analysis experimental analyzing graphical or the presenting is for graphs to contributions and and discovery the for method 280]. new p. accompanied the [Fun37, he of trends” addition, advantages economic In the of of analysis 1.2). know expositions Figure still pointed (see we data as “with of visualization pie these the and for graph, nowadays [ bar them [...]” graph, use future circle and the graph, in were far line data still contributed quantitative was essentially reliable method He statistical of of collections development few language, the English and available the in appeared yet not had [ statistics” AMST11]. RG10, Tuf83, Til75, [Fun37, in today series it time use representing still specifically we way data, statistical the of representations graphical to contributing PLAYFAIRWilliam1801TheCommercialandPoliticalAtlas [ (1801) Playfair book Source: Image his of edition first The today. it use and know Scandi- we and as England series between time exports of and imports [ the Playfair showing countries, graph navian line series Time 1.2: Figure Fun37 hoyo Steadiness of Theory h omriladPltclAtlas Political and Commercial The a ulse n18.Ti rp stknfo h hr dto ulse in published edition third the from taken is graph This 1786. in published was .23 n ste“apaetIvno fSaitclGah”[ Graphs” Statistical of Inventor “[a]pparent the as and 273] p. , Pla01 Pla01 .Pafi’ iegah r osdrdtefis representation first the considered are graphs line Playfair’s ]. lt 12]. plate , ,i hc abr nrdcdgahclrpeettosto representations graphical introduced Lambert which in ), mg eree rmarchive.org: from retrieved Image ls iie nSp.2,2020) 24, Sept. on visited (last n18,atm hn“th term “[t]he when time a 1786, in Til75 ,dsussi ealLmetswork Lambert’s detail in discusses ], hoi e Zuverlässigkeit der Theorie ..Dt nlssadVisualization and Analysis Data 1.2. https://archive.org/details/ h omriland Commercial The Fun37 Fun37 statistics .280]. p. , .280]. p. , (roughly 7 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 8 1. Introduction n odtrieispro”[ period” its determine to and aai rp,bta hw nFgr . wsacsoe oitroeasot curve smooth a interpose to accustomed “was 1.3 Figure in shown as but graph, a in data o xlrtr aaaayi n lvln o iulzn aafrdt analysis: it data postulated Tukey for how data and visualizing analyses for data Cleveland visual and analysis in today data doing exploratory for to used are we as computation, [ differentiation” [ discovery observations” “[the] the amongst for them used also but hand, [ at trends” data economic of the analysis visualized and only not they because urn:nbn:de:bvb:12-bsb10058497-5. VII, Plate 39, Figure 407, soil. p. of 150, Phys.sp. depths [ different (1779) in Lambert temperature of Source: Image variation [ periodic Lambert of , analysis line graphical as a represented variation Periodic 1.3: Figure cetfi esnn,o aeu xmnto ftegons fmthbtenter and theory between match of goodness the of examination [ careful data” a of reasoning, scientific I a nerylso htw analtb ltigtepit.I a later a was It points. the plotting by lot a gain we that lesson early an was “It Til75 .24.Teefidnssgetta ewsaraycmiigvsaiainand visualization combining already was he that suggest findings These 204]. p. , Til75 .21.“abr a atclryaaeo h eest,a tpin step a as necessity, the of aware particularly was “Lambert 201]. p. , Til75 Lam79 Til75 .21 n nohrgah ue h ehiu fgraphical of technique the “used graphs other in and 201] p. , Fun37 .47,poie yByrsh tasilohkMnhn 4 München, Staatsbibliothek Bayerische by provided 407], p. , .21 Lmet.Lmetntol ipae his displayed only not Lambert (Lambert). 201] p. , 8](lyar n o“eetaproi variation periodic a “detect to and (Playfair) 280] , Lam79 .Ti rp sshowing is graph This ]. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. sdsusderir ue’ nrdcino xlrtr aaaayi swdl osdrdas considered widely is analysis data exploratory of introduction Tukey’s earlier, discussed As iulzto,sinicvsaiain n iulaayis(hc ewl nrdc below) introduce will we (which analytics visual and visualization, scientific visualization, and visualization h onrtn o oenvsa aasine ti lowl cetdi h iulanalytics visual the in accepted well also is It science. data visual modern for cornerstone the oi ae[ Ware Colin ednyo obnn l he fteefilsudrteterm rumored the a is under There fields these other. each of with three liaising all to combining closer moved of fields tendency the and off, broke and crumbled [ concepts” or data of field data data, The abstract abstract latter mapping. the spatial for obvious data; no physical visualization on with based information, usually abstract is nonphysical, which is data, it scientific is it first, the For [ cognition” amplify definition the to visualization, information actually of general field in the visualization in of researchers what great specify some to to first According important means. is it sections, following the For general: in science data visual book Tukey’s and from analytics statements visual important in some more relevant quote in very to analytics still want visual are first and that we briefly section, next visualization the [ of Tukey in field by detail the stated introducing as analysis, with data proceed exploratory we Before to analysis data confirmatory from moving [ cf. community, research visualization hc nomto secddo iuldsly.Ftigmteaia ucin to functions mathematical Fitting displays. visual on encoded is information which h t n eiul,otnlae motn set fdt nicvrd”[ undiscovered.” data of graphing aspects without 1] and important p. leaves them often fitting residuals, without and data, raw fits graphing the Just too. in needed process is a data implies visualization because course, of data—graphing needed, statistical are of Graphs structure fitting. the and visualizing to components not two often are “There is points the Plotting points. learn not the do 665] middle often p. you [Tuk77, or enough.” that smooth perhaps, be, you should until it enough as hard as emphasized not lesson, Smaiscnb eyueu,btte r o h eal. Tk7 .27] p. [Tuk77, details.” the not are they but useful, very be can “Summaries 3] serve p. [Tuk77, can step.” else first nothing the but stone—as story, foundation whole the the as be never can analysis data “Exploratory vi] it p. when [Tuk77, see.” is to picture a of value greatest “The ste endb ade l ntesm a svsaiain nychanging only visualization, as way same the in al. et Card by defined then is War00 [ CMS99 s“th s fcmue-upre,itrcie iulrpeettoso data of representations visual interactive, computer-supported, of use “[t]he is information War00 ,woudrto h emvsaiaina agahclrpeetto of representation graphical “a as visualization term the understood who ], .7.Fradfiiino iulzto,Ki ta.[ al. et Keim visualization, of definition a For 7]. p. , CMS99 .Oe h er h sap eaainaogtefilso information of fields the among separation (sharp) the years the Over ]. KKEM10 iulzto,weetedffrnele ntedt hti visualized. is that data the in lies difference the where visualization, .6.Te hndsigihbtentefil of field the between distinguish then They 6]. p. , .3,ta h rtse oadvsa nltc eerhwas research analytics visual toward step first the that 3], p. , forces st oiewa envrexpected never we what notice to us visualization ..Dt nlssadVisualization and Analysis Data 1.2. o h atseveral last the For . KMS + Cle93 information 08 scientific ee to refer ] Tuk77 data , to ]. 9 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 10 1. Introduction . iulAnalytics Visual 1.3 ihteproi tutr ftm.I h edo nomto iulzto,acorresponding a visualization, information of field the In time. of structure periodic the with hmsadCo rtdfie h emvsa nltc n20 s“h cec fanalytical of science “the as 2004 in analytics visual term the defined first Cook and Thomas (statistical) of visualization the and analysis data of problem the that shown have we far, Thus and data the seeing of idea Tukey’s to contributes visualization information of area research The dealing and analysis series time in challenges on focus our expressed already have we far, Thus er a es ic 07,temjrIE I ofrnehsbe detsda peirforum “premier as advertised been has conference VIS IEEE major the 2017), since least (at years rvdssrcueadspot nslcigaporaevsaiaintcnqe o different for techniques visualization that appropriate categorization selecting a in introduce supports importantly and most structure and provides data this visualizing for methods diverse ehooia dacmn ncmuain h eadt oewt h asv mutand the amount of massive Because the with applications. cope and to research demand in the topic relevant computation, a in advancement being technological of history long a have data & otiue ealdrve falapcso iulaayisfo h uoenperspective European the from analytics [KKEM10]. visual consortium industrial of a aspects and to all academic of European review from detailed a Commission-funded experts contributed which European R&D two-year in initiative, a VisMaster for the cornerstone by called the analytics project visual also basically of challenges was and This scope further. the about [ paper and al. formed A et community field. Keim research research the European the in a years, research few and to evolved, a analytics contributed Within visual threats. of terrorist field of application focused identification and and the influenced, and motivated, security heavily homeland was on agenda development and research the [ describing interfaces” visual interactive by facilitated reasoning data to research contribute the scientists it, domains. field, of many research out in this sense problems within analysis make established; to was analytics demand visual the of and field data, collected the of complexity increased field the never introduce fashion and a foundations these in on made analytics. extend were we visual analysis section, of following data the exploratory dialogue In a in before. building advances possible for users, techniques interaction and and data interfaces between graphical novel With results. the cyclic and linear specifically time, of structure the into categorization time. the is dissertation [ this book in a in published and extended later were contributions These purposes. they terminology their In data. as of well type as data this data of analyzing series type in time this challenges and call specific time the visualize with to deal how to in how challenges with copes research of strain change. likely will this future near some in visualization, of ieoine aawsstb inre l [ al. et Aigner by set was data time-oriented o dacsi visualization”. in advances for opeesv iwo h oi fvsaiigtm-retddt.Ms eeatfrtecontent the for relevant Most data. time-oriented visualizing of topic the of view comprehensive a 1 http://ieeevis.org/ KMS + 08 a nipratiiilcnrbto oseiytefil fvsa analytics visual of field the specify to contribution initial important an was ] ieoine data time-oriented ls iie nOt 1 2020) 11, Oct. on visited (last 1 lhuhi tl a eaaecneec rcsfrtetrefields three the for tracks conference separate has still it Although [ AMM AMM + 07 + 07 , AMST11 hnitouigasseai iwon view systematic a introducing when ] TC05 .4.Ti rtbo enn and defining book first This 4]. p. , .Tefudto fvisualizing of foundation The ]. AMST11 oprovide to ] Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. egtt h eal necessary. details the to get we h liaega fvsa nltc st gi nomto,isgt,adassmnsfrom assessments and insights, information, “gain to is analytics visual of goal ultimate The iulzto,hmnfcos n aaaayi.Te lutaeteivle oisadaesby areas and topics combines involved that the field illustrate a They is analysis. scope data the and Rather, factors, visualization. human only visualization, than more much now is visualization, aaae,saitc,admteais hra opeetn hs tegh ihhuman with strengths these The complementing learning, whereas machine cognition. mining, mathematics, and data and are perception part statistics, like analysis databases, automatic humans, computational of the in strength elements the core with power, computational the pcfi ento hnToa n oka vsa nltc obnsatmtdanalysis automated combines analytics “visual more as a with Cook analytics and visual book Thomas defined al. the than et definition in Keim specific research, published analytics visual were analytics in initiative visual experience with practical VisMaster problems the Solving of age: results information the into 2010, grew analytics In visual why [KMS reasons research the of of field one flourishing is a conclude and relate, perceive, to capabilities opportunities. and challenges of scope [ data” broad complex a analytics covers visual that and indicates field This interdisciplinary 1.4). Figure highly (cf. a analytics visual is of scope detailed the defining [ until al. analysis et the what Keim down show drill are, to data filter these and in zoom), aspects (e.g., interesting interactions the allow what important, analyze be first patterns. to to interesting seems massive losing need with without we dealing visualization case, overview are create this we to when In possible because, not necessary often is is adaptation it datasets, this that argued later further, 11] analyse p. zoom/filter, important, the [ show demand” first, on “Analyze details of: mantra analytics visual the into [ demand” on details zoom/filter, first, [ al. et Keim [KMS (2008) analytics. al. et visual Keim of [ scope Source: al. Image the et in Keim fields analytics, application visual and of research scope The 1.4: Figure KMS KMS ALA + + 08 08 KKEM10 dpe n xeddtefmu nomto-ekn ata“overview mantra information-seeking famous the extended and adapted ] + ru htvsa nltc,b vligfo nomto n scientific and information from evolving by analytics, visual that argue ] 18 .25;t os novscmiigtesrnt fmcie,or machines, of strength the combining involves so do to 275]; p. , .1] rtdfie n[ in defined first 11], p. , + + 08]. 8 .7] oreyo pigr©2008. © Springer of courtesy 79], p. 08, Shn96 .37 o iulyepoigdt otasomit transform to data exploring visually for 337] p. , KMS KMS [ KKEM10 + 08 + 08 .8] eme l [ al. et Keim 82]. p. , .Ilsrto fteinvolved the of ]. .I hsbo,bsdon based book, this In ]. atrn the Mastering ..Vsa Analytics Visual 1.3. KKEM10 , 11 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 12 1. Introduction hmni-h-op hnigt hmni-h-op”s httehmnwr rcse edto need processes work human the that so “human-is-the-loop,” a to thinking “human-in-the-loop” iMse rjc.Esnily hspoesdsrbsacmiaino uoai n visual and automatic of combination a describes process this Essentially, project. VisMaster a rvosypbihdb eme l [ al. et Keim by published previously was ankoldeadisgt rmtedt.Etnigo hspoesdfiiinfrvisual for definition process this on Extending to fashion data. iterative the an in from interaction insights human and involving knowledge coupling gain tight a with methods analysis h nweg eeaini op st lo eea op nprle,bcueaayt’cognitive analysts’ modelling because of parallel, idea in basic loops several The allow 1.6). to Figure is (see loops system in analytics generation knowledge visual the a of reasoning users overall human the by comprise process which loop, knowledge-generation and verification, exploration, an rcse r fe ahrcatcadsotnos n hycnwr ndffrn hypotheses, different on work can they and spontaneous, and chaotic rather often are processes [ al. et Endert by claims the recent of the process knowledge-generation emphasized al. the addition et detail Sacha in more and, user. in parts human illustrated human model, and process computer earlier the the between to distinction the argued They activities. [ al. et Sacha analytics, 1.5) Figure (see process this of predecessor A model. reference analytics visual a [ proposing al. et Keim book, their In enable to goal the serve which techniques, and tools of to creation people the [ by datasets” analytics complex visual and large of very goal decision of and basis reasoning the understanding, on effective making an for visualizations interactive with techniques [KAF (2008) al. et process. Keim analytics visual source: Image a [ in al. transitions et and Keim stages process, different analytics visual The 1.5: Figure edrcl nertdadrcgie nteaayis ah ta.[ al. et Sacha analytics. the in recognized and integrated directly be “1. .Cmuiaeteeassmn[]eetvl o cin”[KM0 .7] p. [KKEM10, action.” for effectively assessment[s] these Communicate assessments. understandable 4. and defensible, timely, unexpected. Provide the discover 3. and expected the Detect 2. n fe oflcigdata. ambiguous, conflicting dynamic, often massive, and from insight derive and information Synthesise SSS + KKEM10 14 xaddtekoldegnrto rcs fhmncognition human of process knowledge-generation the expanded ] + endahg-ee rcs o civn hs ol by goals these achieving for process high-level a defined ] 8 .16,cuts fSrne 2008. © Springer of courtesy 156], p. 08, KAF + 08 , KMS KAF KKEM10 + + 08 08 .I hw nasrc vriwof overview abstract an shows It ]. n20,a h einn fthe of beginning the at 2008, in ] SSS .7.Te lodfie the defined also They 7]. p. , EHR + 14 itnuse among distinguished ] + 14 og eodthe beyond go to ] Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. eso hspoesi hpe epcal iue24,weew s hspoest eciethe describe to process this use we where 2.4), Figure (especially 2 Chapter in process this show We nte diint h aiyo iulaayispoesdfiiin oue ntehptei and hypothesis the on focuses definitions process analytics visual of family the to addition Another onWly&Sn t.©2018. © Ltd. Sons & Wiley John oe eeaindrn iulaayispoesseiclyfrtm-retddt [ data time-oriented for specifically process analytics visual a during generation model [ (2018) al. et Andrienko Source: Image a or Andrienko results, building, expected model as [ analytics visual al. viewing for et workflow analytics visual A 1.7: Figure [SSS (2014) al. knowledge- et the Sacha detail. illustrates more Source: model Image in process human the this of definition, process process generation [ analytics al. visual et earlier Sacha the analytics, to visual for model Knowledge-generation 1.6: Figure these loop. into verification in trust the gained combined during have and hypotheses analysts collected the and are after insights which knowledge, new formed, to are loop insights knowledge-generation guidance process the provides with this verification formed During the are whereas exploration. ones loop, the new for verification or and interesting confirmed exploration something are of find hypotheses combination to and the exploration findings or data These visualizations allowing new hypotheses. and generate build data to and the applied analyze is to loop order exploration in The models parallel. in findings or tasks, ALA + 18 .Ti iulaayiswrflwilsrtshwtefia rdc—.. the product—i.e., final the how illustrates workflow analytics visual This ]. model ssm ic fraiyo iulaayi screated. is analysis visual a reality—of of piece some as , + ALA 4 .10] oreyo EE©2014. © IEEE of courtesy 1604], p. 14, + 18 .26,cuts fteErgahc soito and Association Eurographics the of courtesy 276], p. , SSS + 14 .I addition In ]. ..Vsa Analytics Visual 1.3. LAB + 11 ]. 13 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 14 1. Introduction . Methodology 1.4 c.Scin231i hpe ) hsvsa nltc rcs ento yLmashet Lammarsch by definition process analytics visual This 2). Chapter in 2.3.1 Section (cf. iulaayispoesfrtm eismdlslcin pligteBxJnismethodology Box-Jenkins the applying selection, model series time for process analytics visual eea iulaayispoesb eme l [ al. et Keim by process analytics visual general xetdrsls favsa nlss oprdt h rvospoesdfiiin,temain the definitions, process previous the to Compared analysis. visual a of results) expected n ouin.Tewr nti israini ae nteemtoooia foundations. methodological these on based is dissertation techniques, this methods, to in analytics methodologies work visual of The set proposing solutions. accepted and and reasonably research a analytics is visual there doing research, for follow analytics visual of field the In section. following the will in we dissertation research, this analytics in applied visual methodology in and research structure definitions additional guiding process the the the discuss consider as frameworks to proposed addition, In the follow dissertation. this in methodology research [ al. et Andrienko by those [ like al. definitions, process analytics visual Using [ methods, more analytics al. give visual et To of Andrienko design tools, the supported. and for be developers procedures, and to kind researchers needs analytics what Using visual of externalization for understanding and guidance media. is better evaluation, different promotes it development, via analytics whether model visual distributed subject, in of and the building offloaded of model or model of knowledge analyst built concept the the this the that of on is mind based paper the is this in analysis in completely of declaration main process analytics important and the visual power An because during The process idea. gained this this mind. for around in allow built fully they exactly that it complex is are keep a methods to in analytics analyst because, visual an model of for the benefit of difficult important parts occasionally An offload is and details. it externalize or model, to relationships, is aspects, models all complex represent for cannot step and data the of generalization data the “ in that reflected is work a this or in created, be to needs that satisfying product/result product knowledge a [ but goal” insight is analysis trustworthy [ analytics “any the goal” be visual stated not previously that should argued certain result They a the achieving Consequently, analyst. to analytics directed visual activity a “purposeful of a activities of process the is to refer [ activities at. analytics et visual Andrienko of to product according final terms the techniques Usually, the of using types analytics. described and visual performed in activities analysts the view, by on this used focus with instead proposed processes frameworks other conceptual these and because definitions existing complement to is idea n rpsn iulaayismtosadsltosi led n att h uzeo h used the of puzzle the to part one already is solutions and methods analytics visual proposing and l [ al. rcs ento yAdinoe l [ al. et Andrienko by definition process SSS LAB + 14 + 11 ,Lmashe l [ al. et Lammarsch ], a ecniee nitreit tp oigfo h rvosydiscussed previously the from moving step, intermediate an considered be can ] ALA [ ” h rmr neeto h nls snttedt e ebttereality the but se per data the not is analyst the of interest primary the ALA + 18 information p 7–7] hydfie h vrl olo nlss hc sthe is which analysis, of goal overall the defined They 275–276]. pp. , + 18 ALA .23.Esnily nterpoesdfiiintemdli a is model the definition process their in Essentially, 283]. p. , LAB + + 18 , 11 knowledge ,d o lal en h xetdpoutbtinstead but product expected the define clearly not do ], ALA ,adKi ta.[ al. et Keim and ], ALA model + 18 + 18 rvddasmaie cinls operform. to list action summarized a provided ] or , KKEM10 htfcsso h nlpout(.. the (i.e., product final the on focuses that ] ssm ic fraiy nte emphasis Another reality. of piece some as , insight KKEM10 hc r ahrgnrltrsand, terms general rather are which , oamr eetvsa analytics visual recent more a to ] sfaeokfrdefining for framework as ] ALA ALA + 18 + 18 ,Scaet Sacha ], .275]. p. , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. hc sueu o h srt anisgt,adstse h eurdtssapidb h users the by applied tasks required the satisfies and insights, gain to user the for useful is which analytics visual interactive for criteria quality as expressiveness–effectiveness–appropriateness with eetvns) n c iulztosta epsletegvnts n ec h endgoal defined the reach and task given the solve help that visualizations (c) and (effectiveness); (appropriateness). aigtepolm o hsrao,Mkc n inr[ Aigner and Miksch reason, this For problem. the facing ehd,Mkc n inr[ Aigner and Miksch methods, e ftxnme o iulzto ngnrl hs r h famous the are These general. in visualization for taxonomies of set h aa(xrsiees;()vsaiain htaeitiie eonzbe n interpretable and recognizable, intuitive, are that visualizations (b) (expressiveness); data the akAbstraction. Task nomto visualization information [ Aigner most and The Miksch by literature. framework related triangle and design analytics the visual within mentioned the number ones a in appropriately, notable deriving presented supported are and be typologies data to and tasks the given taxonomies the support the of describe to of and order abstract analysis To in interactive knowledge. step and appropriate important insights an an generating is This in user methods. analytics intended visual of design the during authors the [ 1.8, expressiveness Figure like methods, in analytics triangle visual [ for the this effectiveness criteria In answering quality and major Specifying the applicable. specify what are hand. also and methods at representations interaction data visual specify the and and appropriate to analytical method, of clarify apply the indications to to use gives triangle about will abstraction design who are data–users–tasks user the users the use the describe to that working, is tasks is idea the user The the analytics which visual with interactive data. of data time-oriented design the on the focus support a to framework with high-level methods, a as 1.8 Figure in shown problem, the of data complex and large the handle can solution proposed the whether validate to is Framework. Data–Users–Tasks 2014. © Elsevier of courtesy 286], p. [MA14, (2014) Aigner and Miksch Source: Image together methods. paradigm data–users–tasks a using requirements specifying [ in Aigner designers and and Miksch researchers framework, triangle Design 1.8: Figure Mac86 pca ou ed ob u ntedfiiinadasrcino h tasks the of abstraction and definition the on put be to needs focus Special ,adaporaees[ appropriateness and ], yBnSniemn[ Shneiderman Ben by MA14 ydfiiin eta ati ahvsa nltc solution analytics visual each in part central a definition, By s o a nyvsaiigeatywa scnandin contained is what exactly visualizing only (a) for ask ] vW06 Shn96 .I re ogtueu iulanalytics visual useful get to order In ]. swl stecnrbtosb Brehmer by contributions the as well as ] MA14 MA14 nrdcdtedsg triangle design the introduced ] akb aatp aooyfor taxonomy type data by task .I uprsvsa analytics visual supports It ]. MA14 ..Methodology 1.4. Mac86 r a are ] ], 15 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 16 1. Introduction h elwae niae h ein hr einsuisaeapsil ehdlgclchoice. methodological possible a are studies design where regions the indicates area yellow The effectively more studies design conducting for approach methodological holistic a proposed They aiaini 09[ 2009 in validation aiiyaeas aeoie nofu ees rn rbe,asrcin encoding/interaction, abstraction, problem, wrong levels: four into categorized also are validity rvd ugsin o itnteauto ehdlge o ahlvl h ue ee sto is level outer The and levels level. nested each four for into methodologies it evaluation splitting by distinct for process suggestions design provide the in designers visualization guide to r okn ihra sr n r osleterra-ol rbes pcfial,te en a define they Specifically, problems. real-world researchers their the solve where to as: research, study try problem-driven design and of users form real a with as working 1.10. study are design Figure a in defined shown be al. as can to et chart, preconditions Sedlmair tasks location These information the and work. of clarity to clarity task which their with the in information is summarized the methodologies best of study location the design and of important support One suitability framework. the design nine-stage for proposed precondition their applying for guidance [ provided 2012 and in methodology study design a into practice and Meyer, ongoing Sedlmair, this and condensed popular, Munzner increasingly become research—has problem-driven of form Methodology. Study Design to 921]. threats p. [Mun09, possible algorithm The and choices. design evaluation different recommended the and level validate each appropriately efficiently. at techniques to the validity these to methodologies and execute threats designed, to data possible need are and identified that techniques also operations algorithms interaction Munzner into of and data definition encoding the and visual considers tasks the level the level, last abstract third of to the vocabulary is In the goal with level’s types. data second and The tasks the domain. defining the like space, problem domain the characterize and design visualization for model nested her introduced analytics Munzner visual reason, evaluating and this designing For when approach methods. user-centered a requires This process. Model. Nested 2009. © IEEE of courtesy level. 922], each p. in [Mun09, nested validity (2009) four to Munzner into threats Source: process possible Image for design methodologies the evaluation splitting suggests by and [ layers designers Munzner validation, visualization and for design guidance visualization provides for model Nested 1.9: Figure of process. set reasoning large analytical the an from information during applied describe tasks and possible select, structure, [ researchers Andrienko analytics and visual Andrienko help by typology task the is data [ Munzner and nltc ehiusi h ua atrad ydfiiin h ih nerto fteue nothe into user the of integration tight the definition, by and, factor human the is techniques analytics BM13 nadto otevsa nltc rcs oes nipratapc nvisual in aspect important an models, process analytics visual the to addition In Mun09 n cuze l [ al. et Schulz and ] .Tegnrlie fti etdmdl ssoni iue19 is 1.9, Figure in shown as model, nested this of idea general The ]. uigteyaso iulzto eerh einstudies—a design research, visualization of years the During SNHS13 .Mr pcfial osdrn time-oriented considering specifically More ]. AA06 .Teets taxonomies/typologies task These ]. Mun09 .Ti model This ]. SMM12 ]. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. rdadbu)suitable. blue) and (red iulaayis seilyi h einadeauto rcs fvsa nltc ouin.Lam solutions. analytics visual of process evaluation and design the in especially analytics, visual mg ore elare l 21)[M1,p 43,cuts fIE 2012. © IEEE of courtesy 2433], p. [SMM12, (2012) al. et Sedlmair Source: Image eas h ua sri uhacnrlfiuei iulzto n iulaayis ayaspects many analytics, visual and visualization in figure central a such is user human the Because ta.[ and al. visualization et to transferred are and relevant are field interaction human–computer the from Methods. Evaluation avoid stages. to relevant strategies the provided also in They pitfalls these framework. nine-stage the in especially methodology, study about aspects [ critical discussed paper some also 2008 contributed authors her already The in Munzner studies stages. community. design the visualization of the each projects in in study pitfalls authors design possible earlier other on and of reflections authors Each on based the understanding. guidance by forming practical and with ideas described refining is stages and the adapting stages, for overlapping jumps with for backward process analysis includes linear the highly but and a validation, follows precondition framework inward-facing the The for phases: phase validation. three outward-facing core nine-stage of the The consists validation, framework personal process. The for specific phase 1.11. nine-stage a Figure their with in proposed studies illustrated al. design is et framework conduct Sedlmair researches methodology, help study to design framework a defining to addition In et Sedlmair chart, location information and [ clarity al. methodology—task study Design 1.10: Figure l [ al. SMM12 SMM12 iulzto eingieie. SM2 .2432] p. [SMM12, guidelines.” design visualization rbe,vldt h ein n eetaotlsoslandi re orefine to order in learned lessons about this reflect solving and supports design, that system the visualization validate a problem, problem design real-world experts, specific domain a by analyze faced researchers visualization which in project “a LBI + 12 .Teclrdaesidct hr einsuymtoooyi ylo)o snot is or (yellow) is methodology study design a where indicate areas colored The ]. ute lbrtdo hs iflsadsrcue hmacrigt h design the to according them structured and pitfalls these on elaborated further ] otiue cnrobsdlo tteeauto ehiuso information of techniques evaluation the at look scenario-based a contributed ] ovldt iulaayissltos hyne ob vlae adequately. evaluated be to need they solutions, analytics visual validate To Mun08 ,i hc h icse ao ifls elaret Sedlmair pitfalls. major discussed she which in ], ..Methodology 1.4. 17 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 18 1. Introduction nte diint h ee udn cnro yLme l [ al. et Lam visualization. by of scenarios guiding field seven the the in to evaluations addition most Another in work reference a considered and used widely oehr h aesb a ta.[ al. et Lam by papers the Together, iulztosadietfida mhsso h vlaino loihi efrac and performance algorithmic of evaluation the on emphasis an identified and visualizations iulzto,pooigsvngiigseaisfrteeauto.Te eie hs scenarios these derived They evaluation. the for scenarios guiding seven proposing visualization, fe vra n h rcs shgl trtv,jmigbcwrsi eycmo orefine to common very is backwards jumping iterative, highly is process the and overlap often ffcieesadaporaeeso rpsdvsa nltc approach. analytics visual proposed a of appropriateness and effectiveness although solutions, visualization and years. of analysis the evaluation on data over the improvement in eases reports an rigor visualization was as of new there lack well a a as how identified still performance as They well user reasoning. as and evaluation practices experience the work in user and trends of environmental some evaluation evaluating drafted the also in They like practices practices, literature. the the on in inspections reported result They qualitative literature their field. from visualization observations their entire contributing the by gap covering visual this search and filled visualization al. scientific reflect et include not Isenberg also do analytics. therefore, should and, which community, visualization visualization information entire of the literature the [ on al. focused et primarily Isenberg are by visualizations evaluating of practice the is work This scenario. to each methods of possible application of practical overview the an of illustrate as sets to well possible examples as provided evaluation give also They an and such apply scenario. described in of they answered type scenarios, be this these to in questions evaluation of of each evaluation outputs For appropriate and goals selecting visualization. the information in given guidance a provided for and techniques reviews literature extensive from 2012. © IEEE of courtesy 2434], p. [SMM12, (2012) al. et Sedlmair Source: Image understanding. form and ideas [ studies al. design et conducting Sedlmair for framework research, nine-stage in methodology—a study Design 1.11: Figure n epcryotteeauto nodrt aiaetevsaiaindsg n td the study and design visualization phase the design validate the to as order soon in as evaluation evaluation the the out carry to analytics how visual help on and and visualizations guidance evaluating provide for They applied and solutions. used methods the grounding for SMM12 LBI .Tefaeoki osdrdlna,btbcuestages because but linear, considered is framework The ]. + 12 n snege l [ al. et Isenberg and ] IIC LBI + 13 + .Tesvngiigscenarios guiding seven The ]. 12 IIC stesseai eiwon review systematic the is ] + 13 rvd foundation a provide ] Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. . ieSre Analysis Series Time 1.5 ihahsoyrahn akt uei 97[ 1927 in Yule to back reaching history a with hr r aygetadiprattxboso iesre nlss hc ul h ai of basis the build which analysis, series time on textbooks important and great many are There [ Tsay aibe,promn esnlajsmn,ipoigrgeso nlss n pligfor applying and analysis, regression improving adjustment, seasonal performing variables, the of in put when sense make only actions low-level these because challenging, very hsscinadaeue safudto oms otn nti israin h anones main The dissertation. this in content most to foundation a as used are and section this 2). Chapter (cf. later process We support selection to model practice. techniques iterative and analytics of theory visual kind use both this and of impact methodology terms Box-Jenkins enormous in so-called an forecasting this had and employ it analysis and series approach, time systematic modern this on delivering by series analysis time series for time cycle apply iterative [ three-stage verification” versatile and coherent, estimation, a identification, “delivered and time that at analysis [ work reference a remains Control [ and (cf. analysis series time of in history part the central In a get to breaks major structural a of having detection as diagnostics. and computing model analysis and outlier methods for in allowing advances and the impact identified he addition, between In relationship forecasting. dynamic the analyzing process, the analysis of series structures time dynamic of the goals analyzing the as summarized Tsay longer. even back forecasting—reaches namely, in background section. following required the some in give analysis we methodology, series research time the on a discussion through our utility Following the showed we two 5, these scenarios. Chapter For usage in applied 1.3. detection and Section used walk-through outlier in we illustrative for introduced 3, plot cycle ones Chapter multivariate the in the like the capabilities model, and prediction applying definition the by process integrating user analytics extension visual the the a as and well environment data as selection 2 the model Chapter interactive described in the as For methodology well study framework. data–user–task design as and the discussed used abstraction previously used we we task 5, them, and a of 4, performed all 3, For and 2, here. Chapters presented in methodologies discussed the solutions applied analytics visualizations. visual proposed of our design bigger all the In the during keep tasks low-level to the researchers defining allows when This mind tasks. in lower-level framework proposed picture to The goals analysis process. analysis main whole the the relates and goals analysis the practice of in context higher-level are task specification abstract task existing for approach the bottom-up that applied argued usually They the tasks. and and classifications goals [ bridge Munzner better and Tory, researchers Lam, analytics by contribution the is Tasks.papers to Goals from Bridging ratruhadms infiatmlsoewstebook the was milestone significant most and breakthrough Tsa00 nrdcdteterm the introduced ] ulse n17 yBxadJnis[ Jenkins and Box by 1970 in published BJR08 iesre analysis series time .BxadJnisitgae nweg bu ieseries time about knowledge integrated Jenkins and Box ]. oercn diint h rvosymnindstof set mentioned previously the to addition recent more A Tsa00 DGH06 Yul27 , BJ70 DGH06 ;oeo h ol ftm eisanalysis— series time of goals the of one ]; stesaitclaayi ftm eisdata, series time of analysis statistical the as p 4–4] talwdpattoesto practitioners allowed It 446–447]. pp. , LTM18 ,wihee oa nisfut edition fourth its in today even which ], ) ti eeal cetdta the that accepted generally is it ]), ieSre nlss Forecasting Analysis: Series Time ohl iulzto n visual and visualization help to ] ..Tm eisAnalysis Series Time 1.5. 19 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 20 1. Introduction hc r rn,proi,adirglrcmoet [ components irregular and periodic, trend, are which Athanasopoulos: ewn oseiyaddfiewa ema ytm eisdt n htteterms the what and data series time by mean we what define and specify to want we ausadae hrfr,uiait rmliait iesre,rsetvl.Acrigt the to According respectively. series, time multivariate or univariate therefore, are, and values eycrfly ntefloig euetedfiiin fteetrsfo yda and Hyndman from terms these of definitions the use we following, the In carefully. very eidccmoet—aey h esnladcci opnns codn oHyndman to According components. cyclic and seasonal the components—namely, periodic ntevs ubro ok bu iesre nlss u eeal iesre scniee to considered is series time a generally but analysis, series time about books of number vast the in eaaini oeadtedtriaino hc praht s snwatal ae nthe on based actually now is [ use Tsay to approach to which According of determination the distributions. frequency and the power gone whereas or is models, analysis separation parametric spectral and function on autocorrelation focuses of domain use the on [ rely approaches: domain to of schools frequency is two the into and divided domain originally time was analysis the series periodic time varying of with time field sequence the seasonal Historically, a a is summary, series time In cyclic a year. whereas the length lengths. during periodic level level fixed daily weekly/monthly a a a has on on series seasonality as and such week levels, a granularity more during multiple defined on be 1 The nested should least occur years, at may centuries. 2 components of or of duration component decades, a cyclic years, a at weeks, for generally component days, fluctuation cyclic as the and such of seasonal level, duration possible claimed a granularity that any noted on be occur should could it definition, this to addition In [ repeats. Athanasopoulos pattern and periodic a before observations of terms number the the using of By frequency the is component multiple or value single a be can measurements These time. over measured or collected data be seasonal [ Jenkins and Box of work landmark [ are tspatterns.html ieaueo ttsia iesre nlss iesre scmoe f3(r4 components, 4) (or 3 of composed is series time a analysis, series time statistical on literature 2 est ftebo y[ by book the of Website BJR08 hni ih ofo nicesn rn oadcesn rn.[...] trend. decreasing a to trend increasing an from go might it when er. [HA18] years.” osnthv ob ier oeie ewl ee oateda cagn direction’, ‘changing as trend a to refer will we Sometimes linear. be to have not does Seasonal eae ote‘uiescce.Tedrto fteeflcutosi sal tlat2 least at usually is fluctuations these of often duration are The and cycle’. conditions, ‘business economic the to to due related usually are fluctuations a These of frequency. always is Seasonality week. the Cyclic of day the [...] or frequency. year known the and fixed of time the as such factors “ Trend and , , BD91 ylctm series time cyclic yl cuswe h aaehbtrssadflsta r o fafixed a of not are that falls and rises exhibit data the when occurs cycle A ls iie nSp.1,2020) 16, Sept. on visited (last rn xsswe hr saln-emices rdces ntedt.It data. the in decrease or increase long-term a is there when exists trend A , esnlptenocr hnatm eisi ffce yseasonal by affected is series time a when occurs pattern seasonal A CC08 periodic 2 HA18 HA18 , SS11 ,i sipratt itnus h terms the distinguish to important is it ], , .Qoefo eto .:Tm eispatterns, series Time 2.3: Section from Quote ]. eidctm series time periodic en h xc ento n ecito fatm eisvaries series time a of description and definition exact The mean. , Ham94 . – ie h esnlcmoet nadto,periodic addition, In component. seasonal the times 5–2 BJ70 , BK11 .Bfr egv oedtiso oe selection, model on details some give we Before ]. Tsa00 , BD02 .Temi dao h iedmi approach domain time the of idea main The ]. , eidccomponent periodic Cle93 , Tsa10 , Cle94 .Teeaemsl ae nthe on based mostly are These ]. , BD10 trend https://otexts.com/fpp2/ t. esbueboth subsume we etc., , , , HA18 seasonal .Teperiodic The ]. Tsa00 and , periodic cyclic ,the ], , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. AIA models (ARIMA) hswsahee yitgaigotiraayi,apyn h eeto fsrcua ras the breaks, structural of detection the applying analysis, outlier integrating by achieved was This a for model adequate an finding of process the in challenges the on elaborate also we There au nrlto otesaoa tutr ntetm series. time the in structure seasonal time the seasonal to on relation focus in We value imputation. of task challenging the support better to analytics visual ekn a ipeieaiepoesta nlddmdlietfiain oe siain and estimation, model identification, model included that process iterative simple a was Jenkins aa nterfre ae [ paper referred the In data. eidct ntm eisdt a lasbe fmjritrs ntm eisaayi.As analysis. series time in interest major of been always [ Tsay has before, data mentioned series time in estimated Periodicity the judging better for allows which representation, plot cycle a using apply by and imputation data guided series statistically and values. visually arbitrary propose using we of 4, instead Chapter ones in missing reason, the likelihood this for maximum For values a estimated appropriate use more and in values fill arbitrary [ with outliers there values additive addition, with missing In estimation the parameters. fill model to the techniques estimating are when estimation case likelihood this maximum in the models for such apply to techniques similar [ proposed methods and estimation series parameter time data alternative spaced missing unequally on such [ based handling Jones work for by the contributions proposed from major comes The models ARIMA observations. in missing with cope to [ Tsay data, [ concern major of are issues quality 3. data Chapter Sad13 data, the in real-world of environment and results selection applications the model real-world use analytics In and visual model the the of apply part we integral criteria, an those as to predictions addition In methods models. identification adequate pattern of in advances important some and [ criteria, diagnostics. selection model model advanced of more usage to analysis period, series this time In “traditional” 1986–1988. from around computing transition developments in a important advances for was identified reason Tsay there a 1970, data. as in methods Jenkins series series and time Box time seasonal by and to impact major or model the (SARIMA, series to time ARIMA addition the seasonal In fit of also can class but we way the model, this are ARIMA In which mentioned prediction. previously models. series, and S-ARIMA) the time imputation, only periodic detection, not for apply outlier extension like and the [ use tasks, model we for “adequate” chapter, applied and that “useful” and In a used find be to then necessary can is 2. that it Chapter essentially of but 2.3.1 series, Section time in given later detail more in process this describe will We diagnostics. model to led models ARIMA these [ analysis of series success time the of and research forecasting, substantial for method series time apply simply, popularized and time proposed the as on models, focus [ parametric we Jenkins following, apply and and the Box analysis by In series analyst. time the of of aspect experience domain the and analysis the of objective ules steei loadmn o tl sn iesre nlssmtosmdl nsuch on methods/models analysis series time using still for demand a also is there As outliers. Jon85 Tsa00 , .I hpe eapyaduesc oe eeto rtrat upr h identification the support to criteria selection model such use and apply we 2 Chapter In ]. .Tegnrlie fteetcnqe st s pcaie ehd,lk amnfiltering, Kalman like methods, specialized use to is techniques these of idea general The ]. Das13 Tsa00 .W ou nti israino h suso isn bevtosdt and observations/data missing of issues the on dissertation this in focus We ]. h ytmtcapoc o n ekn rpsdalw rciinr o quite to, practitioners allows proposed Jenkins and Box approach systematic The . etoe h nrdcino amnfitr otm eisaayi norder in analysis series time to filters Kalman of introduction the mentioned ] BJ70 Jon80 Tsa00 .Teemdl r nw as known are models These ]. .Jnswsas ocre ihuigtetm eismdl for models series time the using with concerned also was Jones ]. Yul27 ae ue12 stesato ttsia nlsso ieseries time of analysis statistical of start the as 1927 Yule named ] ,Yl rpsdamto o netgtn eidcte,using periodicities, investigating for method a proposed Yule ], GMPn99 Tsa00 .Aohrapoc ol et pl muainto imputation apply to be would approach Another ]. .Tebscmdlslcinpoesb o and Box by process selection model basic The ]. uoersieitgae oigaverage moving integrated autoregressive ..Tm eisAnalysis Series Time 1.5. KHP BJ70 + 11 ] , 21 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 22 1. Introduction ieyue n fmjritrs nmn plctos hr r esnlflcutosi many in fluctuations seasonal are There applications. many in interest major of and used widely 4030B)”bsdo h nomto ht“al id fdsae r rdcdi l seasons all in produced are diseases of kinds “[a]ll that information the on based BC),” (460–370 h omnapoc odaigwt esnlt ntm eisaayi st pl esnlde- seasonal apply to is analysis series time in seasonality with dealing to approach common The opsto [ [S composition another” in than one in rather exacerbated and caused are some yet year, the and Barnett domain. health the to [ sciences Dobson natural over economics and analysis. from prominent series domains, more application time much the is length, of periodic beginning that fixed the periods with cyclic from means with considered which Dealing seasonality, already analysis. with was series Dealing length time in in fixed concern not central are a 295]. is p. and [Yul27, was moment” series the time at which of best problem seemed terminology a that our “attacking methods Using was the he used that and impact stated one, the He new by a challenged data. was was the method in [...] to his fluctuations method but and a cycle, disturbances proposed sunspot the He the of of data. length sunspot the periodic in the patterns determine periodic was interesting the 1849, with to [ concerned years back 11 dates approximately of which cycle [ dataset, periodic Zürich This in Wolf Professor data. by observation introduced sunspot well-known still the classical The Source: using Data Bögl the show Markus by charts Generated line components. stacked Source: The Image irregular and series. trend, time season, periodic data, a observed of decomposition Seasonal 1.12: Figure oto h cnmctm eisdt rsne ou ndffrn ei,lk nmlyetrates, unemployment like media, different in us to presented data series time economic the of Most BD10 DGH06 .vi ttdta saoaiyi ies a rtrcgie yHippocrates by recognised first was disease in “seasonality that stated vii] p. , 0.90 0.95 1.00 1.05 1.10 200 300 400 200 400 600 0.8 0.9 1.0 1.1 1.2 o esnlajsmn [ adjustment seasonal for ] remainder trend seasonal data AirPassengers periodic–cyclic–seasonal 9015 9415 981960 1958 1956 1954 1952 1950 Yul27 aae y[ by dataset Mor77 R , and CT82 Mor77 n vni 97Yl [ Yule 1927 in even and ] Time ggplot2 BJR08 ocmest o h esnlfluctuations. seasonal the for compensate to ] sdfie eoe eln ihperiodic with dealing before, defined as , .Tesnptatvt skont aea have to known is activity sunspot The ]. ,rtivdfo h R the from retrieved ], . Yul27 datasets + 8 .55]. p. 08, a centrally was ] package. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. oedtie icsino uhdsly n esnldcmoiini rsne ntelater the in presented is decomposition seasonal and displays such on discussion detailed more A hsgahalw o netgtn h esnlptena ela h eairi ahsubseries. each in behavior the as well as pattern seasonal the investigating for allows graph This eso C8] h eea dai osprt h he opnnso iesre notrend, into series time a of components three the separate to is idea general The [CT82]. version rpsdb lvln culysprmoe h aapit n sslnst onc h points plot the cycle connect the to contrast, lines In uses and y-axis. points the data on the points the superimposes actually of Cleveland position by correct proposed the in illustrates lines five graph the the overlay that suggesting is [ Bertin [...]” today, them community superimposition visualization superimpose the to in order used in terms cycles the up break to preferable periodicity [ obvious with 1967 series in time way representing similar of a indication first in a proposed already Cleveland had by Bertin adapted later was graph term of the each type used within This who behavior Cle94], month. the [Cle93, each as for well case, pattern—as yearly this plot—in the case, subseries this component—in seasonal the of plot 1.12. subseries Figure seasonal in shown [ as display, Cleveland single by a books in decomposition the of results the show Cleveland to 1982, is In step useful remainder/residuals). (or irregular [ Terpenning and and cases), most in (seasonal periodic adjusted seasonally this in presented usually are shipments, and numbers, production indices, price 1982. © Association [ Statistical (1982) Terpenning [ and Terpenning Cleveland and Source: Cleveland Image plot, cycle as know later plot, subseries Seasonal 1.13: Figure biu eidct ..,adtesuyivle oprsno h hsso ahcce tis it cycle, each of phases the of comparison a involves study the and [...], periodicity obvious by CT82 stacking Cle93 ssoni iue11,wihalw o netgtn h vrl pattern overall the investigating for allows which 1.13, Figure in shown as , rpsdsm rpia ehd o esnlajsmn,weeafirst a where adjustment, seasonal for methods graphical some proposed ] Ber83 , h ie bv ahohrbcuesprmoiinwudactually would superimposition because other each above lines the Cle94 .24 seFgr .4.H ugse ht [] hr is there “[i]f that, suggested He 1.14). Figure (see 214] p. , .CeeadadTrenn hnitoue h so-called the introduced then Terpenning and Cleveland ]. yl plot cycle . CT82 .5] oreyo h ora fteAmerican the of Journal the of courtesy 55], p. , Ber83 .24.Sekn in Speaking 214]. p. , ..Tm eisAnalysis Series Time 1.5. CT82 ]. 23 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 24 1. Introduction . eerhQuestions Research 1.6 eaeuigteetcnqe frpeetn eidccce nsc a norproposed our in way a such in cycles periodic representing of techniques these using are We pligvsa nltc otepolmadcalne ntm eisaayi pn pnumerous up opens analysis series time in challenges and problem the to analytics visual Applying hsfr ehv lutae n icse h rbesadcalne hthv rsni the in arisen have that challenges and problems the discussed and illustrated have we far, Thus eisaayi eln ihproi iesre o ohuiait n utvraetm series time multivariate and univariate both for series data? time periodic with dealing analysis series 1: Sub-Question into down it sub-questions derived break These to question. necessary main are: is the answer it to question, them research consolidating before comprehensive sub-questions this answer to order In question: research following the answer will we dissertation, this analysis. In data exploratory opportunities. the research and and analysis analytics series visual time of in field graphs, research statistical the as introduced visualization, also have of We history analysis. series time of field utility the show detection. then outlier We multivariate 5. for Chapter it in applying generalization series by a time approach propose novel multivariate we for this plots data, of cycle series of of time usage usefulness univariate the the using allow of to of Because limitation 4. the Chapter and in technique imputation this guided statistically and visually for solution x-axis arranged differently correct and the restructured periodicity. with a the suggested, charts, on line Bertin based like of mean periodic and, multiples the the y-axis small representing of the represents line phase on actually the same position them with the Together of in each year). points group, a the each of plot, of cycle month a each In (e.g., connected doing. are is cycle Bertin than way different a in first (1983, Bertin by graph the on based Bögl, [Ber83]. Markus 1967) by in data published different with Redrawn Source: Image cycle. each of [ Bertin cycles, in [ periodicity obvious Showing 1.14: Figure Ber83 oe eeto,prmtiain rdcin muain n ule detection? of outlier analysis and series imputation, time prediction, statistical parametrization, in selection, challenges model the support analytics visual can How ugse raigu biu eidct n ueipsn tt opr h phases the compare to it superimposing and periodicity obvious up breaking suggested ] svsa nltc naeut upr o h hlegsi ttsia time statistical in challenges the for support adequate an analytics visual Is Ber83 .A al s16 Bertin 1967 as early As ]. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. . i n Contributions and Aim 1.7 ete aetese otescn anpart, main second the to step the take then We okbfr umrzn h otiuin oehrwt noeve ftermiigstructure remaining the of overview an with together contributions the summarizing before work the In dissertation. a of structure the in required is it as purpose, specific a has which h neto of intention The analysis, series time statistical in challenges of understanding an provide to is work this of aim The aigtd eaopoe,cnrse,adopsdutlte rnfr into transform they until opposed and contrasted, metamorphosed, variegated, nltc fproi iesre aai hpe .Atrmtvtn h oi ngnrland general in topic the motivating After 1. Chapter in data series time periodic of analytics nCatr3 eetn h oe eeto rcs n rps h nerto fprediction of integration the propose and process selection model the extend we 3, cycles. Chapter feedback In model short this with supports interfaces diagnostic prototype visual model our interactive for that through show cues task We visual selection computations. models incorporating adequate criteria by such information model finding with the in together of users for adequateness guide model the to the judging methods such of analytics and selecting fit visual for the introduce support on We adequate feedback data. of immediate lack the with a adjustment is parameter There allowing and data. published models given was the chapter for this for model of analytics appropriate content visual an The [ of al. role 2. et the Chapter Bögl covers in in analysis part series this time in in chapter selection first model The earlier. stated problems the the of aim document. the this described of and dissertation this in trend. tackled and questions basic seasonal, research some cyclic, our introduced periodic, stated between we we Next, differentiating Thereafter, analysis, series methodology. time continued scientific in We definitions the graphs analytics. of series visual time introducing foundation before historic a general summarized providing in briefly visualization We and on analysis. focused visualization series then and time and analysis of data history on short background a the provided offered we space, problem the describing part, main first the In The frills. no with part final the to the In detail. great in developed are the for themes prepare the of to where each introduced part parts, and four presented these is of material composed thematic the is form [ sonata classical theory the music that is in idea structure this music of background classical selection, from model inspired thesis of form this problem of a structure the into larger to The encapsulated applied detection. is analysis, outlier analytics and series visual imputation, time apply prediction, visual then parametrization, and in analytics, support visual provide and visualization to of ideas principle the introduce series? time multivariate and univariate in tasks detection outlier and imputation 3: Sub-Question tasks? prediction series time for parametrization and 2: Sub-Question .19—aey h oaafr of form sonata the 149]—namely, p. BAF recapitulation sa dqaevsa ersnaino eidctm eisbnfiilfor beneficial series time periodic of representation visual adequate an Is selection model of process the improve interaction and visualization can How + 13 exposition .A rvosyepand h rtse ntm eisaayi st find to is analysis series time in step first the explained, previously As ]. st eetadtesr h anteai aeiladtase it transfer and material thematic main the treasure and repeat to is eli u h onainb nrdcn h oi fvisual of topic the introducing by foundation the out laid we , coda exposition stefia atta,sml u,cnldstework. the concludes put, simply that, part final the is development , development development eew rps u ouin for solutions our propose we were , , recapitulation development h hmsaechanged, are themes the , ..AmadContributions and Aim 1.7. h egh main lengthy the , and , recapitulation coda exposition MV15 The . , , . 25 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 1. 26 Introduction ecniu ihitouigvsal n ttsial uddiptto fmsigvle in values missing of imputation guided statistically and visually introducing with continue We eeatpbiain oehrwt noeve ftecnrbtost hs aeso h author the of of list papers a these to with contributions end dissertation, research the We this the of of overview answer directions. an research thesis, with future this together and publications of relevant challenges contributions open main propose the and 6 questions, Chapter in discuss the and is conclude, dissertation this of part main and the explore cycles. Following to seasonal investigate representation and this outliers employ univariate then and We and variables multivariate plot. multiple interpret cycle for multivariate allow multivariate for to our plot measure in cycle distance this the Mahalanobis-based generalize represent a and use further of We step content series. a The time representation plot data. [ cycle al. a series et using time Bögl idea periodic in multivariate published was in 5 paper Chapter detection this in outlier series for time that multivariate values. to computed utilize representation the and plot of cycle adequateness the the extending propose of perspectives we judgment Finally, different visual from a benefit for allow to visualizing and data for data representation series the plot time cycle on periodic a for utilize imputation We statistical et of series. Bögl results time in a published in was values paper missing this imputing of content The 4. [ Chapter al. in series time seasonal univariate task. selection model insights the additional for give model to the series and of time parsimony prediction input and compute the adequateness to of the context model into the the in apply visually and prediction selection the model provide this for of part content diagnostic [ The model al. process. the et selection Bögl model in the published improve was to paper models series time of capabilities BFG + 15 .Tegnrlie fti otiuini opoieitiiegiac o h akof task the for guidance intuitive provide to is contribution this of idea general The ]. aksBögl Markus . BAF + BFG 15 .I diint h rvoscnrbto,w extend we contribution, previous the to addition In ]. + 17 eaiuainadcoda and recapitulation .I hscnrbto,w aeteimputation the take we contribution, this In ]. nwihw summarize, we which in , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. Development atII Part 27 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 2–4] ntels etneo h atprgaho ae3 opeetapgbekwti the within pagebreak a prevent to 36 page on paragraph last the of sentence last the in 121–140]” iulzto n optrGraphics Computer and Visualization iha xml aae rmeieilg n neveswt w xenldmi xet in experts domain external two with interviews and epidemiology from dataset example an with h rgnlvrini vial at available is version original The h otn fti hpe a ulse n[ in published was chapter this of content The id iulaayisfrmdlslcini iesre analysis. series time in selection model for Alexander analytics and Miksch, Visual Silvia Lammarsch, Tim Rind. Filzmoser, Peter Aigner, Wolfgang Bögl, Markus [ to citation. refer we “[...] modified We authors. the from with interfaces visual interactive through is tasks TiMoVA selection cycles. that model feedback show in short scenarios experts usage domain the support and feedback to experts’ able the from scenarios insights usage The by statistics. evaluated was prototype stories The user experience. on this user For based on process task. feedback this this expert implements iterative in that experts and prototype domain TiMoVA guide the developed to we process purpose, interactive Visual through a combination propose this We However, support interfaces. computation. adequately automated not and do judgement this tools human for software statistical of used combination methodology close The a application sciences. demands many environmental in task or experts economy, domain epidemiology, for task as challenging such a areas is analysis series time in selection Model iulAayisfrMdlSlcinin Selection Model for Analytics Visual https://doi.org/10.1109/TVCG.2013.222 91)23–26 2013. 19(12):2237–2246, , BAF ieSre Analysis Series Time + 13 SS11 ]. © 03IE.Rpitd ihpermission, with Reprinted, IEEE. 2013 .1110”t [.]se[ see “[...] to 121–140]” p. , EETascin on Transactions IEEE CHAPTER SS11 p. , 29 2 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 30 2. iulAayisfrMdlSlcini ieSre Analysis Series Time in Selection Model for Analytics Visual oeta a ooecm h bv-etoe hrcmnsi,i h prto Visual of visualizations” interactive spirit with the techniques in analysis is, automated shortcomings “combine above-mentioned to the (VA), overcome Analytics to way analysis. potential series time A in selection model on working for beneficial be would literature the in exist series time these analyze to models and algorithms, methods, of range A rcia plcto cnroi,freape ulchat ffiilpeitn h number the predicting official health public a example, for is, scenario application practical A ait fpatclapiaindmis h ueaon fwr icsigAIAmodels ARIMA discussing work of amount huge The domains. application practical of variety [ xssa salse rcs o oe eeto nw sBxJnismtoooy[ methodology Box-Jenkins as known selection model for process established an exists t hrceiaint ug u ouinadaayetetssi eto ..I eto . we 2.4 Section In 2.3. Section in tasks the analyze and solution our judge to characterization its [ experts domain for support Therefore, the 4.2. of Section workflow in the argue support we 2.3. appropriately as Section not methodology, in Box-Jenkins do briefly tools underpinnings software theoretical available the currently and However, methodology this discuss and present [ class model this of importance of the reflects class large and prominent [ most models the ARIMA on seasonal focus and we ARIMA work namely our models, In computing. statistical for series. tools time a called is and dataset [ periodic a Such at optimally values. datasets missing time, The without over ideally collected flu. and Another seasonal observations intervals from from spaced year. suffering equally obtained next patients are the of analyzed in number be reasons the cardiovascular for to of prepared be because to treated is be scenario to need domains. that different people in of experts by performed task challenging a is analysis series time Statistical Introduction 2.1 see details more For refinements. iterative the model giving for and 2.5. the output transitions Figure model supporting model the (2) the of series, visualizing feedback time (3) visual the and immediate in interface, interest selection, visual of the model range via of interactively certain task selection a the choose in to experts enabling domain (1) for by guidance visual provides TiMoVA 2.1: Figure fmdlslcinfrtm eisaayi.Acrigt h hrceitc fdsg studies design of characteristics the to According analysis. series time for selection model of Mun08 BK11 KKEM10 2 1 , BJ70 , SMM12 , TC05 , BJR08 ,w edacmrhnieudrtnigo h oanpolm eprovide We problem. domain the of understanding comprehensive a need we ], .Ti assteqeto fhwt s Amtost upr h task the support to methods VA use to how of question the raises This ]. , CC08 , SS11 .Mroe,te r mlmne nms omnsoftware common most in implemented are they Moreover, ]. BK11 , BJ70 , BJR08 BJR08 , .Teemdl r ple na in applied are models These ]. CC08 , Ham94 , SS11 3 BJ70 n there and ] .We ]. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. ebsdorwr ntetcnqe o iulzn ieoine aa[ data time-oriented visualizing for techniques the on work our based We l ao ahmtcladsaitclsfwr ol mlmn h tt-fteatmethods state-of-the-art the implement tools software statistical and mathematical major All they how and prototype the in used mechanisms interaction and encodings visual the describe we [ descriptions process VA existing on based process VA h eaaecluain n iulztosne ob ntae yteue ihrb menus by either user the by initiated be to need visualizations and calculations separate The [ TiMoVA techniques visualization information well-established use to is work this of objective main The VES[ EVIEWS n oesfrtm eisaayi,wihw eciebifl nScin23 econsidered We 2.3. Section in briefly describe we which analysis, series time for models and n nu om,o ycmadln ehd.Wa l hs prahsaemsig sthe is missing, are approaches these all What methods. line command by or forms, input and udnet rweadvsal opr oesdrcl hnslcigtemdl nta tis it Instead model. the selecting when directly models compare visually and browse to guidance interfaces. user driven menu by models and analysis series time support tools these all package [ computing statistical for project R the like tools, LAB [KKEM10, descriptions process analysis. VA series state-of-the-art time use in selection model of problem domain the to descriptions process and methods [ methodology study design the Following Work Related 2.2 by analysis visual series interactive time of in issues selection address model 2.7) the (Section ease paper to our guidance of contributions main the particular, In requirements. our fulfill assess users target our how argue to feedback two user with informal session this feedback dataset. used example a We an performing experts. by on domain experience scenarios external user usage We the the evaluated apply dataset. also example we we an where Furthermore, to 2.6, of prototype Section version the in final evaluation applying the the and evaluated present scenarios we usage design, defining iterative by the prototype and the evaluation formative the to addition In of abbreviation an is which TiMoVA, prototype formative [ using statistics experience were and user prototype visualization the of information and evaluation in process VA experts the by of judged implementation and and refined iteratively design The it. facilitate to prototype [ in used data the specify and tasks, the analysis. for series requirements time the formulate users, target the identify eto .,weew ics h Apoesdsrpinadorpooye enmdour named We prototype. our and description process VA the discuss we where 2.5, Section AMST11 • • • iulzn h oe rniin oeal h oaneprst eiewehro o the not or whether decide to experts domain the while improves. enable model expert to domain transitions model the the to visualizing results and order, model model the the order, of adjusting model feedback the about visual information immediate the function providing get autocorrelation experts partial domain and the where autocorrelation the plot, inside order model the selecting LBI , IHS13 Cle93 + 12 ,Mteaia[ Mathematica ], n pl hmt h atclrtre rbe.Frti esn edfie a defined we reason, this For problem. target particular the to them apply and ] .I h icsino h eut Scin25 n h vlain(eto 2.6), (Section evaluation the and 2.5) (Section results the of discussion the In ]. LBI Wol13 + 12 .W rsn h eut fti enmn rcs in process refinement this of results the present We ]. ,Saa[ Stata ], Mun08 C20 R Ti Sta11 , esre analysis, series me SMM12 ,adGel[ Gretl and ], ,SSJP[ JMP SAS ], KKEM10 ,w pl xsigadkonVA known and existing apply we ], + 1 TC05]. 11, , LAB CL13 SAS12 Mo + .Ecp o h base R the for Except ]. 11 e eeto,and selection, del AMST11 ,MTA [ MATLAB ], n mlmne a implemented and ] , Cle93 ..RltdWork Related 2.2. The10 and ] VA ], . 31 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 32 2. iulAayisfrMdlSlcini ieSre Analysis Series Time in Selection Model for Analytics Visual nti eto epoietecaatrzto ftedmi rbe [ selection. problem model domain the the for of necessary characterization tasks the provide we section this In Characterization Problem 2.3 motivation our ground and detail following more the in problem in target information our further. background specify necessary and characterize the to give section we solutions. their findings, and these tasks by the Motivated in differ but data, time-oriented to methods VA apply TimeSearcher series example approaches time for as statistical VA, in in work selection related distantly model [ only of is there problem knowledge, target our to specific analysis, very the have we Because directly and browse to not history but a settings provides previous them. also loading compare It for allows only. which input models, form-based computed using outliers for call of X-12-ARIMA editing the manual seasonal for interactive the parameters do of the to results user the the and enable series to time [ as the well explore as to adjustment is series approach time their of focus The software. the for interface the is well. solution repeated notable [BJ70] the One methodology support Box-Jenkins not arrange iterative do and the they models in summarize, of steps To bunch separate whole them. the a compare of compute execution to or hand by models of visualizations set in a them on decide either to necessary process. the series about time details of for selection 2.3.1 model Section for See process iterative [BJR08]. An Methodology. Box-Jenkins 2.2: Figure BAP KMST12 + 05 , .Hwvr h sritraespot h sri eetn iesre n adjusting and series time a selecting in user the supports interface user the However, ]. BPS + 07 x12 n h okfrvsa-neatv iesre rpoesn [ preprocessing series time visual-interactive for work the and ] [ KM12 x12GUI akg,wihpoie rpe ucint h X-12-ARIMA the to function wrapper a provides which package, ] [ KMST12 akg o .I ffr nitrciegahcluser graphical interactive an offers It R. for package ] SMM12 n ics the discuss and ] BRG + 12 .Both ]. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. aeut”mdlfratm eis u ecno sueta ti tu”o cret model. “correct” or “true” a is it that assume cannot we but series, time a for model “adequate” edsrb hsmtoooyi iue22adi h following. the in and 2.2 Figure in methodology this describe We h netit fsc oesi u tagtb sn h em“ettvl netiigamodel” a entertaining “tentatively term the using by straight put is models such of uncertainty The “Essentially, that Box George of quote famous the in summarized is selection model of crux The [ version simplified a to reduced sometimes for is used method be The subsequently can which found, is model adequate an until repeated are steps These [ methodology Box-Jenkins The ie iesre seFgr .) thsbe ieyue ntm eisaayi n san is and analysis series time in used widely been has It 2.2). Figure (see series time given a .. o-ekn Methodology Box-Jenkins 2.3.1 of Parsimony. of Principle 2.3.4). Section (see [BK11] [ [ Jenkins useful” and are Box some methodology, but wrong, are models all 2.3.5. are and (2) 2.3.4, entertained tentatively be can diagnostics that model model a as of entitled identification and the combined and (1) models of class 2.2). Figure (see forecasting parameters. its estimate and data the (4) to model the Fit (3) (2) to: necessary [BJR08] al. et Box to (1) according is it series, time given a for [ selection model for method established task. this solve model to a method find standard to a has is official methodology Box-Jenkins health The the prediction, data. cardiovascular the given from do the death To on of based program. number prevention expected a the start predict to to disease needs Angeles [ official Los study health NMMAPS in public the older a the from and that contains 2000 75 in aged to dataset evaluation people 1987 The years the of following. the disease for for cardiovascular the domain from in deaths this task of analysis from number possible dataset daily a a illustrate and chosen 2.6.1 have Section We epidemiology. and health Task. and Dataset Example parsimony ial,cektemdlwt igotc oucvrpsil ak ffi n n hi causes. their find and fit of lacks possible uncover to diagnostics with model the check Finally, rough some gives and model the in the parameters on them. of for deciding number estimates by the model determines below) This (see parsimonious order. model appropriate an select time to and extensive methods too Apply be would data, to of directly consuming. class models general these By Fitting models. components. of model class [ general al. useful et a Box consider models to practice and experience the theory and from mechanisms underlying the about knowledge theoretical incomplete the Use [ BJR08 epeetamr ealddsrpino hs eaaesesi eto 2.3.3, Section in steps separate these of description detailed more a present We . .I sdsrbdb h aahae ut fAbr isen“everything Einstein Albert of quote paraphrased the by described is It ]. BJR08 oe specification model nipratpicpei h oe eeto rcs steprinciple the is process selection model the in principle important An enaysbls obnto fAIAadsaoa ARIMA seasonal and ARIMA of combination subclass any mean ] nipratdmi hr iesre nlssi sdi public is used is analysis series time where domain important An BJ70 BJ70 s agaeta niae htteei ueu”and “useful” a is there that indicates that language a use ] sa trtv rcs oslc naeut oe for model adequate an select to process iterative an is ] BK11 tp()i eae to renamed is (3) Step . BD87 , BJR08 .44.We nrdcn h Box-Jenkins the introducing When 424]. p. , , PW04 CC08 CC08 , , SDZ ,weetedcso o general a for decision the where ], SS11 + oe fitting model .T n rslc model a select or find To ]. 00 .Apsil cnrois scenario possible A ]. ..PolmCharacterization Problem 2.3. n tp()to (4) step and 33 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 34 2. iulAayisfrMdlSlcini ieSre Analysis Series Time in Selection Model for Analytics Visual Autoregressive ihcasclrgeso ti fe o osbet xli iesre ucety[ sufficiently series time a explain to possible not often is it regression classical With AR An ARMA AIA oes nARIMA An models. (ARIMA) AM)mdl.T civ asmn,AM oescmieteieso RadM models. MA and AR of ideas the combine models ARMA parsimony, achieve To models. (ARMA) hs omldfiiin n h eal r o eesr oudrtn n ru h design the argue and understand to necessary not are details models the different and the definitions of formal ideas These key the describe briefly We exist. models alternative Therefore, oicuesaoa em namdli sncsayt obn nodnr o-esnlARIMA non-seasonal ordinary an combine to necessary is it model a in terms seasonal include To o oedtisadtefra entosw ee oteltrtr ntm eisanalysis series time in 2.6. literature and the 2.5 to refer Section we in definitions representations formal visual the the and of details models. more interpretation these For the of definition explain formal and the choices of details full the presenting without them explain and h Amdl r xeddt h iesit,cle a,o h esnlperiod seasonal the difference of non-seasonal lag, called shifts, time the period letters to seasonal extended the are to models extended MA is the that model ARIMA an with model called aggregated, are be models to These needs series called time integrated. sometimes differenced called operation, the transform also differencing series, to a or time possible applying original is by the It series recover To time trends. stationary detrending. to a due trend. to non-stationary or series is effect time series seasonal this time no a is cases, there practical that many means In which stationary, is series time the if models of principle the with because [ conflict models, Jenkins in MA and is only Box This or cases, AR these parameters. only For many with parsimony. with series of model time high-order a a model demand to would problematic it is it cases some In MA an therefore, order, model the order called of again model is average the terms and noise term white noise past white of current the of combination linear order of model values autoregressive past of number The series. SS11]. CC08, BJR08, BJ70, [BK11, Models ARIMA Seasonal and ARIMA 2.3.2 time [ given models a of for class section. following model specific adequate the a for in an introduced find was to methodology how This methodology the series. presented we section, this In time [CC08]. preferable the is represent adequately parameters least to the models with candidate model different the are there series, [ if simpler” that not means but this possible selection as simple as made be should ( uoersiemvn average moving autoregressive ? ) % oe ato order of part model ( ,@ ?, and ) & model. A)mdl xli h urn value current the explain models (AR) o h esnlmdlodr h esnldifference seasonal The order. model seasonal the for @ 3 . u ihtm hfso h esnlperiod seasonal the of shifts time with but , ? n MA a and ? . ( oigaverage Moving ,3 @ 3, ?, ARMA ? sas aldmdlodr hrfr nAR an therefore order, model called also is ( ) @ ) en htthe that means ( ,@ ?, oe ato order of part model ) oe forder of model BJ70 M)mdl xli h urn value current the explain models (MA) uoersieitgae oigaverage moving integrated autoregressive G presented ] C safnto of function a as BK11 3 @ hdffrneo h iesre san is series time the of difference th atwienietrs h number The terms. noise white past @ .1] ntepoeso model of process the In 18]. p. , ? ti osbet pl hsclass this apply to possible is It . uoersiemvn average moving autoregressive and B @ aldsaoa a.Thus lag. seasonal called , B BJ70  stecmiaino an of combination the is , hrfr,teA and AR the Therefore, . ? sas ple iethe like applied also is ( atvle ntetime the in values past @ ,wihw describe we which ], ) oe samoving a is model B ( n s capital use and ? ) oe san is model G SS11 C sa as ]. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. hr hslna eednei nldd scle C/AFpo n nbe st choose to us enables and plot ACF/PACF called is included, is dependence linear this where the For lags. over function a as PACF the likewise is plot PACF The lags. over function a as ACF in used models of class the and 2.3.1 Section in methodology Box-Jenkins the introducing After h C stecreainbtenaytovle natm eiswt pcfi ieshift, time specific a with series time a in values two any between correlation the is ACF The al .:AFadPC eairfrAM n esnlAM oes[ Models ARMA Seasonal and ARMA for Behavior PACF and ACF 2.1: -xs nti aetelbl r esnllg,wihmasta n a ersnsoeseasonal one represents lag one that means which lags, seasonal are labels the case this In x-axes. aldlg h AFi h orlto ewe n w onswt pcfi a,weethe where lag, specific a with points two any between correlation the is PACF The lag. called ol eaeut o h o-esnladsaoa ato h model. the of parameters part of seasonal number and what and non-seasonal model the of for class adequate which be indicate PACF could the and ACF the of behaviors the on lags the show and we y-axes 2.3 the Figure on In plotted are PACF difference. plot ACF/PACF and seasonal ACF the and The plot, difference series plot. ACF/PACF of time example level the an the to for addition idea In first a model. plot, provides the ACF for the parameters with of combined number plot PACF the This removed. is between in points the of effects linear analysis series time in literature the to again refer we PACF the [ and ACF the of definition formal Plot. ACF/PACF order. model the of decision series log the time or support the operation to viewing difference checked by a are done as plots is such this transformations, Usually required achieve all series. the applying to transformation, time step After given first be the plot. The could at line look model. a that a the in of models take in order of to used the is parameters class determine goal of and a this number differencing on the of decide specifies level which the to model select is the series, goal time given the the specification, for model appropriate of task the For Specification Model 2.3.3 selection 2.3.5. model and iterative 2.3.4, this 2.3.3, of steps Sections following separate the the in discuss process we section, this in methodology that called are models average resulting moving The integrated removed. are effects seasonal additive the yl.Tennsaoa asaefatoso n,dpnigo h esnllnt.Frexample, For length. seasonal the on depending one, of fractions are lags non-seasonal The cycle. BK11 , BJ70 , PACF PACF ACF ACF where * BJR08 * * uoorlto function autocorrelation h C lti pk rp,wihi pca yeo a hr,o the of chart, bar of type special a is which graph, spike a is plot ACF The , al ffCt ffatrlag after off Cuts off Tails AR uso fe lag after off Cuts lags at off Tails AR lag after off Cuts : CC08 · B ( ( % ? SRM)mdl n r eoe sARIMA as denoted are and models (SARIMA) r utpe of multiples are ) ) B , SS11 n ieadsrpino h ai da ntefollowing: the in ideas basic the of description a give and ] : %B ? · B B al ffa lags at off Tails MA MA uso fe lag after off Cuts al ffTisoff Tails off Tails for , AF and (ACF) ( ( & @ : ) ) = B 1 , 2 . . . , ata uoorlto function autocorrelation partial : &B @ · B al ffa lags at off Tails al ffa lags at off Tails ARMA ARMA al off Tails ( ( ,& %, ,@ ?, esnlautoregressive seasonal ( ) ..PolmCharacterization Problem 2.3. ) ,3 @ 3, ?, B : : · · B B ( × ) SS11 , & , %, (PACF) .The ]. ) B . 35 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 36 2. iulAayisfrMdlSlcini ieSre Analysis Series Time in Selection Model for Analytics Visual codn oTbe21ti niae,ta naeut oe o h esnlcmoetcould component seasonal the for model adequate an that indicates, this 2.1 Table to According hc stenme fprmtr.Tedifferences The parameters. of number the is which hr r eea ehd oetmt h aaees h otipratoei the is one important most The parameters. the estimate to methods several are There dnie,i sfitdt h iesre aat siaeteukonprmtr ftemodel. the the of and parameters unknown the estimate likelihood-estimation to data series time the to fitted is it identified, [ al. et Box series. a is and result The order. model the ARIMA configure seasonal and a select example we for how model, discussed we section, previous the In Fitting Model 2.3.4 2. lag at off cuts plot PACF the models. in and off AR 1 an tailing lags be seasonal is the plot consider ACF we the If that 2.1. notice Table we in 2.3, shown Figure is models behavior in ARMA The and way. MA, similar AR, a for PACF in the and ACF is the it PACF of and [ behavior ACF, basic models, the average moving identify 1 models, to are autoregressive possible lags the seasonal of the definitions 12, the of Using length seasonal a and are year lags one non-seasonal in the months and 12 with dataset a in (see dataset example the displays plot SDZ This decide [PW04, study to 2.1. NMMAPS experts Table the domain to from enables according 2.6) lags model Section the the of of behavior order The the Lags. on over PACF and ACF 2.3: Figure rsne bv.Nt htteparameters the that Note above. presented 3 SS11 h ee ftesaoa difference seasonal the of level the , @ swl stenme fsaoa parameters seasonal of number the as well as , .Lkws,i spsil odsrb h eairfrtesaoa opnn ftemodel the of component seasonal the for behavior the describe to possible is it Likewise, ]. nodtoa es squares least unconditional ( 2 ) 12 oe.Cniun hsfrtennsaoa as egtasto osbeadequate possible of set a get we lags, non-seasonal the for this Continuing model. BJR08 te ehd r the are methods Other . s h term the use ] 12 1 , 12 2 o eal n hoeia icsinse[ see discussion theoretical and details For . . . . , ( ettvl netie model entertained tentatively ,3 @ 3, ?,  . h esnllength seasonal the , ? ( × ) ehdo moments of method , @ , , & , %, % and , + 00]. 3 % and and ) & B oe,weetelvlo difference of level the where model, eemn h re ftemodel, the of order the determine  & r rnfrain otetime the to transformations are B h ubro parameters of number the , r e codn otesteps the to according set are the , o hs netemdlis model the Once this. for es qae estimation squares least SS11 .121–140]. p. , maximum , , 2 2 . . . , . . . , ? , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. codn oorfidnsaottedmi rbe,rltdwr,adepr edak we feedback, expert and work, related problem, domain the about findings our to According adequate an get to order In samples. larger for suited well is BIC the samples, smaller for well h C ftersdasi acltdadpotdoe h ast hc htteei oremaining no is there that check to lags the over plotted and calculated is residuals the of ACF The methods diagnostic model series, time underlying the represents model the well how evaluate To iulfebc mrvstewyt copihteetssadspot h oanexperts domain the supports and tasks these accomplish to way the improves feedback visual h eiul o ahlgaeidpnet ffrallg h -ausaentsgicn o a for significant not are p-values the lags all for If independent. are lag each for residuals the r ple.Temdli igoe yaayigtersdas hc en h eann part remaining the means which residuals, the analyzing by diagnosed is model The applied. are xml ilg,ceity pdmooy cnm,o niomna eerh h sr have users The research. environmental or economy, epidemiology, chemistry, biology, example Users. Target the present as well as stories user model for using challenges them and formulate used. tasks and data the 2.3 distill Section users, in target discussed our selection describe explicitly we section this In Analysis Requirements 2.4 following the in them present and requirements the analyzed we section. this, achieve To work. and their interactions in intuitive evident, additional is with It visualizations experts. and domain by computations execute the to combining cumbersome that currently are tasks these that conclude (5) area [ in TiMoVA to of refer interface we user criteria graphical information the 2.5. in the Figure decide of criteria on to information details possible such more is show model, We it For the models, for 50–53]. tentatively. different parameters them for of values of number these one a on select on Based to is criteria. goal the the minimizing 2.3.1, thus Section in introduced as model, the and often AICc, are that the form, Criteria [ corrected model. (BIC) bias the criterion its of information fitness (AIC), Bayesian the criterion on information decide Akaike’s to include basis used good a provide They criteria. on based Criteria. is Information and possible not often is specified, decision well clear experience. is A and model readjusted. judgement the be human fulfilled, no to is is needs this there model all that the If assumed otherwise be residuals. can the it within line, autocorrelation dashed remaining the by indicated level, significance pre-specified plot quantile-quantile the expected by are checked residuals is standardized distribution the This fitted, [ well distributed. is normally model standard the be If to (4c) (4b) residuals. the process. in underlying remaining structure is a residuals part unveil standardized remaining can The the episodes (4a) series, non-random ways: Any time four the in time. assessed to over as is fitted plotted plots This well are by is noise. done model white is the like residuals behave If the to expected of 2.5. analysis Figure exploratory of The (4a-d) in model. shown the by explained not is that Diagnostics Model 2.3.5 Cle93 .(d h lto h jn-o ttsi [ statistic Ljung-Box the of plot The (4d) ]. u agtuesaedmi xet nayfil sn iesre nlss for analysis, series time using field any in experts domain are users target Our nadto otedansi lt ti osbet xmn information examine to possible is it plots diagnostic the to addition In SS11 .I otatt h Ic hc osbhv very behave does which AICc, the to contrast In ]. BP70 , LB78 sats hthlst hc if check to helps that test a is ] ..Rqieet Analysis Requirements 2.4. SS11 p. , 37 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 38 2. iulAayisfrMdlSlcini ieSre Analysis Series Time in Selection Model for Analytics Visual easm ohv nvraedt ausosre teudsatdsrt ieitraswithout intervals time discrete equidistant at observed values data univariate have to assume We and team project the of meetings repetitive the in stories user the refined and formulated We ihntepoetta,wt utmr,o te tkhles srsoiseovdfo the from evolved stories User stakeholders. other or customers, with team, project the within Aa ai odfieaV rcs ecito ooecm h ttdpolm.I hssection this In problems. stated the overcome to description process VA a define to basis a as VA forwr.T os,w rvd h ecito fatioe Apoesi eto .. htis that 2.5.1 Section in process VA tailored a of results and description contributions the provide main we the so, present to do To findings our work. on our rely of we identified 2.6 We Section domain. following problem the and the of tasks and characterization the Analysis discussed Series we Time 2.3 Section in In Selection Model for VA 2.5 values. missing The users. target epic: our following of the perspective in the and formulated from process VA is written the goal are high-level formulate stories to The stories prototype. user the these implement used We evaluation. formative the throughout Coh10]. [Coh04, The epics called specific. also very are are coverage they broad until the stories with user high-level stories the smaller user refinement, to high-level iteratively of down process broken the goals are Through These stories stories. known. user user are coverage as broad formulated are a requirements with and requirements [ and methodologies goals development high-level software only agile beginning and lean other in [ role methodology development software (XP) programming extreme as Tasks. well as models series time criteria. the selection and the interpret series and to time fitting required the model are of the skills representations These visual analysis. the series understand time and and statistics in knowledgeable be to ieSre Data. Series Time [Coh10]: requirements the of understanding detailed more a get to defined are stories User • • • • • eiei n oe sbte hnteother. the than better is model one if decide components me. seasonal . no distract if not and do influence they seasonal needed, the are compare can I model. that adequate so inputs an parameter find intuitively can I that glimpse. . so one orders with model series these time about the information fits model the well how assess . and model the on step. decide selection model the for series . time the of subregion any use A forecasting. e.g., purposes, different for model that use can A . att e l motn iulztoso h iesre n h oe ota can I that so model the and series time the of visualizations important all see to want I .., . att nld n xld h esnlcmoet ftemdladteseasonal the and model the of components seasonal the exclude and include to want I .., . att e o e eetdmdlcmae otepeiu oe ota can I that so model previous the to compares model selected new a how see to want I .., . att duttemdlodr ttepaeweetevsaiainpoie the provides visualization the where place the at orders model the adjust to want I .., oanepr ue) att n naeut oe o ie iesre ota I that so series time given a for model adequate an find to want I (user), expert domain a s oanepr ue) att eetacranrgo ntetm eiss htIcan I that so series time the in region certain a select to want I (user), expert domain a s srsoishl ofruaeterqieet nawyta ses ouei discussions in use to easy is that way a in requirements the formulate to help stories User h osdrddt o u okaetm eisa nrdcdi eto 4.1. Section in introduced as series time are work our for data considered The Coh04 Bec00 , n aea important an have and ] Coh10 .Uulyi the in Usually ]. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. edsrb h Apoesfrmdlslcini ealt rpr h edrfrtediscussion the for reader the prepare to detail in selection model for process VA the describe we ( Fgr .) h Apoesdsrpinfrmdlslcin(iue24 n h Box-Jenkins the and 2.4) (Figure selection model for description process VA the 2.5), (Figure h iesre nFgr . is 2.2 Figure in series time The methodology Box-Jenkins the is selection model for description process VA our of basis The rsne nFgr . n icse nScin231 h rcs ecito eso in show we description process The 2.3.1. Section in discussed and 2.2 Figure in presented oe trtvl,t n naeut oe o ie iesre.Ti rcs ecito is description process [LAB This data time-oriented series. for the time process selecting given VA for a the for process on VA model based the adequate shows an figure find The to iteratively, Selection. model Model for Process VA 2.4: Figure the using analyzed is model ( resulting parameters given the with estimated ( model the of order the the model, series ( analysis series time the about interpret To differencing. of level the and experience on based prototype the of interface user 2.2). graphical (Figure methodology the between connection the following this the about an In of 2.5.2 select underpinnings 2.3. Section to Section theoretical in in is the discussed process of briefly the details are analysis of The series goal time series. The statistical time in process given analysis. a series for time model [ in data adequate time-oriented selection for model process VA of the problem of application the is 2.4 Figure Selection Model for Description Process VA 2.5.1 2.4. Section are in results specified These a requirements and the prototype. design fulfill the final to in the implemented techniques provide and interaction we designed and 2.5.2 choices Section design In the of prototype. discussion the of implementation the for used the

? Data sue.Bsdo hskoldeadtevsa ersnain ftetm eisadtime and series time the of representations visual the and knowledge this on Based used. is ) (  8 odcd ntecaso oesadajs h ubro aaeesa elas well as parameters of number the adjust and models of class the on decide to ) Hypotheses ro Analyses Prior Knowledge Domain  Input Data ro Analyses Prior ? ,the ),

C K K n bu iulztoso iesre n iesre models series time and series time of visualizations about and ) r omd( formed are t m Data K A D Hypotheses p i i A rvddas provided d oanKnowledge Domain  iulInterfaces Visual < Hypotheses Interactive The . obidamdlbsdo h input the on based model a build to ) Models neatv iulInterfaces Visual Interactive + C V V r end ae nthe on Based refined. are .B dutn h ee fdffrnig( differencing of level the adjusting By ). d t rao srInteraction User of Area neatv iulInterface Visual Interactive Input A ..V o oe eeto nTm eisAnalysis Series Time in Selection Model for VA 2.5. p I I + v m 11]. B nFgr ..The 2.4. Figure in m bu iesre oesadmodel and models series time about I h Insights h aa“best” data the fits that Model the , LAB Hypotheses oanKnowledge Domain oanKnowledge Domain + sue ovisualize to used is 11 ntedomain the on ] Data h oe is model the (   8 .The ). 3 and ) is 39 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 40 2. iulAayisfrMdlSlcini ieSre Analysis Series Time in Selection Model for Analytics Visual 4)tepvle fteLugBxsaitc vrlags. over statistics Ljung-Box the of p-values the (4d) motn eindcsosfrorpooye eipeetdtepooyei aa For Java. in prototype the implemented We most the prototype. highlight our we detail, for in decisions ideas implementation design and important choices design the discuss we Before Choices Implementation to stories [ user feedback the user version. of acquire final formulation and the prototype implemented with working we along a 2.4) develop design (Section quickly our stories refined user We the prototype. and TiMoVA above our description process VA the on Based Prototype TiMoVA 2.5.2 The interaction. forecasting. user for through used process be the can and series, Interaction time User given of the for adequate ( is model that adequate parameters, the the to of lead process which refinement estimations the parameter the (2) visualized, is that the interpreting ( parameters model and and distribution, the normal (4c) standard lags, the the of over quantile residuals the the analysis against of residual residuals ACF the standardized the for the (4b) area of time, the over quantile in (5) residuals plots standardized and The the plots, analysis (4a) criteria. (3) residual are information toolbox, the the selection (4a-d) including model selection, history the model model (2) further the data), as (input the well plot in as series views plot ACF/PACF time multiple the the and is coordinated (1) the where showing interface, is user figure The Overview. TiMoVA 2.5: Figure igotc ( diagnostics

< swl stevsaiain ftesadrie eiul,ifraincriteria, information residuals, standardized the of visualizations the as well as ) neatv iulInterfaces Visual Interactive shglgtdi ryadidctstepoesses hr h sri atof part is user the where steps, process the indicates and gray in highlighted is + 3 .I hsieaierfieeto h process, the of refinement iterative this In ). Hypotheses ulig( building 3 2 1 (  E eiigtefins fteudryn model underlying the of fitness the deciding ) 4d 4b 4a  ℎ .Tersl samdlwt estimated with model a is result The ). SMM12 Insights 4c .Hr epeetthe present we Here ]. r andb (1) by gained are  < ,ad(3) and ), Area 5 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. C/AFpo 3 h srcnajs h ubro oe aaeesdrcl ihnteplot. the within directly parameters model of number the adjust can user the (3) plot ACF/PACF edfie nScin251 h iulztosaeisie ytevsaiain sdi n by and R in used visualizations the by inspired [ are techniques visualizations visualization The well-established 2.5.1. Section in defined we tm eispo) nti iwi spsil oepoetetm eisadslc eti range certain a select and series time the explore to possible is it view this In plot). series (time h lt nae 4-)so h eut fteprmtretmto steposfrteresidual the for plots the as estimation parameter the of the results In the selection. show model (4a-d) the area for in used plots are The (3) area in plot ACF/PACF the and (2) area in toolbox The [ views multiple and coordinated of consists prototype TiMoVA The description process VA the of workflow the on based is prototype the of interface user graphical The hs r h ra o h ofiuaino h oe re.I h olo the toolbox the In order. model the of configuration the for areas the are These etcllnsi iue26cnb rge ln h -xst eetteodro h oe,which model, the of order the select to x-axis the along dragged be can 2.6 Figure in lines vertical fteatrgesv ato h oe AR model the of part autoregressive the of parameter the for the for numbers input the the slider disables or the enables as in also well residuals which as the model, of the plot in ACF component the seasonal detail. in and more below in The plot ACF/PACF shown (4b). the are area in (3) lags plot of ACF number the the changes and (2) toolbox selection criteria. model information the the including 2.6 history Figure model In the displays (5) area in table The analysis. 2.5.2. Section of in (1) discussed corner are left selection range upper the the for in interactions shown the is of selection details range The This 2.1. selection. time over Figure model plotted the series for time used the is video displays that supplementary (1) the 2.5, Figure in In and 2.5 2.6.1. Figure Section in in shown scenarios is usage areas the five of the and interface user graphical implements that prototype analysis. a series is time result in The selection models. model the for select process VA interactively the to able is user the that Interface background. User the Graphical in running good are shows calculations interface the user to when the computer even result, the input, a allow user As which for caching, times request. and reaction calculations upon threads while them Java provide input, using and user by models to achieved responsive pre-compute is still This is out. interface large user with carried especially the are time-consuming, that very important be is can it analysis datasets, series time given. in is calculations TimeBench the using Because projects other Furthermore our R. of in interconnectivity implementing and than extensibility interactivity the regarding in possibilities [ R package more use rJava have use to the we to of us , us part enables allows is (JRI) that JRI Interface way Java/R Java. a Using with in combination variety computations. designed statistical broad is the a prototype for provides our methods R and these literature tasks. from calculation known other [ methods and computing of analysis statistical for series project time R for the toolkit hensive use to [RLA was data decision time-oriented important for One library software a TimeBench, [ used prefuse framework we software data the time-oriented used we visualizations, dynamic the ssnnmu ihtenme fprmtr.Teei n etclln for line vertical one is There parameters. of number the with synonymous is esnlDifference Seasonal nld esnlParameters Seasonal Include ldr ihthe With slider. 3 n esnlparameter seasonal and AMST11 ( hc o nbe rdsbe h ofiuaino the of configuration the disables or enables box check ? ) hr saohrvria iefor line vertical another is There . Difference , ..V o oe eeto nTm eisAnalysis Series Time in Selection Model for VA 2.5. Cle93 Urb20 .W xeddteevsaiain so visualizations these extended We ]. nR ecoeJv,bcuewith because Java, chose We R. in ] ldradthe and slider  r eetd h continuous The selected. are Rob07 HCL05 .A vriwo the of overview An ]. C20 R ? esnlDifference Seasonal hc steorder the is which , .A nAIfor API an As ]. esnlSpan Seasonal @ a lag max sacompre- a as ] hc sthe is which , + 13]. input , 41 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 42 2. iulAayisfrMdlSlcini ieSre Analysis Series Time in Selection Model for Analytics Visual hc steodro h oigaeaepr ftesaoa opnn ftemdlMA model the of component seasonal the of part average moving the of order the is which epoietecrepniglbl rmFgr .,tenme foii rmteoriginal the from origin of number the 2.4, Figure from labels corresponding the provide we 4-) e eto .. o h eal nteeplots. these on details the for 2.3.5 Section See (4a-d). (2); 3; 2, 2.5: (Figure (1); 2.2: (Figure ioAsosvsa ersnain o h oe igotc 4-)ad() nodrt evaluate to order in (5), and (4a-d) diagnostics model the for representations visual shows TiMoVA span seasonal The auso h oe nomto rtra h el ftemdlciei r ooe codn to according colored are criteria model the of cells The criteria. information model the of values nefc Fgr .:4-,5 iue22 (4); 2.2: Figure 5; 4a-d, 2.5: (Figure interface ato h srinterface. user the of part models of class general a on decide we interface, interface user user the the in in plots area the affected viewing the By of 2.5. number Figure the in and 2.2, process, Figure the in in transition is methodology each Box-Jenkins For 2.5.1 selection. Section model process VA of in the task defined facilitates the for interface cycles description user feedback the process short how creates VA and detail the in explain how We TiMoVA. describe in implemented we paragraph, this In methodology Box-Jenkins the and Description Process VA the work. Connecting in future use the to in wants include he to or feature she possible visualizations a and is tests This which process. customize the and used adjust visualizations to [ user analysis and the tests series enable standard In time R. the for in on literature methods focused the these we in a of prototype of implementations the many randomness of are the implementation there for our and tests statistics essentially in are manifold which are noise, dataset, table. white this for below tests shown and is analysis legend Residual compared The worse history. or (minimum) model better the is in model others this the for the to criterion show this columns whether other indicating the value and their model the transitions the describes in column in used second a color shows The introduced the 2.5 plots. represents as Figure residual column in the first criteria, (5) The of table information models. criteria selected the information previously The TiMoVA. included of of history we design the analysis, in residual 2.3.5, Section the with to numbered are addition residuals In the of analysis the for used plots representation the the the displaying of by area representation inspired the visual is 2.5 The not prototype Figure is noise. the In that white of R. series be interface in time to user likely the the is of in it part residuals if remaining standardized check the task and explore this model, visually for the to used by is representations described goal visual The the and 2.3.5. diagnostics Section model in the discussed We shown. are AR model the of component seasonal for the one of appear, part lines autoregressive vertical the continuous additional MA two model box, the check of the part average moving the of order h plcto ftedmi nweg ( knowledge domain the of application the h teso h oe o h ie iesre.I hsae h eut fteprmtrestimation parameter the of results the area this In series. time given the for model the of fitness the ftemdl(iue22 (3); 2.2: (Figure model the of  < .Teajsmn fterlvn aestigr h ytmt siaeteparameters the estimate to system the triggers faders relevant the of adjustment The ).   8 B + , 3 a eajse sn the using adjusted be can  , C .B dutn h ee fdffrneadtenme fmdlparameters model of number the and difference of level the adjusting By ). ? + , C ,w dniyas aldtnaieyetrandmdl(iue2.2: (Figure model entertained tentatively called so a identify we ),  8 n hwtersligdansisimdaeyi h user the in immediately diagnostics resulting the show and ) BK11

C , esnlSpan Seasonal + , 3 ? BJ70 (  , , @ ) ftesaoa opnnsaeealdby enabled are components seasonal the If . < 8 + , , r ato h sritrcin u not but interaction, user the of part are ) BJR08 C .Teisgt and( gained insights The ). pnbxi h toolbox. the in box spin , CC08 ( % , ) SS11 B % n nte n for one another and , hc steodrof order the is which , .I sdsrbeto desirable is It ].  ℎ  , E  , < and ) ( & & ) B , . Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. nte motn einrqieetwst iulz h hnei h lt hnadjusting when plots the in change the visualize to was requirement design important Another etcllnsaeue oslc h iesre oe.I hsfiuete r e to set are they figure this In model. series time the select to used are lines vertical & h oe re n ieteue h oto vrteetastos hsi upre by supported is This transitions. these over control the user [ the manipulation give direct and order model the ftepos hsesrsta ahprmtrcmiaini ieetclr n h fading the and color, coloration different the a for is sequence combination cyclic parameter endless each an that as ensures used This is colors plots. of the set of This screen. a on distinguish [ ColorBrewer2 using [ [ transitions reaction fast a series. keep time and larger task for smaller even selection certain estimation model a parameter the are select model for to visualizations the series user for the time the time larger and enables a recomputed feature from are This interest they of views, changes. region region other the the as to soon linked as is updated the it or select, using If whether can views select user 2.6. other can Figure the The user to in The connected selection. cursor. is model mouse selection the possible the the for using is not selection used It the is position. remove the that its and about move, to interval Details resize, cursor time mouse time. a the of select moving resolution specifically when suitable on to demand a level on show zoom provided bottom to the are changing adjusted the When points is on time axis series. slider time time range the the time horizontal slider, through the navigate The range through and the navigation in interval. the zoom time is to upper specific area user the a the this allows in of in shown interaction selection as main the selection The and range the 2.1. series Figure and of plot (1) series corner time left the introduced we 2.5.2 Section In Environment Selection Model Guided Interactive the changing by lags seasonal lags. the non-seasonal on the focus at to opacity user the the reducing supports and interface color user The series. time this continuous four the and top the at toolbox The Plot. ACF and Toolbox Selection Model 2.6: Figure ie o h oe aaees ofcso h hnei h eutn lt,w s animated use we plots, resulting the in change the on focus To parameters. model the for lines yuigapabedn ftebr,pit,adlns h oosfrteposaeslce by selected are plots the for another colors The to display lines. of one and parameters from points, the faded bars, value, seamlessly the next are of the plots blending of alpha the direction using and the by calculated in are value model one new from the dragged is line vertical a Rob07 = ,and 0, .Ti rcs ssoni iue27adtesplmnayvdo netesie or slider the Once video. supplementary the and 2.7 Figure in shown is process This ]. % HR07 scretymvdfrom moved currently is n ieetclr htaecnitn ntecodntdadmlil views multiple and coordinated the in consistent are that colors different and ] HB03 Shn83 .W sdti olt e ulttv oo cee hc ses to easy is which scheme, color qualitative a get to tool this used We ]. sn ldr o h ee fdffrneadcniuu vertical continuous and difference of level the for sliders using ] % = to 1 % = ..V o oe eeto nTm eisAnalysis Series Time in Selection Model for VA 2.5. ,wihi h nlmdlcngrto for configuration model final the is which 2, ycrnz Displays Synchronize utnshown button ? = 2, @ = 1, 43 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 44 2. iulAayisfrMdlSlcini ieSre Analysis Series Time in Selection Model for Analytics Visual hnslciganwmdli w lce eiulvsaiain,teAFadtenormal the and ACF the visualizations, residual elected two in model new a selecting when o h e oe mrvs ahnmee o ersnsoetransition. one or represents whether row evaluate numbered to Each user improves. change the model enables the new This the show not We residuals. standardized the Plots. of Residual plot quantile-quantile Elected in Selection Model of Transitions 2.7: Figure 1 5 4 3 2 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. ol eectn oeprmn ihmr dacdvsaiaintcnqe,ueadinclude and use techniques, visualization advanced more with experiment to interactions exciting the be especially would and layout, combination, the from profit can experts domain the well, fading interactive the in included are 2.5, Figure in (4a-d) area plots, analysis residual four All h e dafrtelyu fteTMV sritraei ompteBxJnismethodology Box-Jenkins the to is interface user TiMoVA the of layout the for idea key The of (5) area in shown are models selected currently and previously all for criteria information The are plot ACF/PACF the in lines vertical continuous the and parameters the adjust to toolbox The hmi ioAi re ofrhripoetets fmdlslcini iesre analysis. series time in selection model of it task work the future improve For further to contribution. order our in is TiMoVA and in goal them our was this work our of stage this In really TiMoVA. know of they the that from visualizations benefit familiar using and By confusion avoid have. to already experts is methodology, domain techniques Box-Jenkins the visualization experience the standard target established in our use steps by to separate known intention the and main for well-established The is experts. thinking domain and the working users, of way This workflow. its and Rationales Design model of the Discussion in point to each possible at is criteria it are Additionally, information which the one. criteria to previous the according the task. of models than selection values best worse the or the see better are can is which we it see new transition if a each color if for the readjusted So by is selection table. supported criteria model the information the to minimal during added the filled is find is model immediately stack to history coloring This The 2.5.2. process. Section in described as 2.5 Figure in shown is model process the This of changes worsens. the or see improves to fitness 2.7. user Figure model the the are enables if plots evaluate This residual and continuously. the configuration other configuration, the model to the one modifies from user fading the When before. new presented the process to current the from fading are plots residual plot. ACF/PACF the the When and for plot data model. configuration. ACF/PACF underlying the the the of case change order this also seasonal we In the differencing color decide easily another of more level to can lags the user non-seasonal adjusting the the Thus setting by level. plot opacity ACF/PACF for and the lines in vertical lags the seasonal dragging sliding the by When focusing model the plot. ACF/PACF of the order order in seasonal the lines the changing vertical when continuous process the fading of the one show we 2.7, Figure attention In consider users first the to keep models. and ensures of parsimony class also of relevant That principle the vertical the to appear. the to order and according seasonal a enabled models if the are non-seasonal model, for inputs simpler seasonal the plot ACF/PACF the a box the check fit of accidentally the in component ticking not lines average By does moving user vertical needed. the the is the that model and and ensures non-seasonal autoregressive disabled This are seasonal visible. input not the disabled. are difference is of model seasonal parameters order the seasonal the and the for include span lines to seasonal box the check case, the this default In By 2.6. Figure in shown colors. separate two always uses process % and & nteAFPC ltsoni iue26 h rttp supports prototype the 2.6, Figure in shown plot ACF/PACF the in ..V o oe eeto nTm eisAnalysis Series Time in Selection Model for VA 2.5. 45 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 46 2. iulAayisfrMdlSlcini ieSre Analysis Series Time in Selection Model for Analytics Visual nte eurmn a oueaporaeitrcin nTMV,s hti ses n intuitive and easy is it that so TiMoVA, in interactions appropriate use to was requirement Another seset h sreprec a vlae yifra srfebc [ feedback user informal by evaluated was experience user the assessment, isn ausi h aae.Frteeauto ftepooye eageaetetm eisto series time the aggregate we prototype, the of evaluation sums. the monthly For get dataset. dataset the the the in in on values columns focused missing relevant we The but only. America, Angeles of Los States are in United deaths the disease in cardiovascular cities of different number of data the contains dataset Dataset. Example for solution our of applicability solution and our usability improve the users. can assess target we we its how Accordingly and insights 2.6.2. learned, the Section we discuss in We what further evaluation, 2.6.1. of the Section levels explain in both We scenarios from usage dataset. gained example the an apply on and below prototype dataset the example applying and scenarios usage defining by experience user the and statistics, by and judged visualization was information [ process in refinement iterative experts This members, evaluations team team. formative the design by the results of the evaluated meetings repetitive we phase during implementation and design the During Evaluation 2.6 remain. models less or three 97.6% about in because and models sufficient, adequate be for should shortlist the This in residuals visually. remain standardized them their [ compare load Nazem and directly to models according to three planned to plots we up design diagnostic select our to the to In possible user in is the criteria. enable it information overview to their work this to future With for according material. models two of supplementary than (5) the area more in in compare detail and shown more previously as in all overcome criteria and To with information 2.5 table direction. Figure corresponding history this the a into and provide go models we to considered by models worth currently two and enough only worse is good or it comparing is better of if is limitation this decide of the can Usually think expert you domain models. sub-model the specific possible actually preview more next you this the the that if of see is immediately one plots you to diagnostic because model animated current with the transitions compare the showing of limitation are plot. ACF/PACF One transitions the These inside user TiMoVA transitions. the in the by in transitions directly occur steered animated may and apply occlusions triggered we because solution findings, blending a alpha their as use blending of and alpha Because using suggest also occlusion. They Heer possible changes. to for of According perception selection. the improve model and of users task their on [ concentrate Robertson can and they and users target the for etsfo adoaclrdsaefo h MASsuy[ study NMMAPS the from disease cardiovascular from deaths w xenldmi xet.Bsdsti rtlvlo vlain eeautdteprototype the evaluated we evaluation, of level first this Besides experts. domain external two LBI date + 12 a icse ypromn eosrtos nadto oti eeiieinternal repetitive this to addition In demonstrations. performing by discussed was ] and cvd HR07 hc stediynme fdah rmcrivsua ies.Teeaeno are There disease. cardiovascular from deaths of number daily the is which , nScin23w nrdcdteeapedtsto h al ubrof number daily the of dataset example the introduced we 2.3 Section In Naz88 nmtdtastosi ttsia aagahc r a oegg the engage to way a are graphics data statistical in transitions animated ] .37 naot8%o h iesre nyoeo w models two or one only series time the of 87% about in 307] p. , PW04 , SDZ LBI + 00 + .Teoriginal The ]. 12 consulting ] Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. eeaut h C/AFpo,()i iue25 codn otebhvo ftenon-seasonal the of behavior the to according 2.5, Figure in (3) plot, ACF/PACF the evaluate We hnadn h Acomponent MA the adding when with Tasto nFgr .)w e h igotcposfrteasmdmxdAM model ARMA mixed assumed the for plots diagnostic the get we 2.7) Figure in 3 (Transition iesre lt oesrih iebhvo ntenra unieqatl lt n lower and plot, quantile-quantile normal the in behavior line straight more a plot, series time hs rniin,w rvd l fte o h igotcposi upeetr mg n in and image supplementary a in of plots details diagnostic more the see video. To for supplementary them part. the of non-seasonal all the provide for we model transitions, ARMA these mixed worse a minimal get assume we 2.5 2.5 Figure Figure of (5) are area they in that criteria assumption information the The strengthens distributed. which normal residuals, standard standardized the of plots diagnostic configuration model the is in result see The can improve. we criteria transitions information both the For how this. 2.5 Figure on of details (5) further for area video to p-values changed and with image is further lags order supplementary to more the the addition shows if In statistics zero, 2.7). Ljung-Box from Figure different the in significantly 1 plots, (Transition residual plot those ACF the of of improvement lags non-seasonal the in lags ARMA mixed a have to decide parameter of we the (5) 2.7 model area sliding Figure the in When in of table component scenario criteria model. usage non-seasonal information this the the For of in explained transitions 2.5. code the Figure color show the We with other 2.1. each Table beneath in model the of the order as well as hypothesis. plot, ACF/PACF stationary. this the supports be therefore, in to and, change seems marginal the is series plots that time analysis confirms the residual differencing slider because no difference that needed, assume the be we Moving may 2.5), 2.3.2) (Figure TiMoVA Section in plot (see (PACF) series function operation time autocorrelation the partial to and time According (ACF) the function plot. consider autocorrelation first the we and 2.3.1, plot Section series in presented methodology Box-Jenkins the Following Selection Model following. the in discuss we for TiMoVA that from transitions area five diagnostic visual the model all of how the video each shows and supplementary material interactions a supplementary the provide Another clarify we To narration. transitions, 2.7. the the Figure with during in description behave plots the textual residual representations show the the to support of difficult we two is pictures, in it static transitions Because usage in the feedback expert. how demonstrate visual domain describe We We fictional and tasks. a interactivity model. these of a solve perspective selecting to the before dataset from series example second scenarios time the The on the applied in selection. range is model a prototype of usage selecting the task two high-level of defined the task and is the stories one is user first one the The used evaluation. we the 2.4 for Section scenarios in analysis requirement the on Based Scenarios Usage 2.6.1 o-esnlA oe oorder to model AR non-seasonal ? = 1 http://www.cvast.tuwien.ac.at/TiMoVA ,wihi nA()mdl yadn Acmoeto order of component MA a adding By model. AR(2) an is which 2, ? = and 2 @ = .Ti ofiuainavne h oe oso oernons nthe in randomness more show to model the advances configuration This 1. @ ? = = ,btacrigt h eairo h C/AFpo in plot ACF/PACF the of behavior the to according but 1, ? eut namr admapaac fteresidual the of appearance random more a in results 1 h igotcpossosta h duteto the of adjustment the that shows plots diagnostic the , Jl 0 03 rvstdo c.1,2020] 12, Oct. on [revisited 2013) 30, (July ? = Tasto nFgr .) See 2.7). Figure in 2 (Transition 2 @ = otemodel the to 1 1 ihaudio with ..Evaluation 2.6. 47 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 48 2. iulAayisfrMdlSlcini ieSre Analysis Series Time in Selection Model for Analytics Visual hncnutn al . o h eairo h esnlodro h oe,ta h seasonal the that model, the of order seasonal the of behavior the for 2.1 Table consulting when edsusterslso h srfebc ntefloigsection. following the in feedback user the of results the discuss we configuration: following the with nFgr .)adfrom and 2.7) Figure in 4 1 fFgr ..Apsil sg cnroi oslc rn htsat n nsa defined a at ends and starts that trend a select to is scenario usage possible A 2.1. Figure of (1) h C lto h eiul niae eann esnlatcreain nti aewe case this In autocorrelation. seasonal remaining a indicates residuals the of plot ACF The corner left upper the in show we as series, time the of range a select to user the enables TiMoVA the adjust to is step next the Therefore, yet. model the by covered not is behavior seasonal The ioA hsge ln ihtedansisbsdo h iulzto ftestandardized the of visualization the on based diagnostics the with along goes This TiMoVA. .. vlaigUe Experience User Evaluating 2.6.2 omtv vlainadieaiedsg uigtedsg n mlmnainpae where phase, implementation and design the during design iterative and [ evaluation experience formative user the of evaluation the For scenarios, usage two these in TiMoVA of application the from model. gained adequate insights an found the have to we addition that In supports model this for criteria information the Consulting model a get we Finally, model. this improves fader, difference seasonal the Sliding residuals. the of difference seasonal a consider with the is model in which intermediate model, described an different as get slightly model We a the get parameters. select fewer we We with series time and series. complete simpler time 1994 the November the to with in Compared starting point section. range data previous the last only the consider with to ending want selection. and we model scenario for usage trend following this the only In consider to and series time the in point Selection Range using BIC and AICc AIC, the for values adequate minimum residuals. an found the we with that model indicate 2.5 the Figure selected of We (5) area in model. criteria information the addition, In form: following the of model from transition the for improvement order have to likely is component parameter the sliding because model, autoregressive an consider We model. the of parts seasonal nldn h siae aaeeso h oe,w e h olwn iesre model: series time following the get we model, the of parameters estimated the Including fteslce oe.I ra()o iue25w lorcgietearp mrvmn in improvement abrupt with the component recognize also seasonal we the 2.5 including Figure when of criteria (5) information area In the model. selected the of % ihihstesaoa asaduvistecto nsaoa a ntePC.Ti indicates, This PACF. the in 2 lag seasonal on off cut the unveils and lags seasonal the highlights ( 1 ARIMA ARIMA − 0 . 3068 % ( ( ? ? =  = = 12 to 1 2 2 −  % 3 , 3 , 0 % = = . % = = 5444 = .Siigfo esnlorder seasonal from Sliding 2. sapsil ouint eoeti uoorlto in autocorrelation this remove to solution possible a as 1 = 0 0 Tasto nFgr .) hw h improvement the shows 2.7), Figure in 5 (Transition 2 @ , @ , LBI to 1  = = 24 + 0 1 12 % )( ( × ) ( × ) 1 = erl nteisgt andfo h internal the from gained insights the on rely we ] + .Wt hscngrto,w e seasonal a get we configuration, this With 2. % % 0 . 3143 = = 1 2  ,  ,  ( − = = 1 0 1 0 − . & , & , 3112 0 . 9072 = = %  ? 0 0 = ) ) 2 % = B B  ) = = to 0 G ) n seasonal and 2 12 12 = C F = C n h gradual the and 1 % = (Transition 1 % = 1. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. nte etr eadd st hwhwtemdlprom.Ti suulydn ytkn the taking by done usually is This performs. model the how show to is demanded, feature Another separate all displaying overview the is experts domain the to according feature useful very A demonstration the on and stages, intermediate the on discussions and meetings repetitive had we hsfaueo etn noeve ntepeiul eetdmdl n opr oethan more compare and models selected previously the on overview an getting of feature This see to like would experts domain The work. our improve further to suggestions also are There which session, demonstration the after reflected and paper on noted was feedback This TiMoVA. etcllns h iulspoto ouigo h esnllg o h eeto fteseasonal the of selection the for lags seasonal the continuous on the focusing using of plot ACF/PACF support this visual inside The order lines. model vertical the select directly to helpful very tp fteBxJnismtoooy tpoie l nomto eesr odcd nan on decide to necessary information all provides It methodology. Box-Jenkins the of steps sa vra ntedansi lt scniee sueu n a ofrhripoethe improve further to way a and useful models as certain considered loading is of plots concept diagnostic The the beneficial. in very overlay is an level as abstract more a on models two sn hs oeat soelyi h iesre lti osdrdfrfrhrwork. further for considered of is extension plot the series Although time model. the which this analysis, in of overlay series application as time forecasts the in these not part using and according first model, the the adequate single with on repetitive an series focused or finding time we range is given moment time the the certain along At a shown forecast boundaries. then to confidence is model forecast the This use forecasts. and ahead series step time the of part first implementation. 2.5. Figure of (5) certain area in reload shown and is overview result an the and get design to the user in concept the history this enable a included and We include to models models. was selected phase previously implementation the and of design iterative ensure the to during way suggestion a user One in graphical implemented the and using request statistics of test kind statistics. of this test for set exchangeable prepared a is from analysis solution residual selection Our the the interface. work. for customize statistics future to and be like tests to of would different going result and perform is a interface to as prefer user memory sometimes graphical in experts the available Domain in are them numbers including The and computations demand. the on residuals the of statistics more series. time larger subregion, originally interesting for or even representative model times a the reaction for selecting faster a for subregion have select nice this to to very only also only possibility consider but not the and is series is This time benefit larger task. further model a selection new A from the interest not. to of or model region improves current certain to model the model the compare one when directly whether from to results decide transition experts model and the domain the of the visualization of enables preview the This especially and another. and visualization order immediate model the according the feature changing is beneficial experts Another domain beneficial. the very to considered was model the in component is it plot, ACF/PACF the of behavior the on relies selection model the Because model. adequate this well. In as 2.7. included Section are in scenarios work usage future the discuss for We from them findings consider implementation. our and and reflection section design this the in in suggestions feedback remaining the the included We of hour. features one the about to lasted feedback The immediate gave statistics. experts domain in The PhD analysis. a series one time and for for mathematics dataset institution in an master in [ a scientists evaluation hold informal as both employed They are who research. experts statistics domain external two with session LBI + 12 a efre ydmntaigtepooyewt well-known a with prototype the demonstrating by performed was ] ..Evaluation 2.6. 49 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 50 2. iulAayisfrMdlSlcini ieSre Analysis Series Time in Selection Model for Analytics Visual eas hwdta h neatosaeaporaefrtets n httedmi xet profit experts domain the that and task the for appropriate are interactions the that showed also We Box-Jenkins the in models ARIMA seasonal and ARIMA of class the for that identified We in knowledge basic a least at with users, target the for that is evaluation the from learned we What eletbihdwyo okn n hnig aiirvsaiain,adaporaeinteractions. appropriate and visualizations, familiar thinking, and working of way well-established h olo u oki oueV ospotdmi xet ntepoeso oe selection. model of process the in experts domain support to VA use to is work our of goal The iulztos eahee hsi ioA sdsusdarayi eto ..,b focusing by 2.5.2, Section in already discussed as TiMoVA, in this achieved We visualizations. neetn hleg o uuewr st oee h oe eeto upr o multivariate for support selection Another model them. the estimate foresee to time to methods is in the work selection for future model support for for challenge visual VA provide interesting future apply and For to values missing inspiration spaced. with equally as series not limitations practical are these In and/or consider values. values we missing missing research without contains series often time data spaced equally the with applications, data to limited is the solution include Our to step. interesting diagnostic is be the model would forecasting series it for work, time model future the the For of of diagnostic performance plots. The diagnostic the the the needs. extend to enable their limited for to and analysis currently analysis is residual residual improvement the the Further customize for to methods history them. statistical user this of different using two support enable directly compare to the to is and work prototype in model future measures previous for any criteria idea load Another information history. to include developed a to and in was results interface improvement our user One of graphical evaluation the work. from future gained for insights ideas the discussed we 2.6.2 Section In their from visualizations and methodology selection model domain. well-established a of usage the from of selection by and achieved exploration are visual benefits intuitive These and TiMoVA models. easy the series do and time to description expert process VA domain resulting the the enables that prototype discovered we work, the our evaluating of By visualization the and cycles, feedback support short can transitions. we interfaces, intuitivemodel that visual an showed interactive in problem with process domain the task this of to this workflow methods the VA Applying supports way. that user-friendly tool and or technique no is there methodology Work Future and Conclusion 2.7 a with them support and have already experts domain the the interpret experience and and understand knowledge to order the in on analysis series understandable. time and study learn to to necessary easy is is it TiMoVA others models), For (ARIMA analysis series time and statistics nbigtedmi xett eettemdlodritrcieyvatevsa interface, visual the via interactively order • model the select to expert domain the enabling • • nieteAFPC lt hc rvdsafis dao h oe order, model the of idea first a provides which plot, ACF/PACF the inside epn oaneprswt h iulzto ftemdltastost eiewehror whether decide to transitions model improves. the model of the visualization not the with experts selecting domain while helping results model and the order, of model feedback the visual immediate expert domain the giving Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. h nlmdlw on noruaeseaisi ae sarte ope oe your by model complex rather a as rated is scenarios usage our in found we model final The eycmesm n iecnuig okn ihTMV eue h ubro models of number the reduced TiMoVA with Working consuming. time and cumbersome very oanepr.Fnigti oe sn xsigsaitclsfwr ol ol aebeen have would tools software statistical existing using model this Finding expert. domain noig.Teeoe hycnfcso h ako oe eeto,wihi uddb TiMoVA by guided is visual which these selection, with model work of to work. task used their the improves are on and experts focus domain can the they that Therefore, visualizations benefit well-established encodings. the the have that prototype discovered the we evaluation, in the used from insights the on Based models process. of selection subclasses model certain the already in excluded early feedback visual immediate the because considered, this for techniques visualization appropriate consider data. to of necessary kind is it Therefore, series. time ..Cnlso n uueWork Future and Conclusion 2.7. 51 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. hnapyn h oe n nldn h rdcini nitrcievsa nefc ti still is it interface visual interactive an in prediction the including and model the applying When rceig fte6hItrainlErVsWrso nVsa nltc,EuroVA@EuroVis Analytics, Visual on Workshop EuroVis International 6th the of Proceedings [ in published was chapter this of content The a find to is analysis series time in goal main The domains. different many in appear series Time h rgnlvrini vial at available is version original The 05 alai adna tl,My2-6 2015 25-26, May Italy, Sardinia, Cagliari, 2015, 2015. In selection. model series time in Lammarsch, predictions Integrating Tim Gschwandtner, Rind. Alexander Theresia and Filzmoser, Miksch, Peter Silvia Aigner, full Wolfgang Original Bögl, dissertation. this Markus of style the formatting to overall modification the or to revision reference: fit without Eurographics bibliographic to used the adapted are only in article was defined the it rights of content, retained parts the All with form. accordance license exclusive in authors, the of permission values. actual of the capabilities to prediction relation the in utilize models multiple models, of series prediction interactive time the visual adjust compare a and and approach our models, select In to and model. used process parsimonious is selection and adequate model interface an the for visual in decision combine capabilities the to prediction in propose We the assist integrate methods. to current methods in fits analytical supported model well and which not Judging is models. adequately benchmark series or time values the actual process. from deviations selection compare model to the difficult in and, capabilities variations prediction small the only consider usually, show based, to may iteratively necessary sources done is These is it models plots. therefore, series residual time and of criteria selection information The on series. time given for model nertn rdcin nTm Series Time in Predictions Integrating https://doi.org/10.2312/eurova.20151107 BAF ae 37.TeErgahc Association, Eurographics The 73–77. pages , + 15 ]. © oe Selection Model 05Ergahc.Rpitd with Reprinted, Eurographics. 2015 CHAPTER 53 3 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 54 3. nertn rdcin nTm eisMdlSelection Model Series Time in Predictions Integrating hshlst dutadr-eettetm eismodels. series time the re-select and adjust to helps This ospotdmi xet ntets fmdlslcin epooeaV prahta utilizes that approach VA a propose we selection, model of task the in experts domain support To oe addts edmntaetebnfi fti prahi sg cnrouigadataset a using scenario usage a in approach multiple this of of prediction benefit the the compare demonstrate to We and process candidates. selection model model the in capabilities prediction the . iulAayisApproach Analytics Visual 3.2 siao.Frmr eal bu h RM oesc.[J0,SS11]. [BJR08, likelihood cf. maximum models a ARIMA instance, the for about by, details estimated more are For that parameters estimator. of number the specify and by (MA component ARIMA seasonal ARIMA A non-seasonal series. a time combines given a model for model [ ARIMA methodology adequate Box-Jenkins complex.an the too on getting based from is models approach prevent Our to selection, [ parsimony model of the principle during and The considered capabilities be parameters. prediction fewer to the with judge model to parsimonious experts a visual domain select during the enables environment with exploration refinement interactive capabilities model the iterative prediction in the the predictions of Including coupling interface. close selection a model propose we approach VA our In interactive visual provides therefore approach Our models. to the means of capabilities prediction the 3.3). (Section [JC14] bay Francisco San the in quality water the about ideas preliminary our of version extended and refined a addition is [ In work in model. presented This resulting the 3.2). in (Section confidence process the raises [ work and previous series our the time to given Integrating a adequacy for the on model considerably. perspective the another process of offers environment selection exploration perform the model to into capabilities possibility the prediction the improve including would that apparent prediction became actual it prototype the [SS11]. of domains evaluation medical During and [ financial prices different electricity in predicting and instance [HXG02], for analysis failure tasks, series system time prediction adequate for apply of selecting used class to of widely This and task are models. the models series (ARIMA) in average time experts moving given domain integrated a autoregressive support for seasonal to model approach (VA) a Analytics find Visual [ to values a future is predict goal to main model the this analysis, series time In Introduction 3.1 utpiaiey ohhv natrgesv opnn (AR component autoregressive an have Both multiplicatively. B opr h rdcino utpetm eismodels. series time multiple of prediction and the values, compare actual and • predicted of differences explore predictions, • of types different explore • h parameters The . BAF ( @ + ) 14 MA , .Bsdo edakaddsusosw ou u otiuint integrate to contribution our focus we discussions and feedback on Based ]. BAF ? , ( % & + and ) 13 ,adffrnetasomto ( transformation difference a ), ,w nert h rdcinfntoaiyi h oe selection model the in functionality prediction the integrate we ], @ , & BK11 ecietemdlodro h RadM components MA and AR the of order model the describe , ( BJR08 ,3 @ 3, ?, .I rvoswr [ work previous In ]. ) ihasaoa ARIMA seasonal a with 3 BJR08 ,  .Tesaoa eghi specified is length seasonal The ). ( ? ,wihdsrbshwt find to how describes which ], ) AR , BAF ( % ( ) ,3 @ 3, ?, ,amvn average moving a ), + 13 ( , & , %, ,w introduced we ], ( × ) BJR08 CENC03 , & , %, ) B needs ] model ) ], B Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. ewn ocmaepeitoswt culvle,w ii hspoes h iesre is series time The process: this mimic we values, actual with predictions compare to want we 2)teatcreainadprilatcreain(C/AF lt temdlslcinis selection model (the plots (ACF/PACF) autocorrelation partial and autocorrelation the (2c) (2d) orders, model the selecting for plots autocorrelation partial and autocorrelation the are (2c) ospotteAIAmdlslcinwt h rdcincpblte ftemdl ecombine we model, the of capabilities prediction the with selection model ARIMA the support To vations and prediction one-step-ahead the between difference the visualizing for (1a-c) in shown as view, C nti ae,w ou nterlvn lmnsfrtepeito n o h rdcinis prediction the how and prediction the for elements relevant the on focus same we the paper, for this history. criteria selection In information model the the investigating (2e) for and model and the model, (2d) perform as adequate to purpose an plots for residual decide the (2d) and lines), diagnostics vertical toolbox, the prediction moving and interactively selection by model steered the (2b) values, predicted the applied, 3.1; if Figure and, (cf. series interface time user graphical The [ see environment. details for selection model interactive our in both the of estimation the [SS11]. about cf. details more terms, For error the information. and previous parameters all using point time one only G values observed obser- available the on based is prediction for model ARIMA an of application the general, models. In possible three m2, these (m1, of model interpretation different the a for discuss differences we these 3.3 shows Section 1c) In 1b, respectively. (1a, Graph m3) line, Qualizon Each the to values. (2a) actual of the the view including the history change selection can model users the approach is our (2e) In analysis, criteria. residual information the for prediction, plots and diagnostic selection the model are for display toolbox series the time is the (2b) is values, (2a) future the where of in prototype, prediction used our the shows data showing (2a-e) example the 3.3). displaying (Section environment, scenario usage selection model Interactive 3.1: Figure rdc eest h rdcin fteevle,uigtecrepnigtm eismdl If model. point series time time at corresponding split the using values, these of predictions the to refers predict n h rdce values predicted the and = = 1 / (1b) (1c) (1a) + G , . . . , n tpb-tpicesdb n until one by increased step-by-step and 2 1 C , . . . , G : 1 G , = n h rdce value predicted the and + 2 < G , . . . , BAF r hndntdby denoted then are G : C + + : 1 = 13 ih1 with , G , . . . , ttetm points time the at )cnit ffiemi ra:(a h iesre ipa hwn h input the showing display series time the (2a) areas: main five of consists ]) G ˆ : = + ain steoese-ha rdcin where prediction, one-step-ahead the is variant A . = < : < 1 , . . . , G ˆ : G G ˆ ˆ + = = h oe aaeesaeetmtdbsdon based estimated are parameters model the , 1 + r optd hs auscnb oprdwt the with compared be can values These computed. are 1 sdrvd nta a,peito sscesvl oeat done successively is prediction way, that In derived. is , . . . , C 1 C , : 2 C , . . . , = G ˆ = = + nec tp h oe sfi otedt points data the to fit is model the step, each In . < where , = h rdcin ftenext the of predictions The . < (2b) (2c) (2a) sa integer an is (2d) ..Vsa nltc Approach Analytics Visual 3.2. ≥ .Tu,teterm the Thus, 1. : (2e) < sst .. to e.g., set, is iepoints time G 1 G , . . . , : , 55 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 56 3. nertn rdcin nTm eisMdlSelection Model Series Time in Predictions Integrating esoe ieetwy fhglgtn h ieec ewe culadpeitdvle in values predicted and actual between difference the highlighting of ways different showed We epooet ipa h ieec sn ulznGah[ Graph Qualizon using difference the display to propose we from values parameter of progress the highlight to scale color diverging a use authors the where vertical save to enables band, color accuracy an such Using deviation. standard the to respect with 1-)frtredffrn oes ulznGah r xesoso oio rps[ Graphs Horizon of extensions are Graphs Qualizon models. different three for (1a-c) h srcnslc h oe addtsuigtemdlslcinhsoyi iue312.In 3.1.2e. Figure in history selection model the using candidates model the select can user The the enables which space-efficiency, vertical the is representation visual this of benefit The this. support visually to ways several are there prediction, the of accuracy the on emphasize To etclsaeadse h ieo h iepos saohrvratfraayigtedifference the analyzing for variant another As plots. line the of line the skew and space vertical pc n s hst tc iulrpeettosfrmlil oes sal oaneprsare experts domain Usually models. multiple for representations visual stack to this use and space n-tpaedpeito n h culvleo n iesre oe m,m,m)respectively. m3) m2, (m1, the model between series difference time the one showing of model Graph value for Qualizon actual prototype the a the and represents prediction in 1c) one-step-ahead integrated 1b, how be show (1a, can we line models 3.1.1a-c Each multiple Figure selection. of In predictions model. the time-series of one stacking than the more of predictions of comparison a changes. using radical By more values. unveil actual can the this and scale, predicted color the diverging between differences the for abstractions qualitative [ coloring pseudo two-tone and the limit would above approach proposed the shown, be to and/or need below predictions band model color multiple If small a just clutter. or visual avoid to value, to predicted height line and full the actual the above of use deviation They [ the in encode treatments. Like to applied color. of with difference value the intended encode the toward [ value CareCruiser initial in the like use one, to continuous suggest a we of Therefore, small, instead prediction. is scale the distance color diverging of the categorical error if a standard example for the differ, considering values large, actual or and medium, predicted much how in interested values actual the and predicted the between difference the [ encode that by to predictions work scale model the color of describe diverging number the we a the of 3.4, use context in Section in limited In mainly is compared. means, it be actually Second, can difference prediction. the the what of on boundaries details judge error to possible not is it [ prediction one-step-ahead the boundary, error prediction lower the and upper the differently shows a 3.1.2a. line with Figure dashed connected both cf. a points For addition as In model. represented are selected line. values colored which currently predicted the prediction, the using one-step-ahead prediction, values the of the predicted or types show the values Therefore, to of future computation process. used of selection the is prediction model triggers display the the series either throughout selects time time any user The done the be series. can prediction time This one-step-ahead given values. and predicted the values can within future user of values The prediction predict the controls. that show prediction to provide couple candidate the we To model and process, consulted. a series selection apply are time model model the the the with to of prediction adequateness applied the the is increases judge model iteratively to The user representations the visual transformations. data, resulting applies input and the of order exploration model the the After approach. our in integrated BAF SMY + 14 + 05 .Teeaetolmttosw att drs ee First, here. address to want we limitations two are There ]. ihqaiaieasrcin.I u ae euethese use we case, our In abstractions. qualitative with ] JSMK14 ,ete h akrudo ltcnb used be can plot a of background the either ], HJS FHR + 09 + , 14 HJM ,a eso nFgr 3.1 Figure in show we as ], + 11 ,weeteauthors the where ], GAK Rei08 + 11 ], ] Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. eueadvrigclrsaewt i oos ssoni iue311-.I h negative the In 3.1.1a-c. Figure in shown as colors, six with scale color diverging a use We safis tpteaaytlastedtstadepoe h a aai h iesre display series time the in AR data an raw suggests the which explores and dataset the loads analyst the step first a As Fgr ..a n h eaiu fteAFPC ntecrepnigpo Fgr 3.1.2c), (Figure plot corresponding the in ACF/PACF the of behaviour the and 3.1.2a) (Figure h nls ple gi h oe opeittosaoa ylsi h uue n recognises and future, the in cycles seasonal two predict to model the again applies analyst The is prediction the how and compared are models different of prediction the how illustrate To relation in values actual the to prediction the of deviation the unveils abstraction qualitative The h eiulpos(iue312)idct eann esnlsrcue u h nls first analyst the but structure, seasonal remaining a indicate 3.1.2d) (Figure plots residual The iltclrclasses color violet oan h aae saottewtrqaiyi h a rnic a ra[ area bay Francisco San the environmental in the quality from water scenario the usage about a is use dataset we The process, selection domain. model the in integrated s h esrmn rmoesaini etsaoe5mtr.W grgt h aato data the aggregate We meters. 5 above depths in station one from measurement the use ieto,maigta h rdce au sblwteosre value observed the below is value predicted the that meaning direction, htteerrbudr o lgtynroe o hsmdl utemr,i h information the in Furthermore, model. this for narrower slightly got boundary error the that MA an adding by model the of improvement an increases be analyst may the there AR structure, that seasonal seasonal the remaining already of shows the order which of model 3.1.2a), Because the (Figure behaviour. future and the in pattern cycles good seasonal a two predict to model the applies measurement. this on to based needs quality council water city expected the the to in predict 1986 department to years environmental series the an time from from the water analyst model the an of scenario our salinity calculated In and a temperature 2004. use water We the interpolation. on linear based with measurement month missing interpolate and averages monthly Scenario Usage scenario. usage a 3.3 in 3.1.1a-c Figure in line each of details the medium discuss use we we section, boundary, following second the the inside color but light boundary, The first color the value. actual outside the meaning larger, to distance is the and (+/-) direction  the on example based our difference in the boundaries, for chosen the the between on distance Based the value. use G we actual abstraction the qualitative and the prediction For one-step-ahead (+/-). direction each for difference as labels value observed difference The prediction. the of error standard the to to able are models the well how on impression an gives data. graph the this predict plots, residual the to addition G eiul Fgr ..d n hw h eann esnldpnec lal.Ajsigthe Adjusting clearly. dependency to seasonal order remaining model the shows and 3.1.2d) (Figure residuals @ ˆ 8 8  = ± n h bevdvalues observed the and sue o ls rdcin,weetedffrnei ihntefis onay ftedifference the If boundary. first the within is difference the where predictions, close for used is m) h eiul hwamdlta t h iesre adequately. series time the fits that model a show residuals The (m3). 1 0  . 84  ∗ n akcolor dark and , + tnaderrand error standard and G ˆ % - 8 o u xml euetobudr ees hsrslsi he lse of classes three in results This levels. boundary two use we example our For . G > =  o h esnlAR seasonal the for 1  8 euetregenclrclasses color green three use we , (  ? ntepstv ieto,maigta h rdce au saoethe above is value predicted the that meaning direction, positive the in ,  )  opnn ihorder with component G G o ieecsweetedsac sotietescn onay In boundary. second the outside is distance the where differences for 8 8 ± for 1 . 96 : ( ∗ % ≤ tnaderr h itnei sdt sinteclrclass color the assign to used is distance the error, standard ) 8 to ≤ ( % % = = ) scluae n hw nteQaio Graph. Qualizon the in shown and calculated is opnn mrvstersdasfrhr(m1). further residuals the improves component m) h nls eonzsi h residuals, the in recognizes analyst The (m2). 2 ? = 3 .Adn hscmoetipoe the improves component this Adding 1.  8 =   G ˆ h ietosaeidctdi the in indicated are directions The . 8 − G 8 ( @ ewe h rdce values predicted the between ) opnn fmdlorder model of component G ˆ 8 G < 8 euethree use we , JC14 ..UaeScenario Usage 3.3. .We ]. 57 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 58 3. nertn rdcin nTm eisMdlSelection Model Series Time in Predictions Integrating lhuhteerrbudr o h rdcino uuevle fmdl(3 arwdcompared narrowed (m3) model of values future of prediction the for boundary error the Although Fgr ..ac.I hsve h nls eonzsta hr ssilasaoa reoccurring seasonal a still is there that recognizes analyst the view this In 3.1.1a-c). (Figure h 1GI[ x12GUI The [ TimeSearcher h rdcincpblt ftm eismdl o nlzn n eetn RM models, ARIMA selecting and analyzing for models series time of [ analysis capability series time prediction of goals the main the of one is values future Predicting Conclusion and Discussion process. 3.5 selection model the in integrated not to is possibility this the to but also not provides values, but x12GUI future settings, models visualize previous single and loading For predict for allows them. which compare used. history, directly is a and input is browse form-based there call, models X-12-ARIMA the computed the of the for [ results For parameters outliers the the of and adjusting editing and series manual model time the series the as of exploration well the as on adjustment is seasonal focus The adjustment. clutter. seasonal visual for avoid to to switch series also time can the but above view, and stripe below compact full stripes more the in the use scores for They anomaly chart the line series. encode the time underlying of background the [ the along In in scores models. anomaly height ARIMA encode of to selection metaphor for the similar algorithm support a preserving the not peak to do with according and smoothing differences prediction average moving normalized the a the apply using They values actual deviation. the standard to compared prediction the of [BPS [ ARIMA al. like et methods, Hao driven model excluded to be compared to datasets events exceptional large needs requires that and approach data-driven a uses TimeSearcher prediction. Work Related 3.4 for choice better a therefore and and others, under- the This to year. analyst. compared each the (m1) of model the quarter in fourth first to minor and the is according third underestimate overestimation the models and overestimate three but purpose all quarter, analyst’s that second the recognizes and analyst for the enough Furthermore, good parameters. two is it [ parsimony (m1), of model principle of one the the to in (m2) well to so (m1) from perform areas not colored dark do in increase criteria, (m3). the finally information by and visible the is superior and This where which plots prediction. (m3), one-step-ahead residual and (m2) the models to The according others. the as prediction one-step-ahead viewthe Graph Qualizon the to 3.1.2a) (Figure selects view analyst series The time together. the close switches quite and candidates models model three three these are there 3.1.2e) (Figure criteria yai ure tfid atrsaddsly utpefrcss rvddb similarity-based by provided forecasts, multiple displays and patterns finds it queries dynamic atr ntepeito,btitrsigytemdl(1 n(a a mle ieecsin differences smaller has (1a) in (m1) model the interestingly but prediction, the in pattern HJS KMST12 + BPS 09 , HJM + 07 akg o ffr nitrcieto o h -2AIAsoftware X-12-ARIMA the for tool interactive an offers R for package ] savsaiainto osac n xlr iesre aa With data. series time explore and search to tool visualization a is ] BJR08 + 11 s etbn ihadvrigclrsaet niaeteaccuracy the indicate to scale color diverging a with heat-band a use ] h nls eie o h es ope oe m)wt only with (m1) model complex least the for decides analyst the ] KMST12 BJR08 JSMK14 .Frslcigatime a selecting For ]. , BK11 ,teatosuse authors the ], + .Integrating ]. 07]. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. h sg cnroilsrtshworapoc uprsuesi oprn ieetmodels different comparing in users supports approach our how illustrates scenario usage The n oeaut t plcblt i srsuiswt ttsiin,adt aiaei nnwusage new on it validate to and approach statisticians, our with in studies quality user prediction scenarios. via the applicability same assess its the to evaluate follow important to not is and do our it they Finally, in as models comparison process. approach, these selection a the of model of enable integration adaptation to full require possible A class might is selection. environment the It interactive model to during limited series. models also time time-series and is univariate other candidates approach against and model The models the ARIMA model. of seasonal adequate adequateness of an the propose judge automatically to not expertise does user on complex relies less approach a Our in resulted parsimony. Graphs of Qualizon principle using comparison the models visual satisfies series The which time model, tools. multiple state-of-the-art of in prediction account considering the into by of taken model been a choosing not have in which assists factors, approach Our prediction. one-step-ahead show the regarding not do plots residual slightly. the only if differ even capability criteria model, information prediction adequate the an the and choose judge structures and and recognizable models, examine more directly the or to in one involvement users of human enables and This integrated models an selected uses process. the approach selection on Our feedback model. visual adequate with and workflow parsimonious analysis a finding in users supports ..Dsuso n Conclusion and Discussion 3.5. 59 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. h rgnlvrini vial at available is version original The [ in published was chapter this of content The cec n ehooy AT–Pses hcg,I,UA coe 53,2015 25-30, October USA, IL, Chicago, Posters, – VAST Technology, and Science EE 2015. IEEE, in values missing In of imputation series. guided time statistically seasonal and Visually univariate Lammarsch. Tim “seasonal”. and to “cyclic” Rind, Alexander Aigner, changed Wolfgang Miksch, and Silvia Gschwandtner, 66 Theresia Filzmoser, page Peter Bögl, on Markus mistake small one fixed We authors. the from months the (e.g., cyclic and linear in points time. time boundaries year) neighbouring uncertainty/error the the the of of visually to patterns enables respect the structure with to cyclic values according this estimated and Including the of visual data. adequateness employ by the the and in imputation judging uncertainty inherent the about time tuning information of visual fine structure additional cyclic and include the evaluating we as approach our well In as and guidance. visually values techniques a missing imputation propose the We different estimate applying applications. sensor allows to of that range failing approach, wide imputation example imputation a Thus, in for guided imputed. statistically important applications, be is to values world need missing values real such missing of many these analysis in further For problem measurements. a are values Missing muaini nvraeSeasonal Univariate in Imputation etPse wr AT2015. VAST Award Poster Best iulyadSaitclyGuided Statistically and Visually https://doi.org/10.1109/VAST.2015.7347672 rceig fteIE ofrneo iulAnalytics Visual on Conference IEEE the of Proceedings BFG + 15 ]. © 05IE.Rpitd ihpermission, with Reprinted, IEEE. 2015 ieSeries Time CHAPTER ae 189–190. pages , 61 4 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 62 4. iulyadSaitclyGie muaini nvraeSaoa ieSeries Time Seasonal Univariate in Imputation Guided Statistically and Visually epooea prahta ae h netit neeti mue ausvsbeadallows and visible values imputed in inherent uncertainty the makes that approach an propose We . eae Work Related 4.2 . Introduction 4.1 ..b aln esrmaueet codn otesaitc ieaueaotmsigdata missing about reasons, literature various statistics by the caused to be According can this measurement. [ 4.1, sensor introduction failing the a in by discussed e.g. As available. not is it fmlil muainmtosi h pedxo i ok[B2 n pae n[vB14]. in updates and [vB12] book his of appendix the in methods imputation multiple [ of computation statistical for project R the methods. [ model-based Kleinman and and Horton methods, imputation-based methods, weighting methods, case/record Methods Imputation literature. (NMAR). relevant random the at see missing details not more and and (MAR), definitions random the For at missing (MCAR), random at completely Values Missing approach. data, our time-oriented in of We used structure representations work. the visual related values, relevant relevant missing the the regarding and and important background concept the main about the discussion present brief give we section this In time. cyclic and linear in values neighbouring to them comparing for to used be can interval imputation. confidence the or of boundary uncertainty error the appropriate communicate the method, the on Depending [ estimates of variability the compute from to techniques error imputation standard multiple in [ use introduced uncertainty or is imputation resampling that the repeated uncertainty apply the Others neglects which data. value, the missing the replace and [ value provide they single output of kind the and complete on rely [ usually they since methods, [ information statistical data many for difficulties cause values particles Missing of because missing be may [ values sensor. wrangling rivers, the data in plugging in quality considered water values be measuring Missing to when needs data. example, that missing problem of quality problem data the a face constitute analysts data domains application various In ot al ae iuain lo ocmueetmtsadcndneitras[ intervals confidence and estimates compute to allow simulations based Carlo Monte mueteemsigvle.Iptto ehd r aeoie ytetp fmto itself method of type the by categorized are methods Imputation values. missing these impute bu mlmnain fteemtos efcso mlmnain n akgsaalbein available packages and implementations on focus We methods. these of implementations about LR02 LR02 , to is methods statistical established of application the enable to way common the but ], HK07 , GH07 All09 r lmnsi aae,weeaosraini isn,wihmasthat means which missing, is observation a where dataset, a in elements are , HK07 All09 .Teeaeafwseilzdmtost nlz aawt isn values missing with data analyze to methods specialized few a are There ]. r nahge ee rue no4ctgre [ categories 4 into grouped level higher a on are LR02 rvd vriwaotiptto ehd nldn discussion a including methods imputation about overview a provide ] hr r ieettpsfrtesuc fmsigdt:missing data: missing of source the for types different are there ] .I aeo eetdrsmln ti osbet opt the compute to possible is it resampling repeated of case In ]. All09 C20 R , GH07 .VnBre rvdsals fimplementations of list a provides Burren Van ]. LR02 , HK10 .Mlil muaintcnqe,e.g., techniques, imputation Multiple ]. , LR02 .Sm ehd nyipt a impute only methods Some ]. LR02 , HK07 KHP :complete ]: + 11 Sch99 .For ]. ]. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. . er ftesaoa ati yearly). is part seasonal the if years 1.5 . ieSre muainApproach Imputation Time-Series 4.3 euecndneitrasadbxpos[ box-plots and intervals confidence use we nltc ata[ mantra Analytics h eea dai obnfi rmdslyn h mue ausfo w ieetperspectives, different two from values imputed the displaying from benefit to is idea general The imputation suitable a with values missing impute to is approach, our with support we task The analysts guiding statistically and visually for approach no is there knowledge, our of best the To values. approach. TIMI the propose to us motivates This values. iulyidct isn ausi ohves43 iesre iepo ihlna ieaxis time linear with plot line time-series 4.3, views both in values missing indicate visually iulRepresentations Visual aeylna n ylctm.Fgr . hw h eino u prah o h two the the (b) For and plot, approach. line our series of time design the the (a) shows views, [ 4.1 coordinated Figure use we time. perspectives, cyclic and linear namely imputed the of adequateness the judging for guidance statistical and visual provide and method estimated the of adequateness the judging imputation additionally different for cycle of seasonal outcome the the uses a compares only nor them but methods, of data, None of time-series GUI. exploration for form-based also and and methods menu-driven analysis imputation [ multiple see the provides consider data, that support general we package exeptional more work, that for related values solutions farther imputed (GUI) As and missing interface data. user time-series in graphical values some missing of imputation the in shown is plot cycle A cycle. seasonal the 4.1. in proximity Figure their of (b) through in related [ are plot that single values one the in to component seasonal the [ and plot trend cycle the is use we representation [ and than more (usually trend a than shorter but is part, part seasonal cyclic the the than included, duration If longer a component). cyclic be (and to variation defined remaining and season, trend, are [ Data Time-Series [ rsig ofcltt hstcnqe,i sncsayt s oevrain ftebox-plots the of variations some use to necessary is and it linking [ techniques, as this well facilitate as To view, multiple brushing. and coordinated manipulation, direct focus+context, like lrte n h eincneto ylctm ragmn norapoc,w consider we approach, our [ data in time-oriented arrangement of time part cyclic as of related concept data time-series design the and ularities nbe oaayetelvlo netit nteetmtdvle.T mrc emsVisual Keim’s embrace To values. estimated the in uncertainty of level the analyse to enables xedd oeifraievrinlk h ilnpos[ plots violin the like version informative more extended, [ Cle93 AMST11 Cle93 Pot06 AMST11 TAKP13 , , .Teccepo satcnqet iulz h eoa yl ihntm eisdata series time within cycle sesonal the visualize to technique a is plot cycle The ]. Cle94 PKRJ10 , [ plot cycle and ] Cle93 , , STA BD10 ,lk ipie eso [ version simplified a like ], .Tesaoa opnn,eg h ot nmnhydt,i rue and grouped is data, monthly in month the e.g. component, seasonal The ]. KMS + 13 sdt bandfo bevtoscletdoe ie sn iegran- time Using time. over collected observations from obtained data is , HA18 .T iulyepoetemsigvle rrte h muain fsuch, of imputations the rather or values missing the explore visually To ]. + 08 sdi u prahaesrih owr n eletbihd To well-established. and forward straight are approach our in used Cle93 norapoc eapyvrositrcintcnqe [ techniques interaction various apply we approach our in ] osdr iesre scmoe y3(r4 opnns This components. 4) (or 3 by composed as time-series considers ] ,w s oo n ofiec nevl sue y[ by used as intervals confidence and color use we ], CT82 Tuk77 , Tuf83 Cle93 eedn nteiptto ehd This method. imputation the on depending ] ,a brvae eso [ version abbreviated an ], AMST11 .Ti ae tpsil 1 odsenthe discern to (1) possible it makes This ]. STA AMST11 + HN98 13 n 2 opr ahdt value data each compare (2) and ] , CCH13 ..Tm-eisIptto Approach Imputation Time-Series 4.3. .Teltrtr nstatistics in literature The ]. .Aohripratvisual important Another ]. .Aei [ Amelia ]. PKRJ10 HKB11 yl plot cycle HKB11 ,o an or ], HS12 sa is ] ] ] 63 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 64 4. iulyadSaitclyGie muaini nvraeSaoa ieSeries Time Seasonal Univariate in Imputation Guided Statistically and Visually hnhvrn/eetn on noeve,tecrepnigpitgt ihihe nthe in highlighted gets point corresponding the view, one in point a hovering/selecting When gets window other the in point corresponding the (b), or (a) window, one in point a clicking When ihtemuecro,o yzoigwti h eprlai.Ti hw h eut fdifferent of results the shows This axis. temporal the within zooming by or cursor, mouse the with the Combining bars. vertical red by represented interval, confidence or boundaries error the with a n b,ete yepiil eetn h ee fdti n() rb neato sdsrbdin described as interaction by or (c), in detail of level the selecting explicitly by either (b), and (a) h isn ausaeetmtduigteeiiilmtosadaesona lc ostogether dots black as shown are and methods initial these using estimated are values missing The iw ih()atm eisln ltuigalna ieai,()tecrepnigccepo (for plot cycle corresponding the (b) axis, time linear a using plot line series time a (a) with views aus oln orsodn onsi hs iw,w pl idrcinllnigadbrushing. and linking bi-directional apply we views, these in points corresponding link To values. etc. Feb. then Jan., for values first showing (b), 4.1 Figure in data example our in like visualized nterpeetto sdsrbdaoeadlnigte sn oriae iw,alw to allows views, coordinated using them linking and above described as representation the in muainmto eg,tm eismdl rmlil muain.Clr r sindt the to assigned are Colors imputation). multiple the or of error models outcome by series the represented time on are (e.g., depending versions, method details box-plot imputation These modified 4.2). or Figure intervals, (cf. confidence other boundaries, value each missing to the next around methods area level imputation the the hovering setting by either 4.1c), by (Figure views, panel both configuration in the expanded in be detail can value of imputed view. specific series a time about linear mean Details the month’s in the representing part-of-the-season this plot cycle of the points in all bar highlights horizontal the Hovering/selecting imputed view. the other of adequateness the judge to to close user points the time enables the This besides cycle. also seasonal but a time, in linear other in each values neighbouring their to them compare considered is behaviour [ their structure and time applications world cyclic real a in as common very are series time Seasonal intervals confidence boundaries, error using [LR02]. the instance box-plots communicate for or and methods, quantify imputation to the allows of methods uncertainty imputation different the from values estimated imputation of [ set well R preselected as environment activate/deactivate, a statistical to the use possible in we confidence is Initially, implemented it methods or methods. panel boundary this imputation In error different add/remove 4.2). corresponding imputation as (Figure the of view list indicate detail a the to shows in color (c) interval panel assigned control an The with supplementary. methods the in explained are details More up/down. dot the dragging by well. value as estimated highlighted the adjust to user the allows in latter shown The are details 4.2. more Figure request, of Upon dots) displayed. (black are values bars) estimated (red The boundaries panel. and configuration values a missing (c) and supplementary), cf. details Coordinated imputation. guided statistically and visually for approach our of Overview 4.1: Figure a AMST11 .Arnigtedt ons seilytemsigones, missing the especially points, data the Arranging ]. b C20 R n akgs[ packages R and ] HK07 c , vB12 ]. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. epooe iulyadsaitclygie prahfrteiptto fmsigvle in values missing of imputation the for approach guided statistically and visually a proposed We . icsinadConclusion and Discussion 4.4 opeev h otx loi h ealdve Fgr .,se 3)w s eatczoom semantic a use we (3)) step 4.2, (Figure view detailed the in also context the preserve To ausadteriptdvle,aepsil nbt iw,()ad()i iue41 oprovide To 4.1. Figure in (b) and (a) views, both in possible are values, imputed their and values values. h ee fdti euse.Teclrecdsteiptto ehd f iue4.1c. on Figure depending cf. (3), method, and imputation (1) the as encodes The well color as The (2), interaction. requested. and mouse detail (1) and of image level level between the shown zoom is to level zoom according in varies transition missing representation with interactions the This details, demand. more on details more for interactions of Sequence 4.2: Figure imputation statistical combining By to are. user values the imputed enables the approach adequate our how how discussed in We confidence cycles. gain seasonal with series time univariate in aids and details the shows plot line the in or plot accordingly. cycle Moving value the views. imputed in both and the value in level adjusting missing supported higher a are a interactions to on described mouse value above imputed the these the All on overview demand. an on provides details This display. bifocal a using missing the under-estimates or over- method one or boundary error differently, wider values a impute has methods method the imputation one of the if spread how e.g. suggested comparing more the allows values within It imputed drag-and-drop the methods. through values imputation of is adjust adequateness user can the the user judge addition, The can In accurately. and involved plausible. can most uncertainty user is the the value of methods, which aware imputation deciding different when in uncertainties uncertainty these the consider to about possible details is these providing time. it cyclic By value, and selected linear the the the the both moving in in adjusting value simultaneously values the For and neighbouring changes consider also highlighting necessary. which By directly, if moved values and view. dragged imputed other be the can adjust dot the and value, imputation tune imputed different fine of estimates further comparing and for methods allows which 4.1c), (Figure methods imputation 1 2 3 ..Dsuso n Conclusion and Discussion 4.4. 65 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 66 4. iulyadSaitclyGie muaini nvraeSaoa ieSeries Time Seasonal Univariate in Imputation Guided Statistically and Visually sli u nteitouto,msigvle r i su ntm eisdt rmra world real from data series time in issue big a are values missing introduction, the in out laid As in applicable not is approach our that is limitation Another outlier. an can is approach really value the the Furthermore, whether imputation. the improve to order in excluded be then may which hr r eea osblte oetn u prah o utvraetm eisapossible a series time multivariate For approach. our extend to possibilities several are There au.Idctn h iepit novdi h muainmyhl dniyn upcosvalues, suspicious identifying help may imputation the in involved points time the Indicating value. oankoldeada piie iulrpeetto o esnltm series. time seasonal for representation visual optimized an and knowledge domain of use make for and views approach separate our instance, extend for components. to component, seasonal is each and for idea trend views One several with trend. strong series very time judge a decomposed to has method series imputation time the the of case outcome the compare and value suspicious a impute to used missing be a for estimate good a provide not structure. will cyclic outliers the on representing based visually extension imputations to this that techniques For is appropriate limitation values. more One imputed about the think improve to to needs used one be can variables the between correlation values, these about confidence the raise a values. values, enables unsuitable imputed This adjust the and of representations. adequacy box-plot the in values of about imputed judgement information the better detailed of imputation provides variation the and and views of uncertainty and both outcome the linking into The Using directly arrangements. embedded side. visually different by two is side these methods arrangement, with cyclic of a track series in keeping time view helps the a displaying brushing with for coordinated view axis a time provide linear we a interface, visual interactive an with methods plctos u prahepnso h osblte fiptto ehd yincorporating by methods imputation of possibilities the on expands approach Our applications. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [MT1”o ae6 opeetaoefl ieerr rgnlfl ilorpi reference: bibliographic full Original error. line overfull a prevent to 69 page on “[AMST11]” h rgnlvrini vial at available is version original The h otn fti hpe a ulse n[ in published was chapter this of content The compre- and identifying for technique visualization effective and established an is plot cycle The ulsig erne,wt emsino h uhr,i codnewt h eandrights retained the with accordance in authors, the of permission with Reprinted, Publishing. aksBg,PtrFlmsr hrsaGcwnte,TmLmash oe .Lie Silvia Leite, A. Roger Lammarsch, Tim Gschwandtner, Theresia Filzmoser, Peter Bögl, Markus ish n lxne id yl ltrvstd utvraeotirdtcinuiga using detection outlier Multivariate revisited: plot Cycle citation the abstraction. distance-based Rind. of Alexander front in and Miksch, al.” et Aigner “by overall the added to We fit to adapted dissertation. only was this it content, of article the style the to of formatting modification parts or revision All without form. used license are exclusive Publishing Eurographics/Blackwell the in defined series. time to periodic possible multivariate is in it cycle approach seasonal our the With regarding outlyingness dimensions. with the underlying plot judge the cycle of distance-based plots a cycle juxtapose of original that interpretation Cleveland’s views and multiple exploration coordinated cycle interactive employ of enable we To capacity outliers, Analytics detection. Visual outlier cycle the multivariate distance-based enhance for we this plots environment, Utilizing exploration original. interactive the an to in similar a plot overview representation using one a plot to several cycle retain dimensions across modified and multiple exist a dimension reduce that propose to We distance) patterns explore. (Mahalanobis series, to abstraction time impossible distance-based multivariate or For harder much are data. Unfortunately, visually dimensions univariate perspective. to allows to different also a limited It from is outliers cycles. it and seasonal values and extreme trends contextualize like and series, identify time periodic in patterns hending optrGahc Forum Graphics Computer h utvraeCcePlot Cycle Multivariate The https://doi.org/10.1111/cgf.13182 BFG + 63:2–3,2017. 36(3):227–238, , 17 ]. © 07Eurographics/Blackwell 2017 CHAPTER 67 5 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 5. 68 h utvraeCcePlot Cycle Multivariate The neapeaemnhyaeae ftmeaue vrmlil er,wt erylw yearly a low, yearly a with years, multiple over temperatures of averages monthly are example An nipratojciei iesre nlssi h eeto fotir,wihi multivariate in which outliers, of detection the is analysis series time in objective important An ecnie opeesv srsuiso h akpromnei uuewr.Temain The work. future in performance task the of studies user comprehensive consider we eg,mnho er ncoepoiiy hs ugop ihntesaoa yl nbethe enable cycle seasonal the within subgroups These proximity. close in year) of month (e.g., iesre fe olwaproial ecurn atr,cle eidco esnlpattern. seasonal or periodic called pattern, reoccurring periodically a follow often series Time aibe n o h utvraesae h yl ltdsrbdb lvln [ Cleveland by described plot cycle The space. multivariate the for and variables vdnefra nrae efrac forapoc sbyn h cp fti ae,but paper, are: this paper of this scope of the Empirical contributions beyond 5.7. is Section approach in our discussed of and 5.5, performance 5.6 Section increased Section in an in introduced for illustrated abstraction, evidence as further reasoning, of this anomalies aid ease detecting the to For space, allows multivariate model. the mental combine in multivariate to outliers a load like in cognitive dimensions increases individual and cycle the time such transfer these additional multiple and of takes none considering separately even variable Furthermore or each some space. for just multivariate plots in in abnormal out behave stand may time-consuming, but points difficult, dimensions, data a dimension, Single is each plots task. human of cycle unreliable the multiple values and allows observing extreme/outlying this by and mentally Although overview trends, an plot. patterns, building cycle series, seasonal original time analyze one seasonal to the by multivariate follow analyst for represented not effect be do same can that the variable achieve groups To whole each or pattern. groups seasonal the the within of behavior values cycle extreme seasonal and same outliers the of within detection points data comparing for allows and it pattern and seasonal data, these univariate of in identification trends the facilitates which technique, visualization underlying effective several the for both trends, and pattern seasonal consider to requires series time seasonal measures quality water or deaths data, of pollution number air like like [BD10]. with multivariate, univariate, connected are be disease examples may world cardiovascular by real series many caused time but seasonal domains, cases, of various influenza Examples in of appear number health. series or time seasonal economics, Such ecology, like between. in transitions smooth and high, build We cycle. seasonal series. and time trends effective like and distance), series, established (Mahalanobis time the measure seasonal upon distance multivariate multivariate distance-based in a our patterns on of visualize based construction to abstraction the an and For use patterns we outliers. explore plot univariate to cycle as plots well cycle as utilizing multivariate environment detect interactive to an propose we paper this In Introduction 5.1 h osrcino aaaoi-itnebsdccepo that plot cycle Mahalanobis-distance-based a of construction The • • nitrcieepoainevrneto oriae utpevescmiigthe to combining plots views cycle original multiple with coordinated plot cycle of distance-based environment exploration interactive An – – ssamdfidvsa noigfrteedsacs u ean h dao h original the of idea the retains but distances, plot. these cycle for encoding and visual distances modified multivariate a generalized uses on based step abstraction additional an involves yl plot cycle yCeead[ Cleveland by Cle93 ,wihi iie ounivariate to limited is which ], Cle93 san is ] Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. ok steccepo [ plot cycle the as work, Aigner by data multivariate time-oriented visualize to proposed been have approaches of variety A aibe sats setta rae xcl akcmlxt htwrespromneof performance worsens that complexity task a exactly creates that aspect task a is variables data multivariate for visualization space-efficient Another graphs. braided called visualization outlier for methods statistical (3) and account, into time (2) of methods, aggregation structure the other take and which projections visualizations, using mainly data, multivariate for visualizations lt [ plots bv,i spsil oioaetesaoa opnn ftm eisadt efr further perform to and series time of component seasonal the approaches isolate the to in possible as is periodicity of it representation above, visual the multiple Besides including visualizations. that pixel-based assume we data, univariate tested only they though Even load. [ cognitive Selow van and Wijk Van by work related as [ well al. as et Keim data), by multivariate work original the visualizations, pixel-based For Visualization. in Time of Structure The the underlying data the of representation visual to no hard is for it measure. there method makes distance and visualization projection intervals, strong the of However, a length is patterns. the It temporal compare measures. irregular similarity and on regular based both space finding 2D a in series time [ al. et Schreck by [ multiples al. et going Haroz technique example Another [ for al. cycles. and et seasonal time Playfair for same to appropriate the back less at are variables but multiple detection, showing trend for temporal suitable are approaches all that discovered [ [ CloudLines are [ visualizations Graphs space-efficient Horizon multiples, like small arrange- using well-suited, visual For and are clustering patterns. discuss multivariate and detecting for graph stacked ment [ the graph in stacked information the additional is incorporate 1786, in Playfair by introduced already Data. Multivariate for Visualizations detection. [ al. et Work Related 5.2 [ vlaetepromneo ie-ae iulztosacrigt h akcmlxt and complexity task the to according visualizations pixel-based of performance the evaluate h ai o eea ute ulctos[ publications further several for basis the ta.[ al. et AMST11 TAS04 JME10 Pla86 AMST11 opr xsbsdvsaiain ihrda aot freape h iewel and wheel) time the example, (for layouts radial with visualizations axes-based compare ] – – – ,teapcso rnlrte n ylsaetems motn nsi h otx four of context the in ones important most the are cycles and granularities of aspects the ], eueteifrainls nvtbyacmayn h utvraedt abstraction. data multivariate the accompanying inevitably loss information the outliers, reduce multivariate of seasonality, the interpretation the considering series support time multivariate in outliers identify ihri n oriaesse ra ml utpe [ multiples small as or system coordinate one in either ] opr h rdtoa iepo,salmlils oio rps n new a and Graphs, Horizon multiples, small plot, line traditional the compare ] .Sial prahscnb aeoie notcnqe,wihpoie(1) provide which techniques, into categorized be can approaches Suitable ]. KBK11 Pla86 Cle93 ,wihaeisie yieso vnRvr[ EventRiver of ideas by inspired are which ], HKF16 SBVLK09 o h pca aeo iait aaaecnetdsatrplots, scatter connected are data bivariate of case special the for ] , Cle94 .Avr iia ocp aldtaetre sue nsmall in used is trajectories called concept similar very A ]. uprste seScin53fradtie explanation). detailed a for 5.3 Section (see them supports ] .Vsal iia r iecre [ curves time are similar Visually ]. Rei08 SFdOL04 rqetyue prahi sn eea line several using is approach used frequently A hl h tutr ftm a aydffrn aspects different many has time of structure the While , Few08 , LAB n ulznGah[ Graph Qualizon and ] + Pla86 09 , KJL14 , KKA95 BW08 LYK .Broe l [ al. et Borgo ]. Tuf83 .W ta.[ al. et Wu ]. wihas includes also (which ] BSH + vWvS99 12 .A alternative, An ]. FHR .Tmnk tal. et Tominski ]. + 16 htproject that ] + ,hsbeen has ], 14 WWS ..RltdWork Related 5.2. BPC .Javed ]. + + 10 16 ] ] 69 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 70 5. h utvraeCcePlot Cycle Multivariate The gawl[ Aggarwal The ehd o ule eeto ntm eisi vial.Piaiy erfrt uvy,taxonomies, surveys, to refer we Primarily, available. is series time in detection outlier for methods ehdadcmol sdi ttsist ademliait ules[ outliers multivariate handle to statistics in used commonly and method Approaches. Oriented Statistical ecurn yl,lk esnlcce n rn opnn,wihotnapa ntime in appear often which component, trend a and cycle, seasonal a like cycle, reoccurring a netgt h esnlcceadtetedaogtime along seasonal trend the the on and based cycle data seasonal of the [ type investigate Terpenning this and for Cleveland visualization by alternative plot an subseries as presented was It series. Plot Cycle 5.3.1 variables. some define and [Cle93] introduce Cleveland briefly by we plot plot, cycle cycle distance-based original the the construct and compute to how explain we Before Background 5.3 both approach. include oriented nor within way, statistical relation compact a in and in time intuitive time of an same structure in the the them at with visualize deal neither can Mahalanobis series, that time well-established methods multivariate find the statistical not specifically several could We found detection, we distance. Moreover, outlier multivariate data. multivariate for structure for the approaches approaches account different into take and which time approaches, of visualization identify could we summary, intuitively. meaning In interpret the to and difficult lost, is is components periodicity, principal the the especially of time, of context the e.g., reduction, used, dimensionality be can [ analysis methods component reduction dimensionality principal methods, have visualization procedures basic [ robust with outliers variables, dimensions multivariate required identify the to of used estimation be the to on outliers of influence the avoid [ methods detection outlier outlier distance-based parametric a of class is the distance in Mahalanobis method The detection detection. [ outlier Ben-Gal for series. methods time statistical in detection by work outlier recent on The part [ detection. Austin outlier and for systems Hodge hybrid and learning, topic. machine the networks, neural of breadth the cover which works other or [ data” remaining the from different of significantly definitions is varying which are point There [ data solution. literature possible in a outlier methods term visual the and time long a for statistics in [BAF prediction for used be well as [ can events models abnormal of detection as such analysis sepandi ealb etn ta.[ al. et Bettini by detail in explained is odsatn on o noeve ndffrn ye fmtos aeysaitclmodels, statistical namely methods, of types different on overview an for point starting good onsi iet agruis o xml,huscnb rue nody.‘a’i granularity, a is ‘Day’ days. into grouped be can hours example, For units. larger to time in points yl plot cycle Agg13 sarpeetto ecie yCeead[ Cleveland by described representation a is ,gvsa ndphoeve notirdtcini eea,wt particular a with general, in detection outlier on overview depth in an gives ], Agg13 Agg13 , rmliiesoa cln [ scaling multidimensional or ] BL98 BJW00 ule eeto a encniee oeotchallenge foremost a considered been has detection Outlier , FGR05 BG05 .Esnily rnlrt sagopn fdiscrete of grouping a is granularity a Essentially, ]. CT82 CTB + 5 MHR 15, .Te a esmaie y“notiri a is outlier “an by summarized be can They ]. BG05 .Fraayigadvsaiigmr hn3 than more visualizing and analyzing For ]. .Ccepos[ plots Cycle ]. + 12 granularities nrsdas uhsaoa ieseries time seasonal Such residuals. on ] ie noeve n aooyon taxonomy a and overview an gives ] + 11]. Cle93 o iesre htcontains that series time for ] Agg13 h ocp fgranularities of concept The . AMST11 BBH11 BG05 BG05 .Abodsetu of spectrum broad A ]. .Hwvr applying However, ]. .I sa established an is It ]. , , HA04 Cle93 HA04 , r sdto used are ] rvd a provide ] PnP01 .To ]. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. we’i sdt raetetc ak ntehrzna xs h aapit r hw for shown are points data The axis. horizontal the granularity on the marks of tick granule the each left) create 5.1, to Figure used (see plot is line ‘week’ conventional the In ‘week’. and ‘day’ hnvrw att xlctyrfrt h rgnltcnqe[ technique original the to refer explicitly to want we whenever one called is day specific each while ilrfrto refer will au o hsdyo eki niae yahrzna line. horizontal a by indicated is week of day this for value aus n esrmn o ahJnayo hs er.Lkws,temaue o month for measures the Likewise, years. 8 these of January each for measurement one values: C C ahgaueo ahgauaiy olwn h omlodro ie nteccepo (see plot cycle the In time. of order normal the following granularity, each of granule each o h eann ato h ae eseiyvralsadst o h xlntos We explanations. the for sets and variables specify we paper the of part remaining the For Specification Variable 5.3.2 term the use we plot following cycle the original Cleveland’s In Yet only. today. data used series and time known [ univariate commonly Cleveland represents by is work it later like the subseries, in the mean, for the representing line horizontal the [ pattern” plots [ Terpenning seasonal line seasonal and the “as make Cleveland shows shown “To that are is: plot trends plot a individual cycle are within the the week embedded and of the discernable”, objective of visually days the components other that trend All state and Hence, al. weeks. et four etc.). Tuesday, Aigner these (Monday, accordingly. week of Mondays grouped of day all by contains grouped ‘Monday’ is group axis the horizontal the right) 5.1, Figure granularities the using 5.1, Figure in this illustrate we granularities: two of grouping of order average The line. a form to connected are cycle group the one and within (left) days plot the line plot conventional cycle the the in In letter [ same (right). al. the plot et with Aigner labeled from is (adapted day plot individual cycle Each original Cleveland’s of Explanation 5.1: Figure h 2mnh sebigaya.For year. a assembling months 12 the h aepsto,but position, same the ieetyas o ntne ie otl measurements monthly given instance, For years. different eray ac,adArlwudb nee by indexed be would April and March, February, 9 1 : C , . . . , for , = 0 , . . . , : = o ipicto euse we simplification For . = ⌊ 1 = = , . . . , − B ? 1 dmninltm eisdt by data series time -dimensional ⌋ r nposition in are ie a Given . days 9 ie ute ntecasrgauaiy uha h aemnhin month same the as such granularity, coarser the in further times CT82 esnllnt ( length seasonal 8 ,tevle ftesbeisaepotduigvria ie on lines vertical using plotted are subseries the of values the ], ftesaoa yl n adding and cycle seasonal the of G granule : 8 orfrt the to refer to = and 1 ftegauaiy‘a’ h yl ltivrsthe inverts plot cycle The ‘day’. granularity the of 9 s ) = 8 iepoints time , - fweek of day for mean = 0 ? , . . . , = 2 dmninlmaueeta iepoint time at measurement -dimensional , 3 { AMST11 G , epciey ntefloigwe following the In respectively. 4 1 7, G , . . . , G Cle93 : G 8 vr8years, 8 over + a fweek of day for trend C 9 8 ∗ + B = .16.I ale okby work earlier In 176]. p. , sdsrbdi hssection. this in described as ] 9 ∗ } ol ersn January 8 represent would 9 B days where , ∗ esrda iepoint time at measured B Cle93 ie iepitin point time a gives wel pattern) (weekly part seasonal shape: overall rgnlcceplot cycle original B 8 = ,aln sused is line a ], = 2represents 12 1 AMST11 B , . . . , ..Background 5.3. and ]). , 71 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 5. 72 h utvraeCcePlot Cycle Multivariate The sabssfrdsusn h eindcsosadraosfrhwt pl h additional the apply to how for reasons and decisions design the discussing for basis a As as to referred be will and group the within points data the of median or mean the be can which ihntegroup. the within TssT-5.Te edrv h ak o ule eeto,gigbyn h ak supported tasks the beyond going detection, outlier for tasks the derive we Then T1-T5). (Tasks h rtsto ak sdrvdfo h ak upre yteoiia yl lt(Cleveland plot cycle original the by supported tasks the from derived is tasks of set first The u xml bv,tegop ersn h otso year: a of months the represent groups the above, example our lt[BFG plot [ data the representing The bars Sunday. or and points Saturday the on of pattern points lowest the basically the means to group drop is saddling a Wednesdays, a Mondays within on of with behavior peak group Friday the the and until 5.1, Thursday increase perceived Figure on steady center of lower an group plot by followed cycle the value by the lower formed in a example shape generally For the mean representation. we visual cycle, the seasonal in the of pattern terms By the group. use will we following the In Tasks criteria. these 5.4.1 meets that example prototypical an our is For which requirements. distance, a specified Mahalanobis from the the independent apply meets is we it abstraction implementation as our long Theoretically as measures, T6-T6). distance (Tasks specific plot plot cycle cycle original original the the by by supported are that tasks the derive first we abstraction, distance-based Measures Distance for Requirements and Abstraction Task 5.4 dataset. whole the of median or mean ( center group the of one within points as bins these to refer 1 dniyteoealpteno h esnlcycle. seasonal the of pattern overall the Identify T1: 3 opr hne ihnec ru otesaoa yl n cosgroups. across and cycle seasonal the to group each within changes Compare T3: group. each within behavior the Identify T2: . . . Cle93 , - 12 orsod oa nlsso h nrgauaiy .. atrso h ots average This months’ the cycle. of seasonal patterns the e.g., of granularity, pattern finer overall the the of of analysis an idea to an corresponds get to wants one component, o h eairo n ru oprst h eairo nte group. interesting another also of is behavior It the extent. to which compares to group and one series of time behavior whole the the how of patterns the drives them granularity, coarser the to years. corresponding several time over trend larger univariate months For a within identify patterns group. to e.g., each done within often subseries is the this of series, behavior the assess to needs one year. the over h niiulbhvoso hs set ftetm eis n at oko hc of which know to wants one series, time the of aspects these of behaviors individual the ,CeeadadTrenn [ Terpenning and Cleveland ], = + December 15]. - 8 ) oevr edfieagoa eeec point reference global a define we Moreover, . } rus( groups o aho h rusw a en ru eeec point reference group a define can we groups the of each For . 8 = 1 B , . . . , ^ 8 ) ihntesaoa yl.B writing By cycle. seasonal the within CT82 rus where groups, pattern ) swl sfo u xeinei pligtecycle the applying in experience our from as well as ]), ftesaoa yl and cycle seasonal the of eieteoealpteno h esnlcycle, seasonal the of pattern overall the Beside - 8 ⊆ ie iesre ihaseasonal a with series time a Given - , Ñ 8 B { = ` 1 - named , - 1 8 = = January, - ∅ behavior 8 lblcne ( center global and eidct h data the indicate we Ð - 8 B = 2 1 ihneach within = - ` 8 February, 1 ` , . . . , = Besides - - ) In . for B , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. h sastatistician. a is who series. time seasonal multivariate in detection outlier for required tasks specify we Additionally, ovsaiemliait iesre naccepo,w eda diinlasrcinse.We step. abstraction additional an need we plot, cycle a in series time multivariate visualize To both detection outlier and statistics robust about knowledge domain from derived are tasks These 6 eetmliait n nvraeotir ae nteseie boundary. specified the on based outliers univariate and multivariate Detect T6: 1 plcbefrmliait data. multivariate for Applicable R1: abstraction data the outlier requirements: for following in measurement the used distance meet commonly the to above, are needs described measures tasks distance the Moreover, support representation To visual plot. a detection. cycle retain multivariate original to for us the applicable allows to also compute, measures, similar to Distance easy concept. are well-known they a because and measure, data, distance a use to decided Measure Distance a for Requirements 5.4.2 points. data of outlyingness resulting the track and outlier-boundaries Adjust T10: other the in points data corresponding the explore and outliers univariate Detect T9: univariate the in points data respective the explore and outliers multivariate Detect T8: outliers. multivariate as well as univariate are that outliers Detect T7: [ co-authors our of one of experience long-lasting the from and literature from cycle. seasonal the from deviate that groups whole Identify T5: group. each within values extreme/outlying Detect T4: aibe swl si utvraespace. multivariate in as well as variables h esnlcce(1 sdtce,oewnst dniygop ihntesaoa cycle seasonal the within groups behavior. identify this to wants from deviate one that detected, is (T1) cycle seasonal the outliers. possible onaisseiywa eaae omlfo uligdt ons n at oadjust outliers. to borderline wants detect One to order points. in data outlying boundaries from these normal separates what specify boundaries space. multidimensional in its representation see its and as dimension well (univariate) as one dimensions in other point in data position a selecting with exploration the start plots. univariate the in values corresponding the analyze and space. univariate and multivariate both, in outliers constitute that points context. data detect to needs one This outliers. boundary. multivariate this and outside univariate for points possible data be detect easily to and needs boundary tolerance a specify to h aesaoa yl.Oewnst eetsc xrm ausadcnie hmas them consider and values extreme within such detect points to data wants to One respect cycle. with seasonal values same extreme/outlying the identify to allows plot cycle n ed omk eetoso aapit,i re oepoemliait outliers multivariate explore to order in points, data of selections make to needs One ..Ts btato n eurmnsfrDsac Measures Distance for Requirements and Abstraction Task 5.4. hsts ssmlrt 8 u n at to wants one but T8, to similar is task This h a h aai ragdi the in arranged is data the way The hnteoealptenof pattern overall the When FGR05 xedn akT6, task Extending , FRGTA14 n wants One The ], 73 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 74 5. h utvraeCcePlot Cycle Multivariate The itnemaueqatfistedsac ewe w onsi utvraesae o our For space. multivariate in points two between distance the quantifies measure distance A hc et h eurmn R3. requirement the meets which [ al. et Filzmoser to According center data the with h center The center The opr [ compare hs ehd r nune yotir.Tu,t vi h nuneo ules euerobust use we estimation, outliers, the of for influence methods the of avoid classical estimation to the using Thus, for when methods outliers. and by data, influenced the are methods on these based estimated be to need the of 0.975, quantile the e.g., quantile, certain a approximately than are distance data, Mahalanobis the squared of higher matrix a covariance the to of distance respect Mahalanobis with with center, squared chisquare-distributed the the to distribution, points normal data multivariate the of distribution. case chi-squared the in of detail, quantiles more using outliers, In multivariate identify to used be can distance the (5.1), R1. Equation requirement see our definition, fulfilling By data, measure multivariate for distance detection. applicable established outlier is an multivariate distance is Mahalanobis in it that used is and distance statistics Mahalanobis in the required. using are for methods reason robust main detection, Our outlier for used if but plot, methods cycle estimation distance-based robust to classical from ranging [ methods, different are there them estimate for a Given the of 5.3a. spread Figure the in and ellipses matrix correlation covariance the involves A which R4. data, requirement the our of distance meets structure Mahalanobis which covariance the data, the distance, the specifies Euclidean of the correlation like the measure, also distance considers above. basic described a measured to distance contrast a In on requirements detection our outlier meets multivariate and for statistics PnP01] in HA04, method [BG05, established [ an distance is Mahalanobis which the distance, use multivariate to decided we implementation prototypical Measure Distance 5.4.3 data. the of correlation the Incorporates R4: exists. value cut-off Specific R3: outliers. against Robust R2: ? j BG05 2 ? iesos n2dmnin h pedcnb lutae ihelpe,setedt onsand points data the see ellipses, with illustrated be can spread the 2-dimensions In dimensions. ecnueti uniea onayfrdcdn hte aapiti notiro not, or outlier an is point data a whether deciding for boundary a as quantile this use can We . : = , 1 FRGTA14 = , . . . , BG05 ` ` swl stecvrac matrix covariance the as well as n oainematrix covariance and n h center the and , , FGR05 .Ee huh h pcfi ehdi o eeatfrtecntuto fthe of construction the for relevant not is method specific the though, Even ]. ` n oainematrix covariance a and , , FRGTA14 MD ? ere ffreedom, of degrees ` ( FGR05 ` G and : ? sdfie as defined is `, , dmninldtstwith dataset -dimensional ,tems omnyue ehd r the are methods used commonly most the ], Σ Σ hc etrqieetR.I h ttsisliterature, statistics the In R2. requirement meet which , ,i siae ihrbs rcdrs h Mahalanobis the procedures, robust with estimated if ], Σ ) = edt eetmtdbsdo h dataset the on based estimated be to need q ( Σ G : Σ eurdt aclt h aaaoi distances Mahalanobis the calculate to required h aaaoi itnebtenpoints between distance Mahalanobis the , − j 2 ? ` hs oeta utvraeotirhas outlier multivariate potential a Thus, . ) T Σ − 1 ( G : = − observations, ` ) . Mah36 - ,ageneralized a ], = { median G 1 G , . . . , - To . (5.1) sa as G = : } , , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. hc hw nasrcino utvraetm eisuigdsacs hsscini ie to aimed is section This distances. using series time multivariate a of abstraction an shows which h aneeeti u neatv xlrto niomn stedsac-ae yl plot, cycle distance-based the is environment exploration interactive our in element main The eto ...W otesm transformation same the do We 5.5.1. Section otegoa center global the to iefrec ru,sriga ru aeln.I iue52w s h rgnlccepo to plot cycle original the use we 5.2 Figure In line. base group as serving group, each for distance line Mahalanobis the zero compute center we group the particular, each groups of In between all Instead 5.6). of Section center (see can line. interactively global value changed the zero absolute default the the by and use where center we number, reference, group real central this a as between is line distance center group as each considered plot, be cycle original indicate the that lines In vertical cycle showing distance-based by the supported in is centers T5 pattern and group overall like T3, the this cycle, T1, of reoccurring tasks identification time for the The required seasonal of is multivariate pattern plot. for which the year, shows but the plot plot, over cycle cycle season original original the The the as 5.4.1. tasks Section same cf. the series, support to is goal Our Distance) (Inter-Group Cycle Seasonal 5.5.1 integrated and case multivariate the its for illustrate environment. generalized exploration then is We interactive it our plot. Finally into cycle explaining case. distance-based bivariate first a a by to in plot plot construction cycle cycle original distance-based an the of of transformation construction the the of understanding the ease to plot. comparable cycle are original that the patterns in different trend form or can pattern group, seasonal each a within points the and centers group Considering plot. cycle distance-based a to centers plot group cycle original the an of Transformation 5.2: Figure Environment Exploration Interactive the of Features 5.5 the matrix. and covariance data the the estimating for of [Rou85] center estimator the for estimator robust itnebsdccepo.Drn h xlrto rcs hsgoa eeec on a be can point reference global this process exploration the During plot. cycle distance-based ` a ` ` 1 1 ` , . . . , ` ` , . . . , ecntuttebs ftegop,transformation groups, the of base the construct we , days ` B B e iue5.1. Figure see , 8 , spit omn iesre iepo n sn h distance the using and plot line series time a forming points as 8 = 1 B , . . . , n h lblcenter global the and , 5 ..Faue fteItrcieEpoainEnvironment Exploration Interactive the of Features 5.5. 1 5 2 o h atr ihnec ru.Bt,the Both, group. each within pattern the for ` 8 iiu oainedtriat(MCD) determinant covariance minimum b c ` h euti n horizontal one is result The . days 5 2 ` 5 1 sdsrbdin described as , o constructing for MD 75 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 76 5. h utvraeCcePlot Cycle Multivariate The hra h vrg pigfl otshv ml itne rmtegoa etradtherefore and center global the from distances small have months spring/fall average the whereas distance Mahalanobis the calculate we center global the calculate we setransformation (see ausi pigadfl.Ti esnlpteni eetdi h oiino h ru etrlines center group the of position the in reflected is pattern seasonal This fall. and spring in values utvraeseai a xlie nFgr .) edsusti nomto osdet the to due loss actual information an this in discuss sense We make 5.3). not Figure does be in but may explained 5.2, (as center Figure scenario global in multivariate given the example ‘below’ univariate and the ‘above’ in between applicable distinction The non-negative. always eno l aapit 8dy n ihscmie) nti iait xml hs w group two these example bivariate this In combined). nights 8 and days (8 points data all of mean lblcenter global ltw opt h aaaoi itneo h aapit in points cycle data distance-based to the the possible For of is distance years. it Mahalanobis the that over the example is compute for plot, by we granularity, cycle given plot coarser y-axis original the the the over points on in trend by position representation the represented the this see are and of granularity, points feature coarser data One the the values. of plot, their order cycle the original following the line In a on plot. cycle original the in like points data the Distance) representing (Intra-Group For Groups Within Data 5.5.2 plot, cycle x-axis. distance-based the the to in closer are peaks large as have month, appear hottest therefore, months, as and, August winter center like The global months, 5.4a). the summer Figure to the (see distances and plot month, cycle coldest distance-based as the January in like like average pattern and similar summer, a in values in high winter, in values low are there represent to 5.4b’): Figure able (see are temperature we values, distance these using By centers accordingly. group calculated the are dimensions, them two between than more transformation of by case shown In y-axis the on line group-base the of position the as used is distance the Given 5.3a). Figure (see space two-dimensional in centers points group are center global the and centers centers group the to equivalent is which combined, values humidity cycle seasonal daily the the of example example this small In a humidity. consider We and temperature case. bivariate variables: two the in of construction measurements the show we 5.3 Figure In to allows environment exploration loss. interactive information our how this reduce describe and 5.7 Section abstraction data center global to 5.2a), (Figure plot cycle original the In center constructed. global is the plot cycle distance-based the how illustrate and ru center group osrcino hsgnrlzddsac ae yl lt eapyteedsacs(group distances these apply we plot, cycle based distance generalized this of construction B ? dmninldtst nordsac-ae yl lt o xml,cnie h monthly the consider example, For plot. cycle distance-based our in datasets -dimensional = night sgopn h aapit nomaueet uigthe during measurements into points data the grouping is 2 6 3 nFgr 5.3b. Figure in o ahgop–dyadngt–w aclt iait vrg o eprtr and temperature for average bivariate a calculate we – night and day – group each For . ` ` ` 1 day ` , . . . , and ` ` nteyai.Tu,tegoa etri ersne ytexai itself x-axis the by represented is center global the Thus, y-axis. the on ) and sidctdb oiotldse ie ecmuetedsac feach of distance the compute We line. dashed horizontal a by indicated is 5 MD 1 ` B nFgr .) hsi u otefc htteedsac esrsare measures distance these that fact the to due is This 5.2). Figure in night oti lblcne idctdb h ooe etcllns.Frthe For lines). vertical colored the by (indicated center global this to ( ` day h lblcenter global the , ` `, ftewoedtst .. h iait tmeaueadhumidity) and (temperature bivariate the i.e., dataset, whole the of G 1 Σ G , . . . , ) ftedygopcenter day-group the of MD = ihntegroups the within ( ` ` night n h oainematrix covariance the and , `, , Σ ) ftengtgopcenter night-group the of ` 8 h lblcenter global the , - ` 8,...,B day day - otegoa center global the to eapyasmlrapproach, similar a apply we , 8 otergopcenter group their to ` n esrmnsduring measurements and day and Σ seScin5.5.4), Section (see ` night ` n distances and , naddition, In . ` night ` This . ` othe to 8 In . 6 ` 2 8 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. ogi ute nihsit h utvraedtst epoieitrcieepoainmeans exploration interactive provide we dataset, multivariate the into insights further gain To itne r ree codn otecasrgauaiy noreapeclna as and days, calendar example our in granularity, coarser the to according ordered are distances itnebsdccepo Fgr .a losfrrpeetn nabtaynme fdimensions. of number arbitrary the an construct representing to univariate for distances exploring allows Using 5.4a) and 5.4.1). (Figure detecting Section plot as cycle in well distance-based T1–T10 tasks as (see support trends to outliers and is multivariate goal and patterns Our seasonal environment. exploration exploring interactive in our of users design the show we 5.4 Figure In Interactions and Design 5.5.3 cycle distance-based the in patterns these differently. of interpreted be interpretation to the need plot. discuss trends we and section pattern following seasonal the the In plot, cycle original the any to for contrast computed be can transformation by distance shown the as before, group, their within bars as represented centers group point respective data each the day of during to distance measures Mahalanobis of the transformation subset compute by the we 5.2c from 5.3) Figure (Figure as in example them group bivariate draw one the to for In allows the illustrate center representing we group line like this horizontal group, to each a point within with each plot chart of line bar 5.2, distance series Figure the time from Computing a illustration center. form the the group cycle group to Using each original similar centers. in is the points group plot in data the cycle for the arranged distance-based lines and the horizontal binned to the transformation of are the construction points data data univariate the With how plot. illustrate Cleveland we by 5.1 design Figure original In the up picks lines. we also vertical thus, This using and points. [CT82], points, Terpenning connected data and actual of of instead instead bars distances use represent to we chose plot cycle original the to matrix. covariance contrast the in captured data the of spread the illustrate ellipses The (transformation center abstraction. to applicable global are The the the that of to measures distances centers distance example. Mahalanobis transformation group (e.g., on bivariate the center based of a group is respective distances right with the the to plot points on cycle data plot bivariate cycle distance-based bivariate the the of of construction Construction 5.3: Figure G a night8 G night1 G night4 ` night 6 3 : ` MD 30H ( ` ` , ` night G =86ℎC ? ; dmninlsae sakyapc ple nordata our in applied aspect key a is space, -dimensional ∈ `, , - , Σ ? 30H MD ) dmninlmliait aast oee,in However, set. data multivariate -dimensional ..Faue fteItrcieEpoainEnvironment Exploration Interactive the of Features 5.5. n night and ( G ; G ` , day7 30H G ` day8 day , G Σ G < day2 ) ∈ and 6 - 1 Mahalanobis Distances =86ℎC nFgr .b sdiscussed As 5.3b. Figure in MD 6 2 b 6 and ( where , 1 G n nMahalanobis on and ) 6 6 < 1 2 ` , : : 6 MD MD 3 =86ℎC .Teuaeof usage The ). ,< ;, ( ( G ` day2 day , Σ = `, , ` , ) 1 These . day , . . . , Σ ) , Σ ) 5 8, 2 . 77 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 78 5. h utvraeCcePlot Cycle Multivariate The eecd he ye fotir sn oo.W eetdtredsigihbeclr according colors distinguishable three selected We color. using outliers of types three encode We epoiemlil ikdveswt ihihigtigrdb oeigadslcin including selection, and hovering by triggered highlighting with views linked multiple provide we b olwdb h nvraetm eisln lt() nti cenhtw s space-efficient use we screenshot this In (c). plot line series time univariate the by followed (b) h otmlf hw h oto ae o neatv xlrto d.Clrecdstetp of type the encodes Color (d). exploration interactive for panel control the shows left bottom The oepoepten n ulesi h utvraesaeadteudryn nvraedimensions, univariate underlying the and space multivariate the in outliers and patterns explore To iulztos( ) ofi oevralso h cen ntecnrlpnl() h srcan user the (d), panel control the In screen. the on variables more fit to c), & (b visualizations 5.5.1. Section in example as used temperature, variable oteLab oo pc n aiie h ecpuldsac fteslce oos compare colors, selected the of distance perceptual the maximized and space color L*a*b* the to u neatv xlrto niomn sidpneto pcfi ehdfrcomputing for method specific a of independent is environment exploration interactive Our magenta by represented multivariate cyan by [ scrollable. To are plots visualizations. univariate the the of dimensions, each more in of supported exploration are the selection allow and Highlighting selection. multiple the view, detailed plots. the line In detailed more parameters. by adjust replaced and are view sparklines sparkline and view detailed between [ switch sparklines two show [ allows we al. This 5.4, et Figure Bögl plot. In by line used values. series is missing time it containing a like series dimension, to univariate next same plot the cycle on original multivariate perspectives the the provide of multiple we variables the multiple series, and the time visualization visualizing distance-based For the representations. between univariate forth underlying and back switching for allow that of plot cycle original the is (b’) values. exploration. outlier-boundary detailed specify for to bar used scroll are a sliders with and view outlier detailed more the a on to plots changed small be The can overview. (b+c) comprehensive side a provide right plot to cycle order original in an representations, sparkline plots: univariate underlying the to next with (a) views plot multiple cycle coordinated distance-based the employs the utilizing prototype environment The exploration plot. interactive cycle our Mahalanobis-distance-based of implementation Prototype 5.4: Figure HSA + 10  .Tetretpso ulesadrsetv oosae 1 nvraeotir represented outliers univariate (1) are: colors respective and outliers of types three The ]. 2 utvraeotir ersne yorange by represented outliers multivariate (2) ,  . d a b’ b  n 3 ulesi nvraeand univariate in outliers (3) and , Tuf06 c o h univariate the for ] BFG + 15 o time for ] Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. euetemda ocmuetegoa center global the compute to median the use We eipeetdteitrcieepoainevrnetuiiigtedsac-ae yl plot cycle distance-based the utilizing environment exploration interactive the implemented We epoiea neatv ldri u xlrto niomn o eetn h unie see quantile, the selecting for environment exploration our in slider interactive an provide we ahvariable each grspoie ntesplmnaymtra n ee oorpooyeaalbeoln at online available prototype our to additional refer with and scenario usage material the http://cycleplot.net supplementary following T1–T10 the in tasks reader in related the provided the support to figures We refer to we 5.4.1. scenario scenario, Section usage multivariate usage in the in described this Throughout values use outlying data. We and series patterns exploration time interactive explore seasonal scenario. the to how possibilities usage and the data a advances real in visualizes environment plot prototype cycle distance-based the the how apply illustrate and prototype a in Scenario Usage 5.6 draw to namely context, points. exploratory highlighted an these in to more user the thus the of is of outlyingness detection attention of outlier the indication of an goal other. as The each serve points. from still independent data distributions not these not thus of likely and quantiles most time-dependent, the will However, are distances observations the the for because normal) met (chi-square, be distributions assumed the that Note is data 5.4d. univariate Figure the If outlier. univariate of as quantile identified the is compute it we quantile, distributed, certain normally a than higher is point of center covariance the If above. described as approach similar a matrix use we detection, outlier univariate For the apply we Therefore, matrix dimensions. covariance of global points number the the compute to to compared method group MCD each in points data matrices covariance of separate of estimation the in instability groups matrix covariance the estimate Calculations Robust 5.5.4 prototype. the our In for detection. used outlier calculations for the used on boundaries details the give detection. adjust will outlier we can for section, user parameters following the the 5.4d adjust Figure to allow in literature, example, and statistics For space in multivariate exist as methods well of as lot univariate a so, doing [ For see outliers. multivariate or univariate fteudryn nvraedsrbto.I aeteaslt au facnee nvraedata univariate centered a of value absolute the case In distribution. univariate underlying the of matrix ` 8 = BG05 colMedian - Σ ¯ Σ Σ o aae with dataset a For . 8 - setmtdrbsl,tedaoa ossso outetmtso h variance the of estimates robust of consists diagonal the robustly, estimated is sn h etrddt points data centered the using rt etrtedt onso hi ru einadetmt lblcovariance global a estimate and median group their on points data the center to or , ¯ , , HA04 ` A A = = ( - 0 o noeve.I u niomn,w ihih h dnie ulesin outliers identified the highlight we environment, our In overview. an for ] 1 8 n h variance the and , ) ? , . . . , o osrcigtedsac-ae yl lt hr r w osblte to possibilities two are there plot, cycle distance-based the constructing For . . nthe in ⌊ = Σ − B 1 ihrt siaeasprt oainemti o aho the of each for matrix covariance separate a estimate to Either . ≫ ⌋ ? dmninldtst sn h etrddt points data centered the Using dataset. -dimensional f ? A 2 ecmueteotirbsdo h eetdquantiles selected the on based outlier the compute we , n a eiefrayo hs ehd sneeded. as methods these of any for decide can one - ¯ = - 8 − # ` ` ( ,f `, 8 etn ihsm aast hwdan showed sets data some with Testing . = = Σ colMedian 2 Σ ) iefrtemliait boundary, multivariate the for Like . 8 hsi u oa fe o number low often an to due is This . covMcd ( - ( - ) ¯ n h ru centers group the and , ) ihalcnee data centered all with ..UaeScenario Usage 5.6. f - ¯ A 2 the , for 79 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 80 5. h utvraeCcePlot Cycle Multivariate The JavaScript hncmaigt h te eki umr h srdtcsti diinlvrainwt larger with variation additional this detects user the summer, in peak other the to comparing when an represents basically which center, global the than extreme more average on are months winter the of prototype the [ and R dataset in the done of was preprocessing the separated we concept, of proof a As priuaemte) eprtr,adhmdt rmtect eu grgtdt monthly to aggregated Seoul city the from humidity and temperature, matter), (particulate SO diseases), cardiovascular by caused (deaths Dc,Jn,&Fb) httedt auswti h ru aymr hni tes Especially, others. in than more vary group the within values data the that Feb.), & Jan., (Dec., h srfis at ogta vriwo h esnlbhvo ftetm eisi h multivariate the in series time the of behavior seasonal the of overview an get to wants first user The ules noreape h srcno n n ru htdvae rmtesaoa cycle seasonal the from possible deviates as that December, group in any several find and cannot June, user in the example, first others, our January, Amongst In in T4). year (Task outliers. last easily outliers can the possible user be considers may the user that highlighting the seem bars, without months large Even by These March, dimensions. values year. and extreme all first June identify across the months months for the except stable at quite length, looking be same When groups to the the roughly T3). within with & variations distances T2 (Tasks the spots detail compares he/she more user in the Next, groups across T4). (Task and center group the months to peak distances the of some in tendency a in identifies position He/She their to T3). according & groups T2 the (Tasks smooth cycle within behavior similar seasonal the the a considers variable, follows each then abstraction of user Mahalanobis-distance-based representation The plot the behavior. cycle of original the pattern transitions in seasonal season the the the Considering the of than peaks details). low January more and to for high closer material between are supplementary global months the the winter (see as other month center the summer month that one shows Selecting January, and e.g., (October). summer point, autumn that reference or means (April) This spring winter. the e.g., in month, 5.4a, average as Figure well see as months, summer the in of peaks are centers there group that the identifies to user According T1). Task (compare space appropriate using by application, web them. a connect in to visualization libraries interactive the with R in computations [ library (HEAT dataset the of preprocessing The environment. exploration interactive to refer we goal, this achieve to 5.4.1. tasks Section particular in the described on T1–T10 details tasks For the dataset. the of values outlying above. described Goal/Tasks. dataset the in outliers the and trends, User. cardio variables, 6 of averages. subset a select we illustration, the measurements For daily of caused (2000-2007). consists deaths years dataset of The several number for diseases. the as respiratory well and as diseases for cardiovascular data by project meteorological R with together the indicators in available quality is air It [LOK13]. [ Korea computing South statistical in cities major for Dataset. data meteorological Pollution Air & Mortality n uligesvle a oea ecie bv seScin .. n ..) The 5.5.4). and using 5.4.3 application web Sections as (see done was above environment exploration described interactive as the of done implementation was values outlyingness and sapsil sri ulchat ffiil h nlzsadepoe h esnlpattern, seasonal the explores and analyzes who official, health public a is user possible a As hr eipre h rcmue aafie o uuewr,oemycmiethe combine may one work, future For file. data precomputed the imported we where , h vrl oli ogtisgt notesaoa atrs rns xrm,and extreme, trends, patterns, seasonal the into insights get to is goal overall The C20 R .Tecmuaino lbladgopcnes aaaoi distances, Mahalanobis centers, group and global of computation The ]. C20 R slbaynamed library as ] h aae saottemraiy i olto,and pollution, air mortality, the about is dataset The 2 Slu ixd) NO dioxide), (Sulfur HEAT [ LOK13 .Tedtstcnan several contains dataset The ]. 2 Ntoe ixd) PM dioxide), (Nitrogen LOK13 10 ]) Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. hncagn h hehl rmte09 ote09qatl T0.Ti lost interactively to allows This (T10). quantile 0.9 the to 0.95 the from threshold the changing when T) niaigotir nbt,ui n utvraesae h srimdaeyrecognizes immediately user The space. multivariate and uni- both, in outliers indicating (T7), seasonal multivariate the of out stands that month, whole one imagine can one but T5), (Task h sratvtstehglgtn fotir n eet eti uniefrteuiait and univariate the for quantile certain a selects and outliers of highlighting the activates user The prahrgrigteseie ak,adgv notoko uuework. future will on our we outlook of section an performance give the this cover and In depth, tasks, more specified in design. the limitations regarding the approach and and benefits of construction and discussion our the benefits this of for the continue design discussed decisions the briefly specific and we of sections visualization distance these the limitations a In for of requirements construction 5.5). the (Section the with accordingly argued together environment, and so 5.4 do Section distance-based to utilized relevant in the tasks measure of the abstracted construction We the outliers explained plot. and and cycle patterns series exploring time for seasonal environment multivariate exploration in interactive our introduced we far, So Discussion 5.7 overall the series. in time insight seasonal getting multivariate for the interactive vital of the is picture in plot is combined cycle it but distance-based Obviously, seasonal visualizations, the the space. environment, univariate explore multivariate underlying exploration to the the able in consult directly is also groups user to across required the and plots, within cycle patterns original and multiple pattern only using to contrast In are. outliers the extreme outliers, how as about highlighted impression month only an in are get 5.4a, bar Figure first see the example, plot, cycle For borderline distance-based are decreased. the that is in By points boundary Jun. data the Dec. and the when Mar. in track outliers, not can Nov. as user in indicated the only are slider, outlier and the univariate with a outlier boundary is univariate also outlier temperature a user the variable also changing The the is (T9). but cardio center Dec., variable the and the Nov. to outliers, in distance plot, multivariate large cycle being with distance-based besides points Nov. the that data in in recognizes as points that recognized data shows be extreme them also two Selecting can detects they magenta. user the in highlighted cardio, variable Dec., the and for plot cycle original the variables, some in NO that and reveals the exploration PM highlights This which and investigate plots. (T8), cardio further univariate outliers e.g., To the the in selects outliers. points he/she multivariate data space, all corresponding univariate are the year in last outlyingness the the in magenta Nov.–Feb. in months highlighted the months cycle winter that original these the in points in data pattern more same are the there exceptionally detects that user an recognizes the and and that, plots Oct.–Apr., Knowing considers months user Dec.–Feb. in the in only example, number outliers For large allows multivariate that (T6). are outlyingness easily there outliers the that 0.95 univariate in interesting as the patterns well of selects as overview user multivariate an The gets detect to and points. 5.4, data Figure in the shown of quantile, outlyingness indicate to boundaries multivariate pattern. 2 hyaehglgtda ulesi oh nvraeadmliait pc.Loigat Looking space. multivariate and univariate both, in outliers as highlighted are they , 10 hyaeidctda utvraeotir ny e nohr,eg,SO e.g., others, in yet only, outliers multivariate as indicated are they , ..Discussion 5.7. 2 81 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 82 5. h utvraeCcePlot Cycle Multivariate The hl h dao sn nr-ru s ne-ru itne sawl-nw taey eddnot did we strategy, well-known a is distances inter-group vs. intra-group using of idea the While sacneune l itne r ersne bv h ru etrlns(e iue52.We, 5.2). Figure (see lines center group the above represented are distances all consequence, a As ieyslcigtegauaiylvl codn otesaoa iesre,ti btato still abstraction this series, time seasonal the to according levels granularity the selecting wisely hs os h nomto bu h xc oiino htdt on,frtesk fbigal to able being of sake the for point, data that of position exact the about information the loose thus, aecnb ple o ru etr,b enn ru eeec on instead. The point reference group. reference group this a to defining by groups centers, other the group for relate applied and group be x-axis, whose can the month same on reference lie a potential then i.e. Another would group, system. line representative coordinate center one could multivariate choosing point the be reference of would This Instead point, point zero point. point. reference the reference reference basis, global different the a a instance, define to for can centers be, we line. group center horizontal the global a relate mathematical on to aligned a is seasonal centers of problem univariate group this in show tackle would pattern to same plot seasonal way cycle The no One original is center. the there pattern global also case seasonal the data, In no around series plots. is sphere time cycle there multidimensional original if a for judge span true to centers is difficult 5.3b group drawback then Figure the is in if lines It or base all level. the at horizontal therefore, same and the groups, on both nearly group to are all distance where the Mahalanobis representation on same a position the in have same to result exact to distance would the same This on the lie centers. has lines group center base the global of the all where even setting or a multiple to lead may however, distances, Using line and plots outliers. cycle like original anomalies, investigate in to and dimensions thus, intra- single and multivariate the of plots the of context connect visualizations in to the seen possible to be distances is to inter-group it needs explorational where series full environment, time The exploration multivariate interactive 4.2). for the Section plot (see cycle series distance-based time our multivariate of for power plots cycle in applied this find and account, into context. time temporal of the structure retains the of taking each By and views. plot linked cycle multiple distance-based in the dimensions environment, both, single exploration in interactive the investigations our further multivariate allows loss from information 5.5.3, abstraction this Section of reduce see type To a Our dimensions. also less lost. is to distances, is space periodicity, Mahalanobis the the By especially using time, i.e. visualizations. of approach, 2-dimensional context for the second and reduction, first dimensionality e.g., applying components, principal the of meaning [ this analysis in component [ method used principal scaling commonly like One reduction, space. dimensionality multivariate is in this case represent be to three-dimensional difficult to very and appears is two-dimensional, it for position space), even actual (maybe the space about one-dimensional information for the important While dimensions. multiple represent definition. by number non-negative a is distance distance-based A the differently. therefore, interpreted of and be representation groups), to different within needs a patterns plot and using causes cycle pattern plot This seasonal cycle the based 5.5). (i.e., distance and patterns a from 5.4.3 constructing independence Sections by the (see achieved is distances is plot Mahalanobis This cycle distance-based dimensions. the of construct number we way the the of benefit main One Limitations and Benefits 5.7.1 BBH11 .Hwvr n iiaino hstcnqei,ta ti icl oitrrtthe interpret to difficult is it that is, technique this of limitation one However, ]. H ai.I iue53tegoa center global the 5.3 Figure In -axis. Agg13 rmultidimensional or ] ` sclose is Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. omlcs-eetaayi forapoc ntebsso hnadGln[ Golan and Chen of basis the on approach our of analysis cost-benefit formal A (e.g., data the within trend strong very a be would one, previous the to related overall problem the A preserving while groups, within variations smaller the exploring for allow would distances large very are there which in case a by demonstrated is limitation possible Another rus(akT) u nyfrec ftevralssprtl.Frpcuigteepten and patterns these picturing For separately. across variables and cycle the seasonal of each the for to only group within but each behavior T3), within the (Task one changes identify groups 5.4), compare T1), Figure and (Task of T2), cycle (b) (Task seasonal column group the in each of plots patterns the overall (only identify variable each easily our for can of plot cycle benefit original the an explain only 5.4. Using better Section should in outlined discussion task following the the performing but when paper, approach this of scope the Performance Task 5.7.2 could again which analysis, plot. detailed cycle distance-based more our a for using by component seasonal [ supported the (see be separate are series There to time used multivariate components. for be other adjustment also the seasonal analyze to to approaches order usually recent in is also component adjustment seasonal Seasonal the remove seasonal-component. to the irregular- applied for or plot error- cycle time the distance-based In and our seasonal-, well. using trend-, as into plot cycle up original split [ the is component affects within series that time patterns problem the the a decomposition distort is would series again, years This, the over months. trend of steep groups A years). several over data monthly groups groups. the within between for distances comparison scale the and for one picture scale Using another scales. and center, separate global use the to transformations, to be data centers would using group solution propose of Another we distance scale. problem, also In log this can tackle a this To However, center. as plot. such global visible. cycle be the original not to would the centers in groups happen the group within of variations distance small the within the points consequence representing data lines the vertical of long variation ( mainly small scale only common and A centers groups. group the the and center global the between outlier using in of aspects because local possible this is considering groups) [FRGTA14]. done across work see few detection, as relatively well is aspects as There different time, of distances. periodic exploration Mahalanobis and The linear groups. (in of across investigation locality even further of and a of within allows flexibility points points, The data reference between abstraction. group relations as based well distance helps as the feature global by This adjustable introduced dimensions. interactively loss single point information the a the to the of reduce connection of relations to the one the and in explore space or further multidimensional plot to the cycle allows in distance-based linking the enable and in to Brushing either views representations. points linked univariate multiple multiple selected or the the single use to of We groups selection other environment. the exploration all ( interactive of center our relation group in the any investigate group select therefore to and possibility point the reference introduced global we above, mentioned issues both For BD10 .Teecnte eaaye eaaeywt prpit iulztos e.g., visualizations, appropriate with separately analyzed be then can These ]. H ai)wudtu edt itnebsdccepo htshows that plot cycle distance-based a to lead thus would -axis) GM13 CG16 sbeyond is ] ) htcan that ]), ..Discussion 5.7. ` 8 as ) 83 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 84 5. h utvraeCcePlot Cycle Multivariate The ihnec ru Ts 4 n h dnicto fwoegop htdvaefo h seasonal the from deviate that groups whole of identification the and T4) (Task group each within hl rusta eit rmtesaoa yl Ts 5,oeas eurstecascln plot line classic the requires also one T5), (Task cycle seasonal the from deviate that groups whole h anavnaeo h itnebsdccepo steageae vriwo l dimensions all of overview aggregated the is plot cycle distance-based the of advantage main The iw ncmiainwt h wfl iulzto fec aibei lsia yl ltand plot cycle classical a in variable each of visualization twofold the with combination in views ftedsac-ae btato rmtemliait aa gi codn oteproiiyin periodicity the context to in according locally again third, data, and multivariate variable, the each from for abstraction plots according cycle distance-based locally the second, original several variable, of point. each the data for in each scale periodicity of time the investigation linear to the normal additional enables the the it to and that according representations is, locally First, different plot cycle of distance-based combination the this of of abstraction benefit Another plot. line multiple full a coordinated The and view. highlighting, condensed interactions, a the in by outliers achieved like only anomalies is and power exploration patterns the directly show to combined, are that dimensions. outliers multiple as or well single as only in dimensions, of outliers more combinations univariate or all and one for multivariate in tasks both, the outliers all univariate eases affects only which This or dimensions, analysis. multivariate single the the to of perspectives, data different memorization multivariate external of abstracted views the the multiple to from coordinated possibility switch these the easily in Only to points allows groups points. data other reference the to (group) highlight compared local distances and and the select to global also changing but e.g. interactively periodicity, locality, as the of well distance-based of aspects as The position different same with outliers. the relations multivariate in points points are that data data but points the variables investigate data to investigating single allows for the allow plot of cycle not any does in only, univariate outliers plots Highlighting line no variables. and are the plots of cycle each original that in in perspective outliers outliers additional of an connections year adds the last chart, investigate the line to within allows traditional values and larger plot the cycle representations, e.g. different time, linear possible in NO like other variable anomalies, each of and to peaks relate they identify how also and to outliers, 5.4c) (Figure variable each of of representation identification and going T4) are (Task that groups within detection only outlier values univariate extreme/outlying and of detection multivariate the for beyond when T6-T10 Tasks points, the single consider plot. we of cycle If track distance-based keep the and to variables beneficial single also the is between is 5.4b) focus (Figure interaction the plots switching of cycle combination original support a several The T1-T5) the (Tasks and tasks necessary. 5.4a) these may (Figure For variables, plot 5.6). individual cycle Section many distance-based (see mentally or the outliers summarizing single multivariate by in be Also outliers not are variables. may that or single points in data just not variables, outliers possibly individual or are the variables outliers the multivariate separately, of variable any each in for outliers done be can T5) (Task values cycle extreme/outlying down detecting break of to tasks possible the not While is as it well variables. but as individual space, transitions for multivariate information and the any the peaks identify in identify to when can similarities possible inter-group One hand, is and only. it other intra- abstraction tasks, can the the these one On of for variable, behavior 5.4a) and space. the (Figure patterns plot multivariate of cycle in each distance-based in pattern the similar only patterns using a and aggregation. mentally mental transitions by derive smooth extend or and certain imagine similar a to very this are do there can If one space, multivariate the within behaviors 2 n SO and 2 nFgr .badc hssd-ysd rsnaino h aedt in data same the of presentation side-by-side This c. and 5.4b Figure in Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. hc nbe h eeaiaint nabtaynme fdmnin.Wt u interactive our With dimensions. of number arbitrary an to generalization the enables which hr r eea usin ihrgr oteuaiiyo u prahta eanoe.First, open. remain that approach our of usability the to regard with questions several are There xlrto niomn ecmiesaitc n iulzto ehiusadblnetheir balance and techniques visualization and statistics combine we environment exploration eis ihrsett h tutr ftm n h eain mn utpedimensions. multiple among relations the and time time of seasonal structure multivariate the in to outliers respect and with patterns series, analyzing visually for limitations and benefits distances, Mahalanobis the using step abstraction additional visual an [ a Cleveland includes retains by plot and plot cycle revisits cycle distance-based It identifying original for series. the time to plot seasonal similar cycle multivariate representation in distance-based outliers a and patterns utilizes seasonal environment exploration interactive Our Conclusion 5.8 test the demo. to 5.6), online possibility demonstrate (Section an the in we scenario and prototype usage material, being, the the supplementary time provided in the walk-through the above, For the and discussion by tasks comprehensive study. approach of one number our the of than to applicability more Due the require focusing performance. will task studies this and user complexity, plots formal the by their of and tackle examples, interpretation to case correct plan We use the additional on questions. providing these by answer open first can an issue users remains this real this with identify periodicity, evaluation to involving future users series Only helps time above question. discussed multivariate abstracted plots For the cycle be regarding different outliers. to performance of needs multivariate the combination and about confusion the know cause not whether may do and we This tasks Furthermore, plot. cycle user. distance, the original multivariate by the a learned to using design similar abstraction very an a encoding uses although plot, cycle distance-based the Work Future 5.7.3 plot. cycle distance-based the Cle94 .Tecntuto fthe of construction The ]. ..Conclusion 5.8. 85 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 5. 86 h utvraeCcePlot Cycle Multivariate The upeetr aeilt upr h edri h sg cnrodsusdaoei eto 5.6. Section in above discussed scenario usage the in reader the Scenario support Usage to material for Supplementary Material Supplementary Appendix: 5.9 Distance with variati with additional this detects user the summer, pea in other the to comparing when Especially, in than others. more vary group the within values data the Feb.), th Jan.,(Dec., & months peak the of some in groups according to their position in the seasonal seasonal cycle the in position their to according the groups within behavior the considers then user The the the underlying univariate cycle plots. plots. cycle univariate underlying follows follows point either to the the to either reference point global the change to allows prototype Our of the season in in season the low of and high between transitions the Considering each variable, variable, each distance Selecting for example January as global reference p reference as January global example for Selecting than than - group group based larger distances distances larger the the (cf. tasks T2tasks (cf. cycle a similar smooth smooth similar a - summer months in the multivariate space. Likewise s Likewise space. multivariate the in months summer based abstraction from multivariate space multivariate from abstraction based plot centers iue55 upeetr aeilfrteuaeseai,pg 1. page scenario, usage the for material Supplementary 5.5: Figure the the the original original the & T3). T3). & . . seasonal seasonal global global to the group center (T4). center group the to e She He/ behavior compared to the the to compared behavior center (default) (default) center pattern pattern cycle cycle identifies a tendency a tendency identifies plot representation of of representation plot of the the of or to any of of any to or ahalanobis M winter on on peaks peaks at at oint shows that that shows oint k k

- Distance Distance Distance electing July, summer months are closer. closer. are months summer July, electing - - based based - based cycle cycle cycle the other winter months are closer to January January to closer are months winter other the plot plot plot Distance - Based Based C summer ycle Plot ycle Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. to be quite stable months across all dimensions. across all months stable quite be to first year first distances with roughly the same length, except for for except length, same the roughly with distances multivariate multivariate there that interesting finds user the example, For outliers. possible Dece in several June, and in first January, in year t considers user the others, Amongst (T4). outliers possible be may that bars, large by values extreme ident easily can user the highlighting Even without months the at looking When T3). & (T2 detail more in groups across g and the within variations the compares user the Next, exceptionally large number in Dec. in number large exceptionally Knowing that, the user detects the same pattern in in pattern same the detects user the that, Knowing data points in these winter months highlighted in in highlighted months winter these in points data space. . . According to this pattern, these these pattern, this to According iue56 upeetr aeilfrteuaeseai,pg 2. page scenario, usage the for material Supplementary 5.6: Figure outliers 2000 only in months Oct. months in only June and and 2000 arch M – Feb. , he/ she spots spots she he/ , – Apr., and an an and Apr., months seem seem months Original Original mber, as mber, magenta are are he last last he ify ify the the roups roups the the ..Apni:SplmnayMtra o sg Scenario Usage for Material Supplementary Appendix: 5.9. original original (T7), indicating outliers in both, both, in outliers indicating (T7), C ycle Plot for each Variable each for Plot ycle Distance Distance Distance cycle plots and recognizes that there are more more are there that recognizes and plots cycle - - - based based based cycle cycle cycle plot plot plot no outlier during this period this during outlier no uni - and multivariate multivariate and 87 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 5. 88 h utvraeCcePlot Cycle Multivariate The R R prototype. the (Nov. 2002 and Dec. 2002). and 2002 (Nov. Jun. in the ar. the Jun. in M month in bar first the example, For decreased. is boundary the when as outliers, indicated are user can track the data points that are borderline borderline are that points data t the slider, track can the user with boundary outlier the By changing as to the 0.9 quantile (T10). This allows to interacti to This allows (T10). quantile 0.9 the to an impression about how extreme the outliers are. are. outliers the extreme how about impression an Looking at the the at Looking multivariate outliers multivariate temperature is an univariate outlier only in Nov. n Nov. in only outlier univariate is an temperature Dec., highlighted in in highlighted Dec., recognized as data points with with points as data recognized (2007) (2007) last the that recognizes immediately user The emark 1: the colored arrows ( , ) are not pa ( , not are ) arrows colored the 1: emark emark 2: selected items have their border increased border their have items selected 2: emark Distance outliers - based in Nov. in iue57 upeetr aeilfrteuaeseai,pg 3. page scenario, usage the for material Supplementary 5.7: Figure cycle , when changing the threshold from from threshold the changing , when distance plot – original original Feb. are all multivariate outliers. multivariate all Feb. are - magenta based based , , the variable cardio is also an univariate outlier i outlier univariate an is also cardio variable the cycle plot for the variable variable the for plot cycle cycle cycle . Selecting them shows that in the the in that shows them . Selecting large distance to the center center the to distance large plot plot are only highlighted highlighted only are year year the 0.95 0.95 the vely get get vely rt of of rt cardio and and ot in Dec. Dec. in ot he he and and

, the , the (T9). The user also recognizes that besides being being besides that recognizes also user The (T9). Distance Distance user detects two extreme data points in Nov. and and Nov. in points data extreme two detects user distance - - based based cycle cycle n Nov. and Dec., but the variable variable the but Dec., and Nov. n plot 2007 plot - based based 2007 cycle plot, they can also be be also can they plot, cycle 2007 2007 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. eaiuainadCoda and Recapitulation atIII Part 89 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. atflo ieadiaeut feffect. of inadequate and time of wasteful h raetvleo itr swe it when is picture a of value greatest The “ ae itrsta aet egn vrwt edn ls osetemi on are point main the see to glass reading a with over gone be they to space have the that worth Pictures not take. frequently us—are reassure to blankets’ know—‘security should data xetdt see. to expected .. Weblee tee sacerdmn htpcue ae nepoainof exploration on based pictures that demand clear a is [there] believe, [W]e [...] force ” hi esgsuo s itrsta mhsz htw already we what emphasize that Pictures us. upon messages their forces onW ue,17.[u7,p vi] p. [Tuk77, 1977. Tukey, W. John st oiewa enever we what notice to us Conclusion CHAPTER 91 6 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 92 C 6. onclusion //commons.wikimedia.org/wiki/File:Covid-19-curves-graphic-social-v3.gif hc ewsl hs h setrtoo h rp omk h rp oklk krceig(see skyrocketing like look graph the make to graph the of ratio aspect the chose wisely while he representations which visual some questioning for allow to important also is literacy Visual seFgr .) sn uhilsrtv rpswsipratt estz h ulct the to public the sensitize to important was graphs illustrative such Using 6.1). Figure (see lk . nentoa ies C-YS) mgsetatdfo i nmto eree rmWkpdaCommons: Wikipedia from retrieved animation gif a from extracted Images (CC-BY-SA). license International 4.0 Alike er sotndn napitdgahc otsc nomto sdslydi a iePlayfair like way a in displayed is information such Most graphic. printed a in done often is years iue62lf) ut [ Tufte left). [ 6.2 spending Figure government about [ graph graphs” his statistical to of “inventor refer the we Playfair, to back William Going 280], well. as skepticism some teaching (2020). Morris Toby and The Wiles Siouxsie situation, system. Morris. Source: pandemic healthcare and Image this the Wiles for of measures cases, collapse COVID-19 precautionary the preventing in of thereby people curve engage the to flattening intended was about illustration comic A 6.1: Figure situations. pandemic such in of public power general the the central control how and and convey planning to and how is making sovereignty recognized decision data also for we use are Furthermore, to predictions conveyed. agenda, “good” be their visual important to communicate teach message to to the charts important of such is representations using it adequate those Therefore, for and them. importantly, data more interpret of epidemiological and, to collapse literacy and alone a statistical left prevent with to being order deal often in to while careful had graphs more people be Suddenly, to system. and healthcare taken the measures the Twitter of on effects and published necessity was Morris Toby channels. and media Wiles different Siouxsie all by chart on comic the propagated the in famously, was curve” most line Perhaps the the watching “flatten the was message Everybody with the together while cases. presented progress exponential of were the number intervene what predicted to of in measures predictions outcome Different expected with together future. charts the for these medical meant in they politicians, growth taught is, by growth and presented exponential illustrated what are it forgotten and had at had now media looking somebody the If were in People epidemiologists. omnipresent and are professionals, into apart. steeply charts falling increasing These (COVID-19) started SARS-CoV-2 sky. it by the infections knew of we numbers as showing world charts global line the 2020, spring In Tuf83 etoe htdslyn oenetsedn n etover debt and spending government displaying that mentioned ] I’ h n ftewrda eko t[...]” it know we as world the of end the “It’s ulse ne h raieCmosAttribution-Share Commons Creative the under Published — ...(1987) R.E.M. ls iie nSp.2,2020) 26, Sept. on visited (last , rmteAlbum the from Pla01 lt 0,in 20], Plate , Document Fun37 https: p. , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. paety iesre nlssadpeito swl siptto n ule eeto r an are detection outlier and imputation as well as prediction and analysis series time Apparently, [ wars Catr1,w oiae hs hlegsi iesre nlss ete rpsdour proposed then we analysis; series time in challenges these motivated we 1), (Chapter iulaayistcnqe n prahsi hpes25t rvd upr nsligthem. solving in support provide to 2–5 Chapters in approaches and techniques analytics visual et 4 00;Tescn mg a eree rmWkpdaCommons: Wikipedia from retrieved was Playfair_interest_national_debt.png image second The 2020); 24, Sept. h neto oilsrt h syoktn oenetdb”det nnigteBiihcolonial British the financing to due debt” government “skyrocketing the illustrate to intention the elwt eti set ftecalne nsaitcltm eisaayi.I h Introduction the In analysis. series time to statistical approaches in and challenges techniques the analytics of visual aspects introduced we certain with dissertation, deal this of body main the In Summary challenges. 6.1 recent and analyses visual historic in of users context support the to into techniques them analytics embedded visual and applied so we doing dissertation, this In topic. important archive.org: [ Playfair graphs, different two in England of debt National 6.2: Figure audience. the of interpretation different the a misleading tell best to at way is a and in does visualization actually the data tweak the helps 45 as practices of story best angle such correct an Ignoring to the [ segments ratio. Cleveland with local aspect graphs; concerned banking series centrally of time concept one appropriate of the consider only integrity introduced to graphic the for important not context is was and it Tufte ratio reason, aspect this data. For with the right). data of 6.2 the Figure showing representations (see graph the ratio another in Yet have aspect to better intention. integrity a this Playfair’s underscore to to referred Tufte debt, attached paragraph, text with next wars the [ folly” colonial in “ruinous financing that, for the noted government, about also Tufte British polemically wrote increasing. Playfair rapidly graph, like this look to them make to do, to used Tuf83 https://archive.org/details/PLAYFAIRWilliam1801TheCommercialandPoliticalAtlas .65]. p. , mg ore lyar(81 [ (1801) Playfair Source: Image ls iie nSp.2,2020) 24, Sept. on visited (last Pla01 lt 0&p 129]. p. & 20 plate , https://commons.wikimedia.org/wiki/File: ° Tuf83 Pla01 o hoiga adequate an choosing for .6] enn the meaning 65], p. , .Telf n serves one left The ]. is mg eree from retrieved image First Cle93 , Cle94 ls iie on visited (last ..Summary 6.1. also ] 93 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 94 C 6. onclusion Scin231,adterltdtss iemdlseicto,mdlfitn,admdldiagnostics model and fitting, model specification, model like tasks, related the and 2.3.1), (Section iulaayisapoc,w obn iuladaayia ehd oitgaeteprediction the integrate to methods analytical and visual combine we approach, analytics visual fmlil uso aaee dutet,rcmuain n nlsso h ucm,the outcome, the of analysis and recomputation, adjustments, parameter of runs multiple of nScin23 h ls fAIASRM oesi ple nmn ieetdmis such domains, different many in applied is models ARIMA/SARIMA of class The 2.3. Section in a eapid h su ihmsigvle o ttsia ehd sta hs sal rely usually these that is methods statistical for values missing with issue The applied. be can n oeatn [ forecasting and muaino isn ausi eidcTm Series. Time Periodic in Values Missing of Imputation 1 sub-questions answer questions. to research used of were set chapter our this from from model 2 complex findings less and the a and in results discussion representation demonstrated The adequate and and selected. representation integration graph this Qualizon that a scenario directly usage using capabilities a proposed We in prediction judge models. adequate and multiple an examine for or to decision one user for the the in enabling guidance by provides model This parsimonious and process. visual our selection In interactive model models. the the benchmark in in or capabilities values predictions actual only from deviations including compare process When to the difficult into still apply 3. model is to the it Chapter is interface, of in goal capabilities the prediction selection the Because model integrate of variations. we prediction, small for show model only a may such 2, Chapter in process selection approach systematic [ [ the Jenkins particular, and In Box by forecasting. proposed with synonymous is which [ prediction, 1927 Yule produce to back dating data, series Diagnostics. Model in Prediction Using our validating and answering for 2. basis and the 1 are and sub-questions evaluation questions—specifically, scenarios prototype the research usage the of using and results walk-through process The detailed the a feedback. applied provided experts’ We and dataset experience. epidemiological user based an on prototype feedback using the expert refined and iteratively and stories We prototype user interfaces. a on visual in interactive process analytics through process visual parts analytics our these visual implemented combining a by proposed We experts domain computation. human guiding knowledge, and for analysis domain automated and as expert well of as intertwining judgment, close a involves which diagnostics, [ model Jenkins and Box by approach proposed systematic selection the Essentially, model of sciences. environmental and economy, epidemiology, as methodology Box-Jenkins as known process, process selection selection model model iterative this the illustrated in and and general particular in in analysis series time in involved tasks tedious Selection. Model Series Time these to contributions our parametrization, discuss and selection, summarize we model time sub-sections, challenges. periodic following series multivariate the and time In univariate of data. for partly series tasks detection outlier the and support imputation, prediction, to tried we Specifically, isn aai rqetdt ult rbe oades[ address to problem quality data frequent a is data missing Tsa00 .Uigifrainciei n eiulpos iew sdi h vrl RM model ARIMA overall the in used we like plots, residual and criteria information Using ]. Tsa00 ,frcsighsa vnlne itr hnsaitclaayi ftime of analysis statistical then history longer even an has forecasting ], BJ70 nScin11o hpe ,w rsne h hlegn and challenging the presented we 1, Chapter of 1.1 Section In loe rciinr oapysc oesfrforecasting for models such apply to practitioners allowed ] Yul27 .Tm eismdl eg,AIA r neddto intended are ARIMA) (e.g., models series Time ]. codn oTa’ itr ftm eisanalysis series time of history Tsay’s to According BJ70 ead ihyieaieprocess iterative highly a demands ] KHP nms elwrdapplications, real-world most In + 11 eoeaayi methods analysis before ] Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. enwrvsttersac usinsae nScin16adase hmbsdo h findings the on based them answer and 1.6 Section in stated question research the revisit now We task the support to series time periodic for plots cycle of benefit the from learned already We fmsigvleiptto Catr4.Bsdo hseprec,w a h dat apply to idea the had we experience, this on Based 4). (Chapter imputation value missing of uodnt usin osbeunl eiefo hma nwrt h anquestion. main the the answering to by answer start an We them from dissertation. derive this subsequently of to body questions main subordinate the in presented argumentation and Revisited Questions Research 6.2 main the as well as 3 and 1 sub-questions answer to from part question. learned research missing lessons last and the reflections contribute The inevitably chapter loss this and used. information abstraction interpretation the data the reduce multivariate helped support the also accompanying in and It outliers periodicity outliers. identify the multivariate could of considering we contextualization while approach, views series this coordinated Using time multiple multivariate with novel environment. plots exploration this cycle interactive integrated classical to an multiple also addition in with we In together plot, plot cycle plot. cycle distance-based cycle of type Mahalanobis original overview this the one to of of it construction representation reduce the established to in the abstraction proposed distance-based retain we a data, and using univariate dimension plot on cycle only modified defined a is and 5 it values Chapter Because extreme contextualizing perspectives. and univariate different identifying in from visually patterns outliers for cycle allows comprehending The and and series. series identifying time time for effective periodic periodic multivariate and we established in data, well detection multivariate is outlier with plot the dealing support are to domains idea application periodic the most in extended in outliers analysts identify Because to environment series. exploration time interactive an in representations plot cycle called are which values, other the from significantly deviate that points data Series. the Time Periodic Multivariate in Detection Outlier directly is chapter this 3. of and contribution 1 The sub-questions research values. better to imputed for linked the time of periodic appropriateness and incorporated the linear of and visual in judgment imputation optimized values of an neighboring possibilities contextualized using the and By knowledge on time. domain expanded periodic we and time, in linear periodic points for in data values representation imputed neighboring compare their interactively plot cycle of to a context chart employed the line and standard values imputed a proposed the we with in linked 4 inherent representation Chapter uncertainty In the such of integrate intervals. uncertainty to confidence approach the to or an communicate used boundaries to be error possible appropriate can is using it simulations way, imputation Carlo-based this Monte In example, intervals. for confidence calculate measure; missing error the replace an to calculate imputation [ help applying resampling of uncertainty repeated cases commonly, most the Most in is neglected concern data. is One that data methods. the statistical in established introduced apply to order in values missing [ values [ missing data complete on LR02 All09 , Jon80 ,adol e pcaie ehd r plcbei h aeof case the in applicable are methods specialized few a only and ], .Iptto sacmo a obps hsiseadrpaethe replace and issue this bypass to way common a is Imputation ]. LR02 n utpeiptto ehius[ techniques imputation multiple and ] nte su nra-ol aais data real-world in issue Another ..Rsac usin Revisited Questions Research 6.2. outliers [ Agg13 Sch99 ]. ] 95 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 96 C 6. onclusion eepoe outtcnqe o h aclto fdsacss hti sls ro rn and prone error less is it that so distances of calculation the for techniques robust employed We eitoue nutv n neatvl ikdves nw smlil oriae iw,and views, coordinated multiple as known views, linked interactively and intuitive introduced we ecudfidmliait ulesta r o ulesi n ftesprt aibe.B using By variables. separate the of any in outliers not are that outliers multivariate find could we to us allowed approach analytics visual our addition, In parameters. model’s the adapting when saoa RM)mdl nCatr n ,w hwdta hs neatv exploration interactive these that showed we 3, and 2 Chapters In model. ARIMA) (seasonal h ihyieaiepoeso h RM oe eeto,kona o-ekn methodology, Box-Jenkins as known selection, model ARIMA the of process iterative highly The ovsal nlz atrsadotir nmliait esnltm eis hsi oewith done is This series. time seasonal multivariate both in for outliers limitations and and of patterns benefits combination analyze of The visually balancing to dimensions. the of facilitates distance-based number techniques The visualization arbitrary dimensions. and an univariate statistical to the technique of this each linear apply and in can values multivariate abstraction neighboring the in comparing multivariate by time exploring outliers periodic for possible and intuitive identifying it and makes can series also representation time It this periodic easily. with environment familiar exploration users the plot, cycle to of multivariate adapt representation novel visual our established in and series known time periodic a with Furthermore, together abstraction issue. distance-based this this overcome helped distance-based and multivariate data the the of of perspective addition variable separate the each a outliers, for allowed hiding plots plot cycle cycle in univariate prone multiple be using would to which contrast only, In outliers. distance. by the influenced compute not the to distance, data Euclidean the the of like distribution measures, multivariate distance the try normal considers to to distance 5) contrast Mahalanobis (Chapter In environment We issue. exploration this interactive distributions. overcome our marginal to the in approach in view hidden multiple be a otherwise employed would visual that the outlier in an shows outliers immediately the is hiding data to [ multivariate leads Tufte in which representation. outliers dimensions, detecting or at representations aimed wrong representation the graphical choosing a in series? issue time common multivariate One and univariate in tasks detection outlier and imputation 3 Sub-Question herein. summarized means and the parameters using of prediction, adjustment the the of including integration process, SARIMA selection a model of the component improved could seasonal environments we the that by meant given reproduced series is the time cycle for input seasonal model the the well the with how together of check prediction appropriateness also the the of judge integration This to selection measures dataset. model additional the use during and performance prediction process and capabilities prediction the diagnostics incorporate model the in improvements the grasp immediately to selection, possibilities model comparison the direct improve To available of exploration. amount interactive large an the and including analysis, the R, series combine of time to power for order computational packages interactive in the proposed computing worlds: Our statistical both for prediction. of project for strength R used the is upon it built model, information environment adequate of exploration an comparison suggest numerical statistical these and for If (visualization) project criteria. plots R static the uses as mainly such and environments, computing, computation script-based in applied usually is tasks? prediction series time for parametrization and 2 Sub-Question o a iulzto n neato mrv h rcs fmdlselection model of process the improve interaction and visualization can How sa dqaevsa ersnaino eidctm eisbnfiilfor beneficial series time periodic of representation visual adequate an Is Tuf83 .1]ue h aoseapeo iait ctepo,which scatterplot, bivariate a of example famous the used 14] p. , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. ls nerto fsaitclcmuainit nitiie ihyrsosv,adinteractive and responsive, highly intuitive, an into computation statistical of integration close A with research this answer to us allows discussion and argumentation this Essentially, 4. Chapter in work o mue ausi h eeatcneto ieradproi ieaddtce outliers detected sophisticated Using and time series. time periodic periodic and multivariate linear of representation of values visual context estimated adequate relevant the we an and the in addition, series in In time values periodic capabilities. imputed explore prediction for to the techniques model exploring analytics and and visual models, provided data, integrating series as time of well the support as exploring better parameters, diagnostics, enables the combination adapting this support selection, Specifically, to model analytics analysis. the visual series allows time models, in and challenges data the series time of investigation and exploration, interactive adaptation, the support to representation visual adequate using environment, exploration detection? outlier and imputation, prediction, parametrization, selection, model of analysis series Question Research Main sub-question this answer can we essence, In data. superior this a in provides outliers series and time series. patterns periodic time of multivariate investigation periodic into multivariate perspective interactive in additional outliers an this analysts, of in a For detection such charts the combined line We and support to plots 5). cycle environment (Chapter exploration univariate data with multivariate plot for distance-based cycle plot a from distance-based cycle using findings multivariate the representation the use plot cycle on to the order Based extended in we immediately. abstraction enabled imputation, values for which imputed plot time, cycle the periodic a of and using adequateness linear of the context judge the compare to in to us values beneficial missing inevitably visual for is appropriate values 4) the estimated (Chapter series, the tasks time imputation periodic we for for plot) that, as of (cycle illustrated tasks representation capabilities, also prediction We for prediction support data. adequate the series added time and Integrating approach univariate this extended diagnostics 3, criteria. Chapter model information in the introduced of and analysis plots the support residual the adequately adapting using to when how showed transitions We animated and parameters. diagnostics model model by visualizing data and univariate the integrating of supports closely selection environment model exploration ARIMA/SARIMA for visual interactive methodology Box-Jenkins an established how demonstrated we 2, Chapter series In time multivariate and univariate both for series data? time periodic with dealing analysis series 1 Sub-Question our to similar well, as with series sub-question detection time outlier periodic for multivariate 5 in Chapter tasks in applied imputation showed techniques to and the and applicable consider data linear be we series to in support, time adequate values periodic an neighboring univariate is for their it this similarlythat of introduced and context we values Although the missing for time. in values periodic values estimated represent imputed to the values. plot investigate cycle missing could the of used imputation we the an improve 4, Similarly, helps Chapter dimensions. series In multiple time among periodic relationship of the representation and adequate structure periodic the to respect yes . yes svsa nltc naeut upr o h hlegsi ttsia time statistical in challenges the for support adequate an analytics visual Is . o a iulaayisspottecalne nsaitcltime statistical in challenges the support analytics visual can How ..Rsac usin Revisited Questions Research 6.2. 97 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 98 C 6. onclusion lhuhmc fteotirdtcinadiptto liaeyne ob uoae,visual automated, be to need ultimately imputation and detection outlier the of much Although lhuhw rpsdaditoue ouin o ovn oeo h ttdpolm ntime in problems stated the of some solving for solutions introduced and proposed we Although iulaayisfrcalne ntm eisaayi xrpltsteie fTkysieson ideas Tukey’s of idea the extrapolates analysis series time in challenges for analytics visual iinadcalne nvsa nltc eerhrltdt h oiscvrdi hsdissertation this in covered topics the to related research analytics visual in challenges and vision the improve capabilities prediction integrating and selection model to approaches analytics visual eerhi iesre nlssa el nadto,teeaesm eetatce bu the about articles recent some are there addition, In analytics well. visual for as opportunities analysis that great in series research as time challenges future in discussed about the research discussion of a some include consider both We history and the forecasting, field. discussed and have analysis DGH06] series indicates [Tsa00, them time articles of Two of each and work. considered, future be for to opportunities need that research challenges open still are there analysis, series Opportunities Future and Challenges Open 6.4 approach. analytics visual is the it in series, prominently time time periodic of with structure dealing periodic When the consider outliers. to univariate important and allows multivariate abstraction this detect an for to in methods us a statistical outliers as investigating robust for well using plot as environment cycle exploration tasks interactive multivariate imputation a and visualize data to series, univariate abstraction for time distance-based plots periodic cycle the for Specifically, using of investigation representation user. direct adequate the allowing for the by capabilities dataset prediction given applying and a that measures for diagnostic stated model adequate we an where finding section, of process previous the in visual questions how showed presented research We specifically our we 1. Chapter to 5), in and answers introduced analysis 4, the series 3, time in 2, challenges (Chapters the to dissertation contributes this analytics of body main the In models and methods, algorithms, data. the different or parametrizing new and for adapting regard. for this a useful in inevitably by success judged is for analytics better critical is is series, expertise time his/her the of in integration patterns the the Therefore, nature match outlying analyst. values the human imputed judging well models, how analytics of achieved, and visual adequateness been values The in not of still taken. tackled has been been fully already box has have black steps that the Opening important opening challenge Although run. significant already. the one years on several is previews for research models and adjustments of model box do black expensive to computationally the even possible for it responses fast make fast and and reasonably systems allow calculations interactive to advanced time today’s possible in With is challenges it before. the computation, possible to not solutions were novel that allows analysis statistics series with together analytics visual Using Conclusion 6.3 detection. outlier and challenges imputation, the prediction, supporting parameterization, for selection, possibilities model new of completely provides and Applying analysis missed. data be exploratory otherwise would that we outliers time, find periodic and/or and outliers linear these in investigate components could univariate underlying periodic the multivariate with visualize together to structures, representations visual well-established with abstractions data Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. erahdtede erigaeai h enie lhuhte lo e osblte for possibilities new allow they Although meantime. the in arena learning deep the reached we it seasons, and cycles nested multiple and/or variations larger for but adaptations, simple with iulaayisfr22 [ 2020 for analytics visual an building for exploration allows and that communicating for means a is analytics, visual [ learning machine and analytics visual ahn erig enhl,AdinoadAdino nterrcn iino spatio-temporal of vision recent their in Andrienko, and Andrienko Jiang Meanwhile, opportunities. learning. and machine challenges open as [ interpretability al. and et trust the [ analytics enhancing visual on with discussion combined learning machine [ on trust diagnostics” discovery; research and and comparison understanding model machine “data accountability; of contribute and interpretability building can of analytics needs visual main where three models, identified learning They et required. Krause are to interpretation According and contribute. can analytics concern visual [ central al. and of visualization in are which including building to topic, trust challenge discussed and and widely The Interpretability a is community. models. is research research complex models More visualization complex more the for such series. understand this achieve time and to input explain methods to the and ability to techniques appropriate interactive model find the the to how understanding 2, needed and comprehensible Chapter a components allowed in model plots approach the diagnostic analytics of the ARIMA and visual the parameters our like model in models, of series adjustment used time we simpler models For SARIMA reached. and are results the how of especially understanding visualization, while results their learning to machine come complex approaches, such how learning been understanding deep has like and there techniques, explaining years, better recent for In demand drastically. strong and increased a enhanced, has been networks has of complexity ANN the of prediction, area the in research 2006, Since addition forecasting. in the challenges in power considering major their as and overfitting to predicting of for risk (ANN) networks and over-parametrization, neural complexity, artificial model to section whole a dedicated Prediction. for Models Advanced and Learning Machine sub-section. following the in models discuss advanced we such which challenges, with additional Dealing creates embedding learning. methods the machine and even example, for or from, representations approaches visual other and of models specialized handle and to advanced possible require be may may length periodic opportunity in an are variations be extensions to small likely, this and Most consider be adaptations and research. to lengths that further periodic techniques expect for different We proposed having cycles our with general. deal consider in to we required series case, time practical the periodic of is for meaning portion large applicable this cycles, a data seasonal in real-world Although considered and mainly (frequency). applications length dissertation periodic this fixed with of cycles body periodic main the in presented [LGH [ analytics visual spatio-temporal discussions—namely, following the in consider we that ylcTm eisWtotFxdProi Length. Periodic Fixed Without Series Time Cyclic optr rmdl.Te rudta av nlssmyntb wr ftebgcagsin changes big the of aware be not may analysts naive that argued They models. or computers KPB16 + JLC19 7 KPB16]. 17, ,oedsrberao o ua novmn nvsa nltc swe understanding when is analytics visual in involvement human for reason desirable one ], loicue h blt oepani hi eerhcalne bu interactive about challenges research their in explain to ability the included also ] AA20 ,psdteiseo nrtcltuti eut rdcdby produced results in trust uncritical of issue the posed ], ERT + 17 , HKPC19 titysekn,orpooe solutions proposed our speaking, Strictly , ..Oe hlegsadFtr Opportunities Future and Challenges Open 6.4. JLC19 eGojradHnmn[ Hyndman and Gooijer De ERT KPB16 ,adpeitv iulanalytics visual predictive and ], + 17 rvdsamr detailed more a provides ] .Rcn state-of-the-art Recent ]. DGH06 AA20 ], ] 99 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 100 C 6. onclusion ihpirkoldeadas nbetedfiiino betvsfrtefrcssta are that forecasts the for objectives of definition the enable also and knowledge prior with osdrdi h oe eeto rcdr.Asaeo-h-r uvyo rdcievisual predictive of survey state-of-the-art A procedure. selection model the in considered ayapiain,advsa nltc eerhhspce pti su n rvddspotin support provided and issue this up picked has research analytics visual and applications, for many needed specific and time involved other and complexity, parameters One [ computational of computation estimation, number research. parameter large for series difficulty the time resulting is future the methods for multivariate challenge existing a with as challenge series for very missing time still support multivariate are software in these appropriate [ use, Tsay is to addition, there methods In particular, are In applications. there apply. and to done complex been and already difficult has practice and theory in [ Hyndman and Gooijer Models. Series Time Multivariate of Parametrization knowledge prior good integrating process. a as selection well be be model as to to a objectives methods such need such analytics into met judging visual be and consider to expressing, still objectives defining, We the for understanding, fit too. the their integrated of in and capabilities because, but defined prediction step formally model, the first more the into a select expectation only to and is domain objective the It and of adjust model. form data to informal way the adaptive an about knowledge form an allows prior to simply and also approach data Our process the selection on direction. model based this the model into into step capabilities first prediction simple the a integrate as to 3 Chapter [ bias? in amount?” introducing of contribution right of benefits the costs the is the get human-in-the-loop minimize we much and that How intuition, so emotional integration knowledge and regarding constrain social questions knowledge, or unanswered domain “regulate raised to and specifically knowledge possible al. user is et integrating it Lu how when arise predictions. that adapt issues to the them allowing discussed also They research. future for [ al. et Lu by analytics Prediction. for Objectives [ Specifying Hyndman and Knowledge Prior Integrating challenges extensive these to decision. attention this pay outlier with to like come important tasks, that is for it methods prediction, learning and machine imputation, and detection, models advanced and complex more using bias and literature trust, mentioned interpretability, previously [ in the detection involvement to human concordance interpretability, in with are primarily analytics These concerned visual them. where solving From challenges, to achieve. high pressing contribute to most the can challenging the of especially derived is Because they survey, it comprehensive approaches, approaches. their learning learning deep of deep be size in to and needs analytics complexity interpretations visual and using trust interpretability [ that al. and gain interpretability et to situation, Hohman effort this analysts. the In by but accepted understanding. essential, better are a building get the trust to trust and effort just the may analysts make experienced not while and slightly numbers parameters model adapting when results HKPC19 DGH06 DGH06 dnie h edfrmdlslcinpoeue ouetedt together data the use to procedures selection model for need the identified ] p 71] hncnieigetnigorvsa nltc approaches analytics visual our extending considering When 17–18]. pp. , .Teprmtiaino oesadagrtm so eta ocr in concern central of is algorithms and models of parametrization The ]. LGH DGH06 + 17 a utvraetm eismdl.Atog xesv research extensive Although models. series time multivariate was ] loietfidteitgaino srkoldea challenge a as knowledge user of integration the identified also ] Tsa00 HKPC19 oea h itrso iceeadcniuu variables continuous and discrete of mixtures the foresaw ] pcfial icse h susi understanding in issues the discussed specifically ] LGH nte hleg etoe yDe by mentioned challenge Another + 17 p 5–5]W osdrour consider We 554–555] pp. , eGojrand Gooijer De Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. ihvsaiaint rvd iulaayisslto o hschallenge. this for solution analytics visual a provide to visualization with [ series time spaced unequally of rastering a such for support visual providing work plcto fsadr ehd fewrs h uhro hsdsetto otiue oa to contributed the dissertation allows this but of concerns, author and The issues own afterwards. its methods has approach standard This of application series. time such transform to eismdlta ol tsm at ftetm eisaeutl sntaporaei other in appropriate not is adequately series time the of parts some fit would that model series rdcieVsa Analytics. Visual Predictive uheet,pten,adbhvo,i sncsayt obn ttsia opttoa methods detect computational to statistical order combine In to methods accordingly. necessary into models is look applied it to behavior, the important and adapt is patterns, to it effects, order Therefore, such in data. them a the understand is in and there nowadays, effects detect available been of to data not kinds long-term has these and group, of high-volume chance resulting the each higher with for addition, models In series yet. time pursued different applying time then labeling dissertation and and approach this segmenting an of for and author support similarities The visual their such [ classes. about series to work segmented according some the series to of time contributed each already the to of models possible parts A different segment apply them. following then or segments partition the or to breaks is and shifts solution the around exactly as such segments, shifts, level breaks, structural is [ outliers spaced shifts to unequally addition location these in and compensating challenge interesting methods into Another and more models series. look time specialized time only spaced apply unequally not as such should explore well research and as This investigate series help done. also be but to transformations, work such and supporting research more is there but addition In prediction. [ and analysis approaches the model stated for and series important series time more time specific be such will to in occurrence observations of between trend times duration a the foresaw time that also the Tsay detail example, challenge. more For specific in investigate one them. to as with observations cope spaced to unequally research simple mentioned additional on he require as will well and as [ methods Tsay series In existing time 5). challenge considered. spaced and be equally 4 to of (Chapters values need detection missing that outlier on series focused time have in we work, structures this other interesting several are there Series. data, Time in Structures Interesting of Detection selection. model series be time can multivariate analytics of visual challenges how investigate the could support models research and state-spaced Future applied and applied. ARMA practically vector stated more these he be methods, and can computational variables, of between advances relationships the dynamic with the the that, investigating of in influence interest the to the of solution satisfy understanding possible [ to an Tsay a shape is results. help analytics the and visual on parameters that parameters of state number can large we a experience, support this parameters of With influence the results. investigating and the data to series [ time al. labeling and this et segmenting Röhlig of for by author approach work the a example, in For involved was domains. dissertation application and methods different for parametrization l [ al. LGH ABG + 17 rsne eetavne nvsa nltc upr o rdcieaayis where analytics, predictive for support analytics visual in advances recent presented ] + 14 , RLS DGH06 + 14 , RLK .Teesrcua rasadsit setal enta time a that mean essentially shifts and breaks structural These ]. na xesv uvyatceo rdcievsa nltc,L et Lu analytics, visual predictive on article survey extensive an In + Tsa00 15 , BDB xlrdteiprac fmliait oesi order in models multivariate of importance the explored ] + Tsa00 16 Jon85 , BBB dnie oeseilfaue ntedt that data the in features special some identified ] ,oepsil prahi oapyrastering apply to is approach possible one ], + RLK 18 , npatclapiain n real-world and applications practical In ..Oe hlegsadFtr Opportunities Future and Challenges Open 6.4. BBGM18 + 15 htepoe iulanalytics visual a employed that ] ,btti dao sn such using of idea this but ], BBGM17 ], 101 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 102 C 6. onclusion nrek n nrek [ Andrienko and Andrienko rn yeo sr o ntne nedue ih s nya nta aaerzto famodel a of parametrization initial an only use might end-user an instance, For user. of type wrong 6]as etoe h agro h iueo iesre ehd,sc si hr r outliers are there if as such methods, series time of misuse the of danger the mentioned also 461] h buzzword The iulzto n iulaayistcnqe sdvlpda eerhpooye n onot do and prototypes research as developed is techniques analytics visual and visualization irre,mil o urn tt-fteatdt cec agae ieRadPto,t make to Python, and R open-source like in languages approaches science and data methods, Although state-of-the-art techniques, current future. these for near publishing mainly the in libraries, in trend software current reliable a and is accessible there into transferred be to likely seem discussed as [ solutions, Andrienko analytics challenge and need visual related Andrienko statistics practical Another by robust appropriate community. in of forecasting advances accessibility the and the in availability that attention is suggested more They with considered detect. p. be to [DGH06, to difficult Hyndman are and Gooijer that De shifts or 92]. p. thus [AA20, and humans” think, by and controlled understand, and see, led cannot be which to reasoning computers, need and of weaknesses understanding the human the of of emphasized awareness “primacy Andrienko and the and analytics, Andrienko visual Therefore, of outcome. philosophy the underlying trusts uncritically result, a as mistrust. consequently, and, decisions and results faulty to lead may each so for be do techniques to to and Failing needs methods distinction are. appropriate this type the research, user what analytics determine visual from to in distinguished and addition, investigated and In individually considered design. be course the to of of needs can beginning this prediction the but and backgrounds, modeling different supporting with for users understanding methods target analytics in-depth hand, Visual no other but domain. the techniques, the on and of experts, models Modeling a predictive involved. have about data knowledge experts the advanced Domain have and domain methods. of the or domain of specific models understanding the predictive great in about experts knowledge neither have end-users, are knowledge: nor their End-users application on experts. visual depending modeling analytics, three predictive and visual identified predictive a experts, They of domain with process. scope the the interacting in to users user contribute of can of types knowledge type his/her how the and system between analytics relationship User. the of on Type discussion on Depending Tools Appropriate regard. this in explain to ability and interpretability of challenge discussed. have the we with as deal models, to such complex in and trust large increasingly and such interpretability learning of in deep trend challenges considered This the al. increases trend. et models current Lu a Furthermore, as as well models. prediction as for complex knowledge approaches more user and of larger integration to to the correspond scaling challenges identified the named also They their of findings herein. Some mentioned their challenges discussions. on the based internal analytics and outlook survey visual an literature predictive offered the in also from directions they systems research analytics literature, and visual the challenges the in future found on for overview analytics predictive their for to applied addition techniques modeling, In and statistical mining. from techniques data prediction and for learning, term machine umbrella an as used is analytics predictive eas /ei nbet oewt h dac dpainpsiiiiso h aaerzto and, parametrization the of possibilities adaptation advance the with cope to the unable for is technique s/he or because method not inadequate or an up providing mixing by and example, type for user caused, be intended can the This specifying strictly algorithm. analysis an by generated results of xlial I(rica intelligence) (artificial AI explainable AA20 AA20 ttdtedne fuciia rs essbigoel critical overly being versus trust uncritical of danger the stated ] .Te lie httevs muto otiuin in contributions of amount vast the that claimed They ]. a enue o eerheot ntrying in efforts research for used been has ue l [ al. et Lu LGH + 17 asda important an raised ] Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. eetnieydsusdoe hlegsadftr eerhopruiista pndu during up opened that opportunities research future and challenges open discussed extensively We hncetn iulaayissltos nrek n nrek aldi h “effective the it called Andrienko and Andrienko solutions. analytics visual creating when ecudiaiebigbnfiili iesre oeln n rdcinseaiswt data with scenarios prediction and modelling series time in beneficial being imagine could we h oko hsdsetto okmr ieadrsucsta xetdadoccasionally and expected than resources and time more took dissertation this on work The in systems the to compared users of effort the measuring for metrics of lack the argue even They iiino ao ewe h ua n h optr [ computer” the and human the between labor of division oiae h uhrsfe ie e h uhrlandaltproal,sinicly and scientifically, personally, lot a learned author the yet time, free author’s the dominated aacn h ffr ewe ua n ahn nvsa nltc prahs h ot of costs The approaches. analytics visual in machine and human between effort the balancing n oi,temr o elz hti etoe ob oe ntefloigscin elist we and section, PhD following papers. his the the during In of Bögl, each Markus done. to dissertation, be contributions this to his of open discuss author into left the investigate by is you published what further publications realize the the you projects, research more in the usual topic, As one dissertation. this on research the shifts. and breaks that structural changes approach containing data fascinating the a if is and, This model automatically. the adaptations in model of changes knowledge such would such the foresee trigger which accordingly, on adequately, data—to rather—based the data but in the adaption, changes maps the expected new execute longer the time to no every analyst model environment human building the a model require and always analytics changes visual data a basic or in The arrives reduce model environments. and data a changing computer readjust in the to times to not reaction tasks is and more idea costs, move involvement, helps human idea of This effort the dynamics. data [ to mechanisms” reacting adaptation 91] an model p. with automatic discussion “orchestrated this call they on embarked topic—what exciting Andrienko and entities. Andrienko two is these systems. there between mixed-initiative computer, such effort the by the faster balance or to user how the on by evidence done empirical better generalizable Endert are no that still herein. sub-tasks to into referred them those divide including and papers, recent in [ discussed al. widely [ et been undertaking” cost-intensive has user and This unpredictable the 6]. an getting p. be since can analysis process visual analysis the interactive and in towards many possible involved favored “[i]n as Keim, is much Daniel analytics as by automated analysis represented cases prominently automated As on necessary. emphasis where involvement an human been analytics visual already of has human beginning of there the centrality research, Since the analytics. the and visual available, information, in data comprehend (“human-is-the-loop”) big to involvement of capabilities amounts human massive of ever-growing limitations the natural of because years recent in increased consider to relevant more becoming are machines and users human between needed time and effort [ data spatio-temporal for HKPC19 analytics visual in challenges Effort. Machine and Human Responsibility—Balancing user. of type the on depending support appropriate to connection visual [ and al. visualization et Lu appropriate in experience such lack because [ who techniques misuse, people analytics of by danger adapted a often again are is libraries there environments web-based through accessible them ERT ,adpeitv iulaayis[ analytics visual predictive and ], LGH + 17 rudta,atog hr r salse ehdlge odcmoetasks decompose to methodologies established are there although that, argued ] + 17 loietfidtecalneo mrvn h sreprec,wihhsa has which experience, user the improving of challenge the identified also ] AA20 .Frabodruiiaino h rdcievsa nltc system, analytics visual predictive the of utilization broader a For ]. LGH + 17 aedsusdteiseo cnmc in economics of issue the discussed have ] ..Oe hlegsadFtr Opportunities Future and Challenges Open 6.4. AA20 AA20 eetppr rsnigresearch presenting papers Recent ,mcielann [ learning machine ], .9] hscalnehas challenge This 91]. p. , ERT KMT09 AA20 + 17 , , , 103 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. 104 C 6. onclusion Graphics inr lxne id n i amrc.Vsal n ttsial uddiptto of imputation guided statistically and Visually Lammarsch. Tim and Rind, Alexander Aigner, h nig ftersac odce yMru öldrn i h rga eepresented were program PhD his during Bögl Markus by conducted research the of findings The 05 alai adna tl,My2-6 2015 25-26, May Italy, Sardinia, Cagliari, 2015, o rdcieAnalytics Predictive for for In diagnostics guide predictions. to model methods series analytics Visual time Rind. Alexander and Miksch, Silvia Lammarsch, [BAF – 2015 selection. model series time in predictions Integrating In Rind. Alexander and Miksch, Silvia marsch, detection outlier Multivariate revisited: abstraction. plot distance-based Cycle a Rind. using Alexander and Miksch, Silvia Leite, [BFG Publications Main 6.5.1 PhD his list during we co-author a addition, as In contributed each below. dissertation are this publications and program. of main self-contained author are the the 5 in which to listed and publications publications 4, this research 3, of 2, the author Chapters on the of publications. based the contributions of and each roles in the explains dissertation and publications main the summarizes international in published were results and The EuroVIS, audience. like (EG) scientific journals, Eurographics high-profile a VIS, of IEEE front like in EuroVA, symposia, and conferences international at the to back come to section. want this I of conclude, beginning To the horizons. from and song complete shores R.E.M. nearly new a the is for from was it off Serendipitously, quote result set phase. The to final me the allowing obstacles. now, in evolved overcoming that and accomplishment work of of feeling plenty forward moving occupationally, [BAF . Publications 6.5 isn ausi nvraesaoa iesre.In series. time seasonal univariate in values missing [BFG rceig fte6hItrainlErVsWrso nVsa nltc,EuroVA@EuroVis Analytics, Visual on Workshop EuroVis International 6th the of Proceedings + + + + 14] 15] 17] 15] Workshop swl si ofrnepoedns(GErVS EEVS.Tefloiglist following The VIS). IEEE EuroVIS, (EG proceedings conference in as well as , aksBg,Wlgn inr ee izoe,Teei shadnr Tim Gschwandtner, Theresia Filzmoser, Peter Aigner, Wolfgang Bögl, Markus Lam- Tim Gschwandtner, Theresia Filzmoser, Peter Aigner, Wolfgang Bögl, Markus aksBg,PtrFlmsr hrsaGcwnte,TmLmash oe A. Roger Lammarsch, Tim Gschwandtner, Theresia Filzmoser, Peter Bögl, Markus aksBg,PtrFlmsr hrsaGcwnte,Sli ish Wolfgang Miksch, Silvia Gschwandtner, Theresia Filzmoser, Peter Bögl, Markus optrGahc Forum Graphics Computer 04– 2014 , Workshop rceig fte21 EEVSWrso nVisualization on Workshop VIS IEEE 2014 the of Proceedings optrGahc Forum Graphics Computer and I’ h n ftewrda eko tadIfeel I alone” and time it some know had we I as fine—time world the of end the “It’s EETascin nVsaiainadComputer and Visualization in Transactions IEEE ae 37.TeErgahc Association, Eurographics The 73–77. pages , — rceig fteIE ofrneon Conference IEEE the of Proceedings ...(1987) R.E.M. 63:2–3,21 – 2017 36(3):227–238, , , rmteAlbum the from Document Journal Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. iulAayisSineadTcnlg,VS otr,Ciao L S,Otbr25-30, October USA, IL, Chicago, Posters, – VAST Technology, and Science Analytics Visual hl rtn h aes egtee edakfo h oatosaditgae h suggested the integrated and co-authors the from feedback gathered he papers, the writing While lxne id iulaayisfrmdlslcini iesre analysis. series time in selection model for analytics Visual Rind. Alexander eulc ue48 2018 4-8, June Republic, EuroVA@EuroVis Analytics, Visual on Workshop EuroVis International 10th the of Proceedings Chicago, in conference VAST IEEE 2015 the at Award Poster Best the awarded was poster This 09 ue3 09 ot,Portugal Porto, 2019, 3, June 2019, 2015 [BBGM18] [BAF h rttpclipeetto o h hwae hc a mlmne yoeo h co- advised the He of one by visualization. implemented the was for which showcase, data the the for prepared implementation prototypical and the computed and method abstraction etto n iulaayi fmliait iesre.In series. time seg- multivariate automated of the analysis Combining visual Kohlhammer. and Jörn mentation and Schumann, Heidrun Miksch, Silvia ner, Czech Brno, Posters, – 2018 EuroVis In Visualization, data. on series Conference time Eurographics labeling 20th and the segmenting of of process the in uncertainties rizing In pre-processing. series time multivariate in uncertainty Quantifying Miksch. Silvia and mer, [BBB Publications Additional 6.5.2 conference. scientific USA. international Illinois, the at [ presentation paper the for the papers poster For a the visualization. prepared the For and for designed data he the [ Bögl. prepared paper and the Markus computed For by and showcased visualization. written the for text data the main based prepared co-author the same from the explanations by prepared [ and was following text 5.6) in (Section the reader the scenario on usage support to the 5.9 on Section discussion in the attached material supplementary The authors. [ paper the For ideas. overall the fit they as improvements. improvements iterative about and feedback meetings for scientific project co-authors the for with of responsible discussions front and also in topics was presentation respective the He the as well conferences. as international writing process at paper, submission the community entire of the organization and abstraction, paper, data the design, of visualization idea, main the including BAF Contributions. Graphics Computer and Visualization on [BBB BAF + ae 8–9.IE,21 – 2015 IEEE, 189–190. pages , + + + + 15 14 13] 19] 18] , , BFG BAF aksBg,Wlgn inr ee izoe,TmLmash ivaMkc,and Miksch, Silvia Lammarsch, Tim Filzmoser, Peter Aigner, Wolfgang Bögl, Markus ügnBrad hita os aksBg,CrsinEcnr hrsaGschwandt- Theresia Eichner, Christian Bögl, Markus Bors, Christian Bernard, Jürgen Kohlham- Jörn Gschwandtner, Theresia Bögl, Markus Bernard, Jürgen Bors, Christian aksBg,CrsinBr,Teei shadnr n ivaMkc.Catego- Miksch. Silvia and Gschwandtner, Theresia Bors, Christian Bögl, Markus + + 15 15 , aksBögl Markus , BAF BFG + + ae 54.TeErgahc soito,2018 Association, Eurographics The 45–47. pages , 14 15 , ,Mru öldatdtefiue o h einadcmue and computed and design the for figures the drafted Bögl Markus ], BAF a h edato falteaoelse aes[ papers listed above the all of author lead the was + 13 ae 13.TeErgahc soito,2019 Association, Eurographics The 31–35. pages , Poster n a epnil o oto h cetfi contributions, scientific the of most for responsible was and ] 91)23–26 03– 2013 19(12):2237–2246, , BAF + 13 BFG rceig fte9hInternational 9th the of Proceedings ,h mlmne h rttp as prototype the implemented he ], + 17 ,h mlmne h data the implemented he ], Journal EETransactions IEEE Proceedings BFG BFG ..Publications 6.5. + + 15 17 ], , 105 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. .C 6. 106 onclusion AT21,Ciao L S,Otbr2-0 2015 25-30, October USA, IL, Chicago, 2015, VAST h ehrad,Jn -,2016 6-7, June Netherlands, The 2017 14-16, August Thailand, rhdAnot ehr hnasr n hlp öle.Avsa nltc prahto approach analytics visual A Zöllner. Philipp and Thonhauser, Gerhard Arnaout, Arghad uoi okhpo iulAayis uoAErVs21,Sasa K ue91,2014 9-10, 2014 June Association, UK, Swansea, Eurographics 2014, The EuroVA@EuroVis Analytics, Visual on Workshop EuroVis – VAST Technology, and 2014 Science 25-31, Analytics October Visual France, on Paris, Posters, Conference IEEE 2014 the of Proceedings 4, June Republic, Czech Brno, 2018, EuroVA@EuroVis Analytics, Visual on Workshop EuroVis ypsu nVsa nomto omncto n neato,VNI21,Bangkok, 2017, VINCI Interaction, and Communication Information Visual on Symposium 2018 otCI21,Mrbr lvna uy13 03 Proceedings 2013. 1-3, July Slovenia, Maribor, 2013, SouthCHI önKhhme.Vsa-neatv emnaino utvraetm eis In series. time multivariate of segmentation Visual-interactive Kohlhammer. Jörn nCmlx ntutrd i aa-TidItrainlWrso,HIKD21,Hl at Held 2013, HCI-KDD Workshop, International Third - Data Big Unstructured, Complex, in In symbolic data. for transformation time-oriented visual of Interactive representation Rind. Alexander and Miksch, Silvia Gschwandtner, In data. series time multivariate labeling and segmenting In labeling. and segmentation time-series on influence parameter Analyzing Miksch. Silvia and [RLS visual by recognition activity Supporting Miksch. Silvia In and analytics. Alsallakh, Bilal Bögl, Markus [RLK Groningen, 2016, EuroVA@EuroVis Analytics, Visual on Workshop EuroVis International 7th the [BDB Graphics study. Computer user and comparative A uncertainty: temporal of encodings [GBFM16] In series. time spaced unequally of rastering for support [ABG [BBGM17] [LAB nCmue Science Computer in ae 95.TeErgahc soito,2018 Association, Eurographics The 49–53. pages , + + + + + 14] 16] 13] 15] 14] rceig fte21 EECneec nVsa nltc cec n Technology, and Science Analytics Visual on Conference IEEE 2015 the of Proceedings atnRhi,Mri uoci,HirnShmn,Mru öl ia Alsallakh, Bilal Bögl, Markus Schumann, Heidrun Luboschik, Martin Röhlig, Martin i amrc,Wlgn inr lsi etn,Mru öl Theresia Bögl, Markus Bertone, Alessio Aigner, Wolfgang Lammarsch, Tim and Vögele, Anna Röhlig, Martin Bögl, Markus Dobermann, Eduard Bernard, Jürgen ia lalk,Mru öl hrsaGcwnte,Sli ish ia Esmael, Bilal Miksch, Silvia Gschwandtner, Theresia Bögl, Markus Alsallakh, Bilal Schumann, Heidrun Kirste, Thomas Krüger, Frank Luboschik, Martin Röhlig, Martin hrsaGcwnte,Mru öl al eeio n ivaMkc.Visual Miksch. Silvia and Federico, Paolo Bögl, Markus Gschwandtner, Theresia hita os aksBg,Teei shadnr n ivaMkc.Visual Miksch. Silvia and Gschwandtner, Theresia Bögl, Markus Bors, Christian ae 0–1.Srne,2013 Springer, 400–419. pages , 21:3–4,2016 22(1):539–548, , ae 35.AM 2017 ACM, 53–57. pages , ae 13.TeErgahc soito,2016 Association, Eurographics The 31–35. pages , ua-optrItrcinadKoldeDiscovery Knowledge and Interaction Human-Computer ae 6–7.IE,2014 IEEE, 269–270. pages , ae 14.IE,2015 IEEE, 41–48. pages , rceig fte1t International 10th the of Proceedings rceig fte5hInternational 5th the of Proceedings EETascin nVisualization on Transactions IEEE oue74 of 7947 volume , rceig of Proceedings etr Notes Lecture . Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. . upeetr aeilfrteuaeseai,pg 87 86 71 ...... 65 ...... 78 . . 77 ...... 40 ...... 2 . . . page . . scenario, 64 . . usage . . the . . . for . 1 . material page . . . Supplementary . scenario, . usage . . . the . 75 for . plot . . material . 5.6 . cycle Supplementary . 36 multivariate . . example 44 . the . bivariate utilizing . . a environment . . exploration 5.5 with . plot . Interactive plot . cycle . cycle distance-based . . distance-based 32 . generalized the . a . . . of 5.4 . to Construction . . plot . . . cycle . . . univariate . . . a . . 5.3 43 of . . . Transformation plot . . cycle . . . . univariate . . a . . . . of 5.2 . . Explanation . . . . 55 ...... details . . . 5.1 . more . . for . . . 39 interactions . . of . . imputation . Sequence . . series . . . time . . . guided statistically . . . . 24 . 4.2 and . Visually . . . environment . . . selection . . . model . . . interactive 4.1 . . . the 30 . . . into . . . integrated . . . predictions . . . . Model ...... Plots . 23 . . Residual 3.1 . . . in . . . . Selection . . Model Plot . . ACF/PACF . . of the . . Transitions . . 22 of . . Part . . . . and . . Toolbox . . . Selection 2.7 . . . . Model ...... 2.6 . Selection Overview . TiMoVA Model . . . . for . . Description . . . . Process . . Analytics 2.5 . . . Visual ...... Lags . 16 over PACF . . . . 2.4 . and 11 ACF ...... Methodology 2.3 . environment Box-Jenkins . . selection . . model . . interactive . . . An . . 2.2 – TiMoVA . . . . . 13 12 . . . . 15 . 17 2.1 ...... cycles . . in . . plot periodicity . 18 cycle obvious . . . . Showing as . know ...... later 1.14 plot, ...... series subseries time Seasonal ...... periodic 13 a . 1.13 . . . . . of studies decomposition . . . . . design . Seasonal conducting . . . for . . 1.12 framework methodology—nine-stage . chart . . . . study location Design information . . . . . and . clarity 1.11 . validation . . . methodology—task and study . . design . . . Design visualization . for . . . 8 . model 1.10 framework Nested . . . data–users–tasks . 7 building triangle: . . . model design as analytics 1.9 . . . analytics Visual visual . . viewing . for workflow . . . analytics . 1.8 Visual . . . analytics . . visual . . al. for . 1.7 et model . Knowledge-generation . Keim . by . . defined . process . analytics . 1.6 . Visual analytics . visual temperature of soil 1786 scope 1.5 in in The variation appeared periodic first of 1801, analysis from 1.4 graphical Playfair Lambert’s W. by graph line series 1.3 Time 1.2 . osbytefis iesre rp rmte10 the from graph series time first the Possibly 1.1 th r11 or th ito Figures of List etr 6 ...... century 107 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. . lyarrpeetn ainldb fEgad...... 93 . . 92 . . . . . 88 ...... England . of debt cases . COVID-19 national of representing . curve Playfair the . flattening about . comic 6.2 A . 3 page scenario, 6.1 usage the for material Supplementary 5.7 108 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. . C n AFbhvo o RAadSRM oes...... 35 ...... models SARIMA and ARMA for behavior PACF and ACF 2.1 ito Tables of List 109 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [AMM [ALA [BAF [AMST11] [Ans73] [All09] Aggarwal. C. Charu [Agg13] [ABG [AA20] [AA06] + + + 13] + 18] 14] 07] iulzto fTm-retdData Time-Oriented of Visualization eprlDt:ASseai Approach Systematic A Data: Temporal Graphics analytics visual A Zöllner. Philipp and Thonhauser, Gerhard Arnaout, Arghad Psychology 04 wne,U,Jn -0 2014 9-10, June UK, Swansea, 2014, 2020s. for vision EETascin nVsaiainadCmue Graphics Computer and Visualization on Transactions IEEE n lxne id iulaayisfrmdlslcini iesre analysis. series 2013. time in selection model for analytics Visual Miksch, Silvia Rind. Lammarsch, Alexander Tim and Filzmoser, Peter Aigner, Wolfgang Bögl, Markus analysis. statistical 1973. Feb. in 27(1):17–21, Graphs Anscombe. J. Francis Tominski. Christian and Schumann, Heidrun Miksch, Silvia Aigner, Wolfgang view. systematic data—a time-oriented Chris- Visualizing and Tominski. Schumann, tian Heidrun Müller, Wolfgang Miksch, Silvia Aigner, Wolfgang In data. Missing Allison. model Paul as analytics visual Viewing Rind. Alexander building. and Daniel Fuchs, Miksch, Georg Silvia Andrienko, Keim, Gennady Lammarsch, Tim Andrienko, Natalia EuroVA@EuroVis Analytics, Visual on Workshop In EuroVis data. International 5th series the time of multivariate labeling and segmenting to approach Esmael, Bilal Miksch, Silvia Gschwandtner, Theresia Bögl, Markus Alsallakh, Bilal a analytics: visual Spatio-temporal Andrienko. Gennady and Andrienko Natalia Andrienko. Gennady and Andrienko Natalia 13:0–0,2007. 31(3):401–409, , optrGahc Forum Graphics Computer hpe ,pgs7–9 AE 2009. SAGE, 72–89. pages 4, chapter , ora fSailIfrainScience Information Spatial of Journal ule Analysis Outlier h AEHnbo fQatttv ehd in Methods Quantitative of Handbook SAGE The pigr odn K 2011. UK, London, Springer, . 76:7–9,2018. 37(6):275–299, , h uorpisAscain 2014. Association, Eurographics The . pigrNwYr,2013. York, New Springer . pigrSine&Bsns ei,2006. Media, Business & Science Springer . xlrtr nlsso pta and Spatial of Analysis Exploratory Bibliography h mrcnStatistician American The 002)8–5 2020. 2020(20):87–95, , 19(12):2237–2246, , optr & Computers Proceedings 111 , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [BBGM18] [BBB [BAF [BAF [BD87] [BBH11] [BBGM17] [BBB 112 [BAP + + + + + 15] 14] 05] 18] 19] otr,Bn,CehRpbi,Jn -,2018 4-8, June Republic, Czech Brno, Posters, – okhpo iulzto o rdcieAnalytics Predictive for Visualization on Workshop IC 07 ago,Taln,Ags 41,2017 14-16, August Thailand, Bangkok, 2017, VINCI VDA Analysis, Data and Visualization soito,2018. Association, rceig fte2t uorpisCneec nVsaiain uoi 2018 EuroVis Visualization, on Conference Eurographics 20th the of Proceedings uoAErVs21,Bn,CehRpbi,Jn ,2018 4, June Republic, Czech Brno, 2018, EuroVA@EuroVis Surfaces 2015 nentoa ypsu nVsa nomto omncto n Interaction, and Communication Information Visual on Symposium International igotc o iesre oe rdcin.In guide predictions. to model methods series analytics time Visual for Tim diagnostics Rind. Gschwandtner, Alexander and Theresia Miksch, Filzmoser, Silvia Peter Lammarsch, Aigner, Wolfgang Bögl, Markus ereE .BxadNra .Draper. R. Norman and Box P. E. George framework general A Hammer. mapping. 2011. 24(3):771–804, visualization Barbara data and dimensionality-reducing Biehl, for Michael Bunte, Kerstin In data. series time labeling and segmenting of process Catego- the Miksch. in Silvia uncertainties and Gschwandtner, rizing Theresia Bors, Christian Bögl, Markus In series. time spaced unequally Visual of Miksch. rastering Silvia for and Gschwandtner, support Theresia Bögl, Markus Bors, Christian 2019. Portugal Association, Porto, Eurographics 2019, The 3, 31–35. June 2019, EuroVA@EuroVis time Analytics, Visual multivariate Jörn on in Gschwandtner, uncertainty In Theresia Quantifying pre-processing. series Bögl, Miksch. Markus Silvia and Bernard, Kohlhammer, Jürgen Bors, Christian 2018. Association, Eurographics series. time Theresia multivariate of Eichner, analysis In visual and Christian Com- segmentation Kohlhammer. automated Bögl, the Jörn bining and Markus Schumann, Heidrun Miksch, Bors, Silvia Gschwandtner, Christian Bernard, Jürgen In series. time Shneiderman. in Ben and search Khella, pattern Amir Plaisant, Interactive Catherine Aris, Aleks Buono, Paolo 25-26, May Italy, Sardinia, Cagliari, 2015, EuroVA@EuroVis Analytics, Visual time on in In predictions selection. Integrating model Tim Rind. series Alexander Gschwandtner, and Theresia Miksch, Filzmoser, Silvia Peter Lammarsch, Aigner, Wolfgang Bögl, Markus rceig fte9hItrainlErVsWrso nVsa Analytics, Visual on Workshop EuroVis International 9th the of Proceedings ae 37.TeErgahc soito,2015. Association, Eurographics The 73–77. pages , onWly&Sn,N,UA 1987. USA, NY, Sons, & Wiley John . rceig fte1t nentoa uoi Workshop EuroVis International 10th the of Proceedings rceig fte6hItrainlErVsWorkshop EuroVis International 6th the of Proceedings ae 7–8,2005. 175–186, pages , miia oe uligadResponse and Building Model Empirical rceig fteCneec on Conference the of Proceedings rceig fte21 EEVIS IEEE 2014 the of Proceedings ae 54.TeEurographics The 45–47. pages , 2014. , ae 35.AM 2017. ACM, 53–57. pages , rceig fte10th the of Proceedings erlComputation Neural ae 95.The 49–53. pages , pages , , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [Bec00] [BG05] [BFG [Ber83] [BDB [BD10] [BD02] [BK11] [BJW00] [BJ70] [BFG [BD91] [BJR08] + + + 15] 17] 16] oeatn n Control and Forecasting iigadKoldeDsoeyHandbook Discovery Knowledge and Mining aaae,Dt iigadTmoa Reasoning Temporal and Mining Data Databases, guided statistically and Visually Lammarsch. Tim and Rind, Alexander Aigner, Control 2015 25-30, October USA, IL, Chicago, h uorpisAscain 2016. Association, 2016 Eurographics The 6-7, June Netherlands, The Groningen, 2016, EuroVA@EuroVis Example pigrSre nSaitc.Srne,NwYr,N,UA n dto,1991. edition, 2nd USA, NY, York, New Springer, Davis. Statistics. A. in Series Richard Springer and Brockwell J. Peter øe igadadMrtKulahci. Murat and Bisgaard Søren Wang. X. Sean and Jajodia, G. Sushil Bettini, Claudio Reinsel. C. Gregory and Jenkins, M. Gwilym Box, P. E. George Jenkins. M. Gwilym and Box P. E. George editors, Rokach, Lior and Maimon Oded In detection. Outlier Ben-Gal. Irad Multivariate revisited: plot 2017. abstraction. Cycle 36(3):227–238, distance-based a Rind. using Alexander detection outlier and Miksch, A. Roger Silvia Lammarsch, Tim Leite, Gschwandtner, Theresia Filzmoser, Peter Bögl, Markus Posters, – VAST Technology, and Science Analytics In Visual series. on Conference time IEEE seasonal the univariate in values missing of imputation Wolfgang Miksch, Silvia Gschwandtner, Theresia Filzmoser, Peter Bögl, Markus graphique" "Sémiologie french 1967. in in published originally Berg, J. William by translated Bertin. Jacques 2000. MA, Reading, Beck. Kent series. time multivariate of In segmentation Vögele, Visual-interactive Anna Kohlhammer. Röhlig, Jörn Martin and Bögl, Markus Dobermann, Eduard Bernard, Jürgen 2010. Heidelberg, Berlin Springer, Health. and Biology for Dobson. J. Annette and Barnett G. 2002. Adrian edition, 2nd USA, NY, York, New Springer, Statistics. in Texts Springer Davis. A. Richard and Brockwell J. Peter rceig fte7hItrainlErVsWrso nVsa Analytics, Visual on Workshop EuroVis International 7th the of Proceedings odnDy a rnic,C,UA 1970. USA, CA, Francisco, San Holden-Day, . onWly&Sn,Hbkn S,2011. USA, Hoboken, Sons, & Wiley John . xrm rgamn xlie:EbaeChange Embrace Explained: Programming Extreme eilg fGraphics of Semiology onWly&Sn,Hbkn S,4heiin 2008. edition, 4th USA, Hoboken, Sons, & Wiley John . ieSre nlssadFrcsigby Forecasting and Analysis Series Time h nvriyo icni rs,1983. Press, Wisconsin of University The . nrdcint ieSre n Forecasting and Series Time to Introduction ae 8–9.IE,2015. IEEE, 189–190. pages , nlsn esnlHat Data Health Seasonal Analysing ieSre nlss oeatn and Forecasting Analysis: Series Time ae 3–4.Srne S 2005. US, Springer 131–146. pages , ieSre:Ter n Methods and Theory Series: Time pigr 2000. Springer, . optrGahc Forum Graphics Computer ieGauaiisin Granularities Time ieSre Analysis: Series Time Addison-Wesley, . rceig of Proceedings ae 31–35. pages , Statistics . Data 113 , . . Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [CCH13] 114 [CB14] [BRG [BPC Lewis. Toby and Barnett Vic [BL98] [BP70] [BPS [CC08] [BW08] [BSH [Bro11] [BM13] + + + + 07] 10] 16] 12] 91)27–35 2013. 19(12):2376–2385, mrcnSaitclAssociation Statistical American rnatoso iulzto n optrGraphics Computer and Visualization on Transactions isn aaExploration Data Missing iulitrciepercsigo iesre aa nAdesKre n Stefan and Kerren Andreas In data. series time of preprocessing Visual-interactive Graphics R oue8,pgs3–8 ikpn nvriyEetoi rs,2012. Press, Electronic University Linköping 39–48. pages 81, volume nentoa ofrneo nomto Visualisation Information on Conference International ate rhe n aaaMnnr ut-ee yooyo btatvi- abstract of typology multi-level A tasks. sualization Munzner. Tamara and Brehmer Matthew iou hn,Dan ok n ek Hofmann. Heike and Cook, Dianne Cheng, Xiaoyue Chan. Kung-Sik and Cryer D. Jonathan Publishing. Research Scientific time. of Publisher: 3 problem The Number: 2014. Balestra. 5(3):250–258, Luisella and Chong Choy Li aesthetics. & geometry – graphs Stacked Wattenberg. Martin and Byron Lee temporal 2016. of 22(1):559–568, patterns visualize to data. time in Folding evolution Curves: Time Grabowski, Tom Dragicevic. Madhyastha, Pierre Tara and Heulot, Nicolas Shi, Conglei Bach, Benjamin editor, 2011. Lovric, Science Miodrag Statistical In of series. clopedia Time Brockwell. J. Peter editors, Seipel, Kohlhammer. Jörn and May, Thorsten Goroll, Oliver Ruppert, Tobias Bernard, Jürgen In forecasting. series time for interface plot simultaneous river with forecasting A Similarity-based previews: Jank. Wolfgang Shneiderman, and Ben Aris, Shmueli, Aleks Galit Simeone, Adalberto Plaisant, Catherine Buono, Paolo effectiveness the on resolution block visualization. and pixel-based Thornton. demands of Ian task and Murray, of Tavi impact Jänicke, the Heike Evaluating Chen, Min Proctor, Karl Borgo, Rita autocorrelations models. residual series of time Distribution average moving Pierce. autoregressive-integrated A. in David and Box P. E. George pigrTxsi ttsis pigr e ok Y S,2deiin 2008. edition, 2nd USA, NY, York, New Springer, Statistics. in Texts Springer . 66:6–7,2010. 16(6):963–972, , rceig fSGA,ItrcieVsa nlsso Data of Analysis Visual Interactive SIGRAD, of Proceedings EETascin nVsaiainadCmue Graphics Computer and Visualization on Transactions IEEE EETascin nVsaiainadCmue Graphics Computer and Visualization on Transactions IEEE 03 akg eso 0.1-5. version package R 2013. , ulesi ttsia data statistical in Outliers EETascin nVsaiainadComputer and Visualization on Transactions IEEE ae 6110.Srne,Bri,Heidelberg, Berlin, Springer, 1601–1605. pages , 532:5912,1970. 65(332):1509–1526, , ieSre nlss ihApiain in Applications With Analysis. Series Time isnDtGI U for GUI A MissingDataGUI: ae 9–9.IE,2007. IEEE, 191–196. pages , 46:2515,2008. 14(6):1245–1252, , ie,3deiin 1998. edition, 3rd Wiley, . nentoa Ency- International rceig fthe of Proceedings oenEconomy Modern ora fthe of Journal IEEE , , , , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [Das13] [CT82] [Coh10] [Coh04] [CMS99] [CM09] [EHR [DGH06] [CTB Cleveland. S. William [Cle93] [CL13] [CG16] [CENC03] [Cle94] + + 12] 14] ely pe adeRvr J 2010. NJ, River, Saddle Upper Wesley, rnatoso iulzto n optrGraphics Computer and Visualization on Transactions //gretl.sourceforge.net ora fItlietIfrainSystems Information Intelligent of Journal adoko aaQuality Data of Handbook prices. electricity next-day predict to models ARIMA Systems nentoa ora fForecasting of Journal International VAST Technology, and Science Analytics Visual on Conference IEEE airCnrrs oai sioa rnic .Ngls n noi .Conejo. J. Antonio and Nogales, J. Francisco Espinola, Rosario Contreras, Javier n hitpe nrw.Tehmni h op e ietosfrvsa analytics. visual for directions new loop: the is Fiaux, human Patrick forecasting. The North, Andrews. Chris series Christopher Ramakrishnan, and Naren time Hossain, of Shahriar M. years Endert, Alex 25 Hyndman. J. Rob and Gooijer De G. Jan editor, Sadiq, Shazia In data. your in Monsters glitches: Data Dasu. Tamraparni 2012. In decomposition. event seasonal-trend abnormal for using analytics examination media and social detection Spatiotemporal S. David Ertl. Maciejewski, Thomas Ross and Jang, Yun Ebert, Bosch, Harald Thom, Dennis Chae, Junghoon seasonal for methods Graphical Terpenning. J. adjustment. Irma and Cleveland S. William Cohn. Mike 2004. USA, CA, City, Redwood Cohn. Mike 1999. Inc., think. Publishers to Kaufmann vision Morgan Using Shneiderman. Ben and In Mackinlay, D. Jock Card, K. Stuart 2009. USA, NY, York, New Springer, R! Use Metcalfe. V. Andrew and Cowpertwait S.P. Paul 1994. USA, Cleveland. S. William 1.9.12 Version Guide, User’s - Library Time-series Lucchetti. and Riccardo and Cottrell Allin optimize? processes visualization may What Golan. Amos and Chen Min ednsi nomto iulzto:uigvso othink to vision using visualization: information in Readings 83:0412,2003. 18(3):1014–1020, , srSoisApid o gl otaeDevelopment Software Agile For Applied: Stories User ora fteAeia ttsia Association Statistical American the of Journal uceigwt gl:Sfwr eeomn sn Scrum Using Development Software Agile: with Succeeding iulzn data Visualizing h lmnso rpigdata graphing of elements The ae 6–7.Srne elnHiebr,2013. Heidelberg, Berlin Springer 163–178. pages , ls iie nOt 2 2020). 12, Oct. on visited (last 23:4–7,2006. 22(3):443–473, , oatPes umt J S,1993. USA, NJ, Summit, Press, Hobart . 33:1–3,2014. 43(3):411–435, , rt:GURgeso,Econometric Regression, GNU Gretl: nrdcoyTm eiswt R with Series Time Introductory EETascin nPower on Transactions IEEE oatPes umt NJ, Summit, Press, Hobart . 21)21–62 2016. 22(12):2619–2632, , ac 2013. March , 737:26,1982. 77(377):52–62, , Addison-Wesley,. ae 579–581. pages , ae 143–152, pages , Proceedings Addison- . http: IEEE 115 . Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [GAK [HA04] [GM13] [GH07] [Fun36] [FRGTA14] [Few08] [ERT [Fun37] [FGR05] [GBFM16] [GMPn99] [FHR 116 + + + 17] 14] 11] 1936. dacdVsa nefcs AVI Interfaces, Visual Advanced rica nelgneReview Intelligence Artificial RM oes kpigapoc essadtv ule approach. outlier additive versus approach Skipping models: ARIMA effects and events, plans, visualizing and Exploring CareCruiser: Seyfang. Andreas ls iie nOt 3 00,Jn 2008. June 2020), 13, Oct. on visited (last PacificVis Econometrics business_intelligence/time_on_the_horizon.pdf gai ízBac,adFbieRsi h tt fteati nertn machine integrating in art the Nabney, of analytics. state Ian visual The into Wong, Rossi. learning William Fabrice and Turkay, Blanco, Cagatay Díaz Ribarsky, Ignacio William Endert, Alex itraJ og n i utn uvyo ule eeto methodologies. detection outlier of survey A Austin. Jim and Hodge J. Victoria in observations Missing Peña. Daniel and Maravall, Agustín Gómez, Víctor 2013. Analysis, Economic of Tech- Bureau adjustment. report, seasonal nical to approach multivariate A Greenaway-McGrevy. Ryan Models level/Hierarchical Hill. Jennifer and Gelman Andrew Graphics Computer study. and user Visualization comparative on A Visual uncertainty: Miksch. temporal Silvia of and encodings Federico, Paolo Bögl, Markus Gschwandtner, Theresia In interactively. and Miksch, Silvia Kaiser, Katharina Aigner, Wolfgang Gschwandtner, Theresia representation data. graphical statistical the of of development Historical Funkhouser. Gray Howard graph. century tenth a on note A Funkhouser. Gray Howard of Identification outliers. multivariate Thomas-Agnan. local Christine and Ruiz-Gazen, Anne Filzmoser, Peter qualitative In with visualization abstractions. time-series Space-efficient Silvia Graphs: and Qualizon Aigner, Wolfgang Miksch. Rind, Alexander Hoffmann, Stephan Intelli- Federico, Paolo Business Visual outlier Multivariate 2005. Reimann. geochemistry. Clemens exploration in and detection Garrett, G. Robert horizon. Filzmoser, Peter the on Time Newsletter, gence Few. Stephen ae 35,2011. 43–50, pages , 82:4–6,1999. 88(2):341–363, , rceig fte1t nentoa okn ofrneon Conference Working International 12th the of Proceedings rceig fte4hIE aicVsaiainSymposium, Visualization Pacific IEEE 4th the of Proceedings Osiris http://www.perceptualedge.com/articles/visual_ :6–0,1937. 3:269–404, , abig nvriyPes e ok S,2007. USA, York, New Press, University Cambridge . ttsia Papers Statistical 22:516 2004. 22(2):85–126, , optrGahc Forum Graphics Computer ae 7–8.AM 2014. ACM, 273–280. pages , aaAayi sn ersinadMulti- and Regression Using Analysis Data optr Geosciences & Computers 21:3–4,2016. 22(1):539–548, , 51:94,2014. 55(1):29–47, , 68:5–8,2017. 36(8):458–486, , ie a.2,2016, 20, Mar. cited , EETransactions IEEE Osiris 31(5):579–587, , 1:260–262, , ora of Journal Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [HCL05] [HK07] [HKPC19] [HK10] [HB03] [HN98] [HJM [HA18] [HR07] [HKF16] [Ham94] [HKB11] [HJS + + 09] 11] mrcnStatistician American 2009 VAST ’05) (CHI Systems Computing in Factors rnatoso iulzto n optrGraphics Computer and Visualization on Transactions h mrcnStatistician American The Graphics Forum Graphics ls iie nOt 2 2020). 12, Oct. on visited (last EETascin nVsaiainadCmue Graphics Computer and Visualization on Transactions IEEE Txs ebun,Asrla n dto,2018. edition, 2nd Australia, Melbourne, OTexts, Athanasopoulos. George and Hyndman J. Rob 2007. graphics. data statistical in transitions Animated Robertson. George and Heer Jeffrey synergism. trace plot-density box A plots: Violin Nelson. D. Ray and Hintze L. Jerry frontiers. Visual next the for Chau. survey Horng interrogative Duen An and learning: Pienta, deep Robert in analytics Kahng, Minsuk Hohman, Fred for scatterplot series. connected time The paired Franconeri. presenting Steve and Kosara, Robert Haroz, for Steve program A II: Amelia Blackwell. Matthew data. missing and King, Gary time-series Honaker, in James values missing about do to What data. cross-section King. Gary and Honaker James models. regression data of incomplete comparison fit A to software nothing: and about methods ado data Much missing Kleinman. Ken and Horton Nicholas 2009. Technology, and In Science series. Analytics Visual time on of Symposium prediction IEEE Visual at A. presented Poster: Daniel Dayal, Castellanos. Umeshwar Malu Sharma, and K. Keim, Ratnesh Janetzko, Halldór Hao, C. Ming series. time seasonal analytics large visual A of prediction Sharma. peak-preserving K. for Ratnesh Umeshwar approach and Hill, Marwah, William Manish Keim, Mittelstädt, A. Sebastian Daniel Dayal, Janetzko, Halldór Hao, C. Ming interactive for toolkit A Prefuse: In Landay. A. visualization. James information and Card, K. Stuart Heer, Jeffrey selecting for tool maps. for online schemes An colour Colorbrewer.org: Brewer. Cynthia and Harrower Mark 1994. USA, NJ, Hamilton. D. James 29:1428,2016. 22(9):2174–2186, , ae 2–3,2009. 229–230, pages , ora fSaitclSoftware Statistical of Journal 03:9–0,2011. 30(3):691–700, , mrcnJunlo oiia Science Political of Journal American iesre analysis series Time 11:99,2007. 61(1):79–90, , 22:8–8,My1998. May 52(2):181–184, , atgahcJunl The Journal, Cartographic rceig fteSGH ofrneo Human on Conference SIGCHI the of Proceedings EETascin nVsaiainadComputer and Visualization on Transactions IEEE ae 2–3.AM 2005. ACM, 421–430. pages , rneo nvriyPes Princeton, Press, University Princeton . 57:–7 2011. 45(7):1–47, , oeatn:picpe n practice and principles Forecasting: https://otexts.com/fpp2 01:73,2003. 40(1):27–37, , 58:6429,2019. 25(8):2674–2693, , 42:6–8,2010. 54(2):561–581, , 13(6):1240–1247, , Computer Poster IEEE The 117 . Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [IHS13] [JC14] [HS12] [IIC [KAF [HSA 118 [JSMK14] [JME10] [JLC19] [HXG02] [Jon80] [Jon85] + 13] + + 10] 08] 66:2–3,2010. 16(6):927–934, nml eeto o iulaayiso oe osmto data. consumption power Graphics of analytics visual for detection Anomaly visualization. evaluating of practice the on review systematic A for routes flight display to maps of generation automatic An Vinot. Queue ls iie nOt 2 2020). 12, Oct. on visited (last Engineering nomto iulzto:HmnCnee susadPerspectives and Issues Human-Centered Visualization: Information (AVI’10) Interfaces Visual Advanced on Conference International i rffi otolr:Srcueadclrotmzto.In optimization. color and Structure controllers: traffic air eryHe n e hedra.Itrciednmc o iulanalysis. visual for dynamics Interactive Shneiderman. Ben and Heer Jeffrey nCmue cec,pgs1415 pigr 2008. Springer, 154–175. pages Science, Computer editors, in North, Chris challenges. and and Fekete, Jean-Daniel process, Stasko, Definition, T. John analytics: Kerren, Andreas Visual In Melançon. Guy Kohlham- and Jörn Görg, mer, Carsten Fekete, Jean-Daniel Andrienko, Gennady Keim, Daniel Keim. A. Daniel and Mittelstädt, Sebastian Stoffel, Florian Janetzko, Halldór In data. spaced unequally with analysis statistics series Time Jones. H. Richard with series time to models arma of observations. fitting missing likelihood Maximum Jones. H. Richard of perception Graphical Elmqvist. series. Niklas time multiple and McDonnel, Bryan Javed, Waqas interactive on advances research Recent learning. Chen. machine Changjian and Liu, Shixia Jiang, Liu 0.4-1. version package R Cloern. 2014. E. James and Jassby D. Alan Graphics Computer and Visualization on Möller. Thorsten and Sedlmair, Michael Chen, Jian Isenberg, Petra Isenberg, Tobias Inc. Global, IHS and network neural of study prediction. comparative series A time Goh. in N. modeling Thong ARIMA and Box-Jenkins Xie, Min Ho, L S. 2010. Jean-Luc and Tabart, Gilles Alonso, Roland Serrurier, Mathieu Hurter, Christophe 02:03–05,2012. 10(2):30:30–30:55, , oue5 ae 5–7.Esve,Asedm ehrad,1985. Netherlands, Amsterdam, Elsevier, 157–178. pages 5, volume , 82–7 2014. 38:27–37, , 22:7–7,2002. 42(2):371–375, , Ves eso 8 Version EViews, ora fVisualization of Journal EETascin nVsaiainadCmue Graphics Computer and Visualization on Transactions IEEE Technometrics rie A 2013. CA, Irvine, . 23:8–9,1980. 22(3):389–395, , q xlrn ae ult oioigdata monitoring quality water Exploring wq: 91)21–87 2013. 19(12):2818–2827, , 22:0–1,2019. 22(2):401–417, , http://www.eviews.com optr Industrial & Computers rceig fthe of Proceedings EETransactions IEEE ae 233–240, pages , etr Notes Lecture , optr & Computers adokof Handbook ACM , , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [KJL14] [KKEM10] [KPB16] [KMT09] Templ. Matthias and Schopfhauser, Daniel Meraner, Angelika Kowarik, Alexander [KMST12] [KM12] [KKA95] [KBK11] [LAB [KMS [KHP + + + 09] 11] 08] 9 (Vis95) ’95 ol o iulAnalytics Visual for Tools lmts A S,20.IEEE. 2009. USA, CA, Alamitos, 2010. Germany, Goslar, Association, ofrneo uoenSaitcas okSsino ttsia aaEditing Data Statistical on Session Work Statisticians, European of Conference optrGraphics Computer iulzto n o uhanalytics? much how and visualization io rtjć nioBrii n ailA em luLns opc display Compact time-series. multiple CloudLines: in Keim. episodes A. event Daniel of and Bertini, Enrico Krstajić, Miloš entoa ofrneo nomto iulsto,I 2009 IV Visualisation, Information In on Conference ternational visualizations. interactive pixel-based and for patterns views temporal Hierarchical aggregated Smuc. Mayr, Michael Eva and Gärtner, Miksch, Johannes Silvia Bertone, Alessio Aigner, Wolfgang Lammarsch, Tim (WHI2016) Learning interpret Machine to in In analytics visual models. Using learning machine Bertini. predictive Enrico and Perer, Adam Krause, Josua much how analytics: Visual Thomas. Jim and Mansmann, Florian Keim, In A. Europe. Daniel for R. Commission Economic in - Nations data United dependent 2012. Norway, Oslo, time of detection outlier and adjustment Interactive 2008. Heidelberg, Berlin, H. Michael Simoff, Simeon In editors, Mazeika, challenges. Arturas and and Hartmut Boehlen, Scope and Thomas, analytics: Jim Visual Schneidewind, Jörn Ziegler. Mansmann, Florian Keim, A. Daniel R-project.org/package=x12 processing batch for Meraner. structure Angelika and Kowarik Alexander Analytics Visual with Problems Solving Mansmann. - Age Florian Information and The Ellis, ing Geoffrey Kohlhammer, Jörn Keim, A. Daniel In A data. of pattern: amounts Recursive large very Ankerst. visualizing Mihael for and technique Kriegel, Hans-Peter Keim, A. Daniel long- of 2014. In visualization trend sets. Multi-scale data temperature Liu. term Jiayi and Jusufi, Ilir Kerren, Andreas data. transformations credible and and Visualizations usable wrangling: for data in directions Ham, Paolo Research and van Brodbeck, Buono. Dominique Frank Lee, Kennedy, Bongshin Weaver, Jessie Chris Riche, Plaisant, Henry Nathalie Catherine Heer, Jeffrey Kandel, Sean ae 7–8,1995. 279–286, pages , 71)23–49 2011. 17(12):2432–2439, , etr oe nCmue cec LC) Springer, (LNCS). Science Computer in Notes Lecture , nomto Visualization Information rceig fSGA 04-Vsa Computing Visual - 2014 SIGRAD of Proceedings 02 akg eso 1.0-3. version package R 2012. , e ok Y S,2016. USA, NY, York, New , ls iie nOt 2 2020). 12, Oct. on visited (last iulDt iig hoy ehiusand Techniques Theory, Mining: Data Visual CLWrso nHmnInterpretability Human on Workshop ICML IKDExplorations SIGKDD EETascin nVsaiainand Visualization on Transactions IEEE 1:X2-waprfnto and function wrapper - X12 x12: rceig fte1t In- 13th the of Proceedings 04:7–8,2011. 10(4):271–288, , rceig Visualization Proceedings 12:–,2009. 11(2):5–8, , ae 44,Los 44–49, pages , Eurographics . http://CRAN. Master- 119 , , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [MA14] [LGH [LAB [LAB [LR02] [LTM18] 120 [LYK Lambert. Heinrich Johann [Lam79] [Mac86] [LOK13] [LB78] [LBI + + + + + 12] 12] 13] 11] 17] okhp C-D 03 eda otCI21,Mrbr lvna uy1-3, July Slovenia, Maribor, 2013, SouthCHI at Held 2013, HCI-KDD Workshop, rnatoso iulzto n optrGraphics Computer and Visualization on Transactions Graphics Computer and Visualization on Transactions ie os ooe,UA n dto,2002. edition, 2nd USA, Hoboken, Sons, & Wiley a 1-Jn ,2011 3, June - 31 May Graphics 0–1.Srne,2013. Springer, 400–419. h tt-fteati rdcievsa analytics. visual predictive in state-of-the-art The Norway, Bergen, 2011, EuroVA@EuroVis Analytics, Visual on Workshop EuroVis 03 Proceedings 2013. id oad ocp o h tutr ftm a upr h iulanalytics visual editors, the Santucci, G. support and can Miksch time S. of In structure process. the how Alexander concept and a Towards Miksch, Silvia Rind. Bertone, Alessio Aigner, Wolfgang Lammarsch, Tim okMcily uoaigtedsg fgahclpeettoso relational of presentations graphical of design information. the Automating Mackinlay. Jock data. time-oriented 2014. data–users–tasks of 38:286–290, a analytics Applying visual time: to of triangle matter design A Aigner. Wolfgang and Miksch Silvia references. Keim. temporal with Daniel collections and text Ribarsky, exploring William Visually Krstajic, EvenRriver: Milos Yang, Jing Luo, Dongning with tasks to goals reports. from analysis Bridging study Munzner. design Tamara and Tory, Melanie Lam, Heidi Rubin. Donald and Little Roderick (HEAT) Temperature Kim. and Ho and Ohn, Il-Sang Lim, Youn-Hee 2017. 36(3):539–562, Maciejewski. Ross and Gleicher, Michael Hansen, Brett Garcia, Rolando Lu, Yafeng Sheelagh and scenarios. Plaisant, Seven Catherine visualization: information Isenberg, in studies Petra Empirical Bertini, Carpendale. series Enrico time in Lam, fit of Heidi lack of measure a On Box. P. E. models. George and Ljung M. Greta International Third - Data Big Unstructured, Complex, in Discovery In Knowledge and data. Theresia time-oriented of Bögl, representation transformation symbolic visual Markus for Interactive Rind. Bertone, Alexander and Alessio Miksch, Silvia Gschwandtner, Aigner, Wolfgang Lammarsch, Tim Biometrika 41:3–4,2018. 24(1):435–445, , C rnatoso Graphics on Transactions ACM oue74 of 7947 volume , 52:9–0,1978. 65(2):297–303, , ae –2 egn owy 2011. Norway, Bergen, 9–12, pages , 03 akg eso 1.2. version package R 2013. , Pyrometrie EETascin nVsaiainadComputer and Visualization on Transactions IEEE ttsia nlsswt isn Data Missing with Analysis Statistical etr oe nCmue Science Computer in Notes Lecture eln 1779. Berlin, . ET elhEet fArPollution Air of Effects Health HEAT: ()1011 1986. 5(2):110–141, , rceig fte2dInternational 2nd the of Proceedings ua-optrInteraction Human-Computer optrGahc Forum Graphics Computer 81:315 2012. 18(1):93–105, , 2012. 18(9):1520–1536, , optr Graphics & Computers pages , John . IEEE IEEE , , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [PnP01] [Pla01] [PKRJ10] [Pfa08] [Naz88] [Mun09] [Mor77] [Pla86] [MV15] [Mun08] [MHR [Mah36] [Pot06] + 11] iulzto fLreadUsrcue aaSt,G-dto etr oe in Notes Lecture GI-Edition Sets, Data Unstructured and Large of Visualization iulzto:HmnCnee susadPerspectives and Issues Human-Centered Visualization: rnatoso iulzto n optrGraphics Computer and Visualization on Transactions uigshct o e nägnbszrGegenwart zur bis Anfängen den von Musikgeschichte elg äerie,21.4 Auflage. 4. 2015. Bärenreiter, Verlag; Graphics uoi 2010) Eurovis eisA(General) A Series nomtc (LNI) Informatics fteNtoa nttt fSine (Calcutta) Sciences of Institute National the statistics. of in distance generalized the On Mahalanobis. Chandra Prasanta rsi otr ehd o rsnigsaitclifrain h o plot. box The information: statistical presenting for Methods robust Potter. and Kristin detection outlier Multivariate Prieto. estimation. matrix J. covariance Francisco and Peña 1801. edition, Daniel ed. 3rd etc, century Wallis, eighteenth J. the for of Burton T. whole by the during England expenditure revenues, of commerce, debts the and of progress the charts, copper-plate stained Playfair. William 2005 Press, University Cambridge I. 1786. Spence edition, and H. Wainer by introduced and edited Playfair. William Visualizing Johnson. uncertainty. R. and Chris statistics and summary Riesenfeld, Richard Kniss, Joe Potter, Kristin 2008. USA, NY, York, New Springer, Pfaff. Bernhard 1988. USA, NY, Inc., Dekker, Marcel Monographs. and Textbooks Statistics: Nazem. M. Sufi Vogel. Günter and Michels Ulrich validation. and design visualization for model nested A Munzner. Tamara Springer. 2008. Heidelberg, 134–153, editors, pages North, C. and Fekete, J.-D. Stasko, J.T. research Kerren, visualization A. information In writing papers. in pitfalls and Process Munzner. Tamara cycle. sunspot the Forecasting Morris. J. M. a - hotspots Forecasting Ebert. S. David approach. and analytics predictive Cleveland, A. S. Michael Larew, William G. Mitchell, Stephen Rudolph, Stephen Hafen, Ryan Maciejewski, Ross 74:4–5,2011. 17(4):440–453, , 93:2–3,2010. 29(3):823–832, , ple ieSre nlssfrBsns n cnmcForecasting Economic and Business for Analysis Series Time Applied nlsso nertdadCitgae ieSre ihR with Series Time Cointegrated and Integrated of Analysis -:716 2006. S-4:97–106, , h omriladpltclals:rpeetn,b en of means by representing, : atlas political and commercial The 4()4748 1977. 140(4):437–468, , h omriladPltclAlsadSaitclBreviary Statistical and Atlas Political and Commercial The Technometrics EETascin nVsaiainadComputer and Visualization on Transactions IEEE optrGahc ou Poednsof (Proceedings Forum Graphics Computer t-ta ui:sseaice Teil: systematischer Musik: dtv-Atlas ora fteRylSaitclSociety. Statistical Royal the of Journal 33:8–1,2001. 43(3):286–310, , :95,1936. 2:49–55, , 56:2–2,2009. 15(6):921–928, , etce Taschenbuch Deutscher . oue45 of 4950 volume , odn Printed London: . Proceedings Information s R! Use . LNCS IEEE 121 , . . Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [RLA [PW04] [RLK [Sad13] [RLS C20] [R 122 [Rob07] [RG10] [Rei08] [S [Rou85] + 08] + + + 14] 13] 15] 924–26 2013. 19:2247–2256, iWe okhpFo hoyt rcie ein iinadVisualization and Vision Design, Practice: to Theory From Workshop VisWeek eprt Ages Temperate Timeline 14.IE,2015. IEEE, 41–48. xlntosadRfrne oteMs osdrbeWiesi hsc and Physick in Writers Considerable Most the to References and Explanations hlspy ohAcetadMdr:t hc r de,AhrssUo the Upon Aphorisms Added, are which to Modern: and Ancient Both Philosophy, ml-o,Mals n te itmes o oWl nw oFre More Former to Known Well So Not Distempers, Other and Measles, Small-pox, In visualization. 2014 25-31, October France, Paris, Posters, – VAST Technology, and Science 2015 25-30, October USA, IL, Chicago, 2015, VAST Technology, and Science In analytics. visual oe .Pn n ehJ et.TeNMPdt package. NMMAPSdata The Welty. 2004. J. Leah and Peng D. Roger hzaSdq rlge eerhadpatc ndt ult aaeet In management. 2013. quality Heidelberg, data in editor, practice Sadiq, and Shazia Research Prologue: Sadiq. Shazia al. et Sprengell J. C. Applications point. breakdown and Statistics high ical with estimation Multivariate Rousseeuw. J. Peter (CMV) exploratory Visualization in views Exploratory multiple in & Coordinated art: the of State Roberts. C. Jonathan 2014. IEEE, 269–270. segmentation time-series In on influence labeling. parameter and Analyzing Miksch. Alsallakh, Bilal Silvia Bögl, and Markus Schumann, Heidrun Luboschik, Martin Röhlig, Martin by recognition activity Supporting Miksch. Silvia and Alsallakh, Schumann, Bilal Heidrun Bögl, Kirste, Markus Thomas Krüger, Frank Luboschik, Martin Röhlig, Martin of analytics visual for library software and model data. data time-oriented Silvia A and Alsallakh, TimeBench: Bilal Aigner, Miksch. Wolfgang Lammarsch, Tim Rind, Alexander Grafton. Anthony and Rosenberg Daniel In 2008. graph. horizon the of development The Reijner. Hannes 2020. Austria, Vienna, Computing, Statistical for Foundation Team. Core R rneo rhtcua rs,21.Gol-ok-D DqWqKVzipToC. Google-Books-ID: 2010. Press, Architectural Princeton . .Bnik .Femn .Gown n tes 1708. others, and Goodwin, T. Freeman, W. Bonwick, R. . :ALnug n niomn o ttsia Computing Statistical for Environment and Language A R: rceig fteCneec nCodntdadMlil Views Multiple and Coordinated on Conference the of Proceedings rceig fte21 EECneec nVsa Analytics Visual on Conference IEEE 2014 the of Proceedings rceig fte21 EECneec nVsa Analytics Visual on Conference IEEE 2015 the of Proceedings EETascin nVsaiainadCmue Graphics Computer and Visualization on Transactions IEEE poim fHportsadteSnecso ess With Celsus: of Sentences the and Hippocrates of Aphorisms adoko aaQuality Data of Handbook :8–9,1985. 8:283–297, , ae 17.IE,2007. IEEE, 61–71. pages , atgahe fTm:AHsoyo the of History A Time: of ae –1 pigrBerlin Springer 1–11. pages , News R rceig fthe of Proceedings 4(2):10–14, , Mathemat- pages , pages , R . , , Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [Shn96] [SDZ [SSS [SNHS13] [SMM12] [SFdOL04] [SMY [Shn83] [SBVLK09] [SAS12] [Sch99] [SS11] + + 14] + 00] 05] ihRexamples. R With iulzto InfoVis Visualization uaHsia n auiKsd.Totn suoclrn:Cmatvisualiza- Compact coloring: pseudo Two-tone Kaseda. Takumi and Hoshiya, Yuka rnatoso iulzto n optrGraphics Computer and Visualization on Transactions utpeVesi xlrtr iulzto,CMV04 Visualization, Exploratory in Views Multiple iulcutraayi ftaetr aawt neatv ooe maps. Kohonen interactive with data trajectory of analysis cluster Visual optrGraphics Computer Research iulztos In visualizations. InfoVis Computer IEEE ay C oebr2012. November NC, Cary, Inc. Institute SAS n ailA em nweg eeainmdlfrvsa analytics. visual for model generation Knowledge Ellis, Keim. Geoffrey Kwon, A. Chul Daniel Bum and Stoffel, Florian Stoffel, Andreas Sacha, Dominik Stoffer. S. David and Shumway H. Robert tasks. visualization A of Schumann. Heidrun space and Heitzler, design Magnus Nocke, Thomas Schulz, Hans-Jörg In data. one-dimensional for tion Saito, Hiroki Yamamoto, Mitsuyoshi Miyamura, Nakamura Hiroko Saito, Takafumi Graphics Computer stacks. and the and methodology: trenches study the Design from Munzner. Reflections Tamara and Meyer, Miriah Sedlmair, Michael IEEE. IEEE, 1996. information USA, for Colorado, taxonomy Boulder, type 336–343, data by task A it: have eyes The Shneiderman. Ben languages. programming beyond step A manipulation: Direct Shneiderman. Ben case A data: spatio-temporal In of Haim study. exploration and assist Oliveira, to de F. views Christina Coordinated Maria Levkowitz. Flores, F. Edilson Shimabukuro, H. Milton Cambridge, Institute, Effects Health Methods 94, 2000. I: Report Research Part study. issues. pollution Douglas methodologic air and and and Schwartz, mortality, morbidity, Joel national Zeger, The Scott Dockery. Dominici, Francesca Samet, Jonathan primer. a imputation: Multiple Schafer. Joseph oisShek ügnBrad ain o adsegr n önKohlhammer. Jörn and Landesberger, Von Tatiana Bernard, Jürgen Schreck, Tobias ae 7–8.IE,2005. IEEE, 173–180. pages , ()31,1999. 8(1):3–15, , rceig fte2dItrainlCneec nCodntdand Coordinated on Conference International 2nd the of Proceedings 68:76,1983. 16(8):57–69, , rceig 96IE ypsu nVsa Languages Visual on Symposium IEEE 1996 Proceedings pigr e ok r dto,2011. edition, 3rd York, New Springer, M 0Mdln n utvraeMethods Multivariate and Modeling 10 JMP 91)26–35 2013. 19(12):2366–2375, , ()1–9 2009. 8(1):14–29, , 81)23–40 2012. 18(12):2431–2440, , http://www.jmp.com EESmoimo nomto Visualization, Information on Symposium IEEE EETascin nVsaiainand Visualization on Transactions IEEE ieSre nlssadisApplications. its and Analysis Series Time EETascin nVisualization on Transactions IEEE ttsia ehd nMedical in Methods Statistical ls iie nOt 2 2020). 12, Oct. on visited (last ae 0–1.IE,2004. IEEE, 107–117. pages , 01)10–63 2014. 20(12):1604–1613, , A nttt Inc., Institute SAS . Information pages , IEEE 123 Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [STA [Sta11] [Til75] 124 [vB14] [vB12] Tufte. R. Edward [Tuf06] [Tuf83] [Tsa00] [TC05] [TAKP13] [Urb20] Tk7 onW Tukey. W. John [Tuk77] [Tsa10] [The10] [TAS04] + 13] iulzto n muaino isn Values Missing of Imputation and Visualization est fteauthor the of website eeomn gnafrVsa Analytics Visual for Agenda Development X 2011. TX, Science mathworks.com StataCorp. 00;Nweiino v1]aalbe[vB18]]. available [vB12] of edition New 2020); [ Avail- from 2014] 18, Feb. at [accessed online 2014. able Online, imputation. Multiple Buuren. van Stef 2012. USA, Raton, Boca Burren. van Stef version package R 2020). 2020. interface, java to R 0.9-13. Low-level rJava: Urbanek. Simon 1983. USA, CT, Cheshire, Tufte. R. Edward 2010. edition, edition 3rd Sons, & Wiley John Statistics. Tsay. S. Ruey research. Association future Statistical and American history the Brief of forecasting: and series Time Tsay. S. Ruey graphs. experimental Early Tilling. Laura Inc. MathWorks The Cook. A. Kristin and Thomas J. James SAC’04 computing In layouts. visualiza- Axes-based radial with Schumann. tions Heidrun and Abello, James Tominski, Christian Prantner. Bernd and Kowarik, Alexander Alfons, Andreas Templ, Matthias 0.9.0. version package and Kowarik, Prantner. Alexander Bernd Alfons, Andreas Templ, Matthias Schopfhauser, Daniel vB12 ()1323 1975. 8(3):193–213, , http://CRAN.R-project.org/package=rJava http://www.stata.com tt ttsia otae ees 12 Release Software: Statistical Stata ;[paefo c.1,22:Wbientacsil nmr.New anymore. accessible not Website 2020: 13, Oct. from [Update ]; http://www.multiple-imputation.com nlsso iaca ieSeries Time Financial of Analysis xlrtr aaAnalysis Data Exploratory IGI iulzto n muaino isn Values Missing of Imputation and Visualization VIMGUI: ls iie nOt 2 2020). 12, Oct. on visited (last lxbeIptto fMsigData Missing of Imputation Flexible euiu Evidence Beautiful h iulDslyo uniaieInformation Quantitative of Display Visual The ae 2214.AM 2004. ACM, 1242–1247. pages , https://stefvanbuuren.name/ ALB(R2009a) MATLAB rceig fte20 C ypsu nApplied on symposium ACM 2004 the of Proceedings rpisPes hsie T S,2006. USA, CT, Cheshire, Press, Graphics . ls iie nOt 2 2020). 12, Oct. on visited (last 540:3–4,2000. 95(450):638–643, , luiaigtePt:TeRsac and Research The Path: the Illuminating EE o lmts A S,2005. USA, CA, Alamitos, Los IEEE, . Addison-Wesley, 1977. . h rts ora o h itr of History the for Journal British The aik A 2010. MA, Natick, . 03 akg eso 4.0.0. version package R 2013. , ie eisi rbblt and Probability in Series Wiley . ttCr P olg Station, College LP, StataCorp . hpa n Hall/CRC, and Chapman . ls iie nOt 13, Oct. on visited (last paeo pedxA Appendix of Update , ls iie nOt 12, Oct. on visited (last rpisPress, Graphics . http://www. 03 R 2013. , Journal VIM: Die approbierte gedruckte Originalversion dieser Dissertation ist an der TU Wien Bibliothek verfügbar. The approved original version of this doctoral thesis is available in print at TU Wien Bibliothek. [Wol13] [vW06] [Yul27] [WWS [War00] [vB18] [vWvS99] + 16] iulzto (InfoVis’99) Visualization ihseilrfrnet ofrssnptnumbers. sunspot wolfer’s to reference special with optrGraphics Computer hsclCharacter Physical graphs. stacked of understanding better Toward n dto,2018. edition, 2nd Buuren. van Stef h oa oit fLno.Sre ,Cnann aeso ahmtclor Mathematical a of Papers Containing A, Series London. of Society Royal series, the disturbed periodicities investigating of method a on VII. Yule. Udny George Graphics Computer and PieceStack: Cui. Weiwei and Qu, Huamin Shi, Conglei Wu, Yingcai Wu, Tongshuang http://www.wolfram.com/mathematica Inc. Research, Wolfram 2000. edition, 1st Inc., Publishers Ware. Colin visual- based calendar and Cluster In data. Selow. series van time R. of Edward ization and Wijk van J. Jarke visualization. on Views Wijk. van J. Jarke nomto iulzto ecpinfrdesign for Perception - Visualization Information lxbeIptto fMsigData Missing of Imputation Flexible 2(3)2728 1927. 226(636):267–298, , 24:2–3,2006. 12(4):421–432, , 26:6015,2016. 22(6):1640–1651, , ae –,SnFacso A S,19.IEEE. 1999. USA, CA, Francisco, San 4–9, pages , ahmtc,Vrin9.0.1 Version Mathematica, rceig 99IE ypsu nInformation on Symposium IEEE 1999 Proceedings EETascin nVsaiainand Visualization on Transactions IEEE ls iie nOt 2 2020). 12, Oct. on visited (last EETascin nVisualization on Transactions IEEE hlspia rnatosof Transactions Philosophical hpa n Hall/CRC, and Chapman . hmag,I,2013. IL, Champaign, . ognKaufmann Morgan . 125