On Incompressible Multidimensional Networks∗ Felipe S

Total Page:16

File Type:pdf, Size:1020Kb

On Incompressible Multidimensional Networks∗ Felipe S 1 On incompressible multidimensional networks∗ Felipe S. Abrahao˜ †, Klaus Wehmuth†, Hector Zenil‡ and Artur Ziviani†, Senior Member, IEEE Abstract In order to deal with multidimensional structure representations of real-world networks, as well as with their worst-case irreducible information content analysis, the demand for new graph abstractions increases. This article presents an investigation of incompressible multidimensional networks defined by generalized graph representations. In particular, we mathematically study the lossless incompressibility of snapshot-dynamic networks and multiplex networks in comparison to the lossless incompressibility of more general forms of dynamic networks and multilayer networks, from which snapshot-dynamic networks or multiplex networks are particular cases. We show that incompressible snapshot-dynamic (or multiplex) networks carry an amount of algorithmic information that is linearly dominated by the size of the set of time instants (or layers). This contrasts with the algorithmic information carried by incompressible general dynamic (or multilayer) networks that is of the quadratic order of the size of the set of time instants (or layers). Furthermore, we prove that incompressible general multidimensional networks have edges linking vertices at non-sequential time instants (layers or, in general, elements of a node dimension). Thus, representational incompressibility implies a necessary underlying constraint in the multidimensional network topology. Index Terms Algorithmic information, Complex networks, Lossless compression, Multidimensional systems, Network topology I. INTRODUCTION Complex networks science has been showing fruitful applications to the study of biological, social, and physical systems [5,33,50]. This way, as the interest and pervasiveness of complex networks modeling and network analysis increase in graph theory and network science, proper representations of multidimensional networks into new extensions of graph-theoretical abstractions has become an important topic of investigation [30, 31, 36]. In a general sense, a multidimensional network (also known as high-order network) is any network that has additional representational structures. For example, this is the case of dynamic (i.e., time-varying) networks [19, 35, 39, 42], multilayer networks [11, 24, 30], and dynamic multilayer networks [44,45,47]. In this regard, the general scope of this paper is to study the lossless incompressibility of multidimensional networks as well as network topological properties in such generalized graph representations. Unlike traditional methods from random graphs theory, such as the probabilistic method, and from data compression algorithm analysis, which is dependent on the chosen programming language, we present a theoretical investigation of algorithmic complexity and algorithmic randomness. First, this enables worst-case compressibility and network complexity analyses that do not depend on the choice of programming language [37, 52]. Secondly, it enables us to achieve mathematical proofs of the existence of certain network topological properties in generalized graphs, which are not necessarily generated or defined by stochastic processes [14, 34]. With this purpose, algorithmic information theory [15, 28, 34] has been giving us computably universal tools for studying data compression of individual objects [6, 26, 34, 43, 51]. On the one hand, first note that the source coding theorem [20] in classical information theory ensures that, as SV (G)S = n → ∞, every recursively labeled representation of a random graph G on n vertices and edge probability p = 1~2 in the classical Erdos–R˝ enyi´ model G(n; p)—from an independent and identically distributed stochastic process—is expected to be losslessly incompressible with probability arbitrarily close to one [32,34]. In this sense, as shown in [25,33,38,52], approaches to network complexity based on classical (or statistical) information theory have presented useful tools to find, estimate, or measure underlying graph-theoretic topological properties in random graphs or in complex networks. On the other hand, for some graphs G (with n vertices) displaying maximal degree-sequence entropy [53] or exhibiting a Borel-normal distribution of presence or absence of edges [7–9], the edge set E(G) is computable and, therefore, is algorithmically compressible on a logarithmic order [15, 28, 34]. That is, even though their inner structures might seem to arXiv:1812.01170v7 [cs.IT] 29 Apr 2020 be statistically homogeneous or to be following a uniform probability distribution, these graphs G may be compressed into O (log(n)) bits and thus are very far from being incompressible objects. Moreover, it has been shown that statistical estimations are not invariant to language description in general [52]. This comes from the reliance on probability distributions that require making a choice of feature of interest relevant to the measure of interest. For example, the node degrees for the study of their distribution in a network disregards other representations of the same object (network) and its possible underlying generating mechanism. While some of these statistical approaches may be useful when a feature of interest is selected they can only capture the complexity of the representation and not the object. This is in contrast to universal measures of randomness such as ∗In [4], a preliminary version of part of this paper is presented as an extended abstract. †National Laboratory for Scientific Computing (LNCC) – 25651-075 – Petropolis, RJ – Brazil. ‡Oxford Immune Algorithmics – RG1 3EU – Reading – U.K. Algorithmic Dynamics Lab, Unit of Computational Medicine, Department of Medicine Solna, Center for Molecular Medicine, Karolinska Institute – SE-171 77 – Stockholm – Sweden. Algorithmic Nature Group, Laboratoire de Recherche Scientifique (LABORES) for the Natural and Digital Sciences – 75005 – Paris – France. Emails: [email protected], [email protected], [email protected], [email protected] 2 algorithmic complexity whose invariance theorem guarantees that different representation will have convergent values [15,28,34] (up to a constant that only depends on the choice of the universal programming languages). Of course, the complication is how to achieve the estimation of those universal measures which by virtue of being universal are also semi-computable and their application require, therefore, a much higher degree of methodological care compared to those measures that are simply computable such as those based on traditional statistical approaches, e.g., entropy, or graph-theoretic approaches, e.g., node degree. In this article, we apply the results on computable labeling and algorithmic randomness introduced in [14,29,52]. In particular, we extend these ones for string-based representations of classical graphs to string-based representations of multiaspect graphs (MAGs), as shown in [1] . MAGs are formal representations of dyadic (or 2-place) relations between two arbitrary n-ary tuples [45,46] and have shown fruitful representational properties to network modelling [2,3,45,49] and analysis of multidimensional networks [19, 44, 47, 48], such as dynamic networks and multilayer networks. First, we compare the algorithmic complexity and incompressibility of snapshot-dynamic networks and multiplex networks with more general forms of dynamic networks and multilayer networks, respectively. In turn, dynamic networks and multilayer networks are considered as distinct types of multidimensional (or high-order) networks. Secondly, we demonstrate some multidimensional topological properties of incompressible general multidimensional networks. To tackle these problems in the present paper, we apply a theoretical approach by putting forward definitions, lemmas, theorems, and corollaries. In Sections II and III, we present some background results upon which we build the contributions of this paper. In Section IV, we investigate the algorithmic randomness of snapshot-dynamic networks and multiplex networks through the calculation of the worst-case lossless compression of the characteristic string of the network. This way, one can compare these two kinds of networks with more general forms of dynamic networks and multilayer networks, respectively. Further, in Section V, we investigate some multidimensional network topological properties of incompressible multidimen- sional networks. Such topological properties include, more specifically, degree distribution of composite vertices, composite diameter, and composite k-connectivity. As shown in [1], if the randomness deficiency is asymptotically bounded above by a logarithmic term of the network size, these findings follow from the fact that incompressible multidimensional networks inherit the topological properties from their respective isomorphic graphs [45]. Hence, we also demonstrate the presence of a new multidimensional network topological property that may not correspond to underlying structural constraints in real-world networks, e.g., of snapshot-dynamic networks. In particular, we show the presence of transtemporal or crosslayer edges (i.e., edges linking vertices at non-sequential time instants or layers). II. PRELIMINARY DEFINITIONS AND NOTATION A. Multiaspect graphs We directly base our definitions and notation regarding classical graphs on [12, 13, 27]; and regarding generalized graph representation of dyadic relations between n-tuples (i.e. multiaspect graphs) on [45, 46]. Definition II.1. Let G = (A ; E ) be a multiaspect graph
Recommended publications
  • Opinion-Based Centrality in Multiplex Networks: a Convex Optimization Approach
    Opinion-Based Centrality in Multiplex Networks: A Convex Optimization Approach Alexandre Reiffers-Masson and Vincent Labatut Laboratoire Informatique d'Avignon (LIA) EA 4128 Universit´ed'Avignon, France June 12, 2017 Abstract Most people simultaneously belong to several distinct social networks, in which their relations can be different. They have opinions about certain topics, which they share and spread on these networks, and are influenced by the opinions of other persons. In this paper, we build upon this observation to propose a new nodal centrality measure for multiplex networks. Our measure, called Opinion centrality, is based on a stochastic model representing opinion propagation dynamics in such a network. We formulate an optimization problem consisting in maximizing the opinion of the whole network when controlling an external influence able to affect each node individually. We find a mathematical closed form of this problem, and use its solution to derive our centrality measure. According to the opinion centrality, the more a node is worth investing external influence, and the more it is central. We perform an empirical study of the proposed centrality over a toy network, as well as a collection of real-world networks. Our measure is generally negatively correlated with existing multiplex centrality measures, and highlights different types of nodes, accordingly to its definition. Cite as: A. Reiffers-Masson & V. Labatut. Opinion-based Centrality in Multiplex Networks. Network Science, 5(2):213-234, 2017. Doi: 10.1017/nws.2017.7 1 Introduction In our ultra-connected world, many people simultaneously belong to several distinct social networks, in which their relations can be different.
    [Show full text]
  • Modeling Customer Preferences Using Multidimensional Network Analysis in Engineering Design
    Modeling customer preferences using multidimensional network analysis in engineering design Mingxian Wang1, Wei Chen1, Yun Huang2, Noshir S. Contractor2 and Yan Fu3 1 Department of Mechanical Engineering, Northwestern University, Evanston, IL 60208, USA 2 Science of Networks in Communities, Northwestern University, Evanston, IL 60208, USA 3 Global Data Insight and Analytics, Ford Motor Company, Dearborn, MI 48121, USA Abstract Motivated by overcoming the existing utility-based choice modeling approaches, we present a novel conceptual framework of multidimensional network analysis (MNA) for modeling customer preferences in supporting design decisions. In the proposed multidimensional customer–product network (MCPN), customer–product interactions are viewed as a socio-technical system where separate entities of `customers' and `products' are simultaneously modeled as two layers of a network, and multiple types of relations, such as consideration and purchase, product associations, and customer social interactions, are considered. We first introduce a unidimensional network where aggregated customer preferences and product similarities are analyzed to inform designers about the implied product competitions and market segments. We then extend the network to a multidimensional structure where customer social interactions are introduced for evaluating social influence on heterogeneous product preferences. Beyond the traditional descriptive analysis used in network analysis, we employ the exponential random graph model (ERGM) as a unified statistical
    [Show full text]
  • Multilayer Networks
    Journal of Complex Networks (2014) 2, 203–271 doi:10.1093/comnet/cnu016 Advance Access publication on 14 July 2014 Multilayer networks Mikko Kivelä Oxford Centre for Industrial and Applied Mathematics, Mathematical Institute, University of Oxford, Oxford OX2 6GG, UK Alex Arenas Departament d’Enginyeria Informática i Matemátiques, Universitat Rovira I Virgili, 43007 Tarragona, Spain Marc Barthelemy Downloaded from Institut de Physique Théorique, CEA, CNRS-URA 2306, F-91191, Gif-sur-Yvette, France and Centre d’Analyse et de Mathématiques Sociales, EHESS, 190-198 avenue de France, 75244 Paris, France James P. Gleeson MACSI, Department of Mathematics & Statistics, University of Limerick, Limerick, Ireland http://comnet.oxfordjournals.org/ Yamir Moreno Institute for Biocomputation and Physics of Complex Systems (BIFI), University of Zaragoza, Zaragoza 50018, Spain and Department of Theoretical Physics, University of Zaragoza, Zaragoza 50009, Spain and Mason A. Porter† Oxford Centre for Industrial and Applied Mathematics, Mathematical Institute, University of Oxford, by guest on August 21, 2014 Oxford OX2 6GG, UK and CABDyN Complexity Centre, University of Oxford, Oxford OX1 1HP, UK †Corresponding author. Email: [email protected] Edited by: Ernesto Estrada [Received on 16 October 2013; accepted on 23 April 2014] In most natural and engineered systems, a set of entities interact with each other in complicated patterns that can encompass multiple types of relationships, change in time and include other types of complications. Such systems include multiple subsystems and layers of connectivity, and it is important to take such ‘multilayer’ features into account to try to improve our understanding of complex systems. Consequently, it is necessary to generalize ‘traditional’ network theory by developing (and validating) a framework and associated tools to study multilayer systems in a comprehensive fashion.
    [Show full text]
  • Evolving Networks and Social Network Analysis Methods And
    DOI: 10.5772/intechopen.79041 ProvisionalChapter chapter 7 Evolving Networks andand SocialSocial NetworkNetwork AnalysisAnalysis Methods and Techniques Mário Cordeiro, Rui P. Sarmento,Sarmento, PavelPavel BrazdilBrazdil andand João Gama Additional information isis available atat thethe endend ofof thethe chapterchapter http://dx.doi.org/10.5772/intechopen.79041 Abstract Evolving networks by definition are networks that change as a function of time. They are a natural extension of network science since almost all real-world networks evolve over time, either by adding or by removing nodes or links over time: elementary actor-level network measures like network centrality change as a function of time, popularity and influence of individuals grow or fade depending on processes, and events occur in net- works during time intervals. Other problems such as network-level statistics computation, link prediction, community detection, and visualization gain additional research impor- tance when applied to dynamic online social networks (OSNs). Due to their temporal dimension, rapid growth of users, velocity of changes in networks, and amount of data that these OSNs generate, effective and efficient methods and techniques for small static networks are now required to scale and deal with the temporal dimension in case of streaming settings. This chapter reviews the state of the art in selected aspects of evolving social networks presenting open research challenges related to OSNs. The challenges suggest that significant further research is required in evolving social networks, i.e., existent methods, techniques, and algorithms must be rethought and designed toward incremental and dynamic versions that allow the efficient analysis of evolving networks. Keywords: evolving networks, social network analysis 1.
    [Show full text]
  • Multidimensional Network Analysis
    Universita` degli Studi di Pisa Dipartimento di Informatica Dottorato di Ricerca in Informatica Ph.D. Thesis Multidimensional Network Analysis Michele Coscia Supervisor Supervisor Fosca Giannotti Dino Pedreschi May 9, 2012 Abstract This thesis is focused on the study of multidimensional networks. A multidimensional network is a network in which among the nodes there may be multiple different qualitative and quantitative relations. Traditionally, complex network analysis has focused on networks with only one kind of relation. Even with this constraint, monodimensional networks posed many analytic challenges, being representations of ubiquitous complex systems in nature. However, it is a matter of common experience that the constraint of considering only one single relation at a time limits the set of real world phenomena that can be represented with complex networks. When multiple different relations act at the same time, traditional complex network analysis cannot provide suitable an- alytic tools. To provide the suitable tools for this scenario is exactly the aim of this thesis: the creation and study of a Multidimensional Network Analysis, to extend the toolbox of complex network analysis and grasp the complexity of real world phenomena. The urgency and need for a multidimensional network analysis is here presented, along with an empirical proof of the ubiquity of this multifaceted reality in different complex networks, and some related works that in the last two years were proposed in this novel setting, yet to be systematically defined. Then, we tackle the foundations of the multidimensional setting at different levels, both by looking at the basic exten- sions of the known model and by developing novel algorithms and frameworks for well-understood and useful problems, such as community discovery (our main case study), temporal analysis, link prediction and more.
    [Show full text]
  • Design and Analysis of Graph-Based Codes Using Algebraic Lifts and Decoding Networks Allison Beemer University of Nebraska - Lincoln, [email protected]
    University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Dissertations, Theses, and Student Research Papers Mathematics, Department of in Mathematics 3-2018 Design and Analysis of Graph-based Codes Using Algebraic Lifts and Decoding Networks Allison Beemer University of Nebraska - Lincoln, [email protected] Follow this and additional works at: https://digitalcommons.unl.edu/mathstudent Part of the Discrete Mathematics and Combinatorics Commons, and the Other Mathematics Commons Beemer, Allison, "Design and Analysis of Graph-based Codes Using Algebraic Lifts nda Decoding Networks" (2018). Dissertations, Theses, and Student Research Papers in Mathematics. 87. https://digitalcommons.unl.edu/mathstudent/87 This Article is brought to you for free and open access by the Mathematics, Department of at DigitalCommons@University of Nebraska - Lincoln. It has been accepted for inclusion in Dissertations, Theses, and Student Research Papers in Mathematics by an authorized administrator of DigitalCommons@University of Nebraska - Lincoln. DESIGN AND ANALYSIS OF GRAPH-BASED CODES USING ALGEBRAIC LIFTS AND DECODING NETWORKS by Allison Beemer A DISSERTATION Presented to the Faculty of The Graduate College at the University of Nebraska In Partial Fulfilment of Requirements For the Degree of Doctor of Philosophy Major: Mathematics Under the Supervision of Professor Christine A. Kelley Lincoln, Nebraska March, 2018 DESIGN AND ANALYSIS OF GRAPH-BASED CODES USING ALGEBRAIC LIFTS AND DECODING NETWORKS Allison Beemer, Ph.D. University of Nebraska, 2018 Adviser: Christine A. Kelley Error-correcting codes seek to address the problem of transmitting information effi- ciently and reliably across noisy channels. Among the most competitive codes developed in the last 70 years are low-density parity-check (LDPC) codes, a class of codes whose structure may be represented by sparse bipartite graphs.
    [Show full text]
  • The Key Technology of Computer Network Vulnerability Assessment Based on Neural Network
    Wang J Wireless Com Network (2020) 2020:222 https://doi.org/10.1186/s13638-020-01841-y RESEARCH Open Access The key technology of computer network vulnerability assessment based on neural network Shaoqiang Wang* *Correspondence: [email protected] Abstract School of Computer With the wide application of computer network, network security has attracted more Science and Technology, Changchun University, and more attention. The main reason why all kinds of attacks on the network can pose Changchun 130022, Jilin, a great threat to the network security is the vulnerability of the computer network China system itself. Introducing neural network technology into computer network vulner- ability assessment can give full play to the advantages of neural network in network vulnerability assessment. The purpose of this article is by organizing feature map neural network, and the combination of multilayer feedforward neural network, the training samples using SOM neural network clustering, the result of clustering are added to the original training samples and set a certain weight, based on the weighted iterative update ceaselessly, in order to improve the convergence speed of BP neural network. On the BP neural network, algorithm for LM algorithm was improved, the large matrix inversion in the LM algorithm using the parallel algorithm method is improved for solving system of linear equations, and use of computer network vulnerability assess- ment as the computer simulation and analysis on the actual example designs a set of computer network vulnerability assessment scheme, fnally the vulnerability is lower than 0.75, which is benefcial to research on related theory and application to provide the reference and help.
    [Show full text]
  • Multidimensional Analysis of Complex Networks
    Alma Mater Studiorum Universita’ di Bologna Computer Science Department Ph.D. Thesis Multidimensional analysis of complex networks Possamai Lino Ph.D. Supervisors: Prof. Massimo Marchiori Prof. Alessandro Sperduti Ph.D. Coordinator: Prof. Maurizio Gabbrielli DOTTORATO DI RICERCA IN INFORMATICA - Ciclo XXV Settore Concorsuale di afferenza: 01/B1 Settore Scientifico disciplinare: INF/01 Esame Finale Anno 2013 Ai miei genitori. Abstract The analysis of Complex Networks turn out to be a very promising field of research, testified by many research projects and works that span different fields. Until recently, those analysis have been usually focused on deeply characterize a single aspect of the system, therefore a study that considers many informative axes along with a network evolve is lacking. In this Thesis, we propose a new multidimensional analysis that is able to inspect networks in the two most important dimensions of a system, namely space and time. In order to achieve this goal, we studied them singularly and investigated how the variation of the constituting parameters drives changes to the network behavior as a whole. By focusing on space dimension, we were able to characterize spatial alteration in terms of abstraction levels. We propose a novel algorithm that, by applying a fuzziness function, can reconstruct networks under different level of details. We call this analysis telescopic as it recalls the magnification and reduction process of the lens. Through this line of research we have successfully verified that statistical indicators, that are frequently used in many complex networks researches, depends strongly on the granularity (i.e., the detail level) with which a system is described and on the class of networks considered.
    [Show full text]
  • Research Article Multilayer Network-Based Production Flow Analysis
    Hindawi Complexity Volume 2018, Article ID 6203754, 15 pages https://doi.org/10.1155/2018/6203754 Research Article Multilayer Network-Based Production Flow Analysis Tamás Ruppert, Gergely Honti, and János Abonyi MTA-PE “Lendület” Complex Systems Monitoring Research Group, Department of Process Engineering, University of Pannonia, Veszprém, Hungary Correspondence should be addressed to János Abonyi; [email protected] Received 24 February 2018; Revised 18 May 2018; Accepted 27 May 2018; Published 16 July 2018 Academic Editor: Dimitris Mourtzis Copyright © 2018 Tamás Ruppert et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. A multilayer network model for the exploratory analysis of production technologies is proposed. To represent the relationship between products, parts, machines, resources, operators, and skills, standardized production and product-relevant data are transformed into a set of bi- and multipartite networks. This representation is beneficial in production flow analysis (PFA) that is used to identify improvement opportunities by grouping similar groups of products, components, and machines. It is demonstrated that the goal-oriented mapping and modularity-based clustering of multilayer networks can serve as a readily applicable and interpretable decision support tool for PFA, and the analysis of the degrees and correlations of a node can identify critically important skills and resources. The applicability of the proposed methodology is demonstrated by a well-documented benchmark problem of a wire-harness production process. The results confirm that the proposed multilayer network can support the standardized integration of production-relevant data and exploratory analysis of strongly interconnected production systems.
    [Show full text]
  • Frequent Itemset Mining and Multi-Layer Network-Based Analysis of RDF Databases
    mathematics Article Frequent Itemset Mining and Multi-Layer Network-Based Analysis of RDF Databases Gergely Honti † and János Abonyi *,† MTA-PE Complex Systems Monitoring Research Group, University of Pannonia, 8200 Veszprem, Hungary; [email protected] * Correspondence: [email protected] † These authors contributed equally to this work. Abstract: Triplestores or resource description framework (RDF) stores are purpose-built databases used to organise, store and share data with context. Knowledge extraction from a large amount of interconnected data requires effective tools and methods to address the complexity and the underlying structure of semantic information. We propose a method that generates an interpretable multilayered network from an RDF database. The method utilises frequent itemset mining (FIM) of the subjects, predicates and the objects of the RDF data, and automatically extracts informative subsets of the database for the analysis. The results are used to form layers in an analysable multidimensional network. The methodology enables a consistent, transparent, multi-aspect-oriented knowledge extraction from the linked dataset. To demonstrate the usability and effectiveness of the methodology, we analyse how the science of sustainability and climate change are structured using the Microsoft Academic Knowledge Graph. In the case study, the FIM forms networks of disciplines to reveal the significant interdisciplinary science communities in sustainability and climate change. The constructed multilayer network then enables an analysis of the significant disciplines Citation: Honti, G.; Abonyi, J. and interdisciplinary scientific areas. To demonstrate the proposed knowledge extraction process, we Frequent Itemset Mining and search for interdisciplinary science communities and then measure and rank their multidisciplinary Multi-Layer Network-Based Analysis effects.
    [Show full text]
  • Do-It-Yourself Local Wireless Networks: a Multidimensional Network Analysis of Mobile Node Social Aspects
    Do-it-yourself Local Wireless Networks: A Multidimensional Network Analysis of Mobile Node Social Aspects Annalisa Socievole and Salvatore Marano DIMES, University of Calabria, Ponte P. Bucci, Rende (CS), Italy Keywords: Opportunistic Networks, Multi-layer Social Network, Mobility Trace Analytics, Wireless Encounters, Facebook. Abstract: The emerging paradigm of Do-it-yourself (DIY) networking is increasingly taking the attention of research community on DTNs, opportunistic networks and social networks since it allows the creation of local human- driven wireless networks outside the public Internet. Even when Internet is available, DIY networks may form an interesting alternative option for communication encouraging face-to-face interactions and more ambitious objectives such as e-participation and e-democracy. The aim of this paper is to analyze a set of mobility traces describing both local wireless interactions and online friendships in different networking environments in order to explore a fundamental aspect of these social-driven networks: node centrality. Since node centrality plays an important role in message forwarding, we propose a multi-layer network approach to the analysis of online and offline node centrality in DIY networks. Analyzing egocentric and sociocentric node centrality on the social network detected through wireless encounters and on the corresponding Facebook social network for 6 different real-world traces, we show that online and offline degree centralities are significantly correlated on most datasets. On the contrary, betweenness, closeness and eigenvector centralities show medium-low correlation values. 1 INTRODUCTION for example, an alternative option for communication is necessary. Do-it-yourself (DIY) networks (Antoniadis et al., DIY networks are intrinsically social-based due 2014) have been recently proposed new generation to human mobility and this feature is well suitable networks where a multitude of human-driven mo- for exchanging information in an ad hoc manner.
    [Show full text]
  • Interactive Visualization of a News Clips Network a Journalistic Research and Knowledge Discovery Tool
    Interactive Visualization of a News Clips Network A Journalistic Research and Knowledge Discovery Tool Jose´ Devezas and Alvaro´ Figueira CRACS/INESC TEC, Faculdade de Ciencias,ˆ Universidade do Porto Rua do Campo Alegre, 1021/1055, 4169-007 Porto, Portugal Keywords: Visualization, Networks, Communities, Interactive, Named Entities, News Clips. Abstract: Interactive visualization systems are powerful tools in the task of exploring and understanding data. We describe two implementations of this approach, where a multidimensional network of news clips is depicted by taking advantage of its community structure. The first implementation is a multiresolution map of news clips that uses topic detection both at the clip level and at the community level, in order to assign labels to the nodes in each resolution. The second implementation is a traditional force-directed network visualization with several additional interactive aspects that provide a rich user experience for knowledge discovery. We describe a common use case for the visualization systems as a journalistic research and knowledge discovery tool. Both systems illustrate the links between news clips, induced by the co-occurrence of named entities, as well as several metadata fields based on the information contained within each node. 1 INTRODUCTION Our goal was to create an interface for the user to explore the already available information in a user- Interactively exploring data through visualization en- friendly and insightful manner. We largely took ad- ables the users to improve their understanding of the vantage of the community structure of the news clips provided information. They gain knowledge by “con- network — identified by the Breadcrumbs system us- necting the dots”, establishing mental relationships ing methodologies such as the Louvain method (Blon- between the individual pieces of data in an intuitive del et al., 2008) or Tang’s multidimensional integra- manner.
    [Show full text]