Vega et al. Applied Network Science (2018) 3:25 Applied Network Science https://doi.org/10.1007/s41109-018-0082-3 RESEARCH Open Access Foundations of Temporal Text Networks Davide Vega* and Matteo Magnani *Correspondence:
[email protected] Abstract InfoLab, Department of Information Three fundamental elements to understand human information networks are the Technology, Uppsala University, Uppsala, Sweden individuals (actors) in the network, the information they exchange, that is often observable online as text content (emails, social media posts, etc.), and the time when these exchanges happen. An extremely large amount of research has addressed some of these aspects either in isolation or as combinations of two of them. There are also more and more works studying systems where all three elements are present, but typically using ad hoc models and algorithms that cannot be easily transfered to other contexts. To address this heterogeneity, in this article we present a simple, expressive and extensible model for temporal text networks, that we claim can be used as a common ground across different types of networks and analysis tasks, and we show how simple procedures to produce views of the model allow the direct application of analysis methods already developed in other domains, from traditional data mining to multilayer network mining. Keywords: Network, Text, Time, Model, Temporal text network, Human information network Introduction A large amount of human-generated information is available online in the form of text exchanged between individuals at specific times. Examples include social network sites, online forums and emails. The public accessibility of several of these sources allows us to observe our society at various scales, from focused conversations among small groups of individuals to broad political discussions involving heterogeneous audiences from large geographical areas (Zhou et al.