ARTICLES PUBLISHED ONLINE: 10 AUGUST 2015 | DOI: 10.1038/NPHYS3422

Spectrum of controlling and observing complex networks

Gang Yan1†, Georgios Tsekenis1†, Baruch Barzel2, Jean-Jacques Slotine3,4, Yang-Yu Liu5,6 and Albert-László Barabási1,6,7,8*

Recent studies have made important advances in identifying sensor or driver nodes, through which we can observe or control a complex system. But the observational uncertainty induced by measurement noise and the energy required for control continue to be significant challenges in practical applications. Here we show that the variability of control energy and observational uncertainty for di￿erent directions of the state space depend strongly on the number of driver nodes. In particular, we find that if all nodes are directly driven, control is energetically feasible, as the maximum energy increases sublinearly with the system size. If, however, we aim to control a system through a single node, control in some directions is energetically prohibitive, increasing exponentially with the system size. For the cases in between, the maximum energy decays exponentially when the number of driver nodes increases. We validate our findings in several model and real networks, arriving at a series of fundamental laws to describe the control energy that together deepen our understanding of complex systems.

any natural and man-made systems can be represented as concentration of a in a metabolic network35,the networks1–3, where nodes are the system’s components and geometric state of a chromosome in a chromosomal interaction links describe the interactions between them. Thanks to network14, or the belief of an individual in opinion dynamics29,36. M T these interactions, perturbations of one node can alter the states The vector u(t) u (t),u (t),...,u (t) represents the external =[ 1 2 ND ] of the other nodes4–6. This property has been exploited to control control inputs, and B is the input matrix, with B 1 if control ij = a network—that is, to move it from an initial state to a desired input uj(t) is imposed on node i. The adjacency matrix A captures final state7–9—by manipulating the state variables of only a subset the interactions between the nodes, including the possibility of self- 10,11 10–26 of its nodes . Such control processes play an important role in loops Aii representing the self-regulation of node i. the regulation of protein expression27, the coordination of moving robots28, and the inhibition of undesirable social contagions29.Atthe Control energy same time the interdependence between nodes means that the states The system (1) can be driven from an initial state xo to any desired of a small number of sensor nodes contain sucient information final state x within the time t 0, ⌧ using an infinite number d 2 [ ] about the rest of the network, so that we can reconstruct the system’s of possible control inputs u(t). The optimal input vector aims to ⌧ full internal state by accessing only a few outputs30. This can be minimize the control energy7 u(t) 2dt, which captures the 0 k k utilized for biomarker design in cellular networks, or to monitor in energy of electronic and mechanical systems or the amount of eort R real time the state and functionality of infrastructural31 and social– required to control biological and social systems. If at t 0the 32 33 = ecological systems for early warning of failures or disasters . system is in state xo 0, the minimum energy required to move the = 7,16–18 Although recent advances in driver and sensor node system to point xd in the state space can be shown to be identification constitute unavoidable steps towards controlling and T 1 observing real networks, in practice we continue to face significant (⌧) x G (⌧)x (2) E = d c d challenges: the control of a large network may require a vast amount ⌧ T of energy16–18, and measurement noise34 causes uncertainties in where G (⌧) eAt BBTeA t dt is the symmetric controllability c = 0 the observation process. To quantify these issues we formalize the Gramian. When the system is controllable all eigenvalues of G (⌧) R c dynamics of a controlled network with N nodes and ND external are positive. Equation (2) indicates that for a network A and an input control inputs as7–10 matrix B the control energy (⌧) also depends on the desired state E xd. Consequently, driving a network to various directions in the state x(t) Ax(t) Bu(t) (1) space requires dierent amounts of energy. For example, to move ˙ = + the weighted network of Fig. 1a to the three dierent final states xd where the vector x(t) x (t), x (t), ... , x (t) T describes the with x 1, we inject the optimal signals u(t) shown in Fig. 1b =[ 1 2 N ] k dk = states of the N nodes at time t and xi(t) can represent the onto node 1, steering the system along the trajectories shown in

1Center for Research and Department of Physics, Northeastern University, Boston, Massachusetts 02115, USA. 2Department of Mathematics, Bar-Ilan University, Ramat-Gan 52900, Israel. 3Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA. 4Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA. 5Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts 02115, USA. 6Center for Cancer , Dana Farber Cancer Institute, Boston, Massachusetts 02115, USA. 7Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts 02115, USA. 8Center for , Central European University, H-1051 Budapest, Hungary. †These authors contributed equally to this work. *e-mail: [email protected]

NATURE PHYSICS | ADVANCE ONLINE PUBLICATION | www.nature.com/naturephysics 1

© 2015 Macmillan Publishers Limited. All rights reserved ARTICLES NATURE PHYSICS DOI: 10.1038/NPHYS3422

a Controlling e Observing u(t) y(t) −3.2 1 −3.2 1

1.3 1.0 1.3 1.0

0.7 0.7 −2.7 2 3 −2.2 −2.7 2 3 −2.2

x˙(t) = Ax(t) + Bu(t)˙x(t) = Ax(t) y(t) = Cx(t) + w(t)

bf150 0.25 100 0.00

50 )

) t

t (

( u 0 y −0.25 −50

−100 −0.50 0.0 0.5 1.0 1.5 2.0 2.5 3.0 0123 t t

cg

x 8 o 1.0 xd x 6 o

4 3 3

x 0.5 x 2

0 0.0 0.2 6 4 15 0.5 2 5 0.1 −5 0.0 x 0 x x2 x 2 −2 −15 1 0.0 −0.5 1

dh

1 4 0.25 2 0 2 0.00 1 −1 −0.25 2 0 0 0.5 0 −2 −1 0.0 x∼ ≡ ^x − x −0.5 −2 o o −2 −4 1234 (log-scaled) log( ) Control energy Observational uncertainty

Figure 1 | Controlling and observing a network. a, The control of a three-node weighted network with one external signal u(t) that is injected to the red driver node. Hence the input matrix is B 1,0,0 T. The nodes have negative self-loops that make all eigenvalues of the adjacency matrix A negative. =[ ] b, Optimal control signals that minimize the energies required to move the network from the initial state x 0,0,0 T to three di￿erent desired states x o =[ ] d with x 1 in the given time interval t 0,3 . c, Trajectories of the network state x(t) driven respectively by the control signals in b. d, Control energy k dk= 2[ ] surface, showing the amount of energy required to move the network by one unit distance (that is, x 1) in di￿erent directions. The surface is an k dk= ellipsoid spanned by the eigen-energies for the controllability Gramian’s three eigen-directions (arrows). The squares correspond to the three final points used in b and c. e, Observing the network with one output y(t). Node 1 is selected as the sensor (green), thus the output matrix is C 1,0,0 . The =[ ] measurement noise w(t) is assumed as Gaussian white noise with zero mean and variance one. f, A typical output y(t) that is used to approximate the initial state x . g, A typical trajectory of the system state x(t). h, Estimation errorx ˜ xˆ x , wherex ˆ is the maximum-likelihood estimator of the initial o = o o o state. Starting from the same initial state we ran the system 5,000 times independently, each dot representing the estimation error of one run. The uncertainty ellipsoid (black) corresponds to the standard deviation ofx ˜ in any direction.

Fig. 1c. The corresponding minimum energies are shown in Fig. 1d. decays quickly to a nonzero stationary value when the control The control energy surface for all normalized desired states is an time ⌧ increases16. Henceforth we focus on the control energy ellipsoid, implying that the required energy varies drastically as we (⌧ ) and the controllability Gramian G G (⌧ ). E ⌘E !1 ⌘ c !1 move the system in dierent directions. Given a network A and an input matrix B, the controllability As real systems normally function near a stable state—that Gramian G is unique, embodying all properties related to the is, all eigenvalues of A are negative37—the control energy (⌧) control of the system. To uncover the directions of the state space E

2 NATURE PHYSICS | ADVANCE ONLINE PUBLICATION | www.nature.com/naturephysics

© 2015 Macmillan Publishers Limited. All rights reserved NATURE PHYSICS DOI: 10.1038/NPHYS3422 ARTICLES

a 100 bc100 0.15 Erdős−Rényi

10−2 10−2 0.10 ( ) ( ) ( ) p p SF_assortative p 10−4 10−4 (r = 0.25) 0.05

Model networks SF_disassortative Scale-free (r = −0.24) Prediction Prediction 0.00 10−6 10−6 100 101 102 100 101 102 0 5 10 15 20 25 30

de100 100 f0.5 Power grid (US) Exponential fit 0.4 10−2 −2 10 0.3

−4 ( ) ( ) 10 ( ) p p p 0.2 10−4 10−6 0.1 Open_flights AS_CAIDA Infrastructural networks Prediction Prediction 0.0 10−6 10−8 100 101 102 103 100 101 102 103 104 0 5 10 15 20

gh10−1 100 i0.8 Inter-companies Exponential fit −1 10 0.6 10−3 10−2 0.4 ( ) ( ) ( ) p p p 10−3 10−5

Social networks 0.2 −4 Social net (Israel) 10 Online forum Prediction Prediction 0.0 10−7 10−5 101 102 103 104 100 101 102 103 02468

jk100 100 l Brain coactivation 0.3 Exponential fit 10−1 10−2 10−2 0.2 ( ) ( ) ( ) p p p 10−3 10−4 0.1 Hetero. net Biological networks 10−4 PPI (human) (human) Prediction Prediction 0.0 10−5 10−6 100 101 102 100 101 102 103 051015

Figure 2 | Controlling a network through all nodes (N N). Distribution p( ) of eigen-energies required to control several model and real systems. D = E a, Scale-free model network without degree–degree correlation. b, Scale-free model networks with positive (r 0.25) or negative (r 0.24) = = degree–degree correlation. c, Erdős–Rényi model network. d, Airline transportation network. e, Internet AS-level network. f, US power grid network. g, Israeli . h, User-interaction network of an online forum. i, Interlocking network of Norwegian companies. j, Human protein–protein interaction network. k, Human heterogeneous network. l, Functional coactivation network of the human brain. The straight lines show prediction (4) and the error bars represent standard deviations. For model networks the edges’ weights A are uniformly drawn from [0,1]. The self-loops A A , ij ii = j ij where 0.25, representing a small perturbation to diagonal entries, to ensure that the network is stable. The data sources and basic characteristics of = P these networks are discussed in Supplementary Section VI A. requiring dierent energies, we explore the eigen-space of G. Denote undirected networks all eigenvalues of A are negative, thus we by the eigen-energies (that is, the minimum energy required denote the eigenvalues by so that the absolute eigenvalues are Ei i to drive the network to the eigen-directions of G). According to i >0 for all i. We sort the absolute eigenvalues in ascending order equation (2) 1/µ ,withµ corresponding to the eigenvalues of 0< < < < , finding that (Supplementary Section I) Ei = i i 1 2 ··· N G. Generally, the energy surface for a network with N nodes is a super-ellipsoid spanned by the N eigen-energies of G. To determine G V (V TBBTV ) C V T (3) the distribution of these eigen-energies we decompose the adjacency = [ ] matrix as A V 3V T, where V represents the eigenvectors of A where denotes Hadamard product, defined as (X Y ) X Y , = ij = ij ij and 3 diag , ,..., are the eigenvalues. For stable and C is a matrix with entries C 1/( ). For a given = { 1 2 N } ij = i + j

NATURE PHYSICS | ADVANCE ONLINE PUBLICATION | www.nature.com/naturephysics 3

© 2015 Macmillan Publishers Limited. All rights reserved ARTICLES NATURE PHYSICS DOI: 10.1038/NPHYS3422

a 1.0 bc1.0 1.0

0.8 0.8 0.8 ( ) ( ) ( ) p p p 0.6 0.6 0.6 ( ) ( ) ( ) > > > p 0.4 p 0.4 p 0.4

Model networks 0.2 0.2 SF: r = +0.34 0.2 Scale-free SF: r = −0.36 Erdős−Rényi Prediction Prediction Prediction 0.0 0.0 0.0 100 1070 10140 10210 100 1070 10140 10210 100 1070 10140 10210

de1.0 1.0 f1.0

0.8 0.8 0.8 ( ) ( ) ( ) p p p 0.6 0.6 0.6 ( ) ( ) ( ) > > > p 0.4 p 0.4 p 0.4

0.2 0.2 0.2 AS_1997 IEEEtest_grid

Infrastructural networks Airlines_sampled Prediction Prediction Prediction 0.0 0.0 0.0 100 1070 10140 10210 100 1060 10120 10180 100 1070 10140 10210

gh1.0 1.0 i1.0

0.8 0.8 0.8 ( ) ( ) ( ) p p p 0.6 0.6 0.6 ( ) ( ) ( ) > > > p 0.4 p 0.4 p 0.4 Social networks 0.2 0.2 0.2 Social_pattern Email_comm Phone_call Prediction Prediction Prediction 0.0 0.0 0.0 100 1050 10100 10150 100 1080 10160 10240 100 1090 10180 10270

jk1.0 1.0 l1.0

0.8 0.8 0.8 ( ) ( ) ( ) p p p 0.6 0.6 0.6 ( ) ( ) ( ) > > > p 0.4 p 0.4 p 0.4

Biological networks 0.2 0.2 0.2 Gene_coexpress Mutualism_net Cat_cortex Prediction Prediction Prediction 0.0 0.0 0.0 100 1050 10100 10150 100 1040 1080 10120 100 1050 10100 10150

Figure 3 | Controlling a network through a single node (N 1). Panels show the complementary cumulative distribution p>( ) of eigen-energies required D = E to control several model and real systems. a, Scale-free model network without degree–degree correlation. b, Scale-free model networks with positive (r 0.34) or negative (r 0.36) degree–degree correlation. c, Erdős–Rényi model network. d, Sampled airline network. e, Internet AS-level network in = = 1997. f, IEEE power grid test network. g, Human-contact network of the ACM Hypertext 2009 conference. h, Email interaction network. i, Phone call network between di￿erent countries. j, Connected component of the human -coexpression network. k, Mutualism ecological network in Mauritius. l, Inter-region network of cat cortex. The insets are log–log plots of probability distributions p( ) with logarithmic binning3. The straight lines show E prediction (5) and the error bars represent standard deviations. The data sources and basic characteristics of these networks are discussed in Supplementary Section VI A. network, (3) captures the impact of the input matrix B on the control eigenvectors. Thus 2 and p( ) (1/2)p(), meaning the Ei = i E = properties of the system, allowing us to analyse the distribution of distribution of eigen-energies is proportional to the distribution eigen-energies for dierent numbers of driver nodes and determine of the network’s absolute eigenvalues. We add self-loops as N the required energy for each direction. Aii ( j 1 Aij), where > 0 is a small perturbation to ensure = + = that all eigenvalues of A are negative. This scheme has been widely Controlling a system through all nodes used in previousP studies on dynamical processes taking place 29 38 If we can control all nodes (that is, ND N), B becomes a unit on networks, such as opinion dynamics , synchronization and = T 16 1–3 diagonal matrix. In this case G V diag 1/2 V and the eigen- control . For networks with degree distribution p(k) k the = { i} ⇠ directions of the controlled system are the same as the network’s distribution of the absolute eigenvalues of A also obeys a power

4 NATURE PHYSICS | ADVANCE ONLINE PUBLICATION | www.nature.com/naturephysics

© 2015 Macmillan Publishers Limited. All rights reserved NATURE PHYSICS DOI: 10.1038/NPHYS3422 ARTICLES

a 0.3 bc105 5 ) ND/N = 0.5, log( max) )

D N = 2 max 0.2 N peak 104 4 log( log( log( ND/N = 0.5 [log( )] 0.1 p 103 3 0.0 0 2 4 6 8 10 log( ) log( N ) 102 2 D log( )

0.2 1 ND/N = 0.25, Npeak = 4 10 1 ) ) D max N 0.1 100 0 [log( )] log( log( log( 0.0 0.1 0.2 0.3 p p[log( )] 0.0 0 2 4 6 8 10 log( ) d 90 log( max) 0.10 log( N ) N /N = 0.2, N = 5 60 D D peak Prediction ) ) D N max 30 0.05 log( [log( )] log( log( p 0 0 10 20 30 40 50 0.00 0 2 4 6 8 10 N/ND log( )

efg 60 North_Euro_grid 60 Inter_companies 90 Brain coactivation Prediction Prediction Prediction ) ) ) 60

max 40 max 40 max

log( log( 20 log( 20 log( 30

0 0 0 0 10 20 30 40 0 10 20 30 40 0 10 20 30 40 50

N/ND N/ND N/ND hij90 Metabolic (C. elegans) AS_December 1998 25 Online forum 60 ) 70 Prediction ) Prediction ) Prediction

max max max 40 50 15 log( log( 30 log( log( 20 10 5 0 0 10 20 30 40 50 0 5 10 15 20 0 10 20 30

N/ND N/ND N/ND

Figure 4 | Controlling a network through a finite fraction of its nodes. a, Multi-peak distributions p[log( )] for 1

39,40 law p() (see Supplementary Section II A). Consequently, network (Fig. 2g), the user-interaction network of an online ⇠ forum (Fig. 2h), the human protein–protein interaction network p( ) (4) (Fig. 2j) and the human heterogeneous network (Fig. 2l), in line E ⇠E with the prediction (4). In contrast, for several networks with indicating that the system can be easily driven in most directions bounded degree distribution, such as the ErdÆs–Rényi random of the state space, requiring a small . A few directions require network (Fig. 2c), the US power grid network (Fig. 2f), the E considerable energy and the most dicult direction needs41 interlocking network of Norwegian companies (Fig. 2i) and the 1/( 1) N . The fact that is sublinear in N for > 2 functional coactivation network of the human brain (Fig. 2l), Emax ⇠ Emax indicates that, when N N, the energy density /N remains p( ) is also bounded, as predicted by in (4). Such D = E D E !1 bounded. In Fig. 2 we test the prediction (4) for several model, networks require even less energy to control their progress in infrastructural, social and biological networks. We find that p( ) their most dicult direction. Taken together, we find that for E follows a power law for uncorrelated or correlated scale-free N N the distribution of eigen-energies is uniquely determined D = model networks (Fig. 2a,b), the airline transportation network by , and we lack significant energetic barriers (Fig. 2d), the Internet AS-level network (Fig. 2e), an Israeli social for control.

NATURE PHYSICS | ADVANCE ONLINE PUBLICATION | www.nature.com/naturephysics 5

© 2015 Macmillan Publishers Limited. All rights reserved ARTICLES NATURE PHYSICS DOI: 10.1038/NPHYS3422

Controlling a system through a single node Table 1 | Controlling complex networks with di￿erent numbers If there is not degeneracy in A’s eigenvalues we can control an of driver nodes. undirected network by driving only a single node11,42. In this case V TBBTV V V (1/N), where h is the index of the ={ ih jh} ⇠ O Number of Distribution of eigen- Maximum control chosen driver node. Thus V TBBTV can be viewed as a small driver nodes energies energy 1/( 1) perturbation to the matrix C in (3). The statistical behaviour of ND Np( ) max N = E ⇠E E ⇠ N/ND the eigenvalues of G is mainly determined by the eigenvalues of C, 1( ) E p( 0)d 0. Based on (5) we w(t), which we assume to be a Gaussian white noise with E = E E E obtain p>( ) (ln max ln ), decreasing linearly with ln .We zero mean and variance one. We aim to estimatex ˆ o of the E ⇠ E E R E ⌧ 2 test our prediction on several network models (Fig. 3a–c) and real initial state xo while minimizing the dierence y(t) yˆ(t) dt 0 k k networks (Fig. 3d–l), finding that the corresponding eigen-energies between the output y(t) that is actually observed and the output At R span over a hundred orders of magnitude and this exceptional range yˆ(t) Ce xˆ o that would be observed in the absence of noise. With = 45 of variations is reasonably well approximated by (5) for both p>( ) the maximum-likelihood approximation , the expectation xˆ o xo E 45 T 1 h i= and p( ). Taken together, if we attempt to control a network from and the covariance matrix xx Go (⌧), where x xˆ o xo is the E h˜ ˜ ⌧i=ATt T At ˜ ⌘ a single node (N 1), the required energy varies enormously for estimation error and Go(⌧) e C Ce dt is the observability D = = 0 dierent directions, almost independently of the network structure, Gramian. Therefore, the variance 2 of the approximation in R making some directions prohibitively expensive energetically. direction x is ˜ 2 T 1 Controlling a system through a finite fraction of its nodes (⌧) x Go (⌧)x (6) (1 ) =˜ ˜ When p( ) , the distribution p( ) e Eˆ , where ln . E ⇠ E Eˆ ⇠ Eˆ ⌘ E Thus, if ND N, p( ) is an exponential (one-peak) distribution indicating that the estimation uncertainty varies with the direction = Eˆ 1 for > 2 in (4); if N 1, as p( ) in (5), p( ) is a uniform of the state space. To illustrate this, consider the network in Fig. 1e D = E ⇠ E Eˆ distribution. To understand the transition from (4) for N N to that moves along the trajectory of Fig. 1g, while we measure the D = (5) for N 1, we investigate the distribution p( ) when 1

6 NATURE PHYSICS | ADVANCE ONLINE PUBLICATION | www.nature.com/naturephysics

© 2015 Macmillan Publishers Limited. All rights reserved NATURE PHYSICS DOI: 10.1038/NPHYS3422 ARTICLES degree correlations47 and community structure48 (Supplementary References Table 1). To assess the impact of these topological characteristics 1. Albert, R. & Barabási, A.-L. Statistical mechanics of complex networks. Rev. we perform the degree-preserved randomization49 on each network, Mod. Phys. 74, 47–97 (2002). eliminating local clustering, degree correlations and modularity. 2. Cohen, R. & Havlin, S. Complex Networks: Structure, Robustness and Function (Cambridge Univ. Press, 2010). We find that the distribution of eigen-energies required to drive 3. Newman, M. E. J. Networks: An Introduction (Oxford Univ. Press, 2010). each randomized network follows the predictions (4) and (5) 4. Boccaletti, S., Latora, V., Moreno, Y., Chavez, M. & Hwang, D. Complex (Supplementary Figs 6 and 7), indicating that degree distribution networks: Structure and dynamics. Phys. Rep. 424, 175–308 (2006). is the main factor determining p( ). When the number of 5. Barrat, A., Barthelemy, M. & Vespignani, A. Dynamical Processes on Complex E driver nodes increases, the maximum control energy for the Networks (Cambridge Univ. Press, 2008). randomized networks decreases exponentially, as predicted earlier 6. Barzel, B. & Barabási, A.-L. Universality in . Nature Phys. 9, 673–681 (2013). (Supplementary Fig. 8). We also validate the predictions (4) and (5) 7. Rugh, W. J. Linear System Theory (Prentice-Hall, 1996). on model networks with positive or negative degree correlations 8. Sontag, E. D. Mathematical : Deterministic Finite Dimensional (Figs 1b and 2b). All the tests indicate that the strength of local Systems (Springer, 1998). clustering, degree correlations or community structure have only 9. Slotine, J.-J. & Li, W. Applied Nonlinear Control (Prentice-Hall, 1991). minor influence on the behaviour of control energy. Consequently, 10. Liu, Y.-Y., Slotine, J.-J. & Barabási, A.-L. Controllability of complex networks. our calculations for uncorrelated networks capture the correct Nature 473, 167–173 (2011). 11. Yuan, Z., Zhao, C., Di, Z., Wang, W.-X. & Lai, Y.-C. Exact controllability of fundamental dependence of control energy for real networks. complex networks. Nature Commun. 4, 2447 (2013). Many real networks have dead ends (that is, nodes with one 12. Sorrentino, F., di Bernardo, M., Garofalo, F. & Chen, G. Controllability of degree), which can undermine the stability of complex systems50. complex networks via pinning. Phys. Rev. E 75, 046103 (2007). To test the impact of dead ends on control energy we explored 13. Yu, W., Chen, G. & Lü, J. On pinning synchronization of complex dynamical several real networks that contain a considerable number of one- networks. Automatica 45, 429–435 (2009). degree nodes (see Supplementary Table 1). As shown in Figs 2–4 14. Rajapakse, I., Groudine, M. & Mesbahi, M. Dynamics and control of state-dependent networks for probing genomic organization. Proc. Natl Acad. and Supplementary Figs 6–8, the predictions are robust against such Sci. USA 108, 17257–17262 (2011). dead ends. 15. Nepusz, T. & Vicsek, T. Controlling edge dynamics in complex networks. Nature Phys. 8, 568–573 (2012). Discussion 16. Yan, G., Ren, J., Lai, Y.-C., Lai, C.-H. & Li, B. Controlling complex networks: The energy required for control is a significant issue for practical How much energy is needed? Phys. Rev. Lett. 108, 218703 (2012). control of complex systems. By exploring the eigen-space of 17. Sun, J. & Motter, A. E. Controllability transition and nonlocality in network control. Phys. Rev. Lett. 110, 208701 (2013). controlled systems we found that if all nodes of a system are directly 18. Pasqualetti, F., Zampieri, S. & Bullo, F. Controllability metrics, limitations and driven, the eigen-energies can be heterogeneous or homogeneous, algorithms for complex networks. IEEE Trans. Control Netw. Syst. 1, depending on the structure of underlying networks. Yet, if we wish 40–52 (2014). to control a system through a single node, the eigen-energies are 19. Tang, Y., Gao, H., Zou, W. & Kurths, J. Identifying controlling nodes in enormously heterogeneous, almost independently of the network neuronal networks in dierent scales. PLoS ONE 7, e41375 (2012). 20. Jia, T. et al. Emergence of bimodality in controlling complex networks. Nature structure. Finally, if a finite fraction of nodes are driven, the Commun. 4, 2002 (2013). maximum control energy decays exponentially with the increasing 21. Ruths, J. & Ruths, D. Control profiles of complex networks. Science 343, number of driver nodes. Taken together, our results indicate that 1373–1376 (2014). even if controllable, most systems still have directions which 22. Menichetti, G., Dall’Asta, L. & Bianconi, G. Network controllability is are energetically inaccessible, suggesting a natural mechanism to determined by the density of low in-degree and out-degree nodes. Phys. Rev. avoid undesirable states. Indeed, many complex systems, such as Lett. 113, 078701 (2014). transcriptional networks for gene expression51 and sensorimotor 23. Summers, T. H., Cortesi, F. L. & Lygeros, J. On submodularity and 52 controllability in complex dynamical networks. Preprint at systems for motion control , need only to function in a low- http://arXiv.org/abs/1404.7665v2 (2014). dimensional subspace. Owing to the duality between controllability 24. Tzoumas, V., Rahimian, M. A., Pappas, G. J. & Jadbabaie, A. Minimal actuator and observability, our results also imply that, if we monitor only a placement with optimal control constraints. Preprint at small fraction of nodes, the observation can be extremely unreliable http://arXiv.org/abs/1503.04693 (2015). in certain directions of the phase space. 25. Cornelius, S. P., Kath, W. L. & Motter, A. E. Realistic control of network dynamics. Nature Commun. 4, 1942 (2013). It is worth noting that linear dynamics captures the behaviour 26. Whalen, A. J., Brennan, S. N., Sauer, T. D. & Schi, S. J. Observability and of nonlinear systems in the vicinity of their equilibria. The controllability of nonlinear networks: The role of symmetry. Phys. Rev. X 5, formalism (1) has been widely used to model diverse networked 011005 (2015). systems14,18,24,29,36 (see also Supplementary Section VII A, B), 27. Menolascina, F. et al. In-vivo real-time control of protein expression from allowing us to reveal the role of the network topology on the endogenous and synthetic gene networks. PLoS Comput. Biol. 10, fundamental control properties of complex systems10,11,14–24. Indeed, e1003625 (2014). 28. Rahmani, A., Ji, M., Mesbahi, M. & Egerstedt, M. Controllability of multi-agent if the linearized system (1) is controllable, the original nonlinear 53 systems from a graph-theoretic perspective. SIAM J. Control Optim. 48, system is locally controllable . The corresponding control energy 162–186 (2009). is also highly heterogeneous for dierent directions, if we constrain 29. Acemoglu, D., Ozdaglar, A. & ParandehGheibi, A. Spread of (mis)information the system’s trajectory to be local (Supplementary Section VII C). in social networks. Games Econ. Behav. 70, 194–227 (2010). Moreover, if the linearized dynamics of a nonlinear system is 30. Liu, Y.-Y., Slotine, J.-J. & Barabási, A.-L. Observability of complex systems. controllable along a specific trajectory, the original nonlinear system Proc. Natl Acad. Sci. USA 110, 2460–2465 (2013). 53 31. Yang, Y., Wang, J. & Motter, A. E. Network observability transitions. Phys. Rev. is locally controllable along the same trajectory . This implies that Lett. 109, 258701 (2012). our results can be potentially extended to describe control properties 32. Pinto, P. C., Thiran, P. & Vetterli, M. Locating the source of diusion in of nonlinear systems in the vicinity of their stability basin50,54. Yet, in large-scale networks. Phys. Rev. Lett. 109, 068702 (2012). this case, the linearized dynamics becomes time-varying, and the 33. Scheer, M. et al. Anticipating critical transitions. Science 338, required energy for controlling time-varying systems remains an 344–348 (2012). open problem that deserves future attention. 34. Friedman, N. Inferring cellular networks using probabilistic graphical models. Science 303, 799–805 (2004). 35. Almaas, E., Kovács, B., Vicsek, T., Oltvai, Z. N. & Barabási, A.-L. Global Received 17 December 2014; accepted 30 June 2015; organization of metabolic fluxes in the bacterium Escherichia coli. Nature 427, published online 10 August 2015 839–843 (2004).

NATURE PHYSICS | ADVANCE ONLINE PUBLICATION | www.nature.com/naturephysics 7

© 2015 Macmillan Publishers Limited. All rights reserved ARTICLES NATURE PHYSICS DOI: 10.1038/NPHYS3422

36. Castellano, C., Fortunato, S. & Loreto, V. Statistical physics of social dynamics. 52. Todorov, E. & Jordan, M. I. Optimal feedback control as a theory of motor Rev. Mod. Phys. 81, 591–646 (2009). coordination. Nature Neurosci. 5, 1226–1235 (2002). 37. May, R. M. Stability and Complexity in Model Ecosystems (Princeton Univ. 53. Coron, J.-M. Control and Nonlinearity (American Mathematical Society, 2009). Press, 1974). 54. Menck, P. J., Heitzig, J., Marwan, N. & Kurths, J. How basin stability 38. Pecora, L. M. & Carroll, T. L. Master stability functions for synchronized complements the linear-stability paradigm. Nature Phys. 9, 89–92 (2013). coupled systems. Phys. Rev. Lett. 80, 2109–2112 (1998). 39. Chung, F., Lu, L. & Vu, V. Spectra of random graphs with given expected degrees. Proc. Natl Acad. Sci. USA 100, 6313–6318 (2003). Acknowledgements 40. Kim, D. & Kahng, B. Spectral densities of scale-free networks. Chaos 17, We thank E. Guney, C. Song, J. Gao, M. T. Angulo, S. P. Cornelius, B. Coutinho and A. Li 026115 (2007). for discussions. This work was supported by Army Research Laboratories (ARL) 41. Cohen, R., Erez, K., ben Avraham, D. & Havlin, S. Resilience of the Internet to Network Science (NS) Collaborative Technology Alliance (CTA) grant ARL NS-CTA W911NF-09-2-0053; DARPA Social Media in Strategic Communications project under random breakdowns. Phys. Rev. Lett. 85, 4626–4628 (2000). agreement number W911NF-12-C-002; the John Templeton Foundation: Mathematical 42. Cowan, N. J., Chastain, E. J., Vilhena, D. A., Freudenberg, J. S. & and Physical Sciences grant number PFI-777; European Commission grant numbers FP7 Bergstrom, C. T. Nodal dynamics, not degree distributions, determine the 317532 (MULTIPLEX) and 641191 (CIMPLEX). structural controllability of complex networks. PLoS ONE 7, e38398 (2012). 43. Antoulas, A. Approximation of Large-Scale Dynamical Systems (SIAM, 2009). 44. Del Genio, C., Gross, T. & Bassler, K. All scale-free networks are sparse. Phys. Author contributions Rev. Lett. 107, 178701 (2011). All authors designed and performed the research. G.Y. and G.T. carried out the 45. Kailath, T., Sayed, A. & Hassibi, B. Linear Estimation (Prentice-Hall, 2000). numerical calculations. G.Y. did the analytical calculations and analysed the empirical 46. Watts, D. J. & Strogatz, S. H. Collective dynamics of ‘small-world’ networks. data. G.T., B.B., J.-J.S., Y.-Y.L. and A.-L.B. analysed the results. G.Y. and A.-L.B. were the Nature 393, 440–442 (1998). main writers of the manuscript. G.T., B.B. and Y.-Y.L. edited the manuscript. G.Y. and G.T. contributed equally to this work. 47. Newman, M. E. J. Assortative mixing in networks. Phys. Rev. Lett. 89, 208701 (2002). 48. Girvan, M. & Newman, M. E. J. Community structure in social and biological Additional information networks. Proc. Natl Acad. Sci. USA 99, 7821–7826 (2002). Supplementary information is available in the online version of the paper. Reprints and 49. Xulvi-Brunet, R. & Sokolov, I. M. Reshuing scale-free networks: From permissions information is available online at www.nature.com/reprints. random to assortative. Phys. Rev. E 70, 066102 (2004). Correspondence and requests for materials should be addressed to A.-L.B. 50. Menck, P. J., Heitzig, J., Kurths, J. & Schellnhuber, H. J. How dead ends undermine power grid stability. Nature Commun. 5, 3969 (2014). 51. Müller, F.-J. & Schuppert, A. Few inputs can reprogram biological networks. Competing financial interests Nature 478, E4 (2011). The authors declare no competing financial interests.

8 NATURE PHYSICS | ADVANCE ONLINE PUBLICATION | www.nature.com/naturephysics

© 2015 Macmillan Publishers Limited. All rights reserved