LETTER Communicated by P. Reed Montague
Synaptic Pruning in Development: A Computational Account
Gal Chechik
Isaac Meilijson Downloaded from http://direct.mit.edu/neco/article-pdf/10/7/1759/813980/089976698300017124.pdf by guest on 02 October 2021 School of Mathematical Sciences, Tel-Aviv University, Tel Aviv 69978, Israel
Eytan Ruppin Schools of Medicine and Mathematical Sciences, Tel-Aviv University, Tel Aviv 69978, Israel
Research with humans and primates shows that the developmental course of the brain involves synaptic overgrowth followed by marked selective pruning. Previous explanations have suggested that this intriguing, seem- ingly wasteful phenomenon is utilized to remove “erroneous” synapses. We prove that this interpretation is wrong if synapses are Hebbian. Under limited metabolic energy resources restricting the amount and strength of synapses, we show that memory performance is maximized if synapses are first overgrown and then pruned following optimal “minimal-value” deletion. This optimal strategy leads to interesting insights concerning childhood amnesia.
1 Introduction
One of the fundamental phenomena in brain development is the reduction in the amount of synapses that occurs between early childhood and pu- berty. In recent years, many studies have investigated the temporal course of changes in synaptic density in primates, revealing the following picture. Beginning at early stages of fetal development, synaptic density rises at a constant rate, until a peak level is attained (at 2–3 years of age in hu- mans). Then, after a relatively short period of stable synaptic density (until the age of 5 in humans), an elimination process begins: synapses are being constantly removed, yielding a marked decrease in synaptic density. This process proceeds until puberty, when synaptic density stabilizes at adult levels, which are maintained until old age. The peak level of synaptic den- sity in childhood is 50% to 100% higher than adult levels, depending on the brain region. The phenomenon of synaptic overgrowth and pruning was found in humans (Huttenlocher, 1979; Huttenlocher, De Courten, Garey, & Van der Loos, 1982; Huttenlocher & De Courten, 1987), as well as in other mammals such as monkeys (Eckenhoff & Rakic, 1991; Bourgeois & Rakic, 1993; Bourgeois, 1993; Rakic, Bourgeois, & Goldman-Rakic, 1994), cats (In- nocenti, 1995) and rats (Takacs & Hamori, 1994). It was observed throughout
Neural Computation 10, 1759–1777 (1998) c 1998 Massachusetts Institute of Technology 1760 Gal Chechik, Isaac Meilijson, and Eytan Ruppin widespread brain regions including cortical areas: visual (Bourgeois & Ra- kic, 1993; Huttenlocher et al., 1982), motor and associative (Huttenlocher, 1979), the cerebellum(Takacs & Hamori, 1994), projection fibers between hemispheres (Innocenti, 1995), and the dentate gyrus (Eckenhoff & Rakic, 1991). The time scale of synaptic elimination was found to vary by cortical area, coarsely following a dorsal to frontal order (Rakic et al., 1994). The
changes in synaptic density are not a result of changes in total brain vol- Downloaded from http://direct.mit.edu/neco/article-pdf/10/7/1759/813980/089976698300017124.pdf by guest on 02 October 2021 ume, but reflect true synaptic elimination (Rakic et al., 1994). In some cases, synaptic elimination was shown to be correlated with experience-dependent activity (Stryker, 1986; Roe, Pallas, Hahm, & Sur, 1990). What advantage could such a seemingly wasteful developmental strat- egy offer? Some researchers have treated the phenomenon as an inevitable result of synaptic maturation lacking any computational significance. Oth- ers have hypothesized that synapses that are strengthened at an early stage might be later revealed as harmful to overall memory function, when ad- ditional memories are stored. Thus, they claim, synaptic elimination may reduce the interference between memories, and yield better overall perfor- mance (Wolff, Laskawi, Spatz, & Missler, 1995). This article shows that in associative memory networks models, these previous explanations do not hold, and it puts forward a different expla- nation. Our proposal is based on the assumption that synapses are a costly resource whose efficient utilization is a major optimization goal guiding brain development. This assumption is motivated by the observation that the changes in synaptic density along brain development are highly cor- related to the temporal course of changes in energy consumption (Roland, 1993) and by the fact that the brain consumes a large fraction of total energy consumption of the resting adult (Roland, 1993). By analyzing the network’s performance under various synaptic constraints such as limited number of synapses or limited total synaptic strength, we show that if synapses are properly pruned, the performance decrease due to synaptic deletion is small compared to the energy saving. Deriving optimal synaptic pruning strate- gies, we show that efficient memory storage in the brain requires a specific learning process characterized by initial synaptic overgrowth, followed by judicious synaptic pruning. The next section describes the models studied and our analytical results, which are verified numerically in section 3. Section 4 discusses the pos- sible benefits of efficient synaptic elimination and its implications for the phenomenon of childhood amnesia.
2 Analytical Results
In order to investigate synaptic elimination, we address the more general question of optimal modification of a Hebbian memory matrix. Given pre- viously learned Hebbian synapses, we apply a function that changes the synaptic values and investigate the effect of such a modification function. Synaptic Pruning in Development 1761
First, we analyze the way memory performance depends on a general synap- tic modification function. Then we derive optimal modification functions under different constraints. Finally, we calculate the dependency of perfor- mance on the deletion levels.
2.1 The Models. We investigate synaptic modification in two Hebbian
models. The first model is a variant of the canonical Hopfield model. M Downloaded from http://direct.mit.edu/neco/article-pdf/10/7/1759/813980/089976698300017124.pdf by guest on 02 October 2021 memories are stored in an N-neuron network, forming approximate fixed points of the network dynamics. The initial synaptic efficacy Wij between the jth (presynaptic) neuron and the ith (postsynaptic) neuron is
1 M W , 1 i j N W 0, (2.1) ij = √ i j 6= ; ii = M 1 X= M where 1 are 1 binary patterns representing the stored memories. { } = The actual synaptic efficacy Jij is
J g(W ) 1 i j N J 0, (2.2) ij = ij 6= ; ii = where g is a general modification function over the Hebbian weights, such that g(z) has finite moment if z is normally distributed. The updating rule t for the state Xi of the ith neuron at time t is
N t 1 t X + (f ), f J X , (2.3) i = i i = ij j j 1 X= where f is the neuron’s input field, and is the function (f ) sign( f ). The i = overlap m (or similarity) between the network’s activity pattern X and the 1 N memory is m N j 1 j Xj. = = The second model is a variant of the low-activity, biologically motivated P model described by Tsodyks and Feigel’man (1988), in which synaptic effi- cacies are described by
1 M J g(W ) g ( p)( p) , (2.4) ij = ij = √ i j p(1 p) M 1 ! X= where are 0, 1 memory patterns with coding level p (fraction of firing { } neurons), and g is a synaptic modification function. The updating rule is similar to equation 2.3,
N t 1 t X + (f ), f J X T , (2.5) i = i i = ij j j 1 X= 1762 Gal Chechik, Isaac Meilijson, and Eytan Ruppin
1 sign( f ) where now denotes the step function (f ) + , and T is the neuronal = 2 threshold, set to its optimal value (see equation A.9). The overlap m in this model is defined by
1 N m ( p)X . = Np( p) j j 1 j 1 = X Downloaded from http://direct.mit.edu/neco/article-pdf/10/7/1759/813980/089976698300017124.pdf by guest on 02 October 2021 2.2 Pruning Does Not Improve Performance. To evaluate the impact of synaptic pruning on the network’s retrieval performance, we study its effect on the signal-to-noise ratio (S/N) of the neuron’s input field in the modified Hopfield model (see equations 2.2 and 2.3). The S/N is known to be the primary determinant of retrieval capacity (ignoring higher-order correlations in the neurons input fields) (Meilijson & Ruppin, 1996) and is calculated by analyzing the moments of the neuron’s field. The network is started at a state X with overlap m with a specific memory ; the overlap with other memories is assumed to be negligible. Therefore W i j is the ij √M sum of M 1 independent variables with zero expectation and standard x2/2 variation √M and is distributed N(0, 1). Denoting (x) e , we use the = √2 fact that (x) x (x) and write 0 =
E f NE g(W )X Nm E g(W ) (2.6) i| i = ij j = ij j = h i 1 1 Nm E g(Wij) 1 Nm E g(Wij) 1 = 2 | j =+ 2 | j = = h i h i 1 ∞ i Nm g(Wij) Wij dWij = 2 √M Z ∞ 1 ∞ i Nm g(Wij) Wij dWij 2 + √M Z ∞ 1 ∞ i Nm g(Wij) (Wij) 0(Wij) dWij 2 √M Z ∞ 1 ∞ i Nm g(Wij) (Wij) 0(Wij) dWij 2 + √M = Z ∞ ∞ i Nm g(Wij) Wij (Wij)dWij = √M = Z ∞ Nm i E zg(z) = √M where z is a random variable with standard normal distribution. The vari- Synaptic Pruning in Development 1763 ance of the field is similarly calculated to be
V( f ) NE g2(z) . (2.7) i| i = h i Hence the S/N is