<<

Proceedings of the International Joint Conference IBERAMIA/SBIA/SBRN 2006 - 1st Workshop on Computational Intelligence (WCI’2006), Ribeir˜aoPreto, Brazil, October 23–28, 2006. CD-ROM. ISBN 85-87837-11-7

A Connectionist Model based on Physiological Properties of the

Alberione Braz da Silva Joao˜ Lu´ıs Garcia Rosa Ceatec, PUC-Campinas Ceatec, PUC-Campinas Campinas, SP, Brazil Campinas, SP, Brazil [email protected] [email protected]

Abstract 2 Intraneuron Signaling

Recent research on artificial neural network models re- 2.1 The biological neuron and the intra- gards as relevant a new characteristic called biological neuron signaling plausibility or realism. Nevertheless, there is no agreement about this new feature, so some researchers develop their The intraneuron signaling is based on the principle of own visions. Two of these are highlighted here: the first is dynamic polarization, proposed by Ramon´ y Cajal, which related directly to the cerebral cortex biological structure, establishes that “electric signals inside a nervous cell flow and the second focuses the neural features and the signaling only in a direction: from neuron reception (often the den- between . The proposed model departs from the pre- drites and cell body) to the trigger zone” [5]. vious existing system models, adopting the standpoint that a Based on physiological evidences of principle of dy- biologically plausible artificial neural network aims to cre- namic polarization the signaling inside the neuron is per- ate a more faithful model concerning the biological struc- formed by four basic elements: Receptive, responsible for ture, properties, and functionalities of the cerebral cortex, input signals; Trigger, responsible for neuron activation not disregarding its computational efficiency. threshold; Signaling, responsible for conducting and keep- ing the signal; and Secretor, responsible for signal releasing to another related neuron. 1 Introduction These elements are related to the dendritic region, cell body, axon, and presynaptic terminals of the biological neu- ron, respectively [5]. Classical connectionist models are based on the pioneer- The cell body is the neuron portion where cell genes are ing McCulloch-Pitts neuron proposal [9] although their op- and is the place where protein syntheses occur. Consid- erations are far from the way human brain works. Artificial ered the neural cell metabolic center, the cell body is the Neural Networks (ANN) nowadays are designed to mimic, main control place of ordinary neural electric signals. From with limited realism, some brain functionalities including the cell body - or - originate two signal conducting pattern recognition, new information acquisition, and mo- branches, the axon and the . tor performance [7, 3]. The main of a is the reception of sig- More recently, some researchers of this area have ap- nals originated from other cells. The axon, in general, proached the proposition of new models based on cerebral comes from a cell body area called axon hillock [5], located structure, that is, on cerebral cortex biological properties. on the opposite side of the tree trunk formed by dendrites, This new direction on research makes noticeable a new and ended in a micro-ramification form called axonal tree characteristic to be incorporated to ANNs: the biological or axonal branch. These branches have in their terminals plausibility. synaptic buttons, responsible for connectivity between neu- However there is no agreement about the definition of rons. this new concept. For this reason, biological plausibility The axon is considered the main neuron conductivity is still being analyzed under several aspects. One of these unit [5], responsible for the transmission of electric signals main visions concerns the cerebral cortex structure. (action potentials or spikes) from the axonal cone - area sit- Taken the ideas presented previously into consideration, uated at the end of cell body - to the presynaptic terminals, this paper proposes a new artificial neuron model based on keeping the same amplitude and duration, varying only in biological properties of the physiological neuron. intensity or frequency.

1 The transmission rules of neuron output signaling are none [3, 5], originating in a specific trigger zone - the be- limited, but very efficient. This efficiency is verified ginning of the axon. The fact that the , the through fast propagation and signal properties maintenance neuron conducting signal, is an “all-or-none” signal or bi- between initial and final points of involved [5, 6]. nary is motivated by the fact that a stimulus smaller than There are other elements, co-responsible for propagation threshold does not produce signals and a stimulus greater speed and amplitude maintenance of action potentials, for than threshold produces the same signal independently of example, the glial cell and the Ranvier nodules. The glial amplitude variation and stimulus duration. Consequently, cell involves the neural cell bodies and their axons in a pro- there are only two pulse characteristics for the informa- cess called myelination, which provides as an insulating fi- tion transmission process to be taken into consideration: the nal product, the myelin sheaths, white fatty extensions of amount of action potentials and the time intervals between neuroglial cells (or Schwann cells), involving the axon in them [5, 12, 4]. several layers. These sheaths are interrupted at regular in- The action potential is generated only if the input signal, tervals by the nodes of Ranvier, which are electrically non- or the weighted sum of the input signals by their synaptic insulated and are responsible for the action potential regen- weights, is greater than a certain activation threshold, that eration [5, 6]. is, once the threshold is reached, the additional increase in There is also the ionic mechanism, which can act as a amplitude of the input signal implies in an increase of the signaling structure in an excitable cell, independently of action potential frequency and not in an increase of the po- synapse localization. This mechanism refers to the cur- tential amplitude [5]. Since action potentials are of the kind rent flow generated by the ionic movement through protein “all-or-none” and conducted without alterations through the membranes - channels - generating the signals (membrane whole axon, the information contained in the signal is rep- potentials) in neural cells. These alterations in resting po- resented only by the frequency and pulse amount, without tentials of the cell membrane provoked by the ionic current taking into account their amplitude. So, the larger the stim- flowing through open channels produce input signals that ulus duration, the longer the action potential sequence lasts, develop the intraneuron signaling process. and consequently, the larger the amount of pulses [5]. The intraneuron signaling process begins after neuron Once the synaptic terminals are reached, action poten- binding, either directly or by means of modulation (see item tials stimulate chemical neurotransmitter releasing by the about interneuron signaling), altering the membrane poten- cell, considered as analogical output signals. In this graded tial and transforming chemical potential energy in an elec- transmitter releasing, there is, repeatedly, the transforma- tric signal called synaptic potential. The synaptic potential tion of digital signal, represented by action potential, in is graded and its amplitude is directly related to volume and analogical signal. Also in this process, the exact amount releasing time of chemical neurotransmitters in interneuron of neurotransmitters to be released by the cell is defined by signaling process. Once generated, the input signal - synap- the total number of action potentials in a given time inter- tic potential - often flows through dendrites and cell body val [5]. After an action potential releasing, a very short re- where it is accumulated until reaching a certain threshold, fractoriness time period happens, that is, a period of lesser responsible for pulse or action potential generation. This in- excitability. This time period - called refractory period - can put signal can be excitatory or inhibitory, so the latter tries be divided in: (1) absolute, when happens immediately after to prevent action potential from releasing. the action potential; and (2) relative, which is subsequent to The over stimulus of the target cell makes the inputs the absolute refractory period. compete. In the “struggle,” the weaker synapses are elimi- In the absolute refractory period, it is impossible to ex- nated and the stronger synapses keep the inputs to the post- cite a cell; the same does not happen in the relative period, synaptic cell [5]. These competitive inputs are combined when a pulse can be released [5, 11]. into the postsynaptic neuron through a process called neu- ronal integration, which sum up all the signals received by 2.2 The artificial neuron and the intraneu- a neuron and takes the decision of firing or not an action ron signaling potential, which reflects the brain fundamental ability of decision-making in the neuron [5, 11]. In neuronal inte- The neuron is considered the fundamental operational gration process, the brain ability in opting for one of the unit of a neural network - natural or artificial -, due to its ca- competitive alternatives (fire or not fire), selecting one and pacity of processing and transmitting information [5, 12, 4]. suppressing the other, is known as the integrative action of In artificial neural network primordial research, McCul- the nervous system [5]. loch and Pitts [9] define a neuron model that served as basis Action potentials are electric signals used for manipu- for building several artificial neurons (for instance, Rosen- lating brain information - reception, analysis, and transmis- blatt’s [3]). sion. They are very fast and transient, of the kind all-or- McCulloch-Pitts model [9] is based on an “all-or-none” property of a pulse releasing for a specific neuron. The ba- which are located in the pre and postsynaptic cell mem- sic idea to represent the “all-or-none” property proposed by branes and serve as a cytoplasmatic connection between the the two researchers consists in dividing the refractory period two cells. At chemical synapses, there is a small cellular in time units so that, at most, one pulse could be generated separation between the cells called synaptic cleft, instead of by the neuron in each interval. Besides, the model output cytoplasmatic continuity. is binary, that is, it is 0 (zero) or 1 (one) according to the At electrical synapses, part of electric current injected in value of the induced local field or neuron activation poten- presynaptic cell escapes through the resting channels and tial. McCulloch and Pitts [9] modeled, indeed, an artificial the remaining current is driven to the inside of the postsy- neuron as a time-independent binary element. naptic cell through the gap junction channels, which con- Maass [7] proposes a classification based on model gen- nect cytoplasms of involved cells. erations to situate his proposition in a biological realism At chemical synapses, in the presynaptic terminal spe- evolutionary scale. Maass [7] suggests three generations: cialized zones, there are vesicles containing neurotransmit- ter molecules. When the action potential reaches these 1. The first generation artificial model is based on synaptic vesicles, the neurotransmitters are released to the McCulloch-Pitts neuron, and its main feature is the synaptic cleft, and the interneuron signaling chemical pro- ability to produce only digital outputs - zero or one; cess, or chemical synapse, takes place. 2. The second generation model is based on the applica- These vesicles, when stimulated, flow towards the plas- tion of the as a computational unit, matic membrane, with which are melted, releasing neuro- which produces a continuous set of output values for a transmitters to the synaptic cleft. The amount of neuro- weighted (or polynomial) sum of inputs; and transmitters released is directly related to the depolarization amplitude inside presynaptic terminals and to the frequency 3. The third generation model uses the duration time of of the action potentials that cross the axon [5, 6]. an action potential to encode information, that is, uses The releasing of chemical neurotransmitters towards the time as a computational and interneuron communica- synaptic cleft operates as an output signal. After releasing, tion resource. the neurotransmitters spread along the synaptic cleft in or- der to reach the receptors of the postsynaptic cell membrane Maass’s integrate and fire neuron (spiking neuron) model of another neuron. The neurotransmitter binding at the post- is argued to be a third generation model [7]. That is, synaptic cell generates a synaptic potential which produces Maass [7, 8] employs the spike transmission time, the re- excitatory or inhibitory synapse, depending on the recep- fractory period, and the membrane electric potential in the tor type. In other words, receptors are the elements which pulse trigger zone as model’s main elements. define the type of synapse and the occurrence of modula- The model proposed by Rosa [13] considers the clas- tion [5, 13]. The fact that the receptors define the type of sic neuron model basic elements like connection weights, synapse agrees with Ramon´ y Cajal’s principle of connec- activation threshold, and activation function. In addition, tional specificity [5]. Rosa [13] introduces three more elements that act directly In some cases, neurotransmitters are enzymatically di- in intraneuron and interneuron signaling. These elements vided, synthesizing new transmitters, therefore increasing are the transmitters T, the receptors R, and the controllers the amount of substrate in the synaptic transmission pro- C, representing the amount of substrate, the binding affin- cess [13]. ity between transmitters and receptors, and the genes - name The chemical neurotransmitters are classified based on and expression. However, Rosa [13] does not consider some their control over the ionic channels of the postsynaptic intraneuron signaling features, as for instance, the associ- cell, acting directly or indirectly through direct or indirect ation of the pulse frequency with the amount of substrate chemical synaptic actions, respectively. In direct way, the released by transmitter T. receptors responsible for ionic channels control belonging to its structure, recognize the neurotransmitter and through 3 Interneuron signaling its binding, open these channels. In indirect way, the re- ceptor does not take part effectively of the ionic channel 3.1 The biological neuron and the in- structure, but there is an activation through a fixing protein terneuron signaling - called G protein - which is consequence of the binding between transmitter and receptor. This protein activates a The interneuron signaling occurs by means of electrical set of second messengers which modulates transmission at or chemical synapse, which have completely different mor- ionic channel. phologies. At electrical synapses, the transmission occurs The direct interference of a receptor over ionic chan- through special ionic channels called gap junction channels, nels triggers chemical synaptic actions, very fast in general, which often takes part in behavior, in direct way. In indirect Nvr = { P , W , Θ, τ, η, ξ, C } interference, the synaptic actions which serve also to mod- ulate behavior are slower, but alter neuron excitability and where each basic element can be described as: synaptic connection strengths of neural circuits. • P represents the membrane potential in neuron trigger The modulation process in chemical synaptic actions oc- zone - the place in neuron soma where the pulses are curs also through peptides that can act as neurotransmit- initiated, ters, but in a broader way - in relation to actuation area - and longer-lasting - in relation to action time. These pep- • W represents the connection weight set, tides are characterized by acting in restricted areas, show- • Θ represents the activation threshold, ing low conductance, without sustaining high frequency im- pulses, showing persistency and small excitatory effects, • τ represents the pulse transmission time between in- and not producing enough depolarization to excite a cell by volved neuron somas, itself [13]. In the case of depolarization, although insuffi- • η represents the time the neuron is kept inert between cient, the peptide transmission can provoke a fast excitatory pulses, effect by means of a second input, also excitatory. Mutation happens through modification of gene expres- • ξ represents the response function that defines the type sion, and it is another factor associated with synaptic modu- of synapse (excitatory or inhibitory), and lation process. Mutation is often due to the second messen- ger intense action in changes of binding affinities between • C represents the controller set. transmitters and receptors. In this proposal, the controller is considered an external element for the artificial neuron composition, because it acts 3.2 The artificial neuron and the interneu- at the interface of chemical synaptic transmission process. ron signaling The controller is an element set that represents the binding affinity degree, the amount of substrate, and the modifica- Maass [7] defines his model taking into account the bio- tion of gene expression in modulation. Here, the elements logical neuron output and the frequency of individual action P , W , Θ, τ, η, and ξ act on information transmission and potentials as the mechanism for information representation. on binding affinity control due to direct synaptic action. The Rosa [13], otherwise, takes into consideration Ramon´ y element set C acts on binding affinity control as a result of Cajal’s connective specificity principle, shown previously. the modulation effect, on the amount of substrate control That is, Rosa [13] proposes the inclusion of three new vari- due the action of acetylcholine or other transmitters that act ables - Transmitter (T), Receptor (R), and Controller (C) - in a similar way, and on the modification of gene expression in McCulloch-Pitts classical model [9], responsible for the as a consequence of the second messenger action. affinity control between neurons and the way signal trans- Here, the artificial neural network is a finite set of mission occurs at chemical synapse. The new elements, ac- spiking-response control neurons Nvr, defined as: cording to Rosa [13], are absent in conventional models. • A set of synapses S ⊆ Nvr × Nvr, 4 The biologically plausible artificial neuron • A synaptic weight w ≥ 0 for each pre(nu) and post(nv) synaptic neuron binding, proposal + • A response-function ξnu,nv : R → R for each There is no agreement about the characteristics that the synapse between the pre(nu) and post(nv) synaptic + proposal of a biologically more realistic artificial neuron neuron ∈ S, where R = {x ∈ R : x ≥ 0}, model can be based on, but there are some fundamental + + • A threshold function Θnv : R → R for each presy- principles, which provide a basis for such a proposal. naptic neuron nv ∈ Nvr [7], Focusing on neural signaling, there are two large groups of characteristics that must be taken into consideration: the • A set of controllers C ⊆ Nvr × Nvr, intraneuron signaling and the interneuron signaling. This • A label or number for the origin and target genes proposal considers both and is based mainly on Maass’ [7] (name), neuron formalism and on Rosa’s [13] model functionali- ties, because they present most of the basic principles for • A modulatory synaptic function mnu,nv ∈ {γ, ϕ, ψ}, biological plausibility characteristics. Merging Maass [7] where γ, ϕ, and ψ represent a gene expression, a bind- and Rosa [13] models and eliminating the redundancies, the ing affinity degree, and a substrate variation, respec- proposed model can be defined as: tively, and + + • A control function χnu,nv : R → R , ∀mnu,nv • θ(t) represents the heaviside function [2], which is the [13]. transformation of the differential equation ξnu,nv with initial conditions in an algebraic equation, in order to So, if a firing time set s of presynaptic neuron nu is de- obtain a solution of these conditions in an indirect way, F ⊆ R+ fined by nu , then the trigger zone potential of post- without calculating the general solution of ξnu,nv by synaptic neuron nv in time t is defined by [7]: means of integrals and derivatives, and P (t) = nv • χnu,nv represents the control function of postsynaptic P P w · ξ (t − s) nu:∈S s∈Fnu:s

• ϕnv represents the new binding affinity degree of re- applied to the function η(t), and ceptor (at target cell), • χnu,nv represents the control function. • ψnu represents the increasing of the amount of sub- strate (at origin cell), and The control function χnu,nv is given only by gene ex- pression of the target cell, because there is no pulse and no • ρnu,nv represents the type of postsynaptic potential refractory period without gene expression affinities between (excitatory or inhibitory) in relation to the type of the two involved signaling neurons. transmitter and receptor, by means of direct action. Finally, these formalisms aim to guarantee the new pro- posed model adherence to the biological neural signaling The negative signal of the inhibitory synapse is defined basic principles. This way, it adds new biological plausibil- by the response function ξ (t − s) of the postsynaptic nu,nv ity characteristics to Maass’ [7] and Rosa’s [13] models. neuron, through control element ρnu,nv, since the biological synapse does not change signals during learning process. For this reason, the synaptic weights have only positive or 5 Simulation zero values. The response function or postsynaptic potential is given The behavioral comparative analysis of the proposed by [2]: model is based on three variations of learning mod- els and algorithms: with Back- ξnu,nv(t) = 1 t−δ t−δ propagation (MLP+BackProp); Multilayer Perceptron with (exp(− ) − exp(− )) · θ(t) · χnu,nv 1−(τs/τm) τm τs GeneRec (MLP+GeneRec); and Spiking Response Model where with GeneRec (SRM+GeneRec). For the proposed model, the learning algorithm • τs represents the synapse time constant, GeneRec, proposed by O’Reilly [10, 11], was employed. The reason for this choice relies on its presumable biolog- • τm represents the membrane time constant, ical plausibility. Instead Back-propagation, considered bi- • t represents the current time, ologically implausible [1], GeneRec is argued to be more biologically plausible: learning happens through synaptic • δ represents the delay time constant in axonal trans- weight modifications using only the local information avail- mission, able in synapses. In simulation, the features of employed networks for 6 Conclusion comparison and of the proposed model are: the in- put, hidden, and output layers for the three models - The proposed model considers all the functionalities dis- MLP+BackProp, MLP+GeneRec, and SRM+GeneRec - are cussed earlier, and, specially, tries to merge Maass’ [7] and built by eighty, ten, and ten neurons respectively, and the Rosa’s [13] neuron model elements, adopting a new for- weights had their values assigned randomly. Specifically, mat for the element controller. This model aims to consider the values of the structural elements of the SRM base model a better modulation process, regarding the amount of sub- are assigned randomly in defined range for τm between 40 strate, the binding affinity degree, and the modification of and 2,000 ms, τs between 20 and 1,000 ms, θ between 0.1 gene expression. This way, the proposed model can add va- and 0.8 mV, τ between 2,000 and 8,000, δ between 20 and riety to the models considered biologically plausible. 200 ms. The same happens with the structural elements of the proposed model, where randomly values were assigned References to γnv between 0 and 1, φnv between 0.6 and 1.0, ψnu between 1 and the amount of neurons of the post-synaptic [1] F. Crick and C. Asanuma. Certain aspects of the anatomy layer, and ρnu,nv receives value 3 or 5. and physiology of the cerebral cortex. In D. E. Rumelhart In learning process, 100 patterns were presented, divided and J. L. McClelland, editors, Parallel Distributed Process- in 10 variations for each value. These patterns were pre- ing - Explorations in the Microstructure of Cognition, vol- sented to the network in an alternate way to avoid recog- ume 2, pages 333–371. A Bradford Book, MIT Press, 1986. nizing problems, through 101 iterations and learning rate of [2] W. Gerstner. Spiking neurons. In W. Maass and C. M. Bishop, editors, Pulsed Neural Networks, pages 3–53. The 0.2. MIT Press, 1999. In recognizing process, 10 patterns with varied complex- [3] S. Haykin. Neural Networks: A Comprehensive Foundation. ity were presented, and 90% of the patterns was recognized Prentice Hall, 2 edition, 1999. effectively. The unrecognized pattern is the same for every [4] E. M. Izhikevich. Which model to use for cortical spiking network, and this result was expected because this pattern neuron? IEEE Trans. Neural Networks, 15(5):1063–1070, resembles a pattern between digits 3 and 9 (see figure 1). 2004. [5] R. Kandel, J. H. Schwartz, and T. M. Jessell. Principles of Neural Science. McGraw-Hill, 4 edition, 2000. [6] C. Koch and O.¨ Bernander. Axonal modeling. In M. A. Arbib, editor, The Handbook of Brain Theory and Neural Network, pages 135–140. A Bradford Book, The MIT Press, 2 edition, 2003. [7] W. Maass. Networks of spiking neurons: The third genera- tion of neural network models. Neural Networks, 10:1659– Figure 1. Detail of the unrecognized pattern 1671, 1997. [8] W. Maass. Spiking neurons, computation with. In M. A. Ar- presented to the involved networks. bib, editor, The Handbook of Brain Theory and Neural Net- work, pages 1080–1084. A Bradford Book, The MIT Press, 2 edition, 2003. About the unrecognized pattern, a 10-fold cross valida- [9] W. S. McCulloch and W. Pitts. A logical calculus of the tion technique was applied, employing the training patterns ideas immanent in nervous activity. Bulletin of Math. Bio- that represent the digit nine. The test considered a total of physics, 5:115–137, 1943. ten correct patterns, a total of a hundred selected patterns, [10] R. C. O’Reilly. Six principles for biologically-based compu- and six correctly selected patterns for the MLP+GeneRec tational models for cortical cognition. Trends in Cognitive and MLP+BackProp models and eight correctly selected Science, 2:455–462, 1998. [11] R. C. O’Reilly and Y. Munakata. Computational Explo- patterns for the SRM+GeneRec model. From the results rations in Cognitive Neuroscience - Understanding the Mind shown on table 1 it is possible to notice that learning was by Simulating the Brain. A Bradford Book, MIT Press, more effective in the SRM + GeneRec model. 2000. [12] M. G. Paulin, L. F. Hoffman, and C. Assad. Dynamics and the single spike. IEEE Trans. Neural Networks, 15(5):987– Table 1. Performance of the models (in %). 994, 2004. Model Recall Precision F-Measure [13] J. L. G. Rosa. An artificial neural network model based on neuroscience: Looking closely at the brain. In V. Kurkova,´ MLP+GeneRec 43.75 41.18 42.42 N. C. Steele, R. Neruda, and M. Karn´ y,´ editors, Artificial SRM+GeneRec 50.00 47.37 48.65 Neural Nets and Genetic Algorithms - Proc. Intl. Conf. in MLP+BackProp 40.00 37.50 38.71 Prague, Czech Republic, pages 138–141, Prague, Czech Re- public, 2001. Springer-Verlag.