Pattern Activation/Recognition Theory of Mind

ORIGINAL RESEARCH published: 15 July 2015 doi: 10.3389/fncom.2015.00090 Pattern activation/recognition theory of mind Bertrand du Castel * Schlumberger Research, Houston, TX, USA In his 2012 book How to Create a Mind, Ray Kurzweil defines a “Pattern Recognition Theory of Mind” that states that the brain uses millions of pattern recognizers, plus modules to check, organize, and augment them. In this article, I further the theory to go beyond pattern recognition and include also pattern activation, thus encompassing both sensory and motor functions. In addition, I treat checking, organizing, and augmentation as patterns of patterns instead of separate modules, therefore handling them the same as patterns in general. Henceforth I put forward a unified theory I call “Pattern Activation/Recognition Theory of Mind.” While the original theory was based on hierarchical hidden Markov models, this evolution is based on their precursor: stochastic grammars. I demonstrate that a class of self-describing stochastic grammars allows for unifying pattern activation, recognition, organization, consistency checking, metaphor, and learning, into a single theory that expresses patterns throughout. I have implemented the model as a probabilistic programming language specialized in activation/recognition grammatical and neural operations. I use this prototype to compute and present diagrams for each stochastic grammar and corresponding neural circuit. I then discuss the theory as it relates to artificial network developments, common coding, neural reuse, and unity of mind, concluding by proposing potential paths to validation. Edited by: Keywords: recurrent, neural, stochastic, autapse, self-description, metaphor, grammar, hylomorphism Hava T. Siegelmann, Rutgers University, USA Reviewed by: Introduction Cees van Leeuwen, KU Leuven, Belgium In his book How to Create a Mind (Kurzweil, 2012), Ray Kurzweil proposes to model the brain Kyle I. Harrington, with a unified processing paradigm (pp. 5, 7, 23) that consists in a hierarchy of self-organizing Harvard Medical School, USA pattern routines (pp. 29, 172) “. constantly predicting the future and hypothesizing what we *Correspondence: will experience” (pp. 31) and “. involved in our ability to recognize objects and situations” (pp. Bertrand du Castel, 32). Furthermore, Kurzweil proposes that the model be implemented with hierarchical hidden Schlumberger Research, 5599 San Markov models (HHMMs, p. 144) whose parameters are learned via genetic algorithms (p. Felipe St., Houston, TX 77056, USA 146). [email protected] Importantly, Kurzweil calls his model “Pattern Recognition Theory of Mind.” While I fully concur to the hypothesis of a unified processing paradigm involving patterns, I suggest in this Received: 25 October 2014 article that the model Kurzweil presents in his book does not fully achieve his goal and that it can Accepted: 25 June 2015 Published: 15 July 2015 be more ambitious. First, I note that Kurzweil adds to the central pattern recognition processing two modules: a “critical thinking module” (p. 175) that checks for consistency of patterns, and an Citation: du Castel B (2015) Pattern “open question module” (p. 175) that uses metaphors to create new patterns. These two modules activation/recognition theory of mind. invoke additional, separate apparatus to that of his pattern recognition modules. Second, Kurzweil Front. Comput. Neurosci. 9:90. proposes that self-organization of pattern recognition modules is achieved via linear programming doi: 10.3389/fncom.2015.00090 (p. 174), which is yet another apparatus. Frontiers in Computational Neuroscience | www.frontiersin.org 1 July 2015 | Volume 9 | Article 90 du Castel Pattern activation/recognition theory of mind I suggest that while functions of consistency checking, presentation, learning afforded by self-description. Diagrams metaphorical creation, and self-organization are indeed needed, are produced by running a prototype implementation of the their treatment should not require extra mechanisms. The theory model in the form of a probabilistic programming language should handle them just as any other pattern, since they are belonging to the lineage of PRISM (Sato and Kameya, 1997) actually patterns of patterns, an assertion that I will formally and Church (Goodman et al., 2008), but dedicated to the substantiate further on. Moreover, I consider that limiting self-describing activation/recognition grammatical and neural pattern processing to recognition is limiting the scope of a operations described in this article. unified processing paradigm. Indeed, patterns of activation are In summary, I present here a model that augments and not different from patterns of recognition; for example, drawing completes Kurzweil’s software ambition and has a natural a circle or recognizing a circle both involve the same pattern interpretation in terms of the brain’s hardware. Subsequently, I (circle), and so the same unified pattern mechanism should apply discuss the theory as it relates to artificial network developments, in both cases, with the difference being determined only by the common coding, neural reuse, and unity of mind, and I propose type of action (Mumford and Desolneux, 2010). In fact, Kurzweil potential paths to validation. Finally, Kurzweil places evolution does mention that HHMMs can be used in both activation and at the center of his learning model, with genetic algorithms. I recognition (p. 143); but he did not include that in his model, nor prefer here to use instead swarming as a learning mechanism, as did he consider them being used at the same time in activation evolution is a phylogenetic property (which happens over time), and recognition, a key element of my analysis going forward. while swarming is an ontogenetic property (which happens in In this article, I therefore advance an evolution of the real time). Swarming is readily achieved with neuronal circuits in model, extending it to both activation and recognition. I call lower and higher animals, and therefore readily asserted in neural it “Pattern Activation/Recognition Theory of Mind.” It is based circuits. With this considered, I can now describe the Pattern on stochastic grammars, i.e., the superset model from which Activation/Recognition Theory of Mind. HHMMs were originally derived (Fine et al., 1998). Stochastic grammars differ from HHMMs in that they are fully recursive, Materials and Methods or, said otherwise, fully recurrent. I will demonstrate that they are capable of self-description, allowing them to handle both I first introduce activation/recognition grammars, a class patterns and patterns of patterns. Consequently, they can account of probabilistic context-free grammars that can function for pattern consistency, since consistency is a pattern of patterns; both separately and simultaneously in activation and similarly, stochastic grammars can use metaphors to create new recognition. I show that activation/recognition grammars patterns in a process which also involves patterns of patterns. have the power needed for biologically-inspired computations, Since the self-describing stochastic grammars I present can both that is, the power of Turing machines, while providing an activate and recognize patterns, they meld in a single model the additional level of expressiveness and theory that maps neural motor and sensory experiences of the brain. circuitry. I demonstrate with an example from vision how a More generally, I will show that in addition to providing probabilistic version of these grammars can learn colors through a unified processing paradigm, self-describing stochastic reinforcement by expressing a swarm of lower grammars. I then grammars capable of both activation and recognition have a show with metaphor and composition that this mechanism natural mapping to neuronal circuits. Recursion (recurrence), generalizes up to a self-describing grammar that provides the which enables self-description, is a property of neural circuits, root of a hierarchical and recursive unified model of pattern an example being a neuron whose axon feeds back to the activation and recognition in neural circuits. neuron itself via an autapse (van der Loos and Glaser, 1972). In addition, stochastic grammars probabilities express the Activation/Recognition stochastic nature of synapses with excitation (more likely to I now introduce grammars that can simultaneously activate and trigger; probability above .5) and inhibition (less likely to trigger; recognize patterns. probability below .5). Furthermore, activation is what an axon A grammar executes a set of rules until rules no longer does, while recognition is what a dendrite does. Self-description apply (Chomsky, 1957). Grammars herein are probabilistic (a.k.a. allows reading, modifying, and creating neuronal circuitry, stochastic) context-free grammars without null symbol (Booth, modeled by stochastic grammars both activating and recognizing 1969; Chi, 1999). They are coded in a minimal subset of Wirth’s other stochastic grammars. This is equivalent to saying that syntax notation (Wirth, 1977), augmented with probabilities. patterns can both activate and recognize other patterns, thus These grammars are not traditional, because they are not limited providing a general mechanism to create, modify, and exercise to functioning solely either in activation or recognition; instead, patterns and patterns of patterns for both activation and they can function both separately and simultaneously in activation recognition. and recognition, which

Load more