Visual Neuroscience(1995), 12, 1027-1052. Printed in the USA. Copyright @ 1995 Cambridge University Press0952-5238/95 $11.00 + .10

ALAN GOVE,t STEPHEN GROSSBERG,2 AND ENNIO MINGOLLA2 IM1T Lincoln Laboratory, 244 Wood Street, Lexington 20epartment of Cognitive and Neural Systemsand Center for Adaptive Systems, Boston University, Boston (RECEIVED November 30, 1994; ACCEPTED April 4, 1995)

Abstract A neural network model is developed to explain how visual thalamocortical interactions give rise to boundary percepts such as illusory contours and surface percepts such as filled-in brightnesses.Top-down feedback interactions are needed in addition to bottom-up feed-forward interactions to simulate these data. One feedback loop is modeled between lateral geniculatenucleus (LGN) and cortical area VI, and another within cortical areas VI and V2. The first feedback loop realizesa matching process which enhances LGN cell activities that are consistent with those of active cortical cells, and suppressesLGN activities that are not. This corticogeniculate feedback, being endstoppedand oriented, also enhances LGN ON cell activations at the ends of thin dark lines, thereby leading to enhanced cortical brightness percepts when the lines group into closed illusory contours. The second feedback loop generatesboundary representations, including illusory contours, that coherently bind distributed cortical features together. Brightness percepts form within the surface representationsthrough a diffusive filling-in processthat is contained by resistive gating signals from the boundary representations.The model is used to simulate illusory contours and surface brightnessesinduced by Ehrenstein disks, Kanizsa squares,Glass patterns, and cafe wall patterns in single contrast, reverse contrast, and mixed contrast configurations. Theseexamples illustrate how boundary and surface mechanismscan generatepercepts that are highly context-sensitive,including how illusory contours can be amodally recognized without being seen, how model simple cells in VI respond preferentially to luminance discontinuities using inputs from both LGN ON and OFF cells, how model bipole celis in V2 with two colinear receptive fields can help to complete curved illusory contours, how short-range simple cell groupings and long-range bipole cell groupings can sometimesgenerate different outcomes, and how model double-opponent, filling-in and boundary segmentationmechanisms in V4 interact to generatesurface brightness percepts in which filling-in of enhancedbrightness and darkness can occur before the net brightness distribution is computed by double-opponent interactions. Keywords: Brightness, Illusory contours, Lateral geniculate nucleus, , Neural networks, Adaptive resonancetheory

Introduction with certain illusory contours. A neural modelof theseLGN- VI interactions is developedand used,as part of a larger theory, This article describesa previouslyunsuspected linkage between to simulate the illusory brightening and darkening effects that the mechanismsof binocular vision, illusory contour formation, are generatedalong with illusory contours in responseto Ehren- and brightnessperception that was first reported in Gove et al. stein, Kanizsa, Glass,and cafe wall input patterns. In some of (1994). The binocular vision mechanismsinclude corticogenic- thesepatterns (the single contrast patterns), all of the inducing ulate feedback pathways,one of whose functional roles is hy- image elementshave the same sign with respectto the image pothesizedto be the selectionof monocular LGN cells whose background. In other patterns (the mixed contrast and reverse activation is consistentwith that of cortical cells that are acti- contrast patterns), someinducing elementshave opposite con- vated during binocular and monocular viewing (Grossberg, trasts with respectto the background. Correlated changes in 1976,1980; Murphy&Sillito, 1987;Singer, 1979).We propose brightness and contour percepts in responseto both types of that this feedbackfrom cortical area VI to the LGN has test- patterns are simulated using the model. able effects on the brightnesspercepts that are generatedalong We have selectedthis particular set of data for analysis be- cause it presentsconceptual challenges to all models of . For example, how are curved, even circular, illu- Reprint requeststo: StephenGrossberg, Department of Cognitive and Neural Systemsand Center for Adaptive Systems,III Cumming- sory contours generatedfrom just a few image contrasts, none ton Street, Boston, MA 02115, USA. of which is colinear with the contour (Fig. IA)? How are illusory

1027 1028 A. Gove,S. Grossberg,and E. Mingo//a

Fig. 1. (A) Ehrensteinillusion: The circular illusory contour enclosesa disk of enhancedbrightness. The different brightnessesinside and outside the circle ren- der the disk visible. (8) A vertical illusory contour is readily recognizedeven though it is not "seen" in the senseof separatingtwo regionsof different brightness and color.

A B

contours recognizedeven if they do not separatean image into make the Ehrensteindisk appear darker, rather than brighter. two regions of visibly different brightness or color (Fig. 18)? than its surround. How do illusory contours sometimesseparate an image with To seewhy this is so, assumeas in Fig. 28 that the thin line constant background luminance into two regions of different is black (low luminance)and surrounded by a white (high lumi- brightness or color (Fig. lA)? How is the relative brightnesses nance)background. Since OFF cells respond best to low lumi- of the two regions determined?All of thesequestions illustrate more generalissues concerning how the visual systemgenerates boundary and surface representations,of which illusory con- tours and brightness perceptsprovide particularly compelling ON-center OFF-center examples. For a review of the functional significance of illu- OFF-surround ON-surround sory contours and brightness percepts in the broader scheme /-:;:"-- + of visual boundary and surface representation, seeGrossberg I - (1994) and Grossberg et al. (1989). ..,~:- ,+,- + -+, '. -I '+ I, A '---" The perception of "brightness buttons" at line ends The interior of the Ehrensteindisk that is surroundedby the illu- OFF-center cells: sory contour in Fig. lA is brighter than its exterior. This appar- Maximum response at line end (interior): ently simple percepthas attracted a great deal of attention from vision scientistsbecause one could imagine many reasonswhy a-':,~ ,/'--:+' no brightness difference or the reverse brightness difference might have beenseen instead (Lesher, in press).Kennedy (1979) ON-center cells:\."" +-;-,, r~~' has attempted to explain this percept by positing that "bright- B nessbuttons" occur at the ends of dark (low luminance)lines. Other authors have used terms suchas "dissimilation" or "line Maximum response along sides (exterior): end contrast" to describethis perceptualphenomena, which has long beenthought to be distinct from classical"area" contrast, whereby the luminance that engendersperceptual contrast in a target regioncompletely encloses its area (Frisby & Clatworthy, 1975; Day & Jory, 1978; Halpern, 1981).Thetextbookmecha- \ ~ nism for explaining brightness (area) contrast has, in turn, for c ~. decadesbeen an appeal to the on-center, off-surround recep- tive fields of early . Fig. 2. Retinal center-surroundcells and their optimal stimuli (A). The An analysis of how suchcells respond to dark lines shows, ON cell, on the left, respondsbest to a high luminancedisk surrounded however, that they cannot, by themselves,explain brightness by a low luminanceannulus. The OFF cell, on the right, respondsbest buttons. More generally, neither on-center off-surround cells to a low luminance disk surrounded by a high luminance annulus (8). (called ON cells below) nor off-center on-surround cells (called OFF cells respond to the inside of a black line. The OFF cell centered at the line end respondsmore strongly than the OFF cell centered in OFF cells below) can explain this phenomenon. We interpret the middle, becausethe surround region of the former cell is closer to this to meanthat the ON and OFF cells that occur in the LON optimal. In (C) ON cellsrespond to the white background just outside (Schiller, 1992),and that are the source of cortical brightness the black line. The amount of overlap of each ON cell's surround with percepts(De Yoe & van Essen, 1988)cannot, without further the black line affects the strength of the cell's response.As seenin the processing,explain brightnessbuttons. Fig. 2 shows that what- ON cell's optimal stimulus (C), the more of the surround that is stimu- ever contribution to area contrast is generated at the ends of lated by a black region, the better the ON cell will respond. Thus, an thin lines by ON or OFF cells must be less in magnitude than ON cell centeredjust outsidethe side of the line will respondbetter than that generatedalong their sides.As explainedbelow, this should a cell centeredjust outside the end of the line.

.+),~,~~ Brightness, contours, and corticogeniculatefeedback 1029

nance in their receptive-fieldcenter and high luminance in their in a surfacerepresentation system that is called the FeatureCon- surround, OFF cells whosecenters lie insidethe line will be acti- tour System,or FCS. Output signals from the BCS form com- vated. Furthermore, OFF cells nearthe line end (but still inside partments within the FCS, within which LGN inputs to the FCS the line) will be more strongly activated then OFF cells in the initiate a diffusive filling-in processthat generatesa surfacerep- middle of the line, becausethe line end is more like a black resentation,including a percept of brightness:ln the complete disk surrounded by a white background than the line middle binocular BCS/FCS model (Grossberg,1994), this surfacerep- is (Fig. 2B). That is, an OFF cell whose centerlies in the line resentationemerges only after severalstages of BCS and FCS end receivesless inhibition from its surround than does a cell interaction occur. The model simulated hereinhas been simpli- centered in the middle of the line, becausea larger area of the fied to focus upon the targeted data base. former cell's surround lies in the white background. The full BCS models aspectsof the interblob cortical pro- A similar analysis can be applied to the ON cells. An ON cessingstream from area VI to V4, and the FCS modelsthe blob cell is excited by high luminance in the center of its receptive processing stream (see Grossberg, 1994, for a review). Many field and low luminance in its surround. The ON cells that are brightnessdata have beensimulated within this modeling frame- active, then, are those centered outside the bar. An ON cell work (Andreou & Boahen, 1991; Arrington, 1994; Cohen & whosecenter is just outsidethe side of the line will respondmore Grossberg, 1984;Grossberg & Todorovic, 1988;Pessoa et al., strongly than an ON cell centered just outside the end of the 1994). Lacking in these accounts, however, was a mechanism line (Fig. 2C). for generatingthe distribution of brightnessinputs that could Given that LGN ON and OFF cells, by themselves,cannot underlie the perceptualphenomena associated with brightness explain brightness buttons, it still remains to explain how a buttons. In particular, previousversions of the BCS/FCS model brighter Ehrensteindisk could be generatedwere brightness but- incorrectlypredicted that the Ehrensteindisk should look darker tons to obtain. Clues were provided by Kennedy (1979), who than its surround. Given that so many brightness data had analyzed a number of illusory contour stimuli. He argued that beencorrectly predicted by the model, including data collected the effect of brightness buttons could often go unnoticed for after its publication (Arrington, 1994;Paradiso & Nakayama, isolatedline segments,but could somehowbe pooledand ampli- 1991;Watanabe & Sato, 1989;Watanabe & Takeichi, 1990),the fied in perceptualsalience when severalbrightness buttons oc- question arose of how the model's description was incomplete curred in proximity or within a figurally complete region (see or incorrect. Such an accountis developedin the presentwork, Fig. 3). Grossbergand Mingolla (19850)presented an analysis which shows howthe addition of the corticogeniculatefeedback and interpretation of Kennedy's remarks through their devel- loop helps to explajn brightnessbuttons without disturbing the opment of a neural model of visual boundary and surface rep- model's previous explanations of other brightnessphenomena. resentation. In their model, the crucial mechanisticsupport for The gist of the presentmodel can be summarizedas follows. perceptually noticeable brightness buttons is a boundary seg- Brightness buttons are by definition an effect of an oriented mentation that separatesthe regioncontaining the buttons from structure (such as a line, or more generally a corner or sharp other regions of a scene. Such a boundary segmentationmay bend in a contour) on perceived featural quality (brightness). be generated by image edges, textures, or shading, and may Within the prior versionsof the BCS and FCS model equations, give rise ~oillusory contours. Boundary segmentationwithin the the computations of the FCS were unoriented, in the sensethat model is'accomplishedby the filtering, competitive, and coop- they were mediatedeither by cells with circularly symmetric ker- erative interactions of a grouping network called the Bound- nels governing their processingof inputs from a prior stage,or ary Contour System, or BCS. by an isotropic diffusion. How then could the effects of ori- The boundaries within the BCS do not carry a perceptually ented filtering be usedto modulate the inputs to the FCS that visible signal. As explained in greater detail below, BCS out- produce brightness buttons? Indeed, oriented filtering alone puts are rendered insensitive to contrast polarity by pooling could not suffice. Interactions must exist among the oriented boundary signals that are sensitiveto opposite contrast polari- filters to determinethe location of the endsof the lines, at which ties. Visible brightnessand color perceptsare assumedto emerge the brightnessbuttons occur. A natural candidate for the lat- ter interactions is the endstopping processthat converts corti- cal complex cells into endstoppedcomplex, or hypercomplex, cells (Hubel & Wiesel, 1977).Where should the results of this endstoppedprocessing have their effect on inputs to the FCS? Having come this far, it is plausible to propose that the cor- tex influences LGN cells via top-down feedback,which it is well ~ known to do (Guillery, 1967).It is not plausible, however, that ( ) this massive feedbackpathway exists just to make Ehrenstein disks appearbright. Grossberg(1976, 1980)suggested that cor- ticogeniculate feedbackexists for a potentially important func- tional reason; namely,to enhancethe activity of LGN cells that ~~ support the activity of presentlyactive cortical cells,and to sup- pressthe activity of LGN cells that do not. In addition, bottom- ~ up retinal input, by itself, was hypothesizedto supraliminally Fig. 3. Brightness buttons lie outside thin lines. When the induced activate LGN cells, but top-down corticogeniculate feedback, boundary contour (denoted by the dashed line) lies on one side of the by itself, was not. buttons, their perceptualeffect is enhanced. If the induced boundary Theserules realizea type of matching that has beenproposed instead divided the brightnessbuttons, they would have lessof a per- in Adaptive ResonanceTheory, or ART; see Carpenter and ceptual effect. Grossberg (1991,1993) for reviews. In this theory, matched

0 1030 A. Gove,S. Grossberg,and E. Mingo/la bottom-up and top-down thalamocortical signal exchanges coherentlybind and synchronizethe activities of cellswhose fea- 000000 0 00@@00 0 tures code the same object part or other unitized event. Once A coherence or resonanceis achieved, learning of new tuning 0 @00000 0 curvesor associationsis triggered.The reciprocalmatching inter- actions betweenLON and cortexwere therebyproposed to con- OOOOOG trol and stabilize adaptive synaptic changesin responseto the flood of visual experience.As noted below, Sillito et al. (1994) have reported neurophysiologicalLON data that are consistent with these model predictions. The following sectionsdescribe +1 + +1 + how this feedbackpathway can also subservethe formation of B +1 + +1 + brightnessbuttons, as an epiphenomenonof its posited primary functional role. Said another way, this analysispredicts that a weakening of top-down feedbackcould generatedark Ehren- steindisks while removing oriented influenceson LON cellsand destabilizing the adaptive tuning of binocular cortical cells, assuming that the rest of the cortex is still functional. Brightness button signals can, in fact, be generatedin two ways that are consistentwith reportedphysiology: (I) Excitatory feedback from cortical endstopped cells can enhance LON cell activity near line ends. (2) Net inhibitory feedbackfrom long- field cells,modulated by LON interneurons,can suppress activ- ity in LON cells coding the sides of lines, making brightness contrast at line ends relatively stronger. A combination of the two mechanismswould have the same properties. Data avail- able at present favor the first hypothesis,and that is the one investigated in the presentwork.

Model LGN circuit The model LGN ON and OFF cells receive input from retinal ON and OFF cells. (See Schiller, 1992, for a review.) Because these ON and OFF cells have antagonistic surroundsand obey shunting,or membrane,equations (see the Appendix), they help to discount the illuminant, normalize image activities, and ex- trac~ ratio contrasts from an image (Grossberg,1983). These image preprocessingproperties are neededto simulate eventhe most basic brightnesspercepts (Grossberg & Todorovic, 1988). As a result of thesemechanisms, ON and OFF cells processline ends in the manner summarized in Fig. 2. Other properties of the LGN, such as the existenceof M and P channelsand of lagged cells, are not neededto explain the targeted data, and so are omitted for simplicity. The LGN model also receivesfeedback from model corti- cal cells, and this feedbackcan causethe resultant LGN activ- ity to differ under certaincircumstances from that causedsolely by its retinal input. For instance,the feedbacksignals increase both ON and OFF activity near line ends and other areas of sharp boundary discontinuity (Fig. 4). This increasein activity of ON and OFF relay cells is effectively an increase in ON- OFF contrast,which is manifestedafter filling-in within the FCS as an increasein brightnesscontrast. This is the model analog of "brightnessbuttons" (Kennedy, 1979)and of "line end con- trast" (Day & Jory, 1978).The increased LGN activity at the line end can also better activate cortical simple cells located at A B the line end, resulting in stronger boundary formation perpen- Fig. 5. LGN model diagram. (A) In Version I, feedback signalsorigi- dicular to the line end, as in the circular illusory contour of nate in cortical endstoppedcells and enter a center-surroundcompeti- Fig. lA, than would otherwise be the case. tion within the LGN. (8) In Version II, the feedback from cortical In the model, cortical feedbackto LGN cells derivesfrom a endstoppedcells directly excites LGN relay cells (solid line), and also population of endstoppedcells; namely,from the outputs of the activates LGN interneurons, which inhibit nearby relay cells (dashed first hypercomplex(endstopped) cell stageof the BCS (Fig. SA). lines). VersionsI and II are functionally equivalent. VersionsII was used These cells excite relay cells in the LGN whose receptive-field for simulations. Brightness, contours, and corticogenicu!atefeedback 1031 centersare topographicallyaligned with their correspondingcor- and visual cortex. However, the LGN also has an intri- tical cells. Nearby relaycells are inhibited by the activity of LON cate local circuitry that is much more complex than a mere relay interneurons. An alternative model implementation is slightly stationwould require. Thereare two basic cell types in the LGN: more complicated and reflects more closelythe corticogenicu- relay cells and interneurons. For present purposes, extrinsic late feedbackseen in the cat (Fig. 6). Here, model feedbackis interneurons located in the perigeniculate nucleus can also be sentto both relaycells and LON interneurons(Dubin & Cleland, considered LGN interneurons. Retinal ganglion cells project 1977;Weber et al., 1989).Both feedbackstreams are excitatory; directly to both types of cells (Dubin & Cleland, 1977).While however, the interneurons, which becomemore active due to relay cells in turn project to visual cortex, the axons of inter- the feedback, inhibit nearby relay cells (Fig. sB). The circuit neurons remain in the LGN. Unlike relay cells, interneurons in Fig. 58 achievesthe samefunctional resultas that in Fig. SA, stain positively for GABA (Montero & Zempel, 1985)and are but also obeysDale's principle that all synaptictargets of a given therefore believedto have an inhibitory effect on the relay cells cell be either excitatory or inhibitory, but not both. they contact (Sillito & Kemp, 1983). Since endstoppedcells respond best to line ends and short Both LGN cell types receive feedback from layer 6 pyrami- line segments,the activity of LON relay celIs near line ends is dal cells in striate cortex (Guillery, 1967). This corticogenicu- increased by feedbackto theseareas. The feedbackalso acti- late feedback is massive,with more fibers going from cortex vates topographicalIy correspondinginterneurons in the LON, to LGN than viceversa. Of all of the synapsesin the LGN, about which inhibit relaycelIs in a local neighborhood.In all, the feed- 50% originate in cortex, compared to only 20% that originate back instantiatesa center-surroundcompetition. The inhibitory in the retina (Robson, 1983).The feedback, which comes from surround can depress LON signals in areasaway from the line cells that are binocularand orientation selective(Gilbert & Kelly, ends, suchas along the line sides(Fig. 48). Since ON and OFF 1975),is also topographic, with a strict correspondencebetween channelsare segregatedin the LON (SchilIer, 1992),the feed- the locations of the visual fields of the bottom-up and top-down back is applied to the ON and OFF layers separately. signalsconverging on a LGN cell (Updyke, 1975). If the LGN werejust a relay station, there would be no need for the preci- sion or amount of feedback that is seen. Neurophysiological LGN data From neurochemicalevidence the feedbackis believedto be The LGN model is supported by a variety of anatomical and excitatory (Montero, 1990),but since it directly activates both physiological data from studies of the cat and monkey. The relay cells (Dubin & Cleland, 1977)and interneurons (Weber LGN is often thought of as a visual relay station betweenthe et al., 1989),the overall effect of feedbackon the LGN is hard to predict from the known anatomy (seeFig. 6). Neurophysi- ologists havealso had difficulty in assessingthe function of the corticogeniculatefeedback. Researcherstrying to measurethe effect'of feedbackon LGN transmission of retinal signalshave met with inconsistentresults. Some studies have found excit- VI atory effects(e.g. Kalil & Chase,1970), while others have found inhibitory effects (e.g. Hull, 1968),and still others found mixed excitatoryand inhibitory effects(Marrocco & McClurkin, 1985). The modulation of cat LGN by cortical feedbackchanges with arousallevel and brain-stemactivity (Funke & Eysel, 1992). The feedback can serveas a gain control mechanism for the entire thalamus, boosting the gain of signals from one modal- I-N: Interneuron ity while suppressingsignals from other modalities. This can- not be the only purpose, however, as it would not justify such a large and complex feedbacksystem; a global arousal signal would suffice. , Corticogeniculate feedbackis also involved in cortical bin- I ~-profile ocular processing(Grossberg, 1976, 1980; Singer, 1977;Varela & Singer, 1987).As noted above, Grossberg (1976,1980) sug- gested that the feedback pathway realizes a top-down pattern matching processthat helps to selectively amplify activities of monocular LGN cells that support the activities of binocular cortical cells, and to suppressthe activities of LGN cells that do not, via positive corticogeniculate feedback linked to inter- nal LGN opponent processes.Topographic correspondenceis necessaryto carry out sucha matching process.A similar mod- ulatory role for top-down feedbackis assumedto be active dur- RGC: Retinal Ganglion Cell ing monocular viewing. This role for corticogeniculate feedbackwas hypothesized Fig. 6. Schematicdiagram of the VI-LGN local circuit. All VI-LGN to be part of a more generaland ubiquitous model of top-down pathwaysare excitatory, but somesynapse directly on dendritesof relay feedbackin stabilizingadaptive synapses in thalamocorticaland cells, while others synapseon inhibitory interneurons, at a site distinct corticocortical circuits, while also regulating the gain of these from the "F-profile," which receivesinput from retinal ganglion cells. circuits. In this more generalAdaptive ResonanceTheory, or Adapted with permission from Weber et al. (1989). ART, modeling framework, bottom-up processing in the ab- 1032 A. Gove,S. Grossberg,and E. Mingolla senceof top-down processingcan activate its target circuits, top- down processingrepresents a form of hypothesistesting that can subliminally prime these circuits, and a combination of bottom-up and top-down processingcan selectthose bottom- up activations that are consistent with top-down feedbackand A suppressthose that are not. This more generalrole for top-down processingis consistentwith the fact that corticogeniculatefeed- back is presentin all parts of the visual field, not just the por- tion with binocular overlap, and the feedbackis also present in specieswith little or no binocular overlap (Koch, 1987). In striking support of this ART prediction,Sillito et al. (1994) reported that "cortically induced correlation of relay cell activ- ity produces coherent firing in those groups of relay cells with B receptive-field alignments appropriate to signal the particular orientation of the moving contour to the cortex. ..this increases the gain of the input for feature-linked eventsdetected by the cortex. ..the cortico-thalamic input is only strong enoughto exert an effect on those dLGN cells that are additionally polar- ized by their retinal input. ..the feedbackcircuit searchesfor correlations that support the 'hypothesis' representedby a par- ticular pattern of cortical activity" (pp. 479-482). Becausecorticogeniculate feedbackhas both excitatory and inhibitory (via interneurons)components, the cortex can selec- tively suppressor enhanceparticular featuresof an image.One c feature known to be affected is stimulus length. Murphy and Sillito (1987)showed that cortical feedbackcauses significant length-tuning in cat LGN cells. As in cortical endstopping,the responseto a line grows rapidly as a function of line lengthand then abruptly declinesfor longerlines. The responseto long lines is herebydepressed. Redies et al. (1986)found that cat dorsal LGN cells and strongly endstopped cortical complex cells re- Fig. 7. Some examplesof the effects of line end shape on boundary sponded best at line ends, both for single lines and for a set of completion strength and brightness. (A) Very thin lines do not induce parallel lines shifted to form a perpendicular illusory contour, strongcompletions. (8) Completionsare bestwhen they are alignedwith as in Fig. 7A. In other words, the responseof the LGN cells the contour of the line end. (C) Thin line ends can produce a "glow" to line endswas enhancedrelative to the responseto line sides. that is predicted by the LGN model. Patterned after Kennedy(1988).. Computer simulationsdescribed below showthat the modelcor- ticogeniculate feedbackis also length-tuned and enhancesline ends, as seenin the data of Redies et al. (1986)and Murphy and Sillito (1987). and figure-ground separation (Gillam & Goodenough, 1994; Grossberg,1994). A thin line is predicted to produce brightness buttons, even LGN stage predictions if it does not produce strong endcuts,since the LGN relay cells By enhancing LON responsesat line ends, model corticogenic- nearthe line end will be excited by feedback,as in Fig. 7C. How- ulate feedbackalso increasesthe input to model cortical sim- ever, becausethe formation of illusory contours to contain the ple cells whosepreferred orientations are perpendicularto the enhancedbrightness signals is suboptimal for thin line ends,any line end. These perpendicular activations, called endcuts, help noticeable brightness difference may be diffuse rather than to initiate the formation of illusory contours in an orientation sharp. Kennedy (1988)has studied thesevarious effects of line that is perpendicularto the line ends, as in Fig. IA. Computer ends on illusory contours and brightness,and his resultsare con- simulations have shown that the width and orientation of the sistent with the analysis outlined above. line end are important parametersfor determining illusory con- tour and brightness strength. If a line end is too thin, then A unified explanation of illusory contour enhanced LON contrast at the line end is not sufficient to acti- and brightness properties vate a cortical simple cell oriented perpendicularto the line, and thus boundariesinduced by very thin lines should be weakened, Fig. 8 summarizesthe macrocircuit of the LGN-cortical model as illustrated in Fig. 7A. that is simulated herein. The model includes ON and OFF reti- The orientation of a line end need not be perpendicularto nal and LGN cells; cortical simple, complex, hypercomplex, the line, and the modelpredicts that, other things equal, bound- higher-orderhypercomplex, and bipole cells of the cortical inter- ary completion should occur preferentially along the contour blob processingstream; and ON and OFF opponentand double- of the line end, as in Fig. 7B, becausethat is the orientation opponent filling-in cells of the cortical blob processingstream. of the maximally activated simple cell. The generalpreference Using this model system,a set of simulations was carried out for perpendicularcompletion is a local effect that may be over- with a fixed setof parametersto illustrate how the model emu- ridden by global cues for three-dimensionalsurface formation lates a wide range of illusory contour and brightness percepts. Brightness, contours, and corticogeniculatefeedback 1033

.1g. 8. (A) Model macrocircuit. BCS stagesare designatedby octago- nal boxes, FCS stagesby rectangularboxes. (B) Model stagesschema- tized by cell icons and intercellular circuits. While most of the terms used are self-explanatory or explained in the text, "spatial impenetra- bility" refers to the need to prevent the cooperative bipole cells from forming spuriousor inappropriategroupings wherelocal evidenceover- ruleslong-range coalignments, and "reset" refersto the temporal aspects of formation, persistence,and dissolution of perceptualsegmentations. Note that the "four leaf clover" icon in the diagram is not drawn to scale,but representscompetition among long-rangebipoles, such as the one partially depicted at the top of the CC Loop. See text for details.

8 :, Cooperative,Competitive :

I L I I cop I .I .I , , ,I I I ,I I I I I I , , I I , """ "11+" .. I I I I I dO .~~ Static: ~ ---+

Oriented : Contrast :+-/.\+. ~--NEW! Filter .~". . (sac Filter) ~ .\J: : + + -,- -

monocular preprocessing 1034 A. Gove,S. Grossberg,and E. Mingo//a

Figs. 9-15 summarize simulations of the Ehrenstein disk, the by the model retinal and LON ON and OFF cells as shown in reverse-contrast Ehrenstein disk, the Kanizsa square, the mixed- Fig. 17. Note that the representationsof LON activity in panel contrast Kanizsa square, the Glass pattern, the mixed-contrast (B) of Figs. 9-15 indicate both ON and OFF activity, as in Glass pattern, and the cafe wall illusion, respectively. In each Fig. 17, but with a middle gray placed in locations having no figure, panel (A) represents the input image, panel (B) the LGN circle in Fig. 17. ' activation pattern, panel (C) the boundary segmentation, and The LON cell outputs activate the first stageof cortical BCS panel (0) the filled-in surface representation. To comment fur- processing,the simple cells (seeFig. 8) whose oriented recep- ther about the simulations, the model stages depicted in Fig. 8 tive fields respondto a prescribedcontrast polarity, or direction- and their functional role will be described qualitatively, includ- of-contrast. The model LON cells input to pairs of like-oriented ing a description of how the present version of the model refines simple cells that are sensitiveto opposite directions-of-contrast. previous versions. Then some key properties of the simulated The simple cell pairs, in turn, send their rectified output sig- percepts will be further discussed. The model is described math- nals to like~oriented complex cells. By pooling outputs from ematically in the Appendix, along with details of how the com- oppositely polarized simple cells, complex cells are rendered puter simulations were carried out. insensitiveto direction-of-contrast, as are all subsequentBCS cell types in the model. Complex cells activate hypercomplex cells through an on- Model overview centeroff-surround network, or spatial competition, whose off- We illustrate model dynamics by tracing how different model- surround carries out an endstopping operation (seeFig. 18B). ing stagesrespond to two black horizontal bars on a light back- In this way, complexcells excite hypercomplexcells of the same ground deliveredto the modelretina. In Fig. 16,the smallcircles orientation and position, while inhibiting hypercomplex cells representsmall luminances,the large circles large luminances, of the same orientation at nearby positions. One role of this at the corresponding image pixels. This image is transformed spatial competition is to spatially sharpenthe neural responses

A "'

C Fig. 9. (A) The Ehrensteinfigure. (B) The LON stageresponse. Both ON and OFF activities are coded as rectified deflections from a neutral gray. Note the brightnessbuttons at the line ends. (C) The equilibrium BCS boundaries. (0) In the filled-in result, the central circle contains strongerFCS signalsthan the background, correspondingto the perceptionof increasedbright- ness. Note that in this and subsequentfigures displaying BCS output (including Figs. 9-14), the representationof boundaries at multiple orientations are superimposed.Photographic reduction prohibits inspectionof responsesof individual orientations, as is apparent in Fig. 18. Brightness, contours, and corticogeniculatefeedback 1035

A

D

Fig. 10. Inverse Ehrensteinfigure. (A) The input image has the luminance values of the original Ehrenstein figure reversed. (8) The LGN stageresponse. Note that the "brightness" buttons at the line ends are darker than the background; they are "darkness buttons." (C) The equilibrium BCS boundariesare the sameas for the standard Ehrensteinfigure. (D) In the filled- in result'the central circle contains weaker FCS signalsthan the background. Thus the model correctly predicts that the circle will appear darker than the background.

to oriented luminanceedges. Another role is to initiate the pro- determine which orientation is receivingthe largest amount of cess,called end cutting, whereby boundaries are formed that cooperative support (see Figs. 8 and 18E). The next stage of abut a line end at orientation perpendicular or oblique to the competitiontakes place across nearby locations to selectthe best orientation of the line itself, as in Fig. 9C; ..,;;;i, spatial location of the emergingboundary (seeFig. 18F). These The hypercomplexcells input to a competition acrossorien- competitive interactions are neededto selectand sharpen the tations at eachposition among hypercomplexcells (see Fig. 18C). best boundary grouping becausethe bipole cell receptivefields This competition acts to sharpenup orientational responsesat are themselvesrather broad. Broad bipole receptive fields are eachposition. Output from the higher-orderhypercomplex cells neededbecause, in many situations, neitherthe imagecontrasts feed into bipole cells that initiate long-range boundary group- to be grouped nor the cortical cells that group them are pre- ing and completion (seeFig. 180). Bipole cells have two ori- cisely aligned across space. Broad receptive fields allow the ented receptivefields. Their cell bodies fire only if both of their grouping to get started and the competitive interactions shar- receptive fields are sufficiently activated by appropriately ori- pen and deform it. Hypercomplex cells that receivethe most ented hypercomplex cell inputs. Bipole cells act like a type of cooperative support from bipole grouping after cooperative- statisticaland-gate that controls long-rangecooperation among competitive feedbackacts further to excite the corresponding the outputs of activehigher-order hypercomplex cells. For exam- bipole cells. ple, a horizontal bipole cell is excited by activation of horizon- This cycle of bottom-up and top-down interaction between tal hypercomplex cells that input to its horizontally oriented hypercomplexcells and bipole cells rapidly convergesto a final receptivefields. A horizontal bipole cell is also inhibited by acti- boundary segmentation(see Fig. 180). Feedbackamong bipole vation of vertical hypercomplexcells. cells and hypercomplexcells herebydrives a resonantcoopera- Output signals from bipole cells feed back to the hypercom- tive-competitive decisionprocess that completesthe statistically plex cells after undergoing two stagesof competitive process- most favored boundaries,suppresses less favored boundaries, ing. First, bipole cell outputs compete across orientation to and coherentlybinds togetherappropriate featurecombinations 1036 A. Gove,S. Grossberg,and E. Mingolla

A

Fig. ...(A) The Kanizsasquare. (B) The LON stageresponse. (C) The equilibrium BCSboundaries. (0) In the filled-in result the square contains stronger FCS signals than the background, corresponding to the perception of increasedbrightness.

in the image. The equilibrium boundary segmentationsshown Ehrenstein figure simulations in panel (C) of Figs. 9-15 are all recorded at the higher-order hypercomplex cells. The first simulation (Fig. 9) shows the completions that occur Each BCS boundary segmentationgenerates topographic betweenline segmentsarranged around a circle in a radial con- output signals to the ON and OFF Filling-In DOmains, or figuration (Ehrenstein, 1941). The input image is shown in FIOOs (seeFig. 8). TheseFIOOs also receiveinputs from the Fig. 9A.Fig. 9B shows brightness buttons in the model LGN ON and OFF LGN cells,respectively. The LGN inputs activate cell activations.The complexcell responses(see Fig. 8)are stron- their target cells, which allow activationto diffuse rapidly across gest at the ends of the line segments,reflecting the effects of gap j4nctions to neighboringFIOO cells. This diffusive filling- the LGN stage.These line end responsesare strong enough to in processis restricted to the compartments derived from the induce completions perpendicular to the lines, thereby form- BCS boundaries,which createbarriers to filling-in by decreas- ing the circular illusory contour (Fig. 9C). Due to the bright- ing the permeability of their target gap junctions. The filled-in nessbuttons in Fig. 9B, the filled-in surface representation of OFF activities are subtracted from the ON activities at double- the centraldisc in Fig. 9D has strongeractivation than the back- opponent cells,whose activities representthe surfacebrightness ground, which is consistentwith the perceptgenerated by this of each percept(see Fig. 8). This double-opponent representa- figure in humans (seeFig. IA). tion is shown in panel (D) of Figs. 9-15 and Fig. 19. The resultscomputed by the individual ON and OFF filling- The model in Fig. 8 is simplified relative to known cortical in domains in Fig. 8 are shownin Fig. 20. Some"leakage" across architecture and to known models thereof. It is a single-scale, boundariesoccurs in the individual ON and OFF filling-in do- monocular model. For generalizations to. multiple-scale and mains, but its effects are effectivelycancelled by subtracting the binocular model interactions, seeGrossberg (1994), Grossberg combined output, as shownin Fig. 9D. This combination of ON et al. (1994b), and Pessoa et al. (1994). The simulations in (on-centeroff-surround) and OFF (off-center on-surround) cell Figs. 9-14 are only shown at equilibrium. For simulations of processingfollowed by opponent subtraction generatesa type temporal network dynamics,see Arrington (1994),Francis and of double-opponentreceptive field. Grossbergand Wyse (1991) Grossberg (1994, 1995)and Francis et al. (1994). analyzed how double-opponentinteractions can cancelleakage Brightness, contours, and corticogeniculatefeedback 1037

A B

c D

I-lg. 12. (A) The "!ixed contrast Kanizsasquare. (B) The LGN stageresponse. (C) The equilibrium BCS boundaries. (0) The filled-in square contains FCS signals similar to those in the background. I """C ~, due to filling-in acrossweak or incomplete boundaries. Some In the mixed-contrast Kanizsa square (Fig. 12A), pairs of examplesof brightnessassimilation may be understoodas effects pac man figureshave opposite contrast with respectto the back- of suchspreading that, due to figural asymmetries,are not can- ground. Under suitable viewing conditions, most subjects rec- celled by subsequentON/OFF interactions. , , ognize the ""ompletedoutline of a square in the center of this The Ehrensteinillusion is also simulated with white lines on figure, as in Fig.12C, although anyvisible brightnessenhance- a dark background. In sucha reverse-contrastEhrenstein fig- ment on one side of the square boundary is greatly reduced,as ure (Fig. lOA), a circular boundary is generatedas in the stan- in Fig. 120. This illusion is important for at leasttwo reasons. dard Ehrensteinfigure (Fig. 10C), but the interior of the circle First, it illustrates in a particularly vivid setting that long-range appears darker than the background (Fig. 100). This simula- boundary completion (and thus illusory contours) can occur tion shows that the mechanismsthat generatedbrightness but- betw.eenelements of opposite contrast. Second,the figure pro- tons can also generate"darkness buttons" under appropriate ducesan illusory contour but not a strong brightnesseffect. In conditions (Fig. lOB). The enhanceddarkness spreads in the Fig. 120, the square is filled-in with approximately the same FCS during filling-in to produce a dark circle. brightnesslevel as that in the background. Human perceptsare consistent with this simulation. Kanizsa square simulations .. A mixed-contrast Kanizsa square can generate an illusory squarethat canbe recognized(Fig. 12C)without necessarilygen- Fig. II shows the simulation of a Kanizsa square. The LON erating a brightnessdifference within that square that can be stage generates"brightness corners" at the interior corner of seen(Fig. 120). This fact has historically causeda great deal eachpac man figure that are enhancedrelative to the complex of controversy as the distinction betweenseeing and thinking, cell output computed in absenceof the LON stage(not shown). or the related distinction betweenmodal and amodal percep- The boundaries of the square are completed by cooperative- tion (Coren & Harland, 1993; Epstein, 1993; Gregory, 1993; competitive feedbackamong the hypercomplexand bipole cells Kanizsa,1979; Kellman & Shipley, 1991;Michotte et al., 1964). in responseto the pac man boundaries(Fig. IIC). The enhanced Within the presenttheory, this property follows from the fact brightnessof the square(Fig. II D) is causedby filling-in of the that boundaries within the BCS carry no perceptualsign -"all brightness corners in Fig. liB within the square boundary. boundariesare invisible" -because the outputs of the BCS pool 1038 A. Gove,S. Grossberg,and E. Mingo//a

A A

B

c

Fig. 13. Glasspattern. (A) The input image is a Stevens(1978) style Fig. 14. Reversecontrast Glass pattern. (A) The input imageisa reverse Glasspattern. Note the circular organizationcharacteristic of Glasspat- contrast Stevens-styleGlass pattern. The circular organization is much terns. (B) The complex cell stage output shown here ingray-scale for- lessapparent in this case.(B) The OC Filter output shown here in gray- mat capturessome of the circular impression. (C) The strong circular scaleformat does not givea circular impression at all, nor does the CC groupings become much more apparent after processing by the CC Loop output (C), which has completions that are predominatelyradial. Loop, as seenin the equilibrium output of Competition 2.

BCS signalsto the ORS are interpreted to come from extrastri- opposite contrast polarities and are, in this sense,insensitive to ate cortical area V4 (Desimone et al.. 1985;Zeki, 1983a,b)and contrast polarity, or direction-of-contrast. The theory predicts the ORS is interpreted to include inferotemporal cortex (Mish- that boundaries are seenonly if a filled-in brightness or color kin, 1982;Mishkin & Appenzeller. 1987;Schwartz et al., 1983), difference is generatedon either side of the boundary positions among other areas. These proposed BCS ...ORS interactions within the surface representationsof the FCS. are described in greater detail in Grossberg(1994) and Gross- Boundaries may nonethelessbe recognizedby direct output berg et al. (1994a). signals from the BCS to an Object Recognition System (ORS) (Grossberg,19870, 1994; Grossberg & Mingolla, 1985b). Thus, Glass pattern simulations one can "know" or "think about" a BCS input to the ORS even if the sameBCS input to the FCS does not causea difference The model's ability to generate boundary segmentations in in filled-in FCS activities that one can "see,"as in Fig. lB. The responseto statistically derived imagesis illustrated in Figs. 13 Brightness, contours, and corticogeniculatefeedback 1039

I L-

J B ~I n...

I i ! , i I ., I , j L~".~J --".--' ~-. .--~- '1T'~~f ,.-. ~. i I!,: i Fig. 16. The exampleinput figure (above) is a 56 x 60 pixel image con- C' r sisting of two low luminance bars on a high luminance background. , ' , . The discreterepresentation (below) of the figure showsthe magnitude Fig. 15. (A) A small segmentof the cafe wall image. (B) Complex cell of the simulated luminance at each pixel, as indicated by the size of responsesto the segmentare straight. (C) The CC Loop adds bound- each circle. aries between the bricks, skewing them in the samedirections as the perceptual effect.

can preferentially respond to the pairs of dots in Fig. 13A, " becausethe dots have the same contrast relative to the back- and 14 with the Glass pattern and mixed-contrastGlass pattern ground. These colinear correlations are passedonto the com- simulations. A Glass pattern may be constructedby superimpos- plex cells (Fig. 13B). The complexcell responsesare then passed ing a slightly rotated copy of a random field of dots onto the through the hypercomplexcells before being linked together by original. For a large range of viewing distances,this gives the bipole-hypercomplexcooperative-competitive feedback, the out- impressionof a circular structure (Glass, 1969).The impression put of which is shown in Fig. 13C. Thus, although bipole cell can be strengthenedby placing the original dots on a randomly receptivefields can pool imagecontrasts with opposite direction- perturbed grid and setting a constant rotation distance,so that of-contrast, their segmentations can become sensitive to for everypair of correspondingdots, the distancebetween them direction-of-contrast through the prior action of simple cells. is fixed (Stevens,1978). This construction method was usedto The strong circular componentof the boundary completions generate the Glass pattern input figure shown in Fig. 13A. is both long-rangeand sharp. The oriented filtering is responsi- The model suggeststhat this perceptis due to the combina- ble for some of the circular appearanceof the BCS boundaries, tion of short-rangecorrelations detectedby the simple cells and but the boundary completion definitely adds to the impression. long-rangecorrelations detectedby the bipole cells. Recall that To quantify this fact, the orientations of all of the nodes in the simple cells are sensitiveto contrast polarity. As a result, they complex cell output were compared to the angle of the tangent

f~-11\ 1040 A. Gove,S. Grossberg,and E. Mingo//a

What makes the mixed-contrast Glass pattern percept dif- ferent from the Glass pattern percept?The key model differ- enceconcerns the responseof simplecells. Because model simple cells are sensitive to a definite direction-of-contrast, they are not optimally activated in a direction parallel to the dot-pair orientations of Fig. 14A. The bipole groupingsare thus also dif- ferent. This basic property of the modelhas beenmisunderstood by some investigators (e.g. Elder & Zucker, 1993),who have mistakenly inferred, because output cells of the BCS pool both directions-of-contrast, that BCS boundary segmentations are insensitive to image direction-of-contrast. Comparison of Figs. 13Cand 14C shows that this is not correct. Another difference betweenthe simulations of like-contrast and mixed-contrastpercepts is worth emphasizing. In Figs. 13 and 14,the switch to mixed imagecontrasts changes the bound- A ary segmentation. In Figs. II and 12, it does not. Why is this? As noted in Grossbergand Mingolla (1985b),the difference lies in the simple cell response.In the caseof the Glass pattern, the switch to mixed contrasts changesthe orientations of the max- imally activatedsimple cells from being colinearto the dot pairs ..00 0 00...000..0000.000000.0.00.00 0 000... towards being perpendicular to the bisector of the dot pairs. .o g a. . .a .a.. This difference in simple cell responseschanges the long-range ..ao.o oo.oo.oo.oo.oo oooo.oa... bipole-mediatedboundary groupings from a circular to a radial .'.".0.000.00..0..0.0.0000000.0.0...',.,.....000000000000000000000000000000000.00 tendency. In the caseof the Kanizsa square, the switch to mixed con- trasts does not alter the orientations of maximally activated simplecells. Eachpac man elementof a Kanizsasquare presents a consistentcontour to the simple cells, whether its contrast is light-to-dark or dark-to-light. The bipole cells then generatea Kanizsa square in both casesby colinearly completing the pac man boundaries.In summary,the simple cell carries out a short- range grouping and the bipole cell a long-rangegrouping. The global patterning of contrastsin the image mayor may not alter the way in which thesetwo grouping scalesinteract. SeeGross- B berg (1994) for other examples of this theme, particularly in explanations of how occluding and occluded contours, depth, Fig. 17. Activation of the retinal and LGN networks. Unfilled circles and transparencyinteract. representthe ON channeloutput; filled circlescode the OFF response. Note the redistribution of activation in LqN (B) comparedto the reti- nal pattern (A). The strongest signals in both the LGN ON and OFF The cafe wall illusion channelsare near the line end, whereasin the retinal stageoutput the strongest signalsare found along the sides of the line. Additional evidence that these short-range and long-range grouping mechanismsexist and interact as modeled can be seen in the cafe wall illusion. The cafe wall image (Fig. 15A) con- sists of only horizontal and vertical edges,yet is appearsto have at eachnode's location to a circle centeredin the middle of the strong oblique components.The rectangleelements, or "bricks," Glass pattern. If a node's activity is greater than 10070of the appearto be trapezoids.The BCS simulationsuggests that diag- maximum activity and the node's orientation is within "lr/8 radi- onal groupings betweenbricks are responsiblefor this percept. ans of the true tangent, then that node is considered "quasi- In the BCS output shown in Fig. 15C,the once horizontal com- tangent." In the complex cell output (Fig. 13B),27.9070 of the plex cell boundaries of the central brick in Fig. 15B have been nodesare quasi-tangent,as comparedto 50.2070for the full BCS deformed so that the bricks now appearstrapezoidal. A simu- output (Fig. 13C). Thus, by the constructed measure,bound- lation on a smaller scalewas presentedin Grossbergand Min- ary completion contributes significantly to the circular appear- golla (1985b). Morgan and Moulden (1986)have presenteda ance of the simulated Glass pattern. similar explanation of this phenomenon. In the mixed-contrastGlass pattern, dot pairs consistof one Note that, in the cafe wall illusion, the simple and complex dot of positive contrast and a second of negativecontrast rela- cells track the local image contrasts but the bipole cell group- tive to the background (Fig. 14A). The circular organization ings do not. For the mixed-contrast Glass pattern, the simple present in the original Glass pattern is much weaker in the and complex cells do not track the local image contrasts and reverse-contrastversion. This is reflected in the model's com- the bipole cell groupings follow suit. The cafe wall illusion plex cell (Fig. 14B)and hypercomplexcell (Fig. 14C)responses. herebyemphasizes that the long-range bipole grouping mecha- The measuredefined abovequantifies this percept. In th~ com- nism cangenerate nonveridical segmentations,even if its short- plex cell output, 13.5070of the nodes are quasi-tangent. In the range inputs are veridical, in its effort to reconcile all of the full BCS output, even fewer nodes, 4.0070,are quasi-tangent. statistical correlations that it senseson a larger spatial scale. Brightness, contours, and corticogeniculatefeedback 1041 ,.. \1 " II Ii ~~~~ ~~~~ :~==::::=::::::::::::::::::::::::::;!~

I I I I II " I I ' i ll I , I I III'" III ' I ' I I " II: I I , ,,:1 " ,~II ~:::::;---~:::: "Ii ",,

A E

Ii II It

'1- " " II 1:- ;1 B

" oj Ii :1 ,~ II F

c

G

Fig. 18. (A) Rectified outputs of simplecells of opposite contrast polar- ity are pooled to form complexcell responses.(8) Feedforward output of hypercomplexcells. (C) Feedforward output of higher-order hyper- complexcells. (0) Feedforwardbipole cell output. (E) Feedforwardout- put of feedbackorientational competition. (F) Feedforward output of feedbackspatial competition. (0) Equilibrium output of higher-order hypercomplex cells shows the completions due to feedback. These D boundaries are used to contain the filling-in of FCS signals.

~ 1042 A. Gove,S. Grossberg,and E. Mingo//a

How do cortical simple cells respond preferentially to luminance discontinuities? The mathematicalequations that define the !"1lodeldepicted in Fig. 8 are given in the Appendix. This and the next sectionhigh- light two key features of the model to further clarify how its LGN and cortical mechanismswork together to simulate the above data properties. The first feature concernsthe designof cortical simple cells. How do simple cells integrate LGN signals from ON and OFF cells in such a way that true luminance discontinuities are favored, say,above ramps of equal net contrast? Fig. 21 depicts a model circuit in which ON cells turn on one-half of a simple cell receptive field and OFF cells turn on the other half. This is done for pairs of simple cells that are sensitive to opposite direction-of-contrast, or contrast polarity. Then the two cells inhibit one anotherbefore generatinga rectified net output sig- nal. This type of simple cell interaction has beenreported exper- imentally (Ferster, 1988; Liu et al., 1992). The ON and OFF terms work together to ensure that simple cells favor regions betweenadjacent ON and OFF activity, where true luminance discontinuitiesoccur. Although computing visual features such as edgesis not a difficult problem in simple images,in process- ing complex imagesthe combination of ON, OFF, and oppo- nent inhibition plays a critical role in attenuating spuriousimage contrasts(Cruthirds et al., 1992;Grossberg et al., 1994b;Pessoa et al., 1994).

How does cortical cooperation generatecurved illusory boundaries? The next cortical feature worth emphasizingis the shape of the bipole cell receptivefields that carry out cooperative boundary completion. Model bipole cells have a pair of colinear oriented receptivefields that must both be activatedto complete an inter- vening illusory contour. Bipole cells were predicted to exist in Cohen and Grossberg (1984) and Grossberg (1984) shortly before cortical cells with similar propertieswere reported by von Fig. 19. The filled-in result which is the model analogof the visual per- cept. The result more closelyreSembles the visualpercept when presented der Heydt et al. (1984).At around the time of the von der Heydt as a gray-scaleimage (below). Zero-valuednodes correspond to medium et al. report, Grossbergand Mingolla (1985a,b)used bipole cell gray pixels in this image. properties to simulate and explain a variety of data about illu-

A

Fig. 20. The filled-in ON (A) and OFF (8) syncytiabefore beingcombined. Note that in (8), higher activity in the OFF channel is coded by brighter pixels, unlike the representation in Figs. 9-15 of net LGN activity. Brightness, contollrs, and corticogeniculatefeedback 1043 8C§-~~ +~ iOFF1@Y+ ~ -=-~ '" y~+ f + t

Fig. 22. The top and bottom imagesdepict relatablecontours. the mid- 0 0 dle caseis not. See text for details.

Unlike the original bipole cell (e.g. Grossberg & Mingolla, 1985b)in which the optimal orientation for any filter location + 0 was radial (i.e. pointing toward the cell's origin), the optimal orientation for a point (p,q) in the newbipole filter is tangent to the circle C centeredon the y-axis that passesthrough (p, q) and the origin (0,0) of the receptive field (Fig. 23). Note that + the circle is different for different filter locations. As in previous versions of the model, the actual filter values fall off exponen- tially as their orientations deviate from the optimal orientation Fig. 21. Circuitry for LGN ON and OFF cell inputs to cortical simple determined by the tangentline. As seenin Fig. 23, the tangent cells. The table illustrates how ON. OFF, and inhibitory signals work line forms a larger angle than a line between (p,q) and the ori- together. gin would, so that the new bipole receptive field satisfies the relatability condition most of the time. In certain circumstances, such as two parallel edgesthat are almost colinear, a comple- sory contour formation, neon color spreading,and texture seg- tion can occur in the model that mathematically violates the regation. Later reports extendedthis analysisto data setsabout Kellman and Shipley-(1991)formulation of spatial relatability hyperacui~y,shape-from-shading, depth perception, binocular but is within a perceptually acceptable error range. That is, rivalry, ~nd the McCollough effect, among others (Gross!,erg, humansas well as the model at times perform completions that 1987a,b; Grossberg & Mingolla. 1987). This ever broadening technically violate the relatability formula. explanatoryrange has allowed the accumulatingweight of exper- Filter valuesare further modulated by the slope of their tan- imental evidence to refine model receptive fields, including gent (small slopesare preferred to large slopes)and by their dis- bipole cell fields. tance from the origin. A similar bipole filter with optimal A persistentquestion about bipole cells has been: How can orientations determined by parabolic equations was used to cells with such large and elongated receptive fields generate carry out BCS boundary segmentationsof synthetic aperture curved and sharp boundaries. such as those seenin all of the radar images(Cruthirds et al., 1992).The overall shapeof both above simulations?The nonlinearcooperative-competitive feed- filters is similar to the original bipole cell of Grossbergand Min- back between hypercomplex cells and bipole cells controls golla (1985b)and to the "associationfield" ofField et al. (1993). boundarysharpness. Bipole receptive-fieldshape determines the After the bipole cells compute how much evidence for a ability to track curved boundaries. boundary exists in each orientation, cells compete across ori- The presentversion of the model usesa bipole filter whose entation with other cells at the same position (seeFig. 8). This receptive-fieldshape has propertiesconsistent with Kellmanand processselects the best orientation(s)for cells that receivecoop- Shipley's (1991)"spatial relatability" condition. This condition erative feedback. Severalorientations rnay be active at points constrainsthe circumstancesunder which boundary completion receivingcooperative feedback. The competition selectsat each should be allowed to occur. Two boundaries are relatable, or position the best orientation, or orientations (at a corner), in can support a completion betweenthem, when their extensions order to produce completions that are as smooth as possible. intersect in an obtuse or right angle. To seepart of the motiva- This is especiallyimportant whenthe boundarybeing completed tion for this. consider two parallel line segmentsseparated by is not a straight contour, as occurs is the Ehrenstein figure a gap (Fig. 22). Whenthe segmentslie on the sameline, the com- (Figs. lA and 9). pletion is straightforward. When the segmentsare offset. the extensionsdo not intersect. Thus the relatability condition is Concluding remarks violated in this case.Note that the only possible smooth com- pletion has an inflection point. The bottom of Fig. 22also shows The neural model in this article suggestshow reciprocal LGN- a case in which the segmentsare not parallel. Thesesegments VI and striate-extrastriatecircuits may work together to gen- are relatable becausetheir extensionsdo intersect an form an erate emergentboundary segmentationsand filled-in surface obtuse angle. Here the completion has no inflection point. brightness properties that match a challenging set of psycho-

'.~ 1044 A. Gave,S. Grossberg,and E. Mingolla

., , , . ." ., ,.~- ., ' . ". .~."""""---" , .

~ .

.~ , "',~. .-, , . ., , , .

A

---, . ., --- ., ., --. ~ /~ ---, . 'e' c

Fig. 23. (A) An example of a bipole "in-field" using the formula of Grossberg& Mingolla (1985b). The length of each seg- ment is proportional to the "weight" afforded input to a bipole from a hypercomplex cell whose receptive-field center is at the center of the line segmentand whosepreferred orientation is that of the line segment.(8) In the present formulation, the optimal orientation for a bipole filter element at (p,q) is determined by the tangent / at that point to a circle C centered on the y-axis. See text for details. (C) An example of a bipole in-field generatedaccording to the construction of (8). (D) The "association field" of Field et at. (1993)can be describedby bipole interactions. Note that connections from the three Gabor filters on the left to the central one are denoted by solid lines, indicating that those combinations of preferred orientation~ support grouping, whereasconnections from the central element to those on the right, denoted by broken lines, are between units of incompatible orientations, and therefore weak or absent. (Adapted with permission from Field et al., 1993).

physical data. The LGN-VI circuit suggestshow endstopped Projects Agency (AFOSR 90-0083). Stephen Grossberg was partially top-down VI-+ LGN feedbackcan select LGN signals that are supported by the Air Force Office of Scientific Research (AFOSR consistentwith the tuning curvesof their VI targets,and thereby F49620-92-J-0499), the Advanced Research Projects Agency (ONR induce the brightness buttons that lead to perception of en- NOOOI4-92-J-4015), and the Office of Naval Research (ONR NOOOI4- 91-J-4100). Ennio Mingolla was partially funded by the Air Force Office hanced brightness in Ehrensteindisks. This enhancementalso of Scientific Research (AFOSR F49620-92-J-0334) and the Office of helpsto synchronizethe firing of LGN signalsand to strengthen Naval Research (ONR NOOO14-9I-J-4 100 and ONR NOOOI4-94-I-O597). boundary signals at line ends. The authors wish to thank Carol Y. Jefferson and Robin L. Locke for Model cortical processing takes place in two parallel but their valuable assistance in the preparation or the manuscript. interacting streams that simulate aspectsof the parvocellular blob stream (FCS) and the interblob stream(BCS). The BCS References models interactions between simple, complex, hypercomplex, higher-order hypercomplex,and bipole cells that communicate ANDREOU, A.G. & BOAHEN, K.A. (1991). Modeling inner and outer plexiform retinal processing using nonlinear, coupled resistive net- with each other via feedforward and feedbackpathways. Pre- works. In Human Vision, Visual Processing, and Digitai Display vious articles (e.g. Francis et al., 1994; Grossberg,1987 b, 1994) II, SPIE, 1453, pp. 270-281. Bellington, Washington: Society of have reviewed experimentalevidence supporting the existence Photo-Optical Instrumentation Engineers. of eachof theseprocessing stages. Other studieshave reviewed ARRINGTON,K. (1994). The temporal dynamics of brightness filling- in. Vision Research 34,3371-3387. experimental evidence supporting the existenceof FCS pro- CARPENTER,G.A. & GROSSBERG,S. (1991). Pattern Recognition by cesses,particularly experiments about brightness, color, and Self-Organizing Neural Networks. Cambridge, Massachusetts: MIT depth perception and the temporal dynamics of filling-in (Ar- Press. rington, 1994; Cohen & Grossberg, 1984; Grossberg, 1994; CARPENTER,G.A. & GROSSBERG,S. (1993). Normal and amnesic learn- Grossberg & Todorovic, 1988; Paradiso & Nakayama, 1991; ing, recognition, and memory by a neural model of cortico-hippo- campal interactions. Trends in Neurosciences 16, 131-137. Pessoaet al., 1995). In addition, the present model satisfiesa COHEN,M.A. & GROSSBERG,S. (1984). Neural dynamics of brightness test that few biologicallyderived models have heretofore passed: perception: Features, boundaries, diffusion, and resonance. Percep- it works. The modelhas proven itself capableof processingcom- tion and Psychophysics 36, 428-456. plex imagery (Cruthirds et al., 1992; Grossberget al., 1994b; COREN, S. & HARLAND, R.E. (1993). Subjective contours and visual- geometric illusions: Do they share common mechanisms? Italian Waxman et al., 1993),and thus has demonstratedthe type of Journal of Psychology 20, 709-730. computational power that is needed to function in the real CRUTHIRDS,D.R., GOVE, A., GROSSBERG,S., MINGOLLA, E., NOWAK, world. N. & WILLIAMSON,J. (1992). Processing of synthetic aperture radar images by the Boundary Contour System and Feature Contour Sys- tem. Proceedings of the International Joint Conference on Neural Acknowledgments Networks, IV, pp. 414-419. Picataway, New Jersey: IEEE Service Center. Alan Gove's work was done while he was conducting doctoral research DAUGMAN,J.G. (1980). Two-dimensional spectral analysis of cortical at Boston University and wassupported under the Advanced Research receptive field profiles. Vision Research 20, 847-856. Brightness, contours, and corticogeniculatefeedback 1045

DAY, R.H. & JORY, M.K. (1978). Subjective contours, visual acuity, perception: Boundary completion, illusory figures, and neon color and line contrast. In Visual Psychophysics and Physiology, ed. spreading. Psychological Review 92, 173-211. ARMINGTON, J.C., KRAUSKOPF,J.E. & WOOTEN, B., pp. 331-349. GROSSBERG,S. & MINGOLLA, E. (1985b). Neural dynamics of percep- New York: Academic Press. tual grouping: Textures, boundaries, and emergent segmentations. DE YOE,E.A. & VAN ESSEN,D.C. (1988). Concurrent processing streams Perception and Psychophysics 38, 141-171. in monkey visual cortex. Trends in Neuroscience II, 219-226. GROSSBERG,S. & MINGOLLA, E. (1987). Neural dYllamics of surface DESIMONE, R., SCHEIN, S.J., MORAN, J. & UNGERLEIDER,L.G. (1985). perception: Boundary webs, iIIuminants, and shape-from-shading. Contour, color, and shape analysis beyond the striate cortex. Vision Computer Vision, Graphics, and Image Processing 37, 116-165. Research 25, 441-452. - GROSSBERG,S., MINGOLLA,E. & Ross, W.D. (1994a). A neural theory DUBIN, M. W. & CLELAND, B.G. (1977). Organization of visual inputs of attentive visual search: Interactions of boundary, surface, spa- to interneurons of lateral geniculate nucleus of the cat. Journal of tial, and object representations. Psychological Review 101, 470-489. Neurophysiology 40,410-427. GROSSBERG,S., MINGOLLA, E. & WILLIAMSON, 1. (1994b). Synthetic EHRENSTEIN,W. (1941). Uber Abwandlungen der L. Hermannschen Hel- aperture radar processing by a multiple scale neural system for ligkeitserscheinung. Zeitschrijl fiir Psychologie 150, 83-91. boundary and surface representation. Technical Report CAS/CNS- ELDER, J. & ZUCKER, S. (1993). The effect of contour closure on the TR-94-00I, Boston, MA: Bost°.n University. Neural Networks (in rapid discrimination of two-dimensional shapes. Vision Research press). 33,981-991. GROSSBERG,S., MINGOLLA, E. & TODOROVIC,D. (1989). A neural net- EpSTEIN,W. (1993). On seeing Ihat thinking is separate and on thinking work architecture for preattentive vision. IEEE Transactions on Bio- that seeing is the same. Italian Journal of Psychology 20, 731-747. medical Engineering 36, 65-83. FERSTER,D. (1988). Spatially opponent excitation and inhibition in GROSSBERG,S. & TODOROVIC,D. (1988). Neural dynamics of I-D and simple cells of the cat visual cortex. Journal of Neuroscience 8, 2-D brightness perception: A unified model of classical and recent 1172-1180. phenomena. Perception and Psychophysics 43,241-277. FIELD, D.J., HAYES,A. & HESS,R.F. (1993). Contour integration by GROSSBERG,S. & WYSE, L. (1991). A neural network architecture for the human : Evidence for a local "association field." figure-ground separation of connected scenic figures. Neural Net- Vision Research 33, 173-193. works 4, 723-742. FRANCIS,G. & GROSSBERG,S. (1994). Cortical dynamics of form and GUILLERY, R. W. (1967). Patterns of fiber degeneration in the dorsal motion integration: Persistence, apparent motion, and illusory con- lateral geniculate nucleus of the cat following lesions in the visual lours. Technical ReportCAS/CNS- TR-94-011, Boston, MA: Bos- cortex. Jo/lrnal of Comparative Ne/lrology 130, 197-222. ton University. Vision Research (in press). HALPERN, D.F. (1981). The determinants of illusory-contour percep- FRANCIS, G. & GROSSBERG,S. (1995). Cortical dynamics of bound- tion. Perception 10, 199-213. ary segmentation and reset: Persistence, , and residual HUBEL, D.H. & WIESEL,T.N. (1977). Functional architecture of macaque press).~traces (Technical ' Report CAS/CNS- TR-95-002). Perception (in monkey visual cortex. Proceedings of the Royal Society B (London) 198, I-59. FRANCIS, G., GROSSBERG,S. & MINGOLLA, E. (1994). Cortical dynam- HuLL, D.H. (1968). Corticofugal influence in the macaque lateral genic- ics of feature binding and reset: Control of visual persistence. Vision ulate nucleus of the cat. Vision Research 8, 1285-1298. Research 34, 1089-1104. KALIL, R.E. & CHASE,R. (1970). Corticofugal influence on activity of FRISBY, J.P. & CLATWORTHY,J.L. (1975). Illusory contours: Curious lateral geniculate nucleus neurons in the cat. Journal of Neurophys- cases of simultaneous brighlness contrast? Perception 4, 349-357. iology 33, 459-474. FUNKE, K. & EYSEL, U. T. (1992). EEG-dependent modulation of re- KANIZSA,G. (1979). Organization in Vision. New York: Praeger Pub- sponse dynamics of cat dLGN relay cells and the contribution of lishing. . corticogeniculate feedback. Brain Research 573, 217-227. KELLMAN,P.l. & SHIPLEY,T.F. (1991). A theory of interpolation in object GILBERT,C.D. & KELLY, J.P. (1975). The projections of cells in differ- perception. Cognitive Psychology 23, 141-221. ent layers o[ the cat's visual cortex. Journal of Comparative Neu- rology 163, 81-106. . KENNEDY,1.M. (1979). Subjective contours, contrast, and assimilation. In Perception and Pictorial Representation, ed. NODINE, C.F. & GILLAM, B.J. & GOODENOUGH,B. (1994). Subjective contours at line FISHER, D.F., New York: Praeger. terminators -The effect of the relationship between terminator ar- KENNEDY, 1.M. (1988). Line endings and subjective contours. Spatial rangement and line arrangement. Investigative Ophthalmology and Vision 3, 151-158. Visual Science 35,1627. (" KOCH, C. (1987). The action of the corti co fugal pathway on sensory GLASS,L. (1969). Moire effect from random dots. Nature 223, 578-580. thalamic nuclei: A hypothesis. Neuroscience 23, 399-406. GOVE, A.N., GROSSBERG,S. & MINGOLLA, E. (1994). A link between LESHER,G. W. Illusory contours: Toward a neurally based perceptual brightness perception, illusory contours, and binocular corticoge- theory. Psychological Bulletin (in press). ' niculate feedback. Investigative Ophthalmology and Visual Science Llu, Z., GASKA, 1.P., lACOBSON,L.D. & POLLEN,D.A. (1992). Inter- 35,1437. ,neuronal interaction between members of quadrature phase and GREGORY,R.L. (1993). Seeing and thinking. Italian Journal of Psychol- anti-phase pairs in the cat's visual cortex. Vision Research 32, ogy 20,749-769. 1193-1198. GROSSBERG,S. (1976). Adaptive pattern classification and universal MARCEUA, S. (1980). Mathematical description of the responses of sim- recoding, 11: Feedback, expectation, olfaction, and illusions. Bio- ple cortical cells. Journal of the Optical Society of America 70, logical Cybernetics 23, 187-202. 1297-1300. GROSSBERG,S. (1980). How does a brain build a cognitive code? Psy- MARROCCO,R. T. & MCCLURKIN, 1.W. (1985). Evidence for spatial struc- chological Review 87, I-51. ture in cortical input to the monkey lateral geniculate nucleus. Exper- GROSSBERG,S. (1983). The quantized geometry of visual space: The imental Brain Research 59, 50-56. . coherent computation of depth, form, and lightness. Behavioral and MICHOTTE, A., THINES, G. & CRABBE,G. (1964). Les complements Brain Sciences 6, 625-657. amodaux des structures perceptives. Louvain: Publications Univer- GROSSBERG,S. (1984). Outline of a theory of brightness, color, and form sitaires de Louvain. perception. In Trends in Mathematical Psychology, ed. DEGREEF, MISHKIN, M. (1982). A memory system in the monkey. Philosophical E. & VAN BUGGENHAUT,J., pp. 59-86. Amsterdam: Elsevier/North- Transactions Royal Society B (London) 298, 85-95. Holland. MISHKIN, M. & ApPENZELLER,T. (1987). The anatomy of memory. Sci- GROSSBERG,S. (1987a). Cortical dynamics of three-dimensional form, entific American 256, 80-89. color, and brightness perception, I: Monocular theory. Perception MONTERO,Y.M. (1990). Quantitative immunogold analysis reveals high and Psychophysics 41, 87-116. glutamate levels in synaptic terminals of retino-geniculate cortico- GROSSBERG,S. (1987 b). Cortical dynamics of three-dimensional form, geniculate, and geniculo-cortical axons in the cat. Visual Neurosci- color, and brightness perception, II: Binocular theory. Perception ence 4, 437-443. and Psychophysics 41,117-158. . MONTERO,Y.M. & ZEMPEL,1. (1985). Evidence for two types of GABA- GROSSBERG,S. (1994). 3-D vision and figure-ground separation by visual containing interneurons in the A-laminae of the cat lateral genicu- cortex. Perception and Psychophysics 55, 48-120. late nucleus: A double-label HRP and GABA-immunocytochemcal GROSSBERG,S. & MINGOLLA, E. (1985a). Neural dynamics of form study. Experimental Brain Research 60, 603-609. 1046 A. Gove, S. Grossberg,and E. Mingoffa

MORGAN, M.J. & MOULDEN, B. (1986). The Miinsterberg figure and the text. Additional modifications from equationsemployed for twisted cords. Vision Research 26, 1793-1800. previously published simulations are also included, including MURPHY, P.C. & SILLlTO, A.M. (1987). Corticofugal influences on ON and OFF LGN cells, and improved cortical bipole cell re- the generation of length tuning in the visual pathway. Nature 329, 727-729. ceptive fields and hypercomplex-bipole feedback interactions PARADISO,M. & NAKAYAMA,K. (1991). Brightness perception and filling- (Cruthirds et al., 1992; Grossberg & Wyse, 1991). The struc- in. Vision Research 31, 1221-1236. ture of the improved model, as shown in Fig. 8, includes an PESSOA,L., MINGOLLA, E. & NEUMANN, H. (1995). A contrast- and OFF channel, an LGN stage,and feedback from the cortex to luminance-driven multiscale network model of _brightness percep- tion. Visual Research 35, 2201-2233. the LGN stage. PETZOLD,L.R. (1983). Automatic selection of methods for solving stiff In addition to the mathematical description of each stage, and non-stiff systems of ordinary differential equations. SIAM Jour- convolution filters are displayed graphically, and the output nal of Scientific and Statistical Computing 4, 136-148. of eachstage is shown for an image of sufficient simplicity to POLLEN,D.A. & RONNER,S.F. (1981). Phase relationships between adja- allow a detailed inspectionof the effects of eachstage (Fig. 16). cent simple cells in the visual cortex. Science 212, 1409-1411. REDIES, C., CROOK,J.M. & CREUTZfELDT, O.D. (1986). Neuronal re- Parametervalues are also listed for eachstage. These values are sponses to borders with and without luminance gradients in cat visual used for all the simulations in this article. cortex and dLGN. Experimental Brian Research 61, 469-481. ROBSON,J.A. (1983). The morphology of corticofugal axons to the dor- sallateral geniculate nucleus in the cat. Journal of Comparative Neu- ON and OFF retinal shunting networks rology 216, 89-103. SCHILLER,P. (1992). The ON and OFF channels of the visual system. This first stage of processing involves two parallel center- Trends in Neuroscience IS, 86-92. surround networks. These networks compensate for variable SCHWARTZ,E.L., DESIMONE,R., ALBRIGHT, T. & GROSS,C.G. (1983). illumination ("discount the illuminant") while suppressingnoise Shape recognition and inferior temporal neurons. Proceedings of and computing contrasts in the image. In the ON channel, the the National Academy of Sciences of the U.S.A. 80, 5776-5778. SILLlTO, A.M., JONES, H.E., GERSTEIN,G.L. & WEST, D.C. (1994). centeris excitatory while the surround is inhibitory (Fig. 24A), Feature-linked synchronization of thalamic relay cell firing induced whereas in the OFF channel, the center is inhibitory and the by feedback from the visual cortex. Nature 369, 479-482. surround is excitatory (Fig. 248). The cells in each channel SILLlTO,A.M. & KEMP, J.A. (1983). The influence of GABAergic inhib- obey membrane, or shunting, equations coupled by distance- itory processes on the receptive field structure of X and Y cells in cat dorsal lateral geniculate nucleus (dLGN). Brain Research 277, dependentinteractions (Grossberg, 1983), whereby inputs Ipq 63-77. at position (p,q) are filtered by convolution filters that are SINGER,W. (1977). Control of thalamic transmission by corticofugal defined by isotropic two-dimensional Gaussians.The ON and caland Reviewascending 57, reticular386-420. pathways in the visual system.. Physiologi- OFF channelsthus have activities Xu and Xi}' re~pectively,at cell positions (i,j) that satisfy the following equations: SINGER,W. (1979). Central-core control of visual-cortex functions. In Neurosciences Fourth Study Program, ed. SCHMITT,1;'.0. et al., Cam- On retinal cells bridge, Massachusetts: M.I. T. Press. SPITZER, H. & HOCHSTEIN,S. (1985). A complex-cell receptive field model. Journal of Neurophysiology 33, 1266-1286. ~ Xi] = -DXi] + (U -Xi])}::; Cpqijlpq STEVENS;K.A. (1978). Computation of locally parallel structure. Bio- dt p,q logical Cybernetics 29, 19-28. -(Xi] + L) }::; SpqijI~ UPDYKE, B.V. (1975). The patterns of projection of cortical areas 17, (1) p,q 18, and 19 onto the laminae of the dLGN in the cat. JournalofCom- parative Neurology 163, 377-396. VARELA,F.J. & SINGER,W. (1987). Neuronal dynamics in the visual cor- and ticothalamic pathway revealed through binocular rivalry. Experi- OFF retinal cells mental Brain Research 66, 10-20. VON DER HEYDT, R., PETERHANS,E. & BAUMGARTNER,G. (1984).lIIu- -x"7d = -Dx."7 + (u -x"7 ) ~ S I sory contours and cortical neuron responses. Science 224, 1260-1262. dt I} /} I} £..I pql} pq WATANABE,T. & SATO,T. (1989). Effects of luminance contrast on color p,q spreading and illusory contour in the neon color spreading effect. -(Xi) + L) b Cpqijlpq Perception and Psychophysics 45, 427-430. .p,q (2) WATANABE,T. & TAKEICHI,H. (1990). The relation between color spread- ing and illusory contours. Perception and Psychophysics 47,457-467. WAXMAN, A.M., SEIBERT,M., BERNARDON,A.M. & FAY, D.A. (1993). where U and L are the upperand lower bounds of the activities Neural syste!lls for automatic target learning and recognition. The x+ and x-, Lincoln Laboratory Journal 6, 77-116. WEBER, A.J., KALIL, R.E. & BEHAN, M. (1989). Synaptic connections between corticogeniculate axons and interneurons in the dorsallat- Cpq;j= Cg2(p,q,i,j,uc)' Spqij = Sg2(P,Q,;,j,Us) (3) eral geniculate nucleus of the cat. Journal ofC;omparative Neurol- ogy 289, 156-164. ZEKI. (1983a). Colour coding in the cerebral cortex: The reaction of and the two-dimensional Gaussian function g2 is defined as cells in monkey visual cortex to wavelengths and colours. Neuro- science 9, 741-765. ZEKI. (1983b). Colour coding in the cerebral cortex: The responses of g2(p,q,i,j,u)= ~ exp(-~ «p -i)2 + (q _j)2)} (4) wavelength-selective and colour coded cells in monkey visual cor- tex to changes in wavelength composition. Neuroscience 9,767-791. At equilibrium the ON channel activity is defined by

Appendix: An improved neural model of boundary completion and surface filling-in (5) This sectiondescribes the revisedBCS and FCS equations after incorporation of the enhancementsand revisions discussedin Brightness, contours, and corticogenicu/atefeedback 1047

000 0

A B

" , o 0

.'..00 g00..".O .,."00,,..,. '..00000..'.ooO

, , , c D

~ ~ ;. ..~ ~ --. .--~ ~ -.

E Fig. 24. DOG filters for the ON (A) and OFF (B) channel retinal output cell. Unfilled circles code pos- itive values, filled circles represent negative values. Both filters havea 3-by-3 center and a l5-by-15 sur- .00000 round. (C) DiscreteGabor filter for a horizontal sim- ple cell. (D) Filter for Competition 1. (E) Cooperative F bipole filter. (F) Filter for Competition 1F.

and the OFF channel activity is defined by The parametersC and Sin eqn. (3)were chosenso that when the two Gaussianfunctions are combined into a DOG filter, the positive- and negative-valuedfilter elements both have an absolute sum of I. Thus C = 1.19and S = 1.20. The input image (6) I in this simulation has values in the range [0.1,I]. The interior regions of the bars have pixel value 0.1, while the background pixels all have value 1.0. A threshold-linear.or half-wave rectified, output function T(x) is applied to the ON and OFF activities of this stage,where

T(x) = max(O,x) (7) LON circuit In the model, LGN relay cells r receive excitatory signals T This function is also used in other stages. The input image (Fig. 16)consists of two parallel bars of from retina, excitatory signalsE from cortical endstoppedcells and inhibitory signalsM from LON interneurons,which are also low luminance on a high luminance background. The output of the retinal stageis shown in Fig. 17A. The parameter val- activated by feedback. The model equations for ON and OFF ues for eqns. (2)-(4) are D = I, U = I, L = I, ac = 0.58, and relay cells are given by as = 2.90. As a result, Xi; + Xi} = O. Therefore, it is possible LON ON relay cells to display both outputs in a singleimage. In Fig. 17A, open cir- cles key ON channelvalues, and filled circles key OFF channel values. As illustrated in Fig. 2, the ON channelresponds near the outside of the bars, and the OFF channelcells respond to the interior of the bars. Far from the bars the input pattern is -(rt + L) L Spq;jMpq (8) uniform, so neither channel responds. P.q

~ 1048 A. Gove,S. Grossberg,and E. Mingo//a

and Simpleand complexcell layers LON OFF relay cells The oriented cells used here are odd-symmetric Gabor filters (Daugman, 1980; Marcelja, 1980). Including even Gabor cells may improve the system's boundary detection capabilities (Cruthirds et al., 1992;Pollen & Ronner, 1981;Spitzer & Hoch- -(';] + L) }::; Spq;jMpq (9) stein, 1985),but adequateresults were here obtained without p,q them. Each odd cell has two subregions,A and B, that are sym- where Cpqij= Cg2(p,q,;,j,uc) and Spqij= Sg2(P,q,;,j,us) are metric about the cell's long axis (Fig. 24C). RegionA is excited on-centerand off-surround Gaussiankernels, respectively,and by net ON channelsignals while region B is excited by net OFF U and L are the upper and lower bounds, respectively, of r's channelsignals. For eachof the K discreteorientations used in activity. The feedbackto this stage is computed by summing the model, there are two simple cells of opposite polarity. Thus over orientation the activity of cortical cells from the outputs there are 2K simple cells, whose activities are half-wave recti- E of endstopped hypercomplexcells: fied and combined pairwise to activate K complex cells (Gross- berg & Mingolla, 1985b). Eachsimple cell subregioncomputes the net ON minus OFF (10) (or OFF minus ON) response for the entire subregion:

The activity of Wijk is defined by eqns. (21)-(24). The inter- A;jk = 2.; (T(r;j) -T(r~»T(G~&) (14) neuron activity M is similarly defined by p,q and

M-I) - T (II) Bijk = ~ (T(rpq) -T(rp~»T(-G~:b) (15) p,q Note that in the excitatory terms of eqns. (8) and (9) the where the oriented Gabor filters G~~Jj,with orientation k, are feedback is gated by the bottom-up signal Xi}' As a result, rotated versions of the horizontal filter with orientation k = 0 the feedback can only enhance the activity of cells that already and frequency w, that is defined by receive retinal input. In the inhibitory terms the feedback does not interact with the bottom-up input. Both terms are consis- G~~~j= G sin(27rw(q -j»exp( -! «(p -i)/Uh>.2 tent with the discussion of LGN circuitry in cat by Weber et al. (1989). Although bottom-up retinal input, by itself, can acti- + «q -j)/Uv)2)} (16) vate model LGN cells, top-down corticogeniculate feedback can- not, by Itself, activate model LGN cells. When both bottom-up For this and all anisotropic filters used in the model, the equa- and top-down inputs are active, a match between bottom-up tion for the horizontal filter is given. Computation of filter retinal input with top-down cortical input can enhance LGN orientations other than horizontal is done by first rotating the processing, while LGN processing of bottom-up retinal inputs filter plane to align the filter's long axis with the x-axis, and then that have no top-down support is suppressed. calculating each filter value by applying the equation for the The result of processing the bar image with the LGN stage horizontal filter. For a filter with orientation index k, for exam- is shown in Fig. 17B. The output LGN output has stronger sig- ple, location (p,q) is first rotated by (Jk= -k?r/Kto give new nals at the ends of the bars rather than at its sides. These are coordinates (p',q') defined by the model analog of brightness buttons.; ..,

By eqns. (8) and (9), at equilibrium, the ON and OFF cell p' =PCOS(Ok) -qsin(Ok) (17) activities of the LGN stage obey the equations and

UT(x;j> + UT(x;j> }::; Cpq;jEpq -1. }::; Spq;jMpq q' =pSin('(Jk) + qCOS«(Jk) (18) r..+- - p,q p,q Ii (12) Then the equation for the horizontal filter at (p,q) is applied to (p',q'). In this way, filter values are computed for filters of every orientation. and The simple cell receptive field is designedso that its activity Sijkis largest when a correctly oriented net ON signal occurs in one half of the cell's receptive field and on equal net OFF signaloccurs in the other half. This property is captured by the ,"7 = 11 (13) D + T(x;j) + T(x;j) }::: Cpq;jEpq + }::: Spq;jMpq equation p,q p,q Simplecells

Default parameters for eqns. (\2) and (\3) of the LON stage Sjjk = T(Ajjk + Bjjk -alAjjk -Bjjkl> (19) are D = \, U = \, L = \, C = \00, Uc= \.0, S = 10, Us= 3.0, W = 0.\6. (Subscripts on parameters such as U, L, C, and S The first two terms compute the net ON and OFF activity of are omitted in their severaluses for simplicity.) the two subregions,which can be positive or negative.The sum Brightness, contours, and corticogeniculatefeedback

A;jk + B;jk in eqn. (19) can be interpreted as the net effect of while the inhibitory convolution filter includes nearby orienta. two operations. In the first operation, ON cells turn on one- tions as well: half of a simple cell's receptive field, as in term T(r~)O(~j), and OFF cells turn on the other half, as in term T(r;j)t( -0;':&). S pqij(r,k) = Sg2 ( p,q,l,j,fJs" )gl ( T,k ,fJr ) (23) This is done for pairs of simple cells that are sensitiveto op- posite direction-of-contrast. Then the two cells inhibit one an- where g. is a one-dimensional Gaussian other before generating a net output signal, as diagrammed in Fig. 21. The remaining term -aIA;jk -B;jkl in eqn. (19) reduces the simple cell responsewhen the subregionsare not gJ(r,k,a) = (27ra2)-J/2exp[- equally activated,with the parametera determiningthe strength of the "penalty." Theseterms work togetherto ensurethat sim- ple cells favor regions betweenadjacent ON and OFF activity, (seeFig. 24D). This form of inhibition prevents non-dominant where true luminance discontinuities typically occur. orientations from producing undesired effects. In particular, if competitionwere strictly among cells of identical orientations, The complex cell activity C;jkis the sum of half-wave recti- fied signals from pairs of like-oriented simple cells of opposite then cells of the preferred orientation in a given region could inhibit one another, while a single, or small number, of "noise" contrast-polarity: Complexcells cells of a nearby orientation could have its activity (relatively) enhanced for lack of competitors. At equilibrium, the activi- ties of the first competitive stage are thus defined by Cjjk = Sjjk + Sjj(k+K) for 0 oSk < K (20)

The original simpleand complexcell layer equationin Grossberg Wijk = and Mingolla (19850)combined area normalization (in the de- nominator) with oriented filtering. Becausethe input to the new version is already normalized with respectto area by the reti- (25) nal stage,normalization is not neededhere, and is omitted for which generalizesthe first competitive stage of Grossbergand simplicity. Parameter values for the simple cell stage are", = 0.2, Mingolla (1987).As in that model, the difference of Gaussians (]h = 1.833, (]v = 0.833, IX = 1.3, and K = 12. ParameterG in term in the numerator of eqn. (25) intensifies the competition eqn. (16) was chosenso that the absolute sum of the positive betweennearby cells of the same orientation so that it is possi- and of the negativevalues in the Gabor filter is I. Becauseof ble to drive a losing cell's activity down to zero. When the pixel sampling differences for different orientations, the value input to the competition has boundaries that are severalpixels of G varied slightly with orientation. In the horizontal (k = 0) wide, the thickness of the boundaries can be reduced to one case,G = 0.556. The output of the complex cell layer for the or two pixels. , -.., two-bar simulati'!n is'~hown in Fig. 18A. . As in Grossbergand Mingolla (1985b, 1987),this stagealso performs endstopping. The convolution filter has a center- surround structure so that a line that fits within the central region will produce a stronger response than a line that also Hypercomplexcells: Spatialcompetition extendsinto the surround. Becausethe input to this stageis ori- This stage uses competition across space, or an endstopping ented, the isotropic convolution filter can appearto favor lines operation, among like-oriented cells to convert output signals of a certain length at a specific orientation. The enhancement from model complex cells into input signals to the first pop- of the line ends (and the relative weakening of the line sides) ulation of model hypercomplex cells, which is called the first can be seenin the output of this stage (Fig. 18B). Parameter competitive stage, or Competition 1. The output of the first values for eqn. (25) are D = 1, U = 1, L = I, J = 0.01, and competitive stageis used as feedbackto the LON stage,as in F = 0.03. Filter parameters in eqns. (22)and (23)are C = 1.0, eqns. (10)and (11), as well as input to the higher-order hyper- Ur = 1.0, S = 1.0, Us= 3.5, and u, = 2.0. complexcells. Competition 1 takesthe form of a standardshunt- ing equation, with two additional terms, a tonic input J and Higher-order hypercomplexcells: Orientational competition feedback v that derives from a later stage in the cooperative- competitive grouping network, or CC Loop: This competition takes place acrossthe orientation dimension. At each spatial position, cells compete with other cells that have same position but different orientation. The result is ori- ~ W;jk= -DW;jk + (U- W;jk)(~Cpq;jCpqk + FT(V;jk) + J) entational sharpening at image locations where no single ori- entation is the clear winner. The other effect of this stage is disinhibition of signals perpendicularto those that were inhib- -(W;jk + L) ~ St;q;J>cpqr (21) p,q,r ited below the tonic level J in Competition 1. This can cause newsignals to appear that flank and are perpendicularto exist- ing boundaries. These new endcut signals help to generate The excitatory convolution filter is a two-dimensionalGauss- boundaries,such as the Ehrensteincircle of Figs. lA and 9, that ian across space: are perpendicularor obliquely oriented with respectto line ends (Grossberg & Mingolla, 1985b, 1987). Cell activity for Com- Cpqij = Cg2(p,q,i,j,u,,) (22) petition 2 hypercomplexcells is governed by 1050 A. Gove,S. Grossberg,and E. Mingo//a d . dt Yijk = -DYijk + (U -Yijk) ~ CrkT( Wijr) where

Dpq;j = "(p -;)2 + (q -j)2 -(Yijk + L) ~ SrkT( Wijr) (26) (35) r where interaction betweencells at orientations k and r is defined and by one-dimensional Gaussiankernels g2: -I ( p-i ). Fpqij=tan S-(q-j) s*q-j (36) Crt = Cg1(r,k,u,,) (27) Variable s in eqn. (36) is given by 8'k = 8g.(r,k,us) (28) At equilibrium, (p -;)2 + (q _j)2 s= (37) 2(q -j) The first term in the exponential of eqn. (34) modulates filter Yijk = (29) values based on their distance Dpq;jfrom the bipole's center, D + ~ (Crk+ Srk)T(w;jr) where p is the optimal distance from the center. The second term computes the slope of the tangent at (p,q) of the circle The output of Competition 2 is displayed in Fig. 18C. Be- centered at (O,s) which passesthrough (0,0) (the bipole cell's cause the output of Competition 1 has orientationally sharp origin) and (p, q); see Fig. 23. This circle has equation responsesfor the example input, the sharpening of Competi- tion 2 is not obvious. In addition, the endcuts in this example p2 + (q -S)2 = S2 (38) merely strengthenthe direct filter responsesat the (thick) line end. Parameterchoices for eqn. (29) are D = I, U = I, and and by implicit differentiation its tangent is L = 1. Filter parameter for eqns. (27)and (28)are C = 4.323. a.. = 1.208,S = 4.323, and as = 1.932. ~=-E- dp s -q (39) Bipole cells: Long-range cooperation Note that the radius s will vary from point to point. The sec- The cooperative stage sums input from two lobes of a bipole ond term in the exponential penalizesorientations in the filter cell filter (Fig. 24E) that is activated by boundary signals from that have large tangentvalues. The most favorable orientations the previous hypercomplexcell stage. If there is sufficient acti- (for this term) are those similar to the bipole's main axis. The vation in both lobes, feedbacksignals are generated that ini- third term of eqn. (34)measures the similarity of the orientation tiate bo,undarycompletion. The equation for this stage is of point (p, q, r) and the angle formed by the tangent at that point. The tangentdefines the optimal orientation for that point. Filter elementorientations closerto this optimal value will have (30) greater strength than those at larger angular separations. At equilibrium, the unthresholded bipole cell activity is, by where eqns. (30)-(33),

f(x) = E +T(x) T(x) (31)

A;jk = }::; (T(Yijr) -T(Y;jR»T(Z'/q;J» (32) (40) p,q,r and The new filter is consistentwith the data of Field et al. (1993) and of Kellman and Shipley (1991). The new equation is also Bijk = ~ (T(Yijr) -T(YijR»T( -Z<;qiJ) more easilymodified and scaled than the original of Grossberg (33) and Mingolla (1985b) becauseall of the terms are contained p,q,r within a single exponentialfunction. A similar bipole cell equa- In eqns. (31)and (32), R is the orientation perpendicularto r. tion, basedon parabolasinstead of circles,was used by Cruthirds The new bipole filter is defined by et al. (1992). The changeto circles simplifies the equation as much as possible. The output for this stage(Fig. 180) showswhere boundary z(r,O) pqij =Zsgn(p-i) enhancementand completiontake place. The boundaries of the X exp ( -(Dpqij2- p)2 -~ -K ("If -Fpqij )2 \ long (horizontal) side of the bars are long enoughto stimulate horizontal bipole cells, as well as some oblique angle bipoles.

2u. 2U2 2 In addition, the short, vertical boundaries of the bars are able 2 2uJ ) to stimulate vertical bipole cells that lie in the gap betweenthe (34) lines. These vertical activations will ultimately give rise to two Brightness, contours, and corticogeniculatefeedback 1051 boundariesconnecting the lines at their ends. Parametervalues eachorientation. The feedbacksignals are spatially sharpened for eqns. (31)and (34)are E= 0.15, Z = I, (11= 4.0, p = 10.0, in this stage before being added back into the loop, as in (12= 0.3, and (13= 0.1. In an image processing application of BCS/FCS mecha- nisms, Grossberg et al. (1994b)employed a variation in the bipole weighting function where p = 0, resulting in the stron- gest "weights" at locations near the bipole center. Such an ar- rangementallows bipole activity to remainstrong from interiors -(Vijk + L) L S~~&T(Upqk) p,q,r of segmentsright up to endpoints, while still preventing out- ward completion beyond inducers. where

C Ia) C ( .. Top-downorientation competition pqij = g3 p,q,l,j,Uc,Ug ) This stageis homologous to Competition 2. It is called Com- 8 ( .. 8(0)pqij = g3 p,q,l,j,U..,Uh) petition 2F, with the 'F' indicating that the competition is in the feedback portion of the CC Loop. This stage, which was not and g3 is defined by included in the original BCS, sharpens boundary completions that have components from severalorientations by enhancing the orientations at each point that are most highly favored by g3(p,q,i,j,C1I,C12) the cooperation. The equation for this stage is given by exp -21 (( P-i-;;;- )2 + (~q_j )2)J (49) = (27ru) U2)- [

When ag :#: ac: or ah :#: as the filters are anisotropic, taking an -(U;jk + L) LSrkH(Z;jr: (41) elliptical shape, as in Fig. 24F. When this is the case, rotated r versions of the filters are applied within each orientation. This where filter responds well to a line of activity while at the same time limiting the thickness of the line. At equilibrium, Crk = Cg\(r,k,u(") (42)

Srt = Sg.(r,k,us) (43)

H(z) = HT(z -J) (44) The output of this stage (Fig. 18F)is a sharper version of its The signal function H(z) determines whether a cooperative input wherein many of the flanking pixels in Fig. 18Ehave dis- bipole cell is activated enough by both of its receptive fields appearedor beensubstantially weakened. Parameter values for to generate output signals that participate in the competition. eqn. (46) are D = I, U = I, and L = I. Filter parameters for At equilibrium, eqns. (47)and (48)are C = 47.6, Uc= 0.95, Ug= 1.0, S = 120.0, Us= 1.0, and Uh= 1.0. ~ (UCrk-LSrk)H(Z;jr) r Uijk = (45) Boundarycompletion D + L (Crk + Srk)H(Zijr) r The output of Competition IF is fed back into Competition I [eqn. (21)] to close the CC Loop. Cooperative boundaries are added to the bottom-up boundaries in Competition I, and the The output of Competition 2F is shown in Fig. 18E. Normally circuit computes the completed boundaries, including illusory this competition reducesthe orientational spreadof the coop- contour boundaries,of the input image. The output of the CC erative signals,as in the complex imagery processedin Gross- Loop is the equilibrium activation of Competition 2 (Fig. 180). berg et al. (1994b). In the presentexample, however, most of These completed boundaries are sharply localized at the cor- the input signalsare already orientationally as sharp as possi- rect spatial positions. They are used in the FCS to contain the ble. The effectsof sharpeningcan be seenby examiningthe pix- spreading of brightness signals. els in the middle of eachbar. Parametervalues for eqns. (41) and (44) are D = I, U = I, L = I, H = I, and J = 1.2. Filter parameters in eqns. (42) and (43) are C = 4.95, U" = 0.865, Filling-in S = 4.95, and Us= 1.385. The brightnesssignals that fill-in this stageare derived from the ON and OFF channels of the LGN stage. As noted in Co- hen and Grossberg (1984)and Grossberg & Todorovic (1988), Top-downspatial competition filling-in uses the ON and OFF signals to recover a surface This feedbackstage is bomologous Competition I, hence it is reconstruction that is relatively uncontaminated by variations called Competition IF, in that it occur across position within in illumination. Filling-in occurs separatelyvia nearest-neighbor 1052 A. Gave,S. Grossberg,and E. Mingo//a

diffusion in ON and OFF filling-in domains, or FIOOs. The Jr ij = Sij+ -Sij - final output is the difference of the ON and OFF FlOO acti- (57) vations at each location, hence a double-opponent response Default parametervalues for the filling-in eqns. (51) and (53) (Grossberg, 1987b; Grossberg& Wyse, 1991). The FIOOs are are D = 0.001, {) = 1000,and 'Y = 10000. two-dimensionalisotropic networks, so that filling-in proceeds equally in all directions until i~ is blocked by a boundary or attenuated with distance. Computer implementation Cell activity in each syncytium is described by a diffusion The computer implementation of the BCS/FCS model is writ- equation ten in C and runs on a Silicon Graphics Iris 4D/280S machine. The equilibrium equations for each stage are used. The LGN d di Sij = -Dsij + T(rij) + feedback loop is computed by cycling once through the rele- L (spq -S;j)Ppq;j (51) p,qeNij vant stages, giving the following order of processing: Retinal stage, LGN stage with no feedback, Complex Cells, Competi- in which a FillO cell Sijreceives input from LGN cell activ- tion I, LGN stage with feedback, Complex Cells, CC Loop, ity 'if. as defined by eqn. (12), and from FIDO cells in the Filling-in. neighborhood The CC Loop is computed by cycling multiple times through the equilibrium equations for Competition I, Competition 2, N;j(i,j-l),(i- ,j),(i + I,j),(i,j + I») (52) Cooperation, Competition 2F, and Competition I F, in that The conductance coefficient Ppqij between two neighboring order. The cycle ends when there is no significant change in the cells depends on the strength of the boundary betweenthem: values of Competition I from the previous cycle. Five cycles are often sufficient, translating to a runtime of about 5 min for a 128 x 128 image. (53) The accuracy of the CC Loop approximation was checked by integrating the dynamic equation for Competition I (while where the equilibrium BCS boundaries from Competition 2 solving the other stages at steady state) using the LSODA soft- of eqn. (29) are summed over orientation: ware integration package (Petzold, 1983). At convergence, the results were indistinguishable from those obtained by the iter- (54) ative method described above. A similar test of the LGN stage yielded equally good results. In both cases, integration takes much longer than the respective approximations. The equilibrium ON and OFF syncytial activities are the solu- In order to achieve a better discrete approximation of filter tions to the sets of simultaneous equations defined, respec- functions than could be obtained by evaluation at a single pixel, tively, by several subpixel calculations were used. For example, the ker- nels of eqn. 3 centered at location (p, q) were evaluated in the range p -0.5 to p + 0.5 in increments of 0.1, and similarly (55) for q values. The average of all these evaluations was used to compute ON and OFF cell responses. Index variables rand k, used to denote orientational tun- and ing, are implicitly modular. Their values "wrap around" at the number of discrete orientations used in the simulations. Thus, to compute cross-orientation competition for 12 orientations, (56) values of r-k must always be between -5 and 5. If r-k is not in this range, 12 is added or subtracted, as necessary. For the sake of visual clarity, the display of the simulated The final filled-in double-opponentciutputjij' shown in Fig. 19, output of eqns. 21-25, shown in Fig. 18B, does not show activity is calculated by subtracting the OFF channel output from the in locations whose only positive input is that of the tonic exci- ON channel output: tation, J.