Reinforcement Learning Using Local Adaptive Models

Total Page:16

File Type:pdf, Size:1020Kb

Reinforcement Learning Using Local Adaptive Models Linköping Studies in Science and Technology Thesis No. 507 Reinforcement Learning Using Local Adaptive Models Magnus Borga LIU-TEK-LIC-1995:39 Department o f Electrical Engineering Linköpings universitet, SE-581 83 Linköping, Sweden Reinforcement Learning Using Local Adaptive Models © 1995 Magnus Borga Department of Electrical Engineering Linköpings universitet SE-581 83 Linköping Sweden ISBN 91-7871-590-3 ISSN 0280-7971 Abstract In this thesis, the theory of reinforcement learning is described and its relation to learn- ing in biological systems is discussed. Some basic issues in reinforcement learning, the credit assignment problem and perceptual aliasing, are considered. The methods of tem- poral difference are described. Three important design issues are discussed: information representation and system architecture, rules for improving the behaviour and rules for the reward mechanisms. The use of local adaptive models in reinforcement learning is suggested and exempli®ed by some experiments. This idea is behind all the work pre- sented in this thesis. A method for learning to predict the reward called the prediction matrix memory is pre- sented. This structure is similar to the correlation matrix memory but differs in that it is not only able to generate responses to given stimuli but also to predict the rewards in reinforcement learning. The prediction matrix memory uses the channel representation, which is also described. A dynamic binary tree structure that uses the prediction matrix memories as local adaptive models is presented. The theory of canonical correlation is described and its relation to the generalized eigen- problem is discussed. It is argued that the directions of canonical correlations can be used as linear models in the input and output spaces respectively in order to represent in- put and output signals that are maximally correlated. It is also argued that this is a better representation in a response generating system than, for example, principal component analysis since the energy of the signals has nothing to do with their importance for the response generation. An iterative method for ®nding the canonical correlations is pre- sented. Finally, the possibility of using the canonical correlation for response generation in a reinforcement learning system is indicated. ii To Maria Copyright c 1995 by Magnus Borga Acknowledgements Although I have spent most of the time by my self this summer writing this thesis, there are a number of people who have made it possible for me to realize this work. First of all, I would like to thank all my colleagues at the Computer Vision Laboratory for their professional help in the scienti®c work and, most of all, for being good friends. Many thanks to my supervisor Dr. Hans Knutsson whose ideas, inspiration and enthusi- asm is behind most of the work presented in this thesis. I also thank professor GÈosta Granlund for running this research group and letting me be a part of it. Among my other colleagues I would in particular like to thank Tomas Landelius and JÈorgen Karlholm for spending a lot of time discussing ideas and problems with me, for reading and constructively criticizing the manuscript and for giving me a lot of good ref- erences. A special thank to Tomas for providing me with interesting literature. Slainte! I would also like to thank Dr. Mats Andersson for his kind help with a lot of technical details which are not described in this thesis and Dr. Carl-Johan Westelius for the beer supply that eased the pain of writing during the warmest days of the summer. I should also express my gratitude to Niclas Wiberg who evoked my interest in neural networks. I gratefully acknowledgethe ®nancial support from NUTEK, the Swedish National Board for Industrial and Technical Development. Finally I would like to thank my wife, Maria, for her love and support, for spending all her summer vacation here in the city with me instead of being at the summer cottage and for proof-reading my manuscript and giving constructive critique of the language in my writing. Remaining errors are to be blamed on me, due to ®nal changes. ªWe learn as we do and we do as well as we have learned.º - Vernon B. Brooks Contents 1 INTRODUCTION 1 : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 1.1 Background : 1 : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 1.2 The thesis : 2 : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 1.3 Notation : 3 : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 1.4 Learning : 5 : : : : : : : : : : : : : : : : : : : : : : : 1.4.1 Machine learning : 6 : : : : : : : : : : : : : : : : : : : : : : : : 1.4.2 Neural networks : 7 2 REINFORCEMENT LEARNING 13 : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 2.1 Introduction : 13 : : : : : : : : : : : : : : : : : : 2.2 Problems in reinforcement learning : 16 : : : : : : : : : : : : : : : : : : : : : : : 2.2.1 Perceptual aliasing : 16 : : : : : : : : : : : : : : : : : : : : : : : 2.2.2 Credit assignment : 18 : : : : : : : : : : : : : : : : 2.3 Learning in an evolutionary perspective : 19 : : : : : : : : : : : : : : : : 2.4 Design issues in reinforcement learning : 22 : : : : : : 2.4.1 Information representation and system architecture : 22 : : : : : : : : : : : : : : : 2.4.2 Rules for improving the behaviour : 24 : : : : : : : : : : : : : : : 2.4.3 Rules for the reward mechanisms : 27 : : : : : : : : : : : : : : : : 2.5 Temporal difference and adaptive critic : 29 : : : : : : : : : : : : : : : : : : : : : : : 2.5.1 A classic example : 33 : : : : : : : : : : : : : : : : : : : : : : : : : 2.6 Local adaptive models : 34 : : : : : : : : : : : 2.6.1 Using adaptive critic in two-level learning : 37 : : : : : : : : : : : : : : : : : : : : : 2.6.2 The badminton player : 38 3 PREDICTION MATRIX MEMORY 43 : : : : : : : : : : : : : : : : : : : : : : : 3.1 The channel representation : 43 : : : : : : : : : : : : : : : : : : : : : : : : : 3.2 The order of a function : 48 : : : : : : : : : : : : : : : : : : : : 3.3 The outer product decision space : 48 : : : : : : : : : : : : : : : : : : : : : : 3.4 Learning the reward function : 49 : : : : : : : : : : : : : : : : : : : : 3.4.1 The Learning Algorithm : 51 : : : : : : : : : : : : : : : : : : : : 3.4.2 Relation to TD-methods : 53 : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 3.5 A tree structure : 54 : : : : : : : : : : : : : : : : : : : : : 3.5.1 Generating responses : 54 : : : : : : : : : : : : : : : : : : : : : : : : 3.5.2 Growing the tree : 57 vii viii : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 3.6 Experiments : 60 : : 3.6.1 The performance when using different numbers of channels : 61 : : : : : : : : : : : : : : : : : : : 3.6.2 The use of a tree structure : 62 : : : : : : : : : : : : : : : : : : : : : : : 3.6.3 The XOR-problem : 63 : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 3.7 Conclusions : 65 4 CANONICAL CORRELATION 67 : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 4.1 Introduction : 67 : : : : : : : : : : : : : : : : : : : : : 4.2 The generalized eigenproblem : 69 : : : : : : : : : : : : : : : : : : : : : 4.2.1 The Rayleigh quotient : 69 : : : : : : : : : : : : : : : : : : 4.2.2 Finding successive solutions : 71 : : : : : : : : : : : : : : : 4.2.3 Statistical properties of projections : 72 : : : : : : : : : : : : : : : : : : : : : 4.3 Learning canonical correlation : 76 : : : : : : : : : 4.3.1 Learning the successive canonical correlations : 77 : : : : : : : : : : : : : : : : : : : : : : : : : : 4.4 Generating responses : 79 : : : : : : : : : : : : : : 4.5 Relation to a least mean square error method : 82 : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 4.6 Experiments : 83 : : : : : : : : : : : : : : : : : : : : : 4.6.1 Unsupervised learning : 84 : : : : : : : : : : : : : : : : : : : : 4.6.2 Reinforcement learning : 87 4.6.3 Combining canonical correlation and the prediction matrix mem- : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : ory : 89 : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 4.7 Conclusions : 92 5 SUMMARY 95 A PROOFS 97 : : : : : : : : : : : : : : : : : : : : : : : : : : A.1 Proofs for chapter 3 : 97 : : : : : : : : : : : : : : A.1.1 The constant norm of the channel set : 97 : : : : : : : : : A.1.2 The constant norm of the channel derivatives : 99 : : : : : : : : : : : : A.1.3 Derivation of the update rule (eq. 3.17) : 99 : : : : : : : : : : : : A.1.4 The change of the father node (eq. 3.36) : 100 : : : : : : : : : : : : : : : : : : : : : : : : : : A.2 Proofs for chapter 4 : 100 : : : : : : : : A.2.1 The derivative of the Rayleigh quotient (eq. 4.3) : 100 : : : : A.2.2 Orthogonality in the metrics A and B (eqs. 4.5 and 4.6) : 100 : : : : : : : : : : : : : : : : : : : : : : A.2.3 Linear independence : 101 : : : : : : : : : : : : : : : : : : : : : A.2.4 The range of r (eq. 4.7) 101 : : : : : : : : : : : : : : : A.2.5 The second derivative of r (eq. 4.8) 102 : : : : : : : : : : A.2.6 Positive eigenvalues of the Hessian (eq. 4.9) : 102 : : : : : : : : : : : : : A.2.7 The successive eigenvalues (eq. 4.10) : 103 : : : : : : : : : : A.2.8 The derivative of the covariance (eq. 4.15) : 103 : : : : : A.2.9 The derivatives of the correlation (eqs. 4.22 and 4.23) : 104 : : : : : : : : : : : : : : : : : : : : A.2.10 The equivalence of r and 104 : : A.2.11 Invariance with respect to linear transformations (page 85) : 105 ix C C : : : : : : yy A.2.12 Orthogonality in the metrics xx and (eq. 4.40) 106 : : : : A.2.13 The equilibrium
Recommended publications
  • Rule-Based Reinforcement Learning Augmented by External Knowledge
    Rule-based Reinforcement Learning augmented by External Knowledge Nicolas Bougie12, Ryutaro Ichise1 1 National Institute of Informatics 2 The Graduate University for Advanced Studies Sokendai [email protected], [email protected] Abstract Learning from scratch and lack of interpretability impose some problems on deep reinforcement learning methods. Reinforcement learning has achieved several suc- Randomly initializing the weights of a neural network is in- cesses in sequential decision problems. However, efficient. Furthermore, this is likely intractable to train the these methods require a large number of training model in many domains due to a large amount of required iterations in complex environments. A standard data. Additionally, most RL algorithms cannot introduce ex- paradigm to tackle this challenge is to extend rein- ternal knowledge limiting their performance. Moreover, the forcement learning to handle function approxima- impossibility to explain and understand the reason for a de- tion with deep learning. Lack of interpretability cision restricts their use to non-safety critical domains, ex- and impossibility to introduce background knowl- cluding for example medicine or law. An approach to tackle edge limits their usability in many safety-critical these problems is to combine simple reinforcement learning real-world scenarios. In this paper, we study how techniques and external knowledge. to combine reinforcement learning and external A powerful recent idea to address the problem of computa- knowledge. We derive a rule-based variant ver- tional expenses is to modularize the model into an ensemble sion of the Sarsa(λ) algorithm, which we call Sarsa- of experts [Lample and Chaplot, 2017], [Bougie and Ichise, rb(λ), that augments data with complex knowledge 2017].
    [Show full text]
  • Personality and Individual Differences 128 (2018) 162–169
    Personality and Individual Differences 128 (2018) 162–169 Contents lists available at ScienceDirect Personality and Individual Differences journal homepage: www.elsevier.com/locate/paid Risk as reward: Reinforcement sensitivity theory and psychopathic T personality perspectives on everyday risk-taking ⁎ Liam P. Satchella, , Alison M. Baconb, Jennifer L. Firthc, Philip J. Corrd a School of Law and Criminology, University of West London, United Kingdom b School of Psychology, Plymouth University, United Kingdom c Department of Psychology, Nottingham Trent University, United Kingdom d Department of Psychology, City, University of London, United Kingdom ARTICLE INFO ABSTRACT Keywords: This study updates and synthesises research on the extent to which impulsive and antisocial disposition predicts Personality everyday pro- and antisocial risk-taking behaviour. We use the Reinforcement Sensitivity Theory (RST) of Reinforcement Sensitivity Theory personality to measure approach, avoidance, and inhibition dispositions, as well as measures of Callous- Psychopathy Unemotional and psychopathic personalities. In an international sample of 454 respondents, results showed that Callous-unemotional traits RST, psychopathic personality, and callous-unemotional measures accounted for different aspects of risk-taking Risk-taking behaviour. Specifically, traits associated with ‘fearlessness’ related more to ‘prosocial’ (recreational and social) risk-taking, whilst traits associated with ‘impulsivity’ related more to ‘antisocial’ (ethical and health) risk-taking. Further, we demonstrate that psychopathic personality may be demonstrated by combining the RST and callous- unemotional traits (high impulsivity, callousness, and low fear). Overall this study showed how impulsive, fearless and antisocial traits can be used in combination to identify pro- and anti-social risk-taking behaviours; suggestions for future research are indicated. 1.
    [Show full text]
  • Fact Sheet - Reinforcement
    FACT SHEET - REINFORCEMENT Positive Reinforcement Because many children with autism have difficulty with communication, play skills, and socialization, it is often difficult to motivate them to engage in activities that incorporate these skills. Positive reinforcement can provide additional motivation to help shape and increase developmentally approriate behaviors. A positive reinforcer is anything that is added following a behavior that increases the likelihood of the behavior occuring again in the future. Rewards are often given to children when they engage in desirable behaviors, but if the reward does Contact Information not cause those behaviors to increase in the future, then the reward is not actually a Website: positive reinforcer. www.coe.fau.edu/card/ Various Forms of Reinforcement Boca Raton Campus 777 Glades Road Natural Reinforcement: A child’s positive behaviors and social interactions Boca Raton, FL. 33431 are reinforced naturally. The natural consequences of positive behaviors become reinforcing themselves. Successful interactions become motivating to the child. Main Line: 561/ 297-2023 Toll Free: 1-888-632-6395 Examples: Fax: 561/297-2507 ♦ There is a ball out of reach for a child. The child says, “Ball,” and an adult Port St Lucie Campus hands the ball to the child. Access to the ball is reinforcing and increases the likelihood of the child requesting “ball” in the future. 500 NW California Blvd. ♦ A child is struggling with a difficult puzzle. The child says, “Help,” and an Port St Lucie, FL. 34986 adult helps the child. Completion of the puzzle is reinforcing. This successful interaction increases the likelihood of the child attempting puzzles Main Line: 561/ 297-2023 in the future and requesting help when needed.
    [Show full text]
  • The Building Blocks of Treatment in Cognitive-Behavioral Therapy
    Isr J Psychiatry Relat Sci Vol 46 No. 4 (2009) 245–250 The Building Blocks of Treatment in Cognitive-Behavioral Therapy Jonathan D. Huppert, PhD Department of Psychology, The Hebrew University of Jerusalem, Jerusalem, Israel Abstract: Cognitive behavioral therapy (CBT) is a set of treatments that focus on altering thoughts, sensations, emo- tions and behaviors by addressing identified maintenance mechanisms such as distorted thinking or avoidance. The current article describes the history of CBT and provides a description of many of the basic techniques used in CBT. These include: psychoeducation, self-monitoring, cognitive restructuring, in vivo exposure, imaginal exposure, and homework assignments. Cognitive-behavioral therapy (CBT) is probably progress has been made in understanding the psy- better called cognitive and behavioral therapies, chological mechanisms involved in treatment (e.g., given that there are many treatments and traditions 10), though there is still much to understand. that fall under the rubric of CBT. These therapies The basic tenets of CBT theory of mental illness have different emphases on theory (e.g., cognitive is that psychopathology is comprised of maladap- versus behavioral) and application (e.g., practical tive associations among thoughts, behaviors and vs. theoretical). Historically, behavior therapy de- emotions that are maintained by cognitive (at- veloped out of learning theory traditions of Pavlov tention, interpretation, memory) and behavioral (1) and Skinner (2), both of whom considered processes (avoidance, reinforcement, etc.). Within learning’s implications for psychopathology. More CBT theories, there are different emphases on direct clinical applications of behavioral prin- aspects of the characteristics of psychopathology ciples were developed by Mowrer (3), Watson and and their maintenance mechanisms (e.g., 11–14).
    [Show full text]
  • Throwing More Light on the Dark Side of Personality: a Re-Examination of Reinforcement
    City Research Online City, University of London Institutional Repository Citation: Broerman, R. L., Ross, S. & Corr, P. J. (2014). Throwing more light on the dark side of psychopathy: An extension of previous findings for the revised Reinforcement Sensitivity Theory. Personality and Individual Differences, 68, pp. 165-169. doi: 10.1016/j.paid.2014.04.024 This is the accepted version of the paper. This version of the publication may differ from the final published version. Permanent repository link: http://openaccess.city.ac.uk/16257/ Link to published version: http://dx.doi.org/10.1016/j.paid.2014.04.024 Copyright and reuse: City Research Online aims to make research outputs of City, University of London available to a wider audience. Copyright and Moral Rights remain with the author(s) and/or copyright holders. URLs from City Research Online may be freely distributed and linked to. City Research Online: http://openaccess.city.ac.uk/ [email protected] Throwing More Light on the Dark Side of Personality: A Re-examination of Reinforcement Sensitivity Theory in Primary and Secondary Psychopathy Scales Broerman, R. L. Ross, S. R. Corr, P. J. Introduction Due to researchers’ differing opinions regarding the construct of psychopathy, the distinction between primary and secondary psychopathy, though it has long been recognized to exist, has yet to be fully understood. This distinction, originally proposed by Karpman (1941, 1948), suggests two separate etiologies leading to psychopathy. Whereas primary psychopathy stems from genetic influences resulting in emotional deficits, secondary psychopathy is associated with environmental factors such as abuse (Lee & Salekin, 2010). Additionally, primary psychopathy is characterized by lack of fear/anxiety, secondary psychopathy is thought more to represent a vulnerability to experience higher levels of negative affect in general (Vassileva, Kosson, Abramowitz, & Conrad, 2005).
    [Show full text]
  • PSYCO 282: Operant Conditioning Worksheet
    PSYCO 282 Behaviour Modification Operant Conditioning Worksheet Operant Conditioning Examples For each example below, decide whether the situation describes positive reinforcement (PR), negative reinforcement (NR), positive punishment (PP), or negative punishment (NP). Note: the examples are randomly ordered, and there are not equal numbers of each form of operant conditioning. Question Set #1 ___ 1. Johnny puts his quarter in the vending machine and gets a piece of candy. ___ 2. I put on sunscreen to avoid a sunburn. ___ 3. You stick your hand in a flame and you get a painful burn. ___ 4. Bobby fights with his sister and does not get to watch TV that night. ___ 5. A child misbehaves and gets a spanking. ___ 6. You come to work late regularly and you get demoted. ___ 7. You take an aspirin to eliminate a headache. ___ 8. You walk the dog to avoid having dog poop in the house. ___ 9. Nathan tells a good joke and his friends all laugh. ___ 10. You climb on a railing of a balcony and fall. ___ 11. Julie stays out past her curfew and now does not get to use the car for a week. ___ 12. Robert goes to work every day and gets a paycheck. ___ 13. Sue wears a bike helmet to avoid a head injury. ___ 14. Tim thinks he is sneaky and tries to text in class. He is caught and given a long, boring book to read. ___ 15. Emma smokes in school and gets hall privileges taken away. ___ 16.
    [Show full text]
  • Integrating the Olweus Bullying Prevention Program and Positive Behavioral Interventions and Supports in Pennsylvania
    Integrating the Olweus Bullying Prevention Program and Positive Behavioral Interventions and Supports in Pennsylvania 2 Integrating the Olweus Bullying Prevention Program and Positive Behavioral Interventions and Supports in Pennsylvania Overview of Workgroup and Method This report was prepared with input from This report was produced to summarize the Pennsylvania OBPP-PBIS workgroup. the workgroup’s findings related to the The workgroup included representation following questions: from statewide leadership organizations that support the dissemination of Olweus • Is it possible to implement both OBPP Bullying Prevention Program (OBPP) and and PBIS in a school? Positive Behavioral Interventions and • What strategies support co-implemen- Supports (PBIS) in the commonwealth, tation of OBPP and PBIS? as well as leaders from schools that have • What considerations are warranted experience with both programs/frame- when a school is selecting an evidence- works. The workgroup met on six different based school climate improvement occasions and conducted site visits of program, such as OBPP or PBIS? model implementation sites. Definitions of Bullying Among Youths Bullying is any unwanted aggressive behavior(s) by another youth or group of youths who are not siblings or current dating partners that involves an observed or perceived power imbalance and is repeated multiple times or is highly likely to be repeated. Bullying may inflict harm or distress on the targeted youth including physical, psychological, social or educational harm. – Centers for
    [Show full text]
  • Giving Students the Tools to Reduce Bullying Behavior Through the Blending of School-Wide Positive Behavior Support, Explicit In
    Giving students the tools to reduce bullying behavior through the blending of school-wide positive behavior support, explicit instruction, and a redefinition of the bullying construct. CONTENTS ii. Before We Intervene ................................................................................... ii-1 1. Student Curriculum (Part 1) ..................................................................... 1-1 Objectives and Procedure ............................................................................................................... 1-1 Teaching the Social Responsibility Skills .................................................................................... 1-3 2. Student Curriculum (Part 2) ...................................................................... 2-1 Responding to Stop/walk/talk ...................................................................................................... 2-2 Group Practice .................................................................................................................................... 2-2 3. Gossip .............................................................................................................. 3-1 Stop/walk/talk with gossip ............................................................................................................ 3-2 Group Practice .................................................................................................................................... 3-2 4. Inappropriate Remarks ..............................................................................
    [Show full text]
  • Top 10 Behaviors of an Expert Animal Trainer
    The Top 10 Behaviors of Expert Animal Trainers Steve Martin Natural Encounters, Inc. Abstract Think of a trainer you recognize as an expert. Now, think of the characteristics that inspire you to call that person an expert. Is it the person's knowledge, skills, charisma, confidence, reputation or ... something else? This presentation will operationalize some of the most important characteristics that expert animal trainers exhibit, from my point of view. Introduction We all know great trainers in our lives, people we look up to, admire, talk about favorably with others. But, how does a person earn that reputation as a great trainer? And, what separates a great trainer from an average trainer? To answer these questions, we need to start by operationalizing the construct “training skill.” What does a trainer do to earn a reputation and label of “Expert?” “Expert” Operationalized Curators, managers, supervisors, veterinarians, directors and more would benefit from a description of the observable training skills of their staff. Since everyone’s training these days, how does a leader with no experience in training judge the skills of their staff? Because a person has read Don’t Shoot the Dog (a great resource by the way), has a whistle around their neck or a clicker in their hand, and uses jargon that confuses non- trainers, does not mean a person is a highly-skilled trainer. When a vet, curator or director watches a training session how are they to know skillful training when they see it? When the trainer tells them the animal is acting up, distracted by their presence, or messing with their minds, how does the director know the real problem isn’t the trainer encroaching on the animal’s personal space, unclear criteria, low rate of reinforcement, poor antecedent arrangement, or one of many other common training mistakes? For that matter, how does the trainer know? Good training involves the artful application of scientific principles.
    [Show full text]
  • Reinforcement
    Reinforcement This article is about the psychological concept. For behavior but this term may also refer to an enhancement the construction materials reinforcement, see Rebar. of memory. One example of this effect is called post- For reinforcement learning in computer science, see training reinforcement where a stimulus (e.g. food) given Reinforcement learning. For beam stiffening, see shortly after a training session enhances the learning.[2] Stiffening. This stimulus can also be an emotional one. A good ex- In behavioral psychology, reinforcement is a ample is that many people can explain in detail where they were when they found out the World Trade Center was attacked.[3][4] Reinforcement is an important part of operant or instrumental conditioning. 1 Introduction B.F. Skinner was a well-known and influential researcher who articulated many of the theoretical constructs of reinforcement and behaviorism. Skinner defined rein- forcers according to the change in response strength (re- sponse rate) rather than to more subjective criteria, such as what is pleasurable or valuable to someone. Accord- Diagram of operant conditioning ingly, activities, foods or items considered pleasant or en- joyable may not necessarily be reinforcing (because they consequence that will strengthen an organism’s future be- produce no increase in the response preceding them). havior whenever that behavior is preceded by a specific Stimuli, settings, and activities only fit the definition of antecedent stimulus. This strengthening effect may be reinforcers if the behavior that immediately precedes the measured as a higher frequency of behavior (e.g., pulling potential reinforcer increases in similar situations in the a lever more frequently), longer duration (e.g., pulling future; for example, a child who receives a cookie when a lever for longer periods of time), greater magnitude he or she asks for one.
    [Show full text]
  • Repeat Victimisation, Retraumatisation and Victim Vulnerability
    Send Orders for Reprints to [email protected] 36 The Open Criminology Journal, 2015, 8, 36-48 Open Access Repeat Victimisation, Retraumatisation and Victim Vulnerability Nicola Graham-Kevan*, Matthew Brooks, VJ Willan, Michelle Lowe, Phaedra Robinson, Roxanne Khan, Rachel Stokes, May Irving, Marta Karwacka and Joanne Bryce School of Psychology, University of Central Lancashire, UK Abstract: This study explores the contribution that traumatic experiences and psychological post-traumatic stress symptoms make to predicting subsequent revictimisation in a sample of violent crime victims. In addition, the timing of first trauma exposure was also explored. Fifty-four adult victims (27 male and 27 female) of police recorded violent crime were interviewed and their traumatic exposure history, trauma symptomology, age at first trauma exposure as well as psychological and psychosocial functioning were assessed. These victims were followed longitudinally and subsequent revictimisation between six and twelve months post index victimisation measured. A greater number of types of trauma exposure was related lower emotional stability, higher trauma symptomology and revictimisation. Those victims with childhood traumatic exposure reported more trauma symptomology exposure than those without prior exposure. The implications for law enforcement and victim services are discussed. Keywords: Crime, victims, violence, psychological trauma, post traumatic press. Interest in revictimisation (revictimisation refers here to of subsequent victimisation increases. This could be through any subsequent victimisation after the recorded index violent maladaptive coping (Fortier, DiLillo, Messman-Moore, victimisation) has been increasing over the past decade Peugh, DeNardi & Gaffey, 2009), such as substance use (Farrell, 2005) and so the factors that help to explain this (Dumais, De Benedictis, Joyal, Allaire, Lessage & Côte, phenomena are an important area to research (Davis, 2013; Hassel, Nordfjærn & Hagen, 2013), hypervigilance Maxwell, & Taylor, 2006).
    [Show full text]
  • How New Ties Facilitate the Mutual Reinforcement of Status and Bullying in Elementary Schools Rozemarijn Van Der Ploeg, Christian Steglich and René Veenstra
    The way bullying works: How new ties facilitate the mutual reinforcement of status and bullying in elementary schools Rozemarijn van der Ploeg, Christian Steglich and René Veenstra The self-archived postprint version of this journal article is available at Linköping University Institutional Repository (DiVA): http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-154080 N.B.: When citing this work, cite the original publication. van der Ploeg, R., Steglich, C., Veenstra, R., (2019), The way bullying works: How new ties facilitate the mutual reinforcement of status and bullying in elementary schools, Social Networks. https://doi.org/10.1016/j.socnet.2018.12.006 Original publication available at: https://doi.org/10.1016/j.socnet.2018.12.006 Copyright: Elsevier http://www.elsevier.com/ ACCEPTED MANUSCRIPT / UNCORRECTED PROOF The Way Bullying Works: How New Ties Facilitate the Mutual Reinforcement of Status and Bullying in Elementary Schools Rozemarijn van der Ploega,b, Christian Steglicha,c, René Veenstraa a Department of Sociology, University of Groningen, Groningen, the Netherlands b Department of Pedagogy and Educational Science, University of Groningen, Groningen, the Netherlands c Institute for Analytical Sociology, Linköping University, Linköping, Sweden Article in press Social Networks: https://doi.org/10.1016/j.socnet.2018.12.006 Correspondence regarding this paper should be addressed to Rozemarijn van der Ploeg, University of Groningen, Grote Rozenstraat 38, 9712 TJ Groningen, the Netherlands. Electronic mail may be addressed to [email protected]. Telephone: 0031 50 363 2486. 1 ACCEPTED MANUSCRIPT / UNCORRECTED PROOF Highlights Younger children punished bullying by a refusal to attribute status to bullies.
    [Show full text]