Towards Using AI to Augment Human Support in Digital Mental Healthcare

Towards Using AI to Augment Human Support in Digital Mental Healthcare Prerna Chikersal Abstract School of Computer Science To address the need for more access to, and increase Carnegie Mellon University, US the effectiveness of, mental health treatment, internet- [email protected] delivered psychotherapy programs such as iCBT are shown to achieve clinical outcomes comparable to face- Gavin Doherty to-face therapy. While offering ubiquitous access to School of Computer Science & Statistics healthcare, a key concern of digital therapy programs is Trinity College Dublin, IRL to sustain users’ engagement with the treatment to [email protected] attain desired benefits. Research has demonstrated that including a trained ‘human supporter’ to the digital Anja Thieme mental health ecosystem can provide useful guidance Healthcare Intelligence and motivation to its users, and lead to more effective Microsoft Research Cambridge, UK outcomes than unsupported interventions. Within this [email protected] context, we describe early research that makes use of machine learning (ML) approaches to better understand how the behaviors of these human supporters may benefit the mental health outcomes of clients; and how such effects could be maximized. We discuss new opportunities for augmenting human support through personalization, and the related ethical challenges. Author Keywords Mental health; digital behavioral intervention; ethics; responsible AI; machine learning; NLP; data mining. CSS Concepts •Human-centered computing → Human computer Submission to CHI 2020 workshop on Technology Ecosystems: Rethinking interaction; •Computing methodologies → Machine Resources for Mental Health learning; Change & Improvement Introduction these outcomes as features in K-means clustering, Rates as Clinical Outcomes Our research applies unsupervised machine learning, through which we obtain k=3 clusters of supporters Message-level Change (MC): and statistical and data mining methods to identify whose messages are generally linked with either ‘high’, The clinical score after a message is highly what supporter behaviors – delivered as part of an ‘medium’ or ‘low’ improvements in client outcomes. We dependent on the clinical score before that message, as clients that have more severe online psychotherapy intervention – correlate with hypothesize that there are differences in the messages symptoms before the message also tend to improve more on average after the message. better client mental health outcomes [1]. Our analysis sent by supporters in the ‘high’ versus ‘low’ outcome Hence, we measure Message-level Change as the difference between actual change and the is based on a fully anonymized dataset of 234,735 clusters; and that these differences will help us identify expected change given the client score before supporter messages to clients (sent by 3,481 what may constitute more effective support strategies. the message. That is, for each supporter S with NM messages, compute: supporters to 54,104 clients) from an established internet-delivered Cognitive Behavioral Therapy (iCBT) Linguistic Strategies used in More Successful Messages program for “depression and anxiety” [7]. This program As a next step, we want to identify what semantic or is one of the most frequently used treatments on the linguistic strategies occur significantly more often in the SilverCloud platform (www.silvercloudhealth.com). messages of supporters in the ‘high’ outcome cluster; Message-level Improvement Rate (MR): If Message-level Change > 0, then the client Accessed online or via mobile, the program presents a and this result needs to be consistent independent of improved more than expected post-message, self-guided intervention with seven core psycho- differences across clients (e.g., variations in their and we label the message as “improved (1)”. Otherwise, we label the message as “not educational and psycho-therapeutic modules that are mental health state, program use). To analyze the improved (0)”. For each supporter S with NM messages, we average these labels across all delivered through interactive multi-modal contents and content of the supporter messages, we used a lexicon- messages to compute this outcome. tools [2]. Clients work through the program at their based text-mining approaches that extract Client-level Change (CC): own pace and time, and with regular, personalized comprehensive text characteristics without risking to While MC captures changes in clinical scores across all messages by a supporter, CC feedback messages sent by a trained human supporter. identify any text content so as to preserve anonymity. normalizes these changes across all clients of the supporter. For each supporter S, we first compute the MC for each client of S separately using the messages that S sent to Aiming to better understand what linguistic strategies Amongst others, this included the: (i) use of the NRC them. Then, we average the MCs per client across all clients of S to get CC. For example, characterize the feedback messages of supporters Emotion Lexicon [6] to capture the sentiment and if a supporter sends 6 messages to client A whose clients have better mental health outcomes; we emotional tone of a message by extracting the whose change is +1 after each messages and 4 messages to client B whose change is employed a set of ML and data mining methods. percentages of positive and negative words, and words always 0, the MC will be 6 = 0.6 while the 6+4 6 0 related to eight emotion categories (e.g., fear, joy, ( )+( ) CC will be 6 4 = 0.5. Thus, MC can be high 2 Understanding Successful Support Strategies anger); (ii) extraction of first person plural pronouns even when only a few clients improve, whereas CC will only be high when these To better understand what support strategies (e.g., we, us, our) as indicator of a supportive alliance; improvements are consistent across all/ many clients. This makes CC more robust to a characterize supporter messages that are correlated (iii) uses of encouraging phrases (e.g., ‘well done’, single client’s changing situations or symptoms. with better clinical outcomes for clients, we first needed ‘good job’); and (iv) use of the Regressive Imagery Client-level Improvement Rate (CR): to identify what constitutes ‘successful support Dictionary [5] to assess percentages of words related For each supporter S with clients NC, we first messages’ based on clinical outcomes. To this end, we to mental health processes such abstraction (e.g., compute the average Message-level Improvement Rate using messages S sent to computed 4 clinical outcomes by averaging post- know, thought), social behavior (e.g., call, say, tell), or each client separately, and then sum these rates across all clients and divide by the total message change in PHQ-9 (for depression) [3] and temporal references (e.g., when, now, then). number of clients. GAD-7 (for anxiety) [4] scores across the messages sent by each supporter (see sidebar). We then use Our semantic analysis revealed statistically significant the well-known frequent item set mining and findings that more successful supporter messages association rule learning Apriori algorithm. It first consistently used more positive and less negative generates a set of frequent items that occur together, words, and had less occurrences of negative emotions and then extracts association rules that explain the conveying sadness and fear. Further, more successful relationship between those items. We extracted messages consistently employed first person plural association rules separately for both our ‘more’ and pronouns more frequently than less successful ‘less’ successful outcome clusters; and then calculated messages, and consistently contained significantly the salience for each of the identified rules as the more encouraging phrases. For our mental process absolute confidence difference between the two clusters Figure 1. Mean percentage of variables, we found that more successful messages (more salient rules are used more frequently by words in ‘more’ and ‘less’ consistently employed more words associated with supporters in either the ‘more’ or ‘less’ successful successful support messages that social behavior and less words associated with cluster). We derived the 1584 most salient rules, are associated with Social abstraction (see Figures 1 and 2; and [1] for details). comprised of 8 support strategies and 66 multi- Behavior (here plotted across at dimensional contexts. clients with different frequencies of looking at content pages of the Client Context Specific Support Strategies iCBT program: ContentViews). The above analysis indicates characteristics of support Upon analyzing these we observed, for example, that messages that tend to correlate with improved client support messages that use few words of fear and more outcomes independent of differences in the clients’ first person plural pronouns have high salience in more specific context (e.g., their mental health, platform successful messages when a user hasn’t engaged much use), when treating different context variables in with the intervention (e.g., no content views, no isolation. Next, we want to better understand the more content shared). This means that writing messages complex relationship that likely exists between the use with less words related to fear, and more first person of support strategies and various client context plural pronouns are strongly associated with more variables. In other words, how may considering the successful support messages, and this

Towards Using AI to Augment Human Support in Digital Mental Healthcare

A Study on Hierarchical Clustering for Identifying Asthma Endotypes (J4R/ Volume 03 / Issue 05 / 004)

Atopic Dermatitis and Respiratory Allergy: What Is the Link

Pediatric Pulmonology, on Pediatric Pulmonology, International Congress Th 15 ISSN 8755-6863 PEDIATRIC

Neurips 2020 Workshop Book

Ds3-Datascience-Polytechniqu…

Ascertaining Patterns of Asthma Symptoms in Childhood Using Machine Learning Methods

Elsevier Editorial System(Tm) for the Lancet Respiratory Medicine Manuscript Draft

Challenges in Using Hierarchical Clustering to Identify Asthma Subtypes: Choosing the Variables and Variable Transformation Mate

Features of Asthma Which Provide Meaningful Insights for Understanding the Disease Heterogeneity

Advances in Data Science 2018: Final Speakers & Discussion Themes

Developmental Profiles of Eczema, Wheeze, and Rhinitis: Two Population-Based Birth Cohort Studies

Uncertainty As a Form of Transparency: Measuring, Communicating, and Using Uncertainty