A Hierarchical Contextual Attention-Based GRU Network For

Total Page:16

File Type:pdf, Size:1020Kb

A Hierarchical Contextual Attention-Based GRU Network For A Hierarchical Contextual Attention-based GRU Network for Sequential Recommendation Qiang Cui1 Shu Wu1 Yan Huang1 Liang Wang1;2 1Center for Research on Intelligent Perception and Computing National Laboratory of Pattern Recognition 2Center for Excellence in Brain Science and Intelligence Technology Institute of Automation, Chinese Academy of Sciences [email protected], fshu.wu, yhuang, [email protected] Abstract mance on different tasks, for example next basket rec- ommendation (Yu et al. 2016), location prediction (Liu Sequential recommendation is one of fundamental tasks for et al. 2016c), product recommendations (Cui et al. 2016; Web applications. Previous methods are mostly based on Hidasi et al. 2016), and multi-behavioral prediction (Liu, Markov chains with a strong Markov assumption. Recently, recurrent neural networks (RNNs) are getting more and more Wu, and Wang 2017). These RNN methods mostly employ a popular and has demonstrated its effectiveness in many tasks. common strategy: using the last hidden state of RNN as the The last hidden state is usually applied as the sequence’s rep- user’s final representation then making recommendations to resentation to make recommendation. Benefit from the nat- users. ural characteristics of RNN, the hidden state is a combina- In recommender systems, a better recommendation tion of long-term dependency and short-term interest to some should capture both the long-term user-taste and short- degrees. However, the monotonic temporal dependency of term sequential effect (Rendle, Freudenthaler, and Schmidt- RNN impairs the user’s short-term interest. Consequently, the hidden state is not sufficient to reflect the user’s final inter- Thieme 2010; Wang et al. 2015). They are referred as long- est. In this work, to deal with this problem, we propose a term dependency and short-term interest in this work. The Hierarchical Contextual Attention-based GRU (HCA-GRU) hidden state of RNN has both characteristics in nature and network. The first level of HCA-GRU is conducted on the in- is very suitable for recommendation. With the help of gated put. We construct a contextual input by using several recent activation function like long-short term memory (Hochreiter inputs based on the attention mechanism. This can model the and Schmidhuber 1997) or gated recurrent unit (Cho et al. complicated correlations among recent items and strengthen 2014), RNN can better capture the long-term dependency. the hidden state. The second level is executed on the hidden This advantage allows the hidden state to be able to con- state. We fuse the current hidden state and a contextual hid- nect previous information for a long time. Due to the re- den state built by the attention mechanism, which leads to a current structure and fixed transition matrices, RNN holds more suitable user’s overall interest. Experiments on two real- world datasets show that HCA-GRU can effectively generate an assumption that temporal dependency has a monotonic the personalized ranking list and achieve significant improve- change with the input time steps (Liu, Wu, and Wang 2017). ment. It assumes that the current item or hidden state is more sig- nificant than the previous one. This monotonic assumption is reasonable on the whole sequence, and it also enables hid- 1 Introduction den state to capture the user’s short-term interest to some Recently, with the development of Internet, some applica- degrees. tions of sequential scene have become numerous and multi- There is a problem that the monotonic assumption of lateral, such as ad click prediction, purchase recommenda- RNN restricts the modeling of user’s short-term interest. Be- arXiv:1711.05114v3 [cs.IR] 5 Feb 2018 tion and web page recommendation. A user’s behaviors in cause of this problem, the forementioned common strategy such applications is a sequence in chronological order, and is not sufficient for better recommendation. First, short-term his following behaviors can be predicted by sequential rec- interest should discover correlations among items within a ommendation methods. Taking online shopping for instance, local range. This will make the recommendation very re- after a user buys an item, the application would predict a list sponsive for the coming behaviors based on recent behav- of items that the user might purchase in the near future. iors (Wang et al. 2015). Next, for short-term interest in rec- Traditional sequential methods usually focus on Markov ommender systems, we can not say that one item or hidden chains (Rendle, Freudenthaler, and Schmidt-Thieme 2010; state is must more significant than the previous one (Liu, Chen, Wang, and Wang 2015; He and McAuley 2016a). Wu, and Wang 2017). The correlations are more compli- However, the underlying strong Markov assumption has dif- cated, but the monotonic assumption can not well distin- ficulty in constructing more effective relationship among guish importances of the several recent factors. For example, various factors. Lately, with the great success of deep learn- when a user buys clothes online, he might buy it for some- ing, recurrent neural network (RNN) methods become more one else. Small weights should be provided for such kind of and more popular. They have achieved promising perfor- behaviors. Take another example, we can consider the regu- lar three meals a day as a sequence. The connection may be closer among breakfasts instead of between breakfast and Table 1: Notation lunch or dinner. Therefore, the short-term interest should be Notation Explanation carefully examined and needs to be integrated with the long- U, I, Iu set of users, set of items, sequence of items of user u term dependency. p, q positive item, negative item t In this paper, we propose a Hierarchical Contextual x^upq preference of user u towards item p and q at the t-th time step Attention-based GRU (HCA-GRU) network to address the d dimensionality of input vector above problem. It can greatly strengthen the user’s short- x, h input and hidden state of GRU term interest represented in each hidden state, and make a xc, hc contextual input, contextual hidden state w w non-linear combination of long-term dependency and short- x, h window widths U, W , b transition matrices and bias of GRU term interest to obtain user’s overall interest. The first level V transition matrix for xc of our HCA-GRU is conducted on the input to make hidden E, F embedding matrices for h and hc states stronger. When we input the current item to GRU, we ho overall interest of h and hc also input a contextual input containing information of pre- rx, Qx weight vector and matrix for input vious inputs. Inspired by a learning framework for word vec- rh, Qh weight vector and matrix for hidden state tors in Natural Language Processing (NLP) (Le and Mikolov ax, ah attention weights 2014), we firstly collect several recent inputs as the contex- tual information at each time step. Then the attention mecha- nism is accordingly introduced to assign appropriate weights (Liu et al. 2016b). Incorporated with geographical distance to select important inputs. The contextual input is a weighted and time interval information, RNN gains the state-of-the-art sum of this context. The second level of our HCA-GRU is performance for location prediction (Liu et al. 2016c). The executed on the hidden state to make a fusion of recent hid- RLBL model combines RNN and Log-BiLinear (Mnih and den states. Each time, the user’s overall interest is a com- Hinton 2007) to make multi-behavioral prediction (Liu, Wu, bination of the current hidden state and a contextual hid- and Wang 2017). Furthermore, extensions of RNN like long- den state. Encouraged by the global context memory initial- short term memory (LSTM) (Hochreiter and Schmidhuber ized by all hidden states with average pooling in a Computer 1997) and gated recurrent unit (GRU) (Cho et al. 2014) are Vision (CV) work (Liu et al. 2017), we use several recent greatly developed, as they can better hold the long-term de- hidden states to construct local contextual information. It is pendency (Bengio, Simard, and Frasconi 1994). Here, our used to establish the current contextual hidden state via at- proposed network relies on GRU. tention. Finally, parameters of HCA-GRU are learned by the Contextual information has been proven to be very im- Bayesian Personalized Ranking (BPR) framework (Rendle portant on different tasks. When learning word vectors in et al. 2009) and the Back Propagation Through Time (BPTT) NLP, the prediction of next word is based on the sum or algorithm (Werbos 1990). The main contributions are listed concatenation of recent input words (Le and Mikolov 2014). as follows: About 3D action recognition, GCA-LSTM network applies • We propose a novel hierarchical GRU network for se- all the hidden states in ST-LSTM (Liu et al. 2016a) to ini- quential recommendation which can combine the long- tialize global context memory by average pooling (Liu et al. term dependency and user’s short-term interest. 2017). On scene labeling, the patch is widely used. Episodic CAMN model makes contextualized representation for ev- • We propose to employ contextual attention-based model- ery referenced patch by using surrounding patches (Ab- ing to deal with the monotonic assumption of RNN meth- dulnabi et al. 2017). As the winner of the ImageNet ob- ods. This can automatically focus on critical information, ject detection challenge of 2016, GBD-Net employs differ- which greatly strengthens the user’s short-term interest. ent resolutions and contextual regions to help identify ob- • Experiments on two large real-world datasets reveal that jects (Zeng et al. 2016). Our contextual modeling is mainly the HCA-GRU network is very effective and outperforms focused on the input and hidden state of GRU, and we do the state-of-the-art methods.
Recommended publications
  • Combining Semi-Quantitative Risk Assessment, Composite Indicator
    www.nature.com/scientificreports OPEN Combining semi‑quantitative risk assessment, composite indicator and fuzzy logic for evaluation of hazardous chemical accidents Huyen Thi Thu Do1*, Tram Thi Bich Ly1 & Tho Tien Do2 In this study, a combination of semi‑quantitative risk assessment, composite indicator and fuzzy logic has been developed to identify industrial establishments and processes that represent potential environmental accidents associated with hazardous chemicals. The proposed method takes into consideration the root causes of risk probability of hazardous chemical accidents (HCAs), such as unsafe onsite storing and usage, inadequate operation training, poor safety management and analysis, equipment failure, and factors afected by the potential consequences of the accidents, including human health, water resources, and building and construction. These issues have been aggregated in a system of criteria and sub‑criteria, demonstrated by a list of non‑overlapping and exhaustive categorical terms. Semi‑quantitative risk assessment is then applied to develop a framework for screening industrial establishments that exhibit potential HCAs. Fuzzy set theory with triangular fuzzy number deals with the uncertainty associated with the data input and reduces the infuence of subjectivity and vagueness to the fnal results. The proposed method was tested among 77 industrial establishments located within the industrial zones of Ho Chi Minh City, Vietnam. Eighteen establishments were categorized as high HCA risk, 36 establishments were categorized as medium HCA risk, and 23 ones were of low HCA risk. The results are compatible with the practical chemical safety situation of the establishments and are consistent with expert evaluation. Hazardous chemical accidents (HCAs) can occur at any location in the production, use, storage, disposal or transportation cycle of the chemical.
    [Show full text]
  • CORRELATION BETWEEN the CHEMOTAXONOMIC CLASSIFICATIONS of the ESSENTIAL OILS of 48 Eucalyptus SPECIES HARVESTED from TUNISIA and THEIR PHYLOGENETIC CLASSIFICATION
    Journal of Experimental Biology and Agricultural Sciences, March - 2014; Volume – 2(1S) Journal of Experimental Biology and Agricultural Sciences http://www.jebas.org ISSN No. 2320 – 8694 CORRELATION BETWEEN THE CHEMOTAXONOMIC CLASSIFICATIONS OF THE ESSENTIAL OILS OF 48 Eucalyptus SPECIES HARVESTED FROM TUNISIA AND THEIR PHYLOGENETIC CLASSIFICATION Elaissi Ameur1*, Medini Hanene 1, Rouis Zied2, Khouja Mohamed Larbi3, Chemli Rachid1 and Harzallah-Skhiri Fethia1 1Laboratory of The Chemical, Galenic and pharmacological Drug Development, Faculty of Pharmacy, University of Monastir, Avenue Avicenne, 5019 Monastir, Tunisia. 2Laboratory of Genetic, Biodiversity and Bio-resources Valorisation, Higher Institute of Biotechnology of Monastir, University of Monastir, Avenue Tahar Haddad, 5000 Monastir, Tunisia. 3National Institute for Research on Rural Engineering, Water and Forestry, Institution of Agricultural Research and Higher Education, BP. N.2, 2080 Ariana, Tunisia. Received – January 24, 2014; Revision – February 19, 2014, Accepted – March 16, 2014 Available Online - March 31, 2014 KEYWORDS ABSTRACT Eucalyptus Various chemical classes (monoterpenes hydrocarbons, oxygenated monoterpenes, sesquiterpenes Essential oils hydrocarbons, oxygenated sesquiterpenes, esters, ketones, non classified coumpounds and non identified compounds) and twenty five of the main components from the essential oils of 48 Tunisian Eucalyptus 1, 8-cineole species has been reported. The compounds includes 1,8-cineole, torquatone, p-cymene, spathulenol, trans-pinocarveol,
    [Show full text]
  • IPC Classification
    A guide to current Internaonal Paralympic Commiee Classificaons Archery is open to athletes with a physical impairment and classificaon is broken up into three classes: ARW1: spinal cord and cerebral palsy athletes with impairment in all four limbs ARW2: wheelchair users with full arm funcon ARST (standing): athletes who have no impairments in their arms but who have some impairment in their legs. This group also includes amputees, les autres and cerebral palsy standing athletes. Some athletes in the standing group will sit on a high stool for support but will sll have their feet touching the ground. Athlecs uses a system of leers and numbers is used to disnguish between them. A leer F is for field athletes, T represents those who compete on the track, and the number shown refers to their impairment. 11-13: track and field athletes who are visually impaired 20: track and field athletes who are intellectually disabled 31-38: track and field athletes with cerebral palsy 41-46: track and field amputees and les autres T 51-56: wheelchair track athletes F 51-58: wheelchair field athletes Blind athletes compete in class 11 and are permied to run with a sighted guide, while field athletes in the class are allowed the use of acousc signals, for example electronic noises, clapping or voices, if they compete in the 100m, long jump or triple jump. Athletes in classes 42, 43 and 44 must wear a prosthesis while compeng, but this is oponal for classes 45 and 46. Boccia (a bowling game) is open to athletes with cerebral palsy and other severe physical impairments (eg, muscular dystrophy) who compete from a wheelchair, with classificaon split into four classes: BC1 : Athletes may compete with the help of an assistant, who must remain outside the athlete's playing box.
    [Show full text]
  • 9059 Medical Multiple 005
    Schedule of Accreditation issued by United Kingdom Accreditation Service 2 Pine Trees, Chertsey Lane, Staines-upon-Thames, TW18 3HR, UK HCA International Limited, operating as HCA Laboratories Issue No: 008 Issue date: 17 September 2021 Shropshire House Contact: Dorothy Kasibante Shropshire Place/Queens Yard Tel: +44 (0)207 034 4616 9059 Capper Street Fax: +44 (0)207 487 2721 London E-Mail: [email protected] Accredited to WC1E 6JA Website: hcalaboratories.co.uk ISO 15189:2012 Testing performed by the Organisation at the locations specified below Locations covered by the organisation and their relevant activities Laboratory locations: Location details Activity Location code Shropshire House Local contact Microbiology A Shropshire Place/Queens Yard Dorothy Kasibante Serology Capper Street Risk & Compliance Clinical Immunology London Manager Biochemistry WC1E 6JA Haematology Mr Charles Percy - CEO Blood Transfusion Administration Tel: +44 (0)207 486 5091 Referrals Molecular Specimen Reception Cytopathology (inc. gynaecological & diagnostic) Histopathology (inc. routine & special staining, Decalcification & frozen sections) London Bridge Hospital Laboratory Local contact Sample Reception B London bridge Hospital As above Biochemistry Tooley Street Haematology London Blood Transfusion SE1 2PR EBUS and Slide production, staining & mounting of fine needle aspirates for diagnostic cytology LOC laboratory Local contact Biochemistry C 59/97 Harley Street As above Haematology London W1G 6AF Sydney Street Lab Local contact Biochemistry
    [Show full text]
  • Strength Analysis of Hull Structures in Bulk Carriers
    CLASSIFICATION NOTES No. 31.1 STRENGTH ANALYSIS OF HULL STRUCTURES IN BULK CARRIERS JUNE 1999 DET N ORS KE VERIT AS Vcritasveien 1, N-1322 Hfl!vik, Norway Tel.: +47 67 57 99 00 Fax: +47 67 57 99 11 FOREWORD DET NORSKE VERJTAS is an autonomous and independent Foundation with the objective of safeguarding life, property and the environment at sea and ashore. DET NORSKE VERlTAS AS is a fully owned subsidiary Society of the Foundation. It undertakes classification and certification of ships, mobile offshore units, fixed oftshore structures, facilities and systems for shipping and other industries. The Society also carries out research and' development associated with these functions. DET NORSKE VERITAS operates a worldwide network ot survey stations and is authorised by more than 120 national administrations to carry out surveys and, in most cases, issue certificates on their behalf. Classification Notes Classification Notes are publications that give practical information on classification of ships and other objects. Examples of design solutions, calculation methods, specifications of test procedures, as well as acceptahle repair methods for some components are given as interpretations of the more general rule requirements. A list of Classification Notes is found in the latest edi6on of the Introduction booklets to the "Rules for Classification of Ships", the "Rules for Classification of Mobile Offshore Units" and the "Rules for Classification of High Speed and Light Craft". In "Rules for Classification of Fixed Offshore Installations", only those Classification Notes that are relevant for this type of structure, have been listed. The list of Classification Notes is also included in the current "Classification Services - Publications" issued by the Society, which is available on request.
    [Show full text]
  • A Hierarchical Contextual Attention-Based Network for Sequential Recommendation
    Neurocomputing 358 (2019) 141–149 Contents lists available at ScienceDirect Neurocomputing journal homepage: www.elsevier.com/locate/neucom A hierarchical contextual attention-based network for sequential recommendation ∗ Qiang Cui, Shu Wu , Yan Huang, Liang Wang Center for Research on Intelligent Perception and Computing (CRIPAC), National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences (CASIA) and University of Chinese Academy of Sciences (UCAS), Beijing 100190, China a r t i c l e i n f o a b s t r a c t Article history: The sequential recommendation is one of the most fundamental tasks for Web applications. Recently, Received 2 June 2018 recurrent neural network (RNN) based methods become popular and show effectiveness in many se- Revised 28 March 2019 quential recommendation tasks, such as next-basket recommendation and location prediction. The last Accepted 25 April 2019 hidden state of RNN is usually applied as the sequence’s representation to make recommendations. RNN Available online 10 May 2019 can capture the long-term interest with the help of gated activations or regularizers but has difficulty in Communicated by Dr. B. Hammer acquiring the short-term interest due to the ordered modeling. In this work, we aim to strengthen the short-term interest, because it is beneficial to generate responsive recommendation according to recent Keywords: Sequential recommendation behaviors. Accordingly, we propose a Hierarchical Contextual Attention-based (HCA) network. First, RNN Recurrent neural network is extended to model several adjacent factors at each time step. Such kind of multiple factors can be con- Short-term interest sidered as a context where the short-term interest comes from.
    [Show full text]
  • USAF Oral History Program Interview of Mr. Doyle Northrup
    Declassified Under Authority of the lnteragency Security Classification Appeals Panel, E.O. 13526, sec. 5.3(b)(3) ISCAP Appeal No. 2015-107, document 1 Declassification Date: November 23, 2020 ~ . ~ -·~ "' It. q'":.i,t«: "' ....., IUL'tt\,.,r•• 'l'-•-·····".,t...... .,. Ftblft DECLA.88mEO Cl A .r:•··~ ,<"r; '.Y SUD,! S(H! r: 0 }L . j_ -;--.._ Wa,:,hington and llved t!ierc until l <;ZS. In 1934, be :married !ybll ...J <¾ vl vl Louiac Crosby of New .Bedford, Mas~achusetts. nb-tto'rwi!-p,f-----''---' pre;ccntly reside in Alexantl:ria., Yi:::gin.ia. ~.':. ile an undergrac!uito £tutlent at Whitman College, Walla Vi;,lla, Wanhington,. !n l <;Z&~ !...!r., Nvrth.rup wac elected to Ph:l ,Bqta I~appa. Uc x-cc-chcd a Bachelor of ~cicnee dc5rec uVlith Hono;rs" !rom. 1 \ /hitn::an in I r;z9., 1n 193Z, he was awarded a ~ta$'ter of Scie::ice degree £roni ?,!assz.cht:.st!t?.s L--istitt:.te o! Te:.chnolcgy. From 1932 to 1940 ho \'.'a~ a :l.esearcl>. /sut:oei,;.t<: wifa t!1e F!lyai<:s Department of Ma.uacliusetts Institute of Technolo::;y. Se ,,,as A ~cro- 1{,n de 6,va/f ber o! a f'our-nlZo.n technical grm.:.:o. .which de,.,..clo~cd the,1 electrostatic generator and demon:..t:-atcd its application as a v.:..!ua'ble tool i::i nuclear • From 1',40 to 1S'\13 Mr, Northrup wa~ associ.:t~eJ with th" Na...,al Or'1n:i.r.co Labor::.tory. E.'lrly i.n thl3 period he ~crvcd as Chfo! Pl_,y10ki,:t -..
    [Show full text]
  • ITERATED INDEX and the MEAN EULER CHARACTERISTIC 3 Little Interest for the MEC Calculations for Specific Manifolds
    ITERATED INDEX AND THE MEAN EULER CHARACTERISTIC VIKTOR L. GINZBURG AND YUSUF GOREN¨ Abstract. The aim of the paper is three-fold. We begin by proving a formula, both global and local versions, relating the number of periodic orbits of an iterated map and the Lefschetz numbers, or indices in the local case, of its iterations. This formula is then used to express the mean Euler characteristic (MEC) of a contact manifold in terms of local, purely topological, invariants of closed Reeb orbits, without any non-degeneracy assumption on the orbits. Finally, turning to applications of the local MEC formula to dynamics, we use it to reprove a theorem asserting the existence of at least two closed Reeb orbits on the standard contact S3 (by Cristofaro-Gardiner and Hutchings in the most general form) and the existence of at least two closed geodesics for a Finsler metric on S2 (Bangert and Long). Contents 1. Introduction 1 2. Iterated index 4 2.1. Iterated index of a map 4 2.2. Iterated index of a germ 6 3. Mean Euler characteristic 9 3.1. Notation and conventions 9 3.2. Examples 10 3.3. Local formula for the MEC 12 3.4. Preliminaries and the proof 13 3.5. Asymptotic Morse inequalities 17 4. Closed Reeb orbits on S3 19 References 23 arXiv:1311.0547v1 [math.SG] 3 Nov 2013 1. Introduction In this paper we prove an elementary formula relating the number of periodic orbits of an iterated map and the Lefschetz numbers of the iterations. We also establish a local version of this formula which connects the number of periodic orbits, suitably defined, of a germ at an isolated fixed point and the indices of its iterations.
    [Show full text]
  • USA CYCLING Annual Report 2008 USA Cycling National Headquarters
    USA CYCLING Annual REPORT 2008 USA CYCLING NATIONAL HEADQuarTERS 210 USA Cycling Point, Suite 100 • Colorado Springs, CO 80919 (T) 719.434.4200 • (F) 719.434.4300 (E) [email protected] • (W) www.usacycling.org EXECUTIVE STAFF Chief Executive Officer, Steve Johnson 719.434.4250 [email protected] Chief Operating Officer, Sean Petty 719.434.4252 [email protected] Chief Financial Officer, Todd Sowl 719.434.4255 [email protected] Human Resources Director/Executive Assistant, Debbie Francis 719.434.4257 [email protected] ACCOUNTING Accounting Manager, Bob Plutt 719.434.4266 [email protected] Accounting Assistant, Connie Ennis 719.434.4267 [email protected] Paralegal, Pam Castle 719.434.4270 [email protected] COMMUNICATIONS Director of Communications, Andrea Smith 719.434.4274 [email protected] Communications Manager, Keri Kahn 719.434.4272 [email protected] MARKETING Director of Sponsorship & Business Development, Jess Schwartzkopf 719.434.4278 [email protected] Sponsorship and Affinity Manager, Mercedes Hickey 719.434.4280 [email protected] ATHLETICS Director of Athletics/Endurance Program Manager, Jim Miller 719.434.4222 [email protected] Director of Logistics & Operations, Gregory Cross 719.434 [email protected] National Mountain Bike Development Director, Marc Gullickson 719.434.4232 [email protected] Sport Science & Coaching Education Manager, Sam Callan 719.434.4224 [email protected] Junior Programs Manager, Benjamin Sharp 719.434.4226 [email protected] BMX Program Director, Mike King 719.434.4230 [email protected] Sports Science Coach, Clay Worthington 719.434.4236 [email protected] Office & Programs Manager, Marlis Johnson 719.434.4225 [email protected] Cover: Kristin Armstrong became just the second woman in U.S.
    [Show full text]