An Explainable Autoencoder for Collaborative Filtering Recommendation

University of Louisville ThinkIR: The University of Louisville's Institutional Repository Faculty Scholarship 2019 An Explainable Autoencoder For Collaborative Filtering Recommendation Pegah Sagheb Haghighi University of Louisville Olurotimi Seton University of Louisville Olfa Nasraoui University of Louisville, [email protected] Follow this and additional works at: https://ir.library.louisville.edu/faculty Part of the Computer Engineering Commons, and the Computer Sciences Commons ThinkIR Citation Haghighi, Pegah Sagheb; Seton, Olurotimi; and Nasraoui, Olfa, "An Explainable Autoencoder For Collaborative Filtering Recommendation" (2019). Faculty Scholarship. 429. https://ir.library.louisville.edu/faculty/429 This Article is brought to you for free and open access by ThinkIR: The University of Louisville's Institutional Repository. It has been accepted for inclusion in Faculty Scholarship by an authorized administrator of ThinkIR: The University of Louisville's Institutional Repository. For more information, please contact [email protected]. An Explainable Autoencoder For Collaborative Filtering Recommendation Pegah Sagheb Haghighi Olurotimi Seton Olfa Nasraoui Knowledge Discovery and Web Knowledge Discovery and Web Knowledge Discovery and Web Mining Lab, Dept. of Computer Mining Lab, Dept. of Computer Mining Lab, Dept. of Computer Science and Engineering, University Science and Engineering, University Science and Engineering, University of Louisville of Louisville of Louisville Louisville Louisville Louisville [email protected] [email protected] [email protected] ABSTRACT in the objective function of classical matrix factorization. The con- Autoencoders are a common building block of Deep Learning archi- straining term tries to bring the user and explainable item’s latent tectures, where they are mainly used for representation learning. factor vectors closer, thus favoring the appearance of explainable They have also been successfully used in Collaborative Filtering (CF) items at the top of the recommendation list. When it comes to recommender systems to predict missing ratings. Unfortunately, Autoencoder recommender models however, very little research like all black box machine learning models, they are unable to ex- has been conducted to address the explainability of the predictions. plain their outputs. Hence, while predictions from an Autoencoder- In this paper, we propose an Autoencoder model that finds top n based recommender system might be accurate, it might not be accurate recommendations that are explainable without the use of clear to the user why a recommendation was generated. In this additional data modalities such as content. The rest of the paper work, we design an explainable recommendation system using is organized as follows: Section 2 reviews related work, Section 3 an Autoencoder model whose predictions can be explained using presents a novel explainable Autoencoder for Collaborative Filter- the neighborhood based explanation style. Our preliminary work ing. Section 4 presents the experimental results. Finally, Section 5 can be considered to be the first step towards an explainable deep concludes the paper. For clarity, all notations used in this paper are learning architecture based on Autoencoders. summarized in Table 1. KEYWORDS 2 RELATED WORK Recently, a growing body of research has involved using neural recommender systems, explainability, neural networks, autoen- networks such as Autoencoders for collaborative filtering. An Au- coder, deep learning toencoder is an unsupervised learning model and a form of neural network which aims to learn important features of the dataset. Ar- 1 INTRODUCTION chitecturally, an Autoencoder is a three-layer feed-forward neural The use of information filtering tools that discover or suggest rele- network consisting of an input layer, a hidden layer and an output vant and personalized items has become essential to avoid informa- layer. Figure 1 depicts a classical Autoencoder architecture. An tion overload. For instance, recommender systems assist users by Autoencoder consists of two parts, an encoder and a decoder. The providing them with personalized suggestions. They recommend encoder maps input features into a hidden representation and the items in a variety of domains (music, movies, books, travel rec- decoder aims to reconstruct the input features from the hidden ommendation, etc). However, the most accurate recommendation features [8]. The encoder and decoder may each have a deep archi- models tend to be black boxes [7] that cannot justify why items tecture which consists of multiple hidden layers. It is worth noting are recommended to a user. The ability of a recommender system that the output layer has the same dimension or number of neurons to explain the reasoning of its recommendation can serve as a as the input layer. arXiv:2001.04344v1 [cs.IR] 23 Dec 2019 bridge between humans and recommender systems. Explanations can serve different purposes such as building trust, improving trans- parency and user satisfaction and helping users make informed decisions [7, 14]. Latent factor models have been the state of the art in Collaborative Filtering recommender systems. For instance, Matrix Factorization (MF) learns hidden interactions between enti- ties to predict possible user ratings for items. However, a common challenge with this method is the difficulty of explaining recommendations to users using the latent dimensions. To solve this problem, some explainable recommendation algorithms have recently been proposed. For instance, Explicit factor models (EFM) [17] generate Figure 1: An Autoencoder Architecture explanations based on the explicit features extracted from users’ reviews. Explainable Matrix Factorization (EMF) [1, 3] is another A collaborative filtering based Autoencoder was introduced by algorithm that uses an explainability regularizer or soft constraint Sedhain et al. [11] and Ouyang et al. [10]. They achieved a better Table 1: Summary of notations Symbol Description Symbol Description U set of all users Ui;x set of users given rating x to item i I set of all items Rtest ratings in the test set σ sigmoid activation functions σ 0 identity activation function b, b0 biases E explainability matrix R rating matrix e explainability vector u r user sparse rating vector Nu set of users who are most similar to user u i r item sparse rating vector W 1, W 2 connection weights between layers accuracy than current state-of-the-art methods such as Matrix Fac- are more informative, whereas the global layer learns word se- torization to predict missing values of the user-item matrix. Their quences from the original review. Finally, the outputs of these two model worked as follows: Given a hidden layer, the encoder sec- layers are combined and passed through the fully connected layers. tion of an Autoencoder takes the ratings data r of each user and Costa et al. [6] presented a model based on Long Short Term maps it to a latent representation h, see Eq.(1). User preferences Memory (LSTM) Recurrent Neural Network (RNN) to automatically are encoded in a sparse matrix of ratings R ϵ Rm×n, where m and generate natural language explanations for recommender systems. n are the number of users and items, respectively. U = f1; ::;mg The authors tried to improve the work proposed by [9] by consid- represents the set of all users and I = f1; ::;ng represents the ering a vector of auxiliary data instead of only one dimension of set of all items. Each user u ϵ U is represented by a sparse vector auxiliary information. The vector of auxiliary data in their model ¹uº r = fRu1; :::; Run g, and each item i ϵ I is represented by a sparse is a set of rating scores for different features of items. ¹iº vector r = fR1i ; :::; Rmi g. The hidden or latent representation is Chen et al. [5] used both implicit feedback and textual reviews given by to introduce a visually explainable recommendation. By combining images and reviews, their model generated natural language explanations, which could describe the highlighted image regions to the h = σ¹W :r + bº (1) 1 user. where σ is the activation function - in this case the sigmoid Recently, Bellini et al. [4] introduced the SEM-AUTO approach 1 which makes use of Autoencoders and semantic information via the function ( 1+e−x ), W 1 is a weight matrix, and b is a bias vector. The decoder maps the latent representation h into a reconstruc- DBpedia knowledge graph. The authors mapped a set of categorical tion output rˆ given by features of an item into the hidden layers for each user. As a result, this model is capable of retrieving the categorical information of the items rated by the user. The proposed model aims to capture the rˆ = σ 0¹W :h + b0º (2) 2 shared side information of all the items rated by a user. Hence, the where σ 0 is an activation function - in this case Identity. The information associated with positively rated items get a high value goal of the Autoencoder is to minimize the reconstruction error by (close to 1), and for those with negatively rated items will tend to minimizing a reconstruction loss function as follows be closer to 0. This approach can therefore help Autoencoders find the potential features that match userâĂŹs tastes. Õ λ To summarize our review, neural network models and in partic- min k ru − rˆu k2 + :¹k W k2 + k W k2 º (3) 2 1 F 2 F ular Autoencoders, have recently received increasing attention in u the Recommender Systems field. However, like other latent factor where λ is the regularization term coefficient and jj:jj is the models, Autoencoders are black-box models that are unable to pro- Frobenius norm. vide an explanation for their output. The research questions that we are trying to answer are: (1) Can we generate an explainable Col- Wu et al. [15] introduced the Collaborative Denoising Auto- laborative Filtering recommender system using the Autoencoder Encoder (CDAE) which has one hidden layer that encodes a latent architecture? (2) Can the explainable Autoencoder maintain an vector for the user.

An Explainable Autoencoder for Collaborative Filtering Recommendation

CRUC: Cold-Start Recommendations Using Collaborative Filtering in Internet of Things

Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recommendation

Combining User-Based and Item-Based Collaborative Filtering Using Machine Learning

Towards an Effective Crowdsourcing Recommendation System a Survey of the State-Of-The-Art

Wiki-Rec: a Semantic-Based Recommendation System Using Wikipedia As an Ontology

User Relevance for Item-Based Collaborative Filtering R

Collaborative Filtering: a Machine Learning Perspective by Benjamin Marlin a Thesis Submitted in Conformity with the Requirement

User Data Analytics and Recommender System for Discovery Engine

Recommender Systems User-Facing Decision Support Systems

A Recommender System Based on Collaborative Filtering Using Ontology and Dimensionality Reduction Techniques

Using Implicit Feedback for Task Recommendation In

Recommending Web Pages Using Item-Based Collaborative Filtering Approaches