title and link description Improving Distributional Similarity • similarity with Lessons Learned from Word ... • analogy • qualitativ: compare neighborhoods across Dependency-Based Word Embeddings embeddings • word similarity • word similarity Multi-Granularity Chinese Word • analogy Embedding • qualitative: local neighborhoods A Mixture Model for Learning Multi- • word similarity Sense Word Embeddings • analogy Learning Crosslingual Word • multilingual Embeddings without Bilingual Corpora • word similarities (monolingual) Word Embeddings with Limited • word similarity Memory • phrase similarity A Latent Variable Model Approach to • analogy PMI-based Word Embeddings Prepositional Phrase Attachment over • extrinsic Products A Simple Word Embedding Model for • lexical substitution (nearest Lexical Substitution neighbors after vector averaging) Segmentation-Free Word Embedding • extrinsic for Unsegmented Languages • word similarity Evaluation methods for unsupervised • analogy word embeddings • concept categorization • selectional preference Dict2vec : Learning Word Embeddings • word similarity using Lexical Dictionaries • extrinsic Task-Oriented Learning of Word • extrinsic Embeddings for Semantic Relation ... Refining Word Embeddings for • extrinsic Language classification from bilingual • word similarity word embedding graphs Learning principled bilingual mappings • multilingual of word embeddings while ... A Word Embedding Approach to • extrinsic Identifying Verb-Noun Idiomatic ... Deep Multilingual Correlation for • word & bigram similarity Improved Word Embeddings Siamese CBOW: Optimizing Word • sentence similarity (word Embeddings for Sentence ... averaging + nearest neighbor) Spectral Graph-Based Method of • word similarity Multimodal Word Embedding Cross-lingual Models of Word • multilingual Embeddings : An Empirical Comparison How to Train good Word Embeddings • word similarity for Biomedical NLP • extrinsic D-GloVe: A Feasible Least Squares • word similarity Model for Estimating Word ... • analogy • word similarity The Interplay of Semantics and • qualitative: morphological Morphology in Word Embeddings features (of nearest neighbors) Multilingual Training of Crosslingual • word similarity Word Embeddings • multilingual Learning Sentiment-Specific Word • extrinsic Embedding for Twitter Sentiment ... Unsupervised Morphology Induction • word similarity Using Word Embeddings Word Embedding Distance Pattern for • extrinsic Keyphrase Classification in ... • find most contrasting word in set Revisiting Word Embedding for of candidate (similar to Contrasting Meaning analogy task in this particular embedding) • synset / lexeme similarity (similar AutoExtend: Extending Word to word similarity in this Embeddings to Embeddings for ... particular embedding) A Comparison of Word Embeddings for • multilingual English and Cross-Lingual ... Modeling Context Words as Regions: • word similarity An Ordinal Regression ... • analogy Mimicking Word Embeddings using • word similarity Subword RNNs • qualitative: nearest neighbors • extrinsic Morphological Priors for Probabilistic • word similarity Neural Word Embeddings • qvec An Error-Oriented Approach to Word • find unlikely word in a sentence Embedding Pre-Training to predict errors Adapting Pre-trained Word Embeddings • extrinsic For Use In Medical Coding The Role of Context Types and • extrinsic Dimensionality in Learning Word ... • word similarity Entity Extraction in Biomedical • extrinsic Corpora: An Approach to Evaluate ... A Strong Baseline for Learning Cross- • multilingual Lingual Word Embeddings ... Symmetric Pattern Based Word • word similarity Embeddings for Improved Word ... • analogy • synonym / antonym detection Word Embedding -based Antonym (specifically encoded in Detection using Thesauri and ... embedding) Diachronic Word Embeddings Reveal • nearest neighbors (multiple Statistical Laws of Semantic ... embeddings) On Approximately Searching for • nearest neighbors (strategies to Similar Word Embeddings make search more efficient) Evaluation of acoustic word • embeds accoustic samples embeddings instead of written words Using word embedding for bio-event • extrinsic extraction Investigating Language Universal and • extrinsic Specific Properties in Word ... Using Word Embedding for Cross- • multilingual Language Plagiarism Detection Word Re- Embedding via Manifold • word similarity Dimensionality Retention Intrinsic Subspace Evaluation of Word • extrinsic Embedding Representations • extrinsic • synonym selection Learning Semantic Word Embeddings • word similarity based on Ordinal Knowledge ... • sentence completion (using co- ocurrence probability) Beyond Bilingual: Multi-sense Word • multilingual Embeddings using Multilingual ... Determining Gains Acquired from Word • extrinsic Embedding Quantitatively ... Specializing Word Embeddings for • extrinsic Similarity or Relatedness • synonym selection • word similarity Right-truncatable Neural Word • sentence completion (using co- Embeddings ocurrence probability) Bilingual Word Embeddings from • multilingual Parallel and Non-parallel Corpora ... • extrinsic MGNC-CNN: A Simple Approach to • extrinsic Exploiting Multiple Word ... • critical discussion of evaluation Intrinsic Evaluations of Word methods; call for more Embeddings : What Can We Do Better? exploratory analysis of word embeddings Tracing armed conflicts with diachronic • nearest neighbors (multiple word embedding models embeddings) Analyzing Word Embeddings through • extrinsic Multilingual Evaluation Automated WordNet Construction • multilingual Using Word Embeddings • analogy Word Embeddings as Metric Recovery • series completion (based on in Semantic Spaces vector offset; similar to analogy) • concept categorization • identify nouns Predicting the Compositionality of (through nearest neighbor Nominal Compounds: Giving ... search) • text summary evaluation (based Better Summarization Evaluation with on similarity of phrases to texts Word Embeddings for ROUGE obtained through averaging word vectors) Nonparametric Spherical Topic • extrinsic Modeling with Word Embeddings Exploring Word Embedding for Drug • extrinsic Name Recognition Word Embeddings based on Fixed-Size • word similarity Ordinally Forgetting Encoding Lexical Comparison Between Wikipedia • linguistic study (with lots of uses and Twitter Corpora by ... of word similarity) A Simple Regularization-based • multilingual Algorithm for Learning Cross ... • extrinsic PPDB 2.0: Better paraphrase ranking, • that now fine-grained entailment ... supports word embeddings Centroid-based Text Summarization • text summarization (based on similarity of phrases to texts through Compositionality of ... obtained through averaging word vectors) Word Similarity Based on Word • word similarity Embedding and Knowledge Base Bilingual Word Embeddings for Phrase- • multilingual Based • extrinsic Predicting Polarities of Tweets by • extrinsic Composing Word Embeddings ... Unsupervised POS Induction with Word • extrinsic Embeddings Semantic Annotation Aggregation with • extrinsic Conditional Crowdsourcing ... Bilingual Word Embeddings from Non- • multilingual Parallel Document-Aligned ... Recognizing in • extrinsic Twitter Using Word Embeddings Arabic Textual Entailment with Word • extrinsic Embeddings • multilingual Comparing Fifty Natural Languages • word similarity and Twelve Genetic Languages ... • compare embeddings of different languages to measure similarity Cross-Lingual Word Embeddings for • multilingual Low-Resource Language ... • word similarity Delexicalized Word Embeddings for • extrinsic Cross-lingual Dependency ... How Well Can We Predict Hypernyms • extrinsic from Word Embeddings ? A ... Evaluating word embeddings with fMRI • extrinsic and eye-tracking Adjusting Word Embeddings with • semantic intensity (similar to Semantic Intensity Orders word similarity) An Improved Crowdsourcing Based • embedding evaluation through Evaluation Technique for Word ... crowd sourcing Word Embedding for Response-To-Text • extrinsic Assessment of Evidence Chinese Grammatical Error Diagnosis • extrinsic Using Single Word Embedding Syntax-Aware Multi-Sense Word • qualitative: nearest neighbors • word similarity Embeddings for Deep ... • extrinsic A Probabilistic Model for Learning • qualitative: nearest neighbors Multi-Prototype Word Embeddings • word similarity Learning Sense-specific Word • multilingual Embeddings By Exploiting Bilingual ... Learning bilingual word embeddings • multilingual with (almost) no bilingual data Elucidating Conceptual Properties from • study into what single dimensions Word Embeddings of word embeddings encode • conflict prediction (based on Temporal dynamics of semantic nearest neighbor search after relations in word embeddings : an ... linear mapping of vectors) • measure encoded morphological Morphological Word - Embeddings information (through nearest neighbor search / comparison) • word similarity • analogy A Joint Model for Word Embedding and • classify morpheme boundaries Word Morphology (based on an embedding of subwords) Learning Compositionality Functions • extrinsic on Word Embeddings for ... • extrinsic A Simple but Tough-To-Beat Baseline • sentence similarity (similar to for Sentence Embeddings word similarity) Man is to Computer Programmer as • measure gender bias by Woman is to Homemaker? Debiasing projecting words on concept axes Word Embeddings within an embedding • word similarity Don’t count, predict! A systematic • synonym selection comparison of context-counting vs. • concept categorization context-predicting semantic vectors • selectional preferences • analogy Empath: Understanding Topic Signals • word similarity in Large-Scale Text • crowd sourcing • word similarity • anology (targeting syntactic Retrofitting Word Vectors to Semantic relations) Lexicons • synonym selection • extrinsic What can you do with a rock? • analogy Affordance extraction via word • explore embeddings by mapping embeddings words on concept axes • multilingual Fast Training of • word similarity Representations Using n-gram Corpora • extrinsic • nearest neighbors (multiple embeddings) A framework for analyzing semantic • sentiment analysis (compare change of words across time sentiment words in the neighborhood in multiple embeddings) • changes of a word vector over Statistically Significant Detection of time (based on an alignment Linguistic Change between multiple embeddings) Linguistic Regularities in Sparse and • analogy Explicit Word Representations Efficient Estimation of Word • analogy (qualitative and Representations in Vector Space quantitive) Distributed Representations of Words • analogy (qualitative and and Phrases and their Compositionality quantitive) • word similarity GloVe: Global Vectors for Word • analogy Representation • extrinsic • word similarity Learning Word Meta-Embeddings • analogy • extrinsic Evaluation of Word Vector • qvec Representations by Subspace • word similarity Alignment • extrinsic

Additional Vis / HCI papers (that are primarily discussed as related work):

• word similarity (and changes during training, including thirs words responsible for these Visual Tools for Debugging Neural changes during training) Language Models • activation patterns in the neural network used for training • corpus analysis during training Visual Exploration of Semantic • word similarity Relationships in Neural Word • analogy Embeddings • nearest neighbors • compare embeddings • nearest neighbors Embedding Projector: Interactive • word similarity visualization and interpretation of • finding global structure embeddings • meaningful directions (e.g., for analogy) ConceptVector: Text Visual Analytics • nearest neighbors via Interactive Lexicon Building Using • concept synthesis based on Word Embedding nearest neighbors • special embedding (words & citations) • finding global structure (through 2D projection using t-SNE) cite2vec: Citation-Driven Document • analyzing citations trough their Exploration via Word Embeddings position in the word / citation space • analyze documents through their position on the word space

• word similarity: 43 • analogy: 22 • neighborhoods (simple: 10 / compare: 7) • synonym selection: 4 • concept projection: 2 • similarity between compound entities (phrases, sentences; incl. selectional preferences): 6 • context prediction: 3 sentence completion / error prediction

• concept categorization: 3 Concept categorization relies on clustering vectors and measures cluster purity based on semantics groups. While it is intrinsic evaluation, it uses an additional unsupervised machine learning step.

• multilingual: 17 • extrinsic: 43 • qvec: 2

• qualitative: some papers provide qualitative evaluation by listing a specific pattern from the embeddings. For example, the nearest neighbors for some chosen words. • extrinsic: extrinsic evaluation is based on downstream NLP applications. Here, we have used the label extrinsic as follows: Every evaluation that uses additional machine learning methods, e.g., classification of word vectors, are labeled extrinsic. • crowd sourcing: One popular way of evaluating word embeddings comparatively is to extract nearest neighbors of a specific word from multiple embeddings. Through crowd sourcing, labelers are asked to decide which one of these extracted neighbors is closest to the query word. This compares the intuitiveness of the semantic / linguistic knowledge encoded within embeddings. • qvec: Evaluation metric based on the correlations of dimensions in a word embedding with word descriptions extracted from knowledge about the word from lexicons (including semantic relations and part-of-speech).