DOCSLIB.ORG
Explore
Sign Up
Log In
Upload
Search
Home
» Tags
» MinHash
MinHash
Mining of Massive Datasets
Applied Statistics
Arxiv:2102.08942V1 [Cs.DB]
Similarity Search Using Locality Sensitive Hashing and Bloom Filter
Compressed Slides
Lecture Note
Distributed Clustering Algorithm for Large Scale Clustering Problems
Minner: Improved Similarity Estimation and Recall on Minhashed Databases
Setsketch: Filling the Gap Between Minhash and Hyperloglog
Strand: Fast Sequence Comparison Using Mapreduce and Locality Sensitive Hashing
Efficient Minhash-Based Algorithms for Big Structured Data
Hash-Grams: Faster N-Gram Features for Classification and Malware Detection Edward Raff Charles Nicholas Laboratory for Physical Sciences Univ
December, 2018 2018
Set Similarity Search Beyond Minhash∗
SRS: Solving C-Approximate Nearest Neighbor Queries in High Dimensional Euclidean Space with a Tiny Index
Secure Similar Document Detection with Simhash
A Review for Weighted Minhash Algorithms
Fundamental Data Structures Zuyd Hogeschool, ICT Contents
Top View
Fundamental Data Structures
Dimension Independent Similarity Computation
Google News Personalization: Scalable Online Collaborative Filtering 1 Outline
Finding Similar Items:Nearest Neighbor Search
In Defense of Minhash Over Simhash
Scalable Techniques for Similarity Search
A Probabilistic Molecular Fingerprint for Big Data Settings
Real-Time Clustering for Large Sparse Online Visitor Data