#similarity-search #similarity #search #lsh #all-pairs

all-pairs-hamming

All pairs similarity search on binary sketches in the Hamming space

1 unstable release

0.1.0 Sep 25, 2022

#8 in #lsh

Download history 186/week @ 2024-11-19 148/week @ 2024-11-26 156/week @ 2024-12-03 60/week @ 2024-12-10 107/week @ 2024-12-17 49/week @ 2024-12-24 27/week @ 2024-12-31 57/week @ 2025-01-07 180/week @ 2025-01-14 136/week @ 2025-01-21 119/week @ 2025-01-28 112/week @ 2025-02-04 52/week @ 2025-02-11 138/week @ 2025-02-18 203/week @ 2025-02-25 174/week @ 2025-03-04

580 downloads per month
Used in find-simdoc

MIT/Apache

32KB
794 lines

All pairs similarity search on binary sketches in the Hamming space

This library provides a fast and compact all pairs similarity search (or similarity self-join) on binary sketches in the Hamming space. The algorithm employs a modified variant of the sketch sorting approach, a combination of the multiple sorting and the multi-index approach.

This library is a part of find-simdoc.


lib.rs:

This library provides a fast and compact all pairs similarity search (or similarity self-join) on binary sketches in the Hamming space. The algorithm employs a modified variant of the sketch sorting approach, a combination of the multiple sorting and the multi-index approach.

Dependencies

~1MB
~17K SLoC