Questions tagged [murmurhash]

MurmurHash is a non-cryptographic hash function suitable for general hash-based lookup.

Murmurhash generates roughly the same number of collisions as alternate hashes over a wide range of input data.

95 questions
1
vote
0 answers

How do you compute a CurseForge API fingerprint for a WoW Retail Addon?

There is no documentation on this. I got as far as to understand that Murmur2 is used for hashing stuff (that is not mentioned anywhere either..) But I don't exactly know how to hash the addon. Do I need to hash the filenames, the contents, what…
Walialu
  • 4,096
  • 2
  • 27
  • 29
1
vote
2 answers

Bloom Filters with the Kirsch Mitzenmacher optimization

I have recently started playing around with Bloom Filters, and I have a use case in which this calculator suggests to use 15 different and unrelated hash functions. Hashing 15 times a string would be quite computing-intensive, so I started looking…
1
vote
1 answer

How the hash function by partitioner in Cassandra is decided for a particular data set to ensure even distribution of data across multiple cluster?

As we know from Cassandra's documentation[Link to doc] that partitioner should be such that the data is distributed evenly across multiple nodes to avoid read hotspots. Cassandra offers various partitioning algorithms for that - Murmur3Partitioner,…
khush
  • 2,702
  • 2
  • 16
  • 35
1
vote
1 answer

Python 3.7 spaCy Help Needed- Environment Inconsistency Issue?

I am facing an issue when trying to call spaCy into my Jupyter notebook. When I run import spacy I get the below: I have used spaCy before many times with no issue, but I noticed this problem began after I was trying to also install from…
PugFanatic
  • 31
  • 2
  • 9
1
vote
0 answers

Murmur3_128 in Javascript

I struggle with finding a way to produce a HASH in javascript that would be equivalent to one produced in java with this code private static final HashFunction MURMUR_3_128 = Hashing.murmur3_128(SEED); public static String hash(String string) { …
Mugetsu
  • 1,739
  • 2
  • 20
  • 41
1
vote
0 answers

Quality of murmur3f and xxhash different variations

From https://github.com/rurban/smhasher/, it seems that murmur3f and xxhash function (except xxh3) have the same quality, with respect that both functions return "uniform" results across different of workload. Are there any other results regarding…
user3563894
  • 331
  • 3
  • 13
1
vote
0 answers

Convert 32 bit uniform distribution to uniform distribution on any int

Given a discrete uniform distribution D~U([0:2^N-1] from which a sample yields a number in the inclusive integer range [0, 2^N-1] for an integer N, I need a function convert such that for a sample d~D, convert(d, m) will have an integer uniform…
Gulzar
  • 23,452
  • 27
  • 113
  • 201
1
vote
1 answer

How do I hash integers and strings inputs using murmurhash3

I'm looking to get a hash value for string and integer inputs. Using murmurhash3, I'm able to do it for strings but not integers: pip install murmurhash3 import mmh3 mmh3.hash(34) Returns the following error: TypeError: a bytes-like object is…
Niv
  • 850
  • 1
  • 7
  • 22
1
vote
0 answers

Hashing quality not constant over bits

I am attempting to derive a pseudo-random 32bit value from a 32bit input. For this, I am using this murmur hash: uint32_t murmur(uint32_t key, uint32_t seed) { uint32_t k = ( key ^ seed ) * 0x5bd1e995; k = k ^ (k>>24); return…
Bram
  • 7,440
  • 3
  • 52
  • 94
1
vote
1 answer

Do we need to consider python platform bits or Linux ones for finding the suitable pyhash - murmur3_x**_128 function?

I want to use pyhash murmur3 128 bits algorithm in my program. It has 2 different variants i.e. murmur3_x64_128 and murmur3_x86_128. Is it referring to python platform or Unix platform? Eg usage: (Both are working on my system, but my python and…
SmiP
  • 155
  • 2
  • 2
  • 16
1
vote
1 answer

MurmurHash3_32 Java returns negative numbers

I am trying to replicate the file hashing of MobileSheetsPro, an Android app, where there is a hashcodes.txt which includes a hash for each file, as well as the path, last modified date and filesize. We'll just focus on the hashing part. So for the…
Taxel
  • 3,859
  • 1
  • 18
  • 40
1
vote
0 answers

C++ std:hash in python

I want to use the same hash as C++ std::hash in python. I have found that std::hash uses murmurhash and i have also found several python libraries with that hashing algorithm, but non returns the same value as std::hash. I am looking for either a…
user13004139
1
vote
1 answer

Murmurhash of different language version get different result

I've tried three version of murmurhash in java(jedis and guava), go and python. The result of java(guava),go and python version output same hash code but different with java(jedis). All the murmurhash code are shown as follow. I'm confused about the…
Alexander
  • 523
  • 5
  • 21
1
vote
1 answer

Apply scikit-learn murmurhash3_32 on a Pandas dataframe

I try to apply murmurhash on a pandas dataframe. I wanted to use scikit-learn murmurhash3_32 (any other easy proposition would be appreciated). I tried import pandas as pd from sklearn.utils.murmurhash import murmurhash3_32 df = pd.DataFrame({'a':…
azerty
  • 698
  • 7
  • 28
1
vote
0 answers

import fails for murmur2 package in Redshift UDF

I am trying to import murmur2 package as a library in Redshift database. I did following steps Run the module packer $ ./installPipModuleAsRedshiftLibrary.sh -m murmur2 -s s3://path/to/murmur2/lib Create library on redshift CREATE OR REPLACE…
jimy
  • 4,848
  • 3
  • 35
  • 52