Is there a way to set the seed value for using the ruby hash function (i.e. murmur hash in 1.9, don't know JRuby?) so that I can get the same hash code every time I run the script (i.e. in parallel on multiple processes or on different nodes)
so that
puts "this is a test".hash
is the same whenever I run this , today, tomorrow, 3 weeks from now, etc
I want to do this so I can implement MinHash in parallel
I can see in the murmur_hash gem that the murmur hash accept a seed so I assume I can set the seed and get the hash code deterministically whenever I choose the same seed