Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> but you'll have a non-zero rate of pages you think you've downloaded but you actually haven't, depending how you tune it.

My (not-very fresh) memory of what bloom filter actually is tells me that this "non-zero rate" you're talking about must be HUUUUGE. In order of millions of pages. Am I right?



You're right if OP was using a small bloom filter, and wrong if it was a big one. Hence, the phrase, "depending how you tune it."


You can construct a bloom filter with an arbitrarily low error rate.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: