They tried Cython first, so PyPy wouldn't have been faster than that. The big speedup for them was in making the code parallel, which is tricky in python and typically requires processes instead of threads and copying instead of sharing, either of which may eat most of the gains in this case.