I doubt spaCy will ever be faster on PyPy (the neural network library Thinc is currently 50% slower). It'd still be really great to get it running, so people who benefit from PyPy for other parts of their stack don't have to manage two Python environments.
Just keep rewriting C as Python. I still remember the day I switched from Numpy and CPython to array.array on PyPy for a 60x boost in benchmarks. (Only a 20x speedup on actual running code; this was for geometry generation in a networked video game server.)
Tests failing:
* https://github.com/explosion/spaCy
Confirmed working:
* https://github.com/explosion/thinc
* https://github.com/explosion/preshed
* https://github.com/explosion/cymem
* https://github.com/explosion/murmurhash
I doubt spaCy will ever be faster on PyPy (the neural network library Thinc is currently 50% slower). It'd still be really great to get it running, so people who benefit from PyPy for other parts of their stack don't have to manage two Python environments.