To me, this looks about 40% faster. WIP though. API needs to change to take out error output argument. Further optimization is possible by merging loops.
To me, this looks about 40% faster. WIP though. API needs to change to take out error output argument. Further optimization is possible by merging loops.