Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
BUGGED AMEND ! remove slide and handle jump in avx512
This removes the extra overhead and is cycle for cycle tied with pre-avx512 decision on non avx512. The PCALIGN on avx512 improves the performance from 28.5GiB/s to 30GiB/s on 4K. This also removes avo since I couldn't get it to jump to an other function. BUG: `go test .` for avx512 fails many times, probably some dub error while rewriting.
- Loading branch information