Skip to content

Commit

Permalink
remove slide and handle jump in avx512
Browse files Browse the repository at this point in the history
This removes the extra overhead and is cycle for cycle tied with pre-avx512 decision on non avx512.
The PCALIGN on avx512 improves the performance from 28.5GiB/s to 30GiB/s on 4K, and 25GiB/s to 28GiB/s on 10M.
This also removes avo because it were running in my legs as I couldn't get it to jump to an other function and cespare emited reservation to using it.
  • Loading branch information
Jorropo committed Jun 20, 2024
1 parent 5103d35 commit e7870b6
Show file tree
Hide file tree
Showing 10 changed files with 185 additions and 5,490 deletions.
248 changes: 0 additions & 248 deletions gen/avx512.go

This file was deleted.

13 changes: 0 additions & 13 deletions gen/go.mod

This file was deleted.

10 changes: 0 additions & 10 deletions gen/go.sum

This file was deleted.

13 changes: 0 additions & 13 deletions gen/main.go

This file was deleted.

Loading

0 comments on commit e7870b6

Please sign in to comment.