Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support the "SIMD"-like intrinsics #81

Open
eyalroz opened this issue Jun 23, 2020 · 0 comments
Open

Support the "SIMD"-like intrinsics #81

eyalroz opened this issue Jun 23, 2020 · 0 comments
Labels

Comments

@eyalroz
Copy link
Owner

eyalroz commented Jun 23, 2020

CUDA offers many functions:

https://docs.nvidia.com/cuda/cuda-math-api/group__CUDA__MATH__INTRINSIC__SIMD.html

for working with multiple 1-byte and 2-byte values packed into the native 4-byte integers.

We should offer both explicit access to these, which would be better structured and not a heap of idiosyncratic names (perhaps via the kat::array type? some other way?)

We should also check our existing code, to see when specializations are in order which would ensure we benefit from these instructions (e.g. in sequence operations or collaboration primitives).

@eyalroz eyalroz added the Task label Jun 23, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant