Performance-portable, length-agnostic SIMD with runtime dispatch
-
Updated
Jun 7, 2024 - C++
Performance-portable, length-agnostic SIMD with runtime dispatch
Expressive Vector Engine - SIMD in C++ Goes Brrrr
PyTurboJPEG is a highly optimized Python wrapper of libjpeg-turbo (TurboJPEG API) which supports x86 and ARM architecture.
Pelemay is a native compiler for Elixir, which generates SIMD instructions. It has a plan to generate for GPU code.
SIMD-based linear algebra and statistics for data science with dart
GPU-accelerated 3D vortex methods solver with easy GUI
Two-dimensional flow solver with GUI using vortex particle and boundary element methods
Corium is a modern scripting language which combines simple, safe and efficient programming.
"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)
This repository lists 4 problems solved using C. Each problem has its own serial and parallel implementations. For the latter, the OpenMP API was utilized.
Image filters using SSE Instructions (Streaming SIMD Extensions) of Intel® x86-64 Architecture.
A portable modern C++ primitive performance library for 3D Vision & Photo-Mechanics.
DSL for SIMD Sorting on AVX2 & AVX512
System benchmarks over JVM with JMH - SIMD (superscalar processing), Branch prediction, False sharing.
An implementation of dot product using CUDA, x64, and SIMD using the integer data type (32-bits) in C Language.
SIMD discrete Fourier transform tests and discussion
EinsteinDB is a Hybrid memory system consisting of DRAM and Non-Volatile Memory configured to persist data fast.
SIMD-accelerated Vector math lib
n-body-simulation performance test suite
A fast and simple c# hex-decode function using AVX2 and SSSE3 Intel intrinsics.
Add a description, image, and links to the simd-parallelism topic page so that developers can more easily learn about it.
To associate your repository with the simd-parallelism topic, visit your repo's landing page and select "manage topics."