| VcDevel/Vc |
1,349 |
|
0 |
1 |
over 2 years ago |
3 |
May 20, 2022 |
85 |
bsd-3-clause |
C++ |
| SIMD Vector Classes for C++ |
| VcDevel/std-simd |
467 |
|
0 |
0 |
about 3 years ago |
0 |
|
19 |
other |
C++ |
| std::experimental::simd for GCC [ISO/IEC TS 19570:2018] |
| WojciechMula/sse-popcount |
297 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
bsd-2-clause |
C++ |
| SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html |
| altimesh/hybridizer-basic-samples |
220 |
|
0 |
0 |
over 2 years ago |
0 |
|
9 |
mit |
C# |
| Examples of C# code compiled to GPU by hybridizer |
| minio/md5-simd |
123 |
|
1,714 |
2,325 |
over 3 years ago |
12 |
October 06, 2021 |
0 |
apache-2.0 |
Go |
| Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel. |
| ashvardanian/ParallelReductionsBenchmark |
116 |
|
0 |
0 |
9 months ago |
0 |
|
1 |
|
C++ |
| Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast! |
| PatwinchIR/ultra-sort |
18 |
|
0 |
0 |
over 7 years ago |
0 |
|
1 |
mit |
C++ |
| DSL for SIMD Sorting on AVX2 & AVX512 |
| tugrul512bit/VectorizedKernel |
5 |
|
0 |
0 |
almost 3 years ago |
0 |
|
0 |
gpl-3.0 |
C++ |
| Running GPGPU-like kernels on CPU with auto-vectorization for SSE/AVX/AVX512 SIMD Architectures |