| jfalcou/eve |
764 |
|
0 |
0 |
about 2 years ago |
0 |
|
74 |
bsl-1.0 |
C++ |
| Expressive Vector Engine - SIMD in C++ Goes Brrrr |
| romeric/Fastor |
579 |
|
0 |
0 |
almost 3 years ago |
0 |
|
32 |
mit |
C++ |
| A lightweight high performance tensor algebra framework for modern C++ |
| agenium-scale/nsimd |
184 |
|
0 |
0 |
over 4 years ago |
0 |
|
15 |
mit |
C |
| Agenium Scale vectorization library for CPUs and GPUs |
| ashvardanian/ParallelReductionsBenchmark |
116 |
|
0 |
0 |
9 months ago |
0 |
|
1 |
|
C++ |
| Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal, and Rust - all it takes to sum a lot of numbers fast! |
| cjmcv/hpc |
47 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
apache-2.0 |
C++ |
| Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. ) |
| ChenYangyao/hipp |
8 |
|
0 |
0 |
about 3 years ago |
0 |
|
0 |
gpl-3.0 |
C++ |
| HIPP: Modern C++ Toolkit for HPC |
| sandialabs/p3a |
8 |
|
0 |
0 |
about 3 years ago |
0 |
|
0 |
mit |
C++ |
| Portably Performant Physical Algebra |
| jfalcou/kyosu |
5 |
|
0 |
0 |
over 2 years ago |
0 |
|
1 |
bsl-1.0 |
C++ |
| Complex Without Complexes - C++20 library for Cayley-Dickson algebra computations (complex,quaternion,octonion) |
| mratsim/compute-graph-optim |
5 |
|
0 |
0 |
almost 7 years ago |
0 |
|
1 |
|
Nim |
| Experiments in compute graph optimisations and ML and HPC compilers frontend |