| horovod/horovod |
13,755 |
|
20 |
16 |
about 2 years ago |
77 |
June 12, 2023 |
372 |
other |
Python |
| Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet. |
| openwall/john |
8,749 |
|
0 |
0 |
about 2 years ago |
0 |
|
509 |
|
C |
| John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs |
| mind/wheels |
885 |
|
0 |
0 |
almost 7 years ago |
0 |
|
20 |
|
|
| Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI) |
| devitocodes/devito |
493 |
|
0 |
1 |
about 2 years ago |
15 |
October 16, 2023 |
109 |
mit |
Python |
| DSL and compiler framework for automated finite-differences and stencil computation |
| openhackathons-org/gpubootcamp |
479 |
|
0 |
0 |
over 2 years ago |
0 |
|
25 |
apache-2.0 |
Jupyter Notebook |
| This repository consists for gpu bootcamp material for HPC and AI |
| NVIDIA/AMGX |
420 |
|
0 |
0 |
over 2 years ago |
0 |
|
79 |
bsd-3-clause |
Cuda |
| Distributed multigrid linear solver library on GPU |
| alibaba/libgrape-lite |
345 |
|
0 |
0 |
over 2 years ago |
1 |
January 05, 2022 |
4 |
apache-2.0 |
C++ |
| 🍇 A C++ library for parallel graph processing (GRAPE) 🍇 |
| mpi4jax/mpi4jax |
328 |
|
0 |
2 |
over 2 years ago |
61 |
December 11, 2023 |
17 |
mit |
Python |
| Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python :zap: |
| NVIDIA/multi-gpu-programming-models |
327 |
|
0 |
0 |
over 3 years ago |
0 |
|
0 |
bsd-3-clause |
Cuda |
| Examples demonstrating available options to program multiple GPUs in a single node or a cluster |
| QMCPACK/qmcpack |
273 |
|
0 |
0 |
about 2 years ago |
0 |
|
406 |
other |
C++ |
| Main repository for QMCPACK, an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids with full performance portable GPU support |