| broadinstitute/gatk |
1,549 |
|
0 |
2 |
about 2 years ago |
46 |
March 16, 2023 |
1,299 |
other |
Java |
| Official code repository for GATK versions 4 and up |
| bigdatagenomics/adam |
966 |
|
20 |
17 |
about 2 years ago |
14 |
December 16, 2020 |
35 |
apache-2.0 |
Scala |
| ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed. |
| aehrc/VariantSpark |
121 |
|
0 |
0 |
about 3 years ago |
40 |
October 03, 2025 |
62 |
other |
JavaScript |
| machine learning for genomic variants |
| GenomicsDB/GenomicsDB |
88 |
|
0 |
2 |
about 2 years ago |
35 |
October 12, 2023 |
28 |
other |
C++ |
| High performance data storage for importing, querying and transforming variants. |
| TileDB-Inc/TileDB-VCF |
79 |
|
0 |
0 |
about 2 years ago |
0 |
|
18 |
mit |
C++ |
| Efficient variant-call data storage and retrieval library using the TileDB storage library. |
| bigdatagenomics/cannoli |
37 |
|
0 |
1 |
over 2 years ago |
11 |
December 17, 2020 |
1 |
apache-2.0 |
Scala |
| Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed. |
| gorpipe/gor |
37 |
|
0 |
0 |
over 2 years ago |
0 |
|
7 |
agpl-3.0 |
Java |
| GORpipe is a tool based on a genomic ordered relational architecture and allows analysis of large sets of genomic and phenotypic tabular data using declarative query language, in a parallel execution engine. |
| mcapuccini/spark-tutorial |
34 |
|
0 |
0 |
about 10 years ago |
0 |
|
0 |
apache-2.0 |
Scala |
| Introduction to predictive modeling in Spark with applications in pharmaceutical bioinformatics |
| jtnystrom/Discount |
14 |
|
0 |
0 |
about 3 years ago |
6 |
February 13, 2023 |
0 |
gpl-3.0 |
Scala |
| Very large scale k-mer counting and analysis on Apache Spark. |
| allenday/spark-genome-alignment-demo |
13 |
|
0 |
0 |
almost 10 years ago |
0 |
|
2 |
|
Scala |
| An example of bioinformatics and bigdata tools can playing nicely together |