Skip to content
Change the repository type filter

All

    Repositories list

    • adam

      Public
      ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
      Scala
      Apache License 2.0
      309997343Updated Aug 26, 2024Aug 26, 2024
    • Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.
      Shell
      Apache License 2.0
      363811Updated Aug 13, 2024Aug 13, 2024
    • cannoli

      Public
      Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.
      Scala
      Apache License 2.0
      173800Updated Apr 24, 2024Apr 24, 2024
    • utils

      Public
      General utility code used across BDG products. Apache 2 licensed.
      Scala
      Apache License 2.0
      261810Updated Jan 3, 2024Jan 3, 2024
    • Web Site for the Big Data Genomics Group
      HTML
      71000Updated Sep 3, 2023Sep 3, 2023
    • mango

      Public
      A scalable genome browser. Apache 2 licensed.
      Scala
      Apache License 2.0
      30124577Updated Dec 2, 2022Dec 2, 2022
    • convert

      Public
      Conversions to and from Big Data Genomics Avro Formats. Apache 2 licensed.
      Java
      Apache License 2.0
      5020Updated Dec 2, 2022Dec 2, 2022
    • workflows

      Public
      Toil workflows for bigdatagenomics tools. Apache 2 licensed.
      Python
      Apache License 2.0
      5581Updated Apr 22, 2021Apr 22, 2021
    • Dockerfile
      Apache License 2.0
      2101Updated Aug 23, 2020Aug 23, 2020
    • deca

      Public
      Distributed exome CNV analyzer. Apache 2 licensed.
      Scala
      Other
      4391Updated Oct 15, 2019Oct 15, 2019
    • Awesome list of applications that extend Big Data Genomics ADAM. CC0 licensed.
      41100Updated Jul 11, 2019Jul 11, 2019
    • avocado

      Public
      A Variant Caller, Distributed. Apache 2 licensed.
      Scala
      Apache License 2.0
      4271196Updated Mar 11, 2019Mar 11, 2019
    • gnocchi

      Public
      Scala
      Apache License 2.0
      106101Updated Apr 24, 2018Apr 24, 2018
    • lime

      Public
      Distributed Set Theory for Genomics
      Scala
      Other
      3572Updated Mar 27, 2018Mar 27, 2018
    • rice

      Public
      An RNA pipeline built on top of ADAM. Apache 2 licensed.
      Scala
      Apache License 2.0
      171962Updated Jan 19, 2018Jan 19, 2018
    • quinine

      Public
      A refreshing treatment for all quality control ailments. Apache 2 licensed.
      Scala
      Apache License 2.0
      6252Updated Oct 13, 2016Oct 13, 2016
    • Exemplar API that mediates Toil with a WDL front-end and workflow tracking.
      Java
      Apache License 2.0
      1101Updated Aug 1, 2016Aug 1, 2016
    • eggo

      Public
      Ready-to-go Parquet-formatted public 'omics datasets
      Python
      Apache License 2.0
      830213Updated Nov 2, 2015Nov 2, 2015
    • recipes

      Public
      Recipes using BDG projects. Apache 2 licensed.
      Shell
      Apache License 2.0
      4410Updated Mar 25, 2015Mar 25, 2015
    • PacMin

      Public
      Assembler for PacBio reads. Apache 2 licensed.
      Scala
      Apache License 2.0
      3340Updated Mar 14, 2015Mar 14, 2015
    • corretto

      Public
      Read error correction utilities.
      Apache License 2.0
      2020Updated Mar 1, 2015Mar 1, 2015
    • Notebook tools for Big Data Genomics. Apache 2 licensed.
      JavaScript
      Apache License 2.0
      653300Updated Mar 1, 2015Mar 1, 2015
    • Utility classes for wrapping services or other interfaces around a Spark/ADAM cluster. Apache 2 licensed.
      Java
      Apache License 2.0
      8520Updated Nov 17, 2014Nov 17, 2014