The Monster Team

New to the team? Start here.

People

Name	Role	GitHub
Jeff Korte	Product Owner	@JeffKorte
Quazi Hoque	Software Engineer	@quazi-broad
Drew Herbst	Tech Lead	@aherbst-broad

GitHub Teams

DSP Monsters - Team for repositories under the broadinstitute org
Monster - Team for repositories under the DataBiosphere org

Projects

Data Modeling

Linked Data definitions for the Terra Core Data Model, with extensions for unmodeled datasets.

Documentation

GitHub repos

TerraCore Data Model - Data Model definitions and examples

Data Ingest

Pipelines for moving data into the Jade Data Repository.

Documentation

Google Docs

GitHub repos

ClinVar - ETL pipeline for the ClinVar dataset
ENCODE - ETL pipeline for the ENCODE dataset
Dog Aging - ETL pipeline for the Dog Aging Project dataset
HCA - ETL pipeline for the HCA

Ingest Utilities

Tools and libraries used to support the top-level ingest pipelines.

GitHub repos

Base utilities - Common utilities shared across our batch ETL projects
XML-to-JSON-list - Command-line tool for mechanical conversion of XML into Beam-friendly JSON

Operations

Infrastructure, configuration, and shared code used to manage developing and deploying our services.

GitHub repos

Helm charts - Custom Helm charts for pieces of Monster infrastructure
Core deployments - Terraform modules, Helm releases, and deploy scripts for Monster's GCP environments
setup-chart-releaser - GitHub Action to install Chart Releaser

Semi-Archived

The repositories in this section are still being used, but we're trying to move away from them.

Data Ingest Framework

Our first stabs at data ingest envisioned a framework of dataset-agnostic services. We shifted away from that pattern because it introduced significant overhead vs. custom pipelines using common command-line tools.

GitHub repos

Transporter - Bulk file-transfer system
Storage Libs - Utility libraries for I/O against external storage systems

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
archive		archive
getting-started		getting-started
tech-docs		tech-docs
templates/python-project		templates/python-project
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Monster Team

People

GitHub Teams

Projects

Data Modeling

Documentation

GitHub repos

Data Ingest

Documentation

GitHub repos

Ingest Utilities

GitHub repos

Operations

GitHub repos

Semi-Archived

Data Ingest Framework

GitHub repos

About

Releases

Packages

Contributors 4

Languages

broadinstitute/monster

Folders and files

Latest commit

History

Repository files navigation

The Monster Team

People

GitHub Teams

Projects

Data Modeling

Documentation

GitHub repos

Data Ingest

Documentation

GitHub repos

Ingest Utilities

GitHub repos

Operations

GitHub repos

Semi-Archived

Data Ingest Framework

GitHub repos

About

Resources

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages