This is an aim to share knowledge, so that anyone may feel comfortable running analysis pipelines, or at least be familiar with them.

This space will contain documentation for software/ pipelines/ workflows that our lab routinely uses to analyze data. Each software/ pipeline/ workflow will contain details regarding: what it is designed to analyze, how to run it, and other general information.

The steps in the pipeline will contain links to the tools on the cornell biohpc website, as well as from documentation sources (github, etc). At any particular step of any of the pipelines, you may switch out tools for others - there are pros and cons to each tool.

When contributing to this page, please write documentation that you would want to see if you were running an unfamiliar pipeline.

Happy coding.

Prokaryotic Genome Analyses

Intro to bioinformatics + microbial genomics

Pangenome workflow

Viral sequence detection

Reference based transcriptome assembly