Installation

There are many tools that you will need to use for the purposes of this analysis. These tools will require you to have a basic understanding of package management and tool usage at the command line, python, and R.

There are many resources out there for how to install specific packages so I will not spend too much time on the details of installation.

Mandatory Note

I highly recommend creating a new conda environment for the purposes of only this analysis.

As you generate new datasets you can copy this environment over to them, but as you make changes you should leave this one unchanged so that you can always come back to the same analysis without any unexpected quirks.

Commandline Tools

  • conda
    • Used for environment management
  • kallisto bustools
    • Used for sequence alignment to the transcriptome
  • kallisto
    • is the core tool for pseudoalignment and pseudoalignment reference building
    • a version of this will be packaged into kallisto-bustools but if you don't want to search for the binary you can just install this separately
  • bustools
    • a tool for interacting with *.bus files which are used to organize and interact with single-cell data results.
    • Some specifications about the BUS File Format
    • a version of this will be packaged into kallisto-bustools but if you don't want to search for the binary you can just install this separately
  • jupyter
    • Used for generating an interactive environment to explore your data (overview)

Python Modules

  • scanpy
    • Used for single-cell analysis
  • scvelo
    • Used for single-cell RNA velocity analysis (Optional)
  • geomux
    • used for cell-guide assignment
  • hgsig
    • used for cluster enrichment/depletion and significance testing

R Libraries

  • edgeR
    • used for differential expression analysis