This pipeline is based on snakemake and the dropseq tools provided by the McCarroll Lab. It allows to go from raw data of your Single Cell RNA seq experiment until the final count matrix with QC plots along the way.
This is the tool we use in our lab to improve our wetlab protocol as well as provide an easy framework to reproduce and compare different experiments with different parameters.
It uses STAR to map the reads. It is usable for any single cell protocol using two reads where the first one holds the Cell and UMI barcodes and the second read holds the RNA. Here is a non-exhausitve list of compatible protocols/brands:
This package is trying to be as user friendly as possible. One of the hopes is that non-bioinformatician can make use of it without too much hassle. It will still require some command line execution, this is not going to be fully interactive package.
If you simply want to use this workflow, download and extract the latest release. If you intend to modify and further develop this workflow, fork this reposity. Please consider providing any generally applicable modifications via a pull request.
In any case, if you use this workflow in a paper, don't forget to give credits to the authors by citing the URL of this repository and, once available, its DOI.
Configure the workflow according to your needs via editing the file
config.yaml and the
samples.tsv following those instructions
All you need to execute this workflow is to install Snakemake via the Conda package manager. Software needed by this workflow is automatically deployed into isolated environments by Snakemake.
Test your configuration by performing a dry-run via
snakemake --use-conda -n --directory $WORKING_DIR
Execute the workflow locally via
snakemake --use-conda --cores $N --directory $WORKING_DIR
$N cores on the
$WORKING_DIR. Alternatively, it can be run in cluster or cloud environments (see the docs for details).
If you not only want to fix the software stack but also the underlying OS, use
snakemake --use-conda --use-singularity
in combination with any of the modes above.
After successful execution, you can create a self-contained report with all results via:
snakemake --report report.html
You can find the documentation here
I'm actively seeking help to implement the points listed bellow. Don't hesitate to contact me if you wish to contribute.
I hope it can help you out in your single cell experiments!
Feel free to comment and point out potential improvements via issues