This e-book contains resources for mastering NGS analysis. It has been generated by the Bioinformatics team at NYU Center For Genomics and Systems Biology in New York and Abu Dhabi.
TOPICS
Next-Generation Sequencing Analysis Resources
Introduction to Linux
Linux Exercise
Nano Tutorials
Pre-Requisites
NGS Sequencing Technology and File Formats
How Sequencing Works
FastA Format
FastQ Format
Quality Scores
SAM/BAM/CRAM Format
BED Format
VCF Format
GFF3 Format
Alignment
Trimming with Trimmomatic
Visualization
Variant Calling
Pre-Processing
Variant Discovery
RNA-seq Analysis
Aligning RNA-seq data
Introduction to R
DESeq
DESeq 2
Gene Set Enrichment Analysis with ClusterProfiler
Over-Representation Analysis with ClusterProfiler
Salmon & kallisto: Rapid Transcript Quantification for RNA-Seq Data
Instructions to install R Modules on Dalma
HPC
Resources for editing files on the HPC
Atom
SSH Mounts
Neovim
SLURM
Modules
Gencore Infrastructure
Gencore Variant Detection Example
Software
HPCRunner
BioX Workflow
ChipSeq analysis
CHiP-seq considerations
Prerequisites, data summary and availability
Deeptools2 bamCoverage
Deeptools2 computeMatrix and plotHeatmap using BioSAILs
Exercise part4 – Alternative approach in R to plot and visualize the data
De novo genome assembly
Pre-processing and QC
Exercise in de novo assembly
Individual Commands
Single cell RNA sequencing
Prerequisites
Seurat part 1 – Loading the data
Seurat part 2 – Cell QC
Seurat part 3 – Data normalization and PCA
Seurat part 4 – Cell clustering
Loading your own data in Seurat & Reanalyze a different dataset