Workflow for single-cell RNAseq (scRNAseq)

/media/jhuang/Elements(Denise_ChIPseq)/Data_Jingang/GSE163973_KF_NS_done/seurat0**.Rmd[R]

https://nbisweden.github.io/workshop-scRNAseq/

https://github.com/NBISweden/workshop-scRNAseq/

https://github.com/NBISweden/workshop-scRNAseq/blob/master/labs/seurat/seurat_06_celltyping.qmd

https://github.com/NBISweden/workshop-scRNAseq/blob/master/labs/seurat/seurat_07_trajectory.qmd

Project Workflow

The analysis is organized into seven steps, each corresponding to an R Markdown file:

  1. 01_qc – Quality control of the raw data.
  2. 02_dimension_reduction – Dimensionality reduction (e.g., PCA, UMAP).
  3. 03_integration – Data integration using the Harmony library to align datasets and mitigate batch effects. This step also includes normalization (NormalizeData) and selection of highly variable genes (FindVariableFeatures) for each dataset before integration.
  4. 04_clustering – Identification of cell clusters.
  5. 05_dge – Differential gene expression analysis.
  6. 06_celltype – Cell type annotation.
  7. 07_trajectory
  8. 08_spatial – if applicable

This structure should help you follow the workflow step by step.

Harmony in R: Integration vs. Batch Effect Removal

Harmony is a tool commonly used in single-cell RNA-seq analysis. Its main purpose is data integration, but it also effectively removes batch effects. Here’s a breakdown:


1. What Harmony Does

  • Aligns multiple datasets (from different batches, labs, conditions, or technologies) in a shared low-dimensional space (e.g., PCA).
  • Reduces technical variation (batch effects) while preserving biological differences.
  • Produces a “corrected” dataset suitable for downstream analysis like clustering or visualization.

2. Integration vs. Batch Effect Removal

Term Meaning
Batch Effect Removal Focuses only on removing technical variation between batches. May distort biological differences.
Integration Aligns datasets from different batches or conditions, minimizing batch effects while keeping biological variation intact. Harmony achieves this by iteratively adjusting cells’ embeddings.

3. Conceptual Diagram

Batch 1: A1 A2 A3 →

Batch 2: B1 B2 B3 —-> [Harmony] —> Integrated Space (similar cell types cluster together) Batch 3: C1 C2 C3 →

  • Arrows indicate the mapping of cells from separate batches into a shared low-dimensional space.
  • Cells of the same type cluster together, regardless of batch.

4. Summary

  • Harmony = integration tool.
  • Batch effect removal = part of the integration process.
  • Integration = alignment of datasets with biological signals preserved.

Leave a Reply

Your email address will not be published. Required fields are marked *