Skip to content

gauravrmore007/Gene-Expression-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

7 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🧬 Gene Expression Analysis (scRNA-seq)

This project performs a basic single-cell RNA-seq (scRNA-seq) analysis using Python and the Scanpy library. It includes steps for preprocessing, clustering, visualization, and marker gene detection.


πŸ“ Project Structure

gene_project/ β”œβ”€β”€ data/ # Contains the .h5 dataset (download manually) β”œβ”€β”€ results/ # Stores generated plots and outputs β”œβ”€β”€ script.py # Main analysis script β”œβ”€β”€ gene_expression_analysis.py # Alternative or extended script β”œβ”€β”€ Requirement.txt # List of Python dependencies


πŸ“₯ Dataset

Download the sample dataset from 10x Genomics:

πŸ“Ž PBMC 3k filtered_feature_bc_matrix.h5 (HDF5)

Save this file into the data/ folder and rename it if needed (e.g., your_file.h5).


▢️ How to Run

  1. Install required packages:
    pip install -r Requirement.txt
    
  2. Run the analysis:
    python script.py
    
    

πŸ”¬ Pipeline Steps

  • Load and filter scRNA-seq data
  • Normalize and transform
  • PCA + UMAP for visualization
  • Cluster with Leiden algorithm
  • Detect marker genes

πŸ“Š Outputs

  • UMAP plot of clusters
  • PCA plot
  • Marker gene heatmap

🧠 Extensions

  • Add sample comparison (e.g. healthy vs diseased)
  • Train ML model for cell type classification

πŸ‘¨β€πŸ’» Author

Gaurav More

About

A complete single-cell RNA-seq analysis pipeline using Scanpy on 10x Genomics PBMC data, including clustering, UMAP visualization, and marker gene detection.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages