GSEA: Difference between revisions

From 太極
Jump to navigation Jump to search
(Created page with "= GSEA = https://en.wikipedia.org/wiki/Gene_set_enrichment_analysis Determines whether an a priori defined set of genes shows statistically significant, concordant difference...")
 
Line 89: Line 89:


tree ~/github/ssGSEA2.0/example/gct/2021-03-20/rank-plots | head -3  
tree ~/github/ssGSEA2.0/example/gct/2021-03-20/rank-plots | head -3  
 
# 102 files. One file per matched gene set
├── DISEASE.PSP_Alzheime_2.pdf
├── DISEASE.PSP_Alzheime_2.pdf
├── DISEASE.PSP_breast_c_2.pdf
├── DISEASE.PSP_breast_c_2.pdf


tree ~/github/ssGSEA2.0/example/gct/2021-03-20/signature_gct | head -3                     
tree ~/github/ssGSEA2.0/example/gct/2021-03-20/signature_gct | head -3                     
 
# 102 files. One file per matched gene set
├── DISEASE.PSP_Alzheimer.s_disease_n2x23.gct
├── DISEASE.PSP_Alzheimer.s_disease_n2x23.gct
├── DISEASE.PSP_breast_cancer_n2x14.gct
├── DISEASE.PSP_breast_cancer_n2x14.gct

Revision as of 14:16, 20 March 2021

GSEA

https://en.wikipedia.org/wiki/Gene_set_enrichment_analysis

Determines whether an a priori defined set of genes shows statistically significant, concordant differences between two biological states

Two categories of GSEA procedures:

  • Competitive: compare genes in the test set relative to all other genes.
  • Self-contained: whether the gene-set is more DE than one were to expect under the null of no association between two phenotype conditions (without reference to other genes in the genome). For example the method by Jiang & Gentleman Bioinformatics 2007

See also BRB-ArrayTools -> GSEA.

ssGSEA

  • https://github.com/broadinstitute/ssGSEA2.0
  • To use "ssgsea-gui.R"
    setwd("~/github/ssGSEA2.0/")
    source("ssgsea-gui.R")
    # select a folder containing gct files; e.g. PI3K_pert_logP_n2x23936.gct 
    # select a gene set file; e.g. <ptm.sig.db.all.flanking.human.v1.8.1.gmt>
    

    A new folder (e.g. 2021-03-01) will be created under the same parent folder as the gct file folder.

    tree -L 1 ~/github/ssGSEA2.0/example/gct/2021-03-20/                         
    
    ├── PI3K_pert_logP_n2x23936_ssGSEA-combined.gct
    ├── PI3K_pert_logP_n2x23936_ssGSEA-fdr-pvalues.gct
    ├── PI3K_pert_logP_n2x23936_ssGSEA-pvalues.gct
    ├── PI3K_pert_logP_n2x23936_ssGSEA-scores.gct
    ├── PI3K_pert_logP_n2x23936_ssGSEA.RData
    ├── parameters.txt
    ├── rank-plots
    ├── run.log
    └── signature_gct
    
    tree ~/github/ssGSEA2.0/example/gct/2021-03-20/rank-plots | head -3 
    # 102 files. One file per matched gene set
    ├── DISEASE.PSP_Alzheime_2.pdf
    ├── DISEASE.PSP_breast_c_2.pdf
    
    tree ~/github/ssGSEA2.0/example/gct/2021-03-20/signature_gct | head -3                    
    # 102 files. One file per matched gene set
    ├── DISEASE.PSP_Alzheimer.s_disease_n2x23.gct
    ├── DISEASE.PSP_breast_cancer_n2x14.gct
    
  • Some discussions from biostars.org. Find -> "ssgsea"
  • Some papers. Proteogenomic Characterization Reveals Therapeutic Vulnerabilities in Lung Adenocarcinoma 2020
  • 【生信分析 3】教你看懂GSEA和ssGSEA分析结果