GEO: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
No edit summary |
||
Line 94: | Line 94: | ||
== Series, Samples, Platforms, DataSets == | == Series, Samples, Platforms, DataSets == | ||
[[File:Geo series.png|250px]] [[File:Geo samples.png|250px]] [[File:Geo platform.png|250px]] [[File:Geo datasets.png|250px]] | [[File:Geo series.png|250px]] [[File:Geo samples.png|250px]] [[File:Geo platform.png|250px]] [[File:Geo datasets.png|250px]] | ||
= R packages = | |||
== [http://bioconductor.org/packages/release/bioc/html/GEOmetadb.html GEOmetadb] == | |||
* http://gbnci.abcc.ncifcrf.gov/geo/ Meltzerlab GEO Microarray Tool | |||
* https://nsaunders.wordpress.com/2010/08/30/geo-database-curation-lagging-behind-submission/ | |||
* http://rpubs.com/seandavi/GEOMetadbSurvey2014 dplyr and the GEOmetadb package for mining NCBI GEO metadata] | |||
== [http://bioconductor.org/packages/release/bioc/html/GEOquery.html GEOquery] == | |||
* http://bioconductor.org/packages/release/bioc/vignettes/GEOquery/inst/doc/GEOquery.html | |||
* http://watson.nci.nih.gov/~sdavis/tutorials/CSHL2010/publicRepos.html | |||
* [http://watson.nci.nih.gov/~sdavis/tutorials/publicdatatutorial/ Accessing Public Data using R and Bioconductor] | |||
* http://www2.warwick.ac.uk/fac/sci/moac/people/students/peter_cock/r/geo | |||
* https://www.biostars.org/p/4896/, https://www.biostars.org/p/111791/ | |||
== [http://bioconductor.org/packages/release/bioc/html/SRAdb.html SRAdb] == |
Revision as of 10:46, 10 June 2015
Gene Expression Omnibus (GEO) website is located at http://www.ncbi.nlm.nih.gov/geo/. GEO is a public functional genomics data repository supporting MIAME-compliant data submissions. Array- and sequence-based data are accepted. Tools are provided to help users query and download experiments and curated gene expression profiles.
Browse Content
Repository Browser/Summary
Click on 'Browse Content' > 'Repository Browser' to go to the summary page. It has 4 tabs.
Series/Series Type
Expression profiling by array 40,319 Expression profiling by genome tiling array 639 Expression profiling by high throughput sequencing 4,772 Expression profiling by SAGE 242 Expression profiling by MPSS 20 Expression profiling by RT-PCR 329 Expression profiling by SNP array 13 Genome variation profiling by array 596 Genome variation profiling by genome tiling array 1,068 Genome variation profiling by high throughput sequencing 63 Genome variation profiling by SNP array 826 Genome binding/occupancy profiling by array 174 Genome binding/occupancy profiling by genome tiling array 2,114 Genome binding/occupancy profiling by high throughput sequencing 3,940 Genome binding/occupancy profiling by SNP array 12 Methylation profiling by array 556 Methylation profiling by genome tiling array 718 Methylation profiling by high throughput sequencing 764 Methylation profiling by SNP array 9 Protein profiling by protein array 167 Protein profiling by Mass Spec 6 SNP genotyping by SNP array 514 Other 1,147 Non-coding RNA profiling by array 2,166 Non-coding RNA profiling by genome tiling array 104 Non-coding RNA profiling by high throughput sequencing 1,478 Third-party reanalysis 135
Platform/Technology
Technology Count in situ oligonucleotide 5,657 spotted oligonucleotide 2,852 spotted DNA/cDNA 2,869 antibody 24 MS 17 SAGE NlaIII 67 SAGE Sau3A 4 SAGE RsaI 1 SARST 2 MPSS 18 RT-PCR 277 other 174 oligonucleotide beads 227 mixed spotted oligonucleotide/cDNA 16 spotted peptide or protein 110 high-throughput sequencing 2,073
Samples/Samples Type
Sample type Count RNA 1,017,959 genomic 244,511 protein 12,860 SAGE 1,763 mixed 3,976 other 7,509 SARST 9 MPSS 207 SRA 135,247
Organism
A partial list:
Organism Series Platforms Samples Homo sapiens 22,477 4,590 792,844 Mus musculus 15,758 1,959 240,935 Rattus norvegicus 2,358 475 68,583 Saccharomyces cerevisiae 1,790 550 37,435 Arabidopsis thaliana 2,416 331 30,709 Drosophila melanogaster 2,422 317 23,601 Sus scrofa 405 107 9,809 Caenorhabditis elegans 1,154 183 8,898 Zea mays 265 91 8,667 Bos taurus 462 147 7,780 Oryza sativa 493 173 5,616 Glycine max 179 41 5,863 Gallus gallus 375 105 5,509 Escherichia coli 508 127 5,056 Macaca mulatta 245 40 4,504 Xenopus laevis 111 25 1,054
Series, Samples, Platforms, DataSets
R packages
GEOmetadb
- http://gbnci.abcc.ncifcrf.gov/geo/ Meltzerlab GEO Microarray Tool
- https://nsaunders.wordpress.com/2010/08/30/geo-database-curation-lagging-behind-submission/
- http://rpubs.com/seandavi/GEOMetadbSurvey2014 dplyr and the GEOmetadb package for mining NCBI GEO metadata]
GEOquery
- http://bioconductor.org/packages/release/bioc/vignettes/GEOquery/inst/doc/GEOquery.html
- http://watson.nci.nih.gov/~sdavis/tutorials/CSHL2010/publicRepos.html
- Accessing Public Data using R and Bioconductor
- http://www2.warwick.ac.uk/fac/sci/moac/people/students/peter_cock/r/geo
- https://www.biostars.org/p/4896/, https://www.biostars.org/p/111791/