get_platform_datasets: Retrieve all experiments using a given platform

View source: R/allEndpoints.R

get_platform_datasetsR Documentation

Retrieve all experiments using a given platform

Description

Retrieve all experiments using a given platform

Usage

get_platform_datasets(
  platform,
  offset = 0L,
  limit = 20L,
  raw = getOption("gemma.raw", FALSE),
  memoised = getOption("gemma.memoised", FALSE),
  file = getOption("gemma.file", NA_character_),
  overwrite = getOption("gemma.overwrite", FALSE)
)

Arguments

platform

A platform numerical identifier or a platform short name

offset

The offset of the first retrieved result.

limit

Defaults to 20. Limits the result to specified amount of objects. Has a maximum value of 100. Use together with offset and the totalElements attribute in the output to compile all data if needed.

raw

TRUE to receive results as-is from Gemma, or FALSE to enable parsing. Raw results usually contain additional fields and flags that are omitted in the parsed results.

memoised

Whether or not to save to cache for future calls with the same inputs and use the result saved in cache if a result is already saved. Doing options(gemma.memoised = TRUE) will ensure that the cache is always used. Use forget_gemma_memoised to clear the cache.

file

The name of a file to save the results to, or NULL to not write results to a file. If raw == TRUE, the output will be the raw endpoint from the API, likely a JSON or a gzip file. Otherwise, it will be a RDS file.

overwrite

Whether or not to overwrite if a file exists at the specified filename.

Value

A data table with information about the queried dataset(s). A list if raw = TRUE. Returns an empty list if no datasets matched.

The fields of the output data.table are:

  • experiment.shortName: Shortname given to the dataset within Gemma. Often corresponds to accession ID

  • experiment.name: Full title of the dataset

  • experiment.ID: Internal ID of the dataset.

  • experiment.description: Description of the dataset

  • experiment.troubled: Did an automatic process within gemma or a curator mark the dataset as "troubled"

  • experiment.accession: Accession ID of the dataset in the external database it was taken from

  • experiment.database: The name of the database where the dataset was taken from

  • experiment.URI: URI of the original database

  • experiment.sampleCount: Number of samples in the dataset

  • experiment.batchEffectText: A text field describing whether the dataset has batch effects

  • experiment.batchCorrected: Whether batch correction has been performed on the dataset.

  • experiment.batchConfound: 0 if batch info isn't available, -1 if batch counfoud is detected, 1 if batch information is available and no batch confound found

  • experiment.batchEffect: -1 if batch p value < 0.0001, 1 if batch p value > 0.1, 0 if otherwise and when there is no batch information is available or when the data is confounded with batches.

  • experiment.rawData: -1 if no raw data available, 1 if raw data was available. When available, Gemma reprocesses raw data to get expression values and batches

  • geeq.qScore: Data quality score given to the dataset by Gemma.

  • geeq.sScore: Suitability score given to the dataset by Gemma. Refers to factors like batches, platforms and other aspects of experimental design

  • taxon.name: Name of the species

  • taxon.scientific: Scientific name for the taxon

  • taxon.ID: Internal identifier given to the species by Gemma

  • taxon.NCBI: NCBI ID of the taxon

  • taxon.database.name: Underlying database used in Gemma for the taxon

  • taxon.database.ID: ID of the underyling database used in Gemma for the taxon

Examples

head(get_platform_datasets("GPL1355"))

PavlidisLab/Gemma-API documentation built on Oct. 25, 2024, 10:25 a.m.