normalize {EnrichmentBrowser} | R Documentation |
This function wraps commonly used functionality from limma for microarray normalization and from EDASeq for RNA-seq normalization.
normalize(se, norm.method = "quantile", within = FALSE, data.type = c(NA, "ma", "rseq"))
se |
An object of class |
norm.method |
Determines how the expression data should be normalized.
For available microarray normalization methods see the man page of the limma
function |
within |
Logical. Is only taken into account if data.type='rseq'.
Determine whether GC content normalization should be carried out (as
implemented in the EDASeq function |
data.type |
Expression data type. Use 'ma' for microarray and 'rseq' for RNA-seq data. If NA, data.type is automatically guessed. If the expression values in 'se' are decimal numbers they are assumed to be microarray intensities. Whole numbers are assumed to be RNA-seq read counts. Defaults to NA. |
Normalization of high-throughput expression data is essential to make results within and between experiments comparable. Microarray (intensity measurements) and RNA-seq (read counts) data exhibit typically distinct features that need to be normalized for. For specific needs that deviate from these standard normalizations, the user should always refer to more specific functions/packages. See also the limma's user guide http://www.bioconductor.org/packages/limma for definition and normalization of the different expression data types.
Microarray data is expected to be single-channel. For two-color arrays, it
is expected here that normalization within arrays has been already carried
out, e.g. using normalizeWithinArrays
from limma.
RNA-seq data is expected to be raw read counts. Please note that normalization for downstream DE analysis, e.g. with edgeR and DESeq, is not ultimately necessary (and in some cases even discouraged) as many of these tools implement specific normalization approaches. See the vignette of EDASeq, edgeR, and DESeq for details.
An object of class SummarizedExperiment
.
Ludwig Geistlinger <Ludwig.Geistlinger@sph.cuny.edu>
readSE
for reading expression data from file;
normalizeWithinArrays
and normalizeBetweenArrays
for normalization of microarray data;
withinLaneNormalization
and
betweenLaneNormalization
for normalization of RNA-seq data.
# # (1) simulating expression data: 100 genes, 12 samples # # (a) microarray data: intensity measurements maSE <- makeExampleData(what="SE", type="ma") # (b) RNA-seq data: read counts rseqSE <- makeExampleData(what="SE", type="rseq") # # (2) Normalization # # (a) microarray ... normSE <- normalize(maSE) # (b) RNA-seq ... normSE <- normalize(rseqSE) # ... normalize also for GC content gc.content <- rnorm(100, 0.5, sd=0.1) rowData(rseqSE)$gc <- gc.content normSE <- normalize(rseqSE, within=TRUE)