sepiraRegAct {SEPIRA} | R Documentation |
sepiraRegAct
calculates TF activity scores in user input data set. It could be a gene expression dataset or a DNA methylation dataset
sepiraRegAct(data, type = c("mRNA", "DNAm"), regnet, norm = c("c", "z"), ncores = 4)
data |
A gene expression or DNA methylation data matrix, with rows referring to genes and columns to samples. |
type |
A character, "mRNA" for gene expression data; "DNAm" for DNA methylation data. |
regnet |
A matrix, the regulatory network inferred from |
norm |
A character indicating the method used to normalize your input data set, "c" for "centering"; "z" for "z-score normalization". |
ncores |
A numeric, the number of cores to use. See |
sepiraRegAct
is one of the two main functions in SEPIRA
package. It takes the output regulatory network from sepiraInfNet
as input, and computes the activity of all TFs in this network from user provided data
.
The data
matrix could be gene expression data or DAN methylation data, with rows are genes and columns are samples. Duplicated row names are not allowed, so you should average the these rows before running sepiraRegAct
.
Note that it's very important that you use the same gene identifier through out the whole analysis.
A matrix of TF activity score with rows referring to TFs, columns to samples.
# gene expression dataset data("GeneExp") # TFs data("TFeid") # run the function cf <- "Blood" coln <- colnames(GeneExp) degth <- c(0.3,0.3) # 'degth = c(0.05, 0.05)' is recommended net.o <- sepiraInfNet(GeneExp,coln,"Lung",cf,TFeid,sigth=0.05,degth=degth,minNtgts=5,ncores=1) # normalized LSCC DNAm data set from TCGA data("LUSCmeth") # estimate TF activity TFact.lusc <- sepiraRegAct(LUSCmeth,type="DNAm",regnet=net.o$netTOI,norm="z",ncores=1) TFact.gtex <- sepiraRegAct(GeneExp,type="exp",regnet=net.o$netTOI,norm="z",ncores=1)