MatToVec {hicrep}R Documentation

Convert the HiC matrix format to vector format

Description

The matrix format is the standard input for the HiCRep reproducibility analysis. It has the dimension of N*(3+N). The additional first three columns are chromosome name, and mid-point coordinates of two contacting bins. The converted format has three columns. The first two columns are mid-point coordinates of two contacting bins, and the third column is the reads number in each bin.

Usage

MatToVec(dat)

Arguments

dat

a Hi-C intra-chromosome matrix in the format of N*N (No chromsome name and coordinates columns).

Value

a vectorized Hi-C data. The first two columns are mid-point coordinates of the two contacting bins. The third column is read numbers of the contacts.

References

HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient. Tao Yang, Feipeng Zhang, Galip Gurkan Yardimci, Ross C Hardison, William Stafford Noble, Feng Yue, Qunhua Li. bioRxiv 101386; doi: https://doi.org/10.1101/101386.

Examples

data(HiCR1)

#re-format the row and column names
resol <- 1000000 
ref_Rep1 <- HiCR1[,-c(1,2,3)]
rownames(ref_Rep1) = colnames(ref_Rep1) = HiCR1[,3]-resol/2

vec_HiC_R1 <- MatToVec(ref_Rep1)
head(vec_HiC_R1)

[Package hicrep version 1.11.0 Index]