CscoreTool-M

Multiple sub-compartment analysis from Hi-C data

Purpose

This program is to do multiple sub-compartment analysis for Hi-C data. Test data are also provided.

Installation

The program is simple one-file executable for linux system. Just download CscoreTool-M Then run the following command:

chmod +x CscoreTool-M

Then you can type ./CscoreTool-M to run it.

The executable is compiled on a CentOS linux machine. If it doesn't work, download the CscoreTool-M.cpp, twister.h and twister.cpp files and put them in the same folder. Then run

g++ CscoreTool-M.cpp twister.cpp -fopenmp -O3 -o CscoreTool-M

You'll get an execuable file CscoreTool-M.

Usage: Usage: CscoreTool-M <windows_Rabl.bed> <input.summary> <N_subcompartments> <N_Rablwindow> <N_session> [Blacklist.bed] [ExcludedInteractions.txt]

Input parameters

a. windows.bed This file is to specify the genomic windows to analyze. It should be an equal-length bed file with the fourth column being the Rabl position, i.e. the relative position between centromere (0) and telomere (1).

b. input.summary This file is the main input file for Hi-C interactions. We accept the same format as the HiCsummary file format for HOMER runHiCpca.pl. See http://homer.ucsd.edu/homer/interactions/HiCtagDirectory.html An example file test.summary.gz can be downloaded. This is 0.5% randomly selected reads in chr1 from the High-resolution GM12878 cell Hi-C dataset (Rao, 2014).

c. OutputPrefix This is the prefix for output files.

d. N_subcompartment The number of sub-compartment to analyze

e. N_Rablwindow The size of the Rabl matrix output.

f. N_session This the number of sessions to use. The number of choice depends on the resource available. These 6 arguments a-f are needed.

g. Blacklist.bed This is the encode blacklist file which lists the blacklist regions excluded from ChIP-seq analysis, possibly due to copy number variations. If the program takes 7 arguments, it will take the 7th argument as Blacklisted.bed.

h. ExcludedInteractions.txt This is the file for excluding regions having translocations. The format is like

chrName1a_chrStart1a_chrEnd1a chrName1b_chrStart1b_chrEnd1b

chrName2a_chrStart2a_chrEnd2a chrName2b_chrStart2b_chrEnd2b chrName2c_chrStart2c_chrEnd2c

......

For each line, several genomic regions are listed in the chrName_chrStart_chrEnd format and separated by tab. Then any interactions between genomic regions in the same line will be excluded from the analysis. Note that the expectation are also excluded in the calculation, so it's different from simply delete the interactions from the input file. If the program takes 8 arguments, it will take the 8th argument as ExcludedInteractions.txt.

Example run:

./CscoreTool-M hg38_500k_Rabl.bed Test.summary Test_500k_1 5 20 12

./CscoreTool-M hg38_500k_Rabl.bed Test.summary Test_500k_2 5 20 12 hg38_blacklisted.bed

./CscoreTool-M hg38_500k_Rabl.bed Test.summary Test_500k_3 5 20 12 hg38_blacklisted.bed Test_excluded.txt

Output There are 3 output files.

XXX_cscore.txt This is the Cscore estimated for each genomic window. XXX_cscore.bedgraph This is the bedgraph file made for visualization. Ths c-scores are normalized to add up to 1 for each genomic region. Regions with no reads are not shown. XXX_rabl.txt estimated Rabl effect, shown as a matrix.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
CscoreTool-M		CscoreTool-M
CscoreTool-M.cpp		CscoreTool-M.cpp
GM12878_hg38_C10.bedgraph.gz		GM12878_hg38_C10.bedgraph.gz
GM12878_hg38_C5.bedgraph.gz		GM12878_hg38_C5.bedgraph.gz
HCT116_hg38_C10.bedgraph.gz		HCT116_hg38_C10.bedgraph.gz
IMR90_hg38_C5.bedgraph.gz		IMR90_hg38_C5.bedgraph.gz
LICENSE		LICENSE
NHEK_hg38_C10.bedgraph.gz		NHEK_hg38_C10.bedgraph.gz
README.md		README.md
Test.summary.gz		Test.summary.gz
Test_excluded.txt		Test_excluded.txt
hg38_500k_Rabl.bed		hg38_500k_Rabl.bed
hg38_blacklisted.bed		hg38_blacklisted.bed
twister.cpp		twister.cpp
twister.h		twister.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CscoreTool-M

About

Uh oh!

Releases

Packages

Languages

License

scoutzxb/CscoreTool-M

Folders and files

Latest commit

History

Repository files navigation

CscoreTool-M

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages