API References
- BIT(file, output_path, show=TRUE, plot.bar=TRUE, format=NULL, N=5000, bin_width=1000, burnin=NULL, genome=c("hg38", "mm10"))
Main interface to run BIT method, please set the input file path, input file format, number of iterations and bin width.
file: file path to the user-input file.
output_path: absoluate or relative directory to save the output.
show: TRUE / FALSE. Whether to display the ranking table. Default: TRUE.
plot.bar: TRUE / FALSE. Whether to plot the top 10 TRs BIT scores in a horizontal bar plot. Default: TRUE.
format: One of “bed”, “narrowPeak”, “broadPeak”, “bigNarrowPeak”, “csv”, or NULL. Specifies the format of the input file. Default: NULL. If set to NULL, BIT will automatically determine the file format based on its extension.
N: Integer. The number of iterations in the Gibbs sampler. Default: 5000.
bin_width: Integer. The width of the bin used to divide the chromatin into non-overlapping bins. Default: 1000. Only change this if you compile a different reference database.
burnin: NULL or an integer. Specifies the burn-in period when deriving the Bayesian inference for TR-level parameters. Default: NULL, which will use N/2.
genome: hg38 or mm10 for TR ChIP-seq data collected from different genome.
- BIT_compare(file1, file2, output_path, show=TRUE, plot_scatter=TRUE, format=c(NULL, NULL), N=5000, bin_width=1000, burnin=NULL, genome=c('hg38', 'mm10'))
compare BIT identifid TRs for two user input epigenomic region sets.
file1: file path to the user-input file 1.
file2: file path to the user-input file 2.
output_path: absoluate or relative directory to store the Gibbs sampler data.
format: One of “bed”, “narrowPeak”, “broadPeak”, “bigNarrowPeak”, “csv”, or NULL. Specifies the format of the input file. Default: NULL. If set to NULL, BIT will automatically determine the file format based on its extension.
N: Integer. The number of iterations in the Gibbs sampler. Default: 5000.
bin_width: Integer. The width of the bin used to divide the chromatin into non-overlapping bins. Default: 1000. Only change this if you compile a different reference database.
burnin: NULL or an integer. Specifies the burn-in period when deriving the Bayesian inference for TR-level parameters. Default: NULL, which will use N/2.
genome: hg38 or mm10 for TR ChIP-seq data collected from different genome.
- display_tables(file_path, output_path, burnin=NULL)
To show the ranking table by inspecting the results of Gibbs sampler.
file_path: path to the saved BIT Gibbs sampling results.
output_path: path to save the rank table.
burnin: number of samples used for burn-in. If not specify, BIT will use the half of the iterations as burn in.
- rank_plot(file_path=NULL, output_path, burnin=NULL, n=10, colors="NPG", main=NULL, xlab="BIT score", ylab="TR symbols")
To draw a barplot for the top n TRs.
file_path: path to the saved BIT Gibbs sampling results.
output_path: path to save the barplot.
burnin: number of samples used for burn-in. If not specify, BIT will use the half of the iterations as burn in.
n: top n TRs will show in the barplot, default: 10.
colors: colors for each bar, default “NPG” for n<=10, has to be manually specified if n>10.
main: main title for the barplot, default: NULL.
xlab: x axis label, default: BIT score.
ylab: y axis label, default: TR symbols.
- compare_scatter_plot(file1_path, file2_path, output_path, burnin=NULL)
To draw a scatterplot for the comparison bewteen two input region sets.
file1_path: path to the saved BIT Gibbs sampling results of input 1.
file2_path: path to the saved BIT Gibbs sampling results of input 2.
output_path: path to save the barplot.
burnin: number of samples used for burn-in. If not specify, BIT will use the half of the iterations as burn in.
- load_chip_data(data_path, bin_width, genome=c('hg38', 'mm10'))
load the pre-compiled chip-seq data.
data_path: path to the ChIP-seq data folder, can be absolute or relative path.
bin_width width of bin, which should be in 100/500/1000 and map with your ChIP-seq data.
genome: hg38 or mm10 for TR ChIP-seq data collected from different genome.
- import_input_regions(file, format=NULL, bin_width=1000, genome=c('hg38', 'mm10'))
Transform the input regions to binary vector.
file: file path to the user-input.
bin_width: desired width of bin, default: 1000.
genome: the genome of TR ChIP-seq data, either as “hg38” or “mm10”.
- alignment_wrapper(input_vec, bin_width, genome=c('hg38', 'mm10'))
Count the ‘good’ and ‘informative’ cases by comparing input with the reference database.
input_vec: A input vector contains index of transformed regions by applying import_input_regions.
bin_width: desired width of bin, default: 1000.
genome: the genome of TR ChIP-seq data, either as “hg38” or “mm10”.