This function evaluate the presence (calls) of individual mutations from a predefined list.
dreams_vc_parallel(
mutations_df,
bam_file_path,
reference_path,
model,
alpha = 0.05,
use_turboem = TRUE,
calculate_confidence_intervals = FALSE,
batch_size = NULL,
ncores = 1,
log_file = NULL
)
A data.frame()
with candidate mutations (SNVs) (chromosome, positions, reference and alternative)
Path to .BAM-file
Path to reference genome e.g. FASTA-file.
A dreams model. See train_dreams_model()
.
Alpha-level used for testing and confidence intervals. Default is 0.05.
Logical. Should turboEM::turboem()
be used for EM algorithm? Default is TRUE.
Logical. Should confidence intervals be calculated? Default is FALSE.
Number of positions to process at a time
Number of processing cores
write log-file to this path
A data.frame()
with information about the individual mutation calls, including:
The genomic position of the mutation.
The reference and alternative allele.
If the EM algorithm converged.
Number of steps and function evaluations by the EM algorithm.
The estiamted tumor fraction (allele fraction).
The confidence interval of tf_est
.
The expected count of the alternative allele under the error (null) model.
The count of the alternative allele.
The coverage used by the model (only referenceredas with and alternative allele).
The total coverage of the position (for reference).
The observed frequency of the alternative allele.
The value of the log-likelihood function under the null (tf=0) and alternative (tf>0) hypothesis.
The chisq test statistic, degrees of freedom and p-value of the statistical test.
Whether the mutation was detected at the supplied alpha level.