This function creates one or more random forest models from a spreadsheet of MLST profiles. In many situations, you would want to make many random forest models and make predictions from each one downstream.
Usage
bootstrapping(
input,
output,
ncores = 1L,
bootstrap_reps = 1L,
loci_start_with = "LMO",
my_seed = 23L
)
Arguments
- input
(character) The input file path for MLST profile data in csv or csv.gz format.
- output
(character) The output directory for random forest models.
- ncores
(integer, default: 1L) The number of cores to use for parallel processing.
- bootstrap_reps
(integer, default: 1L) The number of bootstrap replicates.
- loci_start_with
(character, default: "LMO") The prefix for loci names.
- my_seed
(integer, default: 23L) The seed for reproducibility.