Skip to contents

Whole retention time prediction workflow. Function creates predictor set with RCDK based on SMILES. Trains a chosen predcition model and validates the approach with a cross validation.

Usage

fastret.workflow(
  data,
  method = "glmnet",
  verbose = FALSE,
  data_set_name = "data set",
  final_model = TRUE,
  preprocessed = FALSE,
  interaction_terms = FALSE,
  nfolds = 2,
  include_polynomial = FALSE,
  degree_polynomial = 2,
  scale = TRUE
)

Arguments

data

data.frame with columns NAME, RT, SMILES

method

prediction algorithm, either glmnet or xgboost

verbose

additional print outputs to user if TRUE

data_set_name

name of dataset will appear on validation plot

final_model

TRUE if final model trained on whole dataset should be returned

preprocessed

TRUE if data is already preprocessed and descriptor varialbes are already added

interaction_terms

TRUE if interaction terms between all variables should be added

nfolds

number of folds for cross validation

include_polynomial

TRUE if polynomial terms should be added to descriptor set

degree_polynomial

specifies degree up until which polynomials will be added if include_polynomials == TRUE

scale

if TRUE, all variables will be centered to a mean of 0 and scaled to a standard deviation of 1