Split a data.frame into training and test sets.
data_split( data = get_data("german"), varname = "credit_risk", p_test = 0.2, p_quiz = 0.5 )
data | data.frame |
---|---|
varname | string. output variable name |
p_test | real. proportion of samples in the test set |
p_quiz | real. proportion of samples from the test set in the quiz set |
list with members
training set with output variable
test set without output variable
test set output variable
indices of quiz samples in the test set