Split a data.frame into training and test sets.
data_split( data = get_data("german"), varname = "credit_risk", p_test = 0.2, p_quiz = 0.5 )
| data | data.frame |
|---|---|
| varname | string. output variable name |
| p_test | real. proportion of samples in the test set |
| p_quiz | real. proportion of samples from the test set in the quiz set |
list with members
training set with output variable
test set without output variable
test set output variable
indices of quiz samples in the test set