Given data on explanatory and explained variables, where the explained variablecan take two values { 1 }, find a function that gives the “best” separation betweenthe “-1” cases and the “+1” cases:
Given: ( x1, y1 ), … , ( xm , ym ) n { 1 }
Find: : n { 1 }
“best function” = the expected error on unseen data ( xm+1, ym+1 ), … , ( xm+k , ym+k ) is minimal
Existing techniques to solve the classification task:
Linear and Quadratic Discriminant Analysis
Logit choice models (Logistic Regression)
Decision trees, Neural Networks, Least Squares SVM