Class Times: Tuesdays and Thursdays 11:40-12:55 Location: First week: Old Chem 025 after that 2240b CIEMAS Instructors: Sayan Mukherjee
Office Hours: By appointment Email Contact : sayan at stat.duke Similar class: Statistical learning theory Course description
The problem of supervised learning will be developed in the framework of statistical learning theory. Two classes of machine learning algorithms that have been used successfully in a variety of applications will be studied in depth: regularization algorithms and voting algorithms. Support vector machines (SVMs) are an example of a popular regularization algorithm and AdaBoost is an example of a popular voting algorithm. The course will
1) introduce these two classes of algorithms
2) illustrate practical uses of the algorithms
via problems in computational biology and computer graphics
3) state theoretical results on the generalization and consistency of these algorithms.Prerequisites
Familiarity with probability, functional analysis, and linear algebra will be very helpful. We try to keep the mathematical prerequisites to a minimum, but we will introduce complicated material at a fast pace.Grading
Three problem sets for 50% of the grade. A final project for 50% of the grade.Syllabus
Follow the link for each class to find a detailed description, suggested readings, and class slides. Some of the later classes may be subject to reordering or rescheduling.
Date Title Class 01 Thur 13 Jan Course at a glance Class 02 Tues 18 Jan Learning problem in perspective (Second lecture in notes). Class 03 Thur 20 Jan Regularization and Reproducing Kernel Hilbert Spaces Class 04 Tues 25 Jan Kernel ridge-regression Class 05 Thur 27 Jan Support Vector Machines for classification Class 06 Tues 1 Feb Spline models and regularization networks Class 07 Thur 3 Feb Voting algorithms and weak learners Class 08 Tues 8 Feb Probably approximately correct framework and the boosting hypothesis Class 09 Thur 10 Feb Adaptive boosting (Adaboost) Class 10 Tues 15 Feb Applications in computational biology Class 11 Thur 17 Feb Applications in computer graphics Class 12 Tues 22 Feb Multiclass classification and text classification Class 13 Thur 24 Feb Generalization and consistency Class 14 Tues 1 Mar One-dimensional concentration inequalities Class 15 Thur 3 Mar Vapnik-Chervonenkis classes, shattering dimensions, and covering numbers Class 16 Tues 8 Mar Uniform Glivenko-Cantelli classes Class 17 Thurs 10 Mar Discussion about final projects SPRING BREAK Class 18 Tues 22 Mar Symmetrization and Rademacher averages Class 19 Thur 24 Mar Kolmogorov chaining and Dudley's entropy integral Class 20 Tues 29 Mar Stability and generalization Class 21 Thurs 31 Mar Stability of Tikhonov regularization Class 22 Tues 5 Apr Necessary and sufficient conditions for the consistency or error minimization Class 23 Thurs 7 Apr Bounds for error minimization Class 24 Tues 12 Apr Bounds for boosting algorithms I Class 25 Thur 14 Apr Bounds for boosting algorithms II Class 26 Tues 19 Apr Project presentations Class 27 Thur 21 Apr Project presentations
Math Camp 1 TBD Analysis and basic probability theory Math Camp 2 TBD More analysis and probability theory Reading List
There is no textbook for this course.
I am preparing a series of lectures that will grow in length and hopefully clarity as the course continues. These lecture notes will contain in much greater depth all topics I will cover. I will update the notes such that contents relevant for each lecture will appear at least a day before the lecture. Note, I have not yet had a chance to add references.The books and papers listed below are useful general reference reading, especially from the theoretical viewpoint.
- S. Mukherjee Statistical Learning: algorithms and theory.
- V. N. Vapnik. The Nature of Statistical Learning Theory.
- V. N. Vapnik. Statistical Learning Theory.
- L. Devroye, L. Gyorfi, and G. Lugosi. A Probabilistic Theory of Pattern Recognition.
- N. Cristianini and J. Shawe-Taylor. Introduction To Support Vector Machines.
- T. Evgeniou and M. Pontil and T. Poggio. Regularization Networks and Support Vector Machines.
- F. Cucker and S. Smale. On The Mathematical Foundations of Learning.
- R.E. Schapire. The boosting approach to machine learning: An overview.
- L. Breiman. Bagging predictors.
- J. Friedman, T. Hastie, and R. Tibshirani. Additive logistic regression: a statistical view of boosting.
- G. Lugosi. Concentration-of-measure inequalities.
- S. Mendelson. A few notes on Statistical Learning Theory.
- G. Lugosi. Pattern classification and learning theory.
- O. Bousquet and A. Elisseeff. Stability and generalization.