Number of hours
- Lectures -
- Projects -
- Tutorials -
- Internship -
- Laboratory works -
- Written tests -
ECTS
ECTS 3.0
Goal(s)
The goal of this course is to introduce students to classical approaches of statistical learning. The information age has resulted in masses of multivariate data in many fields: finance, marketing, economy, biology, environmental sciences, and the knowledge to handle them in a rigorous and self-critical manner is of great importance in research and industry. We will give equal importance to theoretical and practical aspects of statistical learning, showing several applications in class and proposing practical sessions in which the student has to perform actual data analysis using Python software. This course is intended for students from IF, ISI, and MMIS, as well as those form the M1AM master.
Pedro Luiz COELHO RODRIGUES
Content(s)
Review of multivariate statistics. Simple and multivariate linear regression. Cross-validation, model selection, bias-variance. Principal component analysis. Linear classification: discriminative and generative approaches. Decision trees. Ensemble methods: bagging and boosting. Performance metrics and overfitting. Introduction to network analysis and community detection in graphs.
PrerequisitesNotions of probability theory: probability distribution, joint probability density function for random vectors, conditional distribution, expectation, variance, covariance, Gaussian distribution. Notions of mathematical statistics: estimator, confidence interval, statistical tests. Notions of linear algebra: matrix reductions, eigenvalue decomposition. Bonus: Elementary notions of Python programming, writing Rmd files, and typesetting equations in LaTeX.
Evaluation : 50% of TP notés and 50% of Examen sur machine (3h)
Resit : Examen oral (exposé, soutenance, etc..) (30min)
Final grade consists of 50% from average in TP reports and 50% in final exam
The course exists in the following branches:
- Curriculum - Master in Applied Mathematics - Semester 8
Course ID : WMM8AM29
Course language(s):
You can find this course among all other courses.
James et al. "Introduction to statistical learning with applications to R"
(Available at https://www.statlearning.com/)
Shalizi "The truth about linear regression”
(Available at https://www.stat.cmu.edu/~cshalizi/TALR/)