Statistical Analysis and document mining - 4MMSADM
A+Augmenter la taille du texteA-Réduire la taille du texteImprimer le documentEnvoyer cette page par mailPartagez cet articleFacebookTwitterLinked InGoogle+Viadeo
Number of hours
Lectures : 15.5
Tutorials : 2.0
Laboratory works : 15.5
ECTS : 3.0
The aim of this course is to present the statistical approaches for analysing multivariate data. The information age has resulted in masses of multivariate data in many different field: finance, marketing, economy, biology, environmental sciences,...The theoretical and practical aspects of multivariate data analysis are given equal importance. This balance is achieved through practicals involving actual data analysis using the R software.
Contact Jean-Baptiste DURAND
1. Multiple linear regression. Least squares, Gaussian linear model, test of linear hypotheses, one-way analysis of variance. 2. Principal Components Analysis (PCA). 3. Classification, linear discriminant analysis, perceptron, Naive Bayes 4. Text mining, numeric representation of texts, connexion with graph clustering.
Applied Probability 2 (1st year), Statistical Principles and Methods (Semester 2)
Practical exam with R (3 h) and reports on supervised practicals. Authorized document and material: handwritten notes only. An electronic version of the booklet will be provided by the teacher.