The goal of this project is to design and automatically deploy a distributed data processing application. The application will be based on the main frameworks used in the Big Data community. The application will be automatically deployed in a public Cloud infrastructure.
The students will work in teams of 5 students.
The students will build a distributed data processing system. These systems are very often used today in different domains (analysis of the stock market, analysis of sensors data, analysis of data coming from tracking systems, etc.). The students will be free to pick the domain targeted by their application.
A data processing system includes several components, each of them being distributed over several machines:
For this project, the students will use the standard technologies that are used by the main companies in the domain (Google, Facebook, LinkedIn, etc.). For example, the students could use:
Furthermore, the students will have to set up the software infrastructure that will allow to configure, deploy and automatically reconfigure their application to be able to execute it on a Cloud computing platform (Ex: AWS, Azure, etc.). The tools used for this stage could include:
Networks, distributed systems, databases.
Demo of the running application. Report and documentation.
pas de rattrapage
The course exists in the following branches:
Course ID : 5MMSDTD7
The course is attached to the following structures:
You can find this course among all other courses.
Date of update June 18, 2017