Descriptif
This module will present concepts, architectures and algorithms for data storage, management, and analysis, at a very large scale, especially in distributed settings. The following topics will be covered, each illustrated with a representative system, whose main features will be detailed during lectures:
- Introduction to distributed systems (consistency, availability, and the CAP theorem; ACID vs BASE)Massively distributed (cloud-based) filesystems (e.g., HDFS/GFS)Modern distributed computing: MapReduceDistributed NoSQL databases:
- Dynamic Hash Tables (DHTs)Key-value stores“Big Table” - style systemsGraph databases: Neo4J, PregelDistributed triple storesDocument stores: MongoDB
Format des notes
Numérique sur 20Littérale/grade européenPour les étudiants du diplôme Data & Knowledge (D-K)
Le rattrapage est autorisé (Note de rattrapage conservée)- Crédits ECTS acquis : 2.5 ECTS
Le coefficient de l'UE est : 2.5
La note obtenue rentre dans le calcul de votre GPA.
Programme détaillé