CC5212-1 Procesamiento Masivo de Datos 2020, Otoño
Welcome! This is the homepage for the course.
I will be putting slides and other material up here after the classes.
Lecture Material
The PDF format doesn't preserve animation, so some (few) slides sometimes may not make much sense in that format, though I try to keep them ordered. On the other hand, the PPTX files can be big. :)
- Lecture 1: Introduction (pptx, pdf)
- Lecture 2: Distributed Systems (pptx, pdf, mp4, lab-mp4)
- Lecture 3: GFS/HDFS & MapReduce/Hadoop (pptx, pdf, mp4, lab-mp4, bonus-mp4)
- Lecture 4: Apache Pig (pptx, pdf, mp4)
- Projects, Optional Assignments (pptx, pdf, mp4)
- Lecture 5: Apache Spark (pptx, pdf, mp4)
- Control 1 (mp4)
- Control 1 Solutions (mp4)
- Lecture 6: Streaming / Kafka (pptx, pdf, 1.mp4, 2.mp4)
- Lecture 7: Information Retrieval / Crawling & Indexing (pptx, pdf, 1.mp4, 2.mp4)
- Lecture 8: Information Retrieval / Ranking (pptx, pdf, 1.mp4, 2.mp4)
- Lecture 9: NoSQL / Overview (pptx, pdf, 1.mp4, 2.mp4)
- Lecture 10: NoSQL / MongoDB (pptx, pdf, 1.mp4, 2.mp4)
- Lecture 11: NoSQL / Neo4J (pptx, pdf, 1.mp4, 2.mp4)
- Lecture 12: Conclusion (pptx, pdf, 1.mp4)
Contact
- Email
- aidhog@gmail.com
- Office
- 231 Poniente