This is an old revision of the document!
Big Data Analytics
News
- Course start: lessons will start on 5th October 2017.
General Info
Teacher:
- Dr. Massimo Callisto De Donato <massimo.callistodedonato[at]gruppofilippetti.it>
Lessons schedule:
- 42 h - lecture and exercise sessions
- Thursday: 15:00 pm – 18:00 pm
Students Office hours:
- Send an e-mail to the teacher to fix an appointment.
Course Objectives
- The course gives an introduction to the Big Data models and related techniques required to perform data analysis in real world examples.
- The course focuses on data with “Big Data characteristics” such as data that can generated by any kind of systems with an high volume, data that grows very fast, data highly semi-structured or un-structured.
- The course highlights the correlations between Big Data and related fields of IoT and Smart Cities.
- The course introduces all relevant state-of-the-art concepts, methods and technologies enabling Big Data Analysis in real world business cases.
Course Contents
- Acquire knowledge and competence on Big Data methodologies, techniques and technologies.
- Know most common techniques of Big Data analysis and how they apply to real world examples.
- Apply Big Data Analysis techniques into practical case studies.
Syllabus
- Introduction to Big Data
- What are Big Data
- Big Data in the real world
- Needs and Challenges of Big Data
- Characterize Big Data
- Big Data models: from storage to processing
- The V model
- From datawarehouse to Big Data
- Aspect of Distributing Storage
- Distributed Processing
- NoSQL
- Distributed Search
- Big Data Analisys: Methodologies and Techniques
- How enabling Big Data Processing
- The Hadoop framework
- HDFS: hadoop filesystem
- Computational framework: MapReduce and YARN
- HBase as NoSQL database
- Other related hadoop processing frameworks
- Practical Examples
- Apache Spark
- Apache Cassandra
- Batch processing Vs RealTime processing
- Apache Spark Streaming
- Apache Storm
- Big Data Analysis applications
- Big Data in practice: connecting with IoT
- Use cases scenario
- Advance Big Data Analysis
- Distribuited graph modelling framework
- Machine Learning
- Baysian Newtork
Study material
Course Slides
- -
- Reference materials
- Slides course.
- Material provided by the teacher.
- Examples
- Homework
- -
Exams
Exam Dates A.Y. 2017/2018
- 08/02/2018 - 15:00
- 08/03/2018 - 15:00
- 21/06/2018 - 15:00
- 12/07/2018 - 15:00
- 20/09/2018 - 15:00
- 07/02/2019 - 15:00
- 21/02/2019 - 15:00
Exam rules:
- Writing Examination on the topics of the course
- Open or multiple-choice questions + Exercise
- 2 h
- Homework assignment evaluation
Exam Results