====== Big Data Analytics ====== ---- ===== News ===== * **Exam results 2017-11-30**: results have been published {{ https://drive.google.com/file/d/1Nnlvq2ONdqK_kpw7daIlXVRsHZH7Ji3w/view?usp=sharing | link }}. Students can confirm their vote by mail and register at the next exam session. * **Papers 2016-17**: available {{ https://drive.google.com/drive/folders/0B3Jl2zBf8Y-pV2hsTXRPZGJaZEE?usp=sharing | here}}. * **Exam results 2017-09-21**: results have been published {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pclVYRkFfY0FlNWM/view?usp=sharing | link }}. Students can confirm their vote by mail and register at the next exam session. * **Exam results 2017-07-13**: results have been published {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pOF9ibWMxOHVuaTg/view?usp=sharing | link }}. Students can confirm their vote by mail and register at the next exam session. * **Exam update**: The exam fixed on 13th July will be in AB2 - Polo Ludovici. * **Exam results 2017-06-22**: results have been published {{https://drive.google.com/file/d/0B3Jl2zBf8Y-pWWF3OHQ1MjdiRjQ/view?usp=sharing | link }}. Students can confirm their vote by mail and register at the next exam session of 2017-07-13. * **Exam results 2017-03-30**: results have been published {{https://drive.google.com/file/d/0B3Jl2zBf8Y-pVTg5X1BUYXNpUVU/view?usp=sharing | link }}. Students can confirm vote by mail. * **For further informaionon about the exam**: I could be be available on saturday morning at Campus - Giurisprudenza building (aula 1, piano terra). For who is interested, please send me an email in order to schedule an appointment. * **Exam results 2017-03-16**: results have been published {{https://drive.google.com/file/d/0B3Jl2zBf8Y-pTkc5aV9zSXMxUEk/view?usp=sharing | link }}. Students can register their vote at the end of the next exam session of 2017-03-30. * **For students that need information or have questions**: I could be be available on saturday morning at Campus - Giurisprudenza building (aula 1, piano terra). For who is interested, please send me an email in order to schedule an appointment. * **Exam update**: The exam fixed on 16th March will be in AB3 - Polo Ludovici. * **No lesson 9th february 2016**: Lecture suspended. Next lecture is 16th february 2016 15:00pm to 19:00pm. Slides of the suspended lesson have been uploaded. * **WebEx**: https://unicam.webex.com/meet/massimo.callisto. * **Course start**: lectures will start on 15th December at Polo Ludovici from 15:00pm to 18:00pm AB2 room. * **Course stop**: lectures suspended due to earthquake. * **New lecture room**: next lectures will be held in room E. W. Dijkstra. * **Course start**: lessons will start on 6th October 2016. ---- ===== General Info ===== **Teacher**: * **Dr. Massimo Callisto De Donato** **Lessons schedule**: * 42 h - lecture and exercise sessions * Thursday: 15:00 pm – 18:00 pm **Students Office hours**: * Send an e-mail to the teacher to fix an appointment. ---- ===== Course Objectives ===== * The course gives an introduction to the Big Data models and related techniques required to perform data analysis in real world examples. * The course focuses on data with "Big Data characteristics" such as data that can generated by any kind of systems with an high volume, data that grows very fast, data highly semi-structured or un-structured. * The course highlights the correlations between Big Data and related fields of IoT and Smart Cities. * The course introduces all relevant state-of-the-art concepts, methods and technologies enabling Big Data Analysis in real world business cases. ---- ===== Course Contents ===== * Acquire knowledge and competence on Big Data methodologies, techniques and technologies. * Know most common techniques of Big Data analysis and how they apply to real world examples. * Apply Big Data Analysis techniques into practical case studies. ---- ===== Syllabus ===== * **Introduction to Big Data** * What are Big Data * Big Data in the real world * Needs and Challenges of Big Data * Characterize Big Data * **Big Data models: from storage to processing** * The V model * From datawarehouse to Big Data * Aspect of Distributing Storage * Distributed Processing * NoSQL * Distributed Search * **Big Data Analysis: Methodologies and Techniques** * How enabling Big Data Processing * The Hadoop framework * HDFS: hadoop filesystem * Computational framework: MapReduce and YARN * HBase as NoSQL database * Other related hadoop processing frameworks * Practical Examples * Apache Spark * Apache Cassandra * Batch processing Vs RealTime processing * Apache Spark Streaming * Apache Storm * **Big Data Analysis applications** * Big Data in practice: connecting with IoT * Use cases scenario * **Advance Big Data Analysis** * Distribuited graph modelling framework * Machine Learning * Baysian Newtork ---- ===== Study material ===== **Course Slides** * slide 1st_0 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pdmpSUmdIdzRCTk0/view?usp=sharing | slides_1}} * slide 1st_1 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pck5Hd1FES0pmVW8/view?usp=sharing | slides_1_1}} * slide 2st_1 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pQV9HNHh3SnVZOTQ/view?usp=sharing | slides_2_1}} * slide 2st_2 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pVkNTSzlUMWhTZWM/view?usp=sharing | slides_2_2}} * slide 2st_3 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pYWZJck13Vm1TMTA/view?usp=sharing | slides_2_3}} * slide 3st_1 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pSnJ3d1Z3dTBqS3c/view?usp=sharing | slides_3}} * slide 4st_1 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pcVc1Um5maFRIUnc/view?usp=sharing | slides_4}} * slide 4st_2 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pQ1h3azIxZUtOams/view?usp=sharing | slides_4_2}} * slide 5st_1 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pLXE4SThZZ2ZDMFE/view?usp=sharing | slide 5_1}} * slide 5st_2 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pZU5NWVZEVURSZDQ/view?usp=sharing | slide 5_2}} * slide 6st_1 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pODVvTFdxQndWX2M/view?usp=sharing | slide 6_1}} * slide 7st_1 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pRWdsMFAzQTBRQ3M/view?usp=sharing | slide 7_1}} * slide 8st_1 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pbVg3WU9jRDJ5R1k/view?usp=sharing | slide 8_1}} * slide 9st_1 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-peGxlMEpWaHRtWGs/view?usp=sharing | slide 9_1}} * slide 10st_1 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pTnBXWjB0VXJuMFE/view?usp=sharing | slide 10_1}} * slide 11st_1 lesson {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pazd1dGlGMTJrRVE/view?usp=sharing | slide 11_1}} * **Reference materials** * Slides course. * Material provided by the teacher. * Examples: {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pdjdYeGRUWDBjQ0k/view?usp=sharing | wordcount }} {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pNkNTeTlyM2RvR1E/view?usp=sharing | exportdbms}} {{ https://drive.google.com/drive/folders/0B3Jl2zBf8Y-pakNOZi14d3JFb28?usp=sharing | hbase}} {{ https://drive.google.com/drive/folders/0B3Jl2zBf8Y-pN1oybzN0NzhmMVU?usp=sharing | spark}} {{ https://drive.google.com/drive/folders/0B3Jl2zBf8Y-pTGJRaTZ0czBuZzg?usp=sharing | spark-streaming}} {{ https://drive.google.com/drive/folders/0B3Jl2zBf8Y-pNFV4QVg4UDh1QW8?usp=sharing | storm}} * **Homework** * {{ https://drive.google.com/file/d/0B3Jl2zBf8Y-pR3dSNWtXMGxjbDA/view?usp=sharing | info}} * {{ https://docs.google.com/document/d/1ziU3J8xGDXAixTMndIdQfseVb5uuTAH5Jyrhew6T3lE/edit?usp=sharing | hw-bidata-001}} {{ https://docs.google.com/document/d/1Wnf1vF-9Ltjv_kJAlh5A6wWQnFx6d7MfLgpLUgbqI6w/edit?usp=sharing | hw-bidata-002}} {{ https://docs.google.com/document/d/1wEZWjBktuFu9yL10IcXlGr17OB5WhTUCBwvJsNtv23w/edit?usp=sharing | hw-bidata-003}} {{ https://docs.google.com/document/d/1HMwSg6zk5TVFaC9oJ3Y5kI94uQEJAURvgf3Hjh259qc/edit?usp=sharing | hw-bidata-004}} {{ https://docs.google.com/document/d/1Jq-NUFgCbyk044HSUiblPrdjAutqfmDb97u8ZQXUeLE/edit?usp=sharing | hw-bidata-005}} {{ https://docs.google.com/document/d/1RSkyV3ZTFP6_zDjC0U0aj_cPCngUgwZhJUoEw5mTjY4/edit?usp=sharing | hw-bidata-006}} * {{ https://drive.google.com/drive/folders/0B3Jl2zBf8Y-pV2hsTXRPZGJaZEE?usp=sharing | Papers 2016-17}} ---- ===== Exams ===== **Exam Dates A.Y. 2015/2016** * 16/03/2017 - 15:00 * 30/03/2017 - 15:00 * 22/06/2017 - 15:00 * 13/07/2017 - 15:00 * 21/09/2017 - 15:00 * 01/02/2018 - 15:00 * 22/02/2018 - 15:00 **Exam rules**: * Writing Examination on the topics of the course * Open or multiple-choice questions + Exercise * 2 h * Homework assignment evaluation ** Exam Results ** * {{https://drive.google.com/file/d/0B3Jl2zBf8Y-pVTg5X1BUYXNpUVU/view?usp=sharing | 2017-03-30 }} * {{https://drive.google.com/file/d/0B3Jl2zBf8Y-pTkc5aV9zSXMxUEk/view?usp=sharing | 2017-03-16 }}