Networking for Big Data and Laboratory
Professor: Paolo Di Lorenzo, Andrea Baiocchi, Antonio Cianfrani
Degree in: Master Degree in Data Science
Semester: II
Mailing list:
Lectures Time
Academyc Year 2023-2024
-
Monday 15:00 - 17:00 (classroom B2 - building RM102 - via Ariosto 25)
-
Tuesday 15:00 - 19:00 (classroom 7 - building CU033 - piazzale Aldo Moro 5)
-
Friday 13:00 - 15:00 (classroom B2 - building RM102 - via Ariosto 25)
Office Hours
Prof. Antonio Cianfrani - Office hours will take place on Thursday from 3 pm to 4 pm. It is possible to fix videocalls in different days sending an email to antonio.cianfrani@uniroma1.it
Prof. Andrea Baiocchi - Office hours will take place on Monday morning (from 11:00 to 12:00). It is possible to arrange dedicated Skype calls sending an email to andrea.baiocchi@uniroma1.it
Prof. Paolo Di Lorenzo - Office hours will take place on Thursday from 3 pm to 4 pm. It is possible to fix videocalls in different days sending an email to paolo.dilorenzo@uniroma1.it
Course Object
The first aim of the Networking for Big Data and Laboratory course is to provide students the principles of Internet networking. The course also focuses on recent advances in networking protocols to efficiently support Distributed Data Centers infrastructures. Finally, Data Center architecture is presented and two major issues are investiaged with some detail, namely scheduling and congestion control. The course has also a practical part, mainly devoted to IP and SDN networks configuration and troubleshooting.
Prerequisite
A basic understanding of programming logic. Basics of probability and statistics. Basics of matrix algebra.
Outline of the Course
Part 1 - Networking fundamentals (Prof. Antonio Cianfrani)
TCP/IP protocol stack
Transport layer: TCP and UDP
IP layer: addressing and routing
Link layer: forwarding
References: [xx]-[xx]
Part 2 - Signal Processing and Digital Communications (Prof. Paolo Di Lorenzo)
Signals and data
Fourier analysis
Linear Processing
Fundamentals of digital communications: Source coding, digital modulation
References: [1], [7]-[8]
Part 3 - Data Centers (Prof. Andrea Baiocchi)
Outline of cloud computing.
Data centers architectures, topologies, addressing, routing.
Job scheduling and load balancing.
Congestion control (QCN, DCTCP).
References: [1]-[6]
Textbooks and resources:
[2] Liu, Y., Muppala, J. K., Veeraraghavan, M., Lin, D., & Hamdi, M. (2013). Data center networks: Topologies, architectures and fault-tolerance characteristics. Springer Science & Business Media.
[3] Da Fonseca, Nelson LS, and Raouf Boutaba, eds. Cloud services, networking, and management. John Wiley & Sons, 2015.
[4] Zhou, X., Wu, F., Tan, J., Sun, Y., & Shroff, N. (2017). Designing low-complexity heavy-traffic delay-optimal load balancing schemes: Theory to algorithms. Proceedings of the ACM on Measurement and Analysis of Computing Systems, 1(2), 1-30.
[5] Alizadeh, M., Atikoglu, B., Kabbani, A., Lakshmikantha, A., Pan, R., Prabhakar, B., & Seaman, M. (2008, September). Data center transport mechanisms: Congestion control theory and IEEE standardization. In 2008 46th Annual Allerton Conference on Communication, Control, and Computing (pp. 1270-1277). IEEE.
[6] Alizadeh, M., Greenberg, A., Maltz, D. A., Padhye, J., Patel, P., Prabhakar, B., ... & Sridharan, M. (2010, August). Data center tcp (dctcp). In Proceedings of the ACM SIGCOMM 2010 Conference (pp. 63-74).
[7] Prandoni, P., & Vetterli, M. (2008). Signal processing for communications. EPFL press.
[8] Gonzales, Rafael C., and Paul Wintz. Digital image processing. Addison-Wesley Longman Publishing Co., Inc., 1987.
----------------------------------------------------------------------------------------------------------------------
Course reference wensite for academic year 2023/2024:
https://sites.google.com/site/paolodilorenzohp/teaching/networking-for-b...
----------------------------------------------------------------------------------------------------------------------
Bibliography
James F. Kurose and Keith Ross, “Computer Networking: A Top-Down Approach Featuring the Internet”, Pearson
Netkit Emulator, Netkit Documentation page
Additional material provided by the lecturer on the Data Centers
Software Defined Networking - a survey
Course website Moodle (Prof. Cianfrani): https://elearning.uniroma1.it/course/view.php?id=14634