Networking for Big Data and Laboratory

Professor: Paolo Di Lorenzo, Andrea Baiocchi, Antonio Cianfrani

Degree in: Master Degree in Data Science

Semester: II

Mailing list

 

Lectures Time

Academyc Year 2023-2024

  • Monday 15:00 - 17:00 (classroom B2 - building RM102 - via Ariosto 25)

  • Tuesday 15:00 - 19:00 (classroom 7 - building CU033 - piazzale Aldo Moro 5)

  • Friday 13:00 - 15:00 (classroom B2 - building RM102 - via Ariosto 25)

 

Office Hours

Prof. Antonio Cianfrani - Office hours will take place on Thursday from 3 pm to 4 pm. It is possible to fix videocalls in different days sending an email to antonio.cianfrani@uniroma1.it

Prof. Andrea Baiocchi - Office hours will take place on Monday morning (from 11:00 to 12:00). It is possible to arrange dedicated Skype calls sending an email to andrea.baiocchi@uniroma1.it

Prof. Paolo Di Lorenzo - Office hours will take place on Thursday from 3 pm to 4 pm. It is possible to fix videocalls in different days sending an email to paolo.dilorenzo@uniroma1.it

 

Course Object

The first aim of the Networking for Big Data and Laboratory course is to provide students the principles of Internet networking. The course also focuses on recent advances in networking protocols to efficiently support Distributed Data Centers infrastructures. Finally, Data Center architecture is presented and two major issues are investiaged with some detail, namely scheduling and congestion control. The course has also a practical part, mainly devoted to IP and SDN networks configuration and troubleshooting.

 

Prerequisite

A basic understanding of programming logic. Basics of probability and statistics. Basics of matrix algebra.

 

Outline of the Course

Part 1 - Networking fundamentals  (Prof. Antonio Cianfrani)

TCP/IP protocol stack

Transport layer: TCP and UDP

IP layer: addressing and routing

Link layer: forwarding

References:  [xx]-[xx]

 

Part 2 - Signal Processing and Digital Communications  (Prof. Paolo Di Lorenzo)

Signals and data

Fourier analysis

Linear Processing

Fundamentals of digital communications: Source coding, digital modulation 

References:  [1], [7]-[8]

 

Part 3 - Data Centers  (Prof. Andrea Baiocchi)

Outline of cloud computing.

Data centers architectures, topologies, addressing, routing.

Job scheduling and load balancing.

Congestion control (QCN, DCTCP).

References:  [1]-[6]

 

Textbooks and resources:

[1]  Slides, notes, and codes

[2] Liu, Y., Muppala, J. K., Veeraraghavan, M., Lin, D., & Hamdi, M. (2013). Data center networks: Topologies, architectures and fault-tolerance characteristics. Springer Science & Business Media. 

[3] Da Fonseca, Nelson LS, and Raouf Boutaba, eds. Cloud services, networking, and management. John Wiley & Sons, 2015. 

[4] Zhou, X., Wu, F., Tan, J., Sun, Y., & Shroff, N. (2017). Designing low-complexity heavy-traffic delay-optimal load balancing schemes: Theory to algorithms. Proceedings of the ACM on Measurement and Analysis of Computing Systems, 1(2), 1-30. 

[5] Alizadeh, M., Atikoglu, B., Kabbani, A., Lakshmikantha, A., Pan, R., Prabhakar, B., & Seaman, M. (2008, September). Data center transport mechanisms: Congestion control theory and IEEE standardization. In 2008 46th Annual Allerton Conference on Communication, Control, and Computing (pp. 1270-1277). IEEE. 

[6] Alizadeh, M., Greenberg, A., Maltz, D. A., Padhye, J., Patel, P., Prabhakar, B., ... & Sridharan, M. (2010, August). Data center tcp (dctcp). In Proceedings of the ACM SIGCOMM 2010 Conference (pp. 63-74). 

[7] Prandoni, P., & Vetterli, M. (2008). Signal processing for communications. EPFL press. 

[8] Gonzales, Rafael C., and Paul Wintz. Digital image processing. Addison-Wesley Longman Publishing Co., Inc., 1987. 

 

 

----------------------------------------------------------------------------------------------------------------------

Course reference wensite for academic year 2023/2024: 

https://sites.google.com/site/paolodilorenzohp/teaching/networking-for-b...

----------------------------------------------------------------------------------------------------------------------

 

Bibliography

James F. Kurose and Keith Ross, “Computer Networking: A Top-Down Approach Featuring the Internet”, Pearson

Netkit Emulator, Netkit Documentation page

Additional material provided by the lecturer on the Data Centers

VXLAN Technical Report

LISP protocol (pages 23-36)

Software Defined Networking - a survey

Course website Moodle (Prof. Cianfrani): https://elearning.uniroma1.it/course/view.php?id=14634

Bibliographic material and all references required for the Data Center module of NBD can be found here.