Difference between revisions of "Datacenter and ISP Networks"

From NMSL
Line 1: Line 1:
 
+
Welcome to our cutting-edge suite of network technologies. We are at the forefront of developing new approaches in task scheduling for datacenters, advancing multicast forwarding techniques, and enhancing the efficiency of both datacenter and ISP network infrastructures.
XXX Datacenter and ISP Networking
 
... brief intro
 
 
 
Our current problems include task scheduling, datacenter multicast, ISP multicast, and load balancing.
 
 
 
 
 
Recent large-scale Internet applications have introduced a renewed interest in scalable multicast services. Examples of such applications include live Internet broadcast (e.g., Facebook Live), stock market applications, cloud-based enterprise applications, IPTV, webinars and video conferencing, and massive multiplayer games. The scale of these applications is unprecedented. For instance, Facebook Live aims to stream millions of live sessions to millions of concurrent users. Traditional multicast approaches do not scale, because of their substantial state and communication overheads. In addition, most traditional multicast approaches do not enable controlling the network paths chosen for the multicast sessions (usually referred to as traffic engineering), nor do they offer efficient methods to direct multicast traffic through various network functions (e.g., firewalls, IDS, video transcoding), which is known as service chaining.
 
 
 
In this project, we design scalable multicast systems for general ISP networks and datacenter networks.  
 
  
 
== People ==
 
== People ==
Line 19: Line 10:
  
 
'''Horus: In-Network Task Scheduling for Datacenters'''  
 
'''Horus: In-Network Task Scheduling for Datacenters'''  
 +
 +
[[File:Horus.png|thumb|Design of Horus (Datacenter Task Scheduler). |right]]
  
 
Introducing Horus, an in-network task scheduler for datacenters that efficiently assigns short-lived tasks across distributed workers. It minimizes state and communication overhead, dynamically adapts to workloads, and does not buffer tasks, enabling high throughput and reduced tail response times. Compared to traditional rack-confined schedulers, Horus' distributed approach leverages parallel scheduling at line rate across switches. Testing shows Horus scales superbly in large datacenters, outperforming current in-network schedulers in both speed and capacity.
 
Introducing Horus, an in-network task scheduler for datacenters that efficiently assigns short-lived tasks across distributed workers. It minimizes state and communication overhead, dynamically adapts to workloads, and does not buffer tasks, enabling high throughput and reduced tail response times. Compared to traditional rack-confined schedulers, Horus' distributed approach leverages parallel scheduling at line rate across switches. Testing shows Horus scales superbly in large datacenters, outperforming current in-network schedulers in both speed and capacity.
Line 43: Line 36:
 
== Code and Datasets ==  
 
== Code and Datasets ==  
  
* [https://github.com/horus-scheduler Hours Source Code].
+
* [https://github.com/horus-scheduler Horus Source Code].
  
 
* [https://oktopus-project.org/ Oktopus Source Code].
 
* [https://oktopus-project.org/ Oktopus Source Code].
  
 
== Publications ==  
 
== Publications ==  
 +
* P. Yassini, K. Diab, S. Zanganeh, and M. Hefeeda, Horus: Granular In-Network Task Scheduler for Cloud Datacenters, In Proc. of USENIX Networked Systems Design and Implementation (NSDI'24), Sant Clara, CA, April 2024.
  
 
* K. Diab, P. Yassini, and M. Hefeeda, [https://www2.cs.sfu.ca/~mhefeeda/Papers/nsdi22_orca.pdf Orca: Server-assisted Multicast for Datacenter Networks], In Proc. of USENIX Networked Systems Designed and Implementation (NSDI'22), Renton, WA, April 2022.  
 
* K. Diab, P. Yassini, and M. Hefeeda, [https://www2.cs.sfu.ca/~mhefeeda/Papers/nsdi22_orca.pdf Orca: Server-assisted Multicast for Datacenter Networks], In Proc. of USENIX Networked Systems Designed and Implementation (NSDI'22), Renton, WA, April 2022.  

Revision as of 11:18, 8 November 2023

Welcome to our cutting-edge suite of network technologies. We are at the forefront of developing new approaches in task scheduling for datacenters, advancing multicast forwarding techniques, and enhancing the efficiency of both datacenter and ISP network infrastructures.

People


Horus: In-Network Task Scheduling for Datacenters

Design of Horus (Datacenter Task Scheduler).

Introducing Horus, an in-network task scheduler for datacenters that efficiently assigns short-lived tasks across distributed workers. It minimizes state and communication overhead, dynamically adapts to workloads, and does not buffer tasks, enabling high throughput and reduced tail response times. Compared to traditional rack-confined schedulers, Horus' distributed approach leverages parallel scheduling at line rate across switches. Testing shows Horus scales superbly in large datacenters, outperforming current in-network schedulers in both speed and capacity.


Stateless Multicast For ISP Networks

Current multicast forwarding systems suffer from large state requirements at routers and high communication overhead. In addition, these systems do not support generalized multicast forwarding, where traffic passes through traffic-engineered paths or requires service chaining. We propose a new system that completely eliminates the state at routers and reduces communication overhead.

Scalable Multicast for Datacenter Networks

Group communications appear in many recent datacenter applications. These applications, however, do not benefit from multicast due to the absence of efficient systems that support large-scale multicast sessions, minimize state at switches and reduce bandwidth overheads. We propose a new architecture that addresses the challenges of multicast in datacenter networks. The proposed approach carefully divides the state and tasks of the data plane among switches and servers in order to realize efficient multicast services in datacenters, by partially offloading the management of multicast sessions to servers.

Oktopus: Service Chaining for Multicast Traffic

Multicast service chaining refers to the orchestration of network services for multicast traffic. Paths of a multicast session that span the source, destinations and required services form a complex structure that we refer to as the multicast distribution graph. We propose a new path-based algorithm, called Oktopus, that runs at the control plane of the ISP network to calculate the multicast distribution graph for a given session. Oktopus aims at minimizing the routing cost for each multicast session while satisfying all service chaining requirements. Oktopus consists of two steps. The first one generates a set of segments from the given ISP network topology, and the second step uses these segments to efficiently calculate the multicast distribution graph. Oktopus has a fine-grained control over the selection of links in the distribution graphs that leads to significant improvements. The code of Oktopus is open source.


Traffic-Engineered Distribution of Adaptive Multimedia Content in Telco-CDNs

Telco-CDNs refer to content distribution networks deployed and managed by Internet Service Providers (ISPs). They are getting popular among major ISPs because they offer new revenue streams and have the potential of providing better performance compared to traditional CDNs. Managing telco-CDNs is, however, a complex problem, because it requires jointly managing the network resources (links and switches) and the caching resources (processing and storage capacities), while supporting the adaptive nature and skewed popularity of multimedia content. To address this problem, we present a new algorithm called CAD (Cooperative Active Distribution). CAD strives to serve as much as possible of the requested multimedia objects within the ISP while carefully engineering the traffic paths through the network. This is achieved by enabling the cooperation among caches within the ISP not only to serve various representations of multimedia objects, but also to create them on demand using the often under-utilized processing capacity of caches.

Code and Datasets

Publications

  • P. Yassini, K. Diab, S. Zanganeh, and M. Hefeeda, Horus: Granular In-Network Task Scheduler for Cloud Datacenters, In Proc. of USENIX Networked Systems Design and Implementation (NSDI'24), Sant Clara, CA, April 2024.