Traffic-Aware DNN Inference Task Offloading in the Mobile Device-Edge Continuum

IRIS

The rapid growth of mobile devices and machine learning (ML)-based applications is driving a surge in data traffic. Even when inference tasks are considered, a huge amount of data needs to be transferred into the network, e.g., large Deep Neural Network (DNN) models retrieved for on-device inference or streams of input data sent from the device to the edge if the task is offloaded. To address the resulting potential network congestion, we formulate a novel optimization problem aimed at deciding where to execute streams of DNN inference tasks from multiple devices across the mobile device-edge continuum in order to minimize the amount of exchanged data traffic, while satisfying accuracy, latency, and battery constraints. The formulated problem also selects the model variant (in terms of size and accuracy) that best suits the placement decision (device, edge). Results, collected under a wide variety of different settings, showcase the validity of our proposal and its supremacy over the considered benchmark schemes, with gains in terms of saved bandwidth up to 98%.

Traffic-Aware DNN Inference Task Offloading in the Mobile Device-Edge Continuum / Chukhno, O., Singh, G., Campolo, C., Chiasserini, C.F., Molinaro, A.. - (2025), pp. 1-6. (2025 IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2025 gbr 2025) [10.1109/infocomwkshps65812.2025.11152885].

Traffic-Aware DNN Inference Task Offloading in the Mobile Device-Edge Continuum

Chukhno, Olga;Singh, Gurtaj;Campolo, Claudia;Chiasserini, Carla Fabiana;Molinaro, Antonella

2025-01-01

Abstract

The rapid growth of mobile devices and machine learning (ML)-based applications is driving a surge in data traffic. Even when inference tasks are considered, a huge amount of data needs to be transferred into the network, e.g., large Deep Neural Network (DNN) models retrieved for on-device inference or streams of input data sent from the device to the edge if the task is offloaded. To address the resulting potential network congestion, we formulate a novel optimization problem aimed at deciding where to execute streams of DNN inference tasks from multiple devices across the mobile device-edge continuum in order to minimize the amount of exchanged data traffic, while satisfying accuracy, latency, and battery constraints. The formulated problem also selects the model variant (in terms of size and accuracy) that best suits the placement decision (device, edge). Results, collected under a wide variety of different settings, showcase the validity of our proposal and its supremacy over the considered benchmark schemes, with gains in terms of saved bandwidth up to 98%.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Lingua/e
	
				Inglese
			
	Titolo del Volume
	
				IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2025
			
	Serie
	
				IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS
			
	Titolo del convegno
	
				2025 IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2025
			
	Da pagina
	
				1
			
	A pagina
	
				6
			
	Numero di pagine
	
				6
			
	Codice DOI
	
				https://dx.doi.org/10.1109/infocomwkshps65812.2025.11152885
			
	Nome Editore
	
				Institute of Electrical and Electronics Engineers Inc.
			
	Città Editore
	
				345 E 47TH ST, NEW YORK, NY 10017 USA
			
	Periodo del Convegno
	
				2025
			
	Luogo del Convegno
	
				gbr
			
	Parole chiave
	
				DNN model compression
edge computing
inference
offloading
			
	Codice Scopus
	
				2-s2.0-105017959321
			
	Codice Web of Science
	
				WOS:001591523800127
			
	Presenza di coautori internazionali
	
				No
			
	Tipologia
	
				4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
			
	Tutti gli autori
	
						Chukhno, Olga; Singh, Gurtaj; Campolo, Claudia; Chiasserini, Carla Fabiana; Molinaro, Antonella
					
	Tipologia sito docente
	
				273
			
	Citazione
	
				Traffic-Aware DNN Inference Task Offloading in the Mobile Device-Edge Continuum / Chukhno, O., Singh, G., Campolo, C., Chiasserini, C.F., Molinaro, A.. - (2025), pp. 1-6. (2025 IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2025 gbr 2025) [10.1109/infocomwkshps65812.2025.11152885].
			
	Numero autori
	
				5
			
	Fulltext
	
				none
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12318/167866

Citazioni

ND

0

0

social impact