From the edge to the cloud: Exploring AI inference (yes, including generative AI) across the computing continuum

Morabito, Roberto
Invited talk at Aalto University, 2 April 2025, Aalto, Finland

This talk explores recent research on AI inference across the computing continuum, from the edge to the cloud, with a focus on the challenges and opportunities brought by hardware and software heterogeneity, as well as automation requirements. Roberto will present insights into collaborative, edge-centric generative AI inference, including efforts to benchmark language models on constrained devices and to route queries efficiently across distributed nodes. He will also discuss recent work on automating the lifecycle of extreme edge devices using LLMs, demonstrating how these models can support code generation, adaptation, and deployment under

tight resource constraints.


Type:
Talk
City:
Aalto
Date:
2025-04-02
Department:
Communication systems
Eurecom Ref:
8163
Copyright:
© EURECOM. Personal use of this material is permitted. The definitive version of this paper was published in Invited talk at Aalto University, 2 April 2025, Aalto, Finland and is available at :
See also:

PERMALINK : https://www.eurecom.fr/publication/8163