View a PDF of the paper titled Supporting the development of Machine Learning for fundamental science in a federated Cloud with the AI_INFN platform, by Lucio Anderlini and Matteo Barbetti and Giulio Bianchini and Diego Ciangottini and Stefano Dal Pra and Diego Michelotto and Carmelo Pellegrino and Rosa Petrini and Alessandro Pascolini and Daniele Spiga
View PDF
HTML (experimental)
Abstract:Machine Learning (ML) is driving a revolution in the way scientists design, develop, and deploy data-intensive software. However, the adoption of ML presents new challenges for the computing infrastructure, particularly in terms of provisioning and orchestrating access to hardware accelerators for development, testing, and production. The INFN-funded project AI_INFN (“Artificial Intelligence at INFN”) aims at fostering the adoption of ML techniques within INFN use cases by providing support on multiple aspects, including the provision of AI-tailored computing resources. It leverages cloud-native solutions in the context of INFN Cloud, to share hardware accelerators as effectively as possible, ensuring the diversity of the Institute’s research activities is not compromised. In this contribution, we provide an update on the commissioning of a Kubernetes platform designed to ease the development of GPU-powered data analysis workflows and their scalability on heterogeneous, distributed computing resources, possibly federated as Virtual Kubelets with the interLink provider.
Submission history
From: Matteo Barbetti [view email]
[v1]
Fri, 28 Feb 2025 17:42:58 UTC (209 KB)
[v2]
Tue, 17 Jun 2025 23:23:33 UTC (195 KB)