Loading…
AI DevSummit 2025 + DeveloperWeek Leadership 2025
Type: 4. Hackathon clear filter
arrow_back View All Dates
Wednesday, June 4
 

12:00pm PDT

[Virtual] OPEN Session: Efficient Serverless Inferencing: Scaling and Optimization
Wednesday June 4, 2025 12:00pm - 12:25pm PDT
Yann Léger, Koyeb, Co-Founder

Today, AI Infrastructure doesn’t rhyme with efficiency. Massive investments are made in GPUs sold mainly by a single vendor and these GPUs end up underused due to poor software solutions.

This is not a fatality and we, as an industry, are working on increasing average utilization and increasing diversity of accelerators. I’ll walk you through the different technical solutions to implement Serverless Inferencing and the trade-offs, from the chips to the virtualization software through the storage layers.
Speakers
avatar for Yann Léger

Yann Léger

Co-Founder, Koyeb
Yann Leger is co-founder of Koyeb, a serverless platform for AI workloads, and spent the last 12 years building large-scale cloud service providers from scratch.Passionate about cloud computing, he has a deep understanding of the underlying infrastructure, from data centers to the... Read More →
Wednesday June 4, 2025 12:00pm - 12:25pm PDT
VIRTUAL DeveloperWeek Leadership Main Stage
 

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
Filtered by Date -