Talks and presentations

FMOps/LLMOps — Managing the Generative AI Lifecycle on AWS

March 15, 2025

Conference, Cloud Conference, University of Applied Sciences Burgenland, Burgenland, Austria

Presented a 30-minute session on Generative AI lifecycle tooling to an audience of approximately 200 attendees at a cloud computing conference. Covered the end-to-end lifecycle of foundation models, from experimentation and evaluation to deployment and monitoring, using AWS services. The session led to a follow-up conference with business decision makers.

Accelerate FM pre-training on Amazon SageMaker HyperPod

September 04, 2024

Conference, AWS Summit Zurich, Zurich, Switzerland

Co-presented with Ankit Anand and Matt Nightingale, this session explored the challenges of training foundation models at scale and how Amazon SageMaker HyperPod addresses them. The talk covered the generative AI landscape and the growing computational demands of FM development, from prompt engineering and RAG to full pre-training. We introduced SageMaker HyperPod as a resilient, performant, and customizable environment for large-scale distributed training — featuring self-healing clusters that automatically detect hardware failures, replace faulty instances, and resume training jobs from checkpoints, reducing training time by up to 20%. The session went under the hood of HyperPod, covering cluster architecture, instance groups, lifecycle scripts, Elastic Fabric Adapter (EFA) for high-speed inter-node communication, distributed training software stacks for both GPU and Trainium, and job scheduling with auto-healing. Customer stories from Stability AI, Perplexity AI, and Hugging Face illustrated real-world benefits.

LLMOps and FMOps — Evaluating and Operating Foundation Models

March 15, 2024

Conference, AWS Builder Stream, Vienna, Austria

Presented LLMOps and FMOps concepts to approximately 100 attendees at an AWS builder-focused event, covering the operational lifecycle of large language models including fine-tuning, evaluation, deployment, and monitoring strategies.

Productionize ML workloads using Amazon SageMaker MLOps

September 26, 2023

Conference, AWS Cloud Day Zurich, Zurich, Switzerland

This talk presented a comprehensive overview of MLOps on AWS, covering the journey from experimental notebooks to production-ready ML systems using Amazon SageMaker. Starting from the premise that ML code is only a small fraction of a real-world ML system, the session walked through an MLOps maturity framework across four phases — Initial, Repeatable, Reliable, and Scalable — mapping each to specific AWS services and capabilities. Topics included SageMaker Studio for experimentation, SageMaker Experiments for tracking, SageMaker Pipelines for workflow automation, Model Registry for versioning and promotion, SageMaker Projects for one-click CI/CD provisioning, shadow testing and deployment guardrails, Model Monitor for drift detection, and Model Cards and Dashboard for governance. The talk also covered team structures, multi-account strategies, and custom project templates for enterprise-scale MLOps.

Foundation Model Hosting and RAG on AWS

August 24, 2023

Meetup, Vienna Data Science Tools Meetup, Vienna, Austria

Presented on foundation model hosting options and demonstrated a Retrieval-Augmented Generation (RAG) application at the Vienna Data Science Tools Meetup, with approximately 60 attendees from the local data science community.

AWS DeepRacer Workshop

June 22, 2023

Workshop, AWS Summit Milan, Milan, Italy

Delivered two hands-on AWS DeepRacer workshop sessions (theory and practice) at AWS Summit Milan, covering reinforcement learning fundamentals and autonomous racing. Both sessions were at full capacity with 30+ attendees each. Delivered in Italian.

MLOps on AWS — Italian Track

February 23, 2023

Conference, AWS Innovate — AI/ML Edition, Online

Delivered an MLOps session as part of the Italian-language track at AWS Innovate, covering end-to-end machine learning workflows on Amazon SageMaker. Also participated in an Ask The Expert panel and live Q&A session. Approximately 200 attendees.

MLOps — Automazione per il Machine Learning

October 01, 2022

Podcast, AWS Italy Podcast, Online

Recorded a 40-minute podcast episode on MLOps with Alex Casalboni for the official AWS Italy Podcast, discussing automation strategies for machine learning workflows, SageMaker Pipelines, experiment tracking, and best practices for production ML systems.

MLflow on AWS and SageMaker Integration

September 22, 2022

Meetup, AWS Vienna Meetup, Vienna, Austria

Organized and hosted the first AWS Vienna Meetup at the AWS Vienna office, presenting on MLflow integration with Amazon SageMaker and open-source ML tooling on AWS. The event reached full capacity with 35 attendees.

Paolo Di Francesco

Talks and presentations