Deploying Hugging Face models with Amazon SageMaker and AWS Inferentia2

107K subscribers

9,006 views

About
Share

Published On Mar 29, 2024

In this video, I walk you through the simple process of deploying a Hugging Face large language model on AWS, with Amazon SageMaker and the AWS Inferentia2 accelerator.

⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos. Follow me on Medium at / julsimon or Substack at https://julsimon.substack.com. ⭐️⭐️⭐️

Notebook:
https://gitlab.com/juliensimon/huggin...

Deep Dive: Hugging Face models on AWS AI Accelerators
• Deep Dive: Hugging Face models on AWS...

Blog posts:
https://huggingface.co/blog/how-to-ge...
https://aws.amazon.com/blogs/machine-...

Published On Mar 29, 2024

Share/Embed

Video Link