Deploying Hugging Face models with Amazon SageMaker and AWS Inferentia2
Julien Simon Julien Simon
107K subscribers
9,006 views
62

 Published On Mar 29, 2024

In this video, I walk you through the simple process of deploying a Hugging Face large language model on AWS, with Amazon SageMaker and the AWS Inferentia2 accelerator.

⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos. Follow me on Medium at   / julsimon   or Substack at https://julsimon.substack.com. ⭐️⭐️⭐️

Notebook:
https://gitlab.com/juliensimon/huggin...

Deep Dive: Hugging Face models on AWS AI Accelerators
   • Deep Dive: Hugging Face models on AWS...  

Blog posts:
https://huggingface.co/blog/how-to-ge...
https://aws.amazon.com/blogs/machine-...

show more

Share/Embed