How does ChatGPT run on Azure? Inside Microsoft’s AI supercomputer

Get an inside look at the AI supercomputer infrastructure built to run ChatGPT and other large language models, and see how to leverage it for your workloads in Azure, at any scale.

Go behind the scenes in this video is from Microsoft Mechanics:

  • For how we collaborated with NVIDIA to deliver purpose-built AI infrastructure with NVIDIA GPUs
  • How Project Forge checkpointing works to restore job states if a long training job fails or needs to be migrated
  • How we used LoRA fine-tuning to update a fraction of the base model for more training throughput and smaller checkpoints
  • How UK-based company, Wayve, is using Azure’s AI supercomputer infrastructure for self-driving cars
  • And how Confidential Computing works with Azure AI to combine datasets without sharing personally identifiable information for secure multiparty collaborations.

Mark Russinovich, Azure CTO, joins Jeremy Chapman to break it down.

Frank

#DataScientist, #DataEngineer, Blogger, Vlogger, Podcaster at http://DataDriven.tv . Back @Microsoft to help customers leverage #AI Opinions mine. #武當派 fan. I blog to help you become a better data scientist/ML engineer Opinions are mine. All mine.