Faster inference for PyTorch models with OpenVINO Integration with Torch-ORT 

4 min read

Many developers opt to use popular AI Frameworks like PyTorch, which simplifies the process of analyzing predictions, training models, leveraging data, and refining future results. Read more

Join Microsoft at KubeCon + CloudNativeCon Europe 2022 

3 min read

Welcome to KubeCon Europe 2022. While I am unfortunately stuck in rainy Seattle (coldest start to May in 20 years), I’m excited that many of our cloud-native Azure folks will be able to experience sunny Valencia. It’s especially exciting for KubeCon to be the first chance for parts of the Azure Kubernetes Service (AKS) team Read more

Optimizing and deploying transformer INT8 inference with ONNX Runtime-TensorRT on NVIDIA GPUs 

5 min read

Mohit Ayani, Solutions Architect, NVIDIA Shang Zhang, Senior AI Developer Technology Engineer, NVIDIA Jay Rodge, Product Marketing Manager-AI, NVIDIA Transformer-based models have revolutionized the natural language processing (NLP) domain. Ever since its inception, transformer architecture has been integrated into models like Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-trained Transformer (GPT) for performing tasks Read more

Scaling-up PyTorch inference: Serving billions of daily NLP inferences with ONNX Runtime 

8 min read

Scale, performance, and efficient deployment of state-of-the-art Deep Learning models are ubiquitous challenges as applied machine learning grows across the industry. We’re happy to see that the ONNX Runtime Machine Learning model inferencing solution we’ve built and use in high-volume Microsoft products and services also resonates with our open source community, enabling new capabilities that Read more

Supporting efficient large model training on AMD Instinct™ GPUs with DeepSpeed 

6 min read

This post was co-authored by Jithun Nair and Aswin Mathews, members of technical staff at AMD. In recent years, large-scale deep learning models have demonstrated impressive capabilities, excelling at tasks across natural language processing, computer vision, and speech domains. Companies now use these models to power novel AI-driven user experiences across a whole spectrum of Read more

Join us at the Build in the Open Happy Hour to connect and collaborate with Open Source enthusiasts 

1 min read

Open source has forever changed software development for the better. It has allowed developers from around the world to connect, collaborate, and build tomorrow’s technology. Microsoft culturally adopted an open mindset under Satya Nadella and internally changed the way software was developed. Microsoft is continually learning how to be a better open-source partner and support Read more

Announcing Azure Active Directory (Azure AD) workload identity for Kubernetes 

1 min read

Today, we are excited to announce an open-source project called Azure AD workload identity for Kubernetes. It leverages the public preview capability of Azure AD workload identity federation. With this project, developers can use native Kubernetes concepts of service accounts and federation to access Azure AD protected resources, such as Azure and Microsoft Graph, without needing Read more

Microsoft’s OSPO year in review—what we’ve learned 

7 min read

“Tech companies born with an open source mentality get it. It’s our ability to work together that makes our dreams believable and, ultimately, achievable. We must learn to build on the ideas of others”—Satya Nadella Whenever I teach open source at Microsoft, I open with this quote from Satya. Whether new to Microsoft, or learning Read more

Add AI to mobile applications with Xamarin and ONNX Runtime 

2 min read

ONNX Runtime now supports building mobile applications in C# with Xamarin. Support for Android and iOS is included in the ONNX Runtime release 1.10 NuGet package. This enables C# developers to build AI applications for Android and iOS to execute ONNX models on mobile devices with ONNX Runtime. ONNX Runtime is the open source project Read more

Ratify container supply chain in Kubernetes 

4 min read

Securing the software supply chain and verifying that chain is hard for any software, and containers running in Kubernetes are no exception. Operational best practices like image signing, scanning, provenance verification, and ensuring these operations have been properly completed with signed software bill of materials (SBoMs) are all required, and tons of tools are appearing Read more