ONNX

Announcing accelerated training with ONNX Runtime—train models up to 45% faster

May 19, 2020 3 min read

By Manash GoswamiPrincipal Program Manager, Machine Learning Platform

ONNX Runtime is an open source project that is designed to accelerate machine learning across a wide range of frameworks, operating systems, and hardware platforms. It is used extensively in Microsoft products, like Office 365 and Bing, delivering over 20 billion inferences every day and up to 17 times faster inferencing. Today we are introducing Read more

Microsoft open sources breakthrough optimizations for transformer inference on GPU and CPU

January 21, 2020 4 min read

By Emma NingPrincipal Program Manager, AI Frameworks

This post is co-authored by Emma Ning, Azure Machine Learning; Nathan Yan, Azure Machine Learning; Jeffrey Zhu, Bing; Jason Li, Bing One of the most popular deep learning models used for natural language processing is BERT (Bidirectional Encoder Representations from Transformers). Due to the significant computation required, inferencing BERT at high scale can be extremely Read more

ONNX joins Linux Foundation

November 14, 2019 2 min read

By Eric BoydCorporate Vice President, Azure AI

Today the Open Neural Network eXchange (ONNX) is joining the LF AI Foundation, an umbrella foundation of the Linux Foundation supporting open source innovation in artificial intelligence, machine learning, and deep learning. ONNX was co-founded by Microsoft in 2017 to make it easier to create and deploy machine learning applications. In the past few years, Read more

Announcing ONNX Runtime 1.0

October 30, 2019 3 min read

By Faith XuPrincipal Program Manager, Machine Learning Platform

One year after ONNX Runtime’s initial preview release, we’re excited to announce v1.0 of the high-performance machine learning model inferencing engine. This release marks our commitment to API stability for the cross-platform, multi-language APIs, and introduces a breadth of performance optimizations, broad operator coverage, and pluggable accelerators to take advantage of new and exciting hardware Read more

Now available: ONNX Runtime 0.5 with support for edge hardware acceleration

August 26, 2019 4 min read

By Manash GoswamiPrincipal Program Manager, Machine Learning Platform
Faith XuPrincipal Program Manager, Machine Learning Platform

ONNX Runtime 0.5, the latest update to the open source high performance inference engine for ONNX models, is now available. This release improves the customer experience and supports inferencing optimizations across hardware platforms. Since the last release in May, Microsoft teams have deployed an additional 45+ models that leverage ONNX Runtime for inferencing. These models Read more

ONNX Runtime: a one-stop shop for machine learning inferencing

May 22, 2019 3 min read

By Prasanth PulavarthiPrincipal Program Manager, Machine Learning Platform

Organizations that want to leverage AI at scale must overcome a number of challenges around model training and model inferencing. Today, there are a plethora of tools and frameworks that accelerate model training but inferencing remains a tough nut due to the variety of environments that models need to run in. For example, the same Read more

Open Source Weekly #6

November 18, 2017 4 min read

By Microsoft + Open Source

This week’s Microsoft Connect(); event has been a demo-packed few days, highlighting Microsoft’s continuing commitment to delivering open technologies and contributing to and partnering with open source communities. From joining the MariaDB Foundation to launching a new Apache Spark-based analytics platform and previewing Visual Studio Code Live Share, there’s a ton of open source goodness Read more

Follow OpenAtMicrosoft