ONNX

Join Microsoft at Open Source Summit North America 2024

April 9, 2024 4 min read

By Henry YanSenior Product Marketing Manager, Microsoft

Join Microsoft at Open Source Summit North America 2024, taking place in Seattle, Washington from April 16 to 18, 2024, bringing together open source developers, technologists, and community leaders to collaborate, share information, solve problems, and gain knowledge. Read more

Introducing ONNX Script: Authoring ONNX with the ease of Python

August 1, 2023 6 min read

By Aaron BockoverPrincipal Engineer, Microsoft
Maanav DalalProgram Manager, Microsoft
Ganesan RamalingamPrincipal Architect, Microsoft
Justin ChuSoftware Engineer, Microsoft

ONNX Script is a new open-source library for directly authoring ONNX models in Python with a focus on clean, idiomatic Python syntax and composability through ONNX-native functions. Read more

Olive: A user-friendly toolchain for hardware-aware model optimization

June 26, 2023 4 min read

By Emma NingPrincipal Program Manager, AI Frameworks
Devang PatelPrincipal Architect, AI Frameworks
Guoliang HuaPrincipal Software Engineer Manager, Microsoft.

Introducing Olive, an easy-to-use toolchain for optimizing models with hardware awareness. With Olive, you don’t need to be an expert to explore diverse hardware optimization toolchains. Read more

Automate optimization techniques for transformer models

June 26, 2023 3 min read

By Emma NingPrincipal Program Manager, AI Frameworks
Feng TianAI Architect—Intel
Yuwen ZhouAI Engineer—Intel
Haihao ShenLeading AI Architect—Intel
Saurabh TangriPrincipal AI Engineer—Intel

Intel has collaborated with Microsoft to integrate Intel® Neural Compressor into Olive, enabling developers to easily take advantage of model compression techniques in their deployment platform, including Intel processors and accelerators. Read more

Performant on-device inferencing with ONNX Runtime

February 8, 2023 6 min read

By Faith XuPrincipal Program Manager, Machine Learning Platform
Brian LambertMachine Learning Engineer, Pieces.app

The team at Pieces shares the problems and solutions evaluated for their on-device model serving stack and how ONNX Runtime enables their success. Read more

Live demos of machine learning models with ONNX and Hugging Face Spaces

June 6, 2022 5 min read

By Jacky ChenSoftware Engineer, AI Frameworks

Choosing which machine learning model to use, sharing a model with a colleague, and quickly trying out a model are all reasons why you may find yourself wanting to quickly run inference on a model. You can configure your environment and download Jupyter notebooks, but it would be nicer if there was a way to Read more

Scaling-up PyTorch inference: Serving billions of daily NLP inferences with ONNX Runtime

April 19, 2022 8 min read

By Faith XuPrincipal Program Manager, Machine Learning Platform

Scale, performance, and efficient deployment of state-of-the-art Deep Learning models are ubiquitous challenges as applied machine learning grows across the industry. We’re happy to see that the ONNX Runtime Machine Learning model inferencing solution we’ve built and use in high-volume Microsoft products and services also resonates with our open source community, enabling new capabilities that Read more

Accelerate and simplify Scikit-learn model inference with ONNX Runtime

December 17, 2020 5 min read

By Xavier DupreData Scientist at Microsoft
Olivier GriselSoftware engineer at Inria and core contributor to scikit-learn

Scikit-learn is one of the most useful libraries for general machine learning in Python. To minimize the cost of deployment and avoid discrepancies, deploying scikit-learn models to production usually leverages Docker containers and pickle, the object serialization module of the Python standard library. Docker is a good way to create consistent environments and pickle saves Read more

ONNX Runtime scenario highlight: Vespa.ai integration

December 14, 2020 1 min read

By Faith XuPrincipal Program Manager, Machine Learning Platform

Since its open source debut two years ago, ONNX Runtime has seen strong growth with performance improvements, expanded platform and device compatibility, hardware accelerator support, an extension to training acceleration, and more. We are excited by its broad usage in production, powering more than a hundred models across Microsoft products and services and bringing concrete Read more

Adding RoBERTa NLP to the ONNX model zoo for natural language predictions

November 24, 2020 3 min read

By Kundana PillariStudent at the University of California Irvine, Computer Science

In summer 2019, I worked as a high school intern for the ONNX AI team at Microsoft and loved working on various projects with the team, including the BERT text classification model. However, due to Covid-19, the Microsoft Internship Program for high school students was canceled in the summer of 2020. This led two other Read more

Introducing ONNX Runtime mobile – a reduced size, high performance package for edge devices

October 12, 2020 2 min read

By Scott McKayPrincipal Software Engineer
Manash GoswamiPrincipal Program Manager, Machine Learning Platform

ONNX Runtime is an open source project that is designed to accelerate machine learning across a wide range of frameworks, operating systems, and hardware platforms. Today, we are excited to announce ONNX Runtime release v1.5 as part of our AI at Scale initiative. This release includes ONNX Runtime mobile, a new feature targeting smartphones and other Read more

GPT-2 fine-tuning with ONNX Runtime – a 34% speedup in training time

August 24, 2020 4 min read

By Aishwarya BhandareSoftware Engineer
Tianju XuSenior Software Engineer
Kshama PawarSenior Program Manager

Model training is an important step when developing and deploying large scale Artificial Intelligence (AI) models. Training typically utilizes a large amount of compute resources to tune the model based on the input dataset. Transformer models, with millions and billions of parameters, are especially compute-intensive and training costs increase with model size and fine-tuning steps Read more

Follow OpenAtMicrosoft