Aishwarya Bhandare, Author at Microsoft Open Source Blog

GPT-2 fine-tuning with ONNX Runtime – a 34% speedup in training time

August 24, 2020 4 min read

By Aishwarya BhandareSoftware Engineer
Tianju XuSenior Software Engineer
Kshama PawarSenior Program Manager

Model training is an important step when developing and deploying large scale Artificial Intelligence (AI) models. Training typically utilizes a large amount of compute resources to tune the model based on the input dataset. Transformer models, with millions and billions of parameters, are especially compute-intensive and training costs increase with model size and fine-tuning steps Read more

Posts by Aishwarya Bhandare, Software Engineer

Follow OpenAtMicrosoft