You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Trained a GPT-2 model on a custom dataset to generate love quotes. Trained GPT-2 model's inference time is optimized by 4x times by using ONNX Runtime and then quantised from FP32 to Int8 format, reducing the model size by 3x times without compromising much in the model accuracy/performance.