Learn the basic mathematics behind quantization. Use following techniques to convert FP32 to INT8/INT4
Open the notebook locally or in Google colab
Run through the code to see Affine quantization at work
(Optional) Try out the Uniform quantization technique in notebook