The example for 8-bit quantization aware training runs perfectly. I am looking for 4-bit quantization. Unfortunately, I could not find it in the documentation. Please point me in the right direction.
Thanks
The example for 8-bit quantization aware training runs perfectly. I am looking for 4-bit quantization. Unfortunately, I could not find it in the documentation. Please point me in the right direction.
Thanks
You can find an example of a 4-bit dense layer at: