We need to support channel-wise INT16 quantization in one-quantize.
I reopened this issue to support int64 bias.
Things to do
one-quantize with int16 optionone-quantize can use int16 quantization by passing int16 to --quantized_dtype and channel to --granularity.
one-quantize \
--input_dtype float32 \
--quantized_dtype int16 \
--granularity channel \
--input_path ./inception_v3.circle \
--input_data ./inception_v3_test_data.h5 \
--output_path ./inception_v3.quantized.circle
We only support CWQ int16 quantization. If you try to do int16 quantization with layer (default granularity), the below error message will be printed.
one-quantize \
--input_dtype float32 \
--quantized_dtype int16 \
--input_path ./inception_v3.circle \
--input_data ./inception_v3_test_data.h5 \
--output_path ./inception_v3.quantized.circle
ERROR: Layer-wise quantization only supports uint8 dtype.
CC @seanshpark
I close this issue as all jobs required for int64 bias are done.