Using different quantization modes in Tensorflow?

Asked Jul 24 '17 at 17:18

Active Jul 31 '17 at 15:22

Viewed 271 times

TF's documentation on their quantization script appears to only include --mode=eightbit (July 24, 2017).

What are each of the modes used for? i.e. as indicated in quantize_graph.py (round, quantize, weights, weights_rounded):

flags.DEFINE_string("mode", "round",
                """What transformation to apply (round, quantize,"""
                """ eightbit, weights, or weights_rounded).""")

As a complementary question, I tried to trace through the script and noticed that eightbit doesn't actually call any of the functions that compute min/max values (ex. quantize_weight_eightbit). So what is mode=eightbit doing exactly?

edited Jul 31 '17 at 15:22

asked Jul 24 '17 at 17:18

jaydee713

Using different quantization modes in Tensorflow?

0 Answers0