<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: int8 Quantization for TFLite 32 model in i.MX Processors</title>
    <link>https://community.nxp.com/t5/i-MX-Processors/int8-Quantization-for-TFLite-32-model/m-p/1277647#M174142</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Yes, I can attach my pb model and my tflite files. When I run the float32 and float16 it works, I just get the prompt for int8.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in advance.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;PD. I used version 2.4, I think it is the most recent one. Hope this works.&lt;/P&gt;</description>
    <pubDate>Mon, 17 May 2021 14:42:05 GMT</pubDate>
    <dc:creator>BrunoSenzio</dc:creator>
    <dc:date>2021-05-17T14:42:05Z</dc:date>
    <item>
      <title>int8 Quantization for TFLite 32 model</title>
      <link>https://community.nxp.com/t5/i-MX-Processors/int8-Quantization-for-TFLite-32-model/m-p/1277185#M174078</link>
      <description>&lt;P&gt;Hello,&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have tried to quantize my TF Lite model which was float32 to int8, but when I try to benchmark the int8 model on GPU/NPU I get the following error:&lt;/P&gt;&lt;P&gt;STARTING!&lt;BR /&gt;Duplicate flags: num_threads&lt;BR /&gt;Min num runs: [50]&lt;BR /&gt;Min runs duration (seconds): [1]&lt;BR /&gt;Max runs duration (seconds): [150]&lt;BR /&gt;Inter-run delay (seconds): [-1]&lt;BR /&gt;Num threads: [1]&lt;BR /&gt;Use caching: [0]&lt;BR /&gt;Benchmark name: []&lt;BR /&gt;Output prefix: []&lt;BR /&gt;Min warmup runs: [1]&lt;BR /&gt;Min warmup runs duration (seconds): [0.5]&lt;BR /&gt;Graph: [FishDetectModel_1k_int8.tflite]&lt;BR /&gt;Input layers: []&lt;BR /&gt;Input shapes: []&lt;BR /&gt;Input value ranges: []&lt;BR /&gt;Input layer values files: []&lt;BR /&gt;Allow fp16 : [0]&lt;BR /&gt;Require full delegation : [0]&lt;BR /&gt;Enable op profiling: [0]&lt;BR /&gt;Max profiling buffer entries: [1024]&lt;BR /&gt;CSV File to export profiling data to: []&lt;BR /&gt;Enable platform-wide tracing: [0]&lt;BR /&gt;#threads used for CPU inference: [1]&lt;BR /&gt;Max number of delegated partitions : [0]&lt;BR /&gt;Min nodes per partition : [0]&lt;BR /&gt;Loaded model FishDetectModel_1k_int8.tflite&lt;BR /&gt;ERROR: Didn't find op for builtin opcode 'CONV_2D' version '5'&lt;/P&gt;&lt;P&gt;ERROR: Registration failed.&lt;/P&gt;&lt;P&gt;Failed to initialize the interpreter&lt;BR /&gt;Benchmarking failed.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;What could be the problem?&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Sat, 15 May 2021 15:32:43 GMT</pubDate>
      <guid>https://community.nxp.com/t5/i-MX-Processors/int8-Quantization-for-TFLite-32-model/m-p/1277185#M174078</guid>
      <dc:creator>BrunoSenzio</dc:creator>
      <dc:date>2021-05-15T15:32:43Z</dc:date>
    </item>
    <item>
      <title>Re: int8 Quantization for TFLite 32 model</title>
      <link>https://community.nxp.com/t5/i-MX-Processors/int8-Quantization-for-TFLite-32-model/m-p/1277616#M174138</link>
      <description>&lt;P&gt;Hello Bruno,&lt;/P&gt;
&lt;P&gt;The saved model is incomplete. Can you share a saved model dir that we can load directly?, Also, It looks like Tensor flow version does not have the CONV_2D version '5'. You need to use a recent version of TF runtime or at least the same TF version, used for conversion in iMX8.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Regards&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 17 May 2021 13:00:11 GMT</pubDate>
      <guid>https://community.nxp.com/t5/i-MX-Processors/int8-Quantization-for-TFLite-32-model/m-p/1277616#M174138</guid>
      <dc:creator>Bio_TICFSL</dc:creator>
      <dc:date>2021-05-17T13:00:11Z</dc:date>
    </item>
    <item>
      <title>Re: int8 Quantization for TFLite 32 model</title>
      <link>https://community.nxp.com/t5/i-MX-Processors/int8-Quantization-for-TFLite-32-model/m-p/1277647#M174142</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Yes, I can attach my pb model and my tflite files. When I run the float32 and float16 it works, I just get the prompt for int8.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in advance.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;PD. I used version 2.4, I think it is the most recent one. Hope this works.&lt;/P&gt;</description>
      <pubDate>Mon, 17 May 2021 14:42:05 GMT</pubDate>
      <guid>https://community.nxp.com/t5/i-MX-Processors/int8-Quantization-for-TFLite-32-model/m-p/1277647#M174142</guid>
      <dc:creator>BrunoSenzio</dc:creator>
      <dc:date>2021-05-17T14:42:05Z</dc:date>
    </item>
  </channel>
</rss>

