<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>MCX MicrocontrollersのトピックIMX95 npu</title>
    <link>https://community.nxp.com/t5/MCX-Microcontrollers/IMX95-npu/m-p/2325561#M4920</link>
    <description>&lt;P&gt;Dear NXP Support Team,&lt;/P&gt;&lt;P&gt;I am currently working with the NXP i.MX95 SoC using the following setup:&lt;/P&gt;&lt;P&gt;SoC: i.MX95&lt;/P&gt;&lt;P&gt;OS: Yocto 6.6 (Scarthgap)&lt;/P&gt;&lt;P&gt;# uname -a&lt;BR /&gt;Linux imx95 6.6.52&lt;BR /&gt;&lt;BR /&gt;Reference Document: Machine Learning User Guide (UG10166.pdf)&lt;/P&gt;&lt;P&gt;I have followed the steps outlined in the user guide. I am able to successfully execute the benchmark commands for CPU, GPU, and NPU, and the benchmarking results are generated as expected.&lt;/P&gt;&lt;P&gt;However, we are facing an issue with the image classification NPU example using label_image.&lt;/P&gt;&lt;P&gt;Location:&lt;/P&gt;&lt;P&gt;/usr/bin/tensorflow-lite-2.16.2/examples&lt;BR /&gt;CPU Execution (Working as Expected)&lt;/P&gt;&lt;P&gt;Command:&lt;/P&gt;&lt;P&gt;./label_image -m mobilenet_v1_1.0_224_quant.tflite -i grace_hopper.bmp -l labels.txt&lt;/P&gt;&lt;P&gt;Output:&lt;/P&gt;&lt;P&gt;INFO: Loaded model mobilenet_v1_1.0_224_quant.tflite&lt;BR /&gt;INFO: resolved reporter&lt;BR /&gt;INFO: Created TensorFlow Lite XNNPACK delegate for CPU.&lt;BR /&gt;INFO: invoked&lt;BR /&gt;INFO: average time: 12.897 ms&lt;BR /&gt;INFO: 0.768627: 653 military uniform&lt;BR /&gt;INFO: 0.105882: 907 Windsor tie&lt;BR /&gt;INFO: 0.0196078: 458 bow tie&lt;BR /&gt;INFO: 0.0117647: 466 bulletproof vest&lt;BR /&gt;INFO: 0.00784314: 835 suit&lt;/P&gt;&lt;P&gt;The classification results are correct.&lt;/P&gt;&lt;P&gt;NPU Execution (Issue Observed)&lt;/P&gt;&lt;P&gt;Command:&lt;/P&gt;&lt;P&gt;./label_image -m mobilenet_v1_1.0_224_quant.tflite -i grace_hopper.bmp -l labels.txt \&lt;BR /&gt;--external_delegate_path=/usr/lib/libneutron_delegate.so&lt;/P&gt;&lt;P&gt;Output:&lt;/P&gt;&lt;P&gt;INFO: Loaded model mobilenet_v1_1.0_224_quant.tflite&lt;BR /&gt;INFO: resolved reporter&lt;BR /&gt;INFO: EXTERNAL delegate created.&lt;BR /&gt;INFO: NeutronDelegate delegate: 29 nodes delegated out of 31 nodes with 1 partitions.&lt;BR /&gt;INFO: Applied EXTERNAL delegate.&lt;BR /&gt;INFO: Created TensorFlow Lite XNNPACK delegate for CPU.&lt;BR /&gt;INFO: invoked&lt;BR /&gt;INFO: average time: 0.207 ms&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Observation:&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;The delegate reports that 29 out of 31 nodes are delegated.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;However, no image classification results are printed during NPU execution.&lt;BR /&gt;&lt;BR /&gt;The reported inference time (~0.207 ms) appears unusually low compared to expected NPU execution time.&lt;BR /&gt;&lt;BR /&gt;In contrast, CPU execution produces correct classification output.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;Could you please advise if this behavior is expected for Yocto 6.6 (Scarthgap)&lt;BR /&gt;&lt;BR /&gt;share steps for&amp;nbsp; Image calassifcation demo using NPU in 6.6.52&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;LI-PRODUCT title="iMX95" id="iMX95"&gt;&lt;/LI-PRODUCT&gt;&amp;nbsp;#NPU&lt;/P&gt;</description>
    <pubDate>Tue, 03 Mar 2026 12:09:11 GMT</pubDate>
    <dc:creator>shiva141</dc:creator>
    <dc:date>2026-03-03T12:09:11Z</dc:date>
    <item>
      <title>IMX95 npu</title>
      <link>https://community.nxp.com/t5/MCX-Microcontrollers/IMX95-npu/m-p/2325561#M4920</link>
      <description>&lt;P&gt;Dear NXP Support Team,&lt;/P&gt;&lt;P&gt;I am currently working with the NXP i.MX95 SoC using the following setup:&lt;/P&gt;&lt;P&gt;SoC: i.MX95&lt;/P&gt;&lt;P&gt;OS: Yocto 6.6 (Scarthgap)&lt;/P&gt;&lt;P&gt;# uname -a&lt;BR /&gt;Linux imx95 6.6.52&lt;BR /&gt;&lt;BR /&gt;Reference Document: Machine Learning User Guide (UG10166.pdf)&lt;/P&gt;&lt;P&gt;I have followed the steps outlined in the user guide. I am able to successfully execute the benchmark commands for CPU, GPU, and NPU, and the benchmarking results are generated as expected.&lt;/P&gt;&lt;P&gt;However, we are facing an issue with the image classification NPU example using label_image.&lt;/P&gt;&lt;P&gt;Location:&lt;/P&gt;&lt;P&gt;/usr/bin/tensorflow-lite-2.16.2/examples&lt;BR /&gt;CPU Execution (Working as Expected)&lt;/P&gt;&lt;P&gt;Command:&lt;/P&gt;&lt;P&gt;./label_image -m mobilenet_v1_1.0_224_quant.tflite -i grace_hopper.bmp -l labels.txt&lt;/P&gt;&lt;P&gt;Output:&lt;/P&gt;&lt;P&gt;INFO: Loaded model mobilenet_v1_1.0_224_quant.tflite&lt;BR /&gt;INFO: resolved reporter&lt;BR /&gt;INFO: Created TensorFlow Lite XNNPACK delegate for CPU.&lt;BR /&gt;INFO: invoked&lt;BR /&gt;INFO: average time: 12.897 ms&lt;BR /&gt;INFO: 0.768627: 653 military uniform&lt;BR /&gt;INFO: 0.105882: 907 Windsor tie&lt;BR /&gt;INFO: 0.0196078: 458 bow tie&lt;BR /&gt;INFO: 0.0117647: 466 bulletproof vest&lt;BR /&gt;INFO: 0.00784314: 835 suit&lt;/P&gt;&lt;P&gt;The classification results are correct.&lt;/P&gt;&lt;P&gt;NPU Execution (Issue Observed)&lt;/P&gt;&lt;P&gt;Command:&lt;/P&gt;&lt;P&gt;./label_image -m mobilenet_v1_1.0_224_quant.tflite -i grace_hopper.bmp -l labels.txt \&lt;BR /&gt;--external_delegate_path=/usr/lib/libneutron_delegate.so&lt;/P&gt;&lt;P&gt;Output:&lt;/P&gt;&lt;P&gt;INFO: Loaded model mobilenet_v1_1.0_224_quant.tflite&lt;BR /&gt;INFO: resolved reporter&lt;BR /&gt;INFO: EXTERNAL delegate created.&lt;BR /&gt;INFO: NeutronDelegate delegate: 29 nodes delegated out of 31 nodes with 1 partitions.&lt;BR /&gt;INFO: Applied EXTERNAL delegate.&lt;BR /&gt;INFO: Created TensorFlow Lite XNNPACK delegate for CPU.&lt;BR /&gt;INFO: invoked&lt;BR /&gt;INFO: average time: 0.207 ms&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;Observation:&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;The delegate reports that 29 out of 31 nodes are delegated.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;However, no image classification results are printed during NPU execution.&lt;BR /&gt;&lt;BR /&gt;The reported inference time (~0.207 ms) appears unusually low compared to expected NPU execution time.&lt;BR /&gt;&lt;BR /&gt;In contrast, CPU execution produces correct classification output.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;Could you please advise if this behavior is expected for Yocto 6.6 (Scarthgap)&lt;BR /&gt;&lt;BR /&gt;share steps for&amp;nbsp; Image calassifcation demo using NPU in 6.6.52&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;LI-PRODUCT title="iMX95" id="iMX95"&gt;&lt;/LI-PRODUCT&gt;&amp;nbsp;#NPU&lt;/P&gt;</description>
      <pubDate>Tue, 03 Mar 2026 12:09:11 GMT</pubDate>
      <guid>https://community.nxp.com/t5/MCX-Microcontrollers/IMX95-npu/m-p/2325561#M4920</guid>
      <dc:creator>shiva141</dc:creator>
      <dc:date>2026-03-03T12:09:11Z</dc:date>
    </item>
    <item>
      <title>Re: IMX95 npu</title>
      <link>https://community.nxp.com/t5/MCX-Microcontrollers/IMX95-npu/m-p/2326641#M4951</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.nxp.com/t5/user/viewprofilepage/user-id/230279"&gt;@shiva141&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Continue in SFDC.&lt;BR /&gt;&lt;BR /&gt;Best Regards,&lt;BR /&gt;Zhiming&lt;/P&gt;</description>
      <pubDate>Thu, 05 Mar 2026 00:50:04 GMT</pubDate>
      <guid>https://community.nxp.com/t5/MCX-Microcontrollers/IMX95-npu/m-p/2326641#M4951</guid>
      <dc:creator>Zhiming_Liu</dc:creator>
      <dc:date>2026-03-05T00:50:04Z</dc:date>
    </item>
  </channel>
</rss>

