<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic 8M Plus Capabilities : LLM &amp;amp; CV Models in i.MX Processors</title>
    <link>https://community.nxp.com/t5/i-MX-Processors/8M-Plus-Capabilities-LLM-amp-CV-Models/m-p/2104486#M237603</link>
    <description>&lt;P&gt;Hi There,&lt;BR /&gt;&lt;BR /&gt;I have few questions regarding using 8M Plus for running LLMs &amp;amp; CV Models.&lt;/P&gt;&lt;P&gt;1. Since 8M Plus offers 2.3 TOPS of AI performance which generally is just enough to run CNN models or smaller models like Bert or CV models like &lt;STRONG&gt;MobileNet&lt;/STRONG&gt;. Just looking at the TOPS is it really possible to run a Q4 quantized model like &lt;STRONG&gt;DeepSeek R1 1.5B&lt;/STRONG&gt; which is almost 1 GB in Q4 GGUF format? (even tflite conversion will be quite heavy size, I believe)&lt;BR /&gt;&lt;BR /&gt;2. On the other hand the conversion process of LLM models like &lt;STRONG&gt;DeepSeek r1 1.5B&lt;/STRONG&gt; is not straight forward, gives errors. Makes me hard to believe this could be converted even successfully, has someone did that before?&lt;BR /&gt;&lt;BR /&gt;3. Looks like the devices which can give 50+ TOPS could be considered only for running these models in order to have a normal inference performance.&lt;BR /&gt;&lt;BR /&gt;Please help me on this.&lt;BR /&gt;&lt;BR /&gt;&lt;LI-PRODUCT title="IMX8MPLUS" id="IMX8MPLUS"&gt;&lt;/LI-PRODUCT&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 26 May 2025 13:09:17 GMT</pubDate>
    <dc:creator>ankushdineshrana</dc:creator>
    <dc:date>2025-05-26T13:09:17Z</dc:date>
    <item>
      <title>8M Plus Capabilities : LLM &amp; CV Models</title>
      <link>https://community.nxp.com/t5/i-MX-Processors/8M-Plus-Capabilities-LLM-amp-CV-Models/m-p/2104486#M237603</link>
      <description>&lt;P&gt;Hi There,&lt;BR /&gt;&lt;BR /&gt;I have few questions regarding using 8M Plus for running LLMs &amp;amp; CV Models.&lt;/P&gt;&lt;P&gt;1. Since 8M Plus offers 2.3 TOPS of AI performance which generally is just enough to run CNN models or smaller models like Bert or CV models like &lt;STRONG&gt;MobileNet&lt;/STRONG&gt;. Just looking at the TOPS is it really possible to run a Q4 quantized model like &lt;STRONG&gt;DeepSeek R1 1.5B&lt;/STRONG&gt; which is almost 1 GB in Q4 GGUF format? (even tflite conversion will be quite heavy size, I believe)&lt;BR /&gt;&lt;BR /&gt;2. On the other hand the conversion process of LLM models like &lt;STRONG&gt;DeepSeek r1 1.5B&lt;/STRONG&gt; is not straight forward, gives errors. Makes me hard to believe this could be converted even successfully, has someone did that before?&lt;BR /&gt;&lt;BR /&gt;3. Looks like the devices which can give 50+ TOPS could be considered only for running these models in order to have a normal inference performance.&lt;BR /&gt;&lt;BR /&gt;Please help me on this.&lt;BR /&gt;&lt;BR /&gt;&lt;LI-PRODUCT title="IMX8MPLUS" id="IMX8MPLUS"&gt;&lt;/LI-PRODUCT&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 26 May 2025 13:09:17 GMT</pubDate>
      <guid>https://community.nxp.com/t5/i-MX-Processors/8M-Plus-Capabilities-LLM-amp-CV-Models/m-p/2104486#M237603</guid>
      <dc:creator>ankushdineshrana</dc:creator>
      <dc:date>2025-05-26T13:09:17Z</dc:date>
    </item>
    <item>
      <title>Re: 8M Plus Capabilities : LLM &amp; CV Models</title>
      <link>https://community.nxp.com/t5/i-MX-Processors/8M-Plus-Capabilities-LLM-amp-CV-Models/m-p/2105494#M237689</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.nxp.com/t5/user/viewprofilepage/user-id/250903"&gt;@ankushdineshrana&lt;/a&gt;!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;At the moment you can refer to this &lt;A href="https://github.com/nxp-appcodehub/dm-eiq-genai-flow-demonstrator?tab=readme-ov-file" target="_self"&gt;demonstration&lt;/A&gt;&amp;nbsp;from now it is working for iMX95 only but we are working to run this demos in iMX8MP this year.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Best Regards!&lt;/P&gt;
&lt;P&gt;Chavira&lt;/P&gt;</description>
      <pubDate>Tue, 03 Jun 2025 14:30:20 GMT</pubDate>
      <guid>https://community.nxp.com/t5/i-MX-Processors/8M-Plus-Capabilities-LLM-amp-CV-Models/m-p/2105494#M237689</guid>
      <dc:creator>Chavira</dc:creator>
      <dc:date>2025-06-03T14:30:20Z</dc:date>
    </item>
  </channel>
</rss>

