<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Recommendations for ASR, TTS and Transformer in Voice Technology</title>
    <link>https://community.nxp.com/t5/Voice-Technology/Recommendations-for-ASR-TTS-and-Transformer/m-p/2185428#M87</link>
    <description>&lt;P&gt;Hey &lt;SPAN class=""&gt;&lt;A href="https://community.nxp.com/t5/user/viewprofilepage/user-id/172287" target="_self"&gt;&lt;SPAN class=""&gt;Laurent_P&lt;/SPAN&gt;&lt;/A&gt;&lt;/SPAN&gt;&amp;nbsp;, can you share how you implemented the Whisper tiny.en TFLite model on the NPU of i.MX93? I’ve been looking for this for ages, and it would really help me in development. I was able to convert the model to TFLite INT8, but the NPU doesn’t fully support all Whisper operations, so I have to use the float32 model on the CPU. Is it even possible to convert it and use it on the NPU?&amp;nbsp;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;thank you&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 14 Oct 2025 07:27:40 GMT</pubDate>
    <dc:creator>boopathi12</dc:creator>
    <dc:date>2025-10-14T07:27:40Z</dc:date>
    <item>
      <title>Recommendations for ASR, TTS and Transformer</title>
      <link>https://community.nxp.com/t5/Voice-Technology/Recommendations-for-ASR-TTS-and-Transformer/m-p/2025255#M74</link>
      <description>&lt;P&gt;Hello, I am developing an application on i.MX 93 EVK.&amp;nbsp; I would appreciate recommendations for the following which have relatively low compute requirements while running on-device.&amp;nbsp; I am using a i.MX 93 Dual core 1.7GHz.&lt;BR /&gt;&lt;BR /&gt;I am looking for recommendations for&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;ASR engine with reasonable accuracy while running on-device.&amp;nbsp; Eg: Whisper tiny&lt;/LI&gt;&lt;LI&gt;TTS model&lt;/LI&gt;&lt;LI&gt;On-device transformer model such as Llama 3B&lt;BR /&gt;&lt;BR /&gt;Any information such as performance comparisons and recommendations would be appreciated.&amp;nbsp; Thank you&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Fri, 10 Jan 2025 00:41:16 GMT</pubDate>
      <guid>https://community.nxp.com/t5/Voice-Technology/Recommendations-for-ASR-TTS-and-Transformer/m-p/2025255#M74</guid>
      <dc:creator>QuantumPath</dc:creator>
      <dc:date>2025-01-10T00:41:16Z</dc:date>
    </item>
    <item>
      <title>Re: Recommendations for ASR, TTS and Transformer</title>
      <link>https://community.nxp.com/t5/Voice-Technology/Recommendations-for-ASR-TTS-and-Transformer/m-p/2109653#M81</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Hello&amp;nbsp;&amp;nbsp;&lt;a href="https://community.nxp.com/t5/user/viewprofilepage/user-id/245395"&gt;@QuantumPath&lt;/a&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;On i.MX93, we have enabled Whisper ASR (tiny, base, small) and Moonshine ASR (tiny and base).&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;We will deliver first the Whisper ASR as a Voice plugin through GStreamer by mid-July.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;For TTS, we have enabled ViTS TTS. For LLM, we can run small LLM like Danube 0.5B.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;In parallel, we have a complete eIQ Gen Al flow pipeline (Wake word, ASR, LLM, RAG, TTS) running on &amp;nbsp;i.MX95 here :&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;A href="https://github.com/nxp-appcodehub/dm-eiq-genai-flow-demonstrator?tab=readme-ov-file" target="_blank" rel="noopener"&gt;https://github.com/nxp-appcodehub/dm-eiq-genai-flow-demonstrator?tab=readme-ov-file&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 03 Jun 2025 16:04:22 GMT</pubDate>
      <guid>https://community.nxp.com/t5/Voice-Technology/Recommendations-for-ASR-TTS-and-Transformer/m-p/2109653#M81</guid>
      <dc:creator>Laurent_P</dc:creator>
      <dc:date>2025-06-03T16:04:22Z</dc:date>
    </item>
    <item>
      <title>Re: Recommendations for ASR, TTS and Transformer</title>
      <link>https://community.nxp.com/t5/Voice-Technology/Recommendations-for-ASR-TTS-and-Transformer/m-p/2185428#M87</link>
      <description>&lt;P&gt;Hey &lt;SPAN class=""&gt;&lt;A href="https://community.nxp.com/t5/user/viewprofilepage/user-id/172287" target="_self"&gt;&lt;SPAN class=""&gt;Laurent_P&lt;/SPAN&gt;&lt;/A&gt;&lt;/SPAN&gt;&amp;nbsp;, can you share how you implemented the Whisper tiny.en TFLite model on the NPU of i.MX93? I’ve been looking for this for ages, and it would really help me in development. I was able to convert the model to TFLite INT8, but the NPU doesn’t fully support all Whisper operations, so I have to use the float32 model on the CPU. Is it even possible to convert it and use it on the NPU?&amp;nbsp;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;thank you&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 14 Oct 2025 07:27:40 GMT</pubDate>
      <guid>https://community.nxp.com/t5/Voice-Technology/Recommendations-for-ASR-TTS-and-Transformer/m-p/2185428#M87</guid>
      <dc:creator>boopathi12</dc:creator>
      <dc:date>2025-10-14T07:27:40Z</dc:date>
    </item>
    <item>
      <title>Re: Recommendations for ASR, TTS and Transformer</title>
      <link>https://community.nxp.com/t5/Voice-Technology/Recommendations-for-ASR-TTS-and-Transformer/m-p/2256903#M90</link>
      <description>&lt;P&gt;Hello,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You can run Whisper ASR on i.MX93 CPU. We did not enabled NPU. On i.MX95, we can run Whisper ASR both on CPU and NPU.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 08 Dec 2025 07:59:02 GMT</pubDate>
      <guid>https://community.nxp.com/t5/Voice-Technology/Recommendations-for-ASR-TTS-and-Transformer/m-p/2256903#M90</guid>
      <dc:creator>Laurent_P</dc:creator>
      <dc:date>2025-12-08T07:59:02Z</dc:date>
    </item>
  </channel>
</rss>

