VoiceSpot FAQ

cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 

VoiceSpot FAQ

VoiceSpot FAQ

MeganHansen_0-1649271773569.png

 

VoiceSpot FAQ 

 

Q: What is VoiceSpot? 

A: VoiceSpot is a high performance wake word engine with very low memory and MHz footprint. It is highly suited for ultra-low power always-on applications where high recognition rates in noisy environments is important. 

 

Q: Can multiple wake words be supported? 

A: Yes. 

 

Q: Are custom wake words supported? 

A: Yes, custom wake words can be created as per customer request. A dataset is needed to create a custom wake word, which can either be provided by the customer or collected by NXP at additional cost. 

 

Q: What is the total memory and MHz footprint? 

A: The exact numbers depend on the configuration and target platform. A typical implementation uses a 33 kB model size requiring a total of 30 - 90 kB RAM and 33 – 65 kB of flash, depending on where the model and code are placed. Depending on the target platform, 5 – 20 MHz is required for processing. 

 

Q: What is the latency? 

A: The wake word is typically detected within +/- 200 ms of the subjective end of the wake word, which is faster than many other solutions. This helps reduce the latency perceived by the end-user. 

 

Q: What is the difference between VoiceSpot and VIT? 

A: VoiceSpot relies on a dataset for specifying the wake word whereas VIT uses text input to specify the wake word / commands. This makes VoiceSpot less flexible, but more efficient in terms of resources and with higher performance. VoiceSpot is therefore a great choice when high performance and efficiency is important and a dataset is already available or can be made available with reasonable effort. VIT is a great choice if flexibility is important and the additional platform resources needed is not an issue. 

%3CLINGO-SUB%20id%3D%22lingo-sub-1439914%22%20slang%3D%22en-US%22%20mode%3D%22CREATE%22%3EVoiceSpot%20FAQ%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-1439914%22%20slang%3D%22en-US%22%20mode%3D%22CREATE%22%3E%3CSPAN%20class%3D%22lia-inline-image-display-wrapper%20lia-image-align-inline%22%20image-alt%3D%22MeganHansen_0-1649271773569.png%22%20style%3D%22width%3A%20400px%3B%22%3E%3Cspan%20class%3D%22lia-inline-image-display-wrapper%22%20image-alt%3D%22MeganHansen_0-1649271773569.png%22%20style%3D%22width%3A%20400px%3B%22%3E%3Cimg%20src%3D%22https%3A%2F%2Fcommunity.nxp.com%2Ft5%2Fimage%2Fserverpage%2Fimage-id%2F176097i8D1E39CCF58C83C4%2Fimage-size%2Fmedium%3Fv%3Dv2%26amp%3Bpx%3D400%22%20role%3D%22button%22%20title%3D%22MeganHansen_0-1649271773569.png%22%20alt%3D%22MeganHansen_0-1649271773569.png%22%20%2F%3E%3C%2Fspan%3E%3C%2FSPAN%3E%0A%3CBR%20%2F%3E%0A%3CH2%20id%3D%22toc-hId--1327531565%22%20id%3D%22toc-hId--499285898%22%3E%3CSTRONG%3E%3CSPAN%20data-contrast%3D%22auto%22%3EVoiceSpot%20FAQ%3C%2FSPAN%3E%3C%2FSTRONG%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FH2%3E%0A%3CP%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSTRONG%3E%3CSPAN%20data-contrast%3D%22auto%22%3EQ%3A%20What%20is%20VoiceSpot%3F%3C%2FSPAN%3E%3C%2FSTRONG%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSPAN%20data-contrast%3D%22auto%22%3EA%3A%20VoiceSpot%20is%20a%20high%20performance%20wake%20word%20engine%20with%20very%20low%20memory%20and%20MHz%20footprint.%20It%20is%20highly%20suited%20for%20ultra-low%20power%20always-on%20applications%20where%20high%20recognition%20rates%20in%20noisy%20environments%20is%20important.%3C%2FSPAN%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSTRONG%3E%3CSPAN%20data-contrast%3D%22auto%22%3EQ%3A%20Can%20multiple%20wake%20words%20be%20supported%3F%3C%2FSPAN%3E%3C%2FSTRONG%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSPAN%20data-contrast%3D%22auto%22%3EA%3A%20Yes.%3C%2FSPAN%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSTRONG%3E%3CSPAN%20data-contrast%3D%22auto%22%3EQ%3A%20Are%20custom%20wake%20words%20supported%3F%3C%2FSPAN%3E%3C%2FSTRONG%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSPAN%20data-contrast%3D%22auto%22%3EA%3A%20Yes%2C%20custom%20wake%20words%20can%20be%20created%20as%20per%20customer%20request.%20A%20dataset%20is%20needed%20to%20create%20a%20custom%20wake%20word%2C%20which%20can%20either%20be%20provided%20by%20the%20customer%20or%20collected%20by%20NXP%20at%20additional%20cost.%3C%2FSPAN%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSTRONG%3E%3CSPAN%20data-contrast%3D%22auto%22%3EQ%3A%20What%20is%20the%20total%20memory%20and%20MHz%20footprint%3F%3C%2FSPAN%3E%3C%2FSTRONG%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSPAN%20data-contrast%3D%22auto%22%3EA%3A%20The%20exact%20numbers%20depend%20on%20the%20configuration%20and%20target%20platform.%20A%20typical%20implementation%20uses%20a%2033%20kB%20model%20size%20requiring%20a%20total%20of%2030%20-%2090%20kB%20RAM%20and%2033%20%E2%80%93%2065%20kB%20of%20flash%2C%20depending%20on%20where%20the%20model%20and%20code%20are%20placed.%20Depending%20on%20the%20target%20platform%2C%205%20%E2%80%93%2020%20MHz%20is%20required%20for%20processing.%3C%2FSPAN%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSTRONG%3E%3CSPAN%20data-contrast%3D%22auto%22%3EQ%3A%20What%20is%20the%20latency%3F%3C%2FSPAN%3E%3C%2FSTRONG%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSPAN%20data-contrast%3D%22auto%22%3EA%3A%20The%20wake%20word%20is%20typically%20detected%20within%20%2B%2F-%20200%20ms%20of%20the%20subjective%20end%20of%20the%20wake%20word%2C%20which%20is%20faster%20than%20many%20other%20solutions.%20This%20helps%20reduce%20the%20latency%20perceived%20by%20the%20end-user.%3C%2FSPAN%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSTRONG%3E%3CSPAN%20data-contrast%3D%22auto%22%3EQ%3A%20What%20is%20the%20difference%20between%20VoiceSpot%20and%20VIT%3F%3C%2FSPAN%3E%3C%2FSTRONG%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3CP%3E%3CSPAN%20data-contrast%3D%22auto%22%3EA%3A%20VoiceSpot%20relies%20on%20a%20dataset%20for%20specifying%20the%20wake%20word%20whereas%20VIT%20uses%20text%20input%20to%20specify%20the%20wake%20word%20%2F%20commands.%20This%20makes%20VoiceSpot%20less%20flexible%2C%20but%20more%20efficient%20in%20terms%20of%20resources%20and%20with%20higher%20performance.%20VoiceSpot%20is%20therefore%20a%20great%20choice%20when%20high%20performance%20and%20efficiency%20is%20important%20and%20a%20dataset%20is%20already%20available%20or%20can%20be%20made%20available%20with%20reasonable%20effort.%20VIT%20is%20a%20great%20choice%20if%20flexibility%20is%20important%20and%20the%20additional%20platform%20resources%20needed%20is%20not%20an%20issue.%3C%2FSPAN%3E%3CSPAN%20data-ccp-props%3D%22%7B%22%20201341983%3D%22%22%3E%26nbsp%3B%3C%2FSPAN%3E%3C%2FP%3E%0A%3C%2FLINGO-BODY%3E%3CLINGO-TEASER%20id%3D%22lingo-teaser-1439914%22%20slang%3D%22en-US%22%20mode%3D%22CREATE%22%3E%3CP%3EInterested%20in%20learning%20more%20about%20NXP's%20VoiceSpot%20wake%20word%20engine%3F%20Read%20this%20FAQ%20for%20more%20info.%3C%2FP%3E%3C%2FLINGO-TEASER%3E
No ratings
Version history
Last update:
‎04-06-2022 12:04 PM
Updated by: