8M Plus Capabilities : LLM & CV Models

取消
显示结果 
显示  仅  | 搜索替代 
您的意思是: 

8M Plus Capabilities : LLM & CV Models

1,296 次查看
ankushdineshrana
Contributor I

Hi There,

I have few questions regarding using 8M Plus for running LLMs & CV Models.

1. Since 8M Plus offers 2.3 TOPS of AI performance which generally is just enough to run CNN models or smaller models like Bert or CV models like MobileNet. Just looking at the TOPS is it really possible to run a Q4 quantized model like DeepSeek R1 1.5B which is almost 1 GB in Q4 GGUF format? (even tflite conversion will be quite heavy size, I believe)

2. On the other hand the conversion process of LLM models like DeepSeek r1 1.5B is not straight forward, gives errors. Makes me hard to believe this could be converted even successfully, has someone did that before?

3. Looks like the devices which can give 50+ TOPS could be considered only for running these models in order to have a normal inference performance.

Please help me on this.

IMX8MPLUS 

0 项奖励
回复
1 回复

1,259 次查看
Chavira
NXP TechSupport
NXP TechSupport

Hi @ankushdineshrana!

 

At the moment you can refer to this demonstration from now it is working for iMX95 only but we are working to run this demos in iMX8MP this year.

 

Best Regards!

Chavira

0 项奖励
回复