VLM Edge Studio

cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 

VLM Edge Studio

VLM Edge Studio

VLM Edge Studio

 

In this post, I want to share a quick walkthrough of VLM Edge Studio, an NXP launcher application designed to interact with supported Vision-Language Models running locally on FRDM i.MX platforms with Ara240 DNPU acceleration.

VLM Edge Studio provides a Qt/QML-based GUI for model selection, prompt input, and visual interaction with locally running VLMs at the edge. It communicates with the Ara240 Runtime SDK through the eIQ AAF Connector using a REST-based interface and streaming token responses.

 

Key Features

  • Local Vision-Language Model inference on supported i.MX platforms
  • Ara240 DNPU acceleration
  • GUI-based model selection and prompt input
  • Streaming token output
  • Integration with eIQ AAF Connector and Ara240 Runtime SDK
  • Support for camera-based visual input using a USB-C HD camera

 

Supported Model

  • Qwen2.5-VL-7B-Instruct-Ara240

This model is provided as an Ara240-compatible model.dvm file and is intended for local execution on the target platform.

 

Basic Installation

 

After making sure the Ara240 Runtime SDK is installed on the target board, copy the Debian package:

scp vlm-edge-studio.deb root@<ip_addr>:

Install it with:

dpkg -i vlm-edge-studio.deb

The installation may take a few minutes because the model needs to be extracted during setup.

 

Running VLM Edge Studio

 

Start the application with:

run_vlm_edge_studio

Before launching, make sure the Ara240 runtime service is running:

systemctl status rt-sdk-ara2.service --no-pager -l

Once the GUI appears, click LOAD to load the model. After the model is ready, enter a prompt and submit it to interact with the VLM locally on the i.MX platform.

 

Walkthrough Video

 

In the attached video, I show how to launch VLM Edge Studio, load the supported Vision-Language Model, submit a prompt, and interact with the model running locally with Ara240 DNPU acceleration.

(view in My Videos)

Summary

 

VLM Edge Studio is a useful tool for evaluating local Vision-Language Model inference on NXP i.MX platforms using Ara240 DNPU acceleration. It provides a simple workflow for loading the model, entering prompts, and interacting with visual-language AI directly at the edge.

 

Link

 

%3CLINGO-SUB%20id%3D%22lingo-sub-2371540%22%20slang%3D%22en-US%22%20mode%3D%22UPDATE%22%3EVLM%20Edge%20Studio%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-2371540%22%20slang%3D%22en-US%22%20mode%3D%22UPDATE%22%3E%3CH1%20id%3D%22toc-hId-1331746325%22%20id%3D%22toc-hId-1331746334%22%3EVLM%20Edge%20Studio%3C%2FH1%3E%0A%3CBR%20%2F%3E%0A%3CP%3EIn%20this%20post%2C%20I%20want%20to%20share%20a%20quick%20walkthrough%20of%20%3CSTRONG%3EVLM%20Edge%20Studio%3C%2FSTRONG%3E%2C%20an%20NXP%20launcher%20application%20designed%20to%20interact%20with%20supported%20%3CSTRONG%3EVision-Language%20Models%3C%2FSTRONG%3E%20running%20locally%20on%20FRDM%20i.MX%20platforms%20with%20%3CSTRONG%3EAra240%20DNPU%20acceleration%3C%2FSTRONG%3E.%3C%2FP%3E%0A%3CP%3EVLM%20Edge%20Studio%20provides%20a%20Qt%2FQML-based%20GUI%20for%20model%20selection%2C%20prompt%20input%2C%20and%20visual%20interaction%20with%20locally%20running%20VLMs%20at%20the%20edge.%20It%20communicates%20with%20the%20%3CSTRONG%3EAra240%20Runtime%20SDK%3C%2FSTRONG%3E%20through%20the%20%3CSTRONG%3EeIQ%20AAF%20Connector%3C%2FSTRONG%3E%20using%20a%20REST-based%20interface%20and%20streaming%20token%20responses.%3C%2FP%3E%0A%3CH2%20id%3D%22toc-hId-2022307799%22%20id%3D%22toc-hId-2022307808%22%3E%26nbsp%3B%3C%2FH2%3E%0A%3CH2%20id%3D%22toc-hId-214853336%22%20id%3D%22toc-hId-214853345%22%3EKey%20Features%3C%2FH2%3E%0A%3CUL%3E%0A%3CLI%3ELocal%20Vision-Language%20Model%20inference%20on%20supported%20i.MX%20platforms%3C%2FLI%3E%0A%3CLI%3EAra240%20DNPU%20acceleration%3C%2FLI%3E%0A%3CLI%3EGUI-based%20model%20selection%20and%20prompt%20input%3C%2FLI%3E%0A%3CLI%3EStreaming%20token%20output%3C%2FLI%3E%0A%3CLI%3EIntegration%20with%20eIQ%20AAF%20Connector%20and%20Ara240%20Runtime%20SDK%3C%2FLI%3E%0A%3CLI%3ESupport%20for%20camera-based%20visual%20input%20using%20a%20USB-C%20HD%20camera%3C%2FLI%3E%0A%3C%2FUL%3E%0A%3CH2%20id%3D%22toc-hId--1592601127%22%20id%3D%22toc-hId--1592601118%22%3E%26nbsp%3B%3C%2FH2%3E%0A%3CH2%20id%3D%22toc-hId-894911706%22%20id%3D%22toc-hId-894911715%22%3ESupported%20Model%3CCODE%3E%3C%2FCODE%3E%3C%2FH2%3E%0A%3CUL%3E%0A%3CLI%3E%3CCODE%3EQwen2.5-VL-7B-Instruct-Ara240%3C%2FCODE%3E%3C%2FLI%3E%0A%3C%2FUL%3E%0A%3CP%3EThis%20model%20is%20provided%20as%20an%20Ara240-compatible%20%3CCODE%3Emodel.dvm%3C%2FCODE%3E%20file%20and%20is%20intended%20for%20local%20execution%20on%20the%20target%20platform.%3C%2FP%3E%0A%3CH2%20id%3D%22toc-hId--912542757%22%20id%3D%22toc-hId--912542748%22%3E%26nbsp%3B%3C%2FH2%3E%0A%3CH2%20id%3D%22toc-hId-1574970076%22%20id%3D%22toc-hId-1574970085%22%3EBasic%20Installation%3C%2FH2%3E%0A%3CBR%20%2F%3E%0A%3CP%3EAfter%20making%20sure%20the%20%3CSTRONG%3EAra240%20Runtime%20SDK%3C%2FSTRONG%3E%20is%20installed%20on%20the%20target%20board%2C%20copy%20the%20Debian%20package%3A%3C%2FP%3E%0A%3CPRE%3E%3CCODE%3Escp%20vlm-edge-studio.deb%20root%40%3CIP_ADDR%3E%3A%3C%2FIP_ADDR%3E%3C%2FCODE%3E%3C%2FPRE%3E%0A%3CP%3EInstall%20it%20with%3A%3C%2FP%3E%0A%3CPRE%3E%3CCODE%3Edpkg%20-i%20vlm-edge-studio.deb%3C%2FCODE%3E%3C%2FPRE%3E%0A%3CP%3EThe%20installation%20may%20take%20a%20few%20minutes%20because%20the%20model%20needs%20to%20be%20extracted%20during%20setup.%3C%2FP%3E%0A%3CH2%20id%3D%22toc-hId--232484387%22%20id%3D%22toc-hId--232484378%22%3E%26nbsp%3B%3C%2FH2%3E%0A%3CH2%20id%3D%22toc-hId--2039938850%22%20id%3D%22toc-hId--2039938841%22%3ERunning%20VLM%20Edge%20Studio%3C%2FH2%3E%0A%3CBR%20%2F%3E%0A%3CP%3EStart%20the%20application%20with%3A%3C%2FP%3E%0A%3CPRE%3E%3CCODE%3Erun_vlm_edge_studio%3C%2FCODE%3E%3C%2FPRE%3E%0A%3CP%3EBefore%20launching%2C%20make%20sure%20the%20Ara240%20runtime%20service%20is%20running%3A%3C%2FP%3E%0A%3CPRE%3E%3CCODE%3Esystemctl%20status%20rt-sdk-ara2.service%20--no-pager%20-l%3C%2FCODE%3E%3C%2FPRE%3E%0A%3CP%3EOnce%20the%20GUI%20appears%2C%20click%20%3CSTRONG%3ELOAD%3C%2FSTRONG%3E%20to%20load%20the%20model.%20After%20the%20model%20is%20ready%2C%20enter%20a%20prompt%20and%20submit%20it%20to%20interact%20with%20the%20VLM%20locally%20on%20the%20i.MX%20platform.%3C%2FP%3E%0A%3CH2%20id%3D%22toc-hId-447573983%22%20id%3D%22toc-hId-447573992%22%3E%26nbsp%3B%3C%2FH2%3E%0A%3CH2%20id%3D%22toc-hId-313889259%22%20id%3D%22toc-hId-313889268%22%3EWalkthrough%20Video%3C%2FH2%3E%0A%3CBR%20%2F%3E%0A%3CP%3EIn%20the%20attached%20video%2C%20I%20show%20how%20to%20launch%20VLM%20Edge%20Studio%2C%20load%20the%20supported%20Vision-Language%20Model%2C%20submit%20a%20prompt%2C%20and%20interact%20with%20the%20model%20running%20locally%20with%20Ara240%20DNPU%20acceleration.%3C%2FP%3E%0A%3CDIV%20class%3D%22lia-message-template-content-zone%22%3E%0A%3CP%3E%3C%2FP%3E%3CDIV%20class%3D%22lia-vid-container%20video-embed-center%22%3E%3CDIV%20id%3D%22lia-vid-6396694743112w960h540r549%22%20class%3D%22lia-video-brightcove-player-container%22%3E%3CVIDEO-JS%20data-video-id%3D%226396694743112%22%20data-account%3D%226153537070001%22%20data-player%3D%22default%22%20data-embed%3D%22default%22%20class%3D%22vjs-fluid%22%20controls%3D%22%22%20data-application-id%3D%22%22%20style%3D%22width%3A%20100%25%3B%20height%3A%20100%25%3B%22%3E%3C%2FVIDEO-JS%3E%3C%2FDIV%3E%3CSCRIPT%20src%3D%22https%3A%2F%2Fplayers.brightcove.net%2F6153537070001%2Fdefault_default%2Findex.min.js%22%3E%3C%2FSCRIPT%3E%3CSCRIPT%3E(function()%20%7B%20%20var%20wrapper%20%3D%20document.getElementById('lia-vid-6396694743112w960h540r549')%3B%20%20var%20videoEl%20%3D%20wrapper%20%3F%20wrapper.querySelector('video-js')%20%3A%20null%3B%20%20if%20(videoEl)%20%7B%20%20%20%20%20if%20(window.videojs)%20%7B%20%20%20%20%20%20%20window.videojs(videoEl).ready(function()%20%7B%20%20%20%20%20%20%20%20%20this.on('loadedmetadata'%2C%20function()%20%7B%20%20%20%20%20%20%20%20%20%20%20this.el().querySelectorAll('.vjs-load-progress%20div%5Bdata-start%5D').forEach(function(bar)%20%7B%20%20%20%20%20%20%20%20%20%20%20%20%20bar.setAttribute('role'%2C%20'presentation')%3B%20%20%20%20%20%20%20%20%20%20%20%20%20bar.setAttribute('aria-hidden'%2C%20'true')%3B%20%20%20%20%20%20%20%20%20%20%20%7D)%3B%20%20%20%20%20%20%20%20%20%7D)%3B%20%20%20%20%20%20%20%7D)%3B%20%20%20%20%20%7D%20%20%7D%7D)()%3B%3C%2FSCRIPT%3E%3CA%20class%3D%22video-embed-link%22%20href%3D%22https%3A%2F%2Fcommunity.nxp.com%2Ft5%2Fvideo%2Fgallerypage%2Fvideo-id%2F6396694743112%22%20target%3D%22_blank%22%3E(view%20in%20My%20Videos)%3C%2FA%3E%3C%2FDIV%3E%3CP%3E%3C%2FP%3E%0A%3CH2%20id%3D%22toc-hId--1493565204%22%20id%3D%22toc-hId--1493565195%22%3ESummary%3C%2FH2%3E%0A%3CBR%20%2F%3E%0A%3CP%3EVLM%20Edge%20Studio%20is%20a%20useful%20tool%20for%20evaluating%20local%20Vision-Language%20Model%20inference%20on%20NXP%20i.MX%20platforms%20using%20Ara240%20DNPU%20acceleration.%20It%20provides%20a%20simple%20workflow%20for%20loading%20the%20model%2C%20entering%20prompts%2C%20and%20interacting%20with%20visual-language%20AI%20directly%20at%20the%20edge.%3C%2FP%3E%0A%3CH2%20id%3D%22toc-hId-993947629%22%20id%3D%22toc-hId-993947638%22%3E%26nbsp%3B%3C%2FH2%3E%0A%3CH2%20id%3D%22toc-hId--813506834%22%20id%3D%22toc-hId--813506825%22%3ELink%3C%2FH2%3E%0A%3CUL%3E%0A%3CLI%3E%3CA%20href%3D%22https%3A%2F%2Fgithub.com%2Fnxp-imx-support%2Fvlm-edge-studio%22%20target%3D%22_blank%22%20rel%3D%22noopener%20nofollow%20noreferrer%22%3E%20VLM%20Edge%20Studio%20repository%20%3C%2FA%3E%3C%2FLI%3E%0A%3C%2FUL%3E%0A%3CDIV%20class%3D%22lia-message-template-content-zone%22%3E%0A%3CBR%20%2F%3E%0A%3C%2FDIV%3E%0A%3C%2FDIV%3E%3C%2FLINGO-BODY%3E%3CLINGO-LABS%20id%3D%22lingo-labs-2371540%22%20slang%3D%22en-US%22%20mode%3D%22UPDATE%22%3E%3CLINGO-LABEL%3EARA2-M2-16G-GT%3C%2FLINGO-LABEL%3E%3CLINGO-LABEL%3EARA240%3C%2FLINGO-LABEL%3E%3CLINGO-LABEL%3EHands-On%20Training%3C%2FLINGO-LABEL%3E%3C%2FLINGO-LABS%3E
No ratings
Version history
Last update:
3 weeks ago
Updated by: