I recently received my FRDM iMX93 board and am interested in running a minimal LLM or Transformer on it. I found a document by NXP (GEN-AI-RAG-WHITEPAPER) that states TinyLlama has successfully run on iMX93, but there is no more technical information or resources available.

I know I need to ensure the model is compatible with TFLite, then quantize and compile it using eIQ and the Vela compiler, and finally run it on the iMX93 board. However, this seems like a long and complex process. I would prefer to have some kind of tutorial or at least a reliable reference to guide me through the steps.