I am new to opencl, I am working on opencl on Imx6 dual board, and I ran vector computation
example given in white paper for imx6, and I am profiling each and every stage of it using CL_QUEUE_PROFILING_ENABLE while creating command queue
and profiling EnqueNDRangekernel using event and CL_PROFILING_COMMAND_START and CL_PROFILING_COMMAND_END, it shows 18ms for
EnqueNDRangekernel, I want to understand how it is taking 18ms, I am following below link for vector compute example:
Is there any other profiling tool for opencl where I can see and validate my profiling.
Please note I am using poky toolcahin from yocto.