I use opencl to detect object with yolo. And when i use API "clCreateKernelsInProgram" , i find it will stuck .
My kernel num is 15. What can i do to solve it?
By testing one by one , i found that my kernel function "softmax" has some error , I cannot clCreatKernel it .And while i delete some line likes "vstore16(localInput[i], 0, &x[i * 16]);" , it can create suceess ,How can you give me some help?
Thank very much.
Looking forward to your reply!
OpenCL is 2.1v and You have to use openCV 4.2, take in mind that in opencl 3.1v doesn´t have the Softmax function.
i use the version of opencl is 1.2v , and i dont use opencv.
And i fix the kernel api. When i run it with yolov3-tiny. it use about 170s.
Why it so slow? Is the reason of my code or GPU?
Retrieving data ...