Hello All,
I need to iterate over an image captured from the camera subsystem on the i.MX8M Plus. The image is greyscale 1920×1200 and copying the buffer currently takes about 15 ms. This suggests that the buffer memory is mapped as non-cacheable.
On previous ARM systems I have worked with, DMA buffers could be mapped as cache-coherent (for example using the dma-coherent device-tree property or similar mechanisms), which reduced a similar copy operation to roughly 2 ms.
Currently the V4L2 capture buffers appear to be uncached, so both copying the buffer and iterating over it directly (zero-copy processing) are quite slow.
Is there a mechanism on the i.MX8M Plus to enable cache-coherent mappings for these buffers (for example via a device-tree configuration), or another recommended approach to improve CPU access performance?
Thanks