Well I think there are 2 types right ? Tensor cores (which afaik AMD dont have) which are better for matrix ops, and CUDO which are better for general parallel ops.
Maybe someone more clever than me can go into the specifics, I only understand the minimum of the low lvl GPU details.
Maybe someone more clever than me can go into the specifics, I only understand the minimum of the low lvl GPU details.
Nice high lvl document
[0] https://www.acecloudhosting.com/blog/cuda-cores-vs-tensor-co...