Well AMD really needs to work on its software stack to support AI.

Indeed, are you aware of the current state of AMD's equivalent to CUDA? And how far behind would be, and what would need to be done to near equivalence? It seems like the opportunity and differentiator. I wonder how Apple silicon software is doing...

Instead of trying to integrate the whole stack of, say, pytorch, Apple's primary approach has been converting models to work with Apple's stack.

https://github.com/apple/coremltools

Clearly no one is going to be doing training or even fine tuning on Apple hardware at any scale (it competes at the low end, but at scale you invariably will be using nvidia hardware), but once you have a decent model it's a robust way of using it on Apple devices.