@Alex
For 3rd party LLM prompt processing (pp) speed only the iGPU gaming performance matters.
-> So, if M5' iGPU perf increased by "up to 30%"² vs M4, then the 3rd party AI/LLM prompt processing speed will also increase by roughly the same amount.
For LLM token generation (tg), only the memory bandwidth matters (assuming one has enough unified memory to fit a LLM in the first place (so, the memory size matters too)).
-> So, if the memory bandwidth increased by 12.5% (=307.2 GB/s (M5 Pro)/273 GB/s (M4 Pro))¹, then the AI/LLM token generation speed will also increase by roughly the same amount.
¹ en.wikipedia.org/wiki/MacBook_Pro_(Apple_silicon)
² en.wikipedia.org/wiki/Apple_M5#Performance
APPLE's claim of
Quote from: en.wikipedia.org/wiki/Apple_M5#PerformancePeak GPU AI compute: over 4× faster
is going to be (or rather: will be) relevant only to their own, tightly integrated, aka 1st party, solutions, not 3rd party LLMs. But, ofc, I have nothing against if the 3rd party LLM pp and tg speeds are tested.