Quote from: davidm on January 29, 2025, 21:19:53Strix Halo will be able to run larger models using shared RAM, but a lot more slowly than if they were running in VRAM, to the degree they won't be pleasant to use for applications that require responsiveness.Maybe, but Halo a bit of a leg up, especially considering the quad channel 256-bit memory bus that gives it bandwidth similar to a 4060. It may not be able to churn out the same raw it/s of a 4090, but I can pretty much guarantee it will be as fast or faster than a single 4090 or 4090 when running a 70b model, which makes them swap out into system memory and slows them down to a couple it/s or less. And it will do so in a 150W power envelope, vs close to 1000.