QuoteWhile GMKtec's test shows Nvidia's setup remains the hardware to beat for large-model, high-throughput operations
Where? The largest model they tested was the 70B one and the Strix Halo won on generation speed, it only lost on first token which isn't really that important
The DGX has been a complete flop.
Also, I am pretty sure I saw a youtube video of this, the models tested and results seem to match that youtube video. Except it seems like they didn't even use the video itself but used the youtube commenter's data who summarized it. You notice because on the 70B model, the strix halo scored 4.97/sec, not 4.9/sec but the commenter left out the 7 at the end by accident.
Youtube video is called:
NVIDIA DGX Spark – A Non-Sponsored Review (Strix Halo Comparison, Pros & Cons)
While it is nice to have it in text format over videos, they still should source who they got their data from as otherwise it is plagerism. Or they could have just rented a DGX and do their own testing with more models and not just language models but other stuff as well