9523 MT/s is nice and all, but it's 2800 bucks for soldered only 32 GB RAM.. One can get a used/refurb 4060 Laptop for 800 (I get that this is not a gaming laptop) and it will be faster in games and the 5600 MT/s RAM will at least be upgradable to up to 64 GB and the 8 GB VRAM will allow for offloading AI LLM models partially or fully to.
Running AI LLM models locally and privately:But why would one need more than 32 GB RAM you may ask? 32 GB RAM allow only a limited context of ~32000 tokens (= ~24000 words) for a quant of the new 35B MoE SOTA LLM Qwen3.6-35B-A3B-UD-Q4_K_M. And agentic workflows often require more than 100000 tokens, so it will not work.
Quote from: reddit.com/r/LocalLLaMA/comments/1sq94qx/is_anyone_getting_real_coding_work_done_with.. I've come to the conclusion that (1) 32768 is the biggest context I can get away with in an adequately smart model, and (2) it just ain't enough.