Intel releases new graphics card with up to 32 GB VRAM

Redaktion · March 25, 2026, 14:00:23

Intel has released a new graphics card with up to 32 GB of VRAM. Also leveraging up to 32 Xe2 cores, the Arc Pro B70 will soon be joined by the cheaper Arc Pro B65, which Intel claims are 'cost-effective' yet 'high-performance solutions'.

https://www.notebookcheck.net/Intel-releases-new-graphics-card-with-up-to-32-GB-VRAM.1258638.0.html

48 GB VRAM when · March 25, 2026, 15:37:40

Why not use GDDR7, which offers 3 GB per chip density, then this GPU could have:
256-bit/32-bit per chip = 8 chips, 8 chips * 3 GB per chip = 24 GB VRAM, 24 GB VRAM * 2 (chips on both sides of the PCB) = 48 GB VRAM and the memory bandwidth would also be 30% higher, because it's GDDR7 and not GDDR6.
Alternative calculation: 32 GB VRAM * 1.5 (3 GB per chip, instead of the current 2 GB) = 48 GB VRAM.
Frankly, when it comes to AI/LLMs, I'm not interested in 32 GB VRAM GPUs..and I said this over 1 year ago.
Let's see if NVIDIA gives us consumer 48 GB VRAM GPUs in the RTX 60 series (probably not, but NVIDIA gave us RTX PRO 6000 Blackwell GPU with 96 GB VRAM, which I also didn't expect).

QuoteThe ECC memory is essential here, as a single bit flip could ruin a long render or an AI training run.

This GPU is not capable to train a LLM big enough where ECC would matter (even with several and days run, it is unlikely that something would happen). Neither is ECC necessary for fine-tuning with this GPU. Also, there are checkpoints, one doesn't lose everything, and even if, on this GPU the costs of losing a run are a few bucks a few days..in this ballpark. How I know? I asked the big LLMs over at arena.ai (aka lmarena.ai).

(en.wikipedia.org/wiki/GeForce_RTX_50_series#Desktop, en.wikipedia.org/wiki/Intel_Arc#Workstation_2)

RTX 5090: 1792 GB/s = 512-bit * 28 Gb/s / 8.
B70 Pro / B65 Pro: 608 GB/s = 256-bit * 19 Gb/s / 8.

So a consumer 5090 has a 3 times faster memory bandwidth and CUDA, of course. The only thing these B70/B65 could have go for them is having 48 GB VRAM and people trying to make them work without CUDA.

For inferencing these are probably fine (I know NV works using Vulkan just fine (a bit slower than using CUDA)), but then again, a 5090 has 3x the token generation and also much higher prompt processing.

Do you agree or disagree? · March 25, 2026, 18:24:40

Quote from: 48 GB VRAM when on March 25, 2026, 15:37:40Why

Found this to be pertinent to the discussion at hand:

reddit.com/r/LocalLLaMA/comments/1rzaz7r/my_experience_spending_2k_and_experimenting_on_a/

Interested in hearing your thoughts.

RobertJasiek · March 25, 2026, 18:48:25

Seems you ask whether to run AI locally or in the cloud. The answer is... surprise, surprise... it depends! E.g., if you use AI 24/7 for a long time, you better do it locally. If you have just a few quick queries per day but need more than cheap local hardware can process, then pay for a cloud service.

However, cloud comes in two ways: a) already offered as you want (maybe a standard LLM) or b) you first need to program and upload. The latter is also sometimes used for short-time research project (such as a mathematical proof of already proved maximum complexity), for which the time is predictable so you know in advance that a solution is generated within the time- and cost-frame.

I use deep neural net inferencing ca. 7 hours per day for years. For such a purpose, local hardware is by far more cost-efficient and always available. Cloud would be astronomically expensive. Local computing depends on wattage and its power bill. Make the greedy decision and soon you pay more for electricity than hardware. Your own solar park helps.

If you train deep neural nets, it depends. Maybe local hardware can be enough but might already become expensive - from €500 to €100,000 everything is possible, not to mention the electricity bill. If you have a need for giant hardware ressources, cloud together with your programming might be necessary but then we speak of industrial scale expenses - open-end. There is, however, a third option: distributed computing via a world-wide community of enthusiasts. Such has been done for some research projects or for, e.g., training KataGo (the nets for the game of Go).

Do you agree or disagree? · March 25, 2026, 19:55:41

@RobertJasiek:

Have you made any money from using llms that offsets the initial investment cost or payment for the subscription (whether that be cloud or local)?

Or do you pretty much get paid to play with them (for academic research purposes I assume)? So it doesn't matter how much the bills are for these projects, as it's subsidized for you.

RobertJasiek · March 26, 2026, 00:14:13

I do not use LLMs. I use the deep neural net AI called KataGo.

Before using AI, I have written 22 Go books. I am changing my writing to also using AI for analysing Go games and positions and have been doing so for 4 years relearning tactics and strategy from the point of view of AI and broadening the topics I can write about. Now I also understand the opening stage of the game of Go well enough to write about it, a tooic I previously neglected in my books. I am still in the transition and so far have written only drafts of my next Go books. Therefore, I have not earned money from using AI yet. For this purpose, I need to complete the drafts, which I am working on. Note that I still write the books while AI does nothing more than show games and positions together with empirical numbers of winning probabilites and predicted scores, which I need to interpret and comment on.

Studying with AI has already improved my own Go play, but my income does not come from tournament prizes.

You might wonder about LLMs writing Go books. They can't write any meaningful Go books. Game tactics and strategy are not 1-click novels but require understanding on a level of a lifetime to master or AI internal decision-making without any human explanation. Aforementioned empirical numbers are the only interface between AI and human beings so far.

News:

Intel releases new graphics card with up to 32 GB VRAM

Redaktion

48 GB VRAM when

Do you agree or disagree?

RobertJasiek

Do you agree or disagree?

RobertJasiek

Quick Reply