News:

Willkommen im Notebookcheck.com Forum! Hier können Sie über alle unsere Artikel und allgemein über notebookrelevante Dinge diskutieren. Viel Spass!

Main Menu

Post reply

Other options
Verification:
Please leave this box empty:
Shortcuts: ALT+S post or ALT+P preview

Topic summary

Posted by Redaktion
 - Today at 08:55:18
A security researcher spent $1,500 running 13+ AI models against a deliberately vulnerable app. GPT-5.5 led with a 70% solve rate, DeepSeek V4 Pro solved it for $0.62 per attempt, and Gemini refused to engage almost entirely.

https://www.notebookcheck.net/GPT-5-5-dominates-1-500-LLM-hacking-test-while-Gemini-refuses-to-even-try.1315097.0.html