Post reply

July 09, 2026, 17:08:14

News:

Willkommen im Notebookcheck.com Forum! Hier können Sie über alle unsere Artikel und allgemein über notebookrelevante Dinge diskutieren. Viel Spass!

Main Menu

Home
Search

NotebookCHECK - Notebook Forum
► English
► Miscellaneous
► Post reply ( Re: A beginner’s guide to AI jailbreaks — Using Gandalf to learn safely )

Post reply

The message has the following error or errors that must be corrected before continuing:: Warning: this topic has not been posted in for at least 120 days.
Unless you're sure you want to reply, please consider starting a new topic.

Name
Email
Subject
Message icon

Other options

Return to this topic
Don't use smileys

Verification:

Please leave this box empty:

Shortcuts: ALT+S post or ALT+P preview

Topic summary

Posted by Redaktion

- December 08, 2025, 13:08:16

Chatbots come with built-in safeguards designed to prevent them from producing harmful, offensive, or otherwise inappropriate content. But researchers and hackers have shown that, even with multiple patches, AIs can still be vulnerable to certain inputs that bypass those guardrails. One way to explore the basics is through an online game called Gandalf.

https://www.notebookcheck.net/A-beginner-s-guide-to-AI-jailbreaks-Using-Gandalf-to-learn-safely.1180639.0.html