ChatGPT easily bypasses its own guardrails; all LLMs are inherently unsafe (opens in new tab)
One of the most important components of LLMs that tools like ChatGPT use are the so-called guardrails. These are boundaries that a model is not allowed to cross, and cannot cross. At least, that’s how it should be. However, hacker Kevin Zwaan and his team from Q-Cyber and the Hackers Love community demonstrate that LLMs […]
Read the original article