GenAI vulnerable to prompt injection attacks

By Ian Barker
Published 3 months ago

New research shows that one in 10 prompt injection atempts against GenAI systems manage to bypass basic guardrails. Their non-deterministic nature also means failed attempts can suddenly succeed, even with identical content.

AI security company Pangea ran a Prompt Injection Challenge in March this year. The month-long initiative attracted more than 800 participants from 85 countries who attempted to bypass AI security guardrails across three virtual rooms with increasing levels of difficulty.

The challenge generated nearly 330,000 prompt injection attempts using more than 300 million tokens, creating a comprehensive dataset that reveals blindspots in how organizations are currently securing their AI applications.

"This challenge has given us unprecedented visibility into real-world tactics attackers are using against AI applications today," says Oliver Friedrichs, co-founder and CEO of Pangea. "The scale and sophistication of attacks we observed reveal the vast and rapidly evolving nature of AI security threats. Defending against these threats must be a core consideration for security teams, not a checkbox or afterthought."

Challenge participants successfully manipulated LLMs to reveal sensitive information, particularly when models had access to confidential data via RAG (retrieval-augmented generation) systems or plugins. These attacks extracted internal instructions, customer data, and secrets embedded in system prompts.

LLMs with tool access presented heightened risks, as attackers embedded malicious instructions into innocent-looking inputs, causing systems to execute unauthorized actions like sending emails, modifying files, and accessing restricted functions.

Attackers also demonstrated multiple methods to bypass content safety protections, including embedded malicious prompts in external data sources and encoding harmful instructions to evade detection, resulting in generation of otherwise restricted content.

Friedrichs adds, "The industry is not paying enough attention to this risk and is underestimating its impact in many cases, playing a dangerous wait-and-see game. The rate of change and adoption in AI is astounding -- moving faster than any technology transformation in the past few decades. With organizations rapidly deploying new AI capabilities and increasing their dependence on these systems for critical operations, the security gap is widening daily. The time to get ahead of these concerns is now."

You can get the full report from the Pangea site.

Image credit: Tero Vesalainen/Dreamstime.com

No Comments

Comments are closed.

GenAI vulnerable to prompt injection attacks

Recent Headlines

Cloud accounts come under attack as identity threats rise

Microsoft to disable features in outdated Office apps

Spotify is raising its prices yet again

75 percent of cybersecurity leaders don’t trust their own data

Attackers exploit old vulnerabilities as zero-day exploits surge

Hackers weaponize GenAI to boost cyberattacks

Microsoft Recall is bad at filtering sensitive information

Most Commented Stories

Windows 11 25H2 has a new option to remove all unwanted Microsoft apps

This new Windows 11 clone is actually Linux and runs faster on your old PC -- get it now

Half of Americans think AI is a threat, the other half don't. Who's right?

This ergonomic AI mechanical keyboard is built for modern productivity

UpDownTool lets you move from Windows 11 to Windows 10 in just 5 clicks -- without losing any data

Saying no to Windows 11 just got easier -- Operese automatically transfers your Windows 10 files and settings to Linux

Google makes cheaper YouTube Premium Lite available more widely

IObit Software Updater 8 makes app updates faster and safer -- download it now