Opsec Cyber Data Exfiltration Using Indirect Prompt Injection

Data Exfiltration Using Indirect Prompt Injection

December 22 2023 By Bruce Schneier in Security, vulnerabilities

Interesting attack on a LLM:

In Writer, users can enter a ChatGPT-like session to edit or create their documents. In this chat session, the LLM can retrieve information from sources on the web to assist users in creation of their documents. We show that attackers can prepare websites that, when a user adds them as a source, manipulate the LLM into sending private information to the attacker or perform other malicious activities.

The data theft can include documents the user has uploaded, their chat history or potentially specific private information the chat model can convince the user to divulge at the attacker’s behest.

Data Exfiltration Using Indirect Prompt Injection

Data Exfiltration Using Indirect Prompt Injection

Recent Posts

Friday Squid Blogging: New Squid Species Discovered

AIs Are Getting Better at Finding and Exploiting Security Vulnerabilities

The Constitutionality of Geofence Warrants

Ireland Proposes Giving Police New Digital Surveillance Powers

Friday Squid Blogging: Giant Squid in the Star Trek Universe

Recent Comments

Archives

Categories

Meta

Contact Us

[email protected]

Singapore CBD

+65 8714 2780