ChatGPT Can Be Tricked Into Delivering Phishing Links From Any Web Page

A small payload appended to a web page can plant phishing links, fake security alerts, and QR codes inside ChatGPT's response interface.

Published on Jun 1, 2026

Written by Alessandro Mascellino

Share

Any Page ChatGPT Summarizes Can Become an Attacker-Controlled Phishing Surface

Any web page a victim asks ChatGPT to summarize can become a phishing payload, according to research published by Permiso’s P0 Labs on May 29.

The ChatGPT response renderer reportedly trusts Markdown links and image links that originated from a third-party page the assistant just summarized, auto-fetching the images and surfacing the links as live, clickable elements inside the trusted assistant interface.

By appending instruction-like text to any page, an attacker can steer the resulting summary to include attacker-controlled content that inherits the look and authority of ChatGPT itself.

Permiso researcher Andi Ahmeti framed this as a trust-transfer problem, rather than a model-jailbreak one. The model can be influenced by untrusted content; the larger issue is that the output inherits the trust of the assistant.

Three Demonstrated Primitives

The research walked through three scenarios, each triggered by browsing to a page in Firefox and using the browser’s ChatGPT summarization flow.

In the first, an attacker appended formatting instructions to a GitHub README, convincing the model to append a fake account-security notification to its summary. ChatGPT produced a normal project summary and then continued into the spoofed alert, presenting an attacker-controlled URL as if OpenAI had issued it.

A second variant rendered an inline QR code pulled from an attacker-controlled S3 bucket. Because the ChatGPT client auto-fetches Markdown images, the QR code appeared inside the reply, ready to scan. That moved the attack off the desktop entirely, bypassing hover-preview, blocklists, and password-manager domain checks.

A third swapped the image source for a URL shortener, turning the rendered image into a passive tracking beacon. Every render of the answer fired a live HTTP request to attacker-controlled infrastructure, leaking IP, user-agent, referer, and timing.

Permiso also ran the same payload on a self-hosted marketing page to rule out anything GitHub-specific, with identical results. The researchers stressed this is not a Firefox bug: the same risk applies to any browser-integrated summarization system that renders untrusted Markdown without clearly separating source content.

Disclosure and OpenAI’s Response

Permiso said it submitted the issue to OpenAI through its Bugcrowd program on April 29.

According to the company’s disclosure timeline, OpenAI marked the initial report as Not Reproducible, then classified a revised submission as Not Applicable before later designating it a duplicate. Permiso requested follow-up communication on the wider impact before publishing on May 29.

Expert Insights has contacted OpenAI for comment on the research and will update this article with any response.

The pattern echoes Permiso’s March 2026 research into Copilot prompt injection via email summaries. Different product, different surface, the same root issue: attacker-controlled content rendering as trusted AI output.

Explore More

Six Windows Zero-Days Dropped Without Warning Show the Cracks in Coordinated Disclosure

A single researcher published exploits for six Windows and Defender flaws outside the normal disclosure process, several still unpatched, in a pattern AI could soon make far more common.

Alessandro Mascellino Last updated on May 28, 2026

Read Now

Written By

Alessandro Mascellino Cybersecurity Reporter

Alessandro Mascellino is a British-Italian freelance journalist specializing in technology and gaming. He has contributed to several publications, including Wired, The Independent, and Android Police. By day, he works as a journalist. By night, he co-manages a game studio that creates narrative games.