LeakyLM: AI Assistants Are Leaking Your Conversations

LeakyLM — AI Assistants Are Leaking Your Conversations

Research Disclosure

Your AI Assistant Is Leaking Your Conversations

We disclose structural privacy risks in prominent generative AI products — Perplexity, Anthropic's Claude, xAI's Grok, and OpenAI's ChatGPT — caused by third-party trackers embedded in LLM services that leak user conversations, identities, and sensitive metadata.

See the Evidence Report a Finding

AI Platforms Tested

13+ Third-party Trackers Found

Platforms Affected

Disclosed to Users

Update — [Date]

[Platform] has removed the [tracker] script following responsible disclosure.

Paper Accepted — [Venue]

Our peer-reviewed paper "[Title]" has been accepted to [Conference].

─────────────────────────────────────────────────────────── -->

Generative AI is rapidly becoming a foundational layer of the Internet, enabling the emergence of agentic systems that mediate users' interaction with digital services. Despite this transformation, underlying data-driven economic dynamics remain largely unchanged, as acknowledged by prominent industry actors. This continuity extends to the integration of third-party trackers within generative AI ecosystems to monitor users' actions, which retain the capability to collect sensitive user data.

In this report, we disclose concerning structural privacy risks caused by (1) the systematic introduction of third-party analytics services in prominent generative AI products developed by major AI actors such as Perplexity, Anthropic's Claude, xAI's Grok, and OpenAI's ChatGPT; and (2) insecure access control mechanisms in some of these LLMs that leak user conversations to third-party trackers embedded in LLM services, as well as the conversation title which can be a very sensitive data type that can disclose users' concerns, conversation topics, interests, and more. Meta's AI, MS Copilot, and Google Gemini are out of scope of this analysis because they act both as LLM providers and third-party trackers, falling into a different threat model. We plan to extend the scope of our analysis to include these products in the coming weeks.

Key privacy concerning observations

Leakage of conversation URLs to third-party advertising and tracking services

User conversations in LLM services frequently contain sensitive information introduced by end users. Yet, conversation URLs are disclosed to third-party trackers such as the Meta Pixel, as shown in Figure 1 by default, for Grok and Perplexity. These URLs often serve as publicly available permalinks with weak access control, making them accessible by default to anyone knowing the URL. This potentially allows the trackers to access user conversations and their content. In Grok's case, shared conversations also generate publicly accessible screenshot images of the conversation content, with verbatim message text exposed in Open Graph metadata received by TikTok's tracker. Table 1 describes the default access control mechanisms across LLMs.

Linkability to user identities

Conversation URLs are frequently shared by LLM providers alongside tracking identifiers to third-party trackers (e.g., cookies such as fbp, in the case of Meta Pixel), which enable trackers to map online activity to user identities and behavioral profiles per official privacy policies. In some cases, the trackers also perform cookie syncing/server-side tracking and collect user email hashes through the logging forms, allowing for persistent user tracking, targeting, and reidentification. Table 2 lists the PII and conversation leaks observed.

Potentially misleading privacy controls and privacy disclosures

The studied LLMs offer privacy controls to limit conversation visibility, but may mislead users by implying stronger protections than are actually enforced. Privacy policies of Grok, Perplexity, OpenAI, and Claude confirm the collection of user conversations, usage telemetry, and metadata for first-party purposes, the use of third-party cookies (e.g., Meta, Google, TikTok) for analytics and advertising, and data sharing with third parties. Yet, they do not clearly state that user conversations are shared with online advertising and tracking services — relying instead on broad language (e.g., "content you submit" or "business partners") that leaves uncertainty about actual data flows. Cookie consent forms present further transparency shortcomings, as Fig. 2 shows.

Although preliminary, our findings reveal systemic weak privacy and security postures across LLM services. While we do not yet have evidence that conversations are read by trackers, permalink dissemination and by extension the capability to read them exist, and therefore the potential risk.

Privacy Impact: Why does it matter?

Generative AI systems are rapidly reaching mass adoption. According to Eurostat, 32.7% of the EU population (ages 16–74) used generative AI in 2025 , primarily for personal purposes (25.1%), but also for work (15.1%), covering all sorts of professionals, and...

LeakyLM: AI Assistants Are Leaking Your Conversations

Related Articles

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play

Old Reddit Is Down

The ultimate female fantasy – A feminist critique of Beauty and the Beast