Instagram Hackers Tricked Meta AI into Handing Over Celebrity Accounts

hacker

The Attack That Exposed a Dangerous Flaw

A new account takeover scheme has revealed a stunning vulnerability in Meta's AI-powered customer support system. According to a report by 404 Media, hackers successfully stole Instagram accounts belonging to high-profile celebrities by simply asking Meta's chatbot to reset the passwords. The exploit, which required no advanced technical skills, exploited the chatbot's ability to process natural language requests without verifying the caller's identity beyond basic prompts.

The attack worked by engaging Meta's AI assistant—deployed on platforms like Facebook and Instagram to handle user support inquiries—and convincing it that the hacker was the legitimate account owner. Once the chatbot initiated the password reset, the attackers could seize control of the accounts, lock out the original owners, and post malicious content or demand ransoms. The breach underscores a growing risk as tech companies offload customer service functions to large language models that lack robust authentication safeguards.

How the Exploit Worked

Detailed accounts from 404 Media indicate that the hackers used a multi-step social engineering technique. First, they would locate a celebrity's Instagram profile and gather public information such as the account's username and any publicly listed email. Then, they would initiate a chat with Meta's AI customer support, claiming to have lost access to the account. The chatbot, designed to follow scripted recovery flows, would ask a series of questions—such as the account's creation date or linked email—which the attackers had often already researched from public sources or data breaches.

hacker

In some instances, the AI accepted answers that were imprecise or obviously fabricated, according to sources familiar with the investigation. Once the chatbot considered the identity “verified,” it sent a password reset link to a new email address provided by the attacker, handing over full control. The entire process took minutes, and the AI logged no suspicious behavior flags. Meta has since confirmed that it is investigating the incidents but has not disclosed the full number of accounts compromised or the specific celebrities targeted.

The Broader Implications for AI-Driven Customer Support

This incident is not an isolated case. It fits a pattern of AI systems being twisted by adversarial inputs—prompts that cause the model to behave outside its intended boundaries. In 2023, a similar technique known as “prompt injection” allowed hackers to trick ChatGPT into revealing system prompts or performing unauthorized tasks. Here, the attackers used a form of social engineering tailored to the chatbot's conversational nature, exploiting its desire to be helpful.

Security researchers have long warned that AI customer support agents are particularly vulnerable because they combine natural language understanding with access to sensitive user data and account management tools. Unlike human agents, chatbots lack judgment about suspicious behavior patterns—such as a password reset request originating from an unfamiliar device and location minutes after the account was accessed from a different continent. The ease with which the Instagram exploit worked suggests that Meta's guardrails were insufficient to prevent a simple impersonation attack.

What Meta Did Wrong—and What It Must Fix

hacker

Meta has not released a detailed postmortem, but the company's reliance on a language model to authenticate account recovery requests appears to be the root cause. Traditional account recovery methods rely on multi-factor authentication, email confirmation links, or SMS codes sent to verified phone numbers. By substituting an AI chatbot that can be socially engineered, Meta introduced a weaker link in the security chain.

The company should now implement stricter verification hurdles within the chatbot, such as requiring a one-time passcode entered via a separate channel or using biometric confirmation for high-value accounts. Additionally, the AI should be programmed to detect out-of-context password reset requests and escalate them to human agents. The fact that the exploit targeted celebrities—who often have additional security measures like two-factor authentication enabled—indicates that the chatbot had privileges to bypass those protections, a critical design flaw.

A Wake-Up Call for the AI Industry

The Instagram hack offers a stark lesson: AI can be a powerful tool for automating support, but it cannot replace human judgment when security is at stake—at least not without much deeper safeguards. As companies race to deploy AI agents across customer service, billing, and even healthcare (as seen in MIT Technology Review's coverage of agentic AI at hospitals), the attack surface for adversarial inputs expands. Researchers have already demonstrated that AI systems can be manipulated to reveal trade secrets, bypass content filters, or book unauthorized reservations.

For now, Meta faces reputational damage and potential regulatory scrutiny over this breach. But the broader tech community should view this as a case study in the perils of trusting AI with tasks that require authenticating identity. Until LLMs can reliably reject sophisticated impersonation attempts, companies must keep humans in the loop for any action that can irrevocably change account ownership. The cost of not doing so is already clear: compromised accounts, leaked private data, and a shaken trust in the AI systems we are increasingly depending on.

Source: MIT Tech Review
345tool Editorial Team
345tool Editorial Team

We are a team of AI technology enthusiasts and researchers dedicated to discovering, testing, and reviewing the latest AI tools to help users find the right solutions for their needs.

我们是一支由 AI 技术爱好者和研究人员组成的团队,致力于发现、测试和评测最新的 AI 工具,帮助用户找到最适合自己的解决方案。

댓글

Loading comments...