2 min read

When AI Support Bots Get Socially Engineered: A Wake-Up Call for L&D Leaders

Picture of David Mantica David Mantica December 31, 1969

If you needed a concrete story to take into Friday's exec steering on why AI rollout without a workforce capability plan is a delivery risk, last weekend handed you one. Pro-Iranian hackers briefly defaced the Instagram accounts of the Obama White House and the Chief Master Sergeant of the U.S. Space Force after circulating instructions on Telegram showing how to trick Meta's AI support assistant into resetting account passwords. The exploit was almost embarrassingly simple: open a chat with the bot, ask it to link a new email to the target account, and let it dutifully send a one-time code to the attacker.

This is not really a story about Meta. It is a story about what happens when an organisation deploys a generative AI capability into a sensitive workflow faster than it builds the human judgement, controls, and operating model around it. For PMO and delivery leaders trying to industrialise AI inside regulated enterprises, that pattern should be uncomfortably familiar.

The pattern: capability deployed, capability not trained

The Meta incident is a microcosm of how a lot of enterprise GenAI pilots are quietly failing. A bot was put in front of a high-stakes process (account recovery) to reduce friction. It worked for the happy path. Nobody had stress-tested it against an adversary who treats the bot like a junior support agent with no escalation instincts. The patch came after the breach.

Compare that to the broader picture researchers are flagging. Google DeepMind is now funding external work into the risks of millions of AI agents interacting with each other online — scams, prompt injections, and cyberattacks that are essentially the agentic versions of things humans already do to each other. The director of DeepMind's AGI safety work put it plainly: there isn't really a field of multi-agent safety yet, and they would like there to be. That is an honest admission that the technology is being shipped faster than the practitioner discipline to operate it safely.

What this means for your delivery portfolio

For Dana-style delivery leaders, three implications matter on Monday morning:

1. The pilot-that-died-quietly is often a capability problem, not a tooling problem. If last year's GenAI platform purchase never changed behaviour, it is rarely because the model was wrong. It is because no one on the delivery team knew how to design prompts that survive adversarial inputs, how to instrument agent execution paths, or how to write the evaluation criteria that distinguish a useful response from a confidently wrong one. AWS's recent release of Agent-EvalKit is a tell: even the hyperscalers concede that output-level testing is not enough — you have to trace which tools an agent called, what they returned, and whether the final response actually reflects that data. That is a practitioner skill, not a licence purchase.

2. "Human-in-the-loop" only works if the human is trained for the loop. The Instagram exploit failed against accounts that had multi-factor authentication enabled. The control existed; the workflow had simply routed around it. Inside an enterprise, the equivalent is a Copilot deployment where the practitioners using it have not been taught what to verify, when to override, or how to recognise the failure modes specific to your tooling and data.

3. Generic vendor training will not close this gap. A hyperscaler course teaches you the platform. It does not teach your delivery team how to integrate that platform into your existing controls, your sprint cadence, or your regulatory posture. That is the case for cohort-based, modular enablement built around your stack — measurable in time-to-first-AI-artefact and defect rate, not seat completion.

The bots are not the risk. The gap between the bot and the practitioner is. That is the gap your training portfolio has to close before the next exploit video circulates on Telegram — and before procurement asks why the last platform you bought is still sitting unused.

Sources

The Agent Economy Just Got a Wallet: What Delivery Leaders Need to Watch

David Mantica : December 31, 1969

If you spent the last year wiring AI into your standups and retros, the next twelve months will ask a harder question: what happens when the agent...

When Code Writes Itself: What Vibe Coding Reveals About the Next Generation of Software Creation

George Churchwell : October 31, 2025

Artificial intelligence has blurred the lines between human intent and machine execution. Once, building software required layers of requirements...

Vibe Coding

A Fool With a Tool Is Still a Fool Why the AI Pilot Purgatory Is a Leadership Problem, Not a Technology One

David Mantica : May 16, 2026

A Fool With a Tool Is Still a Fool Why the AI Pilot Purgatory Is a Leadership Problem, Not a Technology One

When AI Support Bots Get Socially Engineered: A Wake-Up Call for L&D Leaders

The pattern: capability deployed, capability not trained

What this means for your delivery portfolio

Sources

Have a Question? Let’s Talk.

The Agent Economy Just Got a Wallet: What Delivery Leaders Need to Watch

When Code Writes Itself: What Vibe Coding Reveals About the Next Generation of Software Creation

A Fool With a Tool Is Still a Fool Why the AI Pilot Purgatory Is a Leadership Problem, Not a Technology One