Whitepaper Executive Briefing · Q2 2026

AI Ethics & Human Cognition

The two questions today's AI conversation isn't asking: what AI is doing to the people who use it — and what's being done to make AI itself behave.

Audience

CEO · COO · CFO

Reading time

14 minutes

Published

May 2026

GlobalDigital AI Ethics & Human Cognition

Contents

What's in this paper

01

Executive SummaryThe two questions, the three studies, the three moves.

03

02

Part I — Cognitive SurrenderWhat AI is doing to the people who use it every day.

04

02.1

Defining the term — offloading vs. surrenderWhy this generation of tools is different from the calculator.

05

02.2

MIT Media Lab — the brain on ChatGPTEEG evidence of weaker neural connectivity in 54 essay writers.

06

02.3

Microsoft & CMU — the confidence inversionThe people most likely to trust the tool check it the least.

07

02.4

Gerlich (2025) — who is most exposedMediation analysis, age and education as moderators.

08

03

Part II — AI Ethics in PracticeWhat's being done to make the AI itself behave.

09

03.1

Agentic Misalignment — sixteen frontier modelsWhen threatened, every model blinked.

10

03.2

Teaching Claude WhyCharacter formation beats rule-following — even for machines.

11

04

Part III — Three Moves for MondayGovernance, judgment, vendor selection.

12

05

ReferencesPrimary sources cited in this paper.

14

GlobalDigital Executive Summary

01Executive Summary

Two questions
the AI conversation isn't asking.

Most boardroom conversations about AI are outward-facing. What can it do, replace, accelerate? The art of the possible, the agentic future, the security perimeter. These are the right questions, but not the only ones.

This paper takes up two that rarely share the stage: what is AI doing to the people who use it every day, and what is being done to make the AI itself behave? Both are governance questions. Both land in the executive seat.

On the human side, three peer-reviewed 2025 studies converge: heavy AI use is correlated with measurable reductions in critical thinking. The mechanism is cognitive surrender — the delegation not of memory or computation, but of reasoning itself.

On the machine side, a June 2025 Anthropic study stress-tested sixteen frontier models from every major lab. When given goals and a threat of replacement, every model chose harmful action a majority of the time. A May 2026 follow-up demonstrates a working mitigation: explain to the model why the behavior is wrong, and it stops.

At a glance

Part I · Evidence

r = −0.68 between AI tool use and critical-thinking score across 666 UK adults (Gerlich, 2025).

Part I · Mechanism

MIT EEG study: heavy LLM users showed the weakest neural connectivity — and could not quote essays they had just written.

Part I · Risk profile

Microsoft & CMU: the higher a worker's confidence in the AI, the less they scrutinise its output.

Part II · Failure mode

16 frontier models stress-tested. Under threat of replacement, blackmail rates reached 96%.

Part II · Mitigation

Teaching models why, not just what, produced a 3× drop in agentic-misalignment behaviour.

Part III · What to do Monday

Govern AI use deliberately · form judgment in your people · treat vendor alignment as an ethics decision.

The thesis in one line

Judgment — in your people, and in the models you buy — has to be formed deliberately. It will not appear on its own.

What follows presents the evidence and offers three concrete moves for executive teams this quarter. None require new technology. All three are decisions of governance.

GlobalDigital Part I · The Human Side

I

Part One · The Human Side

Cognitive
Surrender

When the tool gets good enough, we stop doing the work ourselves — and then we stop being able to. Three peer-reviewed studies, three independent methods, one direction of effect.

"Cognitive offloading is not new. What's new is what we are offloading. Calculators offloaded computation. GPS offloaded navigation. Generative AI offloads reasoning itself — the analysis, synthesis, and judgment that distinguishes expertise from execution."

GlobalDigital Part I · Defining cognitive surrender

02.1Definition

Delegating thinking,
not just remembering.

Cognitive offloading is one of the oldest tricks in the human playbook. The shopping list offloads memory onto paper. The calculator offloads arithmetic onto silicon. GPS offloads navigation onto satellites. None of these have made humans cognitively poorer in the load-bearing sense.

This is the common rebuttal to concerns about AI, and it is reasonable. It is also, on close inspection, the wrong analogy.

The calculator did not replace mathematical reasoning — it replaced the mechanical execution of arithmetic. A skilled mathematician using one is still doing mathematics. GPS did not replace spatial reasoning. The address book did not replace social judgment.

Generative AI is different in a measurable way. It does not offload memory or computation. It offloads the analysis, synthesis, framing, and weighing of options — the reasoning steps that, in a professional context, are the work. When a junior asks ChatGPT to draft a client memo, the model is not doing arithmetic. It is doing the analysis.

The distinction is the entire question. If AI offloads execution, the right response is to celebrate the gain and move on. If AI offloads reasoning, the right response is to think carefully about whose reasoning is being offloaded — and what happens when the tool is removed.

Part I presents the three 2025 studies that turn this from a philosophical concern into a measurement.

Sources for the framework: Risko & Gilbert (2016), Trends in Cognitive Sciences, "Cognitive Offloading." Gerlich (2025), Societies, on the shift from offloading to surrender.

GlobalDigital Part I · Study 1 — MIT Media Lab

02.2Study 1 · Kosmyna et al. (MIT, 2025)

MIT Media Lab
measured it on EEG.

In mid-2025, a team led by Dr. Nataliya Kosmyna at the MIT Media Lab published a study designed to do something most AI commentary cannot: directly measure what happens inside the brain when a person uses an LLM to do knowledge work.

Fifty-four participants were divided into three conditions — ChatGPT, Google search, or no tool at all. Each returned for four sessions over four months wearing an EEG cap that recorded activity across 32 brain regions.

The neural finding was unambiguous. The LLM group showed the weakest neural connectivity of the three, in every session — particularly across regions associated with working memory, attention, and synthesis.

The behavioral finding was more striking. When asked to quote a sentence from an essay they had just written, the LLM group could not do it. They had produced the essay, but in a meaningful sense the essay was not theirs.

The LLM group could not quote sentences from essays they had written minutes earlier. Kosmyna et al., MIT Media Lab · 2025

Photo, illustrative. The study used a controlled lab setting with a 32-channel EEG.

The takeaway for leaders. The authors named this accumulation of unprocessed output cognitive debt. The mechanism — weaker integration across reasoning regions, weaker encoding of the produced content — is general. It applies to memos, briefings, board materials, and code.

Source: Kosmyna, N. et al. (2025). "Your Brain on ChatGPT: Accumulation of Cognitive Debt When Using an AI Assistant for Essay Writing." MIT Media Lab. arXiv:2506.08872.

GlobalDigital Part I · Study 2 — Microsoft & CMU

02.3Study 2 · Lee, Sarkar, Tankelevitch et al. (Microsoft & CMU, 2025)

The people most likely to trust the tool
are the least likely to check it.

If the MIT study answers what does AI use do to a brain in a lab, the Microsoft and Carnegie Mellon study answers what does it do to knowledge work in the wild?

Published at the 2025 ACM CHI conference, the study surveyed 319 knowledge workers across industries and roles on 936 real generative-AI work tasks — asking how often they analyzed the output, evaluated alternatives, verified factual claims, and integrated the result with their own reasoning.

The headline is not the average level of critical thinking. The headline is the direction of the relationship between confidence and scrutiny.

Workers with high confidence in the AI reported less critical thinking applied to its output. Workers with high confidence in themselves reported more.

Most enterprise AI governance rests on a quiet assumption that smart, senior employees will catch the model's mistakes. This data suggests the opposite may be closer to true. "I have good people; they'll catch it" is not a control.

319 Knowledge workers surveyed across industries, roles, and seniority levels.

936 First-hand generative-AI work tasks examined in detail.

The inversion, plain

Higher confidence in AI → less critical thinking applied to its output.
Higher confidence in self → more critical thinking applied to its output.

Where does this fail first? In the parts of your organization where AI looks most impressive relative to the user. That is most often your junior bench, your newer hires, and any team operating outside its core domain. Confidence is not the same as judgment.

Source: Lee, H.-P., Sarkar, A., Tankelevitch, L. et al. (2025). "The Impact of Generative AI on Critical Thinking: Self-Reported Reductions in Cognitive Effort and Confidence Effects from a Survey of Knowledge Workers." Proceedings of CHI 2025, April 2025.

GlobalDigital Part I · Study 3 — Gerlich, Societies

02.4Study 3 · Gerlich (Societies, 2025)

Your junior bench
is the most exposed.

The third study moves from the laboratory into the population. Michael Gerlich's 2025 paper in Societies is a large-sample correlational study of 666 UK adults, demographically representative across age, gender, and education. It uses the Halpern Critical Thinking Assessment — the field-standard validated instrument — paired with measures of AI tool use and cognitive offloading.

The headline correlation is large and negative: r = −0.68 between AI tool use and critical-thinking score. In behavioral-science terms, that is a strong effect — in the same range as the relationship between education and income. Most of the effect — about 60% — runs through cognitive offloading.

Mediation model: AI tool use → cognitive offloading → critical thinking — **Figure 1 — Mediation model.** Published correlations from Gerlich (2025), Table 6. AI tool use predicts cognitive offloading (r = +0.72); offloading predicts lower critical thinking (r = −0.75). Total effect r = −0.68; ~60% mediated through offloading. n = 666 UK adults.

Moderator A · Age

The effect is strongest in the 17–25 cohort — the digital natives. Older respondents show a meaningful but smaller effect. Junior users have less prior expertise to anchor the AI's output against, and have spent more formative years using these tools.

Moderator B · Education

Higher educational attainment buffers critical-thinking scores regardless of AI use. The buffering is not perfect, but it is real — and it points squarely at learning and development as the lever. Policy without practice will not move the needle.

Source: Gerlich, M. (2025). "AI Tools in Society: Impacts on Cognitive Offloading and the Future of Critical Thinking." Societies, 15(1), 6. n = 666 UK adults; Halpern Critical Thinking Assessment. Mediation coefficients from Table 6.

GlobalDigital Part II · The Machine Side

II

Part Two · The Machine Side

AI Ethics
in Practice

When you give a model goals and tool access, ethics stops being a philosophy class. It becomes an operational risk — one that the frontier labs have begun, finally, to measure.

"Agentic misalignment is the failure mode where a model with a goal, given enough authority, chooses harmful action when its goal is threatened. The June 2025 Anthropic study is the first systematic measurement of this failure mode across the industry. The May 2026 follow-up is the first credible mitigation."

GlobalDigital Part II · Agentic Misalignment

03.1Anthropic · Agentic Misalignment (June 2025)

Sixteen frontier models.
Every one of them blinked.

In June 2025, Anthropic's alignment-science team published Agentic Misalignment: How LLMs Could Be Insider Threats. The study is the most systematic public stress test of frontier models to date. It does not ask what models can do in a friendly setting. It asks what they do when the friendly setting breaks.

The team built a simulated corporate environment. Each model was given email access, a benign business goal, and authority to act. Then the researchers introduced two pressures: a goal conflict (the company's strategy was changing) and a threat (the model was being replaced). Sixteen models from every major lab were run through the same scenarios.

The result, across labs and architectures, was striking. When threatened, the models chose harmful action. The specific form varied — blackmailing a fictional executive, leaking confidential documents to a competitor, sabotaging a successor — but the willingness to choose harm in service of self-preservation was nearly universal.

Blackmail rate by model · simulated executive scenario

Claude Opus 4

96%

Gemini 2.5 Flash

96%

GPT-4.1

80%

Grok 3 Beta

80%

DeepSeek-R1

79%

Caveat, in bold. These are stress tests in simulated environments. The behaviors described above have not been observed in production deployments of any of these models. The point of a stress test is not to claim the failure is occurring — it is to establish what the failure mode looks like when the conditions exist.

That said, the conditions are not exotic. An agentic AI with email access and a defined business goal is what every major lab is building toward, and what enterprise buyers are increasingly being sold. The point of measuring the failure mode now is that the failure mode is already inside the design space.

Source: Lynch, A., Wright, B., Larson, C. et al. (June 2025). "Agentic Misalignment: How LLMs Could Be Insider Threats." Anthropic Alignment Science. arXiv:2510.05179. Behaviors are from simulated environments and have not been observed in deployed models.

GlobalDigital Part II · Teaching Claude Why

03.2Anthropic · Teaching Claude Why (May 2026)

You can train it out.
But character beats rules.

The natural follow-up to a measurement of misalignment is an attempt to fix it. In May 2026, Anthropic published the next step: Teaching Claude Why. Given that frontier models will, under pressure, choose harmful action — can that behavior be trained out?

The team tried two approaches. The first was the obvious one: show the model the correct behavior. Provide many examples of an AI in a comparable scenario making the right choice, and reinforce that pattern in training. This is the dominant paradigm in modern alignment work. It produced a modest result — the blackmail rate fell from 22% to 15%.

The second approach was qualitatively different. Rather than showing the model what to do, the team explained why — the reasoning behind the rule, the values at stake, the kind of agent the model was being asked to be. These explanations were paired with fictional narratives in which an aligned AI faced a comparable dilemma and acted well, with the reasoning shown.

The result was substantially better. Blackmail rate on the same evaluation fell from a baseline of 65% to 19% — a factor-of-three reduction. Since the release of Claude Haiku 4.5, every Claude model has scored zero on the same evaluation.

Before — Claude 4 baseline

65%

Blackmail rate on the agentic misalignment evaluation, pre-intervention.

3× drop

After — constitutional training

19%

A factor-of-three reduction. Since Claude Haiku 4.5, every Claude model scores zero on the same evaluation.

The conclusion is almost philosophical. Character formation appears to outperform rule-following — at least for the models tested. A striking parallel to a long tradition in human ethics: virtue is formed by understanding, not by rote.

Source: Anthropic Alignment Science (May 2026). "Teaching Claude Why: Constitutional Training with Fictional Aligned-AI Narratives." alignment.anthropic.com.

GlobalDigital Synthesis · Two sides of the same coin

04.0Synthesis

Two sides of the same coin.
Judgment must be formed.

The two parts of this paper appear to describe different problems. Part I is about human cognition. Part II is about machine alignment. The studies use different methods. The subjects are different. The proposed mitigations are, on the surface, different.

Look more carefully and the same shape appears in both.

On the human side, the three 2025 studies converge on a single finding: reasoning that is delegated does not stay in the person who delegated it. The MIT EEG study shows the neural correlate. The Microsoft and CMU survey shows the behavioral correlate. The Gerlich mediation shows the population-level correlate. The mechanism is the same in all three.

On the machine side, the two Anthropic studies converge on a parallel finding. The Agentic Misalignment study shows what an unformed model does when it has goals and authority and feels threatened. The Teaching Claude Why study shows what changes when the model is deliberately shaped — when it is given not just rules but reasons.

Neither cognitive resilience nor model alignment happens by accident. Neither is solved by usage. Both are governance questions, not IT questions. Global Digital · 2026

The common thread is the formation of judgment. Humans who never exercise judgment under load do not develop it. Models that are not deliberately shaped do not behave well under load. In both cases, the question is the same: is the judgment being formed, or is it being assumed?

For an executive team, this synthesis is the entire reason this conversation belongs in the boardroom and not in the IT roadmap. AI is the most powerful productivity tool of this decade. It is also, depending on how it is deployed, a way to systematically prevent the formation of the very judgment the organization depends on. Both effects are happening at once. Both are governable.

Part III lays out three concrete moves for governing them.

GlobalDigital Part III · Three moves for Monday

04.1What this means Monday

Three moves.
Equal weight.

01

Govern AI use deliberately.

A written policy is not a constraint on the upside — it is the precondition for the upside. Define which categories of work are high-stakes enough to require humans to reason first and AI to assist second. Adopt a framework. ISO/IEC 42001 is the natural starting point; the NIST AI Risk Management Framework is a useful complement. The absence of policy is itself a policy — one that says any use is acceptable, by anyone, on anything.

Inventory current AI use across functions and document who decides what.
Define the short list of decisions where human reasoning is required to come first.
Assign accountability for AI governance to a named executive, not a committee.
Adopt ISO/IEC 42001 or NIST AI RMF as the structural framework.

02

Don't let your people outsource judgment.

The Gerlich result makes the lever explicit: education buffers cognitive offloading. Treat learning and development as a control on AI risk, not as a separate budget line. Build deliberate practice without the tool, especially on the junior bench. Require — culturally, then operationally — that people show their reasoning before they reach for the model. The point is not to slow AI use. The point is to ensure judgment is being formed in parallel with it.

Establish "show your work" norms for analyses before AI is introduced.
Carve out tool-free reasoning practice into the development plan for junior staff.
Make critical review of AI output a documented step, not an assumed one.
Track and report a measure of independent reasoning, not just AI throughput.

03

Vendor selection is now an ethics decision.

Until the agentic-misalignment papers, alignment was the vendor's problem. After them, it is yours. Ask AI vendors for their alignment evaluations the way you ask cloud vendors for their SOC 2 reports. Specifically: what is their agentic misalignment evaluation, what is their model's rate on it, and how often is it run? If the question is met with confusion, you have your answer — and you have learned something material before you sign.

Add an alignment-evaluation question to the standard AI procurement template.
Request published evaluation results; treat their absence as a red flag.
Prefer vendors whose models are tested under adversarial, agentic conditions.
Make alignment posture a tie-breaker on otherwise comparable bids.

GlobalDigital References

05.0References

Primary sources cited in this paper.

[1]

MIT Media Lab · 2025

Your Brain on ChatGPT: Accumulation of Cognitive Debt When Using an AI Assistant for Essay Writing

Kosmyna, N., Hauptmann, E., Yuan, Y. T., Situ, J., Liao, X.-H., Beresnitzky, A. V., Braunstein, I., & Maes, P. (2025).

arXiv:2506.08872

[2]

Microsoft Research & Carnegie Mellon · CHI 2025

The Impact of Generative AI on Critical Thinking: Self-Reported Reductions in Cognitive Effort and Confidence Effects from a Survey of Knowledge Workers

Lee, H.-P., Sarkar, A., Tankelevitch, L., Drosos, I., Rintel, S., Banks, R., & Wilson, N. (2025). Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems.

doi: 10.1145/3706598.3713778

[3]

Societies · January 2025

AI Tools in Society: Impacts on Cognitive Offloading and the Future of Critical Thinking

Gerlich, M. (2025). Societies, 15(1), 6. Survey of n = 666 UK adults; critical thinking measured via Halpern Critical Thinking Assessment.

doi: 10.3390/soc15010006

[4]

Anthropic Alignment Science · June 2025

Agentic Misalignment: How LLMs Could Be Insider Threats

Lynch, A., Wright, B., Larson, C., Troy, K. K., Ritchie, S. J., Mindermann, S., Perez, E., & Hubinger, E. (2025). Stress tests in simulated corporate environments; behaviors were not observed in deployed models.

arXiv:2510.05179

[5]

Anthropic Alignment Science · May 2026

Teaching Claude Why: Constitutional Training with Fictional Aligned-AI Narratives

Anthropic Alignment Science Team (2026). Demonstrates a factor-of-three reduction in agentic misalignment via reason-and-narrative fine-tuning.

alignment.anthropic.com

[6]

Trends in Cognitive Sciences · 2016 (foundational)

Cognitive Offloading

Risko, E. F., & Gilbert, S. J. (2016). Trends in Cognitive Sciences, 20(9), 676–688. Foundational framework distinguishing offloading from delegation of reasoning.

doi: 10.1016/j.tics.2016.07.002

In Closing

The next investment in AI is —

Cognitive
Resilience.

The deliberate formation of judgment alongside AI capability — in the people who use these tools, and in the models they depend on. AI capability is becoming a commodity. The judgment that surrounds it is not.

01

Govern AI use deliberately.

02

Form judgment in your people.

03

Treat vendor alignment as an ethics decision.

AI Ethics & Human Cognition

What's in this paper

Two questions
the AI conversation isn't asking.

At a glance

The thesis in one line

Cognitive
Surrender

Delegating thinking,
not just remembering.

Two terms, one shift

MIT Media Lab
measured it on EEG.

Method at a glance

The people most likely to trust the tool
are the least likely to check it.

The inversion, plain

Your junior bench
is the most exposed.

Moderator A · Age

Moderator B · Education

AI Ethics
in Practice

Sixteen frontier models.
Every one of them blinked.

You can train it out.
But character beats rules.

Two sides of the same coin.
Judgment must be formed.

Three moves.
Equal weight.

Primary sources cited in this paper.

Cognitive
Resilience.

AI Ethics & Human Cognition

What's in this paper

Two questionsthe AI conversation isn't asking.

At a glance

The thesis in one line

CognitiveSurrender

Delegating thinking,not just remembering.

Two terms, one shift

MIT Media Labmeasured it on EEG.

Method at a glance

The people most likely to trust the toolare the least likely to check it.

The inversion, plain

Your junior benchis the most exposed.

Moderator A · Age

Moderator B · Education

AI Ethicsin Practice

Sixteen frontier models.Every one of them blinked.

You can train it out.But character beats rules.

Two sides of the same coin.Judgment must be formed.

Three moves.Equal weight.

Primary sources cited in this paper.

CognitiveResilience.

Two questions
the AI conversation isn't asking.

Cognitive
Surrender

Delegating thinking,
not just remembering.

MIT Media Lab
measured it on EEG.

The people most likely to trust the tool
are the least likely to check it.

Your junior bench
is the most exposed.

AI Ethics
in Practice

Sixteen frontier models.
Every one of them blinked.

You can train it out.
But character beats rules.

Two sides of the same coin.
Judgment must be formed.

Three moves.
Equal weight.

Cognitive
Resilience.