The AI-Daptive Organisation | Connexxo - Agile Coaching & Training

Why Adaptivity Is the Defining Capability of the AI Era

In 2025 and 2026, a substantial body of evidence accumulated on how AI is changing software engineering organisations. The findings do not tell a simple story of transformation and progress. They tell a more complex one: of amplification, of paradoxes, and of structural choices that will compound in importance over the years to come.

The underlying pattern is consistent across the evidence. AI changes the operating environment of software organisations faster than any single restructuring can absorb. The tools, the techniques, the failure modes, and the governance demands all keep moving. Organisations that try to reach a stable new configuration and settle there are out of step with the substrate they are working on. The organisations that compound advantage are those that build the capacity to keep adapting, deliberately and continuously. This is the central thesis of the article, and the reason the AI-daptive organisation is not a metaphor but an operational requirement.

This article synthesises the key findings and their implications for leaders who must make decisions now, in conditions of genuine uncertainty, about how their organisations should adapt. It draws on research from DORA (approximately 5,000 technology professionals), METR, GitHub Octoverse, LinearB, Cortex, the World Economic Forum, a range of peer-reviewed studies from 2025 and 2026, and, where no comparable alternatives are available, industry analyses from consulting companies. The research landscape is moving faster than academic publication cycles, where individual findings rest on limited samples, and that limitation is noted.

The Central Finding: Amplification, Not Transformation

The most consistent finding across the evidence base is not the one most organisations expect.

AI does not uniformly improve software teams. It amplifies them. High-performing organisations (those with strong engineering foundations, clear feedback loops, psychological safety, and disciplined quality practices) see meaningful productivity gains when they introduce AI tools. Organisations that lack these foundations see AI accelerate their existing dysfunction.

The DORA (2025) research, covering approximately 5,000 technology professionals, frames AI as an amplifier of existing capability. The pattern is structurally consistent: AI rewards preparation.

Figure 1. AI does not transform organisations uniformly; it amplifies the direction the organisation is already travelling in.

This has an uncomfortable implication. Many organisations are deploying AI tools before the organisational foundations are in place to benefit from them. The tools arrive first. The investment in what makes the tools work, in training, role design, governance, and the relational substrate of the team, follows, if it follows at all. The ratio of spend is approximately the inverse of what the evidence in this article suggests is needed.

Six Findings That Change the Calculation

Finding 1: The Productivity Paradox. A 2025 randomised controlled trial by METR (2025) gave AI tools to 16 experienced developers working on their own familiar open-source projects. The participants became 19% slower.¹ More striking: even after completing the tasks, the same participants believed they had been 20% faster. The gap between subjective perception and objective measurement was 39 percentage points. This divergence matters for how organisations make AI adoption decisions, most of which rely on anecdotal evidence and self-report.

Figure 2. The METR (2025) trial: perceived versus measured productivity change.

Finding 2: Individual Output Up, Organisational Metrics Flat. Faros AI (2025) telemetry across more than 10,000 developers found individual output metrics increasing while organisational-level delivery metrics remained flat. Developers were touching 47% more pull requests per day. But cycle time, incident rates, and change failure rates were not improving at the same rate. Individual-level gains were not translating into system-level improvement.

Finding 3: The Trust Decline. The 2025 developer survey by Stack Overflow (2025), with 49,000+ respondents, found that favourable sentiment towards AI tools fell from 72% in 2024 to 60% in 2025. Trust in AI accuracy fell from 40% to 29%. 46% of respondents actively distrust AI accuracy. Adoption continued to rise regardless; 84% use or plan to use AI tools. Developers are using tools they increasingly distrust.

Finding 4: The Review Bottleneck. GitHub (2025) reported 43 million pull requests per month, up 23% year-on-year, with commits becoming smaller and more frequent. A LinearB (2026) analysis of 8.1 million pull requests across 4,800 teams found that AI-generated pull requests wait 4.6 times longer for review than human-authored ones, and have an acceptance rate of 32.7% compared to 84.4% for human-authored code. Code generation is outpacing review capacity. The constraint has moved.

Finding 5: Quality Costs Concealed in Speed. Cortex (2025) data spanning Q3 2024 to Q3 2025 found that teams accelerating AI-generated output saw pull requests per author increase 20%, cycle time increase 9%, incidents per pull request increase 23.5%, and change failure rate increase 30%. Speed and quality were at odds rather than reinforcing each other in teams lacking sufficient quality infrastructure.

Finding 6: Cognitive and Comprehension Costs. A 2026 randomised controlled trial by Shen and Tamkin (2026), involving 52 developers, found a 17% decline in comprehension scores when developers worked primarily with AI-generated code rather than writing their own, with the largest decline in debugging.² Storey (2026), in From Technical Debt to Cognitive and Intent Debt: Rethinking Software Health in the Age of AI, identifies two new forms of accumulated liability: cognitive debt (the growing gap between code in the system and human ability to understand it) and intent debt (the loss of the reasoning behind design decisions that AI never captured). Practitioner analyses, building on this work, suggest that AI generates code at 140 to 200 lines per minute, while humans read and comprehend at 20 to 40 lines per minute. The maths of that gap compounds over time.

Taken together, these findings point in one direction. AI does not produce a stable new operating mode that organisations can settle into. It produces continuous change at a pace faster than most organisational structures can absorb, and it pulls quality, comprehension, trust, and review capacity in directions that compound if left unaddressed. The implication is that adaptivity, the disciplined ability to keep adjusting in light of evidence, becomes the defining organisational capability of the AI era. The rest of this article works through what that means structurally, technically, and humanly.

Two Structural Paths

Organisations that have moved beyond initial experimentation are arriving at one of two structural configurations. Each is viable under different conditions; neither is universally optimal.

Model A: The Amplified Small Team. A pod of 3 to 5 senior engineers uses AI to produce the output previously requiring a larger mixed-seniority team. AI handles generation; humans provide judgment, architecture, and oversight. The constraint is senior talent: this model concentrates expertise requirements rather than reducing them. It works well for product companies with a focused scope and the ability to attract and retain experienced engineers. It fails when the organisation lacks the senior depth to maintain oversight at scale.

Model B: The AI-Orchestrated Enterprise. Large organisations (200+ engineers) use AI as a coordination layer alongside flatter management structures. Meta has reported a 50:1 engineering-to-manager ratio in some AI-assisted contexts, compared to traditional ratios of 8:1 to 12:1. AI agents handle routine coordination, status tracking, and integration monitoring that previously required management layers. This model is emerging; its evidence base is thinner than that of Model A. It requires substantial investment in agentic infrastructure and the governance structures to operate safely at that level of AI autonomy.

Not all organisations are building software. Organisations whose primary function is delivering services or managing operations may apply these models to their technology functions without the same structural imperatives applying to the whole organisation.

The Engineering Foundations Question

A consistent pattern in the evidence is that AI amplifies both productive and dysfunctional engineering practices. The technical substrate determines the direction of amplification.

The foundations that matter most:

Testing investment. Veracode (2025) found that 45% of AI-generated code introduces OWASP Top 10 vulnerabilities (72% for Java). A CodeRabbit (2025) analysis of 470 open-source pull requests found that AI-generated code has 1.7 times more issues than human-written code. When AI writes the tests as well as the code, tests validate the AI’s implementation rather than the system’s intended behaviour. The investment in testing must increase when AI generates code, not decrease. Mutation testing, which verifies that tests can actually catch bugs, becomes essential. Chaos engineering becomes more important as intuition about failure modes decreases.

Architecture and observability. AI agents working on systems they cannot observe and cannot modify safely are dangerous. Fitness functions, automated checks that verify architectural properties, allow teams to evolve systems with AI assistance while enforcing the constraints that keep systems coherent. Modular architectures with clear seams give AI-generated changes a bounded impact.

Security disciplines. AI-generated code requires automated security scanning at the pipeline level. JetBrains (2025) found that 73% of developers are not using AI in their CI/CD workflows. The gap between AI capability and AI safety practice is significant and consequential.

Specification quality. Across both organisational models, specification quality, the ability to articulate precisely what the system should do and why, is the limiting factor for AI-generated output quality. AI generates code for the specification it receives. Vague specifications produce plausible-looking code that does the wrong thing. Investing in specification discipline is investing in the quality of AI output.

The Human Dimension

The evidence on AI’s effects on people within software organisations is more consistent than the evidence on productivity and more concerning.

The identity shift from maker to orchestrator is real and not adequately acknowledged. For many developers, the craft of writing code is not incidental to their professional identity; it is central. When AI handles the generation and human work shifts towards reviewing, directing, and evaluating, the experience of work changes. The finding by Shen and Tamkin (2026) that comprehension declines when reviewing AI-generated code rather than writing one’s own is partly a skills maintenance problem: the cognitive muscles built through creation atrophy when creation is delegated.

The junior developer pipeline is at structural risk. Data from Brynjolfsson et al. (2025) show that employment of 22- to 25-year-olds in the most AI-exposed occupations has fallen by approximately 16% relative to less AI-exposed occupations from 2022 to mid-2025.³ A Harvard Business School working paper by Hoffmann et al. (2025), using GitHub Copilot rollout as a natural experiment, shows that senior developers retain a larger share of the more interesting tasks under AI adoption, with corresponding implications for the learning opportunities available to junior contributors. AWS CEO Matt Garman described the “don’t hire juniors” strategy as “one of the dumbest things I’ve ever heard.” The succession paradox is this: organisations that optimise headcount now will face a senior talent shortage in 3 to 5 years, in a market where industry estimates suggest roughly three open AI-capable roles per qualified candidate, with reported fill times exceeding 100 days.

Cognitive load is rising, not falling. Research involving nearly 1,500 knowledge workers (Bedard et al., 2026) found that high-oversight workers reported 14% more mental effort and 12% more mental fatigue than low-oversight workers. Supervising AI output is cognitively expensive: it requires sustained, effortful attention of a different character from the attention involved in creating. A survey of 48,000 workers across 47 countries (KPMG & University of Melbourne, 2025) found that 57% concealed AI usage from their employers, a finding that signals, among other things, low psychological safety around AI disclosure.

−16%

developer jobs aged 22 to 25 (2022 to 2025)

57%

of workers conceal AI usage from their employer

+12%

fatigue in oversight roles vs non-oversight roles

3.2×

open AI-capable roles per qualified candidate

Figure 3. Four signals on the human dimension. Junior pipeline contraction, concealment, oversight fatigue, and senior talent scarcity all point in the same direction.

Psychological safety is not a cultural nicety in this context. It is an operational requirement. Organisations where people do not feel safe being honest about AI cannot learn from their own AI experience. The 57% concealment figure means that most organisations are making AI strategy decisions based on a heavily filtered view of reality.

What Leadership Must Do Differently

The evidence clearly distinguishes between leadership approaches that produce durable results and those that deliver short-term metrics at long-term cost.

The most common failure mode is what I call the Productivity Extractionist: a leader who sees AI primarily as a mechanism to extract more output from the same headcount, focuses relentlessly on throughput metrics, and treats every evidence-based concern about quality, cognitive load, or talent pipeline as a reason to accelerate, not to examine the system. The Cortex (2025) data show what happens: incidents rise, change failure rates increase, and the team moves fast towards a wall.

Effective AI-era leadership has three distinguishing characteristics.

First, it treats cognitive load as a strategic indicator. Leaders who track only output will miss the leading indicators of system degradation. Cognitive load, how sustainable the team’s work is, predicts quality decline and attrition before those outcomes manifest in the metrics.

Second, it protects the talent pipeline. This means maintaining junior hiring and mentoring investment even when the immediate productivity model doesn’t require it, and investing in the six AI engineering skills that I see distinguishing high-performing AI-augmented engineers, drawing on the broader 2025 to 2026 practitioner and research literature on AI engineering: intent specification, quality focus, architectural judgement, AI harness and model fluency,⁴ security rigour, and continuous learning.

Third, it deliberately runs a five-phase transition. Effective AI adoption does not happen by deploying tools and waiting for productivity to rise. It requires: building the technical and cultural foundations first; running contained pilots with honest measurement; codifying what works and deliberately spreading it; restructuring team design around the new capabilities; and establishing continuous learning loops that sustain improvement as AI continues to evolve.

Figure 4. The five-phase transition. Each phase precedes and informs the next; skipping phases tends to compound earlier shortcuts into later structural problems.

Dell’Acqua et al. (2025), in a field experiment at P&G involving 776 professionals, found that AI-augmented individuals completed 12% more tasks and worked 25% faster on average, with significantly improved solution quality. The specific finding reinforces the broader pattern: AI-enabled gains are real but conditional on maintaining the human expertise within which AI operates.

The Strategic Stakes

AI changes the economics of software development in ways that demand strategic reconsideration, not just operational adaptation.

The build-versus-buy calculation has shifted. Peer-reviewed evidence on AI-augmented professional work shows that individuals complete 12% more tasks and work 25% faster on average, with improved solution quality (Dell’Acqua et al., 2025). The gains are real but conditional on the human expertise and engineering foundations described earlier; they are not a free multiplier. Even so, decisions that previously favoured commercial off-the-shelf solutions deserve re-examination. Custom solutions that were previously impractical due to prohibitive development costs and timelines may now be viable strategic investments.

Business model pressure is real for software companies. If AI makes it possible to build in four months what previously took eighteen, the competitive advantage of large, established development teams diminishes. Product companies, custom development firms, and enterprise software vendors face different versions of this challenge, but none is immune. The premium service in software is no longer about building software; it is about judgment on what to build, understanding why customers need it, and the organisational capacity to keep improving it.

Ethics and accountability require deliberate structural treatment, not aspiration. When AI-generated code causes harm, through a security vulnerability, a biased decision system, or a failure that injures someone, the question of accountability cannot be answered by pointing to the AI. The responsibility remains with the humans and organisations that deployed it. Legal frameworks are catching up; the EU AI Act (European Union, 2024) establishes explicit risk tiers and conformity requirements for AI systems, including high-risk applications in hiring, credit, education, and critical infrastructure. Building the accountability structures that the regulatory environment will require is not optional work to be deferred. It is foundational work that is easier to build now than to retrofit later.

The Adaptive Capacity Distinction

The organisations that will compound their advantage over time are not those that acquire the best AI tools fastest. They are those who build the adaptive capacity to keep learning and adjusting as AI continues to evolve.

AI capabilities are becoming table stakes. Every serious technology organisation will achieve a functional level of AI-augmented delivery within 2 to 3 years. The competitive question is not who gets there first. It is who builds the organisational capacity to keep adapting after they get there, and after the tools change again.

Adaptive capacity has five dimensions that I see consistently distinguishing organisations that sustain advantage from those that achieve a moment of advantage and then plateau.

Solution orientation. Genuine, ongoing focus on customer and user outcomes, not on tool capabilities as goals in themselves. Organisations oriented to solutions keep asking whether AI is producing better outcomes for real people. Organisations oriented to capabilities keep acquiring new tools and measuring their own sophistication.

Fast improvement cycles. The organisational discipline of learning from experience quickly and adjusting deliberately. In the AI context, this means running experiments with honest measurement, capturing what works, intentionally spreading it, and not waiting for perfect evidence before updating practices.

Organisational multidexterity. The structural ability to operate at three tempos simultaneously: stable operations (keeping current systems reliable), AI capability development (building the competence to operate at higher levels of AI integration), and horizon scanning (watching where AI is heading and making early bets on what matters). Organisations that collapse these tempos, letting the urgent crowd out the important or letting the experimental destabilise the operational, lose the capacity to adapt before they lose the competitive position.

Adaptive leadership. Leaders who treat uncertainty as the permanent operating condition rather than a temporary disruption, who invest in learning infrastructure rather than directing from certainty, and who protect the human foundations (psychological safety, cognitive sustainability, talent pipeline) that AI quality ultimately depends on.

Healthy relationships. The trust, psychological safety, and open communication that make honest learning possible. An organisation in which 57% of employees conceal their use of AI cannot learn from its own experience. The relational substrate is not a soft concern in the AI transformation; it is the medium through which organisational learning travels.

Figure 5. The five dimensions of adaptive capacity. Each dimension feeds the whole and reinforces its neighbours; weakness in any one degrades the system.

These five dimensions are not independent. They are a system. When any one of them is absent, the others are weakened. When all five are present and sustained, the organisation compounds its advantage: each iteration of AI capabilities finds a team that is better able to use it well, learns from it faster, and adjusts more deliberately than the previous iteration.

This is what I mean by the adaptive organisation in the AI era. Not an organisation that has completed the transformation. An organisation that has built the capacity to keep transforming, with each wave of change finding it more capable, not less.

References

Bedard, J., Kropp, M., Hsu, M., Karaman, O. T., Hawes, J., & Kellerman, G. R. (2026, March 5). When using AI leads to “brain fry”. Harvard Business Review. https://hbr.org/2026/03/when-using-ai-leads-to-brain-fry
Brynjolfsson, E., Chandar, B., & Chen, R. (2025). Canaries in the coal mine? Six facts about the recent employment effects of artificial intelligence. Stanford Digital Economy Lab. https://digitaleconomy.stanford.edu/publication/canaries-in-the-coal-mine-six-facts-about-the-recent-employment-effects-of-artificial-intelligence/
CodeRabbit. (2025). State of AI vs. human code generation [Industry report]. CodeRabbit. https://www.coderabbit.ai/whitepapers/state-of-AI-vs-human-code-generation-report
Cortex. (2026). Engineering in the age of AI: 2026 benchmark report [Industry report]. Cortex. https://www.cortex.io/report/engineering-in-the-age-of-ai-2026-benchmark-report
Dell’Acqua, F., Ayoubi, C., Lifshitz-Assaf, H., Sadun, R., Mollick, E. R., Mollick, L., Han, Y., Goldman, J., Nair, H., Taub, S., & Lakhani, K. R. (2025). The cybernetic teammate: A field experiment on generative AI reshaping teamwork and expertise at Procter & Gamble [Working paper No. 33641]. National Bureau of Economic Research / Harvard Business School.
DORA. (2025). 2025 DORA report: State of AI-assisted software development. Google Cloud / DevOps Research and Assessment. https://cloud.google.com/resources/content/2025-dora-ai-assisted-software-development-report
European Union. (2024). Regulation (EU) 2024/1689 of the European Parliament and of the Council laying down harmonised rules on artificial intelligence (Artificial Intelligence Act). Official Journal of the European Union.
Faros AI. (2025). The AI productivity paradox: Research report [Industry report]. Faros AI. https://www.faros.ai/ai-productivity-paradox
GitHub. (2025). Octoverse 2025: The state of open source software. GitHub.
Hoffmann, M., Boysel, S., Nagle, F., Peng, S., & Xu, K. (2025). Generative AI and the nature of work (Working Paper No. 25-021). Harvard Business School. https://www.hbs.edu/ris/download.aspx?name=25-021.pdf
JetBrains. (2025). The state of developer ecosystem 2025. JetBrains.
KPMG & University of Melbourne. (2025). Trust, attitudes and use of artificial intelligence: A global study. KPMG International / University of Melbourne.
LinearB. (2026). 2026 software engineering benchmarks report [Industry report]. LinearB. https://linearb.io/resources/software-engineering-benchmarks-report
METR. (2025). Measuring the impact of Early-2025 AI on experienced open-source developer productivity. Model Evaluation and Threat Research. https://metr.org/blog/2025-07-10-early-2025-ai-experienced-os-dev-study/
Shen, J. H., & Tamkin, A. (2026). How AI impacts skill formation (arXiv:2601.20245). arXiv. https://arxiv.org/abs/2601.20245
Stack Overflow. (2025). 2025 Stack Overflow developer survey. Stack Overflow.
Storey, M.-A. (2026). From technical debt to cognitive and intent debt: Rethinking software health in the age of AI. arXiv. https://arxiv.org/abs/2603.22106
Veracode. (2025). 2025 GenAI code security report. Veracode. https://www.veracode.com/resources/analyst-reports/2025-genai-code-security-report/
World Economic Forum. (2025). Future of jobs report 2025. World Economic Forum.

METR (2025). Measuring the impact of Early-2025 AI on experienced open-source developer productivity (arXiv:2507.09089). Note: single RCT, 16 participants; METR is conducting a larger follow-on study. The finding warrants attention without overstatement. ↩︎
Small sample; treat as indicative pending replication. The finding is directionally consistent with Storey’s (2026) theoretical model of cognitive and intent debt. ↩︎
The 16% is a relative decline within the 22–25 cohort (most AI-exposed vs least-AI-exposed occupations), not an absolute drop across all young developers; see Brynjolfsson et al., Canaries in the Coal Mine?, Stanford Digital Economy Lab, 2025. ↩︎
The harness is the scaffolding around a language model: the prompts, tools, retrieval mechanisms, context window, evaluation hooks, and orchestration logic through which a developer interacts with the model. Harness fluency is distinct from understanding the model itself; the same model behaves very differently depending on how it is harnessed, and an engineer needs both literacies to direct AI work effectively. ↩︎

The Central Finding: Amplification, Not Transformation

Six Findings That Change the Calculation

Two Structural Paths

The Engineering Foundations Question

The Human Dimension

What Leadership Must Do Differently

The Strategic Stakes

The Adaptive Capacity Distinction

References

Digital Tools for Product Backlog

Cookie Settings

Manage Cookie Settings

Essential Cookies