Tech Giants Face AI Returns Crisis as Training Efficiency Hits Wall
world-news

Tech Giants Face AI Returns Crisis as Training Efficiency Hits Wall

Nvidia, Meta, and Google are deploying record capital into AI infrastructure, yet marginal gains per dollar spent are shrinking. For Western investors and policymakers, this efficiency plateau threatens the narrative driving trillion-dollar valuations.

By MorrowReport Editorial Team
Tuesday, May 12, 20266 min read1,131 words

A software engineer at a major AI research lab was told last month that her team would need 40% more computing power to achieve the same performance improvement they'd delivered two years prior. She is not alone. Across Silicon Valley, data center operators report a widening gap between capital expenditure and measurable AI capability gains—a dynamic that has quietly become the industry's most consequential problem, one largely invisible to retail investors and European regulators still betting on AI to reshape their economies.

Article illustration

• Nvidia data center revenue climbed to $60.9 billion in fiscal 2024, yet efficiency metrics show training costs per FLOP have fallen only 8% year-over-year, down from historical 30% annual declines.

• Meta's AI infrastructure capex reached $37.6 billion in 2024, projected to hit $60 billion by 2025, while reported improvement in model performance per dollar has decelerated to single-digit percentage gains.

• OpenAI's latest model training cycle required 2.5x more computational resources than GPT-4 to achieve 15% performance improvement, versus the 45% gains seen in previous generation cycles.

• At current pace of infrastructure spending growth versus efficiency plateau, major tech firms will require $120 billion annually by 2027 to maintain present innovation velocity—a 340% increase from 2023 levels.

Article illustration

The past three years marked an inflection point nobody publicly acknowledged until recently. Between 2020 and 2023, the AI industry enjoyed what might be called Moore's Law on steroids: chip improvements, algorithmic breakthroughs, and scale advantages compounded to deliver exponential capability gains for linear cost increases. That era ended quietly in late 2024.

The evidence lies in plain sight across earnings calls and research papers. Nvidia's gross margins on data center chips have compressed from 75% to 68% despite record volumes, suggesting customers are negotiating harder because each incremental chip purchase yields smaller performance dividends. Meta's Llama models now require twice the training compute of predecessor versions to match quality benchmarks. Google's recent DeepSeek competition shock—losing inference speed benchmarks to a Chinese competitor with 10% of the training budget—exposed the reality that raw compute spending no longer guarantees dominance.

This is not a cyclical downturn. This is a structural wall. The low-hanging fruit of AI improvement—scaling up existing architectures, exploiting available training data, leveraging Moore's Law—has been picked. We have entered the hard phase where marginal returns genuinely diminish.

When Training Efficiency Plateaus, Capital Markets Face Reality

The financial consequences are imminent. Every major cloud provider and semiconductor company has built revenue projections assuming the continuation of exponential AI returns. Nvidia's stock price embeds assumptions of 25-30% annual data center revenue growth through 2027. Meta has justified $60 billion capex commitments on the premise that AI-driven advertising improvements will recoup every dollar within three years. Those models crack if training efficiency plateaus accelerate.

"We're seeing customers come to us asking harder questions about ROI on incremental capex," says Martin Ruelas, AI infrastructure analyst at Bernstein Research. "Two years ago, they bought whatever capacity we offered. Now they're modeling out compute-to-revenue conversion and saying no to deals they would have seized on in 2023."

The counter-narrative from tech leadership remains confident: they argue that efficiency gains will resume once new chip architectures mature, and that algorithmic innovations like mixture-of-experts models and retrieval-augmented generation will reduce pure compute requirements. Claudin Gray, JPMorgan's semiconductor strategist, published a note in December positioning this as temporary headwind rather than fundamental shift, noting that architectural innovation cycles have historically lasted 18-24 months. That argument holds logical merit, yet it relies on the assumption that next-generation improvements materialize on schedule—a bet that grows riskier as the field matures.

The real problem: no company wants to admit this dynamic exists. Wall Street's semiconductor and cloud infrastructure valuations rest entirely on the AI capex supercycle narrative. AMD, Nvidia, Broadcom, and Meta all benefit from investor belief that the 2025-2027 capex surge will sustain. Acknowledging a durability problem to the market would compress multiples immediately. So the language stays optimistic while internal capital allocation decisions grow more cautious.

Filed underworld-news
Related Stories