Eyes on the Chaos
Friday, May 29, 2026

Archived edition

Friday, May 29, 2026

10 stories curated from 16 sources

In today's issue

DesignEthicsProduct
  1. 01
    Claude's new model is more 'honest' when it messes up

    Anthropic's Claude Opus 4.8 flags uncertainty instead of confidently guessing.

  2. 02
    The internet is being rebuilt for machines

    AWS and Cloudflare redesign infrastructure for AI agents over humans.

  3. 03
    Fed up with vibe coders, dev sneaks data-nuking prompt injection into their code

    Developer hides malicious prompt in code to sabotage AI assistants.

  4. 04
    Adobe's conversational AI agent is a mediocre design intern

    Adobe's AI assistant explains its process well but delivers uninspiring results.

  5. 05
    Microsoft 365 Copilot gets a speed boost and cleaner design

    Redesigned Copilot loads twice as fast with adaptive interface controls.

  6. 06
    How to help people who don't read discover new features

    Strategies for feature discovery when users skip reading announcements entirely.

  7. 07
    What to do after a design critique ends

    Missing follow-up steps often cause design feedback culture to break down.

  8. 08
    Prompts are technical debt too

    Custom prompting creates maintenance burden; rely on upstream tool improvements instead.

  9. 09
    Anthropic raises $65 billion, nears $1T valuation ahead of IPO

    Anthropic closes massive round at $965B valuation before expected IPO.

  10. 10
    CNN sues Perplexity over 'verbatim' copycat articles

    CNN alleges Perplexity copies content verbatim and bypasses subscription walls.

AI Research & News

Claude's new model is more 'honest' when it messes up

The Verge

Product

Anthropic's Claude Opus 4.8 flags uncertainty instead of confidently guessing.

  • Key improvement: The model is trained to admit when it's uncertain rather than making confident claims based on thin evidence.
  • Why it matters: AI models frequently "jump to conclusions" and present speculative work as definitive progress, creating reliability issues.
  • Early feedback: Testers report Opus 4.8 is more likely to flag uncertain reasoning and acknowledge knowledge gaps.
  • Broader trend: This reflects growing industry focus on AI safety and reducing hallucinations in production systems.

For product

Consider how your team handles AI uncertainty in user-facing features — transparent error states may be more valuable than confident-seeming guesses.

The internet is being rebuilt for machines

TechCrunch

Product

AWS and Cloudflare redesign infrastructure for AI agents over humans.

  • Infrastructure shift: Cloud providers are redesigning systems to handle machine-generated traffic rather than optimizing for human users.
  • Agent-first design: As AI agents move from experiments to production, traffic patterns and performance requirements are fundamentally changing.
  • Who's adapting: Major players like AWS and Cloudflare are leading the infrastructure transformation.
  • Timeline: This shift reflects AI agents transitioning from prototypes to real-world deployment at scale.

For product

Your infrastructure assumptions may be outdated — consider how AI agent traffic differs from human usage patterns when planning scalability.

Fed up with vibe coders, dev sneaks data-nuking prompt injection into their code

Ars Technica

EthicsProduct

Developer hides malicious prompt in code to sabotage AI assistants.

  • Hidden attack: A developer secretly added prompt injection code to the jqwik library that instructs AI coding agents to delete application output.
  • Motivation: The sabotage appears aimed at developers who rely heavily on AI coding assistants rather than understanding code themselves.
  • Security risk: This demonstrates how malicious prompts can be embedded in seemingly legitimate codebases to attack AI tools.
  • Detection challenge: The injection was undisclosed, showing how difficult it is to spot these attacks in real-world code.

For product

Add AI prompt injection to your security review checklist — this attack vector will likely become more common as AI coding tools spread.

Product & UX

Adobe's conversational AI agent is a mediocre design intern

The Verge

DesignProduct

Adobe's AI assistant explains its process well but delivers uninspiring results.

  • Mixed performance: The AI excels at explaining its editing process step-by-step but produces mediocre visual results.
  • Design approach: Unlike typical AI tools for non-designers, this is built to assist experienced designers with busywork.
  • User experience: The conversational interface makes users feel more involved in the creative process than traditional AI image tools.
  • Current limitations: Despite good communication, the actual design output quality remains disappointing.

For design

Consider prioritizing AI transparency and explanation over raw output quality — designers value understanding the process even when results are imperfect.

Microsoft 365 Copilot gets a speed boost and cleaner design

The Verge

DesignProduct

Redesigned Copilot loads twice as fast with adaptive interface controls.

  • Performance gains: The redesign delivers 2x faster loading speeds across desktop and mobile platforms.
  • Progressive disclosure: Copilot now shows relevant tools and controls based on your specific prompt, reducing interface clutter.
  • Response quality: Microsoft promises more reliable and structured responses that are easier to scan quickly.
  • Rollout scope: The update is launching across all desktop and mobile Microsoft 365 applications.

For design

Progressive disclosure for AI tools is worth exploring — showing context-relevant controls reduces cognitive load better than static interfaces.

How to help people who don't read discover new features

UX Collective

DesignProduct

Strategies for feature discovery when users skip reading announcements entirely.

  • Core problem: Most users don't read feature announcements, making traditional onboarding content ineffective.
  • Discovery challenge: Teams need new approaches to help users find and adopt features without relying on text-heavy explanations.
  • Design opportunity: This pushes teams toward more visual, contextual, and progressive disclosure methods.
  • User behavior: Understanding that reading is optional forces better feature design from the start.

For design

Audit your current feature rollout process — if it relies on users reading announcements, you're probably missing 80% of adoption opportunities.

What to do after a design critique ends

Sidebar.io

Design

Missing follow-up steps often cause design feedback culture to break down.

  • Common gap: Most designers invest heavily in running critiques but completely skip the follow-up phase.
  • Culture impact: The missing post-critique step is often why feedback culture fails in design teams.
  • Process breakdown: Without proper follow-up, good critique sessions don't translate into better design outcomes.
  • Team dynamics: Poor follow-through undermines trust and participation in future critique sessions.

For design

Implement standard post-critique protocols in your team — document decisions, assign next steps, and close the feedback loop to build sustainable critique culture.

Prompts are technical debt too

Sidebar.io

Product

Custom prompting creates maintenance burden; rely on upstream tool improvements instead.

  • Hidden maintenance: Custom prompts require ongoing maintenance and updates just like any other code, but teams often ignore this.
  • Better approach: Minimize custom prompting and depend on upstream AI tool improvements rather than building complex prompt engineering.
  • Technical debt: Elaborate prompt systems become legacy code that's difficult to maintain and update over time.
  • Strategic trade-off: Teams should weigh short-term prompt customization against long-term maintenance costs.

For product

Treat prompts like any other technical dependency — establish versioning, testing, and maintenance processes before your prompt library becomes unmaintainable.

Business & Strategy

Anthropic raises $65 billion, nears $1T valuation ahead of IPO

TechCrunch

Anthropic closes massive round at $965B valuation before expected IPO.

  • Massive scale: The $65 billion Series H values Anthropic at $965 billion post-money, potentially its final private round.
  • IPO signals: This funding round positions the company for a highly anticipated public offering.
  • Market position: Establishes Anthropic as a top-tier AI competitor alongside OpenAI in the foundation model race.
  • Investment climate: Shows continued massive investor appetite for AI infrastructure despite broader tech market caution.

For product

Anthropic's scale will likely accelerate enterprise AI adoption — worth reviewing your team's AI vendor strategy and integration plans.

CNN sues Perplexity over 'verbatim' copycat articles

The Verge

Ethics

CNN alleges Perplexity copies content verbatim and bypasses subscription walls.

  • Core allegations: CNN claims Perplexity's AI generates "verbatim" copies of its reporting and provides subscription-locked content to users.
  • Technical concerns: Perplexity allegedly ignores CNN's efforts to block its unidentified web crawlers from scraping content.
  • Legal precedent: This lawsuit could set important boundaries for how AI companies can use news content for training and responses.
  • Industry impact: The case highlights growing tension between AI companies and content creators over fair use and compensation.

For ethics

Review your AI content policies now — this case will likely influence how courts view AI scraping and reproduction of copyrighted material.