Wyecliff
All editions

AI Gets A Sales Team | Wyecliff Weekly

May 8, 20265 min read
The AI stories that actually matter this week | Edition #22 | May 8, 2026
This was a quieter week than the last few, but every big story pointed at the same thing. The biggest AI labs and the legacy tech giants have started selling more than the technology itself. They are now selling the people, services, and packaging that put AI to work inside an existing business. Anthropic teamed up with Wall Street to launch a $1.5 billion AI services firm. IBM made the coding tool it built for itself generally available. ChatGPT became measurably more reliable. And a Harvard study put OpenAI's reasoning model in the emergency room next to two real doctors. Let's get to it.

Now Available: The Wyecliff Weekly Podcast

Prefer to listen? Every edition of the Wyecliff Weekly is now generated as a podcast. Same stories, same analysis, same lessons. Just press here.
Listen to this edition

Top Stories

+ Anthropic, Goldman, And Blackstone Just Started An AI Consulting Firm

On Monday, Anthropic announced a $1.5 billion joint venture with Blackstone, Goldman Sachs, Hellman & Friedman, and General Atlantic to launch a new firm focused on getting Claude into mid-sized companies. The target is Blackstone's portfolio first, which alone covers more than 250 mid-sized companies, and then expansion to other private-equity-owned businesses. Blackstone President Jon Gray said the new firm is meant to break what he called the biggest bottleneck in enterprise AI right now, which is the shortage of engineers who can actually implement advanced AI inside an existing business.
What it all means: Some of the most sophisticated capital in the world just put $1.5 billion behind a single bet, and the bet is not a model or a piece of software. It is the engineering hours and applied AI staff required to actually deploy AI inside an existing business. The other major AI labs will follow with their own services arms inside six months, and the traditional consulting firms will respond fast.

+ A Harvard Study Found AI Beat Two Doctors On Emergency Room Diagnoses

A study published in Science on Sunday, led by researchers at Harvard Medical School, Beth Israel Deaconess Medical Center, and Stanford, ran 76 real emergency room cases through OpenAI's o1 reasoning model and compared the results to two attending physicians. The model offered the exact or very close diagnosis 67% of the time. The two doctors hit that mark 50% and 55% of the time. The study also tested whether the model could recommend appropriate diagnostic tests and triage decisions, and found it kept pace with or outperformed the doctors there too. The authors were careful with the framing. The two doctors were internal medicine attendings, not ER specialists. The cases were text-based, not real patients in front of a clinician. And the paper called for controlled trials before any of this gets used in actual treatment. One of the authors told the press: "We are already at the ceiling on what we can learn from these comparison studies. We need to see how this works in real care."
What it all means: A peer-reviewed study just put a paid AI model ahead of two trained physicians on a real-world clinical task. The doctor's appointment, the ER visit, and the second opinion are all going to start including an AI somewhere in the loop within the next two years.

+ ChatGPT Just Got A Lot More Reliable

On Tuesday, OpenAI rolled out GPT-5.5 Instant as the new default model in ChatGPT, replacing GPT-5.3. According to OpenAI's own numbers, the new default produces 52.5% fewer made-up claims on high-stakes prompts in medicine, law, and finance, and 37.3% fewer factual errors on the kinds of conversations users had previously flagged as wrong. The response style is also tighter, with fewer filler emojis and less padding around answers. The bigger change is a new feature called memory sources. When ChatGPT uses something it remembers about you to shape an answer, you can now click to see exactly which saved memory was used, and edit or delete that memory directly. The setting is in the same Personalization menu where memory has lived for the past year.
What it all means: The model people use most just became more accurate by default and easier to correct when it remembers something wrong. Anyone who turned ChatGPT memory off because it felt opaque should give it another look. Anyone using ChatGPT for high-stakes work just got a free upgrade.

+ IBM Released The Coding Agent It Built For Itself

At its annual Think conference on Tuesday, IBM announced general availability of Bob, an AI development partner the company has been using internally since June 2025. Bob started inside IBM with 100 developers and grew to more than 80,000 IBM employees in less than a year. Self-reported productivity gains across that group averaged 45%. The headline customer example, Blue Pearl, used Bob to complete a Java upgrade in three days that normally takes about 30. Bob is built to handle more than code generation. It plans, codes, tests, deploys, and modernizes legacy systems, with security and compliance controls layered on top. It works across multiple AI models, not just IBM's own. A 30-day free trial is open to any company. Pricing for the paid plans was not published with the launch.
What it all means: IBM is the first major tech vendor to ship a developer tool that takes action on its own across the full software lifecycle, with proof from inside its own 270,000-person company. Every other enterprise software vendor will now have to put a comparable tool in front of buyers, with productivity numbers attached, or explain why their own engineers are not using one yet.

More Stories

Microsoft Published Its Annual Global AI Diffusion Report.

Released Thursday, the report says global AI use rose to 17.8% of the world's working-age population in Q1 2026, up from 16.3%. The United States moved up to 21st place at 31.3% adoption. The UAE remains in first at 70.1%. South Korea, Thailand, and Japan saw the biggest jumps in the quarter.

OpenAI Is Fast-Tracking Its First Phone For Early 2027.

Veteran Apple analyst Ming-Chi Kuo reported on Monday that OpenAI's first AI phone, designed with Jony Ive, is now targeting mass production in the first half of 2027. Earlier reports had pointed to late 2027 or 2028. The phone will run on chips from MediaTek and Qualcomm. The pull-in is reportedly tied to OpenAI's planned IPO and rising competition in AI agent hardware.

The Musk vs. Altman Trial Closed Its Second Week.

Judge Yvonne Gonzalez Rogers split the case into a liability phase, expected to wrap by May 21, and a separate remedies phase later. Musk's testimony from week one continued to dominate week two coverage, including new emails about a Tesla board seat he allegedly offered Altman in 2017. Most legal analysts watching the trial expect Musk to lose the liability phase.

Why It Matters

In every top story this week, the news was about deployment, reliability, and people, not new capability. The capability is widely available now. The work of putting it safely inside a real business, hospital, or daily software product is where this week's biggest moves were made.

For You

The GPT-5.5 update is the change you will notice without doing anything. The default model is more accurate, less padded, and easier to correct when it gets something wrong about you. Anyone who turned memory off because it felt opaque has a reason to try again. The Harvard ER paper is the slower-moving but more important one. The model performed at or above two trained physicians on real diagnostic tasks, and the same study called for controlled trials before any of it gets used on patients. The capability is there. The work of putting it safely inside a real hospital is the same gap that ran through the rest of the week's news.

For Your Work

The capital pool behind the new Anthropic investment includes Blackstone, Goldman Sachs, Hellman & Friedman, General Atlantic, Apollo, Leonard Green, GIC, and Sequoia. All going to engineering hours, applied AI staff, and multi-year customer support inside mid-sized companies. The IBM Bob launch reinforces the point. The headline number IBM led with was a 45% productivity gain measured across its own engineers, not a benchmark score.

One Thing To Try This Week

How To Get AI To Do A Research Project For You

Last week we covered how to connect AI to the apps you already use. This week builds on that. Once your AI can read your work, the next step is sending it out into the world to do real research on your behalf. All three major platforms now have a research mode that runs for several minutes (sometimes longer), reads dozens of sources, and returns a structured report with citations. Here is how to use each one in under five minutes.

Claude (via Cowork or claude.ai)

  1. Open Claude on the web, desktop, or mobile. Research is included with paid plans (Pro, Max, Team, or Enterprise).
  2. Look for the Research button in the chat input area, on the bottom left. Click it. The button turns blue when on.
  3. Make sure web search is also enabled. Settings > Features > Web search. Research uses web search to find sources.
  4. Type your research question. Be specific about the question, the depth you need, and any constraints. Claude will plan multiple searches, run them in sequence, and synthesize the findings.
  5. Wait one to a few minutes for shorter queries, longer for deeper ones. The response includes inline citations you can click to verify each claim.

ChatGPT

  1. Open ChatGPT. Deep Research is available on the Free tier (5 queries per month), Plus or Team (about 25 per month), and Pro (about 250 per month).
  2. In the message composer, click the tools menu (the icon next to the text box) and select Deep research.
  3. Write a clear prompt that includes the question, the output format you want, and any constraints. Attach PDFs or spreadsheets if they add useful context.
  4. ChatGPT will sometimes ask one or two clarifying questions. Answer them. A sidebar then opens showing each step the agent takes and which sources it is reading.
  5. To restrict the search to trusted sources, click Sites in the prompt area and choose Manage sites. You can require certain domains or simply prioritize them while still allowing the open web.
  6. Wait five to thirty minutes. You will get a notification when the report is finished. Export it as Markdown, Word, or PDF.

Microsoft Copilot

  1. Open the Microsoft 365 Copilot app. Researcher requires a Microsoft 365 Premium subscription or a Microsoft 365 Copilot add-on license. Each licensed user gets up to 25 Researcher queries per month.
  2. In the left sidebar, click Agents and select Researcher.
  3. Type your question. Researcher pulls from your work files (Outlook, OneDrive, SharePoint, Teams) and the public web at the same time.
  4. Researcher will sometimes ask follow-up questions to scope the project. Answer them. The agent then runs for several minutes and returns a structured summary with key insights and source citations.
  5. Behind the scenes, Researcher uses both OpenAI's reasoning models and Anthropic's Claude. Microsoft pairs them so one model produces the research and the other reviews it before delivery, which tends to reduce errors on long reports.
Try This Prompt
I am evaluating [vendor / decision / market]. Run a research project that covers (1) the three biggest competitors and how they price, (2) the two most-cited risks customers report, (3) any regulatory or major news changes in this space in the past six months, and (4) the names of three independent experts who have written about this area in the past year. Cite every claim with a working link. Format the output as a one-page brief with an executive summary at the top and a sources list at the bottom.

Spread The Word

Love the Wyecliff Weekly? Refer a friend and earn rewards on our referral program.

See the referral tiers
Wyecliff Weekly

Never miss an edition.

Wyecliff Weekly lands every Friday at 6 AM Mountain. One short email with the moves that matter, a prompt to run on Monday, and one playbook from the field.

One email a week. No spam. Unsubscribe anytime.

Common Questions

When does Wyecliff Weekly arrive?
Every Friday morning at 6 AM Mountain. One email a week with the biggest AI stories, what they mean for your work, and one thing to try before Monday.
Is Wyecliff Weekly free?
Yes. Subscribe with your work email and read every edition at no cost. We write for operators who want plain takes, not a sales pitch.
What is inside each edition?
Top AI news with context, a short take on why it matters for your business, and one practical prompt or workflow you can run this week.
How do I unsubscribe?
Every email includes an unsubscribe link at the bottom. One click and you are off the list. No hoops.
/newsletter/wyecliff-weekly-edition-22