groundy
industry & business

OpenAI Pushes ChatGPT Into Compensation Data, Pressuring Mercer and Radford

OpenAI's 3M daily compensation queries push ChatGPT into salary benchmarking, but the model lacks the proprietary employer panels behind Radford and Mercer's moat.

6 min · · · 4 sources ↓

OpenAI published a compensation-insights study and launched a personal-finance feature in ChatGPT on June 4, 2026. The following Monday, it confidentially filed its S-1. The two moves point in the same direction: ChatGPT is becoming a conversational front end for salary benchmarking, a market built on proprietary employer-submitted pay panels sold by the likes of Aon’s Radford McLagan, Mercer, and Payscale. The question is whether a model trained on public data can erode a moat that depends on data employers voluntarily submit under NDA.

What OpenAI announced

OpenAI reported that roughly 3 million messages per day in the US ask ChatGPT about wages, compensation, or earnings, based on a privacy-preserving analysis using automated classifiers, per its June 2026 compensation report. That volume is a signal in itself: workers are already treating a general-purpose chatbot as a salary-lookup tool, without any compensation-specific product existing until now.

Alongside the usage data, OpenAI introduced WorkerBench, a new evaluation framework for measuring how well ChatGPT handles labor-market tasks valuable to workers. The specific methodology, benchmark dataset, and detailed results were not fully disclosed in the published announcement.

According to OpenAI’s analysis, among labeled wage-benchmarking messages, pay-calculation queries account for 26%, questions about a specific role for 19%, entrepreneurship for 18%, specific-role-at-a-company questions for 11%, and broader occupation-and-career questions for 11%. The queries over-index in higher-skill, less transparent occupations: creative fields, management, healthcare, and computer and mathematical roles, where pay ranges are harder to benchmark and more negotiable.

The incumbents’ moat

Aon’s Radford McLagan Compensation Database covers over 30 million employees across 115 countries and 150 job functions, built on data from 8,500-plus participating organizations. The data is employer-submitted, validated, and sold behind enterprise-priced subscriptions. That is the structural moat: not the analysis layer, but the panel itself. Employers contribute compensation data they would prefer competitors not see, on the condition that it is aggregated and anonymized.

In March 2026, Aon launched enhancements to the Radford McLagan database specifically targeting AI-era roles: AI-specific job families (head of AI, ML engineer, AI ethics), an AI-enabled job-matching agent, an AI Compensation Assistant, and real-time labor market signals alongside traditional survey data. Aon’s own press release acknowledges that “traditional frameworks are struggling to keep up” and that “pay premiums for AI-driven skills are raising the stakes.”

The timing is not coincidental. Aon is fortifying the panel against the exact use case ChatGPT now addresses from the worker side: fast, conversational, free compensation lookups.

The provenance gap

The structural difference between ChatGPT’s salary output and a Radford report is not accuracy on public benchmarks. It is provenance.

OEWS data, published by the Bureau of Labor Statistics, is derived from employer surveys. It publishes median wages for standard occupation codes at national, state, and metro levels. It is rigorous and it is public. An LLM that reproduces OEWS numbers is doing useful work, but it is doing the same work a BLS website query does, with a conversational wrapper.

What Radford and Mercer sell is different: employer-submitted, company-level pay data for specific roles, broken out by geography, company size, revenue band, and tenure. That data does not exist in any public dataset. An LLM cannot synthesize it from web text because it was never published. The gap is not in the model’s capability but in its training corpus.

This matters for workers negotiating offers. ChatGPT can tell a software engineer that the national median for their role is approximately $X, per OEWS. It cannot tell them what Company Y pays for that role at that level, because that information lives in a proprietary panel the model has no access to. The conversational layer adds speed and accessibility. It does not add data that was not already public.

The personal finance launch

The same day, OpenAI launched a personal-finance experience for Pro US users, connecting 12,000-plus financial institutions via Plaid. An Intuit partnership is forthcoming for action-oriented workflows like credit card applications and tax estimates inside ChatGPT.

The feature has hard boundaries: per OpenAI’s documentation, ChatGPT cannot move money, pay bills, place trades, or file taxes. It is an advisory and synthesis layer, not a financial application. The Plaid integration lets it read account data and transaction history; the Intuit integration would extend that into product recommendations and tax estimation. Combined with the compensation-insights work, the trajectory is clear: OpenAI is building a personal-financial-advisory surface that includes income benchmarking, spending analysis, and product recommendations, without becoming a registered financial institution itself.

What changes for workers and HR vendors

For workers, the practical effect is straightforward. A free tool that surfaces public salary benchmarks, explains compensation structures, and answers role-specific pay questions at the speed of conversation is now available to anyone with a ChatGPT account. The negotiating asymmetry that favors employers, who subscribe to Radford or Mercer data and arrive at offer conversations with precise comp benchmarks, has a partial counterweight on the worker side.

For HR vendors, the threat is not that ChatGPT replaces their panels. It cannot, for the provenance reasons described above. The threat is that the free tier of salary information becomes good enough that fewer workers seek out the paid products, and fewer employers feel pressure to subscribe when candidates arrive with their own data. The classic incumbent pattern applies: a free product that covers most use cases tends to win against an expensive one that covers all of them, except for the buyers who genuinely need complete coverage.

Aon’s March 2026 move to add AI-specific job families and real-time labor signals to Radford suggests the incumbents see this clearly. Their bet is that employer-submitted panel data remains defensible because it cannot be replicated from public sources. That bet has been correct for decades. Whether it stays correct depends on whether enough compensation information leaks into public channels, through job postings, Glassdoor-style self-reporting, and regulatory pay-transparency mandates, to make the LLM-synthesized version competitive with the surveyed version.

For now, the two products coexist on different sides of the table. ChatGPT tells workers what the market looks like. Radford tells employers what they should pay. The data asymmetry persists. What has changed is that one side no longer needs a premium-priced subscription to participate.

Frequently Asked Questions

Does WorkerBench evaluate against proprietary employer data or only public statistics?

WorkerBench evaluates GPT-5.4 against 2024 OEWS median wages at national and metro levels. OpenAI characterizes results as having high coverage, small bias, and most estimates close to the benchmark, but the evaluation is confined to publicly available BLS figures. It does not test against employer-submitted panel data, which is the dataset incumbents sell.

How did GPT-5.5 score on personal-finance tasks?

On an internal benchmark graded by over 50 finance professionals from leading institutions, GPT-5.5 Thinking scored 79 out of 100 and GPT-5.5 Pro scored 82.5 out of 100. The scoring rubric, question set, and professional-selection criteria were not disclosed, so the numbers cannot be independently reproduced or compared against other financial-advisory tools.

What would have to change for LLM salary estimates to match proprietary panel granularity?

Three forces could compress the gap: pay-transparency mandates that force employers to publish ranges, growth of self-reported salary platforms such as Levels.fyi and Glassdoor, and job postings that increasingly include compensation figures. Even if those forces converge, senior and niche roles would improve last because fewer public data points exist for those positions.

Is the personal finance feature available to all ChatGPT users?

As of the June 4, 2026 launch, it is restricted to Pro subscribers in the US, with conversations defaulting to the GPT-5.5 Thinking model. OpenAI has not announced a timeline for free-tier access or international rollout, and the Intuit partnership for action-oriented workflows (credit card applications, tax estimates) is listed as forthcoming with no ship date.

sources · 4 cited

  1. Equipping workers with insights about compensation primary accessed 2026-06-09
  2. A new personal finance experience in ChatGPT primary accessed 2026-06-09
  3. Radford McLagan Compensation Database vendor accessed 2026-06-09
  4. Aon Launches Radford McLagan Compensation Database Enhancements as AI Redefines Workforce Skills and Compensation vendor accessed 2026-06-09