GitHub’s fixed-rate era for premium inference ends June 1, 2026. GitHub announced on April 271 that Premium Request Units (PRUs) across all Copilot plans are replaced by GitHub AI Credits billed against actual token consumption (input, output, and cached) at per-model API rates. Monthly plan prices are unchanged. The per-request subsidies are not.
What Changed and When
The PRU system gave every Copilot request a fixed cost expressed as a multiplier of a base rate. According to GitHub’s premium request documentation2, Claude Opus 4.7 and GPT-5.5 cost 7.5x, Claude Opus 4.5 and 4.6 cost 3x, Claude Sonnet 4.6 cost 1x, and Claude Opus 4.6 in fast mode (preview) cost 30x. That table expires June 1.
After June 1, a request to Opus 4.7 costs whatever its token footprint resolves to against GitHub’s published API rates for that model. The 7.5x multiplier was a fixed cost regardless of context window size. A request that sends 200k tokens of context to Opus 4.7 now costs proportionally more than a 10k-token request. The PRU system couldn’t express that distinction.
GitHub has not yet published the per-model API rates that will govern the new system as of April 28, 2026. Until those rates are posted, teams cannot calculate exact breakeven points.
The New Credit Economics by Plan
Monthly prices are unchanged. Included AI Credits match existing plan prices dollar-for-dollar:
| Plan | Price | Monthly AI Credits | Promotional Credits (June-August 2026) |
|---|---|---|---|
| Copilot Pro | $10/month | $10 | N/A |
| Copilot Pro+ | $39/month | $39 | N/A |
| Copilot Business | $19/user/month | $19 | $30/user |
| Copilot Enterprise | $39/user/month | $39 | $70/user |
Business and Enterprise plans receive bonus credits through August 20261. That buffer disappears September 1.
Credits don’t roll over. Unused credits in a billing cycle reset at the next renewal.
Code completions and Next Edit Suggestions remain included in all plans and consume no credits. The cost exposure is in chat, agent mode, and code review.
How Agent Workflows Get Repriced
The structural change hits hardest in agentic contexts. Under PRUs, a team running an agent loop with multiple Opus 4.7 calls paid a fixed 7.5x per request regardless of context size. Token-metered billing removes that ceiling. An agent that accumulates tool-call outputs across a long session and re-sends the full context window on each turn now pays for the accumulation.
Groundy covered the 7.5x multiplier and the April 20 plan changes as a separate story; the June 1 change retires the multiplier table entirely rather than adjusting it. The economic bet the old system implicitly made was that average context sizes would be small enough to absorb. Evidently they weren’t.
GitHub also introduced session and weekly limits based on token consumption that operate separately from premium request entitlements and can block usage even when request budgets haven’t been hit. Teams relying on agent mode need to account for both the per-token cost and the throughput ceiling.
The Hacker News discussion of the announcement3 drew considerable attention, with threads covering cost shock on the Pro+ tier, the absence of rollover, and comparisons to direct OpenRouter access at API rates without the subscription overhead.
Annual Subscriber Gotchas
Annual Pro and Pro+ subscribers don’t convert to credit billing on June 1. According to GitHub’s announcement, they stay on PRU-based pricing until their plan expires, but their model multipliers increase on June 1 regardless. At renewal, they drop to Copilot Free unless they convert early, in which case they receive prorated credits.
An annual subscriber who renewed in January 2026 keeps PRU billing until January 2027 but pays higher multipliers from June 1 onward. At renewal they either convert before the expiration date or land on the free tier with no premium model access.
New sign-ups for Copilot Pro, Pro+, and Student plans were paused April 20, ahead of the billing change. Opus models were removed from Pro plans at the same time; Opus 4.7 remains in Pro+.
The Hidden Code-Review Tax
Copilot code review picks up a second charge on June 1. GitHub’s changelog states that code review on private repositories will consume both AI Credits and GitHub Actions minutes starting June 1. Public repositories are unchanged.
Teams running automated review on private repos (particularly those with high PR volume or large changesets) face a compound cost that doesn’t appear in the monthly plan pricing. The Actions-minute component charges against each team’s existing Actions quota, separately from the AI Credits component.
What to Audit Before June 1
The credit amounts included in monthly plans are modest relative to heavy agentic usage. The Pro+ monthly allocation covers materially different workloads depending on which models and context sizes are in use. Without published per-model rates, exact projections are impossible.
Four areas to audit before June 1:
Context window hygiene. Agent loops that send full conversation history on each turn inflate token counts per call. Audit tool-call output accumulation and context pruning strategies. What looked cheap at a fixed 7.5x per request may be expensive at per-token billing.
Model selection. Under PRUs, choosing between Opus 4.7 (7.5x) and Sonnet 4.6 (1x) was a fixed cost tradeoff. Under token-metered billing, that tradeoff becomes continuous and depends on actual input/output size per request.
Code review on private repos. Teams that added automated Copilot code review without tracking Actions-minute consumption should audit expected PR volume before June.
Annual subscription dates. Subscribers renewing after June 1 face higher multipliers before their plan expires. Those close to renewal should decide whether to convert early for prorated credits or accept the Free tier drop.
The promotional credit buffer for Business and Enterprise runs through August 2026. September is when steady-state costs become visible.
Frequently Asked Questions
How long could a heavy Copilot Pro+ user realistically run agent mode before exhausting the monthly credit?
Hacker News commenters on the announcement noted that the $39 monthly AI Credit allocation can be burned through in roughly a single hour of intensive agentic coding with long context windows and repeated Opus 4.7 calls. This makes the credit buffer a safety net for light usage rather than a workable floor for power users.
How does the total cost of Copilot under token billing differ from buying model access through OpenRouter?
OpenRouter sells API access at published per-model rates without a subscription wrapper, whereas Copilot bundles subscription fees plus metered credits. Discussion in the community threads flagged OpenRouter as a direct escape hatch: teams doing mostly agentic work may pay less by routing model calls through OpenRouter while keeping only a minimal Copilot subscription for IDE completions, which remain included and credit-free.
Do the new token-based session and weekly limits apply to Copilot Business and Enterprise, or only to individual Pro plans?
GitHub introduced those throughput ceilings on April 20 as part of the individual-plan changes, and the announcement tied them specifically to Pro, Pro+, and Student tiers. Business and Enterprise plans are not explicitly mentioned as receiving the same session or weekly caps, though they remain subject to the AI Credit budget itself.
What is the cost risk for teams running automated Copilot code review on private monorepos with high PR volume?
The compound charge is steeper than the headline suggests because large changesets in monorepos trigger longer review jobs, consuming more GitHub Actions minutes per PR in addition to the AI Credits for the model call. Teams that previously treated Copilot review as a fixed subscription feature now face a variable cost scaled by both PR frequency and changeset size, with no promotional buffer covering the Actions-minute side of the bill.
When will teams be able to calculate exact breakeven points for their agent workflows?
GitHub plans to launch a preview bill in early May 2026, but the per-model API rates that govern credit consumption were still unpublished as of April 28. Until the rate card goes live, even the preview bill will only offer directional guidance, meaning exact cost modeling for high-volume agent loops remains blocked.