Claude Code Usage Limits

Claude Code usage limits explained: subscription vs API metering, session and weekly limits, context, /usage, /cost, /clear, /compact, and cost control.

Claude Code usage limits are the account, plan, context, or billing boundaries that determine how long you can keep working before you need to wait, clear context, change model, buy extra usage, upgrade, or switch to API billing.

They are not simple message counts. A short prompt in a huge session can cost more than a long prompt in a clean session because Claude Code carries conversation history, CLAUDE.md, files it has read, tool output, and model reasoning into later turns.

Last checked on May 24, 2026. Exact allowances, reset windows, model names, usage-credit rules, and subscription handling can change. Trust the reset message inside Claude Code and your account dashboard over any third-party table.

Short Answer

If you use Claude Code through a Claude subscription, usage is tied to your plan and can be shared with other Claude surfaces. If you use an API key or cloud provider, usage is metered by tokens and provider billing rules.

Question	Practical answer
Are limits message counts?	No. Context size, model, tools, files, and session length matter.
Does Claude Code share Claude app limits?	For paid subscription plans, official support says Claude product surfaces share usage limits.
What if I use an API key?	You are billed by token/provider usage rather than a subscription reset window.
Is context full the same as usage limit?	No. Context length is about how much the model can hold in one session.
Fastest way to save usage?	Use `/clear` between unrelated tasks and `/compact` during long tasks.

The Four Different "Limits"

Users often mix these up:

Limit type	What it means	What to do
Usage limit	Your subscription, seat, or org allowance has been used for the current window.	Wait for reset, use `/usage`, switch model, enable extra usage if available, or upgrade.
Context or length limit	The session is carrying too much conversation, file content, tool output, or MCP output.	Use `/compact`, `/clear`, trim logs, and split the task.
Rate limit	Requests or tokens are arriving faster than your account/provider allows.	Reduce parallel runs, slow scripts, retry later, or request appropriate limits.
Credit or spend limit	API credits, auto-reload, cloud quota, or workspace spend cap stops the run.	Check the Console or cloud billing dashboard before retrying.

When someone says "Claude Code limit", first identify which of the four it is. The fix is different.

Subscription vs API Metering

Login path	How usage works	What "running out" looks like	Where to check
Pro / Max via `/login`	Included subscription usage, shared across Claude surfaces.	Session, weekly, or model-specific reset message.	Claude Code usage message, Claude account settings, `/usage`.
Team / Enterprise via `/login`	Organization seat, pool, or policy rules.	Team/org limit message or admin-managed restriction.	Admin console, org analytics, policy settings, `/usage`.
`ANTHROPIC_API_KEY`	Pay-as-you-go token billing through Console.	Usually no subscription-style reset; spend continues until credit/spend/rate limit.	Claude Console usage, billing, spend limits, `/cost`.
Bedrock / Vertex / Microsoft Foundry	Provider-side token billing and quotas.	Provider quota, rate, or billing behavior.	Cloud console, quota page, provider billing.

Important: if an ANTHROPIC_API_KEY environment variable is active, a terminal session may use API billing instead of your subscription path. Check authentication before assuming which bucket you are spending.

What Consumes Usage

Every turn can include:

Conversation history from the current session.
CLAUDE.md, local memory, settings, rules, and loaded skills.
Files Claude Code has read.
Tool results, command output, search output, MCP output, and diffs.
Your new prompt.
The selected model and reasoning effort.

The multiplier is accumulated context. A session that already explored twenty files, printed large logs, and generated several diffs gets heavier on later turns.

Commands That Help

Command	Use it for
`/usage`	See plan limits and reset information when available.
`/cost`	See token and dollar usage for API-billed sessions.
`/model`	Inspect available models and switch to a lighter model when appropriate.
`/clear`	Start a fresh conversation; best when switching tasks.
`/compact`	Summarize history and keep going on the same task.
`/context`	Inspect what is loaded and find bloated context.
`/status`	Confirm the active account, project, and authentication path when available.

If a command is not visible in your build or plan, use the UI message, account dashboard, or provider dashboard as the source of truth.

What To Do When You Hit A Limit

Situation	Best next step
"Session limit" or "weekly limit" with reset time	Wait for the reset, switch model, enable extra usage if your plan supports it, or pause high-intensity work.
"Opus limit"	Switch to Sonnet for execution, keep Opus for planning only, or wait for reset.
Context window is full	Run `/compact` if continuing the same task; run `/clear` if starting a different task.
API spend is rising quickly	Stop the script, check active credentials, inspect Console/provider usage, then add spend limits.
Rate limit error	Reduce concurrency, pause automation, retry later, or use provider/admin limit controls.
Wrong account or billing path	Use `/status`, `/login`, environment checks, and provider dashboards before continuing.

Do not blindly retry after a limit. Repeated retries can waste more quota or create duplicate API spend.

Five Habits That Stretch Usage

1. Clear between unrelated tasks

Use /clear when the next task would make sense in a new terminal. Project memory remains available, but old conversation history stops riding along.

2. Compact long single-task sessions

Use /compact when you are mid-task and still need the important context. It preserves a summary instead of carrying the full transcript.

3. Match model to difficulty

Use Sonnet as the default for most coding. Use Opus for hard planning, architecture decisions, and difficult debugging. Use Haiku for simple lookups or mechanical high-volume work when available.

4. Point to files instead of pasting files

Large pasted files stay in context. Refer to paths and let Claude Code read only what matters. For logs, paste the relevant 20-30 lines instead of the whole output.

5. Ask for a plan before big edits

A plan is cheaper than a wrong multi-file diff. Use Plan Mode before migrations, pricing pages, auth, database, redirects, and MCP permission work.

Cost-Saving Workflow

For a large task, use this pattern:

Start clean with /clear.
Ask for an exploration-only plan.
Use Opus only if the planning problem is hard.
Execute with Sonnet after the plan is clear.
Run build/tests at checkpoints instead of asking Claude to inspect everything repeatedly.
Use /compact only when the same task must continue.
Clear before switching to the next task.

This preserves quality while avoiding the most common usage drain: one long session doing several unrelated jobs.

Billing Boundary Note

Official Claude Code legal materials separate ordinary subscription use from developer/product use. OAuth login is intended for Free, Pro, Max, Team, and Enterprise subscription access, while developers building products or services with Claude capabilities should use API key authentication through Console or a supported cloud provider.

For automation-heavy, headless, SDK, or productized workflows, do not assume subscription limits are the right boundary. Re-check the current legal, pricing, and billing docs before planning usage.