Carview!

CARVIEW

MOTORHOMES

Select Language

HTTP/2 301 date: Sun, 12 Oct 2025 23:05:16 GMT content-type: text/html content-length: 167 location: https://docs.claude.com/en/docs/build-with-claude/token-counting cache-control: max-age=3600 expires: Mon, 13 Oct 2025 00:05:16 GMT vary: Accept-Encoding server: cloudflare cf-ray: 98da3db78ef3f470-BLR alt-svc: h3=":443"; ma=86400 HTTP/2 200 date: Sun, 12 Oct 2025 23:05:16 GMT content-type: text/html; charset=utf-8 content-encoding: gzip x-frame-options: DENY content-security-policy: worker-src * blob: data: 'unsafe-eval' 'unsafe-inline'; object-src data: ; base-uri 'self'; upgrade-insecure-requests; frame-ancestors 'none'; form-action 'self' https://codesandbox.io; x-middleware-rewrite: /_sites/docs.claude.com/en/docs/build-with-claude/token-counting vary: RSC, Next-Router-State-Tree, Next-Router-Prefetch, Next-Router-Segment-Prefetch, Accept-Encoding x-nextjs-cache: HIT x-nextjs-prerender: 1 x-nextjs-stale-time: 60 x-powered-by: Next.js cache-control: s-maxage=31536000 via: 1.1 google alt-svc: h3=":443"; ma=86400 cf-cache-status: HIT age: 260527 set-cookie: __cf_bm=HFGw0R5uEj76UHQAjQgllcNu0HRnznf5fzkfOpK8zjA-1760310316-1.0.1.1-euBq2rCrRcFY.ZTdl.8BdvXBy5x1y59Y_KyuX6O5vW0EQAiSeHurJax8rCC7T2Yfi1R2DrGb87qxIxw1pnAE22Z26Mf70qKvVgvsuqiLL7A; path=/; expires=Sun, 12-Oct-25 23:35:16 GMT; domain=.claude.com; HttpOnly; Secure; SameSite=None strict-transport-security: max-age=31536000; includeSubDomains; preload x-content-type-options: nosniff server: cloudflare cf-ray: 98da3db7c99da9b7-BLR Token counting - Claude Docs

On this page

How to count message tokens
Supported models
Count tokens in basic messages
Count tokens in messages with tools
Count tokens in messages with images
Count tokens in messages with extended thinking
Count tokens in messages with PDFs
Pricing and rate limits
FAQ

Token counting enables you to determine the number of tokens in a message before sending it to Claude, helping you make informed decisions about your prompts and usage. With token counting, you can

Proactively manage rate limits and costs
Make smart model routing decisions
Optimize prompts to be a specific length

How to count message tokens

The token counting endpoint accepts the same structured list of inputs for creating a message, including support for system prompts, tools, images, and PDFs. The response contains the total number of input tokens.

The token count should be considered an estimate. In some cases, the actual number of input tokens used when creating a message may differ by a small amount.Token counts may include tokens added automatically by Anthropic for system optimizations. You are not billed for system-added tokens. Billing reflects only your content.

Supported models

The token counting endpoint supports the following models:

Claude Opus 4.1
Claude Opus 4
Claude Sonnet 4.5
Claude Sonnet 4
Claude Sonnet 3.7
Claude Sonnet 3.5 (deprecated)
Claude Haiku 3.5
Claude Haiku 3
Claude Opus 3 (deprecated)

Count tokens in basic messages

import anthropic

client = anthropic.Anthropic()

response = client.messages.count_tokens(
    model="claude-sonnet-4-5",
    system="You are a scientist",
    messages=[{
        "role": "user",
        "content": "Hello, Claude"
    }],
)

print(response.json())

JSON

{ "input_tokens": 14 }

Count tokens in messages with tools

Server tool token counts only apply to the first sampling call.

import anthropic

client = anthropic.Anthropic()

response = client.messages.count_tokens(
    model="claude-sonnet-4-5",
    tools=[
        {
            "name": "get_weather",
            "description": "Get the current weather in a given location",
            "input_schema": {
                "type": "object",
                "properties": {
                    "location": {
                        "type": "string",
                        "description": "The city and state, e.g. San Francisco, CA",
                    }
                },
                "required": ["location"],
            },
        }
    ],
    messages=[{"role": "user", "content": "What's the weather like in San Francisco?"}]
)

print(response.json())

JSON

{ "input_tokens": 403 }

Count tokens in messages with images

#!/bin/sh

IMAGE_URL="https://upload.wikimedia.org/wikipedia/commons/a/a7/Camponotus_flavomarginatus_ant.jpg"
IMAGE_MEDIA_TYPE="image/jpeg"
IMAGE_BASE64=$(curl "$IMAGE_URL" | base64)

curl https://api.anthropic.com/v1/messages/count_tokens \
     --header "x-api-key: $ANTHROPIC_API_KEY" \
     --header "anthropic-version: 2023-06-01" \
     --header "content-type: application/json" \
     --data \
'{
    "model": "claude-sonnet-4-5",
    "messages": [
        {"role": "user", "content": [
            {"type": "image", "source": {
                "type": "base64",
                "media_type": "'$IMAGE_MEDIA_TYPE'",
                "data": "'$IMAGE_BASE64'"
            }},
            {"type": "text", "text": "Describe this image"}
        ]}
    ]
}'

JSON

{ "input_tokens": 1551 }

Count tokens in messages with extended thinking

See here for more details about how the context window is calculated with extended thinking

Thinking blocks from previous assistant turns are ignored and do not count toward your input tokens
Current assistant turn thinking does count toward your input tokens

curl https://api.anthropic.com/v1/messages/count_tokens \
    --header "x-api-key: $ANTHROPIC_API_KEY" \
    --header "content-type: application/json" \
    --header "anthropic-version: 2023-06-01" \
    --data '{
      "model": "claude-sonnet-4-5",
      "thinking": {
        "type": "enabled",
        "budget_tokens": 16000
      },
      "messages": [
        {
          "role": "user",
          "content": "Are there an infinite number of prime numbers such that n mod 4 == 3?"
        },
        {
          "role": "assistant",
          "content": [
            {
              "type": "thinking",
              "thinking": "This is a nice number theory question. Lets think about it step by step...",
              "signature": "EuYBCkQYAiJAgCs1le6/Pol5Z4/JMomVOouGrWdhYNsH3ukzUECbB6iWrSQtsQuRHJID6lWV..."
            },
            {
              "type": "text",
              "text": "Yes, there are infinitely many prime numbers p such that p mod 4 = 3..."
            }
          ]
        },
        {
          "role": "user",
          "content": "Can you write a formal proof?"
        }
      ]
    }'

JSON

{ "input_tokens": 88 }

Count tokens in messages with PDFs

Token counting supports PDFs with the same limitations as the Messages API.

curl https://api.anthropic.com/v1/messages/count_tokens \
    --header "x-api-key: $ANTHROPIC_API_KEY" \
    --header "content-type: application/json" \
    --header "anthropic-version: 2023-06-01" \
    --data '{
      "model": "claude-sonnet-4-5",
      "messages": [{
        "role": "user",
        "content": [
          {
            "type": "document",
            "source": {
              "type": "base64",
              "media_type": "application/pdf",
              "data": "'$(base64 -i document.pdf)'"
            }
          },
          {
            "type": "text",
            "text": "Please summarize this document."
          }
        ]
      }]
    }'

JSON

{ "input_tokens": 2188 }

Pricing and rate limits

Token counting is free to use but subject to requests per minute rate limits based on your usage tier. If you need higher limits, contact sales through the Claude Console.

Usage tier	Requests per minute (RPM)
1	100
2	2,000
3	4,000
4	8,000

Token counting and message creation have separate and independent rate limits — usage of one does not count against the limits of the other.

FAQ

Does token counting use prompt caching?

No, token counting provides an estimate without using caching logic. While you may provide cache_control blocks in your token counting request, prompt caching only occurs during actual message creation.

Multilingual support Embeddings

Original Source | Taken Source

First steps

Models & pricing

Learn about Claude

Capabilities

Tools

Model Context Protocol (MCP)

Use cases

Prompt engineering

Test & evaluate

Strengthen guardrails

Token counting

How to count message tokens

Supported models

Count tokens in basic messages

Count tokens in messages with tools

Count tokens in messages with images

Count tokens in messages with extended thinking

Count tokens in messages with PDFs

Pricing and rate limits

FAQ

First steps

Models & pricing

Learn about Claude

Capabilities

Tools

Model Context Protocol (MCP)

Use cases

Prompt engineering

Test & evaluate

Strengthen guardrails

​How to count message tokens

​Supported models

​Count tokens in basic messages

​Count tokens in messages with tools

​Count tokens in messages with images

​Count tokens in messages with extended thinking

​Count tokens in messages with PDFs

​Pricing and rate limits

​FAQ

How to count message tokens

Supported models

Count tokens in basic messages

Count tokens in messages with tools

Count tokens in messages with images

Count tokens in messages with extended thinking

Count tokens in messages with PDFs

Pricing and rate limits

FAQ