openai-chat-tokens

A TypeScript / JavaScript library for estimating the number of tokens an OpenAI chat completion request will use.

Estimating token usage for chat completions isn't quite as easy as it sounds.

For regular chat messages, you need to consider how the messages are formatted by OpenAI when they're provided to the model, as they don't simply dump the JSON messages they receive via the API into the model.

For function calling, things are even more complex, as the OpenAPI-style function definitions get rewritten into TypeScript type definitions.

This library handles both of those cases, as well as a minor adjustment needed for handling the results of function calling. tiktoken is used to do the tokenization.

Usage

import { promptTokensEstimate } from "openai-chat-tokens";

const estimate = promptTokensEstimate({
  messages: [
    { role: "system", content: "These aren't the droids you're looking for" },
    { role: "user", content: "You can go about your business. Move along." },
  ],
  functions: [
    {
      name: "activate_hyperdrive",
      description: "Activate the hyperdrive",
      parameters: {
        type: "object",
        properties: {
          destination: { type: "string" },
        },
      },
    },
  ],
});

Development and testing

Built in TypeScript, tested with Jest.

$ npm install
$ npm test

When adding new test cases or debugging token count mismatches, it can be helpful to validate the estimated tokens in the tests against the live OpenAI API. To do this:

Set up the OPENAI_API_KEY environment variable with a live API key
Add validate: true to one of the test examples, or set validateAll to true in token-counts.test.ts, then run the tests

References

"Counting tokens for chat completions API calls" in OpenAI's "How to count tokens with tiktoken" notebook
A post about counting function call tokens on the OpenAI forum.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.github/workflows		.github/workflows
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

openai-chat-tokens

Usage

Development and testing

References

About

Releases 10

Contributors 7

Languages

License

hmarr/openai-chat-tokens

Folders and files

Latest commit

History

Repository files navigation

openai-chat-tokens

Usage

Development and testing

References

About

Resources

License

Stars

Watchers

Forks

Releases 10

Contributors 7

Languages