Show HN: Tokencap – Token budget enforcement across your AI agents

Show HN (score: 5)

Found: April 04, 2026

ID: 4023

Description

Other

Show HN: Tokencap – Token budget enforcement across your AI agents I built this after hitting the same wall repeatedly — no good way to enforce token budgets in application code. Provider caps are account-level and tell you what happened, not what is happening.

Two ways to add it:

  # Direct client wrapper
  client = tokencap.wrap(anthropic.Anthropic(), limit=50_000)

  # LangChain, CrewAI, AutoGen, etc.
  tokencap.patch(limit=50_000)

Four actions at configurable thresholds: WARN, DEGRADE (transparent model swap), BLOCK, and WEBHOOK. SQLite out of the box, Redis for multi-agent setups.

One design decision worth mentioning: tokencap tracks tokens, not dollars. Token counts come directly from the provider response and never drift with pricing changes.

Happy to answer any questions.

More from Show

Show HN: A primer for non-coders to design software via structured AI dialogue

No other tools from this source yet.

Show HN: Tokencap – Token budget enforcement across your AI agents

Description

More from Show

Show HN: A primer for non-coders to design software via structured AI dialogue

DevTools Assistant