Changelog

What's New in TokenTra

Stay up to date with the latest improvements, new features, and integrations. We ship fast and often.

January 1, 2026New Feature

GPT-5 & Claude Opus 4.5 Support

Full support for the latest flagship models from OpenAI and Anthropic, including GPT-5, GPT-5.1, GPT-5 Pro, and Claude Opus 4.5.

  • Complete cost tracking for GPT-5 family (GPT-5, GPT-5.1, GPT-5 Pro)
  • Claude Opus 4.5 and Claude Sonnet 4 integration
  • Updated pricing models for new tiers
  • Enhanced token counting for extended context windows
  • Automatic model version detection

Results: Stay ahead with day-one support for the latest AI models.

December 28, 2025Integration

Gemini 3 Integration

Full support for Google's Gemini 3 family including Pro, Flash, and Ultra variants.

  • Gemini 3 Pro, Flash, and Ultra model tracking
  • Updated Vertex AI cost calculations
  • Multimodal usage tracking for Gemini 3
  • Enhanced BigQuery export integration
December 15, 2025New Feature

Advanced Cost Forecasting v2

Our most accurate forecasting engine yet, powered by transformer-based time series models.

  • 95% accuracy on 30-day forecasts
  • Automatic seasonality and trend detection
  • Per-team and per-project forecasts
  • Confidence intervals with Monte Carlo simulation
  • Slack and email forecast digests

Results: Customers report 40% fewer budget surprises with our new forecasting.

December 1, 2025New Feature

o3 Reasoning Model Support

Full support for OpenAI's o3 and o3-mini advanced reasoning models.

  • o3 and o3-mini cost tracking
  • Reasoning token accounting
  • Extended thinking time cost attribution
  • Comparison analytics vs standard models
November 20, 2025New Feature

Enterprise SSO & SCIM

Enterprise-grade identity management for large organizations.

  • SAML 2.0 and OIDC single sign-on
  • SCIM 2.0 user provisioning
  • Okta, Azure AD, and Google Workspace integration
  • Just-in-time user provisioning
  • Group-based role mapping
November 10, 2025New Feature

Smart Model Routing GA

Smart Model Routing is now generally available after successful beta with 500+ teams.

  • Automatic complexity detection for incoming requests
  • Configurable routing rules based on cost/quality tradeoffs
  • A/B testing framework to validate quality isn't degraded
  • Fallback routing when primary providers are down
  • Real-time routing analytics dashboard

Results: Teams using Smart Routing see 30-50% savings on routine queries.

October 25, 2025Integration

AWS Bedrock Integration v2

Enhanced AWS Bedrock support with Llama 3 and Mistral Large 2.

  • Llama 3.2 and Llama 3.1 model support
  • Mistral Large 2 integration
  • Amazon Titan v2 tracking
  • Cross-region cost aggregation
  • Reserved capacity cost tracking
October 10, 2025New Feature

Semantic Caching GA

Semantic caching is now generally available with 99.9% uptime SLA.

  • Semantic similarity matching (not just exact match)
  • Configurable TTL per query type
  • Cache hit rate analytics
  • Zero latency for cached responses
  • Automatic invalidation options

Results: Customers see 15-25% cost reduction on FAQ-style and repetitive queries.

September 15, 2025SDK

Python SDK v2.0

Major update to the Python SDK with async-first architecture.

  • Native async/await throughout
  • GPT-5 and Claude Opus 4.5 support
  • LangChain and LlamaIndex integrations
  • Automatic retry with exponential backoff
  • Structured logging with OpenTelemetry
August 20, 2025New Feature

Budget Forecasting

Know when you'll exceed your budget before it happens.

  • Holt-Winters forecasting with seasonality awareness
  • "You'll exceed budget by Tuesday" predictive alerts
  • Confidence intervals on projections
  • Weekly and monthly forecast reports
July 15, 2025New Feature

Anomaly Detection Engine

ML-powered anomaly detection catches unusual spending patterns automatically.

  • Spending spikes (Z-score > 3)
  • Unusual model usage patterns
  • Off-hours activity detection
  • New cost centers appearing
  • Rapid growth detection

Results: We analyze your historical patterns and alert you when something looks unusual—before it becomes a $10K surprise.

June 1, 2025Integration

Google Vertex AI Integration

Full support for Google Cloud's Vertex AI platform.

  • Gemini 2 Pro & Gemini 2 Flash
  • PaLM 2 (legacy)
  • Imagen 2
  • Embeddings
May 15, 2025New Feature

Team Workspaces

Organize your AI spending by team with dedicated workspaces.

  • Separate dashboards per team
  • Team-specific budgets and alerts
  • Role-based access control
  • Cross-team comparison views
April 1, 2025Integration

Slack Integration

Get alerts where your team already works.

  • Real-time alert notifications
  • Weekly digest summaries
  • Slash commands for quick lookups
  • Interactive budget approvals
March 1, 2025SDK

Node.js SDK v1.0

The official TokenTra Node.js SDK is now generally available.

  • Zero-latency wrapper for OpenAI, Anthropic, Google, Azure
  • Automatic token counting and cost calculation
  • Custom attribution tags
  • TypeScript support
  • Batch telemetry (non-blocking)
February 1, 2025Integration

Azure OpenAI Integration

Connect your Azure OpenAI deployments to TokenTra.

  • Service principal authentication
  • Cost data via Azure Cost Management API
  • Usage metrics via Azure Monitor
  • Per-deployment breakdown
January 15, 2025New Feature

Cost Attribution

Finally know who's spending what.

  • Attribution by Team, Project, Feature, User, and Custom tags
  • Chargeback reports for finance
  • Per-user unit economics
  • Feature cost analysis
January 1, 2025Launch

TokenTra Launch 🚀

We're live! TokenTra launches with support for OpenAI and Anthropic.

  • Unified cost dashboard
  • Real-time sync (5-minute refresh)
  • Historical trends and comparison
  • Budget limits and alerts
  • Email notifications
  • CSV export

Subscribe to Updates

Get notified when we ship new features. No spam, just product updates.

Or follow us on Twitter/X for the latest updates.