All changelog entries
Feature0.3.0Rate LimitsAPIAudioEmbeddings

Burst mode + completions, embeddings, transcription, TTS

April 4, 2026

Burst mode

Allow 2× burst above your RPM limit before throttling kicks in. Useful when an agent fires parallel tool calls — the natural bunching no longer trips the rate limiter.

New endpoints

  • /v1/completions for legacy text completions
  • /v1/embeddings for vector embeddings
  • /v1/audio/transcriptions and /v1/audio/speech for STT + TTS

Misc

  • KiloCode added to the integration page.
  • A comprehensive Developer Guide covering burst, retry, tools, JSON mode, streaming, and chaining — copyable for agents.