All changelog entries
Feature0.3.0Rate LimitsAPIAudioEmbeddings
Burst mode + completions, embeddings, transcription, TTS
April 4, 2026
Burst mode
Allow 2× burst above your RPM limit before throttling kicks in. Useful when an agent fires parallel tool calls — the natural bunching no longer trips the rate limiter.
New endpoints
/v1/completionsfor legacy text completions/v1/embeddingsfor vector embeddings/v1/audio/transcriptionsand/v1/audio/speechfor STT + TTS
Misc
- KiloCode added to the integration page.
- A comprehensive Developer Guide covering burst, retry, tools, JSON mode, streaming, and chaining — copyable for agents.
