Skip to main content

Roadmap

Kith's development is planned in three phases. v0.1 is shipped and published on npm.

v0.1 — Shipped

The foundation. Everything needed to ship a voice companion with natural-sounding TTS.

  • @kithjs/core — 5 adapter contracts, 12-event normalized bus, policy hooks
  • @kithjs/runtime-pipecat — Python Pipecat sidecar with auto-respawn, barge-in, full event emission
  • @kithjs/runtime-livekit — Mock-mode portability stub (proves adapter contract works across runtimes)
  • @kithjs/voice-router — Sentence-aware chunking, 4 default slang dicts, pronunciation overrides, emoji-to-emotion, VoiceCharacter profiles, ElevenLabs v3 laugh tags
  • @kithjs/observability — Span tracing, duplicate-send guards, reconnect metrics
  • @kithjs/avatar-events — Placeholder (events live in core; renderer utilities in v0.2)
  • Reference demo — Working web app with procedural avatar

v0.2 — Next

Production-grade voice quality and real avatar integration.

FeatureDescription
LiveKit production adapterFull WebRTC integration replacing mock mode
Multi-provider TTSCartesia + OpenAI TTS with automatic fallback
VRM avatar utilitiesThree.js VRM loading, phoneme-to-viseme mapping, blend shapes
Mic input / STTFull duplex — user speaks, Kith transcribes, agent responds
React Native supportLiveKit RN SDK path for mobile companions

v0.3+ — Planned

Scale and platform expansion.

FeatureDescription
TelephonySIP/PSTN via LiveKit for phone-based companions
Multi-agent orchestrationMultiple companions in the same voice session
Pluggable memory backendsFirst-party adapters for common vector stores
Streaming LLM integrationvoice.streamText(tokenIterator) for real-time LLM output
Voice cloning pipelineCharacter voice training workflow

Non-Goals

These are intentionally out of scope for Kith at any version:

  • Replacing your agent runtime. Kith sits beside your agent (LangGraph, custom orchestrator, etc.), not inside it.
  • Reimplementing memory/RAG. The MemoryAdapter is pass-through. You own your storage.
  • Owning the avatar renderer. Kith emits events; your app renders however it wants (Three.js, Unity, 2D canvas, whatever).

Contributing

v0.1 API surface is stable. Issues and PRs welcome, especially for:

  • Provider adapters (Cartesia, OpenAI TTS, Deepgram)
  • LiveKit production adapter
  • VRM avatar utilities
  • Bug reports from real integrations