Back to agent index
LiveKit Agents

LiveKit Agents

Agent framework by LiveKit Agents

Empower your AI innovations with LiveKit Agents, the open-source framework that seamlessly integrates real-time multimodal capabilities, enabling developers to create intelligent agents that communicate via voice, video, and text.

docs.livekit.io/agents

LiveKit Agents is an open-source, end-to-end framework designed to facilitate the development of real-time, multimodal AI applications. This framework empowers developers to create programmable, intelligent AI agents that can interact with users through various modalities, including voice, video, and text. LiveKit Agents seamlessly integrates with large language models (LLMs) and other AI models, making it an ideal solution for a wide range of applications.

Features

The LiveKit Agents framework encompasses a variety of features that enhance its usability and functionality in developing multimodal applications. Below is an overview of the specific features available within the LiveKit Agents framework:

FeatureDescription
Multimodal InteractionSupports voice, video, and text exchanges for comprehensive interactions.
Stateful ProcessesOperates as long-running processes for intuitive user interaction management.
Low-Latency Media TransportUtilizes WebRTC for real-time media transport with minimal delay.
Centralized Business LogicMaintains business logic within the agent for consistent service across platforms.
Extensive Plugin EcosystemIncludes plugins for STT, TTS, and VAD, with options for custom integrations.
Agent Lifecycle ManagementEfficiently manages agent lifecycle and can handle multiple instances.
Orchestration and ScalingFacilitates load balancing and scaling by adding more servers.
Open Source and Edge OptimizationApache 2.0 licensed, with optimized performance over edge servers.

Use Cases

LiveKit Agents can be utilized in various scenarios, showcasing their versatility in real-time interactions:

  • AI Voice Assistants: Engage users in natural voice conversations for personalized assistance.
  • Call Centers: Automate customer service processes by handling incoming and outbound calls.
  • Transcription Services: Provide real-time voice-to-text transcription for meetings, lectures, or interviews.
  • Object Detection/Recognition: Identify objects in real-time video streams for surveillance and monitoring.
  • AI-Driven Avatars: Create interactive avatars using prompts for personalized user experiences.
  • Translation Services: Facilitate real-time communication across languages with instant translation.
  • Video Manipulation: Apply real-time video filters and transforms for enhanced video conferencing.

How to get started

Getting started with LiveKit Agents involves several steps to develop and deploy your AI applications:

  1. Agent Code Development: Write a Python or Node.js application defining the entrypoint function for connections and utilize included or custom plugins.
  2. Frontend Integration: Create a frontend using LiveKit’s SDKs for WebRTC transport and media device management, utilizing the Agents Playground for testing.
  3. Environment Setup: Configure environment variables such as LIVEKIT_URL, LIVEKIT_API_KEY, and LIVEKIT_API_SECRET, along with any necessary API keys for integrations.
  4. Running the Agent: Start the worker using commands like node my_agent.js start or node my_agent.ts connect --room <my-room> to join an active room.

</section>
<section>

LiveKit Agents Pricing

The pricing for LiveKit Agents is structured across several plans:

  • Build Plan: $0/month (free)
  • Ship Plan: $50/month
  • Scale Plan: $500/month
  • Enterprise Plan: Custom pricing, contact sales

Additional Fees

LiveKit also has a connection fee starting at $0.0005/minute, which decreases with volume, and a lower bandwidth fee starting at $0.12 per GB, which also decreases with volume. There is no charge for upstream bandwidth.