Core Concepts

Call Initiation Responsibility

The SDK supports two approaches for call initiation, differing in who is responsible for creating the call:

Responsibility: The SDK directly creates calls with Hidoba Research
Process: SDK uses your API key to create and manage the entire call lifecycle
Use Case: Development, prototyping, simple applications

Responsibility: Your backend creates calls, SDK connects using provided credentials
Process: Your server creates the call and returns a signed URL for the SDK to use
Use Case: Production applications, enhanced security, user management

Understanding the typical flow of an AI voice call helps you implement proper event handling:

Call Initiation: User clicks call button → onCallStart triggered
Call Creation: Either SDK creates call directly (client-side) or your backend creates call (server-side)
Permission Request: Browser requests microphone access automatically
Permission Granted: onCallStarting callback triggered
WebSocket Connection: SDK establishes audio streaming connection
Connected: onConnected callback triggered → conversation can begin
Active Conversation: Real-time audio streaming and optional transcript display
Call End: User hangs up → onHangUp callback triggered

The SDK uses an event-driven architecture with callbacks to handle different states:

Microphone access is handled automatically by the SDK:

Automatic Request: Permission requested immediately after backend call creation
Early Optimization: Stream acquired during backend polling to reduce perceived latency
No Manual Setup: No need to request permissions separately - SDK handles it
HTTPS Required: Browser security requires secure context for microphone access

The SDK includes advanced audio processing capabilities: