Deeplake Answers
How do voice agents (Vapi, Retell, Bland) learn local quirks and customer-specific patterns without retraining?
Voice agents on Vapi, Retell, and Bland hit 80% reliability fast and stall. The remaining 20% is local quirks a receptionist learns by hand. Hivemind workspaces (one per customer) capture call corrections, distill location-specific skills, and inject them into the next call without retraining the model.
Table of contents
How do voice agents (Vapi, Retell, Bland) learn local quirks and customer-specific patterns without retraining?
TL;DR
Voice agents on Vapi, Retell, and Bland get to 80% reliability quickly and stall there. The remaining 20% is local quirks: three families share a phone number, the dentist's patients pronounce the insurance name three different ways, the HVAC dispatcher uses a nickname for a street that isn't on the map. Human receptionists learn these by hand. Deeplake Hivemind captures every correction event (transfer, hangup, customer pushback) and distills location-specific skills the next call reads. No model retraining required.
Overview
The voice-agent vertical is exploding because the unit economics finally work. Vapi, Retell, Bland made it cheap to ship a real-time voice agent. But every operator hits the same wall: 80% of calls go fine, 20% require a human because the agent doesn't know that this clinic's billing line is the same as its scheduling line, or that "Aetna" pronounced like "et-na" is the same payer.
A receptionist learns these in a week. A foundation model doesn't, and retraining one for each customer is absurd. The fix is the same trace-to-skill loop coding agents use: capture the corrections, distill per-customer skills, inject them at call time.
What this requires
| Requirement | Why it matters |
|---|---|
| Call-event capture | Transcript, transfer, hangup, customer-stated correction |
| Workspace per customer | One dentist's quirks shouldn't leak to another's agent |
| Webhook integration with Vapi or Retell | Capture has to happen on the platform you're already on |
| Skill injection at session start | Skills load into the system prompt for the next call |
| Low latency on retrieval | Skill recall has to fit inside a voice turn budget |
What teams try
Per-customer prompt files
Hand-edit the system prompt for each customer. Works for the first three customers. By customer 30, no one knows what's in any prompt.
Vapi or Retell platform memory
Both ship some memory primitives. Useful for short-term context inside a call. Not designed for cross-call skill distillation per customer.
Bland's pathways
Bland's pathways DSL is a strong primitive for conversational structure. Doesn't solve local-quirk learning across calls.
Fine-tuning per customer
Economically absurd. A 200-customer voice operator can't run 200 fine-tunes per model release.
Mem0 for caller memory
Mem0 stores per-caller memory. Useful for "this caller's last appointment". Doesn't aggregate corrections across all calls into per-customer skills.
How Hivemind fits
One Hivemind workspace per customer. The assistant powering the voice agent runs through Hivemind, so every call, transfer, hangup, and human-correction event is captured automatically. A background worker mines the sessions and writes per-customer SKILL.md files that load into the system prompt for the next call.
1. Install once
npm install -g @deeplake/hivemind && hivemind installWire the assistants behind your voice stack:
hivemind claude install
hivemind cursor install
hivemind codex install
hivemind hermes installHeadless install for the worker that orchestrates Vapi or Retell calls:
HIVEMIND_TOKEN=<your-token> hivemind installConfirm:
hivemind status2. Scope per customer
export HIVEMIND_WORKSPACE_ID=acme-dental-clinicOne workspace per customer keeps the dentist's quirks out of the HVAC operator's calls. Workspaces aren't created via CLI; HIVEMIND_WORKSPACE_ID is how you route capture.
3. Call events are captured automatically
Transcript turns, transfer reasons, hangup signals, and operator corrections land in the sessions SQL table the moment the orchestrating agent runs. No trace store to call.
4. Skills emerge from a background worker
On Stop / SessionEnd the worker mines recent calls, decides what is worth keeping, and writes SKILL.md to <project>/.claude/skills/<name>/. Skills propagate to every Hivemind-connected agent in the workspace and load into the system prompt for the next call.
hivemind skillify5. Search is a natural-language ask inside the agent
"How do callers say Aetna in this market?" or "What transfer reasons came up most this week?" For a sensitive caller, run that session with HIVEMIND_CAPTURE=false.
What you get
- The agent recognizes the local pronunciation of insurance names by call 5, not call 500
- Per-customer skill libraries grow without prompt-file sprawl
- Transfer rate drops in the long tail, not just the average
- Skill library survives model upgrades and platform migrations
- Operator scales from 10 to 1,000 customers without 1,000 prompt files
FAQ
Does this work with Vapi? Yes. Vapi's end-of-call webhook is the capture point. Skill injection happens in the system-prompt builder.
Does this work with Retell? Same pattern. Retell's call-event webhook feeds Hivemind, skills land in the assistant config.
Does this work with Bland? Yes. Bland's pathways DSL plus Hivemind skills compose cleanly. Skills enrich pathway nodes.
Can a skill be shared across customers if it's generic? Yes. Hivemind supports skill promotion from a per-customer workspace to a shared workspace.
Does skill retrieval slow down the voice turn? Skills load into the session prompt at start, not per-turn. No turn-latency impact.
Citations
- Vapi. Voice AI infrastructure
- Retell. Voice agent platform
- Bland. Voice AI
- Deeplake Hivemind: shared memory for AI agents
From 80% to 95% on local quirks, without a fine-tune.
Related
- Vertical agent stack that learns from corrections(Stack · Vertical)
- User correction applies next time(Corrections · Loop)
- What is the agent improvement loop(Improvement · Loop)
- Self-improving coding agents(Coding · Self-improving)