The Per-Minute Model
You do not receive separate invoices for the LLM tokens used, the seconds of audio synthesized, or the transcription compute. DialNexa handles all provider relationships and presents you with a single per-minute rate for the selected pipeline. What the per-minute rate includes:| Component | Description |
|---|---|
| Telephony (PSTN) | Carrier charges for connecting the call. Outbound calls have higher carrier costs than inbound. |
| Transcription (STT) | Speech-to-text processing for the full call duration |
| LLM inference | Token costs for all turns in the conversation |
| TTS synthesis | Voice generation for all agent turns |
| Platform infrastructure | DialNexa’s orchestration, storage, and monitoring |
- Phone number rental (monthly, per number)
- Overage charges for exceeding plan concurrency limits
- Add-on features such as custom voice cloning (Enterprise)
What Affects Your Rate
Your effective per-minute cost varies based on several factors:LLM model selection
LLM model selection
The LLM model has the largest impact on per-minute cost. More powerful models (GPT-4o) cost significantly more per token than faster, lighter models (GPT-4o Mini, Groq Llama 4, DeepSeek V3). For most voice agent use cases, GPT-4o Mini provides excellent quality at the lowest cost.Cost relative to GPT-4o Mini (approximate):
- GPT-4o Mini: 1x (baseline)
- Groq Llama 4: comparable to GPT-4o Mini (varies)
- DeepSeek V3: low cost, comparable to or below GPT-4o Mini
- GPT-4o: ~4-6x
TTS provider
TTS provider
ElevenLabs and Cartesia have slightly different underlying synthesis costs. The difference is modest - typically less than 10% of the total per-minute rate.
Call direction
Call direction
Outbound calls cost more than inbound calls because of PSTN termination charges (the carrier fee for connecting a call to a destination number). Inbound calls have lower carrier costs since the originating carrier pays the connection fee.
Destination geography
Destination geography
International calls have higher telephony costs than domestic Indian calls. Some international destinations (specific country codes) have premium carrier rates. The per-minute rate for international calls is higher than for domestic.
Plan tier
Plan tier
Higher-tier plans have lower per-minute base rates. Enterprise accounts with committed monthly volume get the lowest rates through custom pricing.
Viewing Exact Pricing
Exact per-minute rates are shown in the dashboard under Settings → Billing → Pricing. The pricing page shows:- Your current per-minute rate by call direction (inbound/outbound)
- Your current per-minute rate by LLM model
- Phone number rental costs
- Any plan-level discounts applied to your account
Pricing shown in the dashboard is the most accurate source - it reflects your specific plan tier and any negotiated rates. The public pricing page shows standard rates and may differ from your actual rate.
How Costs Are Calculated for a Call
DialNexa first rounds the total connected minutes up to the next full minute, then applies the billable components for the agent pipeline:Cost Optimization Tips
Use GPT-4o Mini as your default
GPT-4o Mini is the fastest and most cost-efficient model for voice agents. Only upgrade to GPT-4o or Claude when you have specific quality requirements that GPT-4o Mini cannot meet.
Enable Audio Cache
Pre-generating static phrases with Audio Cache eliminates TTS synthesis costs for those phrases. For agents with common greetings and closings, this can reduce TTS costs by 20-40%.
Keep system prompts concise
Longer system prompts consume more LLM tokens per turn. Trim your prompts to essentials - every turn in the conversation includes the full system prompt in the LLM context.
Set max duration limits
Configure a maximum call duration on your agents. A 15-minute cap prevents runaway calls (e.g., an agent stuck in a loop) from accumulating large charges.
Configure appropriate retry limits
For outbound batch campaigns, setting too many retries increases costs without proportionally increasing connection rates. Start with 2 retries and measure whether additional retries are yielding connections.
Monitor per-agent costs
The usage breakdown in Settings → Billing → Usage shows cost per agent. Identify which agents have high cost-per-call and investigate whether they have unusually long calls or are using expensive models unnecessarily.
Free Trial Credits
New accounts receive free trial credits that can be used for real calls. Trial credits cover a defined number of minutes and expire based on the terms shown at signup. Trial calls are subject to the same per-minute rate structure as paid calls - the credits simply absorb the cost.Estimating Costs Before Running a Campaign
Use this formula to estimate campaign cost before launching:Related Pages
- Billing - wallet, auto-recharge, and invoices
- Concurrency Tiers - plan-level limits
- Latency - model selection for latency vs. cost tradeoffs