Time estimate: ~5 minutes
What is BYOK?
Most chatbot platforms bundle AI costs into their pricing and charge you per message or per conversation. ReplyBubble works differently:- You create an account with an AI provider (OpenAI or Anthropic)
- You generate an API key and paste it into ReplyBubble
- When a visitor asks a question, ReplyBubble sends it to your AI provider and returns the answer
- The AI provider bills you directly at their standard rates
Setting up OpenAI (recommended)
OpenAI is the most popular choice. Their GPT-4o-mini model offers a great balance of quality and cost for most use cases.Get your API key
- Go to platform.openai.com/api-keys
- Sign in or create an account
- Click Create new secret key
- Name it “ReplyBubble” (helps you identify it later)
- Click Create secret key
- Copy the key immediately — it starts with
sk-proj-and won’t be shown again
Important: Add billing to your OpenAI account at platform.openai.com/account/billing. Without billing, API calls will fail.
Connect to ReplyBubble
- In your ReplyBubble dashboard, go to Settings > AI Provider
- Select OpenAI
- Paste your API key
- Click Save
Setting up Anthropic
Anthropic makes the Claude family of models, which are known for careful, nuanced responses.Get your API key
- Go to console.anthropic.com
- Sign in or create an account
- Go to API Keys
- Click Create Key
- Name it “ReplyBubble”
- Copy the key — it starts with
sk-ant-
Connect to ReplyBubble
- In your ReplyBubble dashboard, go to Settings > AI Provider
- Select Anthropic
- Paste your API key
- Click Save
Choosing a model
After connecting your provider, select which AI model to use. Here’s a simple comparison:OpenAI models
| Model | Best for | Speed | Quality | Relative cost |
|---|---|---|---|---|
| GPT-4o-mini | Most use cases | Fast | Great | Low |
| GPT-4o | Complex topics, nuanced answers | Medium | Excellent | Higher |
Anthropic models
| Model | Best for | Speed | Quality | Relative cost |
|---|---|---|---|---|
| Claude Sonnet (e.g., Claude 3.5 Sonnet) | General use, thoughtful responses | Medium | Excellent | Medium |
| Claude Haiku (e.g., Claude 3.5 Haiku) | Speed-sensitive, simple Q&A | Fast | Good | Low |
Note: Anthropic updates model versions regularly. Your ReplyBubble dashboard shows the specific models available — pick from there.Recommendation: Start with GPT-4o-mini (OpenAI) or Claude Haiku (Anthropic). They handle most support and sales conversations well at the lowest cost. Upgrade to a more capable model if you find answers need more depth.
Understanding AI costs
AI providers charge per token (roughly 1 token = 0.75 words). A typical chatbot conversation uses 500–2,000 tokens.Realistic cost estimates
| Monthly conversations | Model | Estimated AI cost |
|---|---|---|
| 100 | GPT-4o-mini | $0.50–2 |
| 500 | GPT-4o-mini | $3–8 |
| 1,000 | GPT-4o-mini | $5–15 |
| 500 | GPT-4o | $10–25 |
How ReplyBubble keeps costs low
ReplyBubble automatically reduces your AI spend in several ways:- Response caching — repeat questions get instant answers without calling the AI
- Spam filtering — spam and unserious messages are blocked before using any AI credits
- Efficient knowledge base search — only relevant content is sent to the AI, not your entire knowledge base
- Scripted responses — common questions can use pre-written answers at zero AI cost (see Customization)
- Daily spending caps — set a maximum daily spend in Dashboard > Settings > AI Provider to prevent surprise bills
Switching providers
You can switch between OpenAI and Anthropic at any time:- Go to Settings > AI Provider
- Select the new provider
- Paste the new API key
- Click Save
Troubleshooting
”API key invalid” error
- Make sure you copied the full key (no extra spaces)
- Verify billing is set up on your AI provider account
- Check that the key hasn’t been revoked in your provider’s dashboard
Bot not responding
- Confirm your API key is active and has billing enabled
- Check your AI provider’s status page for outages
- Verify you haven’t hit your provider’s rate limits or spending caps
Responses are slow
- Try switching to a faster model (GPT-4o-mini or Claude Haiku)
- Check your AI provider’s status page — response times can vary during peak usage

