67 lines
2.0 KiB
Markdown
67 lines
2.0 KiB
Markdown
# Provider Setup Guide
|
|
|
|
## Free Tier Providers
|
|
|
|
### Groq (Fastest)
|
|
- **URL**: https://console.groq.com
|
|
- **Free Tier**: 20 RPM, variable TPM
|
|
- **Models**: Llama 3.3 70B, Llama 3.1 8B
|
|
- **Best For**: Speed, quick coding tasks
|
|
- **Tip**: Create multiple accounts with different phones for load balancing
|
|
|
|
### Mistral (High Volume)
|
|
- **URL**: https://console.mistral.ai
|
|
- **Free Tier**: 1 billion tokens/month
|
|
- **Models**: Mistral Small, Medium
|
|
- **Best For**: High-volume processing, chatbots
|
|
|
|
### OpenRouter (Universal Access)
|
|
- **URL**: https://openrouter.ai
|
|
- **Free Tier**: 50 requests/day
|
|
- **Access**: Kimi K2:free, Gemini Flash:free
|
|
- **Best For**: Testing, fallback access
|
|
|
|
### Cohere (Embeddings)
|
|
- **URL**: https://cohere.com
|
|
- **Free Tier**: 1,000 calls/month
|
|
- **Best For**: Embeddings, RAG systems
|
|
|
|
## Trial/Cheap Providers
|
|
|
|
### Anthropic Claude (Highest Quality)
|
|
- **URL**: https://console.anthropic.com
|
|
- **Trial**: $5 free credits (new users)
|
|
- **Student**: $500 credits (apply with .edu)
|
|
- **Cost**: $3/M input (Sonnet), $0.25/M (Haiku)
|
|
- **Best For**: Complex reasoning, analysis, code review
|
|
|
|
### Moonshot Kimi (Best Value)
|
|
- **URL**: https://platform.moonshot.ai
|
|
- **Bonus**: $5 signup credit
|
|
- **Cost**: $0.60/M input, $2.50/M output
|
|
- **Context**: 128K tokens
|
|
- **Best For**: Coding, long documents, Chinese content
|
|
|
|
### DeepSeek (Cheapest Reasoning)
|
|
- **URL**: https://platform.deepseek.com
|
|
- **Cost**: $0.14/M input, $0.28/M output
|
|
- **Best For**: Reasoning tasks, math, code
|
|
|
|
## Configuration Priority
|
|
|
|
The system routes requests in this priority:
|
|
|
|
1. **Fast tasks** → Groq (free, instant)
|
|
2. **High volume** → Mistral (1B tokens)
|
|
3. **Complex coding** → Kimi (cheap, 128K context)
|
|
4. **Quality critical** → Claude (expensive but best)
|
|
5. **Fallback** → OpenRouter free tier
|
|
|
|
## Rate Limit Management
|
|
|
|
The router automatically:
|
|
- Tracks RPM/TPM across all providers
|
|
- Distributes load (multiple Groq accounts)
|
|
- Falls back when limits approached
|
|
- Caches responses to reduce API calls
|