Models
Available models on Axerity Chat & API
Free Access
All models are available for free on our Chat interface with daily message limits.
Chat Models & Limits
| Name | Provider | ID | Free | Pro |
|---|---|---|---|---|
| GPT 5.1 | OpenAI | openai/gpt-5.1 | 5/day | 75/day |
| GPT 4.1 | OpenAI | openai/gpt-4.1 | 10/day | 100/day |
| GPT OSS 20B | OpenAI | openai/gpt-oss-20b | 100/day | 1,000/day |
| GPT OSS 120B | OpenAI | openai/gpt-oss-120b | 50/day | 500/day |
| Claude Opus 4.5 | Anthropic | anthropic/claude-opus-4.5 | 3/day | 50/day |
| Claude Sonnet 4.5 | Anthropic | anthropic/claude-sonnet-4.5 | 5/day | 75/day |
| Claude Haiku 4.5 | Anthropic | anthropic/claude-haiku-4.5 | 30/day | 300/day |
| Gemini 3 Pro Preview | google/gemini-3-pro-preview | 5/day | 75/day | |
| Gemini 2.5 Pro | google/gemini-2.5-pro | 10/day | 150/day | |
| Gemini 2.5 Flash | google/gemini-2.5-flash | 30/day | 300/day | |
| Gemini 2.5 Flash Lite | google/gemini-2.5-flash-lite | 100/day | 1,000/day | |
| Grok 4.1 Fast | xAI | x-ai/grok-4.1-fast | 30/day | 300/day |
| Grok 4 Fast | xAI | x-ai/grok-4-fast | 30/day | 300/day |
| Kimi K2 Thinking | Moonshot AI | moonshotai/kimi-k2-thinking | 30/day | 300/day |
| GLM 4.6 | Zhipu AI | z-ai/glm-4.6 | 30/day | 300/day |
| Minimax M2 | MiniMax | minimax/minimax-m2 | 50/day | 500/day |
| Intellect 3 | Prime Intellect | prime-intellect/intellect-3 | 50/day | 500/day |
API Models & Pricing
Free Credits
New users receive 1,000 credits (worth $10) upon signup. API credits are separate from Chat limits.
| Model | ID | Credits/Message |
|---|---|---|
| GPT OSS 20B | openai/gpt-oss-20b | 0.1 |
| GPT OSS 120B | openai/gpt-oss-120b | 0.25 |
| Gemini 2.5 Flash Lite | google/gemini-2.5-flash-lite | 0.15 |
| Gemini 2.5 Flash | google/gemini-2.5-flash | 1 |
| Gemini 2.5 Pro | google/gemini-2.5-pro | 3 |
| Gemini 3 Pro Preview | google/gemini-3-pro-preview | 4 |
| Kimi K2 Thinking | moonshotai/kimi-k2-thinking | 1 |
| GLM 4.6 | z-ai/glm-4.6 | 1 |
| Minimax M2 | minimax/minimax-m2 | 0.5 |
| Intellect 3 | prime-intellect/intellect-3 | 0.5 |
How Credits Are Calculated
Note
Including conversation history increases credit usage. Only include previous messages when context is needed.
Credits are charged per message in your conversation array. This includes both user messages and assistant responses.
Example: A conversation with 4 messages using GPT OSS 20B:
[
{ "role": "user", "content": "How are you?" },
{ "role": "assistant", "content": "I'm well, thank you!" },
{ "role": "user", "content": "What's the weather like?" },
{ "role": "assistant", "content": "I don't have access to weather data." }
]Cost: 4 messages × 0.1 credits = 0.4 credits
Reducing Costs
- Summarize long chats: For conversations over 10 messages, summarize the history and start fresh with the summary as context.
- Use cheaper models for simple tasks: GPT OSS 20B (0.1 credits) handles most tasks well. Save premium models for complex reasoning.
- Cache frequent responses: Store and reuse responses for common queries instead of making repeated API calls.
- Set lower
maxTokens: Limit response length when you only need short answers. - Skip unnecessary history: Only include previous messages when the model actually needs context.
Tools
Chat Only
Tools are only available for Chat users and are not accessible via the API.
| Tool | Description | Free | Pro |
|---|---|---|---|
web_search | Search the web for information | Unlimited | Unlimited |
news_search | Search for news articles | Unlimited | Unlimited |
memories | Store and recall user memories | Unlimited | Unlimited |
image_generation | Generate images with AI | 10/day | 100/day |