Overview
Reasoning models provide step-by-step thinking processes, making them ideal for complex problem-solving, math, coding, and analysis tasks. RedPill supports all major reasoning models with TEE protection.Reasoning tokens show the model’s “thinking process” before generating the final answer, leading to more accurate and well-reasoned responses.
Supported Reasoning Models
OpenAI O-Series
Model | Reasoning Capability | Best For |
---|---|---|
openai/o1 | Very High | Complex problem-solving |
openai/o1-mini | High | Faster reasoning tasks |
openai/o3-mini | Very High | Latest reasoning model |
Anthropic Claude
Model | Reasoning Capability | Best For |
---|---|---|
anthropic/claude-sonnet-4 | Very High | Analysis, research |
anthropic/claude-4.1 | Highest | Complex reasoning |
anthropic/claude-3.7-sonnet | High | Balanced performance |
Google Gemini
Model | Reasoning Capability | Best For |
---|---|---|
google/gemini-2.0-flash-thinking | High | Fast thinking |
Other Thinking Models
Model | Reasoning Capability |
---|---|
qwen/qwq-32b-preview | High |
alibaba/qwen-plus-latest | Medium-High |
Basic Usage
Simple Reasoning Request
Response with Reasoning
Controlling Reasoning Effort
Some models support controlling how much they “think”:Effort Levels
Level | Description | Use Case |
---|---|---|
low | Quick thinking | Simple problems |
medium | Balanced | Most tasks |
high | Deep reasoning | Complex problems |
Limiting Reasoning Tokens
Control cost by limiting reasoning tokens:Excluding Reasoning from Response
Get only the final answer, not the thinking process:Use Case: Math & Science
Use Case: Code Debugging
Use Case: Logic Puzzles
Use Case: Research & Analysis
Use Case: Strategic Planning
Chain of Thought Prompting
Enhance reasoning with explicit prompting:Multi-Step Reasoning
For complex multi-step problems:Injecting Reasoning
Use one model’s reasoning to improve another:Cost Considerations
Reasoning tokens are charged as output tokens:Model | Input Price | Reasoning Price | Output Price |
---|---|---|---|
openai/o1 | $15/1M | $60/1M | $60/1M |
anthropic/claude-4.1 | $3/1M | $15/1M | $15/1M |
google/gemini-2.0-flash-thinking | Free | Free | Free |
- Use
max_tokens
to limit reasoning - Set
effort: "low"
for simple tasks - Use cheaper models for initial exploration
- Cache results for repeated questions
Best Practices
When to Use Reasoning Models
When to Use Reasoning Models
Best for:
- Complex math and logic problems
- Strategic decision-making
- Code debugging and optimization
- Research and analysis
- Multi-step problem solving
- Simple Q&A
- Creative writing
- Basic information retrieval
- Casual conversation
Prompt Engineering
Prompt Engineering
- Be specific about what you want
- Ask for step-by-step solutions
- Provide all necessary context
- Use “Let’s think step by step” prompts
- Request verification of answers
Model Selection
Model Selection
- o1: Best for math and science
- Claude 4: Best for analysis and strategy
- o3-mini: Good balance of speed and reasoning
- DeepSeek: Best for code-related reasoning
Cost Optimization
Cost Optimization
- Start with lower effort levels
- Increase only if needed
- Use cheaper models for testing
- Cache expensive results
- Monitor token usage
Comparison with Regular Models
Feature | Regular Models | Reasoning Models |
---|---|---|
Speed | Fast | Slower |
Cost | Lower | Higher |
Accuracy | Good | Better |
Explainability | Limited | Detailed |
Best For | General tasks | Complex problems |