Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.redpill.ai/llms.txt

Use this file to discover all available pages before exploring further.

Confidential AI Models

RedPill exclusively uses Confidential AI models - AI models running entirely within GPU Trusted Execution Environments (TEE). Your prompts and responses never leave the secure hardware enclave.
Unlike other AI platforms, we don’t route to OpenAI, Anthropic, or other cloud providers. Every model runs in verified TEE infrastructure.

Available Model Providers

RedPill sources Confidential AI models from four verified TEE providers:

Chutes

New GLM, Kimi, Qwen, MiniMax, MiMo, and DeepSeek models

Near AI

GLM-5, DeepSeek V3.1, GPT-OSS, and Qwen

Phala Network

Qwen, Gemma, GPT-OSS, GLM, and embeddings

Tinfoil

Qwen Coder, Kimi Thinking, DeepSeek R1, and Llama

Model Catalog

Chutes Models

ModelParametersBest For
GLM 5.1LargeSystems engineering, agent workflows
Kimi K2.6Large MoEVisual coding, multimodal work
Qwen3.5 397B397B MoEHigh-quality reasoning
Qwen3 Coder NextLargeCode generation, review
MiniMax M2.5LargeGeneral purpose
MiMo V2 FlashLargeFast responses
DeepSeek V3.2685B MoELatest DeepSeek reasoning
Kimi K2.5Large MoEVisual coding

Near AI Models

ModelParametersBest For
GLM-5LargeSystems engineering
DeepSeek V3.1671B MoEHybrid reasoning
GPT-OSS 120B117B MoEOpenAI-style open-weight reasoning
Qwen3 30B30B MoEBalanced performance
GLM-4.7130BBilingual CN/EN tasks

Phala Network Models

ModelParametersBest For
Qwen3.5 27B27BGeneral purpose
Qwen3 VL 30B30B MoEVision + language
Qwen3 Embedding 8B8BConfidential embeddings
Gemma 3 27B27BMultilingual multimodal work
GLM 4.7 Flash~30BFast agentic coding
GPT-OSS 20B21B MoEEfficient OpenAI-style reasoning
Qwen 2.5 7B7BBudget-friendly chat

Tinfoil Models

ModelParametersBest For
Qwen3 Coder 480B480B MoELarge-scale coding
Kimi K2 Thinking1T MoEAgentic reasoning
DeepSeek R1685B MoEAdvanced reasoning
Llama 3.3 70B70BGeneral purpose chat

Choosing the Right Model

Recommended: GLM 5.1, GLM-5, or Llama 3.3 70BThese models handle everyday questions, brainstorming, and general assistance well.
Recommended: GLM 5.1, Qwen3.5 397B, DeepSeek R1, or DeepSeek V3.2Best for multi-step problems, analysis, and tasks requiring deep thinking.
Recommended: Qwen3 Coder Next, Qwen3 Coder 480B, or Kimi K2.6Optimized for code generation, debugging, and technical documentation.
Recommended: Qwen3.5, GLM-5, or GLM-4.7Strong performance in non-English languages, especially Chinese.
Recommended: MiMo V2 Flash, Qwen 2.5 7B, or GPT-OSS 20BSmaller models that respond quickly for simple queries.

Model Usage by Plan

FeatureFreeProEnterprise
Basic models (7B-27B)
Large models (70B+)Limited
Massive models (480B+)
Model switching
Priority access

Why Confidential AI Only?

RedPill is designed for true privacy. Here’s why we only use Confidential AI models:
  1. No third-party exposure - Your data doesn’t go to OpenAI, Anthropic, or Google
  2. Hardware isolation - TEE ensures even the hosting provider can’t see your data
  3. Verifiable execution - Cryptographic attestation proves the model runs in genuine TEE
  4. Consistent privacy - Every model meets the same security standard
Want access to 50+ models including OpenAI GPT and Claude? Use the RedPill API for development - it offers broader model selection with TEE-protected routing.

Attestation & Verification

Every Confidential AI model comes with cryptographic attestation proving it runs in genuine TEE hardware.

Learn about verification

Verify model execution yourself →