Platform pricing. Compute on your terms.
Forge charges for the platform — routing, memory, security, orchestration. Compute is always separate: bring your own keys, use free providers, or buy Forge Credits.
What Forge charges for
- Routing intelligence
- Memory and context
- Security pipeline
- Agent orchestration
- Team features & observability
Fixed monthly cost. Near-zero marginal cost to deliver.
Always separate, always yours
- Bring your own API keys
- Connect free tier providers
- Buy Forge Credits (FCC)
Variable. Scales with usage. Forge's optimization stack reduces it by up to 90%.
Free
Build and explore at zero cost. Platform features included.
Compute: BYOK · Free tiers · FCC credits
- 2 agents
- Basic semantic cache
- Standard routing (single provider)
- Basic security (layers 1-3)
- 1 seat
- Community support
- 3 Forge Connect accounts
Includes semantic cache + smart routing
Pro
The full platform for builders and small teams.
Compute: BYOK · Free tiers · FCC credits
- 15 agents
- Advanced 3-layer memory
- Intelligent multi-provider routing
- Full 7-layer security pipeline
- All 124+ MCP tools
- 30-day observability
- Priority routing queue
- 5 seats
- Email support (48h)
All 6 savings layers active. Pays for itself at ~$200/mo AI spend.
Ultimate
Every product. Maximum capability. For serious teams.
Compute: BYOK · Free tiers · FCC credits
- Unlimited agents
- Full FMF memory (1yr retention)
- All routing modes + Evolver
- ForgeGuard + Extended + compliance
- Forge Foundry + Forge Mesh
- Forge Swarm agent spawning
- 1-year observability + alerting
- 10 seats
- Priority support (4h)
Maximum compression + priority routing. Built for high-volume workloads.
Enterprise
Dedicated infrastructure. Custom SLAs. Full compliance.
Compute: Custom FCC rates · BYOK · Managed key pools
- Everything in Ultimate
- SSO / SAML / SCIM
- SOC 2 + HIPAA BAA
- Data residency routing
- Custom SLAs (99.99% uptime)
- Dedicated account team
- Unlimited seats
- On-premise deployment
7-Day Ultimate Trial
Every new account gets full Ultimate access for 7 days, triggered on your first API request. No credit card required. Explore all 29 layers, 55 MCP modules, and unlimited ForgeBots.
Start Your Free TrialHow to power your agents
Choose any combination. All three work on every platform tier.
Your API Keys
Connect keys from any of 340+ models. Forge optimizes every request — you keep 100% of the savings.
$0 platform fee on compute
Connect keysFree Tiers
Groq, Google, Mistral, Cohere, and more. Rate limits reset automatically. ~$150–250/month in free capacity available.
$0
Connect free providersForge Credits
No accounts. No setup. Buy credits, use any model instantly. Effective cost lower than going direct.
From $0.001/credit
Buy creditsForge routes intelligently across all connected sources — always using the most cost-effective option for each request.
Pay as you go — no subscription required
Forge Credits work on every model, every feature. No account setup. Credits never expire.
Why Forge Credits cost less than going direct
| Model | Direct Rate | Forge Rate | With Caching |
|---|---|---|---|
| GPT-4o | $2.50 / 1M input | $2.50 / 1M input | ~$0.38 / 1M input |
| Gemini 2.5 Pro | $1.25 / 1M input | $1.25 / 1M input | ~$0.19 / 1M input |
| Gemini 2.0 Flash | $0.01 / 1M input | $0.01 / 1M input | ~$0.002 / 1M input |
| Llama 3.3 70B (Groq) | $0.059 / 1M input | $0.059 / 1M input | ~$0.009 / 1M input |
Forge's optimization stack runs on every request. Semantic caching, smart routing, and context compression mean most requests cost a fraction of the base rate.
Feature Comparison
Detailed breakdown of what is included in each tier.
| Feature | Free | Pro | Ultimate | Enterprise |
|---|---|---|---|---|
| Routing & Intelligence | ||||
| LLM Providers | 3 | 14+ | 14+ | 14+ custom |
| Rate Limit | 60/min | 600/min | Unlimited | Unlimited |
| Quality Routing (ELO) | ||||
| Intent Cascade Classifier | Basic | |||
| Parallel Ensemble Routing | ||||
| Context Sharding (RAPTOR) | ||||
| Custom Routing Rules | ||||
| Memory & Storage | ||||
| Semantic Cache | Basic | Full | Full | Full |
| Vector Memory (Qdrant) | ||||
| Graph Memory (Neo4j) | ||||
| State CRDTs (Redis) | ||||
| IPFS/Filecoin Storage | ||||
| Memory Retention | 7 days | 90 days | Unlimited | Unlimited |
| Security | ||||
| Input Scanning | Basic | Full 7-layer | Full 7-layer | Full 7-layer |
| LlamaFirewall | ||||
| PII Detection (Presidio) | ||||
| OWASP Agentic Top 10 | ||||
| SpiceDB Permissions | ||||
| OPA Policy Engine | ||||
| SOC 2 / HIPAA | ||||
| Agents & Bots | ||||
| ForgeBots | 1 | 5 | Unlimited | Unlimited |
| Bot Templates | 3 | 20 | 20+ | Custom |
| ForgeOps Agents | 8 agents | 8+ custom | ||
| Forge Foundry | ||||
| Forge Mesh | ||||
| Forge Link | ||||
| Connectivity & Integrations | ||||
| Forge Connect Accounts | 3 | 10 | 50 | Unlimited |
| Connect Actions/mo | 50 | 500 | 5,000 | Unlimited |
| Available Integrations | 3,000+ | 3,000+ | 3,000+ | 3,000+ custom |
| Event-Driven Triggers | ||||
| Forge Ingest Connectors | 3 | 6 | Unlimited | |
| Compute (always separate) | ||||
| Bring Your Own Keys | ||||
| Free Tier Routing | ||||
| Forge Credits (FCC) | Custom rates | |||
| Support & Compliance | ||||
| Support | Community | Priority | Dedicated | 24/7 + TAM |
| SLA Uptime | Best effort | 99.9% | 99.95% | 99.99% |
| SSO / SAML | ||||
| Data Residency | ||||
| Custom DPA | ||||
| Audit Logs | 30 days | 1 year | Unlimited | |
What you actually pay
Already have LLM API keys? Good — connect them to Forge. Most teams using Forge spend less on AI than before.
Why Forge costs less
- Repeated queries hit Forge's cache — you pay once, not every time
- Long conversations are compressed — fewer tokens sent on every turn
- Simple tasks route to cheaper models automatically
- You get 10+ providers without managing 10+ billing accounts
Our markup is 20–30% depending on your tier. Your total bill is typically 40–60% lower than going direct. The math works.
| Without Forge | With Forge |
|---|---|
| Pay full price every call | 30–40% cached at near-zero cost |
| One provider relationship | 10+ providers, best price routed |
| Context grows, tokens explode | ICE compresses long sessions |
| Manage billing for each key | One invoice, one dashboard |
| No visibility into cost per call | X-Forge-Savings on every response |
Cost Calculator
Estimates based on platform averages. Actual savings depend on query patterns, session length, and model mix. Real savings are recorded on every call.
A La Carte MCP Modules
Access any of our 55 MCP modules individually via x402 micropayments. No subscription upgrade required.
Pay Per Use
Each MCP module has transparent per-request pricing, from $0.001 to $0.01 per call. Only pay for what you use.
55 Modules Available
Covering routing, memory, security, compliance, evaluation, replay, FinOps, connectivity, lifecycle management, and more.
x402 Micropayments
Built on the x402 protocol for frictionless, per-request billing. Purchase credit packs for volume discounts or pay as you go.
Credits are deducted automatically on each MCP module call. Unused credits roll over for 12 months.
Credit Packs
Buy credits in bulk and save up to 50%. Credits work across all 55 MCP modules.
Themed Bundles
Curated sets of MCP modules at 20-30% off individual pricing. Pick the capabilities you need.
Memory Pack
Full 3-layer memory system: vector, graph, and state CRDTs for persistent agent context.
Security Pack
Complete 7-layer ForgeGuard pipeline, OWASP scanning, and MCP supply chain security.
Orchestration Pack
Parallel ensemble routing, circuit breakers, and agent lifecycle management.
Observability Pack
Full tracing, session replay, and continuous evaluation with guardrail pipelines.
Foundry Pack
Multi-agent orchestration studio and ephemeral compute environments.
Developer Pack
Prompt lifecycle management, A/B testing, evaluation suites, and session replay.
Compliance Pack
Automated regulatory evidence, MCP supply chain verification, and FinOps cost intelligence.
From single tool to complete platform
Start with one tool. Upgrade as you grow.
One capability. No platform required.
All 8 tools at Pro tier. 1 seat.
All tools + agents + orchestration + platform features.
Everything + team seats + maximum limits.
Dedicated infrastructure. SSO. Compliance. SLA.
The $10 gap between the bundle ($39) and Platform Pro ($49) is the easiest upgrade decision in software.
Frequently Asked Questions
Need a custom solution?
Enterprise plans include dedicated infrastructure, custom SLAs, compliance certifications, and a named account team. Let us build the right package for your organization.
enterprise@optima-forge.com