From prototype
to production scale.
Pay-as-you-go from $10. Committed plans from $499/mo. Dedicated GPU instances from $5,000/mo. All 200+ models, one API.
Pay-as-you-go from $10. Committed plans from $499/mo. Dedicated GPU instances from $5,000/mo. All 200+ models, one API.
All tiers include
Provision dedicated GPU instances for inference, fine-tuning, and training. Available on Dedicated and Enterprise tiers.
| GPU | VRAM | Price / hr | Best For |
|---|---|---|---|
| NVIDIA H100 SXM | 80 GB | $3.49/hr | Large models, fine-tuning, training |
| NVIDIA A100 SXM | 80 GB | $2.09/hr | Training, high-throughput inference |
| NVIDIA L40S | 48 GB | $1.19/hr | Inference, cost-efficient production |
| NVIDIA A10G | 24 GB | $0.75/hr | Lightweight inference, experimentation |
GPU instances billed per hour. Minimum 1-hour commitment. Multi-GPU clusters available on Enterprise. Pricing may vary by availability and region.
From shared multi-tenant pools to fully isolated on-premises deployments. Pick the tenancy model that matches your compliance and performance needs.
Every model has clear input and output pricing. Use our Vedika models for faith-domain tasks, or route to any open-source model at competitive rates.
| Model | Input (per 1M tokens) | Output (per 1M tokens) | Context |
|---|---|---|---|
| Vedika Standard XALEN | $2.00 | $4.00 | 128K |
| Vedika Fast XALEN | $2.80 | $5.60 | 128K |
| Vedika Voice XALEN | $0.02/sec | incl. | 31 langs |
| Claude Opus 4.7 Anthropic | $5.00 | $25.00 | 1M |
| Claude Sonnet 4.6 Anthropic | $3.00 | $15.00 | 1M |
| Llama 4 Maverick Meta | $0.27 | $0.85 | 1M |
| Llama 4 Scout Meta | $0.18 | $0.30 | 512K |
| DeepSeek R1 DeepSeek | $0.55 | $2.19 | 128K |
| DeepSeek V3 DeepSeek | $0.30 | $0.90 | 128K |
| Qwen 3 235B Qwen | $0.30 | $0.90 | 128K |
| Gemma 2 27B Google | $0.20 | $0.60 | 8K |
| Command R+ Cohere | $2.50 | $7.50 | 128K |
| +190 more models | Full pricing in docs | ||
Batch processing pricing is 50% of the rates shown above. All prices in USD. Custom pricing available for Enterprise plans.
Set hard caps, get alerts before overages, and auto-pause when limits are hit. No surprise bills.
Set alerts at 50%, 80%, 90%, and 100% of your monthly budget. Get notified via email and dashboard before you hit your limit.
Set a hard monthly cap in your dashboard. When reached, API calls return 402 Payment Required. No overage charges. Ever.
Enable auto-pause to halt all API traffic when your budget is exhausted. Resume instantly by increasing your cap or adding funds.
Monitor spend in real-time via dashboard or API. Per-model, per-day, per-key breakdowns. Export anytime as CSV or JSON.
Scale and Enterprise tiers can require manager approval before exceeding budget. Overage requests routed through your approval chain.
All tiers reconcile monthly, not quarterly. Invoices generated within 5 business days of cycle end. No 90-day lag.
Special pricing for students, researchers, and hackathon participants.
50% bonus credits on first deposit. Valid .edu email required.
$10 in free credits for verified hackathon participants. Expires 48 hours after activation.
Researchers affiliated with universities also qualify for the student plan. Email [email protected] with your institution details.
See exactly what XALEN costs for your use case, and what it saves you versus building in-house.
Add a minimum of $10 to your wallet via Razorpay (UPI, cards, net banking). Use any of the 200+ models and pay per token consumed. Credits are valid for 1 year from purchase. No monthly fees, no commitments. When your balance hits zero, API requests return 402 until you top up.
Growth ($499/mo) gives you 300 req/min, 500K tokens/min, 5 API keys, priority support, and usage analytics. Scale ($2,499/mo) upgrades to 1,000 req/min, 2M tokens/min, 10 API keys, priority inference queue, dedicated account manager, and 99.9% SLA. Both tiers include all 200+ models at standard token pricing.
Dedicated tier starts at $5,000/mo. You get isolated GPU instances billed per GPU-hour: H100 at ~$3.49/hr, A100 at ~$2.09/hr, L40S at ~$1.19/hr. Includes custom fine-tuning, 99.99% SLA, multi-tenant isolation, and private endpoints. Contact sales for exact pricing based on your configuration.
Upgrades are immediate: your new rate limits apply instantly. Downgrades take effect at the end of your current billing cycle. You can always fall back to Pay As You Go with no penalty. Contact [email protected] for tier changes.
Yes. Verified religious organizations and registered nonprofits receive 30% off all token pricing on any tier. Contact [email protected] with your organization verification documents and we will apply the discount within 48 hours.
Enterprise tier includes SOC 2 compliance and HIPAA support for health-faith applications. We also offer SSO/SAML integration, data residency options, and custom compliance documentation. Contact our enterprise team for specific certification requirements.
Processing millions of tokens monthly? Need dedicated infrastructure, custom SLAs, or on-premise deployment? Let us build a plan for your organization.
From pay-as-you-go prototyping to dedicated GPU clusters. Pick the tier that fits your stage.