Simple, Transparent Pricing
No hidden fees, no cap. Choose the plan that's right for your vibe and scale as you grow.
Monthly
Yearly
Free
Perfect for getting started
Free
What's included:
- Host unlimited public models, datasets
- Create unlimited orgs with no member limits
- Access the latest ML tools and open source
- Community support
- 5,000 API calls per month
- Basic compute with free CPUs
Limitations:
- No private models
- Limited compute resources
- Standard response times
MOST POPULAR
Pro
Unlock advanced features
$9/month/month
What's included:
- Everything in Free tier
- ZeroGPU and Dev Mode for Spaces
- Free credits across all Inference Providers
- Early access to upcoming features
- Pro badge on your profile
- 100,000 API calls per month
- Pay-as-you-go option for additional usage
Limitations:
- No enterprise features
- Standard SLA
Enterprise Hub
Accelerate your AI roadmap
$20/month/month
What's included:
- Everything in Pro tier
- SSO and SAML support
- Select data location with Storage Regions
- Precise actions reviews with Audit logs
- Granular access control with Resource groups
- Centralized token control and approval
- Dataset Viewer for private datasets
- Advanced compute options for Spaces
- 5x more ZeroGPU quota for members
- Deploy Inference on your own Infra
- Managed billing with yearly commits
- Priority support
- Unlimited API calls
Plan Comparison
Features | Free Tier | Pro | Enterprise |
---|---|---|---|
Core Features | |||
Public Models Access | |||
API Access | |||
Storage | 1GB | 15GB | Unlimited |
Advanced Features | |||
Custom Model Hosting | Up to 5 | Unlimited | |
Private Models | |||
Custom Training | |||
Model Versioning | |||
Support | |||
Support Level | Community | 24/7 Priority | |
SLA Guarantee | |||
Dedicated Account Manager |
Additional Computing Options
Spaces Hardware
Upgrade your Space compute
$0/hour
Starting at
- Free CPUs
- Build more advanced Spaces
- 7 optimized hardware options
- From CPU to GPU to Accelerators
Inference Endpoints
Deploy models on fully managed infrastructure
$0.032/hour
Starting at
- Deploy dedicated Endpoints in seconds
- Keep your costs low
- Fully-managed autoscaling
- Enterprise security
API Usage
Pay only for what you use
$0.001/1000 tokens
Starting at
- Ultra low per-token pricing
- Volume discounts available
- No minimum commitments
- Transparent usage dashboard
GPU Turbo Scaling
Auto-scaling GPU clusters on demand
$0.50/hour
Starting at
- On-demand NVIDIA GPUs
- Automatic cluster scaling
- Real-time performance metrics
- Zero wait time provisioning
Custom ASIC Support
Specialized hardware acceleration
$1.20/hour
Starting at
- Dedicated TPU/ASIC hardware
- 10x faster inference speed
- Optimized for large models
- Hardware-specific optimizations
Edge Deployment
Push models to IoT devices
$0.10/device/month
Starting at
- Ultra-low latency inference
- Model compression technology
- Optimized for IoT and mobile
- Remote updates and monitoring
Multi-Region Deployment
Global low-latency inference
$0.25/region/hour
Starting at
- Deploy to 15+ global regions
- Traffic-based auto-routing
- Regional data compliance
- Geo-redundant failover
Quantum Processing Units
Next-gen quantum acceleration
$5.00/hour
Preview pricing
- Experimental QPU access
- Quantum ML algorithm library
- Specialized problem acceleration
- Academic research priority
Serverless Inference
Pay-per-use compute scaling
$0.15/million predictions
Starting at
- Zero infrastructure management
- Infinite scale potential
- Cold-start optimization
- Cost-effective for variable loads
Frequently Asked Questions
Ready to level up your AI game?
Start building with our platform today. No credit card required for the free tier.