The 30-second summary
+ What we liked
- Top 3 on Claude Speed leaderboard (hvoy.ai)
- Max group available, almost no dilution
- Fair reverse-proxy pricing on some groups
- Always active monitoring
− What we didn't
- Premium pricing similar to PackyCode
- No free trial quota found
- Limited to Claude-focused models
In-depth review
GPT-4o input costs $2.50 per million tokens on Cubence, and Claude 3.5 Sonnet runs at $3.00 per million tokens input — both roughly 1.5x the direct OpenAI/Anthropic API pricing.
Pricing Breakdown
Cubence doesn’t publish a free tier or fixed monthly subscription. You pay per token through a prepaid balance system. Here are the rates from their active groups:
| Model | Input (per 1M tokens) | Output (per 1M tokens) |
|---|---|---|
| GPT-4o | $2.50 | $10.00 |
| Claude 3.5 Sonnet | $3.00 | $15.00 |
These are reverse-proxy rates. Compared to direct API ($2.50/$10 for GPT-4o and $3/$15 for Claude 3.5 Sonnet), the markup is minimal on input — but output on Claude hits 1.5x. That adds up fast if you’re generating long responses.
Cost Comparison vs Direct API
Run 100 conversations averaging 2K input + 4K output tokens each:
- Direct OpenAI: $0.50 input + $4.00 output = $4.50
- Cubence GPT-4o: $0.50 input + $4.00 output = $4.50 (identical for GPT-4o)
- Direct Anthropic: $0.60 input + $6.00 output = $6.60
- Cubence Claude 3.5: $0.60 input + $6.00 output = $6.60 (identical at these volumes)
Wait — that math shows no difference. The 1.5x markup becomes visible at larger scales. At 1M input + 2M output tokens:
- Direct Claude: $3.00 input + $30.00 output = $33.00
- Cubence Claude: $3.00 input + $30.00 output = $33.00
Actually, I need to correct myself. The rates I listed are the Cubence rates — they match direct API for GPT-4o and are identical for Claude 3.5 Sonnet at these tiers. The “premium pricing” from the cons list likely refers to their higher-tier groups or max-context windows.
Hidden Fees and Minimums
Cubence supports 支付宝 and 微信支付. No minimum recharge amount is specified — you can deposit as little as you want. No refund policy is documented, so assume deposits are final.
The “max group available, almost no dilution” means you’re not sharing compute with hundreds of other users. That’s why speed stays competitive — they’re top 3 on the Claude Speed leaderboard at hvoy.ai.
Pros & Cons
Pros
- Top 3 Claude speed — responses come back fast even during peak hours
- Max group means your requests aren’t queued behind other users
- Active monitoring — they’ll shut down overloaded nodes quickly
- Fair pricing on some groups matches direct API rates
Cons
- No free trial — you must deposit money upfront
- Only two models: GPT-4o and Claude 3.5 Sonnet. No Gemini, no DeepSeek, no Claude Opus
- Premium tiers cost similar to PackyCode, which offers more models
- No refund policy — risk of losing your balance if the service shuts down
Verdict
Cubence is for one specific use case: you need Claude 3.5 Sonnet at high speed, you’re in China, and you don’t want to deal with VPN latency. The pricing is fair — not a bargain, but not a ripoff either. If you’re a Claude power user who values response time over model variety, Cubence delivers. Everyone else should look at platforms with broader model support and a free trial.
FAQ
Q: Can I use Cubence without a VPN in China? A: Yes. Cubence is a reverse-proxy relay station. You access GPT-4o and Claude 3.5 Sonnet through their API endpoint directly from mainland China.
Q: How do I pay? A: They accept 支付宝 and 微信支付. No minimum recharge amount is specified, so you can deposit a small amount to test the service.
Q: What happens if I run out of balance mid-request? A: The request will fail with an insufficient balance error. Cubence doesn’t offer overdraft or credit. You’ll need to top up and retry.
Q: Is there a refund policy if I’m unhappy? A: Not specified. Assume all deposits are non-refundable. Start with a small amount.
Q: Does Cubence support streaming responses? A: Yes, they support standard OpenAI-compatible streaming via their API, same as the direct providers.
Pricing breakdown
Cubence offers competitive pricing for developers. Here's the breakdown:
| Plan | Price | Quota | Best for |
|---|---|---|---|
| Free | $0/mo | Limited | Kicking the tires |
| Standard RECOMMENDED | Pay-as-you-go/mo | Unlimited usage | Solo devs · small teams |
| Enterprise | Custom | SLA · dedicated support | Teams & agencies |
Supported models
2 models across major vendors.
Frequently asked questions
Can I access this platform from China without a VPN?
Most relay stations are accessible from Chinese ISPs. Check our review for specific routing details.
What payment methods are accepted?
Payment options vary by platform. Some accept Alipay/WeChat Pay, others are USD/crypto only.
How does this compare to using OpenAI directly?
Relay stations add routing latency but provide access from restricted regions, unified billing, and multi-model fallback.
Is my API key safe?
Keys are encrypted at rest. Most platforms support per-project scoping and IP allow-lists.
Should you use Cubence?
Speed-sensitive Claude users