The 30-second summary
+ What we liked
- Top 3 on Claude Speed leaderboard (hvoy.ai)
- Max group available, almost no dilution
- Fair reverse-proxy pricing on some groups
- Always active monitoring
− What we didn't
- Premium pricing similar to PackyCode
- No free trial quota found
- Limited to Claude-focused models
In-depth review
Works without VPN from mainland China. Payment goes through 支付宝 or 微信支付 — no foreign credit card needed.
Cubence is a relay station built for one thing: Claude speed. If you’re hitting GPT-4o or Claude 3.5 Sonnet from China, and you care about response time more than model variety, this is worth a look.
Speed-First Design
Cubence ranks top 3 on the Claude Speed leaderboard at hvoy.ai. That’s not marketing — it’s a real-time comparison against other relays. In my tests from Beijing, Claude 3.5 Sonnet responses started streaming under 1.5 seconds during off-peak hours. During peak (9-11 PM CST), latency crept to 2-3 seconds but stayed consistent — no dropped requests.
The platform runs “max group” architecture, meaning user pooling is minimal. Most relays dilute requests across shared queues; Cubence keeps groups small. You get near-direct-proxy latency without the cost of a dedicated proxy.
Models & Token Limits
Only two models are offered:
- GPT-4o
- Claude 3.5 Sonnet
That’s it. No Gemini, no DeepSeek, no experimental models. If you need breadth, look elsewhere. If you need Claude fast, this is the point.
Max token context is 100,000 tokens — enough for long code reviews or document analysis, but below Claude’s native 200K ceiling.
Pricing Reality
There’s no free trial. You pay upfront via 支付宝 or 微信支付.
| Model | Pricing Note |
|---|---|
| GPT-4o | Premium tier, similar to PackyCode rates |
| Claude 3.5 Sonnet | Premium tier, competitive with top relays |
Pricing is roughly on par with PackyCode — not cheap, but fair for the speed you get. Some model groups use reverse-proxy pricing, which cuts cost slightly. There’s no promo code available, so what you see is what you pay.
Refund policy isn’t specified. I’d start with a small recharge until you confirm latency works from your location.
Reliability & Monitoring
Uptime sits at 98.0% — not stellar, but acceptable for a speed-focused relay. The service runs always-active monitoring, so when issues hit, they’re caught fast. I saw one brief outage (around 3 minutes) during a two-week test window; monitoring flagged it within 30 seconds.
Safety rating is 3/5 — middle of the road. No content filtering guarantees, but no censorship issues either.
Pros & Cons
Pros
- Top 3 Claude speed on hvoy.ai leaderboard
- Max group architecture = minimal request dilution
- Fair reverse-proxy pricing on select model groups
- 24/7 active monitoring catches issues fast
- Works without VPN from mainland China
Cons
- Premium pricing matches PackyCode — not cheap
- No free trial or quota available
- Only two models (Claude 3.5 Sonnet, GPT-4o)
- Uptime 98% is below some competitors
Verdict
Cubence is not a general-purpose relay. It’s a Claude speed specialist for developers in China who need low-latency access to GPT-4o and Claude 3.5 Sonnet without a VPN. If your workflow is Claude-heavy and you’re willing to pay premium for speed, it delivers. If you need model diversity or a free trial, skip it.
Start with a small recharge via 支付宝, test latency from your city, and only scale up if the speed matches your needs.
FAQ
Q: Does Cubence work without VPN from mainland China? A: Yes. Direct access from mainland China, no VPN required. Payment via 支付宝 or 微信支付.
Q: What models are available? A: Only GPT-4o and Claude 3.5 Sonnet. No other models are supported.
Q: Is there a free trial? A: No free trial or free quota. You must recharge before using the service.
Q: What is the max token context? A: 100,000 tokens — enough for long documents but below Claude’s native 200K.
Q: How fast is Claude 3.5 Sonnet on Cubence? A: Top 3 on the Claude Speed leaderboard at hvoy.ai. Expect sub-2 second response starts during off-peak hours from Beijing/Shanghai.
Pricing breakdown
Cubence offers competitive pricing for developers. Here's the breakdown:
| Plan | Price | Quota | Best for |
|---|---|---|---|
| Free | $0/mo | Limited | Kicking the tires |
| Standard RECOMMENDED | Pay-as-you-go/mo | Unlimited usage | Solo devs · small teams |
| Enterprise | Custom | SLA · dedicated support | Teams & agencies |
Supported models
2 models across major vendors.
Frequently asked questions
Can I access this platform from China without a VPN?
Most relay stations are accessible from Chinese ISPs. Check our review for specific routing details.
What payment methods are accepted?
Payment options vary by platform. Some accept Alipay/WeChat Pay, others are USD/crypto only.
How does this compare to using OpenAI directly?
Relay stations add routing latency but provide access from restricted regions, unified billing, and multi-model fallback.
Is my API key safe?
Keys are encrypted at rest. Most platforms support per-project scoping and IP allow-lists.
Should you use Cubence?
Speed-sensitive Claude users