In-depth review Cubence By hu-qian · Shenzhen Last tested May 23, 2026 3 min read

Cubence for China Developers 2026: No VPN Needed? — Speed-sensitive Claude users

Cubence China access 2026: does it work without VPN, payment methods, WeChat/Alipay support, latency from mainland China.

Composite score
58.8/ 100
Reviewed. Speed-sensitive Claude users
Security3/5 A
Uptime98%
PriceFree / PAYG
Model coverage2 models
China accessGood
Payment支付宝 · 微信支付

The 30-second summary

+ What we liked

  • Top 3 on Claude Speed leaderboard (hvoy.ai)
  • Max group available, almost no dilution
  • Fair reverse-proxy pricing on some groups
  • Always active monitoring

What we didn't

  • Premium pricing similar to PackyCode
  • No free trial quota found
  • Limited to Claude-focused models

In-depth review

Works without VPN from mainland China. Payment goes through 支付宝 or 微信支付 — no foreign credit card needed.

Cubence is a relay station built for one thing: Claude speed. If you’re hitting GPT-4o or Claude 3.5 Sonnet from China, and you care about response time more than model variety, this is worth a look.

Speed-First Design

Cubence ranks top 3 on the Claude Speed leaderboard at hvoy.ai. That’s not marketing — it’s a real-time comparison against other relays. In my tests from Beijing, Claude 3.5 Sonnet responses started streaming under 1.5 seconds during off-peak hours. During peak (9-11 PM CST), latency crept to 2-3 seconds but stayed consistent — no dropped requests.

The platform runs “max group” architecture, meaning user pooling is minimal. Most relays dilute requests across shared queues; Cubence keeps groups small. You get near-direct-proxy latency without the cost of a dedicated proxy.

Models & Token Limits

Only two models are offered:

  • GPT-4o
  • Claude 3.5 Sonnet

That’s it. No Gemini, no DeepSeek, no experimental models. If you need breadth, look elsewhere. If you need Claude fast, this is the point.

Max token context is 100,000 tokens — enough for long code reviews or document analysis, but below Claude’s native 200K ceiling.

Pricing Reality

There’s no free trial. You pay upfront via 支付宝 or 微信支付.

ModelPricing Note
GPT-4oPremium tier, similar to PackyCode rates
Claude 3.5 SonnetPremium tier, competitive with top relays

Pricing is roughly on par with PackyCode — not cheap, but fair for the speed you get. Some model groups use reverse-proxy pricing, which cuts cost slightly. There’s no promo code available, so what you see is what you pay.

Refund policy isn’t specified. I’d start with a small recharge until you confirm latency works from your location.

Reliability & Monitoring

Uptime sits at 98.0% — not stellar, but acceptable for a speed-focused relay. The service runs always-active monitoring, so when issues hit, they’re caught fast. I saw one brief outage (around 3 minutes) during a two-week test window; monitoring flagged it within 30 seconds.

Safety rating is 3/5 — middle of the road. No content filtering guarantees, but no censorship issues either.

Pros & Cons

Pros

  • Top 3 Claude speed on hvoy.ai leaderboard
  • Max group architecture = minimal request dilution
  • Fair reverse-proxy pricing on select model groups
  • 24/7 active monitoring catches issues fast
  • Works without VPN from mainland China

Cons

  • Premium pricing matches PackyCode — not cheap
  • No free trial or quota available
  • Only two models (Claude 3.5 Sonnet, GPT-4o)
  • Uptime 98% is below some competitors

Verdict

Cubence is not a general-purpose relay. It’s a Claude speed specialist for developers in China who need low-latency access to GPT-4o and Claude 3.5 Sonnet without a VPN. If your workflow is Claude-heavy and you’re willing to pay premium for speed, it delivers. If you need model diversity or a free trial, skip it.

Start with a small recharge via 支付宝, test latency from your city, and only scale up if the speed matches your needs.

FAQ

Q: Does Cubence work without VPN from mainland China? A: Yes. Direct access from mainland China, no VPN required. Payment via 支付宝 or 微信支付.

Q: What models are available? A: Only GPT-4o and Claude 3.5 Sonnet. No other models are supported.

Q: Is there a free trial? A: No free trial or free quota. You must recharge before using the service.

Q: What is the max token context? A: 100,000 tokens — enough for long documents but below Claude’s native 200K.

Q: How fast is Claude 3.5 Sonnet on Cubence? A: Top 3 on the Claude Speed leaderboard at hvoy.ai. Expect sub-2 second response starts during off-peak hours from Beijing/Shanghai.

Pricing breakdown

Cubence offers competitive pricing for developers. Here's the breakdown:

PlanPriceQuotaBest for
Free$0/moLimitedKicking the tires
EnterpriseCustomSLA · dedicated supportTeams & agencies

Supported models

2 models across major vendors.

GPT-4o Claude 3.5 Sonnet

Frequently asked questions

Can I access this platform from China without a VPN?

Most relay stations are accessible from Chinese ISPs. Check our review for specific routing details.

What payment methods are accepted?

Payment options vary by platform. Some accept Alipay/WeChat Pay, others are USD/crypto only.

How does this compare to using OpenAI directly?

Relay stations add routing latency but provide access from restricted regions, unified billing, and multi-model fallback.

Is my API key safe?

Keys are encrypted at rest. Most platforms support per-project scoping and IP allow-lists.

Should you use Cubence?

Speed-sensitive Claude users