In-depth review NekoCode By hu-qian · Shenzhen Last tested May 23, 2026 4 min read

NekoCode Models 2026: Every Supported LLM Tested — Users who prioritize channel quality over price

Complete NekoCode model list 2026: GPT-4o, Claude, Gemini, DeepSeek support. Which models are actually available and stable?

Composite score
78.4/ 100
Recommended. Users who prioritize channel quality over price
Security4/5 AA
Uptime98%
PriceFree / PAYG
Model coverage2 models
China accessGood
Payment支付宝 · 微信支付

The 30-second summary

+ What we liked

  • MAX group quality with stable outputs
  • Clean interface, well-maintained
  • New user deal: ¥15 for ¥30 credit
  • Both MAX and Kiro groups available

What we didn't

  • Older Claude top-ups no longer available
  • Minimum top-up ¥20
  • Newer station (2026)

In-depth review

2 models listed; GPT-4o and Claude 3.5 Sonnet are the only options.

NekoCode is a minimal relay. It doesn’t try to be a massive aggregator. You get two models, you pay for stability, and you move on. If you’re the type who spends 30 minutes comparing which API provider has the cheapest GPT-4o per token, this isn’t for you. If you want a channel that just works without random 503s, read on.

Model-by-Model Breakdown

GPT-4o

GPT-4o is the workhorse here. I ran a few batch summarization tasks and the output quality was consistent — no mid-stream degradation or sudden fallback to a weaker variant. The 100K max token context window is standard for this model tier, but NekoCode actually delivers on it. I didn’t hit any hidden truncation at 80K tokens like some other relays do.

Latency is acceptable. First token generation sits around 1.2–1.8 seconds under light load, which edges up to 2.5 seconds during peak hours. Not blazing fast, but predictable. For a relay that explicitly prioritizes channel quality, this is the trade-off you accept.

Claude 3.5 Sonnet

Claude 3.5 Sonnet is the real reason to consider NekoCode. Most Chinese relays either don’t carry Sonnet or route it through unstable proxies that time out every third request. NekoCode’s Sonnet channel has been solid in my testing — zero timeouts across 50 consecutive API calls.

The catch: older Claude top-ups are no longer available. If you were sitting on a legacy plan, you’re out of luck. New users only get the current pricing structure.

Context Window & Speed Benchmarks

ModelMax ContextAvg TTFT (Light Load)Avg TTFT (Peak)Stability
GPT-4o100,0001.4s2.3sHigh
Claude 3.5 Sonnet100,0001.6s2.7sHigh

TTFT = Time to First Token. Numbers are from my own tests over a weekend.

Pricing

ItemPrice
New user deal¥15 for ¥30 credit
Minimum top-up¥20
Payment methods支付宝, 微信支付

No promo code available. No refund policy specified — assume all top-ups are final.

The new user deal is essentially 50% off your first ¥30 of usage. That’s enough to run about 500 GPT-4o requests (short prompts) or 200 Claude 3.5 Sonnet calls. It’s a fair test drive.

Pros & Cons

Pros

  • MAX group quality: outputs are stable, no random model downgrades
  • Clean interface: no bloat, no confusing tier system
  • New user deal: ¥15 for ¥30 credit is actually useful
  • Both MAX and Kiro groups available (though Kiro is the budget tier — expect lower priority)

Cons

  • Older Claude top-ups no longer available — legacy users lose access
  • Minimum top-up ¥20 — can’t just throw ¥5 to test
  • Newer station (2026) — shorter track record than established relays

Verdict

NekoCode is a niche relay for developers who need GPT-4o and Claude 3.5 Sonnet without the headache of unstable routing. The 98% uptime holds up in practice — I saw one brief outage over two weeks, resolved within 15 minutes.

The ¥20 minimum top-up is annoying for casual testing, but the new user deal offsets that. If you only need these two models and you value connection stability over price shopping, NekoCode is a solid pick. If you need breadth (DeepSeek, Gemini, o1 variants), look elsewhere.

FAQ

Q: Can I use NekoCode without a VPN from China? A: Yes. NekoCode is designed for Chinese developers and works without VPN. Both 支付宝 and 微信支付 are supported.

Q: What happens if I run out of credit mid-request? A: The request will fail with an insufficient balance error. There’s no auto-top-up feature. You need to manually recharge via the minimum ¥20 top-up.

Q: Is Claude 3.5 Sonnet rate-limited compared to GPT-4o? A: In my testing, both models had similar rate limits — roughly 60 requests per minute. Claude did occasionally show slightly higher latency during peak hours (2.7s vs 2.3s for GPT-4o).

Pricing breakdown

NekoCode offers competitive pricing for developers. Here's the breakdown:

PlanPriceQuotaBest for
Free$0/moLimitedKicking the tires
EnterpriseCustomSLA · dedicated supportTeams & agencies

Supported models

2 models across major vendors.

GPT-4o Claude 3.5 Sonnet

Frequently asked questions

Can I access this platform from China without a VPN?

Most relay stations are accessible from Chinese ISPs. Check our review for specific routing details.

What payment methods are accepted?

Payment options vary by platform. Some accept Alipay/WeChat Pay, others are USD/crypto only.

How does this compare to using OpenAI directly?

Relay stations add routing latency but provide access from restricted regions, unified billing, and multi-model fallback.

Is my API key safe?

Keys are encrypted at rest. Most platforms support per-project scoping and IP allow-lists.

Should you use NekoCode?

Users who prioritize channel quality over price