Penny-Pincher Provider

Curated list of affordable and free LLM API providers — Claude Code guest passes, coding-plan subscriptions, and free-tier APIs. Maintained for developers who want capable models without premium pricing.

Contributions welcome — open a pull request or issue to add or update an entry.


Claude Code Guest Passes

One week of free Claude Pro (includes Claude Code). New users only; requires payment info but can be cancelled before the trial ends.

My passes:

Friend’s passes:

Have a spare pass? Open a PR adding your link under Friend’s passes, or open an issue.

Claude AI Ecosystem

ClaudeKit

ClaudeKit provides tools and resources to work with Claude AI more effectively — including prompt templates, workflows, and extensions.

Referral: 20% off your first purchase via https://claudekit.cc/?ref=BWA910UK (code: BWA910UK).

Checked Jun 5, 2026.


Providers with Coding Plans

Monthly subscriptions with request-based (not token-based) quotas, compatible with Claude Code, Cursor, Cline, and similar tools.

Z.ai

Plans from $18/month (Lite/Pro/Max). OpenAI-compatible + Anthropic-compatible endpoint. Models: GLM-5.1, GLM-5-Turbo, GLM-4.7, GLM-4.5-Air.

Referral: https://z.ai/subscribe?ic=PLKIAYEIPW

Checked Jun 5, 2026.

MiniMax

Token Plan — priced per API call, not token. $20–$120/month.

Models: M3 (frontier multimodal coding, 1M context), M2.7 (language), M2.7-highspeed, speech-2.8-hd/turbo, Music-2.6, Hailuo 2.3 (video). Yearly plans ~17% off.

Referral (10% off) until Jul 1, 2026For Referred Users: 10% off subscription + become a dev ambassador. For Referrers: 10% back in API voucher per paid referral, usable across all MiniMax models, plus priority access to events and model previews. View details

Checked Jun 5, 2026.

Kimi Code

Moonshot’s coding perk bundled with Kimi membership. Models: Kimi K-series. Rolling 5-hour quota window.

Tiers: Adagio (free), Andante, Presto. Pay-as-you-go also at platform.moonshot.ai.

Checked Jun 5, 2026.

Alibaba Cloud Model Studio

Referral: Up to $1,700 in free trial credits via https://www.alibabacloud.com/campaign/benefits?referral_code=A92LU5 (code: A92LU5).

Token Plan (Team Edition)

Credit-based, per-seat subscription (Singapore region only):

Models: qwen3.6-plus, glm-5, MiniMax-M2.5, deepseek-v3.2 (text); qwen-image-2.0/2.0-pro, wan2.7-image/image-pro (image).

Coding Plan

Pro plan: $50/month — 6,000 req/5-hour, 45,000 req/week, 90,000 req/month.

Models: qwen3.5-plus, kimi-k2.5, glm-5, MiniMax-M2.5. Tools: Claude Code, Cursor, Cline, Codex, and more. Lite plan no longer accepting new subscribers.

Note (as of Jun 5, 2026): effectively unbuyable — perpetually out of stock. Claimed restock at 00:00 GMT+8, but repeated attempts still fail to purchase.

Checked Jun 5, 2026.

opencode — Go

Open-source opencode CLI subscription. $5 first month, $10/month thereafter.

Models: GLM-5/5.1, Kimi K2.5/K2.6, MiniMax M2.7/M3, Qwen3.5/3.6/3.7 Plus, MiMo-V2.5(-Pro), DeepSeek V4 Flash/Pro. Per-5-hour limits vary (200–10,200 req). API key portable — works with Claude Code via LiteLLM proxy or oc-go-cc. Model format: opencode-go/<model-id>.

My referral:

Invite friends to OpenCode Go. Earn $5 when a friend subscribes, and they’ll get $5 too. Share your referral link; your friend joins and subscribes to Go; you both get a $5 usage credit to apply toward your Go usage limits.

Referral link: https://opencode.ai/go?ref=HE42WGS8BM

Checked Jun 5, 2026.

Synthetic

Privacy-first inference (no training on prompts/responses). $30/month subscription or pay-as-you-go.

Models: Kimi K2.6, MiniMax M2.5, GLM 5.1, GLM 4.7 Flash, vLLM-compatible open-source models. OpenAI-compatible — works with Roo, Cline, Octofriend.

Referral: $10.00 in subscription credit via https://synthetic.new/?referral=CNBFyw28zF0dZoj

Checked Jun 5, 2026.

BigModel.cn — GLM Coding Plan

The Chinese (mainland) counterpart of Z.ai’s GLM Coding Plan — same underlying Zhipu AI models, but billed in CNY through bigmodel.cn. Suited for users who can pay via Alipay / WeChat Pay or already have a 智谱 AI account.

Plans (monthly, after the 2026 price adjustment):

All tiers support GLM-5.1, GLM-5-Turbo, GLM-4.7, and GLM-4.5-Air. Compatible with Claude Code, Cline, and 20+ coding tools via OpenAI-compatible API plus an Anthropic-compatible endpoint.

Referral program (challenge-based, resets every 30 invitees):

Source: https://www.bigmodel.cn/glm-coding, https://docs.bigmodel.cn/cn/coding-plan/overview 1

Homepage: https://www.bigmodel.cn/glm-coding

My referral:

🚀 Join the GLM Coding Plan via my link — get 5% off your first order. Subscribe at https://www.bigmodel.cn/glm-coding?ic=VGRZKHKNKW (invitation code: VGRZKHKNKW).

BytePlus ModelArk — Coding Plan

ByteDance. Lite: $15/month, Pro: $35/month (intro promo $5/$25 ended early 2026).

Models: ByteDance-Seed-2.0, DeepSeek-V3.2, GLM-5.1, Kimi-K2.5. Tools: Claude Code, Cursor, Cline, Roo Code, OpenCode.

Referral: https://www.byteplus.com/activity/codingplan?ac=MMAUCIS9NT1S&rc=2739UWRE

Checked Jun 5, 2026.

Xiaomi MiMo Open Platform

I’m on Xiaomi MiMo Open Platform — running Xiaomi’s flagship MiMo V2.5 and the rest of the lineup. Sign up with my code and you’ll instantly get $2 in API credits.

After signup, enter the code at the bottom-left of the console. Credits valid 40 days.

Token Plan (monthly, launched May 2026): Lite ¥39 (60M credits), Standard ¥99 (200M), Pro ¥329 (700M), Max ¥659 (1,600M). Models: MiMo-V2.5, MiMo-V2.5-Pro (2× credit cost). Annual plans discounted.

Referral: Code T8ESAY · https://platform.xiaomimimo.com?ref=T8ESAY

Checked Jun 5, 2026.


Free Providers

OpenRouter

Free models (:free suffix): 20 RPM, 50 req/day (free accounts); 1,000 req/day after $10 top-up.

Checked Jun 5, 2026.

NVIDIA NIM

Free for NVIDIA Developer Program members, no credit card. ~40 RPM. 1,000 inference credits at signup (consumption-based, not a fixed monthly request cap).

Models: Kimi K2.5, GPT-OSS, DeepSeek-V3.2, Llama 3.x, Mistral, Phi, Nemotron. OpenAI-compatible.

Checked Jun 5, 2026.

OpenCode Zen

Hand-picked free models that change periodically. Optimized for coding agents.

Model Status
Big Pickle Free
DeepSeek V4 Flash Free Free
MiMo-V2.5 Free Free
Nemotron 3 Ultra Free Free

OpenAI/Anthropic compatible endpoints at https://opencode.ai/zen/v1/.

Checked Jun 5, 2026.

Google AI Studio

Google’s developer platform for Gemini models. Generous free tier with pay-as-you-go available.

Free-tier limits vary by model (check the AI Studio dashboard for your project):

Model (Free) RPM RPD
Gemini 2.5 Pro 5 100
Gemini 2.5 Flash 10 250
Gemini 2.5 Flash-Lite 15 1,000

Paid tier: higher limits, usage-based billing.

Models: Gemini 2.5 Pro/Flash/Flash-Lite. OpenAI-compatible endpoint available.

Warning: In the Free tier, Google may use your prompts and responses to improve their products. Use the Paid tier or Vertex AI for privacy.

Checked Jun 5, 2026.

Kilo Code — Gateway

VS Code + JetBrains coding extension with built-in gateway.

Free: :free models, 200 req/hour per IP. First top-up: $20 bonus credits (60-day expiry). BYOK supported — no Kilo subscription required for your own provider keys.

Checked Jun 5, 2026.

Cloudflare Workers AI

10,000 Neurons/day (resets 00:00 UTC). ~150 LLM responses/day on Llama 3.3 70B.

Models: Llama 3.3 70B, Gemma 3, Qwen2.5-Coder, DeepSeek R1, BGE embeddings, Whisper.

Checked Jun 5, 2026.

DeepSeek Platform

5M free tokens at signup (no code needed). OpenAI + Anthropic compatible.

PAYG: V4 Flash $0.14/M in, $0.28/M out; V4 Pro $0.435/M in, $0.87/M out (cached: $0.03/M).

Checked Jun 5, 2026.

Groq

LPU inference. Free tier, no credit card.

Model RPM RPD TPM TPD
llama-3.1-8b-instant 30 14,400 6K 500K
llama-3.3-70b-versatile 30 1,000 12K 100K
whisper-large-v3 20 2,000

Checked Jun 5, 2026.

GitHub Models

Free for all GitHub accounts. OpenAI-compatible at https://models.github.ai/inference.

Covers OpenAI, Anthropic, Llama, Mistral, DeepSeek, Grok, Phi. Rate limits vary per model (e.g. GPT-4o: 10 RPM/50 RPD; DeepSeek-R1: 15 RPM/150 RPD). Requires PAT with models:read.

Checked Jun 5, 2026.

xAI Grok API

$25 signup credits + $150/month via Data Sharing Program (eligible countries).

Models: Grok 4, Grok 4.1 Fast (2M ctx), Grok Code Fast. OpenAI + Anthropic compatible.

Warning: Data Sharing opt-in is irreversible and lets xAI train on your prompts.

Checked Jun 5, 2026.

Mistral La Plateforme

Free Experiment plan — up to ~1B tokens/month, no credit card (phone verification only).

Models: Mistral Large 3, Medium 3, Small 3.1, Codestral, Pixtral, embeddings. OpenAI-compatible.

Checked Jun 5, 2026.

Google Cloud Vertex AI

$300 free credits for 90 days (new GCP customers). Not a recurring free tier.

Models: Gemini 3 Pro/Flash, Anthropic Claude on Vertex, DeepSeek, GLM, Qwen via MaaS. Express Mode available without billing for limited evaluation quotas.

Checked Jun 5, 2026.

Hugging Face Inference Providers

Routes across Together, Fireworks, Novita, Cerebras, Replicate, DeepInfra, Scaleway.

Free: 100K monthly credits. PRO ($9/month): 2M credits. OpenAI-compatible at https://router.huggingface.co/v1.

Checked Jun 5, 2026.

Cerebras Cloud

Wafer-scale chip inference. Free tier, no credit card.

Model RPM TPD
gpt-oss-120b 30
llama3.1-8b-instant 30 500K
qwen-3-235b-a22b-instruct-2507 30
zai-glm-4.7 10

1M tokens/day shared cap. 8K context limit on free tier.

Checked Jun 5, 2026.

BigModel.cn

Zhipu AI (智谱 AI). 25M free tokens for new users to explore the API, playground, and AGI apps. GLM-4.7-Flash and GLM-4.5-Flash are permanently free.

Referral: https://www.bigmodel.cn/invite?icode=rIX6uZrLYfy8fQ6Urca4xf2gad6AKpjZefIo3dVEyA%3D

Checked Jun 5, 2026.

Fireworks AI

$1 starter credits. 50+ models. Function calling, MCP support. OpenAI-compatible.

Checked Jun 5, 2026.

Scaleway Generative APIs

EU/GDPR, Paris. 1M free tokens for new customers (no time limit advertised).

Models: Qwen3 (235B/397B/coder), Llama 3.3 70B, Mistral Small 3.2, DeepSeek R1 distill, Pixtral.

Checked Jun 5, 2026.


License

Apache 2.0

  1. Checked on Jun 5, 2026