freetiermodels

Access free inference without locking
into paid model subscriptions.

You don't need
a paid subscription
to use AI effectively

Stop spending $100+ on model subscriptions and hitting the rate limits within 5 requests. Some tasks can be outsourced to free models and still get the outputs that you want. Here's how to get started:

BYOK interface

Connect and access models

Choose a provider that lets you bring your own API keys and easily switch between models. I personally use OpenCode and Droid CLIs.

Free Model API Key

Get free tier access

API keys let you connect a model provider with free tiers to the BYOK interface. All keys are OpenAI-compatible and are categorised below.

Direct Providers

Google
Google
gemma-4-26b
4/5
RPM15
RPD1.5K
Context1M tok

Why This Model

Gemma 4 has generous daily rate limits, while Gemini 3.1 Flash-Lite has 500 RPD.

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for Google and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for Google and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "google/gemini-3.1-flash-lite-latest",
  "provider": {
    "google": {
      "options": {
        "apiKey": "{env:GOOGLE_API_KEY}"
      }
    }
  }
}

All Free Models (6)

gemini-2.5-pro
gemini-2.5-flash
gemini-2.5-flash-lite
gemini-3.1-flash-lite
gemma-4-26b
gemma-3-27b
Mistral
Mistral
mistral-small-4
4/5
RPM60
RPD1B tok/mo
Context128K tok

Why This Model

Very generous free tier, though subject to RPM rate limits.

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for Mistral and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for Mistral and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "mistral/mistral-small-3.1",
  "provider": {
    "mistral": {
      "options": {
        "apiKey": "{env:MISTRAL_API_KEY}"
      }
    }
  }
}

All Free Models (5)

mistral-large-3
mistral-small-3.1
ministral-8b
codestral
mistral-embed
Cohere
Cohere
command-a
4/5
RPM20
RPD1K
Context128K tok

Why This Model

Good rate limits for this lesser-known model.

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for Cohere and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for Cohere and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "cohere/command-a",
  "provider": {
    "cohere": {
      "options": {
        "apiKey": "{env:COHERE_API_KEY}"
      }
    }
  }
}

All Free Models (6)

command-a
command-r
command-r7b
command-light
aya-expanse-32b
embed-v3

Aggregators

NVIDIA NIM
NVIDIA NIM
kimi-k2.5
4/5
RPM40
RPD?
Context128K tok

Why This Model

Long signup process but generous limits. Unsure of the current RPD limits for the free tier.

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for NVIDIA NIM and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for NVIDIA NIM and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "nvidia/meta/llama-3.1-8b-instruct",
  "provider": {
    "nvidia": {
      "options": {
        "apiKey": "{env:NVIDIA_API_KEY}"
      }
    }
  }
}

All Free Models (5)

meta/llama-3.1-8b-instruct
meta/llama-3.3-70b
mistralai/mistral-large
qwen/qwen3-235b
deepseek/deepseek-r1
OpenRouter
OpenRouter
nvidia/nemotron-3-super-120b-a12b:free
4/5
RPM20
RPD50
Context200K tok

Why This Model

Good starting point for free inference, I recommend topping up $10.50 to get 1,000 free requests per day (from 50).

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for OpenRouter and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for OpenRouter and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "nvidia/nemotron-3-super-120b-a12b:free",
  "provider": {
    "openrouter": {
      "options": {
        "apiKey": "{env:OPENROUTER_API_KEY}"
      }
    }
  }
}

All Free Models (29)

qwen/qwen3.6-plus:free
stepfun/step-3.5-flash:free
nvidia/nemotron-3-super-120b-a12b:free
arcee-ai/trinity-large-preview:free
z-ai/glm-4.5-air:free
nvidia/nemotron-3-nano-30b-a3b:free
minimax/minimax-m2.5:free
arcee-ai/trinity-mini:free
nvidia/nemotron-nano-12b-v2-vl:free
openai/gpt-oss-120b:free
nvidia/nemotron-nano-9b-v2:free
nvidia/llama-nemotron-embed-vl-1b-v2:free
openai/gpt-oss-20b:free
qwen/qwen3-coder:free
qwen/qwen3-next-80b-a3b-instruct:free
meta-llama/llama-3.3-70b-instruct:free
liquid/lfm-2.5-1.2b-thinking:free
liquid/lfm-2.5-1.2b-instruct:free
cognitivecomputations/dolphin-mistral-24b-venice-edition:free
google/gemma-3-27b-it:free
nousresearch/hermes-3-llama-3.1-405b:free
meta-llama/llama-3.2-3b-instruct:free
google/gemma-3-4b-it:free
google/gemma-3n-e4b-it:free
google/gemma-3n-e2b-it:free
google/gemma-3-12b-it:free
google/gemma-4-31b-it:free
google/gemma-4-26b-a4b-it:free
openrouter/free
OpenCode Zen
OpenCode Zen
minimax-m2.5-free
3/5
RPM∞
RPDUnlimited (feedback collection period)
ContextVaries by model

Why This Model

Free models available for limited time while teams collect feedback. Rate limits may be hit very quickly if congested.

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for OpenCode Zen and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for OpenCode Zen and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "opencode/minimax-m2.5-free",
  "provider": {
    "opencode": {
      "options": {
        "apiKey": "{env:OPENCODE_ZEN_API_KEY}"
      }
    }
  }
}

All Free Models (4)

minimax-m2.5-free
qwen3.6-plus-free
nemotron-3-super-free
big-pickle
Kilo Gateway
Kilo Gateway
nvidia/nemotron-3-super-120b-a12b:free
3/5
RPM∞
RPD200/hour
Context128K tok

Why This Model

Multiple free models available with decent usage, but responses have been slow for me.

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for Kilo Gateway and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for Kilo Gateway and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "kilo/nvidia/nemotron-3-super-120b-a12b:free",
  "provider": {
    "kilo": {
      "options": {
        "apiKey": "{env:KILO_API_KEY}"
      }
    }
  }
}

All Free Models (5)

nvidia/nemotron-3-super-120b-a12b:free
arcee-ai/trinity-large-thinking:free
bytedance-seed/dola-seed-2.0-pro:free
x-ai/grok-code-fast-1:optimized:free
openrouter/free
Ollama Cloud
Ollama Cloud
glm-4.7:cloud
3/5
RPM1
RPDLight
Context128K tok

Why This Model

Good variety of models, but rate limits get used up quickly.

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for Ollama Cloud and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for Ollama Cloud and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "provider": {
    "ollama": {
      "npm": "@ai-sdk/openai-compatible",
      "name": "Ollama Cloud",
      "options": {
        "baseURL": "https://ollama.com/v1"
      },
      "models": {
        "glm-4.7:cloud": {
          "name": "glm-4.7:cloud"
        }
      }
    }
  }
}

All Free Models (5)

glm-4.7:cloud
deepseek-v3.2
qwen3.5
kimi-k2.5
llama3.2
GitHub Copilot
GitHub Copilot
claude-haiku-4.5
2/5
RPM-
RPD50/month
Context128K tok

Why This Model

50 free requests per month is very limited, but we get access to Claude Haiku and GPT-5 mini

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for GitHub Copilot and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for GitHub Copilot and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "github/claude-haiku-4.5",
  "provider": {
    "github": {
      "options": {
        "token": "{env:GITHUB_TOKEN}"
      }
    }
  }
}

All Free Models (2)

claude-haiku-4.5
claude-haiku-3.5
Groq
Groq
llama-3.1-8b-instant
2/5
RPM30
RPD1K
Context8K tok

Why This Model

While rate limits are very generous, I constantly get 'Reasoning not supported with this model' errors when using with OpenCode.

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for Groq and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for Groq and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "groq/llama-3.1-8b-instant",
  "provider": {
    "groq": {
      "options": {
        "apiKey": "{env:GROQ_API_KEY}"
      }
    }
  }
}

All Free Models (9)

llama-3.3-70b
llama-3.1-8b-instant
llama-3.1-70b
llama-4-scout
kimi-k2
mistral-saba
gemma2-9b-it
qwen-qwq-32b
deepseek-r1
Cerebras
Cerebras
llama-3.3-70b
2/5
RPM30
RPD14.4K
Context8K tok

Why This Model

Limits nerfed for GLM-4.7 and GPT-OSS, I constantly get 'Session too large to compact - context exceeds model limit even after stripping media' errors when using with OpenCode.

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for Cerebras and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for Cerebras and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "cerebras/llama-3.3-70b",
  "provider": {
    "cerebras": {
      "options": {
        "apiKey": "{env:CEREBRAS_API_KEY}"
      }
    }
  }
}

All Free Models (5)

llama-3.3-70b
llama-3.1-8b
qwen3-235b
gpt-oss-120b
qwen-2.5-coder-32b
GitHub Models
GitHub Models
gpt-4o-mini
2/5
RPM15
RPD150
Context128K tok

Why This Model

Rate limits get used up very quickly.

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for GitHub Models and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for GitHub Models and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "github/gpt-4o-mini",
  "provider": {
    "github": {
      "options": {
        "token": "{env:GITHUB_TOKEN}"
      }
    }
  }
}

All Free Models (5)

gpt-4o
gpt-4o-mini
llama-3.3-70b
deepseek-r1
phi-3.5-mini
Cloudflare Workers AI
Cloudflare Workers AI
@cf/meta/llama-3.1-8b-instruct
2/5
RPM?
RPD10K neu
Context8K tok

Why This Model

Long signup process, Cloudflare uses 'neurons' instead of credits and they get used up very quickly with the free tier.

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for Cloudflare Workers AI and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for Cloudflare Workers AI and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "cloudflare/@cf/meta/llama-3.1-8b-instruct",
  "provider": {
    "cloudflare": {
      "options": {
        "accountId": "{env:CF_ACCOUNT_ID}",
        "apiToken": "{env:CF_API_TOKEN}"
      }
    }
  }
}

All Free Models (5)

@cf/meta/llama-3.1-8b-instruct
@cf/meta/llama-3.3-70b
@cf/qwen/qwq-32b
@cf/mistral/mistral-7b-instruct
@cf/deepseek-ai/deepseek-r1-distill-llama-8b
Hugging Face
Hugging Face
meta-llama/Llama-3.2-3B-Instruct
1/5
RPM?
RPD$0.10/month
Context128K tok

Why This Model

$0.10/mo in free credits is extremely limited.

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for Hugging Face and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for Hugging Face and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "huggingface/meta-llama/Llama-3.2-3B-Instruct",
  "provider": {
    "huggingface": {
      "options": {
        "token": "{env:HF_TOKEN}"
      }
    }
  }
}

All Free Models (5)

meta-llama/Llama-3.2-3B-Instruct
meta-llama/Llama-3.3-70B
Qwen/Qwen2.5-72B
mistralai/Mistral-7B-Instruct
deepseek-ai/DeepSeek-R1

Promotional Credits

KiloCode
KiloCode
nvidia/nemotron-3-super-120b-a12b:free
3/5
Amount$2.50
Context200K tok

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for KiloCode and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for KiloCode and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "kilocode/KiloCode-Claude-Sonnet",
  "provider": {
    "kilocode": {
      "options": {
        "apiKey": "{env:KILOCODE_API_KEY}"
      }
    }
  }
}
Fireworks AI
Fireworks AI
llama-v3p2-3b-instruct
3/5
Amount$6
Context128K tok

Desktop Setup

  1. Download OpenCode
  2. Click Models, then Manage Models, then Connect Provider
  3. Search for Fireworks AI and enter your API key

Terminal Setup

  1. Install OpenCode
  2. Type opencode in Terminal
  3. Type /models, press ctrl+a to connect
  4. Search for Fireworks AI and enter your API key

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "fireworks/llama-v3p2-3b-instruct",
  "provider": {
    "fireworks": {
      "options": {
        "apiKey": "{env:FIREWORKS_API_KEY}"
      }
    }
  }
}

Other Providers

Crof
Crof
crof/glm-4.7-flash
3/5
RPMNA
RPD?
Context128K tok

Why This Model

Free tier with no request limits, but I had to continuously prompt it to continue.

Setup via Add-Provider Skill

  1. Download OpenCode
  2. Install the add-provider skill from the repo
  3. Run add provider and choose Crof
  4. Enter your API key when prompted

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "crof/glm-4.7-flash",
  "provider": {
    "crof": {
      "options": {
        "apiKey": "{env:CROF_API_KEY}"
      }
    }
  }
}

All Free Models (2)

qwen3.5-9b
glm-4.7-flash
Nous Portal
Nous Portal
hermes-3-llama-3.1-405b
2/5
RPM?
RPD$10/mo credits
ContextVaries by model

Why This Model

Requires a one-time $10 top-up to access the free tier, models are powered by OpenRouter.

Setup via Add-Provider Skill

  1. Download OpenCode
  2. Install the add-provider skill from the repo
  3. Run add provider and choose Nous Portal
  4. Enter your API key when prompted

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "nousresearch/hermes-3-llama-3.1-405b",
  "provider": {
    "openai-compatible": {
      "options": {
        "baseURL": "https://gateway.nous.chat/v1",
        "apiKey": "{env:NOUS_API_KEY}"
      }
    }
  }
}

All Free Models (3)

hermes-3-llama-3.1-405b
deepseek-ai/DeepSeek-V3
qwen/Qwen2.5-72B-Instruct
routing.run
routing.run
route/kimi-k2.5
3/5
RPM-
RPD50
Context128K tok

Why This Model

Lite plan gives 400 requests/day. Requires MCP installation via opencode-add-provider skill.

Setup via Add-Provider Skill

  1. Download OpenCode
  2. Install the add-provider skill from the repo
  3. Run add provider and choose routing.run
  4. Enter your API key when prompted

OpenCode Config

{
  "$schema": "https://opencode.ai/config.json",
  "model": "routing/openrouter/free",
  "provider": {
    "routing": {
      "options": {
        "apiKey": "{env:ROUTING_API_KEY}"
      }
    }
  }
}

All Free Models (5)

route/kimi-k2.5
route/glm-5
route/qwen3.5-397b-a17b
route/deepseek-v3.2
route/gemma-4-31b-it

Escape the AI cost war

Build Effective Skills
that work with any model.

Turn soul-sucking tasks into simple slash commands
that give you the same outputs with any model.