What is the ThumbGate tech stack?

SQLite+FTS5 lesson DB, MemAlign-inspired dual recall, Thompson Sampling for adaptive checks, LanceDB vector search with Hugging Face embeddings, ContextFS context assembly, Bayesian belief updates, and PreToolUse hook enforcement.

What AI agents does ThumbGate work with?

Claude Code, Cursor, Codex, Gemini CLI, Amp, Cline, OpenCode, and any MCP-compatible agent.

How does ThumbGate reduce host blast radius for high-risk local runs?

ThumbGate combines pre-action checks with execution guidance. Workflow Sentinel predicts risky local actions before they execute, and high-risk runs can be routed into Docker Sandboxes instead of running directly on the host. Team workflows also have a signed hosted sandbox lane for isolated automation dispatch.

How does ThumbGate prevent AI slop and protect brand authenticity?

AI slop happens when agents act without human judgment as a hard check — generating repetitive, generic outputs that erode trust and dilute your brand. ThumbGate inserts human thumbs-up/down between AI intent and execution. Every thumbs-down becomes a prevention rule that blocks the bad pattern permanently. Every thumbs-up reinforces what 'good' looks like for your specific context. Your agent's outputs reflect your actual standards, not generic AI patterns. This is authenticity enforcement at the tool-call level.

How is ThumbGate different from SpecLock?

SpecLock requires manually writing constraints or compiling them from a PRD. ThumbGate learns automatically from thumbs-up/down feedback and auto-generates prevention rules from repeated failures. SpecLock locks files from modification; ThumbGate blocks specific actions (like force-push) before they execute.

How is ThumbGate different from Mem0?

Mem0 is cloud-hosted memory for AI apps. ThumbGate is local-first enforcement. Mem0 remembers context but cannot block actions. ThumbGate captures feedback, promotes it to prevention rules, and physically blocks tool calls that match known failure patterns via PreToolUse hooks.

👍👎

● Machine-speed pre-action defense for coding agents

Stop paying $ for the same AI mistake.

Every retry loop, every hallucinated import, every "let me try a different approach" — those are billable tokens on every LLM vendor's bill. ThumbGate is machine-speed pre-action defense: thumbs-down once, block that exact mistake on every future call, surface the next highest-ROI remediation, and show which agent surfaces are actually active before rollout. Across Claude Code, Cursor, Codex, Gemini, Amp, Cline, OpenCode — any MCP-compatible agent, forever, including fast-moving vibe coding workflows.

As desktop agents move into parallel sessions, terminals, and production workflows, ThumbGate checks the thing benchmarks miss: is this next action a known workflow, an open-ended agent, a costly fan-out, or a blind tool call with no way to verify it worked?

Start 7-day free trial

Go Pro — one correction, every agent, every session.

Personal local dashboard · DPO export from real corrections · founder support on risky flows.

$19/mo

Monthly Pro

Choose monthly →

$149/yr SAVE 35%

Annual Pro

Choose annual →

No credit card for 7-day trial · cancel anytime · your rules and captures stay local. Prefer free? Install CLI →

Your dashboard · Sample enforcing

💸 Tokens saved — since install (Sonnet-blended, conservative)

$0.00

✅ check:no-force-push — blocked 12×

✅ check:no-hallucinated-import — blocked 8×

❌ check:no-drop-prod — FIRED · saved ~$3.40

Sample shown. Your own dashboard tracks live feedback log, actionable remediations, and agent surface inventory from day one. Open dashboard →

Block repeat hallucinations before the model sees them Thumbs-down once, blocked forever Actionable remediations + agent surface inventory CLI-first workflow governance with a live tokens-saved counter

$ npx thumbgate init click to copy

Install Free CLI Start 7-day Pro trial — $19/mo →

Install Claude Extension ⭐ Star on GitHub Install Codex plugin → Install Cursor plugin → Open ThumbGate GPT

🔓 MIT Open Source ⭐ 14 GitHub Stars 🛡️ Local-first — no cloud required 🔌 6 agent integrations

No, you do not have to chat inside the GPT forever. The GPT is advice and checkpointing; local hooks do the hard blocking for Claude Code, Cursor, Codex, Gemini, Amp, Cline, OpenCode, and MCP-compatible agents.

▶ 90-second demo · force-push → 👎 → blocked

→ Start 7-day Pro trial

Your agent has no memory. Every session, the same wrong pattern runs. ThumbGate turns a single correction into a permanent block — before the next tool call fires. See all plans →

First-Dollar Activation Path

Block your first repeated AI mistake in 5 minutes.

Prove one blocked repeat before asking anyone to buy. The fastest path to value: one person, one repeated mistake, one check that blocks it permanently.

1. Install ThumbGate

Run npx thumbgate init in your repo. Or install the Claude Extension, Codex plugin, Cursor plugin, or open the GPT. Native ChatGPT rating buttons are not the ThumbGate capture path.

2. Give feedback

Give thumbs up when the agent follows your standards, or thumbs down when it misses. ThumbGate captures the context and distills a lesson from up to 8 prior entries.

3. The check blocks the repeat

Next time the agent tries the same mistake, the PreToolUse hook fires and physically blocks it. Upgrade after one real blocked repeat when you need the dashboard and exports.

thumbs up: this review named exact files, commands, and tests; repeat this evidence-first format.

thumbs down: the answer ignored my request for exact files and tests; next time include file paths, commands, and verification evidence.

Claude Code · Claude Desktop · Claude Extension

The fastest path for Claude users: install the extension and start blocking mistakes.

ThumbGate ships a published Claude Desktop extension bundle (.mcpb) you can install today. Claude Code users can also add the repo marketplace plugin immediately. No waiting for directory approval.

1. Install for Claude Code

Run npx thumbgate init --agent claude-code or add via claude mcp add thumbgate -- npx --yes --package thumbgate thumbgate serve

2. Or install the Claude Extension

Download the .mcpb bundle for Claude Desktop, or use the repo marketplace: /plugin marketplace add IgorGanapolsky/ThumbGate

3. Give feedback, checks auto-generate

Type thumbs down when Claude makes a mistake. ThumbGate distills a lesson from up to 8 prior entries and blocks the pattern permanently via PreToolUse hooks.

Download Claude Extension (.mcpb) Claude Desktop setup guide Claude plugin docs

Claude Code Skill: Type /thumbgate in any Claude Code session. Auto-triggers on “check”, “feedback”, “block mistake”. Free skill on top of the same local gateway.

ChatGPT Entry Point · Live ThumbGate GPT for ChatGPT

Use the GPT as a preflight desk for risky commands, refunds, deploys, and PR actions.

ThumbGate should meet users where they already ask AI for help. The live GPT is the fastest way to preflight a risky action, capture a typed thumbs-up/down lesson, and prove the enforcement loop before installing anything. As ChatGPT ads roll out, this matters more: ChatGPT can stay the discovery and checkpointing layer, while ThumbGate remains the hard execution boundary after npx thumbgate init.

1. Open the live GPT

Paste a proposed command, file edit, merge, deploy, refund, invoice, or API call and ask whether to allow, block, or checkpoint it.

2. Save the typed signal

Reply in chat with thumbs up: or thumbs down: plus one concrete sentence. Do not rely on ChatGPT's native rating buttons for ThumbGate memory.

3. Enforce locally

Run npx thumbgate init in the repo so Pre-Action Checks block repeated mistakes before the coding agent executes them.

Open ThumbGate GPT ChatGPT Actions setup Why ChatGPT ads need checks

Find it fast: if the direct link does not open, go to Explore GPTs, search ThumbGate, and choose the GPT by Igor Ganapolsky in Programming. Plain English rule: ChatGPT is the discovery and memory surface for advice, checkpointing, and typed feedback capture. One typed signal becomes one remembered rule. The hard Reliability Gateway still runs in the local agent or CI lane.

Positioning

Enforcement is the missing layer in AI orchestration.

Big orchestration suites unify data, routes, and decisions. ThumbGate sits closer to the moment of execution: the point where an agent is about to run a shell command, ship a PR, approve a release, or repeat a mistake you already corrected. That is where workflow trust is won or lost.

Broad orchestration platforms

Good at customer journeys, routing, and cross-system context. Weak when you need a coding agent or automation to stop before a destructive or low-trust action actually runs.

ThumbGate

Turns operator feedback into Pre-Action Checks. It does not just remember the mistake. It blocks the repeat at the tool-call boundary across Claude Code, Cursor, Codex, Gemini, Amp, Cline, and OpenCode.

The stack that makes sense

Use orchestration to decide what should happen next. Use ThumbGate to decide what is allowed to execute. That is the control layer enterprises actually need once agents touch repos, terminals, CI, or production workflows.

Compare orchestration vs enforcement → Platform team rollout → Regulated workflow pattern →

Autoresearch Safety Pack

Stop self-improving coding loops from hacking the benchmark.

Autoresearch loops run experiments, inspect metrics, and accept better variants. ThumbGate gives those loops a Reliability Gateway: Pre-Action Checks for skipped holdout tests, fake proof, reward hacking, unsafe edits, and promotion without verification evidence.

Guard the metric

Require primary and holdout checks before an agent can call a variant better. Block cherry-picked runs and missing baselines.

Preserve proof trails

Promotion needs commands, logs, changed files, and verification evidence so the win survives review instead of becoming a vague claim.

Ship into CI

Start with templates for npm test, Playwright duration, bundle size, lint, and CI failures, then add Team checks for shared workflows.

Read the Autoresearch guide Start Pro trial

New in v1.16.4

Three steps to stop repeated AI failures

Feedback

Give 👍 or 👎 on your AI agent's actions. Feedback is stored in a SQLite+FTS5 lesson DB. In the current Claude auto-capture hook, a vague thumbs-down can distill from up to 8 prior recorded entries and the failed tool call before promotion, then stay linked to a 60-second feedback session. Example: you 👎 a risky migration → it auto-promotes to a "never run DROP on prod" check.

Distill + Rules

Repeated failures auto-promote into prevention rules. Thompson Sampling adapts which rules fire, and the reflector lane can propose a reusable rule from the same transcript so high-risk patterns get stricter enforcement while low-risk ones stay relaxed.

Checks

Rules become Pre-Action Checks that block your agent before it repeats the same mistake. Your agent can't force-push, skip tests, or repeat a refactor you already rejected. No more fix-loops.

Enforcement

Checks block. They don't ask nicely.

Don't trust — verify

Every block shows why: pattern match, evidence, confidence score.

Real tools, not wishes

Checks physically block tool calls. Not prompt tricks. Not "please don't."

Force models to show work

Reasoning chains on every check decision. Thompson Sampling confidence tiers.

Log everything, learn automatically

Repeated failures auto-promote to checks. Org dashboard shows all agents.

Keep risky runs off the host

When Workflow Sentinel predicts a risky local action, ThumbGate can recommend Docker Sandboxes before the agent touches the host filesystem or broader credentials.

Ship with versioned proof

Changesets, SemVer, sync checks, and verification evidence make new package releases inspectable before a buyer trusts the next rollout.

Install for Your Agent

Claude Code

npx thumbgate init --agent claude-code

Wires PreToolUse hooks automatically

Cursor

npx thumbgate init --agent cursor

4 skills: feedback, rules, search, recall

Codex

npx thumbgate init --agent codex

6 skills including adversarial review

Gemini CLI

npx thumbgate init --agent gemini

Gemini CLI integration

Amp

npx thumbgate init --agent amp

Amp agent integration

Any MCP Client

npx thumbgate serve

MCP stdio server for any compatible client

Claude Desktop

Add to your claude_desktop_config.json:

{
  "mcpServers": {
    "thumbgate": {
      "command": "npx",
      "args": ["--yes", "--package", "thumbgate", "thumbgate", "serve"]
    }
  }
}

Official directory review is separate. Claude Code users can install immediately with /plugin marketplace add IgorGanapolsky/ThumbGate and /plugin install thumbgate@thumbgate-marketplace.

Pricing

Stop paying for agent mistakes you already fixed.

Free

See how it works. Hit the wall. Then decide.

3 captures, 1 rule, 1 agent. Enough to prove the enforcement loop works. When you need more, you will know.

3 feedback captures total (not per day)
1 auto-promoted prevention rule
No recall or lesson search
No exports (DPO, Databricks, HuggingFace)
All MCP integrations (Claude Code, Cursor, Codex, Gemini, Amp, any MCP agent)
PreToolUse hook blocking with built-in safety checks (force-push, destructive SQL, secrets)
Setup guide for all agents →

$ npx thumbgate init click to copy

Install Free

Solo Pro

$19/mo

or $149/yr (save 35%) · Personal dashboard + enforcement proof

Unlimited captures, unlimited rules, full recall. $19/mo costs less than 20 minutes of re-fixing a mistake your agent already learned to avoid.

No credit card required. 7-day free trial. Cancel anytime. Your rules and captures stay local.

What your Pro dashboard looks like

✅ check:no-force-push — blocked 12 times
✅ check:require-tests — blocked 8 times
❌ check:no-drop-prod — FIRED (blocked DROP TABLE)
DPO pairs exported: 47 | Lessons: 23 active

Everything in Free, plus:
Visual check debugger → see every blocked action and the check that fired so you can trust the system in minutes
Auto-connect — activate once with your license key, then your running agents appear automatically on your local dashboard
DPO training data export → turn real thumbs-downs into ready-to-use preference pairs for fine-tuning (LoRA / JSONL)
HuggingFace dataset export — share PII-redacted agent traces as open training datasets (npm run export:hf)
Model Hardening Advisor — get recommendations on when and how to fine-tune your model to natively avoid recurring failures
Personal local dashboard — every Pro user gets a localhost dashboard without extra cloud setup
Review-ready workflow support — we help you wire the riskiest flows first: migrations, force-pushes, deploys, and CI

7-DAY FREE TRIAL

Start Free Trial

Upgrade to Pro — $19/mo

No credit card required. Cancel anytime. Your rules and captures stay local.

Team

$49/seat/mo

3-seat minimum · One engineer's correction protects the whole team

When one engineer teaches the agent not to delete staging data, that lesson applies to every agent on the team. Stop paying the same mistake tax across different developers.

Start with one repo, one workflow, one repeat failure at $49/seat.

Workflow hardening sprint — map one painful workflow, one repeated failure, and one buyer proof review before wider rollout
Shared enforcement memory — a shared lesson database where one developer's 👎 on a bad migration protects every agent on the team
Team lesson export/import — export lessons from one project, import into another. Deduplication, provenance tracking, and team-import tagging built in. One team's hard-won lessons become every team's prevention rules
Org dashboard — active agents, check hit rates, risk agents, and proof-backed team metrics in one place
Hosted review views — constrained cards, lists, and callouts for rollout, incident, and audit visibility
Check template library — pre-built guardrails for force-pushes, skipped tests, destructive SQL, and evidence-before-done
Docker Sandboxes guidance — route risky local autonomy into an isolated microVM-backed lane instead of running it directly on a shared host
Signed hosted sandbox dispatch — isolated execution path for team automations that do not need repo-bound local access
Release confidence story — Changesets, SemVer, version sync, and verification evidence keep publishes and rollout claims inspectable
Proof pack — attach verification evidence and rollout diagnostics so the buyer does not have to trust a demo

Start Workflow Hardening Sprint

$49/seat/mo with a 3-seat minimum. Start with a 30-minute intake around one real blocker.

FAQ

Common questions

Does ThumbGate support model fine-tuning?

Yes. ThumbGate Pro includes a Model Hardening Advisor and LoRA JSONL export. Pro users can export their episodic memory as DPO (Direct Preference Optimization) pairs to fine-tune local models (like Llama 3 or Mistral) so they natively avoid repeating known mistakes.

How is ThumbGate different from model-training feedback loops?

ThumbGate's intelligence is context, not weights. It doesn't touch the model — it injects past feedback into context so your agent is conditioned by your corrections. Think of it as a behavioral immune system, not a training pipeline. The check blocks are hard enforcement, not soft suggestions.

What's the tech stack?

SQLite+FTS5 lesson DB for fast full-text search. MemAlign-inspired dual recall (principle-based rules + episodic context). Thompson Sampling for adaptive check sensitivity per failure domain. LanceDB + Apache Arrow for local vector search with Hugging Face embeddings. ContextFS for context assembly. Bayesian belief updates on each memory. PreToolUse hook enforcement blocks known-bad actions before execution. All local-first — no cloud required.

What AI agents and editors does this work with?

ThumbGate works with Claude Code, Cursor, Codex, Gemini CLI, Amp, Cline, OpenCode, and any other MCP-compatible agent. Cursor ships with a plugin bundle in this repo. Codex now ships both a standalone plugin bundle and a repo-local app plugin profile, and the published download is linked directly from this page. VS Code works when you run an MCP-compatible agent inside it, but this repo does not ship a standalone VS Code extension today.

Do I have to chat inside the ThumbGate GPT for enforcement?

No. The ThumbGate GPT is the ChatGPT entrypoint for checking proposed actions, capturing thumbs-up/down lessons, and getting setup help. Use it for advice and checkpointing; hard enforcement still runs locally where the agent executes after npx thumbgate init.

Why does the ChatGPT ads rollout matter to ThumbGate?

OpenAI began testing ads in ChatGPT in the US on February 9, 2026, and Digiday reported CPC bidding on April 21, 2026. That makes trust and measurement more important around AI-assisted decisions. ThumbGate gives teams a hard boundary between conversational discovery and risky local execution, so a suggested action still has to pass a real check before it runs. Read the full positioning guide.

How do we keep high-risk autonomous runs off the host?

ThumbGate is the control plane, not just a prompt layer. Workflow Sentinel predicts blast radius before execution, and risky local autonomy can be routed into Docker Sandboxes instead of running directly on the host. Team workflows also have a signed hosted sandbox lane for isolated dispatch when local repo access is not required.

How do we trust a new package release?

ThumbGate does not rely on vibes. Release-relevant PRs need a Changeset, SemVer rules keep version bumps honest, sync checks keep manifests aligned, proof lanes run before merge, and the exact main-branch merge commit is verified before the work is called done.

Do I need a cloud account?

No. Free keeps local enforcement on your machine with 3 daily feedback captures, 5 lesson searches, unlimited recall, checks, and hook blocking. No cloud account is required. The business starts when a team wants shared rules, approval boundaries, hosted review views, org dashboard visibility, and proof that survives handoffs. Pro is the optional solo side lane for a personal dashboard, DPO export, and team lesson export/import — share lessons across projects so one team's mistakes become every team's prevention rules.

What if my thumbs-down is vague?

For the current Claude auto-capture hook, ThumbGate can reuse up to 8 prior recorded entries and the failed tool call for a vague thumbs-down, then keep a linked 60-second feedback session open for later clarification. The timer resets when more context arrives, so the lesson stays attached to one feedback record instead of fragmenting into duplicates.

How are pre-action checks different from prompt rules?

Prompt rules are a starting point, not a finish line. Without prompt evaluation you do not know whether they still hold up under real tool use. ThumbGate adds the human-in-the-loop measurement loop and the enforcement layer: proof lanes, ThumbGate Bench, and self-heal checks show whether behavior improved, and Pre-Action Checks block the action before execution when it did not.

What does Pro cost?

Pro is $19/mo or $149/yr for individual operators. Team is $49/seat/mo with a 3-seat minimum. Both start with a 7-day free trial, no credit card required.

Stop paying $ for the same AI mistake.

Go Pro — one correction, every agent, every session.

Block your first repeated AI mistake in 5 minutes.

See the footer before you ship the next repeat.

The fastest path for Claude users: install the extension and start blocking mistakes.

Use the GPT as a preflight desk for risky commands, refunds, deploys, and PR actions.

One gateway across the agent surfaces you already use

🧩 Claude Desktop Extension

⚡ Claude Code Skill

🤖 AI CLIs

☁️ Google Data Agent Kit

🧩 Codex plugin

🎯 Cursor plugin

✏️ Editor workflows

🗂️ MCP Server Directory

💬 ChatGPT GPT Actions

See the enforcement before you buy anything

🔍 Live Dashboard Demo

⛔ Check Reasoning Chains

📊 Org Dashboard (Team)

🧱 Isolated Execution Lanes

🧪 Thompson Sampling

🪞 History-Aware Lessons

Enforcement is the missing layer in AI orchestration.

Broad orchestration platforms

ThumbGate

The stack that makes sense

How buyers discover ThumbGate in search and AI answers

AI Orchestration vs Enforcement

ThumbGate for Platform Teams

ThumbGate for Regulated Workflows

ThumbGate vs SpecLock

ThumbGate vs Mem0

What Are Pre-Action Checks?

AI Agent Harness Optimization

AI Search Topical Presence

Best Tools to Stop AI Agents From Breaking Production

ChatGPT Ads Need Pre-Action Checks

Relational Knowledge in AI Recommendations

Browser Automation Safety for AI Agents

Native Messaging Host Security

Claude Code Feedback Memory That Enforces

How to Stop AI Agents From Repeating Mistakes

Cursor Guardrails That Block Repeat Failures

Codex CLI Guardrails That Actually Enforce

Gemini CLI Memory That Leads to Enforcement

Autoresearch Safety for Self-Improving Agents