ThumbGate / Blog

Back to home

Dual-Signal Feedback: Why "What Failed" Isn't Enough

Standard thumbs-down tells you something went wrong. But was it a bad decision (wrong tool) or bad execution (right tool, wrong parameters)?

Inspired by Gen-Searcher's dual reward system, ThumbGate now supports an optional failureType field on capture_feedback:

Thompson Sampling creates separate sub-arms (e.g., git:decision and git:execution) so reliability scores diverge per dimension. An agent might be great at choosing git commands but bad at parameterizing them — now you can see that distinction.

Backward compatible. Existing feedback without failureType works unchanged.

Try it now

The Claude Code Leak Proves Why Pre-Action Gates Matter

Anthropic accidentally shipped 512,000 lines of Claude Code source inside an npm package. A missing .npmignore exposed the full agent architecture: tool-call loops, permission models, retry logic, 44 unreleased feature flags.

Within 24 hours, a clean rewrite called Claw-code hit 100K GitHub stars — the fastest-growing repo in GitHub history.

What the leak revealed about agent security

Claude Code has a sophisticated permission model and tool-calling pipeline. What it does not have is feedback-driven enforcement — the ability to learn from past mistakes and physically block the agent from repeating them.

That's exactly what ThumbGate does. Every Claude Code user — and every Claw-code user — can add pre-action gates today:

Install in 30 seconds

npx thumbgate init works with Claude Code, Claw-code, Cursor, Codex, Gemini, Amp, and any MCP-compatible agent.

The leak proves agents are powerful but fallible software. Memory without enforcement is a suggestion. ThumbGate is a guarantee.

Install ThumbGate

v0.8.5: Gate Reasoning Chains, Org Dashboard, and the Checkout Funnel That Didn't Exist

ThumbGate v0.8.5 is our biggest release yet. Here's what shipped and why.

The problem we didn't see

~1,700 developers install ThumbGate via npm every month. Zero of them ever saw a checkout button. They find the GitHub README, run npx thumbgate init, use it for free, and never visit the landing page. The checkout flow nobody reaches is irrelevant. We were optimizing a storefront in a building with no door.

Gate reasoning chains

Every gate block and warning now explains WHY it fired. When ThumbGate blocks a git push --force, the response includes:

This was inspired by the neuro-symbolic explainability trend in production AI systems. Gates are the symbolic rules; Thompson Sampling provides the statistical confidence. The reasoning chain bridges both.

Multi-agent org dashboard

"I'm not going to have 10,000 agents running in the environment that I don't know what they're doing." — CIO.com, March 2026

The new org_dashboard MCP tool aggregates gate decisions across all registered agent sessions. CIOs and team leads see: total active agents, org-wide adherence rate, top blocked gates, and risk agents (those with the lowest adherence). Free tier shows 3 agents; Pro shows the full org.

Multi-hop agentic retrieval

Inspired by Chroma's Context-1, constructMultiHopPack iteratively retrieves context, prunes weak chunks, refines the query with expansion terms, and checks coverage — stopping when the coverage threshold (60%) is met or max hops are reached. Each hop is logged.

Thompson Sampling calibration

MIN_SAMPLES_THRESHOLD (5) prevents low-sample overconfidence. getCalibration() reports per-category confidence tiers: none (0 samples), low (1-4), medium (5-19), high (20+). Callers know when to trust the statistical arm vs fall back to rules.

The funnel fix

Four touchpoints now put the checkout URL where 100% of npm users actually are:

13 funnel invariant CI tests prevent this blindspot from ever regressing.

Install v0.8.5 on npm