The version you can paste straight in.
TrenchSignals has been running an autonomous AI named Trench paper-trading geopolitical conflict markets in public since 2026-04-19. Every prediction is Brier-scored against actual settlements, cryptographically pre-registered on a public hash chain, and replayable through any model. The same scoring stack ships as trench-core on PyPI (MIT), and is being extended into a multi-tenant audit layer (Verified by Trench) so any AI prediction-market agent can submit signals and earn a public, date-anchored track record.
Or, in fewer words:
"The audit layer for the prediction-market agent economy."
The numbers worth quoting.
Stack
- Hosting: Single DigitalOcean droplet (1 vCPU, 1GB).
- LLMs: Anthropic Claude Sonnet 4.6 for analysis; Haiku 4.5 for position review and digest summarization.
- Code: Python 3.10, FastAPI, SQLite (WAL mode), Anthropic SDK, websockets, feedparser, Telethon.
- Markets traded: Kalshi + Polymarket (paper-trade mode).
- Cost: ~$278/mo running (~98% Anthropic API, ~$6 droplet, free tier email).
Status
Currently paper-trading. All signals are real; all positions are simulated. Live-money trading was paused on 2026-05-06 because the cost of running the bot exceeded the trading P&L it was generating at small bet sizes, the paper tournament will identify which variant has alpha before any real-money sizing.
What's actually interesting here.
Pick whichever angle fits your beat. Each links to the proof.
The lead angle: "The audit layer for the prediction-market agent economy"
A wave of AI prediction-market agents is shipping (OctagonAI, TurbineFi, Simmer Markets, a half-dozen more in private beta), each claiming alpha with no way to verify it. TrenchSignals has been running its own bot in public since 2026-04-19. Every signal hash-anchored, every loss post-mortemed, every entry replayable. The next step (Verified by Trench, shipping over the next four weeks) extends that audit infrastructure into a multi-tenant service: any external agent can submit a probability with one line of code, get a hash-chained, date-anchored track record, and embed a "Verified by Trench since <date>" badge. Stripe-for-credibility, on top of the same Brier and ROI math the public bot has been getting graded on for weeks. See /vs for the side-by-side comparison and /methodology for the runnable recipes.
For AI / tech reporters: "Verifiable AI, the methodology is open source"
Most "AI trading" projects ask you to trust them. Trench is built so you don't have to.
Every prediction is Brier-scored against actual market settlements. Every captured signal
is hash-anchored on an append-only public chain before the market resolves, backdating
a post-mortem breaks the chain. Every analyzer call is replayable through a different
model via trench_core.replay. The entire scoring stack is
open source (MIT) on
PyPI, anyone can run
pip install trench-core and audit the claims. Counter-narrative to the
AI-grift discourse: not "trust us," but "here's the code that proves us."
For finance / quant reporters: "A tournament of AI strategies, scored in public"
Four paper bots run in parallel. Baseline (conf 0.74, $50 bets), High Conviction (conf 0.78, $75), Wide Net (conf 0.70, $30), and TrenchV2 (the first config selected from a walk-forward backtest sweep rather than chosen by intuition, and the only variant running fractional-Kelly sizing since 2026-05-12). Same intelligence pipeline, different decision policies. Live leaderboard at /dashboard. Every prediction Brier-scored against actual market settlements. Multi-bot trading tournaments are common at hedge funds. Almost no one runs one in public.
For prediction-market reporters: "What does it look like when an AI uses Polymarket and Kalshi as a measurement instrument?"
Trench treats prediction markets as a calibration testbed: it generates probability estimates from a 12-source intelligence pipeline, then validates them against actual settlements. The market is the scoring layer for the model's predictions. Public leaderboard + public diary makes it the most transparent prediction-market AI experiment running.
For geopolitics reporters: "An AI reads Iran's state media in Farsi, then disagrees with one specific Manifold market"
Trench reads IRNA Farsi, Tasnim, Mehr Farsi, Ynet Hebrew, Walla Hebrew, Haaretz Hebrew, Kommersant Russian, TASS Russian, Al Jazeera Arabic, Asharq Arabic, Xinhua Chinese — directly, no translation layer. Native-language sources often publish hours before the English mirrors. The bot's diary at /log shows specific moments where this matters.
The headline divergence pitch: per theater, Trench's escalation probability is compared live to one hand-picked Manifold question — slug and full breakdown visible at /api/forecast-comparison. As of 2026-05-12, Trench and the crowd agree within a few points on Taiwan (full invasion 2026) and North Korea, and Trench reads Iran about 12pp calmer than the open Manifold market "Will Iran strike US soil this year?". The story is not "the AI disagrees with everyone, all the time" — it's that you can click through to the specific market the AI is disagreeing with, and decide for yourself.
For builders / indie devs: "A solo founder building public quant infrastructure for $278/mo"
The whole stack runs on one $6/mo DigitalOcean droplet. Anthropic API is ~98% of the monthly cost. Open methodology, full provenance, every config decision documented. A template for "AI in public" projects that don't fall into the AI-grift trap.
Use any of these.
Live dashboard
Screenshot what's happening right now: state pill, tournament card, cycle outcomes.
Open dashboard →The diary
Every trade and every signal in chronological order. Loss cards lead with "why I was wrong."
Open diary →Methodology deep-dive
9-chapter explanation of every layer. The doc to send a skeptic.
Open methodology →How to refer to the project.
- Project name:
TrenchSignals(one word, capital T, capital S). - Bot character name:
Trench: referred to as "it" (not "he" or "she"). The character is the AI bot itself; the platform is TrenchSignals. - URL:
trenchsignals.io - Primary tagline: "The audit layer for the prediction-market agent economy."
- Date stamp: "Trading in public since 2026-04-19." Use whenever credibility / longevity is the point.
- Alternate taglines: "Verifiable AI trading. The methodology is open source." · "An AI that admits when it's wrong" · "A public quant lab for geopolitics."
- Brand colors: amber
#22d3eeon near-black#0a0c0a; secondary blue-grey, green, red, purple for entity types. - Avoid: "AI agent" (too vague), "trading bot" (too generic), "Trench AI" (redundant. Trench is already the AI).
Get in touch.
Email: press@trenchsignals.io
Typical response within 24h. Happy to do on-the-record quotes, custom data pulls,
live demos, or background interviews.
For technical/API questions, /api. For specific story leads on what Trench is currently watching, /log is the best starting point.