snitchbot · 通知 · telemetry for python

Sentry. snitchbot.
Crashes, load, anomalies —
delivered to Telegram.

Drop-in telemetry for any Python service. Exceptions, slow calls, watchdog stalls, RSS / CPU / FD anomalies, and your own notify() calls — streamed to a Telegram chat you already have open. No DSN, no dashboards to host, no blocking calls in your hot path.

$ uv add snitchbot 30-second setup ->

in your process

<10MB

The client — thin, synchronous-safe, fire-and-forget. One dependency (msgpack). Sits inside your Python service, adds no blocking calls.

one per service, isolated

~43MB

An async sidecar spawned alongside your app. Handles Telegram, dedup, rate-limit, anomaly detection — in its own process, so a bug in observability never crashes production.

02 · cases

Every signal,
with the code that triggers it.

Snippets pulled from examples/ — real working scripts, not pseudo-code. Pick a case to see the setup and the verbatim alerts it produces.

    
orders_api.py
import snitchbot

snitchbot.init("orders-api")

# Unhandled exceptions are captured automatically,
# including stack, thread, and origin.

async def list_orders(user_id: int) -> list:
    return await svc.fetch_all(user_id)

# Somewhere down the stack:
#   raise DatabaseConnectionError("refused to ...")
# -> snitchbot captures and sends the alert below.

    
watch_slow_demo.py
import asyncio
import time
import snitchbot


@snitchbot.watch_slow(threshold_ms=100)
async def fetch_user_profile(user_id: int) -> dict:
    await asyncio.sleep(0.25)  # 250 ms > threshold
    return {"name": "Alice"}


@snitchbot.watch_slow(threshold_ms=500)
def generate_report() -> str:
    time.sleep(0.6)  # sync, also captured
    return "report-data"


async def main():
    snitchbot.init("slow-demo")
    await fetch_user_profile(42)
    generate_report()

    
worker.py
import snitchbot
from snitchbot import AnomalyConfig, WatchdogConfig

snitchbot.init("watchdog-demo")

# Zero-config: watchdog is on, threshold 500 ms,
# auto-escalates to error at 2 s, critical at 5 s.

# Full config with custom thresholds:
snitchbot.init(
    "watchdog-demo",
    anomaly=AnomalyConfig(
        watchdog=WatchdogConfig(
            threshold_ms=500,            # 🟠 warning
            error_threshold_ms=2000,      # 🔴 error
            critical_threshold_ms=5000,   # 🟣 critical
            escalation_window="1m",
            cooldown_sec=5,
        ),
    ),
)

    
vitals_config.py
import snitchbot
from snitchbot import (
    AnomalyConfig,
    RssAnomalyConfig,
    CpuAnomalyConfig,
    FdAnomalyConfig,
    ThreadAnomalyConfig,
)

snitchbot.init(
    "anomaly-demo",
    sample_interval_sec=5,
    anomaly=AnomalyConfig(
        rss=RssAnomalyConfig(
            duration="1m", baseline_duration="30m",
            max_mb=450,          # 🔴 ceiling
            spike_ratio=1.5,      # 🟠 +50% vs baseline
            min_spike_mb=50,      # and ≥ 50 MB absolute
        ),
        cpu=CpuAnomalyConfig(
            duration="2m", baseline_duration="20m",
            max_percent=90,       # 🔴 ceiling
            spike_ratio=2.5,       # 🟠 spike
            min_spike_delta=30,    # ≥ 30 pp
        ),
        fds=FdAnomalyConfig(
            max_fds=800,          # 🔴 ulimit guard
            spike_ratio=1.5,       # 🔴 fd leak
            drop_ratio=0.5,        # 🟠 pool collapse
        ),
        threads=ThreadAnomalyConfig(
            max_threads=100,
            spike_ratio=1.5,
        ),
    ),
)

    
app.py
import snitchbot

snitchbot.init("orders-api")

# ▶ lifecycle("startup", reason="init")  — sent immediately

# ...your service does its thing...

# On any of these paths, a shutdown event is emitted:
#  · Clean exit      -> reason="clean_exit", exit_code=0
#  · SIGTERM / SIGINT -> reason="sigterm" / "sigint"
#  · Uncaught crash   -> reason="crash" (+ traceback)
#  · Thread crash     -> reason="thread_crash"

# Nothing to call, nothing to decorate.

    
checkout.py
import snitchbot

snitchbot.init("notify-demo")

# Warning with extras — renders as a meta-table
snitchbot.notify(
    "Starting checkout process",
    severity="warning",
    extras={"cart_size": 3, "user": "Alice"},
)

# Error with live traceback
try:
    _ = 1 / 0
except ZeroDivisionError:
    snitchbot.notify(
        "Division failed in payment calculator",
        severity="error",
        exc_info=True,
    )

    
handler.py
import asyncio
import snitchbot


@snitchbot.watch_slow(threshold_ms=100)
async def call_payment_api(amount: float) -> str:
    await asyncio.sleep(0.2)
    return "txn-12345"


async def handle_request(request_id: str, user_id: int):
    with snitchbot.request_context(
        trace_id=request_id,
        user_id=user_id,
        action="checkout",
    ):
        snitchbot.notify(
            "User started checkout",
            extras={"cart_size": 3},
        )
        await call_payment_api(99.99)  # inherits ctx

    
app.py
import logging
import snitchbot

snitchbot.init("log-demo")
snitchbot.setup_logging()  # WARNING+ -> Telegram

# Or, for structlog users:
# processor = snitchbot.setup_structlog()
# structlog.configure(processors=[..., processor])

logger = logging.getLogger("myapp")

# Extras become a meta-table in the alert
logger.warning(
    "Cache miss rate too high",
    extra={"miss_pct": 42},
)

# exc_info=True attaches the traceback
try:
    _ = 1 / 0
except ZeroDivisionError:
    logger.error("Calculation failed", exc_info=True)

# Inside request_context — trace_id attached automatically
with snitchbot.request_context(trace_id="req-abc-123"):
    logger.warning("Slow DB query in checkout")

    
main.py
import snitchbot

snitchbot.init("web-demo")
snitchbot.setup_logging()

# ── FastAPI ───────────────────────────────────
from fastapi import FastAPI
from snitchbot.integrations.fastapi import install

app = FastAPI()
install(app)

# ── Flask ─────────────────────────────────────
# from flask import Flask
# from snitchbot.integrations.flask import install
# app = Flask(__name__); install(app)

# ── Litestar ──────────────────────────────────
# from litestar import Litestar
# from snitchbot.integrations.litestar import install
# app = Litestar(route_handlers=[...]); install(app)


@app.post("/checkout")
async def checkout(cart_value: int = 100):
    snitchbot.notify("Large checkout",
                     extras={"cart_value": cart_value})
    return {"status": "processing"}


@app.get("/search")
async def search(query: str):
    raise ValueError("Unknown search backend")

snitchbot · orders-api

bot · private

Apr 17

🔴 crash · orders-api · a1b2c3 × 2
DatabaseConnectionError: connection refused to 10.0.0.5:5432

Details

first 14:18:42 UTC (24m ago) last 14:42:10 UTC (just now) pid 101 thread MainThread origin sys_excepthook

Stack (top 3 user frames)

app/db/pool.py:47 in acquire()
  conn = await self._pool.get()
app/services/orders.py:88 in fetch_all()
  return await db.fetch(q)
app/routes/orders.py:12 in list_orders()
  return await svc.fetch_all()

📋 full trace🔕 mute 1h

14:42

Message

/status /last /mute

snitchbot · slow-demo

bot · private

Apr 17

🟠 slow call · slow-demo · ee48d4
__main__.fetch_user_profile took 250 ms (threshold 100 ms)

Details

time 12:57:18 UTC pid 1738 is_async true location watch_slow_demo.py:8

12:57

🟠 slow call · slow-demo · f9c2a1
__main__.generate_report took 612 ms (threshold 500 ms)

Details

time 12:57:20 UTC pid 1738 is_async false location watch_slow_demo.py:14

12:57

Message

/status /last /mute

snitchbot · watchdog-demo

bot · private

Apr 17

🟠 watchdog · watchdog-demo · 7c6497
Event loop blocked for 588 ms (threshold 500 ms)

Details

time 11:25:10 UTC pid 1580 loop main

11:25

🔴 watchdog · watchdog-demo · 7c6497 × 2
Event loop blocked for 2 690 ms (threshold 500 ms)

Details

first 11:25:10 UTC (40s ago) last 11:25:20 UTC (just now) pid 1580 loop main

📋 full trace

11:25

🟣 watchdog · watchdog-demo · 732334
Event loop blocked for 5 699 ms (threshold 500 ms)

Details

time 11:25:24 UTC pid 1580 loop main

Stuck tasks (2)

Innocent-Worker · background_task
  worker.py:55 in background_task()
Task-1 · main
  worker.py:97 in main()

📋 full trace

11:25

Message

/status /last /mute

snitchbot · anomaly-demo

bot · private

Apr 17

🟠 anomaly · anomaly-demo · a7af9c × 2
RSS spike: 183 MB (baseline 70 MB, +160%)

Details

time 11:17:40 UTC pid 1550 type rss_spike window 1m baseline 70 MB current 183 MB

11:17

🔴 anomaly · anomaly-demo · c82b14
CPU ceiling: 94% (limit 90%)

Details

time 11:21:02 UTC pid 1550 type cpu_ceiling window 2m baseline 18% current 94%

11:21

🔴 anomaly · anomaly-demo · 91e7da
FD leak: 40 -> 820 (+780)

Details

time 11:24:55 UTC pid 1550 type fds_spike window 5m baseline 40 current 820

11:24

🟠 anomaly · anomaly-demo · 44c2fb
Thread spike: 8 -> 45 (+462%)

Details

time 11:28:11 UTC pid 1550 type threads_spike window 1m baseline 8 current 45

11:28

Message

/status /last /mute

snitchbot · orders-api

bot · private

Apr 17

▶ orders-api started
━━━━━━━━━━━━━━━━━━

pid 101 time 10:00:14 UTC