Configuration¶

sembr uses pydantic-settings with a four-level priority chain (highest wins):

shell environment variables
    │
.env file
    │
sembr.yaml file (optional, in CWD)
    │
built-in defaults

There is no secrets_dir= support today — Docker secrets land via shell env vars on the container side, which is the same precedence level as plain env vars.

Warning

Do not hardcode settings fields in docker-compose.yml's environment: block. The block sits at the same precedence level as a host-side shell export, so pinning a field in compose silently masks any later .env change and breaks the runtime settings editor's apply-and-restart flow.

Per-intent and per-feed values (similarity threshold, scan interval, lookback, poll cadence, …) live on the Intent and Feed rows themselves and are managed via the REST API or the dashboard, not via environment variables. If a knob you want isn't here, look for it in the api reference.

Required¶

Variable	Description
`EMBEDDER_API_KEY`	SiliconFlow (or any OpenAI-compatible) API key for the `/v1/embeddings` endpoint. The container exits non-zero at startup if absent or blank.

The same key is reused as LLM_API_KEY by default — SiliconFlow hosts both BGE-M3 and DeepSeek-V4-Flash, so one key is usually enough.

Storage¶

Variable	Default	Description
`QDRANT_URL`	`http://qdrant:6333`	Qdrant server URL. The bundled `docker-compose.yml` provisions this address
`SQLITE_PATH`	`/app/data/sembr.db`	SQLite database path inside the container. The host maps `./data/` here via the compose bind mount
`SEMBR_HOST_PORT`	`8000`	Host port exposed by Docker Compose. The in-container bind port is hardcoded to `8000` in the Dockerfile CMD; override the host side here

Embedder¶

Variable	Default	Description
`EMBEDDER_BACKEND`	`siliconflow`	Embedding backend. Only `siliconflow` is shipped today
`EMBEDDER_API_BASE_URL`	`https://api.siliconflow.cn/v1`	OpenAI-compatible `/v1/embeddings` endpoint. Point at any provider that speaks the same protocol to swap
`EMBEDDER_MODEL`	`BAAI/bge-m3`	Model name passed to the endpoint
`EMBEDDER_TIMEOUT_SECONDS`	`30`	HTTP timeout for the startup probe and the httpx client default. Batch embed calls compute a dynamic timeout `max(30s, total_chars / 1500)`, so values below 30 do not tighten the batch path

LLM (summarizer)¶

Variable	Default	Description
`LLM_API_BASE_URL`	`https://api.siliconflow.cn/v1`	OpenAI-compatible `/v1/chat/completions` endpoint
`LLM_API_KEY`	—	API key. Default-shares the SiliconFlow embedder key when left blank
`LLM_MODEL`	`deepseek-ai/DeepSeek-V4-Flash`	Model name passed to the chat completions endpoint
`LLM_TIMEOUT_SECONDS`	`60`	Per-request HTTP timeout
`LLM_MAX_PROMPT_CHARS`	`1_500_000`	Total prompt-side character budget (system + instruction + assembled articles). The pipeline reserves ~15 % for the response, then water-fills article bodies into the remainder — short articles stay whole, only the longest get truncated. Tune to your model's context window. Characters, not tokens — Chinese ≈ 1–1.7 chars/token, English ≈ 4 chars/token. Defaults: `1_500_000` for a 1 M-token ctx model (DeepSeek-V4-Flash) on mixed-language news; bump to `3_000_000` for English-only feeds, drop to `1_000_000` for pure-Chinese feeds with very long articles, `~200_000` for 128 K ctx, `~16_000` for an 8 K-token local model. Lower bound `2_000`

Only the API-style backend (any /v1/chat/completions endpoint) ships today.

Structured extraction & reduce (map-reduce)¶

Opt-in per intent (a toggle in the dashboard, not a global env var). When on, the pipeline extracts each recalled article into a structured, spec-defined record and reduces those facts into the digest instead of raw article bodies; when off, it uses raw bodies as before. The models below default to the summarization model, so the feature needs no extra keys.

Variable	Default	Description
`REDUCE_MODEL`	— (reuses `LLM_MODEL`)	Model for per-article structured extraction and the reduce step. Leave blank to reuse the summarization model
`META_EXTRACTION_MODEL`	— (reuses `LLM_MODEL`)	Model the spec auto-generator uses to draft a per-intent extraction spec. Leave blank to reuse the summarization model
`REDUCE_CONCURRENCY`	`16`	How many source articles to extract in parallel. Higher clears large digests faster but bursts the provider; lower is gentler on rate limits (1–256)

Email notifier¶

Email is the only built-in notification channel today. Leave SMTP_HOST empty to disable email delivery; the rest of the app still runs.

Variable	Default	Description
`SMTP_HOST`	`""`	SMTP server hostname (e.g. `smtp.gmail.com`, `smtp.sendgrid.net`). Empty disables email
`SMTP_PORT`	`587`	SMTP port. `587` for STARTTLS (default), `465` for `SMTP_SSL`
`SMTP_USERNAME`	`""`	SMTP login username. Empty skips `AUTH`
`SMTP_PASSWORD`	`""`	SMTP login password (`SecretStr`; never logged)
`SMTP_FROM`	`""`	`From:` address. Falls back to `SMTP_USERNAME` if empty
`SMTP_USE_STARTTLS`	`true`	Run `STARTTLS` after the plain SMTP connect
`SMTP_USE_SSL`	`false`	Use `SMTP_SSL` directly (port 465 style). When `true`, `SMTP_USE_STARTTLS` is ignored

The per-intent timezone (Intent.timezone) is what the email template uses to render published_at; DISPLAY_TIMEZONE below is consulted by the dashboard, not by email rendering.

Dashboard & logs¶

Variable	Default	Description
`DASHBOARD_TOKEN`	`""`	Optional shared secret gating `/dashboard` and `/api/dashboard/*`. Empty disables auth — set this whenever the host is reachable beyond `localhost`, since feed URLs and dead-article error messages would otherwise be public
`DASHBOARD_POLL_INTERVAL_SECONDS`	`10`	Frontend snapshot polling cadence. Bounded `[2, 120]`. Surfaced via `/api/dashboard/config` to the bundled JS
`DASHBOARD_LOG_RETENTION_DAYS`	`7`	Maximum age of rows kept in `feed_fetch_log` and `embed_call_log`. Bounded `[1, 90]`
`DASHBOARD_LOG_MAX_PER_FEED`	`1000`	Per-feed FIFO cap on `feed_fetch_log` rows. Bounded `[10, 100000]`
`DASHBOARD_LOG_LEVEL`	`INFO`	Default level applied to all seven LogBus tags on startup. One of `DEBUG / INFO / WARNING / ERROR`. The dashboard's `PUT /api/dashboard/logs/level` can adjust each tag at runtime; runtime changes are process-memory only and reset on restart
`DASHBOARD_LOG_BUFFER_PER_TAG`	`1000`	Ring buffer capacity per log tag. Bounded `[100, 10000]`. Memory cost ≈ `7 × buffer × ~500 B`, so the max sits around 35 MB

Display¶

Variable	Default	Description
`DISPLAY_TIMEZONE`	`Asia/Shanghai`	IANA timezone surfaced to the dashboard for timestamp rendering. Not consulted by the email notifier — that uses each intent's own `timezone` field

Prompts¶

The prompts root is fixed at /app/prompts inside the container — it is not configurable. The bundled docker-compose.yml bind-mounts the host's ./prompts directory there in read-write mode so the dashboard's Templates tab can create, edit, rename, and delete files via POST/PUT/DELETE/POST-rename against /api/prompts/templates. Two subdirectories live under it:

system/ — system prompts. Allowed placeholder: {language}.
instruction/ — user instruction templates. Allowed placeholders: {intent_text}, {articles}.

Templates are read on every digest tick (no caching) — host-side edits take effect on the next summary, no restart needed. The reserved name default exists in both subdirectories, ships with the project, and is read-only via the API (HTTP 403 on PUT/DELETE/rename, HTTP 422 if you try to create or rename to it). Per-file size cap is 64 KiB (HTTP 422 on oversize PUT). Empty content is also rejected (HTTP 422). For container-UID / host-permission considerations on production deployments, see the README "Production deployment" section.

The legacy SEMBR_PROMPTS_DIR environment variable was removed in the template-management refactor. Tests can redirect the prompts root via monkeypatch.setattr("sembr.summarizer.templates.PROMPTS_DIR", tmp_path) instead.

Lifespan / shutdown¶

Variable	Default	Description
`LIFESPAN_SHUTDOWN_TIMEOUT`	`8.0`	Maximum seconds allowed for graceful lifespan shutdown before forcing exit. Set below docker-stop's SIGKILL deadline (default 10 s). Only applies to self-restart paths (e.g. settings save → SIGTERM); a normal `docker compose down` is not affected

Collector / RSSHub¶

Variable	Default	Description
`PROXY_HOSTS`	`rsshub:1200`	Comma-separated `host[:port]` entries that front many backends (the bundled RSSHub instance is the canonical example). For these hosts the per-host concurrency limiter additionally segments by the first URL path segment, so backends behind one proxy don't share a single semaphore
`WISBURG_API_KEY`	—	Bearer key for `wisburg-report` feeds (Wisburg developer console). Empty disables Wisburg feeds: fetches fail without advancing the cursor and recover on the next poll once the key is set

RSSHub passthrough variables¶

These environment variables are forwarded as-is to the bundled RSSHub container — they are read by RSSHub itself, not by sembr code. The settings editor accepts new keys that match ^[A-Z][A-Z0-9_]*$ and begin with one of the allowed prefixes (TWITTER_, TELEGRAM_, GITHUB_, RSSHUB_, SOCIAL_, OPENAI_).

Variable	Used by	Notes
`TWITTER_AUTH_TOKEN`	RSSHub Twitter routes	The `auth_token` cookie value (40-char hex) from a logged-in `x.com` session — DevTools → Application → Cookies. Comma-separate multiple values to rotate between accounts
`TELEGRAM_TOKEN`	RSSHub Telegram routes	Bot token from BotFather, for public channel feeds
`TELEGRAM_SESSION`	RSSHub Telegram routes	User session string (Telethon / Pyrogram) for restricted channels
`GITHUB_ACCESS_TOKEN`	RSSHub GitHub routes	PAT — raises the API rate limit from 60 to 5000 req/h