docs/architecture/initiative.md

Ambient Text Cycle

Scope: Current shipped ambient text delivery cycle. Shared attention model: presence-and-attention.md Activity model overview: activity.md Voice pipeline reference: ../voice/voice-provider-abstraction.md

This document describes the text-side ambient delivery system in src/bot/initiativeEngine.ts.

Core Model

One ambient text cycle handles the text spoke of AMBIENT attention:

conversational chime-ins in active channels
standalone proactive posts
optional sharing of passive discovery feed items
optional active curiosity through tools

This is not a second personality and not a separate “initiative mind.” It is the current text transport for ambient thoughts.

The discovery service is now feed infrastructure, not a separate delivery engine. It gathers candidates that the initiative prompt may consider alongside channel context and memory.

Design Principles

The model decides whether to post, what to say, which eligible channel fits, and whether links or media feel natural.
Infrastructure decides when to consult the model and what context to provide.
Discovery candidates are optional context, not assignments.
initiative.text.eagerness is a probability gate before fresh ambient-thought synthesis, not a content rule.
initiative.text.eagerness only shapes AMBIENT text behavior. Direct-addressed and live ACTIVE text turns should use immediate reply admission instead of waiting for the initiative tick.
The model can post now, hold a thought for later, or drop it.
Shared attention may be informed by voice context, but initiative delivery is still a text action.

Runtime Flow

The runtime flow in src/bot/initiativeEngine.ts is:

60s initiative tick
Deterministic gates
Pending-thought revisit or fresh-thought admission
Context assembly
Bounded tool loop
Post now, hold, or drop

1. Tick

The bot runs one in-process ambient text tick every 60s.

There is no replay catchup after downtime. If the process is offline, skipped initiative opportunities are simply missed.

2. Deterministic Gates

Before the model is consulted for an ambient text thought, the runtime checks:

initiative.text.enabled
eligible channel pool is non-empty
daily cap has not been reached
for fresh thoughts: minimum gap since the last initiative consideration has passed
for fresh thoughts: the eagerness probability roll passed

Queued ambient thoughts are revisited before the normal fresh-thought cooldown and probability gates so a held thought can be refined, redirected, or dropped instead of starving behind new-random-post spacing.

Canonical settings:

initiative.text.enabled
initiative.text.eagerness
initiative.text.minMinutesBetweenPosts
initiative.text.maxPostsPerDay

The canonical persisted action history for this cycle is the initiative action log:

initiative_post counts as a surfaced ambient text thought
initiative_skip records ambient quiet decisions and pending-thought lifecycle events

3. Pending Thought Queue

The shipped runtime keeps at most one pending ambient text thought per guild in memory.

That pending thought stores:

the current draft text
the originally targeted channel
revision count and timestamps
the last hold/drop reason
the media directive, if any
a bounded expiry that is anchored to the original creation time so a thought cannot be held forever

When a pending thought exists, the next eligible tick asks the model what it is thinking right now, with the current queued thought included for continuity. The model can:

post it now
keep holding a refined or replaced version
drop it

If the held thought includes an image, video, or GIF plan, that media intent is included again on reconsideration so the next pass can refine the whole thought rather than dropping the media continuity.

A pending thought only blocks fresh ambient synthesis for its own guild. Other guilds remain eligible for fresh ambient consideration on normal cooldown and eagerness terms.

4. Context Assembly

When the gates pass, the runtime builds a prompt from:

eligible channel summaries
passive discovery feed candidates
feed-source performance
lightweight community-interest facts derived from recent activity
durable memory
behavior guidance and relevant behavioral memory

Channel summaries are built from recent stored messages, recent human activity, and time since the bot last posted. Stored channel history can include linked voice-transcript rows alongside normal typed chat. The prompt labels those lines as [vc] so the model can use them as room context while still understanding that initiative delivery is a normal text-channel post.

Discovery context is fetched from src/services/discovery.ts and exposed as optional feed material. The model may ignore all of it.

5. Bounded Tool Loop

The initiative call uses the same broad tool philosophy as the reply pipeline, but with tighter budgets.

Canonical tool budget settings:

initiative.text.maxToolSteps
initiative.text.maxToolCalls

Tool availability:

web_search, web_scrape, and browser_browse are available only when initiative.text.allowActiveCuriosity is true
memory_search is available when memory is enabled
discovery source-management tools are available only when initiative.discovery.allowSelfCuration is true

6. Delivery

The final output can:

drop the thought and stay ambient
hold a refined thought for later
post text to a selected eligible channel now
optionally request media generation (image, video, or gif)
optionally include discovery links when they feel natural

Media availability is governed by initiative.discovery.* budgets and allowlists.

This delivery surface stays text-only even when the context that informed the thought included linked VC transcript rows.

Canonical Channel Pool

The initiative pool for proactive posts is:

permissions.replies.discoveryChannelIds

If discoveryChannelIds is empty, the initiative cycle has no eligible channels and does nothing.

replyChannelIds is a separate concept — unsolicited reply channels where the bot gets a conversational vibe bonus (eagerness boost, standalone post eligibility, softer prompt tone) but does not proactively start new topics unless the channel is also in the discovery list.

Other text permission surfaces still apply:

permissions.replies.allowedChannelIds
permissions.replies.blockedChannelIds
permissions.replies.blockedUserIds

Discovery In The Unified Model

Discovery is split into two roles:

delivery: handled by the unified initiative cycle
collection: handled by the discovery service and discovery settings

initiative.discovery.* now owns feed/media infrastructure such as:

enabled source types
subreddit / RSS / YouTube / X source lists
source freshness and dedupe windows
self-curation guardrails
media generation budgets and model allowlists

The model can:

ignore the feed and [SKIP]
react to channel activity with no discovery content at all
share a feed item
use active curiosity tools to look something up before posting
manage its own feed when self-curation is enabled

Prompt Structure

The shipped ambient-text prompt in src/prompts/promptText.ts is structured like this:

=== AMBIENT TEXT MODE ===
=== YOUR CURRENT THOUGHT ===
=== CHANNELS ===
=== YOUR FEED ===
=== FEED SOURCES ===
=== WHAT THIS COMMUNITY IS INTO ===
=== MEMORY ===
=== BEHAVIOR GUIDANCE ===
=== RELEVANT BEHAVIORAL MEMORY ===
=== CAPABILITIES ===
=== TASK ===

Behavior guidance comes from memory-backed guidance and behavioral facts.

The prompt explicitly frames this cycle as an ambient text action. Voice-derived transcript lines may still appear in channel summaries when they were persisted into the linked text channel, but they are context only, not a separate voice-delivery path.

When a pending thought exists, the prompt includes Your current thought: ... continuity, carries forward any held media intent, and asks the model what it is thinking right now rather than forcing a binary post/skip decision.

Relationship To Shared Attention

In the shared attention model:

direct mentions and active exchanges promote Clanker into ACTIVE
when text is ACTIVE, new turns belong to the immediate text reply path rather than this ambient cycle
this ambient text cycle is the current text transport for AMBIENT
the voice thought engine is the corresponding voice transport for AMBIENT

The transports stay different, but the behavioral intent is the same: Clanker is present, occasionally thinks of something worth surfacing, and may still choose silence.

Both transports now ship a lightweight pending-thought queue so an ambient thought can survive long enough to be refined, redirected, or dropped instead of living for only one tick. In voice, room changes such as new speech, joins/leaves, or voice effects can invalidate a queued thought and make the next pass a reconsideration instead of a blind repeat.

Settings Reference

Initiative Text Settings

Setting	Purpose
`initiative.text.enabled`	Master ambient-text toggle
`initiative.text.execution`	Execution policy for the ambient-text LLM call
`initiative.text.eagerness`	Probability gate before consultation
`initiative.text.minMinutesBetweenPosts`	Minimum spacing between ambient-text considerations
`initiative.text.maxPostsPerDay`	Daily ambient-text budget
`initiative.text.lookbackMessages`	Channel context window
`initiative.text.allowActiveCuriosity`	Enables `web_search`, `web_scrape`, and `browser_browse`
`initiative.text.maxToolSteps`	Max ambient-text tool-loop steps
`initiative.text.maxToolCalls`	Max ambient-text tool calls

Discovery Infrastructure Settings

Setting group	Purpose
`initiative.discovery.sources`	Enables source families
`initiative.discovery.redditSubreddits` / `rssFeeds` / `youtubeChannelIds` / `xHandles`	Feed inputs
`initiative.discovery.freshnessHours` / `dedupeHours` / `sourceFetchLimit`	Candidate collection behavior
`initiative.discovery.allowSelfCuration` / `maxSourcesPerType`	Feed-management guardrails
`initiative.discovery.allowImagePosts` / `allowVideoPosts` / `allowReplyGifs` etc.	Media capability switches
`initiative.discovery.maxImagesPerDay` / `maxVideosPerDay` / `maxGifsPerDay`	Media budgets
`initiative.discovery.allowedImageModels` / `allowedVideoModels`	Media allowlists

Memory And Community Context

The initiative prompt can include:

durable memory facts
always-on behavior guidance facts
relevant behavioral memory facts
lightweight interest facts derived from recent guild activity
source performance summaries based on recent initiative outcomes

This keeps initiative contextual through memory-backed behavioral guidance.

Self-Curation

When initiative.discovery.allowSelfCuration is enabled, the initiative tool loop may use:

discovery_source_add
discovery_source_remove
discovery_source_list

Those tools let the model manage its own feed within operator-owned guardrails.

Source Files

src/bot/initiativeEngine.ts
src/services/discovery.ts
src/prompts/promptText.ts
src/store/settingsNormalization.ts
src/bot/memorySlice.ts