Breehavior-Monitor

Author	SHA1	Message	Date
aj	0449c8c30d	feat: give bot full conversation context on @mentions for real engagement When @mentioned, fetch recent messages from ALL users in the channel (up to 15 messages) instead of only the mentioner's messages. This lets the bot understand debates and discussions it's asked to weigh in on. Also update the personality prompt to engage with topics substantively when asked for opinions, rather than deflecting with generic jokes. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 14:14:46 -05:00
aj	3d252ee729	feat: classify mention intent before running expensive scan Adds LLM triage on bot @mentions to determine if the user is chatting or reporting bad behavior. Only 'report' intents trigger the 30-message scan; 'chat' intents skip the scan and let ChatCog handle it. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 13:20:54 -05:00
aj	b918ba51a8	fix: use escalation model and fallback to permanent memories in migration - Use LLM_ESCALATION_* env vars for better profile generation - Fall back to joining permanent memories if profile_update is null Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 13:14:38 -05:00
aj	efe7f901c2	Merge branch 'worktree-agent-a27a0179'	2026-02-26 13:04:25 -05:00
aj	ca17b6ac61	Merge branch 'worktree-agent-a0b1ccc2'	2026-02-26 13:04:24 -05:00
aj	8a092c720f	Merge branch 'worktree-agent-a78eaee3'	2026-02-26 13:04:18 -05:00
aj	365907a7a0	feat: extract and save memories after chat conversations Merge worktree: adds _extract_and_save_memories() method and fire-and-forget extraction call after each chat reply. Combined with Task 4's memory retrieval and injection. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 13:04:12 -05:00
aj	e488b2b227	feat: extract and save memories after chat conversations Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 13:02:42 -05:00
aj	7ca369b641	feat: add one-time migration script for user notes to profiles Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:59:03 -05:00
aj	305c9bf113	feat: route sentiment note_updates into memory system Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:58:14 -05:00
aj	2054ca7b24	feat: add background memory pruning task Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:58:12 -05:00
aj	d61e85d928	feat: inject persistent memory context into chat responses Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:56:02 -05:00
aj	89fabd85da	feat: add set_user_profile method to DramaTracker Replaces the entire notes field with an LLM-generated profile summary, used by the memory extraction system for permanent facts. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:54:05 -05:00
aj	67011535cd	feat: add memory extraction LLM tool and prompt Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:53:18 -05:00
aj	8686f4fdd6	fix: align default limits and parameter names to spec - get_recent_memories: limit default 10 → 5 - get_memories_by_topics: limit default 10 → 5 - prune_excess_memories: rename 'cap' → 'max_memories' Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:50:47 -05:00
aj	75adafefd6	feat: add UserMemory table and CRUD methods for conversational memory Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:48:54 -05:00
aj	333fbb3932	docs: add conversational memory implementation plan 9-task step-by-step plan covering DB schema, LLM extraction tool, memory retrieval/injection in chat, sentiment pipeline routing, background pruning, and migration script. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:44:18 -05:00
aj	d652c32063	docs: add conversational memory design document Outlines persistent memory system for making the bot a real conversational participant that knows people and remembers past interactions. Uses existing UserNotes column for permanent profiles and a new UserMemory table for expiring context with LLM-assigned lifetimes. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:41:28 -05:00
aj	196f8c8ae5	fix: remove owner notification on topic drift escalation Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 22:29:01 -05:00
aj	c63913cf14	fix: anonymize usernames before LLM analysis to prevent name-based scoring bias Display names like "Calm your tits" were causing the LLM to inflate toxicity scores on completely benign messages. Usernames are now replaced with User1, User2, etc. before sending to the LLM, then mapped back to real names in the results. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 22:20:53 -05:00
aj	cb8ef8542b	fix: guard against malformed LLM findings in conversation validation Filter out non-dict entries from user_findings and handle non-dict result to prevent 'str' object has no attribute 'setdefault' errors. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 21:38:02 -05:00
aj	f46caf9ac5	fix: tag context messages with [CONTEXT] to prevent LLM from scoring them The triage LLM was blending context message content into its reasoning for new messages (e.g., citing profanity from context when the new message was just "I'll be here"). Added per-message [CONTEXT] tags inline and strengthened the prompt to explicitly forbid referencing context content in reasoning/scores. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 20:08:23 -05:00
aj	660086a500	refactor: extract sentiment cog into package with shared _process_finding Convert cogs/sentiment.py (1050 lines) into cogs/sentiment/ package: - __init__.py (656 lines): core SentimentCog with new _process_finding() that deduplicates the per-user finding loop from _process_buffered and _run_mention_scan (~90 lines each → single shared method) - actions.py: mute_user, warn_user - topic_drift.py: handle_topic_drift - channel_redirect.py: handle_channel_redirect, build_channel_context - coherence.py: handle_coherence_alert - log_utils.py: log_analysis, log_action, score_color - state.py: save_user_state, flush_dirty_states All extracted modules use plain async functions (not methods) receiving bot/config as parameters. Named log_utils.py to avoid shadowing stdlib logging. Also update CLAUDE.md with comprehensive project documentation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 17:06:27 -05:00
aj	188370b1fd	Fix LLM scoring usernames as toxic content The display name "Calm your tits" was being factored into toxicity scores. Updated the analysis prompt to explicitly instruct the LLM to ignore all usernames/display names when scoring messages. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 15:51:14 -05:00
aj	7417908142	fix: separate context from new messages so prior-cycle chat doesn't inflate scores The conversation analysis was re-scoring old messages alongside new ones, causing users to get penalized repeatedly for already-scored messages. A "--- NEW MESSAGES ---" separator now marks which messages are new, and the prompt instructs the LLM to score only those. Also fixes bot-mention detection to require an explicit @mention in message text rather than treating reply-pings as scans (so toxic replies to bot warnings aren't silently skipped). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 15:48:02 -05:00
aj	8734f1883b	fix: persist last_offense_time and reset offenses after 24h last_offense_time was in-memory only — lost on restart, so the offense_reset_minutes check never fired after a reboot. Now persisted as LastOffenseAt FLOAT in UserState. On startup hydration, stale offenses (and warned flag) are auto-cleared if the reset window has passed. Bumped offense_reset_minutes from 2h to 24h. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 11:24:38 -05:00
aj	71c7b45e9a	feat: require warning before mute + sustained toxicity escalation Gate mutes behind a prior warning — first offense always gets a warning, mute only fires if warned_since_reset is True. Warned flag is persisted to DB (new Warned column on UserState) and survives restarts. Add post-warning escalation boost to drama_score: each high-scoring message after a warning adds +0.04 (configurable) so sustained bad behavior ramps toward the mute threshold instead of plateauing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 11:07:57 -05:00
aj	f02a4ab49d	Add content fallback for conversation analysis + debug logging When the LLM returns text instead of a tool call for conversation analysis, try parsing the content as JSON before giving up. Also log what the model actually returns on failure for debugging. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 10:16:15 -05:00
aj	90b70cad69	feat: channel-level conversation analysis with compact formatting Switch from per-user message batching to per-channel conversation analysis. The LLM now sees the full interleaved conversation with relative timestamps, reply chains, and consecutive message collapsing instead of isolated flat text per user. Key changes: - Fix gpt-5-nano temperature incompatibility (conditional temp param) - Add mention-triggered scan: users @mention bot to analyze recent chat - Refactor debounce buffer from (channel_id, user_id) to channel_id - Replace per-message analyze_message() with analyze_conversation() returning per-user findings from a single LLM call - Add CONVERSATION_TOOL schema with coherence, topic, and game fields - Compact message format: relative timestamps, reply arrows (→), consecutive same-user message collapsing - Separate mention scan tasks from debounce tasks - Remove _store_context/_get_context (conversation block IS the context) - Escalation timeout config: [30, 60, 120, 240] minutes Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 23:13:07 -05:00
aj	943c67cc87	Add Wordle scoring context so LLM knows lower is better Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 18:05:41 -05:00
aj	f457240e62	Add Wordle commentary: bot reacts to Wordle results with mode-appropriate comments Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 17:56:05 -05:00
aj	01b7a6b240	Bump health check max_completion_tokens to 16 gpt-5-nano can't produce output with max_completion_tokens=1. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 17:08:32 -05:00
aj	a0edf90ebd	Switch to max_completion_tokens for newer OpenAI models gpt-5-nano and other newer models require max_completion_tokens instead of max_tokens. The new parameter is backwards compatible with older models. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 17:07:44 -05:00
aj	dd0d18b0f5	Disable topic drift monitoring in general channel Add ignored_channels config to topic_drift section, supporting channel names or IDs. General channel excluded from off-topic warnings while still receiving full moderation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 14:47:03 -05:00
aj	b79d1897f9	Add drunk mode: happy drunk commentating on everything Lovable hammered friend with typos, strong nonsensical opinions, random tangents, and overwhelming affection for everyone in chat. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 20:05:03 -05:00
aj	ac4057b906	Add hype mode: positive/supportive teammate personality New mode that gasses people up for their plays and takes using gaming hype terminology, but reads the room and dials back to genuine encouragement when someone's tilted or frustrated. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 20:02:39 -05:00
aj	8b2091ac38	Tone down roast bot: more positive, less frequent - Add guidance for ~25% genuinely positive/hype responses - Lean toward playful ribbing over pure negativity - Reduce reply_chance from 35% to 20% - Increase proactive_cooldown_messages from 5 to 8 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 19:55:17 -05:00
aj	7db7a4b026	Tell roast prompt not to fabricate leaderboards or stats The model was inventing rankings and scoreboards from the drama score metadata. Explicitly tell it not to make up stats it doesn't have. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 18:43:36 -05:00
aj	c8e7c8c1cf	Trim prompts for gpt-4o-mini, remove disagreement detection Slim down chat_roast.txt — remove anti-repetition rules that were compensating for the local model (gpt-4o-mini handles this natively). Remove disagreement detection from analysis prompt, tool schema, and sentiment handler. Saves ~200 tokens per analysis call. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 16:26:44 -05:00
aj	c258994a2e	Use gpt-4o-mini for chat/roasts via dedicated LLM_CHAT_MODEL Add a separate llm_chat client so chat responses use a smarter model (gpt-4o-mini) while analysis stays on the cheap local Qwen3-8B. Falls back to llm_heavy if LLM_CHAT_MODEL is not set. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 16:04:55 -05:00
aj	e4239b25c3	Keep only the last segment after bracketed metadata in LLM responses The model dumps paraphrased context and style labels in [brackets] before its actual roast. Instead of just removing bracket lines (which leaves the preamble text), split on them and keep only the last non-empty segment — the real answer is always last. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 15:31:09 -05:00
aj	02b2870f2b	Strip all standalone bracketed text from LLM responses The model paraphrases injected metadata in unpredictable ways, so targeted regexes can't keep up. Replace them with a single rule: any [bracketed block] on its own line gets removed, since real roasts never use standalone brackets. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 15:24:18 -05:00
aj	942f5ddce7	Fix repetitive roast responses with anti-repetition mechanisms Add frequency_penalty (0.8) and presence_penalty (0.6) to LLM chat calls to discourage repeated tokens. Inject the bot's last 5 responses into the system prompt so the model knows what to avoid. Strengthen the roast prompt with explicit anti-repetition rules and remove example lines the model was copying verbatim ("Real ___ energy", etc.). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 15:15:11 -05:00
aj	534aac5cd7	Enable thinking for chat, diversify roast styles - Remove /no_think override from chat() so Qwen3 reasons before generating responses (fixes incoherent word-salad replies) - Analysis and image calls keep /no_think for speed - Add varied roast style guidance (deadpan, sarcastic, blunt, etc.) - Explicitly ban metaphors/similes in roast prompt - Replace metaphor examples with direct roast examples Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 13:59:16 -05:00
aj	66031cd9f9	Add user notes and recent message history to chat context When the bot replies (proactive or mentioned), it now fetches the user's drama tracker notes and their last ~10 messages in the channel. Gives the LLM real context for personalized replies instead of generic roasts on bare pings. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 13:44:04 -05:00
aj	3261cdd21c	Fix proactive replies appearing before the triggering message Proactive replies used channel.send() which posted standalone messages with no visual link to what triggered them. Now all replies use message.reply() so the response is always attached to the source message. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 13:35:40 -05:00
aj	3f9dfb1e74	Fix reaction clap-backs replying to the bot's own message Send as a channel message instead of message.reply() so it doesn't look like the bot is talking to itself. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 12:32:08 -05:00
aj	86b23c2b7f	Let users @ the bot on a message to make it respond Reply to any message + @bot to have the bot read and respond to it. Also picks up image attachments from referenced messages so users can reply to a photo with "@bot roast this". Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 12:24:26 -05:00
aj	8a06ddbd6e	Support hybrid LLM: local Qwen triage + OpenAI escalation Triage analysis runs on Qwen 8B (athena.lan) for free first-pass. Escalation, chat, image roasts, and commands use GPT-4o via OpenAI. Each tier gets its own base URL, API key, and concurrency settings. Local models get /no_think and serialized requests automatically. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 12:20:07 -05:00
aj	b5e401f036	Generalize image roast to handle selfies, memes, and any image The prompt was scoreboard-only, so selfies got nonsensical stat-based roasts. Now the LLM identifies what's in the image and roasts accordingly. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 12:15:22 -05:00

1 2

82 Commits