Breehavior-Monitor

Author	SHA1	Message	Date
aj	f79de0ea04	feat: add unblock-nag detection and redirect Keyword-based detection for users repeatedly asking to be unblocked in chat. Fires an LLM-generated snarky redirect (with static fallback), tracks per-user nag count with escalating sass, and respects a 30-min cooldown. Configurable via config.yaml unblock_nag section. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 13:19:29 -04:00
aj	f7dfb7931a	feat: add redirect channel to topic drift messages Topic drift reminders and nudges now direct users to a specific channel (configurable via redirect_channel). Both static templates and LLM-generated redirects include the clickable channel mention. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-05 17:44:25 -05:00
aj	9872c36b97	improve chat_personality prompt with better structure and guidance - Fix metadata description to match actual code behavior (optional fields) - Add texting cadence guidance (lowercase, fragments, casual punctuation) - Add multi-user conversation handling, conversation exit, deflection, and genuine-upset guidance - Expand examples from 3 to 7 covering varied response styles - Organize into VOICE/ENGAGEMENT sections for clarity - Trim over-explained AFTERTHOUGHTS section Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-03 19:23:31 -05:00
aj	f75a3ca3f4	fix: instruct LLM to never quote toxic content in note_updates Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 22:03:59 -05:00
aj	09f83f8c2f	fix: move slutty prompt to personalities/ dir, match reply chance Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 10:11:46 -05:00
aj	20e4e7a985	feat: add slutty mode — flirty, thirsty, full of innuendos New personality mode with 25% reply chance, very relaxed moderation thresholds (0.85/0.90), suggestive but not explicit personality. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-02 10:11:21 -05:00
aj	6866ca8adf	feat: add afterthoughts, memory callbacks, and callback-worthy extraction Add triple-pipe afterthought splitting to chat replies so the bot can send a follow-up message 2-5 seconds later, mimicking natural Discord typing behavior. Update all 6 personality prompts with afterthought instructions (~1 in 5 replies) and memory callback guidance so the bot actively references what it knows about users. Enhance memory extraction prompt to flag bold claims, contradictions, and embarrassing moments as high-importance callback-worthy memories with a "callback" topic tag. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 11:30:16 -05:00
aj	bf32a9536a	feat: add server rule violation detection and compress prompts - LLM now evaluates messages against numbered server rules and reports violated_rules in analysis output - Warnings and mutes cite the specific rule(s) broken - Rules extracted to prompts/rules.txt for prompt injection - Personality prompts moved to prompts/personalities/ and compressed (~63% reduction across all prompt files) - All prompt files tightened: removed redundancy, consolidated Do NOT sections, trimmed examples while preserving behavioral instructions Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 22:14:35 -05:00
aj	ed51db527c	fix: stop bot from starting every message with "Oh," Removed "Oh," from example lines that the model was mimicking, added explicit DO NOT rule against "Oh" openers, and added more varied examples. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 20:45:16 -05:00
aj	bf5051dfc1	fix: steer default chat personality away from southern aunt tone The LLM was interpreting "sassy hall monitor" as warm/motherly with pet names like "oh sweetheart" and "bless your heart". Added explicit guidance for deadpan, dry Discord mod energy instead. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 17:25:06 -05:00
aj	0ff962c95e	feat: generate topic drift redirects via LLM with full conversation context Replace static random templates with LLM-generated redirect messages that reference what the user actually said and why it's off-topic. Sass escalates with higher strike counts. Falls back to static templates if LLM fails or use_llm is disabled in config. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 15:28:36 -05:00
aj	a73d2505d9	feat: add jealousy/possessiveness detection as toxicity category LLM can now flag possessive name-dropping, territorial behavior, and jealousy signals when users mention others not in the conversation. Scores feed into existing drama pipeline for warnings/mutes. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-27 10:07:45 -05:00
aj	0449c8c30d	feat: give bot full conversation context on @mentions for real engagement When @mentioned, fetch recent messages from ALL users in the channel (up to 15 messages) instead of only the mentioner's messages. This lets the bot understand debates and discussions it's asked to weigh in on. Also update the personality prompt to engage with topics substantively when asked for opinions, rather than deflecting with generic jokes. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 14:14:46 -05:00
aj	67011535cd	feat: add memory extraction LLM tool and prompt Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:53:18 -05:00
aj	c63913cf14	fix: anonymize usernames before LLM analysis to prevent name-based scoring bias Display names like "Calm your tits" were causing the LLM to inflate toxicity scores on completely benign messages. Usernames are now replaced with User1, User2, etc. before sending to the LLM, then mapped back to real names in the results. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 22:20:53 -05:00
aj	f46caf9ac5	fix: tag context messages with [CONTEXT] to prevent LLM from scoring them The triage LLM was blending context message content into its reasoning for new messages (e.g., citing profanity from context when the new message was just "I'll be here"). Added per-message [CONTEXT] tags inline and strengthened the prompt to explicitly forbid referencing context content in reasoning/scores. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 20:08:23 -05:00
aj	188370b1fd	Fix LLM scoring usernames as toxic content The display name "Calm your tits" was being factored into toxicity scores. Updated the analysis prompt to explicitly instruct the LLM to ignore all usernames/display names when scoring messages. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 15:51:14 -05:00
aj	7417908142	fix: separate context from new messages so prior-cycle chat doesn't inflate scores The conversation analysis was re-scoring old messages alongside new ones, causing users to get penalized repeatedly for already-scored messages. A "--- NEW MESSAGES ---" separator now marks which messages are new, and the prompt instructs the LLM to score only those. Also fixes bot-mention detection to require an explicit @mention in message text rather than treating reply-pings as scans (so toxic replies to bot warnings aren't silently skipped). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 15:48:02 -05:00
aj	90b70cad69	feat: channel-level conversation analysis with compact formatting Switch from per-user message batching to per-channel conversation analysis. The LLM now sees the full interleaved conversation with relative timestamps, reply chains, and consecutive message collapsing instead of isolated flat text per user. Key changes: - Fix gpt-5-nano temperature incompatibility (conditional temp param) - Add mention-triggered scan: users @mention bot to analyze recent chat - Refactor debounce buffer from (channel_id, user_id) to channel_id - Replace per-message analyze_message() with analyze_conversation() returning per-user findings from a single LLM call - Add CONVERSATION_TOOL schema with coherence, topic, and game fields - Compact message format: relative timestamps, reply arrows (→), consecutive same-user message collapsing - Separate mention scan tasks from debounce tasks - Remove _store_context/_get_context (conversation block IS the context) - Escalation timeout config: [30, 60, 120, 240] minutes Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 23:13:07 -05:00
aj	b79d1897f9	Add drunk mode: happy drunk commentating on everything Lovable hammered friend with typos, strong nonsensical opinions, random tangents, and overwhelming affection for everyone in chat. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 20:05:03 -05:00
aj	ac4057b906	Add hype mode: positive/supportive teammate personality New mode that gasses people up for their plays and takes using gaming hype terminology, but reads the room and dials back to genuine encouragement when someone's tilted or frustrated. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 20:02:39 -05:00
aj	8b2091ac38	Tone down roast bot: more positive, less frequent - Add guidance for ~25% genuinely positive/hype responses - Lean toward playful ribbing over pure negativity - Reduce reply_chance from 35% to 20% - Increase proactive_cooldown_messages from 5 to 8 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 19:55:17 -05:00
aj	7db7a4b026	Tell roast prompt not to fabricate leaderboards or stats The model was inventing rankings and scoreboards from the drama score metadata. Explicitly tell it not to make up stats it doesn't have. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 18:43:36 -05:00
aj	c8e7c8c1cf	Trim prompts for gpt-4o-mini, remove disagreement detection Slim down chat_roast.txt — remove anti-repetition rules that were compensating for the local model (gpt-4o-mini handles this natively). Remove disagreement detection from analysis prompt, tool schema, and sentiment handler. Saves ~200 tokens per analysis call. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 16:26:44 -05:00
aj	942f5ddce7	Fix repetitive roast responses with anti-repetition mechanisms Add frequency_penalty (0.8) and presence_penalty (0.6) to LLM chat calls to discourage repeated tokens. Inject the bot's last 5 responses into the system prompt so the model knows what to avoid. Strengthen the roast prompt with explicit anti-repetition rules and remove example lines the model was copying verbatim ("Real ___ energy", etc.). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 15:15:11 -05:00
aj	534aac5cd7	Enable thinking for chat, diversify roast styles - Remove /no_think override from chat() so Qwen3 reasons before generating responses (fixes incoherent word-salad replies) - Analysis and image calls keep /no_think for speed - Add varied roast style guidance (deadpan, sarcastic, blunt, etc.) - Explicitly ban metaphors/similes in roast prompt - Replace metaphor examples with direct roast examples Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 13:59:16 -05:00
aj	b5e401f036	Generalize image roast to handle selfies, memes, and any image The prompt was scoreboard-only, so selfies got nonsensical stat-based roasts. Now the LLM identifies what's in the image and roasts accordingly. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 12:15:22 -05:00
aj	a9bc24e48e	Tune english teacher to catch more errors, bump roast reply chance - Raised sentence limit from 3 to 5 for english teacher mode - Added instruction to list multiple corrections rapid-fire - Roast mode reply chance: 10% -> 35% Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 11:03:03 -05:00
aj	4283078e23	Add english teacher mode Insufferable grammar nerd that corrects spelling, translates slang into proper English, and overanalyzes messages like literary essays. 20% proactive reply chance with relaxed moderation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 10:06:31 -05:00
aj	66ca97760b	Add context format explanation to chat prompts LLM was misinterpreting usernames as channel names because the [Server context: ...] metadata format was never explained in the system prompts. This caused nonsensical replies like treating username "thelimitations" as "the limitations channel". Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 09:54:08 -05:00
aj	622f0a325b	Add auto-polls to settle disagreements between users LLM analysis now detects when two users are in a genuine disagreement. When detected, the bot creates a native Discord poll with each user's position as an option. - Disagreement detection added to LLM analysis tool schema - Polls last 4 hours with 1 hour per-channel cooldown - LLM extracts topic, both positions, and usernames - Configurable via polls section in config.yaml Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 09:22:32 -05:00
aj	13a2030021	Add switchable bot modes: default, chatty, and roast Adds a server-wide mode system with /bcs-mode command. - Default: current hall-monitor behavior unchanged - Chatty: friendly chat participant with proactive replies (~10% chance) - Roast: savage roast mode with proactive replies - Chatty/roast use relaxed moderation thresholds - 5-message cooldown between proactive replies per channel - Bot status updates to reflect active mode - /bcs-status shows current mode and effective thresholds Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 08:59:51 -05:00
aj	d41873230d	Reduce repetitive drama score mentions in chat replies Only inject drama score/offense context when values are noteworthy (score >= 0.2 or offenses > 0). Update personality prompt to avoid harping on zero scores and vary responses more. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 22:57:25 -05:00
aj	e2404d052c	Improve LLM context with full timestamped channel history Send last ~8 messages from all users (not just others) as a multi-line chat log with relative timestamps so the LLM can better understand conversation flow and escalation patterns. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 14:04:30 -05:00
aj	fee3e3e1bd	Add game channel redirect feature and sexual_vulgar detection Detect when users discuss a game in the wrong channel (e.g. GTA talk in #warzone) and send a friendly redirect to the correct channel. Also add sexual_vulgar category and scoring rules so crude sexual remarks directed at someone aren't softened by "lmao". Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 17:02:59 -05:00
aj	e41845de02	Add scoreboard roast feature via image analysis When @mentioned with an image attachment, the bot now roasts players based on scoreboard screenshots using the vision model. Text-only mentions continue to work as before. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 16:30:26 -05:00
aj	645b924011	Extract LLM prompts to separate text files and fix quoting penalty Move the analysis and chat personality system prompts from inline Python strings to prompts/analysis.txt and prompts/chat_personality.txt for easier editing. Also add a rule so users quoting/reporting what someone else said are not penalized for the quoted words. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:19:28 -05:00

37 Commits