Breehavior-Monitor

Author	SHA1	Message	Date
aj	89fabd85da	feat: add set_user_profile method to DramaTracker Replaces the entire notes field with an LLM-generated profile summary, used by the memory extraction system for permanent facts. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:54:05 -05:00
aj	67011535cd	feat: add memory extraction LLM tool and prompt Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:53:18 -05:00
aj	8686f4fdd6	fix: align default limits and parameter names to spec - get_recent_memories: limit default 10 → 5 - get_memories_by_topics: limit default 10 → 5 - prune_excess_memories: rename 'cap' → 'max_memories' Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:50:47 -05:00
aj	75adafefd6	feat: add UserMemory table and CRUD methods for conversational memory Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 12:48:54 -05:00
aj	cb8ef8542b	fix: guard against malformed LLM findings in conversation validation Filter out non-dict entries from user_findings and handle non-dict result to prevent 'str' object has no attribute 'setdefault' errors. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 21:38:02 -05:00
aj	f46caf9ac5	fix: tag context messages with [CONTEXT] to prevent LLM from scoring them The triage LLM was blending context message content into its reasoning for new messages (e.g., citing profanity from context when the new message was just "I'll be here"). Added per-message [CONTEXT] tags inline and strengthened the prompt to explicitly forbid referencing context content in reasoning/scores. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 20:08:23 -05:00
aj	7417908142	fix: separate context from new messages so prior-cycle chat doesn't inflate scores The conversation analysis was re-scoring old messages alongside new ones, causing users to get penalized repeatedly for already-scored messages. A "--- NEW MESSAGES ---" separator now marks which messages are new, and the prompt instructs the LLM to score only those. Also fixes bot-mention detection to require an explicit @mention in message text rather than treating reply-pings as scans (so toxic replies to bot warnings aren't silently skipped). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 15:48:02 -05:00
aj	8734f1883b	fix: persist last_offense_time and reset offenses after 24h last_offense_time was in-memory only — lost on restart, so the offense_reset_minutes check never fired after a reboot. Now persisted as LastOffenseAt FLOAT in UserState. On startup hydration, stale offenses (and warned flag) are auto-cleared if the reset window has passed. Bumped offense_reset_minutes from 2h to 24h. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 11:24:38 -05:00
aj	71c7b45e9a	feat: require warning before mute + sustained toxicity escalation Gate mutes behind a prior warning — first offense always gets a warning, mute only fires if warned_since_reset is True. Warned flag is persisted to DB (new Warned column on UserState) and survives restarts. Add post-warning escalation boost to drama_score: each high-scoring message after a warning adds +0.04 (configurable) so sustained bad behavior ramps toward the mute threshold instead of plateauing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 11:07:57 -05:00
aj	f02a4ab49d	Add content fallback for conversation analysis + debug logging When the LLM returns text instead of a tool call for conversation analysis, try parsing the content as JSON before giving up. Also log what the model actually returns on failure for debugging. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 10:16:15 -05:00
aj	90b70cad69	feat: channel-level conversation analysis with compact formatting Switch from per-user message batching to per-channel conversation analysis. The LLM now sees the full interleaved conversation with relative timestamps, reply chains, and consecutive message collapsing instead of isolated flat text per user. Key changes: - Fix gpt-5-nano temperature incompatibility (conditional temp param) - Add mention-triggered scan: users @mention bot to analyze recent chat - Refactor debounce buffer from (channel_id, user_id) to channel_id - Replace per-message analyze_message() with analyze_conversation() returning per-user findings from a single LLM call - Add CONVERSATION_TOOL schema with coherence, topic, and game fields - Compact message format: relative timestamps, reply arrows (→), consecutive same-user message collapsing - Separate mention scan tasks from debounce tasks - Remove _store_context/_get_context (conversation block IS the context) - Escalation timeout config: [30, 60, 120, 240] minutes Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 23:13:07 -05:00
aj	a0edf90ebd	Switch to max_completion_tokens for newer OpenAI models gpt-5-nano and other newer models require max_completion_tokens instead of max_tokens. The new parameter is backwards compatible with older models. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-24 17:07:44 -05:00
aj	c8e7c8c1cf	Trim prompts for gpt-4o-mini, remove disagreement detection Slim down chat_roast.txt — remove anti-repetition rules that were compensating for the local model (gpt-4o-mini handles this natively). Remove disagreement detection from analysis prompt, tool schema, and sentiment handler. Saves ~200 tokens per analysis call. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 16:26:44 -05:00
aj	942f5ddce7	Fix repetitive roast responses with anti-repetition mechanisms Add frequency_penalty (0.8) and presence_penalty (0.6) to LLM chat calls to discourage repeated tokens. Inject the bot's last 5 responses into the system prompt so the model knows what to avoid. Strengthen the roast prompt with explicit anti-repetition rules and remove example lines the model was copying verbatim ("Real ___ energy", etc.). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 15:15:11 -05:00
aj	534aac5cd7	Enable thinking for chat, diversify roast styles - Remove /no_think override from chat() so Qwen3 reasons before generating responses (fixes incoherent word-salad replies) - Analysis and image calls keep /no_think for speed - Add varied roast style guidance (deadpan, sarcastic, blunt, etc.) - Explicitly ban metaphors/similes in roast prompt - Replace metaphor examples with direct roast examples Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 13:59:16 -05:00
aj	8a06ddbd6e	Support hybrid LLM: local Qwen triage + OpenAI escalation Triage analysis runs on Qwen 8B (athena.lan) for free first-pass. Escalation, chat, image roasts, and commands use GPT-4o via OpenAI. Each tier gets its own base URL, API key, and concurrency settings. Local models get /no_think and serialized requests automatically. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 12:20:07 -05:00
aj	28fb66d5f9	Switch LLM backend from llama.cpp/Qwen to OpenAI - Default models: gpt-4o-mini (triage), gpt-4o (escalation) - Remove Qwen-specific /no_think hacks - Reduce timeout from 600s to 120s, increase concurrency semaphore to 4 - Support empty LLM_BASE_URL to use OpenAI directly Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 12:07:53 -05:00
aj	86aacfb84f	Add 120s timeout to image analysis streaming The vision model request was hanging indefinitely, freezing the bot. The streaming loop had no timeout so if the model never returned chunks, the bot would wait forever. Now times out after 2 minutes. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 10:37:37 -05:00
aj	6e1a73847d	Persist bot mode across restarts via database Adds a BotSettings key-value table. The active mode is saved when changed via /bcs-mode and restored on startup. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 09:26:00 -05:00
aj	622f0a325b	Add auto-polls to settle disagreements between users LLM analysis now detects when two users are in a genuine disagreement. When detected, the bot creates a native Discord poll with each user's position as an option. - Disagreement detection added to LLM analysis tool schema - Polls last 4 hours with 1 hour per-channel cooldown - LLM extracts topic, both positions, and usernames - Configurable via polls section in config.yaml Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 09:22:32 -05:00
aj	3f56982a83	Simplify user notes trimming to keep last 10 lines Replace character-based truncation loop with a simple line count cap. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 23:43:36 -05:00
aj	b04d3da2bf	Add LLM request/response logging to database Log every LLM call (analysis, chat, image, raw_analyze) to a new LlmLog table with request type, model, token counts, duration, success/failure, and truncated request/response payloads. Enables debugging prompt issues and tracking usage. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 22:55:19 -05:00
aj	e2404d052c	Improve LLM context with full timestamped channel history Send last ~8 messages from all users (not just others) as a multi-line chat log with relative timestamps so the LLM can better understand conversation flow and escalation patterns. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 14:04:30 -05:00
aj	fee3e3e1bd	Add game channel redirect feature and sexual_vulgar detection Detect when users discuss a game in the wrong channel (e.g. GTA talk in #warzone) and send a friendly redirect to the correct channel. Also add sexual_vulgar category and scoring rules so crude sexual remarks directed at someone aren't softened by "lmao". Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 17:02:59 -05:00
aj	e41845de02	Add scoreboard roast feature via image analysis When @mentioned with an image attachment, the bot now roasts players based on scoreboard screenshots using the vision model. Text-only mentions continue to work as before. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 16:30:26 -05:00
aj	b410200146	Add max_tokens=1024 to LLM analysis calls The analyze_message and raw_analyze methods had no max_tokens limit, causing thinking models (Qwen3-VL-32B-Thinking) to generate unlimited reasoning tokens before responding — taking 5+ minutes per message. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 14:17:59 -05:00
aj	1151b705c0	Add LLM request queue, streaming chat, and rename ollama_client to llm_client - Serialize all LLM requests through an asyncio semaphore to prevent overloading athena with concurrent requests - Switch chat() to streaming so the typing indicator only appears once the model starts generating (not during thinking/loading) - Increase LLM timeout from 5 to 10 minutes for slow first loads - Rename ollama_client.py to llm_client.py and self.ollama to self.llm since the bot uses a generic OpenAI-compatible API - Update embed labels from "Ollama" to "LLM" Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 13:45:12 -05:00
aj	645b924011	Extract LLM prompts to separate text files and fix quoting penalty Move the analysis and chat personality system prompts from inline Python strings to prompts/analysis.txt and prompts/chat_personality.txt for easier editing. Also add a rule so users quoting/reporting what someone else said are not penalized for the quoted words. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 12:19:28 -05:00
aj	a35705d3f1	Initial commit: Breehavior Monitor Discord bot Discord bot for monitoring chat sentiment and tracking drama using Ollama LLM on athena.lan. Includes sentiment analysis, slash commands, drama tracking, and SQL Server persistence via Docker Compose. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 22:39:40 -05:00

29 Commits