Breehavior-Monitor

T

aj b410200146 Add max_tokens=1024 to LLM analysis calls

The analyze_message and raw_analyze methods had no max_tokens limit,
causing thinking models (Qwen3-VL-32B-Thinking) to generate unlimited
reasoning tokens before responding — taking 5+ minutes per message.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-02-21 14:17:59 -05:00

cogs

Add LLM request queue, streaming chat, and rename ollama_client to llm_client

2026-02-21 13:45:12 -05:00

prompts

Extract LLM prompts to separate text files and fix quoting penalty

2026-02-21 12:19:28 -05:00

utils

Add max_tokens=1024 to LLM analysis calls

2026-02-21 14:17:59 -05:00

.env.example

Initial commit: Breehavior Monitor Discord bot