Breehavior-Monitor

aj/Breehavior-Monitor

Fork 0

Commit Graph

Author	SHA1	Message	Date
AJ Isaacs	cf88f003ba	Add LLM warm-up request at startup to preload model into VRAM Sends a minimal 1-token completion during setup_hook so the model is ready before Discord messages start arriving, avoiding connection errors and slow first responses after a restart. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 15:16:52 -05:00
AJ Isaacs	1151b705c0	Add LLM request queue, streaming chat, and rename ollama_client to llm_client - Serialize all LLM requests through an asyncio semaphore to prevent overloading athena with concurrent requests - Switch chat() to streaming so the typing indicator only appears once the model starts generating (not during thinking/loading) - Increase LLM timeout from 5 to 10 minutes for slow first loads - Rename ollama_client.py to llm_client.py and self.ollama to self.llm since the bot uses a generic OpenAI-compatible API - Update embed labels from "Ollama" to "LLM" Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-21 13:45:12 -05:00
AJ Isaacs	a35705d3f1	Initial commit: Breehavior Monitor Discord bot Discord bot for monitoring chat sentiment and tracking drama using Ollama LLM on athena.lan. Includes sentiment analysis, slash commands, drama tracking, and SQL Server persistence via Docker Compose. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-20 22:39:40 -05:00

Author

SHA1

Message

Date

AJ Isaacs

cf88f003ba

Add LLM warm-up request at startup to preload model into VRAM

Sends a minimal 1-token completion during setup_hook so the model is
ready before Discord messages start arriving, avoiding connection
errors and slow first responses after a restart.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-02-21 15:16:52 -05:00

AJ Isaacs

1151b705c0

Add LLM request queue, streaming chat, and rename ollama_client to llm_client

- Serialize all LLM requests through an asyncio semaphore to prevent
  overloading athena with concurrent requests
- Switch chat() to streaming so the typing indicator only appears once
  the model starts generating (not during thinking/loading)
- Increase LLM timeout from 5 to 10 minutes for slow first loads
- Rename ollama_client.py to llm_client.py and self.ollama to self.llm
  since the bot uses a generic OpenAI-compatible API
- Update embed labels from "Ollama" to "LLM"

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-02-21 13:45:12 -05:00

AJ Isaacs

a35705d3f1

Initial commit: Breehavior Monitor Discord bot

Discord bot for monitoring chat sentiment and tracking drama using
Ollama LLM on athena.lan. Includes sentiment analysis, slash commands,
drama tracking, and SQL Server persistence via Docker Compose.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-02-20 22:39:40 -05:00

3 Commits