Breehavior-Monitor

T

aj 1151b705c0 Add LLM request queue, streaming chat, and rename ollama_client to llm_client

- Serialize all LLM requests through an asyncio semaphore to prevent
  overloading athena with concurrent requests
- Switch chat() to streaming so the typing indicator only appears once
  the model starts generating (not during thinking/loading)
- Increase LLM timeout from 5 to 10 minutes for slow first loads
- Rename ollama_client.py to llm_client.py and self.ollama to self.llm
  since the bot uses a generic OpenAI-compatible API
- Update embed labels from "Ollama" to "LLM"

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-02-21 13:45:12 -05:00

cogs

Add LLM request queue, streaming chat, and rename ollama_client to llm_client

2026-02-21 13:45:12 -05:00

prompts

Extract LLM prompts to separate text files and fix quoting penalty

2026-02-21 12:19:28 -05:00

utils

Add LLM request queue, streaming chat, and rename ollama_client to llm_client

2026-02-21 13:45:12 -05:00

.env.example

Initial commit: Breehavior Monitor Discord bot