Commit Graph

14 Commits

Author SHA1 Message Date
Danny 77260c3e08 feat: yaml files for llm reasoning 2026-01-30 10:37:00 -06:00
Danny b0211b944d feat: replace Redis with in-memory conversation storage
- Remove Redis dependency and redis_client.py
- Implement ConversationMemory with module-level dictionary
- Add TTL support via timestamp checking
- Remove redis_connected from health endpoint
- Add embeddings, intent classification, and RAG prompt modules

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-30 10:34:47 -06:00
Danny 72778b65b5 feat: yaml files for llm reasoning 2026-01-29 12:39:41 -06:00
Danny Garcia 50599d7cee
Merge pull request #5 from dannyjosephgarcia/WOOL-25
feat: add GCP service-to-service authentication
2026-01-20 11:42:10 -06:00
Danny c9336d1d84 feat: add GCP service-to-service authentication
Implement identity token verification for Cloud Run deployments:
- Add auth module with GCP identity token verification
- Add configurable auth settings (AUTH_ENABLED, AUTH_AUDIENCE)
- Add service account allowlist for access control
- Protect /chat and /chat/stream endpoints with auth dependency
- Add google-auth dependency for token verification

Auth can be disabled for local development via AUTH_ENABLED=false.
2026-01-19 11:06:59 -06:00
Danny Garcia e3c4680108
Merge pull request #4 from dannyjosephgarcia/WOOL-18
feat: add CORS middleware and SSE streaming endpoint
2026-01-16 12:44:26 -06:00
Danny 6c1cf0655a feat: add CORS middleware and SSE streaming endpoint
Add CORS support for frontend development with configurable origins via
CORS_ORIGINS environment variable. Add /chat/stream endpoint for
Server-Sent Events streaming with true streaming support for OpenAI
adapter and fallback single-chunk behavior for other adapters.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-16 12:43:21 -06:00
Danny Garcia f497fde153
Merge pull request #3 from dannyjosephgarcia/WOOL-21
feat: add AskSage LLM provider integration
2026-01-16 12:18:14 -06:00
Danny 4c084d7668 feat: add AskSage LLM provider integration
Implement AskSageAdapter using the official asksageclient SDK to support
AskSage as an LLM provider option. This enables users to leverage
AskSage's API with configurable email, API key, and model settings.

- Add AskSageAdapter class with async support via thread pool
- Update Settings to include asksage_email, asksage_api_key, asksage_model
- Extend llm_mode literal to include "asksage" option
- Update dependency injection to instantiate AskSageAdapter when configured
- Remove completed OPENAI_INTEGRATION_PLAN.md
- Update requirements.txt with full dependency list including asksageclient
2026-01-16 12:16:34 -06:00
Danny Garcia ad5f8ef798
Merge pull request #2 from dannyjosephgarcia/WOOL-17
feat: add OpenAI integration with dependency injection support
2026-01-13 15:20:31 -06:00
Danny 3324b6ac12 feat: add OpenAI integration with dependency injection support
- Add OpenAIAdapter class using official OpenAI SDK with async support
- Create custom exception hierarchy for LLM errors (authentication,
  rate limit, connection, configuration, response errors)
- Refactor adapter factory to use FastAPI Depends() for dependency injection
- Update configuration to support 'openai' mode with API key and model settings
- Add proper HTTP error mapping for all LLM exception types
- Update Dockerfile with default OPENAI_MODEL environment variable
- Update .env.example with OpenAI configuration options
2026-01-13 15:17:44 -06:00
Danny Garcia d5525b12b2
Merge pull request #1 from dannyjosephgarcia/WOOL-9
feat: add FastAPI skeleton for LLM chat service
2026-01-13 12:51:12 -06:00
Danny Garcia 84de9a02c8 feat: add FastAPI skeleton for LLM chat service
- POST /chat endpoint with message and conversation_id support
- GET /health endpoint for Cloud Run health checks
- Local and Remote LLM adapters with async httpx
- Pydantic schemas and environment-based config
- Dockerfile configured for Cloud Run deployment

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-07 19:32:57 -06:00
Danny 11e7675c52 init: initial commit 2026-01-05 16:23:12 -06:00