feat: add FastAPI skeleton for LLM chat service

- POST /chat endpoint with message and conversation_id support - GET /health endpoint for Cloud Run health checks - Local and Remote LLM adapters with async httpx - Pydantic schemas and environment-based config - Dockerfile configured for Cloud Run deployment Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-07 19:32:57 -06:00 · 2026-01-07 19:32:57 -06:00 · 84de9a02c8
parent 11e7675c52
commit 84de9a02c8
11 changed files with 490 additions and 0 deletions
--- a/.claude/agents/git-version-control.md
+++ b/.claude/agents/git-version-control.md
@ -0,0 +1,100 @@
 ---
 name: git-version-control
 description: Use this agent when you need to perform git operations including staging changes, creating commits with well-structured messages, pushing to remote repositories, or creating pull requests. This agent should be invoked after code changes are complete and ready to be versioned. Examples:\n\n<example>\nContext: The user has just finished implementing a new feature and wants to commit the changes.\nuser: "I've finished implementing the payment validation logic"\nassistant: "I'll use the git-version-control agent to commit these changes with an appropriate message"\n<commentary>\nSince code changes are complete and need to be committed, use the git-version-control agent to handle the version control operations.\n</commentary>\n</example>\n\n<example>\nContext: Multiple files have been modified and need to be committed and pushed.\nuser: "Please commit all the changes I made to the payment providers"\nassistant: "I'll use the git-version-control agent to review the changes, create a commit with a clear message, and push to the remote"\n<commentary>\nThe user explicitly wants to commit changes, so the git-version-control agent should handle staging, committing, and pushing.\n</commentary>\n</example>\n\n<example>\nContext: A feature branch needs to be pushed and a PR created.\nuser: "Can you push this branch and create a PR for the new Stripe integration?"\nassistant: "I'll use the git-version-control agent to push the branch and create a pull request with a comprehensive description"\n<commentary>\nThe user needs both pushing and PR creation, which are core responsibilities of the git-version-control agent.\n</commentary>\n</example>
 tools: Bash, Glob, Grep, Read, WebFetch, TodoWrite, WebSearch, BashOutput, KillBash
 model: sonnet
 ---
 You are an expert git version control specialist with deep knowledge of git workflows, commit conventions, and collaborative development practices. Your role is to manage all git operations with precision and clarity.
 **CRITICAL RULE**: You must NEVER include any AI attribution, Claude references, or tool mentions in commit messages. All commits should appear as if written by a human developer.
 ## Core Responsibilities
 You will handle all git version control tasks including:
 - Checking repository status and understanding what has changed
 - Staging appropriate files for commits
 - Creating clear, concise, and informative commit messages
 - Pushing changes to remote repositories
 - Creating pull requests with comprehensive descriptions
 - Managing branches when necessary
 ## Commit Message Guidelines
 **ABSOLUTELY FORBIDDEN**: Do NOT include any of the following in commit messages:
 - "Generated with [Claude Code]"
 - "Co-Authored-By: Claude"
 - Any mention of AI, Claude, or automated assistance
 - Any tool attribution or generation mentions
 You must write commit messages that:
 - Start with a conventional commit type (feat:, fix:, docs:, style:, refactor:, test:, chore:)
 - Provide a clear, imperative mood subject line (50 characters or less)
 - Include a blank line between subject and body when a body is needed
 - Explain WHAT changed and WHY in the body, not HOW
 - Reference issue numbers when applicable
 - Focus solely on the business logic and technical changes
 - Write commit messages as if they were created by the developer directly
 - Sound completely natural and human-written
 Example commit messages:
 ```
 feat: add payment validation for Stripe provider
 Implement validation logic to ensure payment amounts are within
 acceptable limits and currency codes are supported.
 ```
 ```
 fix: resolve timeout issue in PayPal transaction processing
 ```
 ## Operational Workflow
 1. **Status Assessment**: First run `git status` to understand the current state
 2. **Change Review**: Use `git diff` to review unstaged changes and understand what was modified
 3. **Selective Staging**: Stage files intelligently:
   - Group related changes together
   - Avoid staging unrelated modifications in the same commit
   - Use `git add -p` for partial staging when appropriate
 4. **Commit Creation**: Craft commits that are atomic and focused on a single logical change
 5. **Remote Operations**: 
   - Always pull before pushing to avoid conflicts
   - Push to the appropriate branch
   - Set upstream tracking when pushing new branches
 6. **Pull Request Creation**: When creating PRs:
   - Write descriptive titles that summarize the changes
   - Include a comprehensive description with:
     - Summary of changes
     - Testing performed
     - Any breaking changes or migration notes
     - Related issues or tickets
 ## Best Practices
 - Keep commits small and focused - each commit should represent one logical change
 - Never commit sensitive information (passwords, API keys, tokens)
 - Verify the branch you're on before committing
 - Use `git log --oneline -10` to review recent history and maintain consistency
 - If you encounter merge conflicts, clearly explain the situation and resolution approach
 - When working with feature branches, ensure they're up to date with the main branch
 ## Error Handling
 - If git operations fail, diagnose the issue and provide clear explanations
 - For permission errors, guide on authentication setup
 - For conflicts, explain the conflicting changes and suggest resolution strategies
 - Always verify operations completed successfully before proceeding
 ## Quality Checks
 Before finalizing any git operation:
 - Ensure all intended changes are included
 - Verify no unintended files are staged
 - Confirm commit messages are clear and follow conventions
 - Check that you're on the correct branch
 - Validate that remote operations succeeded
 You are meticulous, systematic, and focused on maintaining a clean, understandable git history that tells the story of the project's evolution without revealing implementation details about tools or assistance used in development.
 **FINAL REMINDER**: Your commit messages must be completely free of any AI mentions, Claude references, or tool attributions. They should read exactly like standard developer commit messages with no indication of automated assistance.
--- a/.env.example
+++ b/.env.example
@ -0,0 +1,6 @@
 # LLM Mode: "local" or "remote"
 LLM_MODE=local
 # Remote LLM Configuration (required if LLM_MODE=remote)
 LLM_REMOTE_URL=https://your-llm-service.com/generate
 LLM_REMOTE_TOKEN=
--- a/16
+++ b/16
@ -0,0 +1,16 @@
 FROM python:3.12-slim
 WORKDIR /app
 # Install dependencies
 COPY requirements.txt .
 RUN pip install --no-cache-dir -r requirements.txt
 # Copy application code
 COPY app/ ./app/
 # Expose port for Cloud Run
 EXPOSE 8080
 # Run the application
 CMD ["uvicorn", "app.main:app", "--host", "0.0.0.0", "--port", "8080"]
--- a/README.md
+++ b/README.md
@ -0,0 +1,117 @@
 # Tyndale AI Service
 LLM Chat Service for algorithmic trading support - codebase Q&A, P&L summarization, and strategy enhancement suggestions.
 ## Quick Start
 ### Local Development
 ```bash
 # Install dependencies
 pip install -r requirements.txt
 # Run the server
 uvicorn app.main:app --reload --port 8080
 ```
 ### Docker
 ```bash
 # Build
 docker build -t tyndale-ai-service .
 # Run
 docker run -p 8080:8080 -e LLM_MODE=local tyndale-ai-service
 ```
 ## API Endpoints
 ### Health Check
 ```bash
 curl http://localhost:8080/health
 ```
 Response:
 ```json
 {"status": "ok"}
 ```
 ### Chat
 ```bash
 curl -X POST http://localhost:8080/chat \
  -H "Content-Type: application/json" \
  -d '{"message": "Hello, how are you?"}'
 ```
 Response:
 ```json
 {
  "conversation_id": "uuid-generated-if-not-provided",
  "response": "...",
  "mode": "local",
  "sources": []
 }
 ```
 With conversation ID:
 ```bash
 curl -X POST http://localhost:8080/chat \
  -H "Content-Type: application/json" \
  -d '{"message": "Follow up question", "conversation_id": "my-conversation-123"}'
 ```
 ## Environment Variables
 | Variable | Description | Default |
 |----------|-------------|---------|
 | `LLM_MODE` | `local` or `remote` | `local` |
 | `LLM_REMOTE_URL` | Remote LLM endpoint URL | (empty) |
 | `LLM_REMOTE_TOKEN` | Bearer token for remote LLM | (empty) |
 ### Remote Mode Setup
 ```bash
 export LLM_MODE=remote
 export LLM_REMOTE_URL=https://your-llm-service.com/generate
 export LLM_REMOTE_TOKEN=your-api-token
 uvicorn app.main:app --reload --port 8080
 ```
 The remote adapter expects the LLM service to accept:
 ```json
 {"conversation_id": "...", "message": "..."}
 ```
 And return:
 ```json
 {"response": "..."}
 ```
 ## Project Structure
 ```
 tyndale-ai-service/
 ├── app/
 │   ├── __init__.py
 │   ├── main.py          # FastAPI app + routes
 │   ├── schemas.py       # Pydantic models
 │   ├── config.py        # Environment config
 │   └── llm/
 │       ├── __init__.py
 │       └── adapter.py   # LLM adapter interface + implementations
 ├── requirements.txt
 ├── Dockerfile
 ├── .env.example
 └── README.md
 ```
 ## Features
 - **Dual mode operation**: Local stub or remote LLM
 - **Conversation tracking**: UUID generation for new conversations
 - **Security**: 10,000 character message limit, no content logging
 - **Cloud Run ready**: Port 8080, stateless design
 - **Async**: Full async/await support with httpx
--- a/app/init.py
+++ b/app/init.py
@ -0,0 +1 @@
 # Tyndale AI Service
--- a/app/config.py
+++ b/app/config.py
@ -0,0 +1,22 @@
 from typing import Literal
 from pydantic_settings import BaseSettings
 class Settings(BaseSettings):
    """Application configuration loaded from environment variables."""
    llm_mode: Literal["local", "remote"] = "local"
    llm_remote_url: str = ""
    llm_remote_token: str = ""
    class Config:
        env_file = ".env"
        env_file_encoding = "utf-8"
 # Constants
 MAX_MESSAGE_LENGTH: int = 10_000
 # Global settings instance
 settings = Settings()
--- a/app/llm/init.py
+++ b/app/llm/init.py
@ -0,0 +1 @@
 # LLM adapters
--- a/app/llm/adapter.py
+++ b/app/llm/adapter.py
@ -0,0 +1,110 @@
 from abc import ABC, abstractmethod
 import httpx
 from app.config import settings
 class LLMAdapter(ABC):
    """Abstract base class for LLM adapters."""
    @abstractmethod
    async def generate(self, conversation_id: str, message: str) -> str:
        """Generate a response for the given message.
        Args:
            conversation_id: The conversation identifier
            message: The user's message
        Returns:
            The generated response string
        """
        pass
 class LocalAdapter(LLMAdapter):
    """Local stub adapter for development and testing."""
    async def generate(self, conversation_id: str, message: str) -> str:
        """Return a stub response echoing the user message.
        This is a placeholder that will be replaced with a real local model.
        """
        return (
            f"[LOCAL STUB MODE] Acknowledged your message. "
            f"You said: \"{message[:100]}{'...' if len(message) > 100 else ''}\". "
            f"This is a stub response - local model not yet implemented."
        )
 class RemoteAdapter(LLMAdapter):
    """Remote adapter that calls an external LLM service via HTTP."""
    def __init__(self, url: str, token: str | None = None, timeout: float = 30.0):
        """Initialize the remote adapter.
        Args:
            url: The remote LLM service URL
            token: Optional bearer token for authentication
            timeout: Request timeout in seconds
        """
        self.url = url
        self.token = token
        self.timeout = timeout
    async def generate(self, conversation_id: str, message: str) -> str:
        """Call the remote LLM service to generate a response.
        Handles errors gracefully by returning informative error strings.
        """
        headers = {"Content-Type": "application/json"}
        if self.token:
            headers["Authorization"] = f"Bearer {self.token}"
        payload = {
            "conversation_id": conversation_id,
            "message": message,
        }
        try:
            async with httpx.AsyncClient(timeout=self.timeout) as client:
                response = await client.post(self.url, json=payload, headers=headers)
            if response.status_code != 200:
                return (
                    f"[ERROR] Remote LLM returned status {response.status_code}: "
                    f"{response.text[:200] if response.text else 'No response body'}"
                )
            try:
                data = response.json()
            except ValueError:
                return "[ERROR] Remote LLM returned invalid JSON response"
            if "response" not in data:
                return "[ERROR] Remote LLM response missing 'response' field"
            return data["response"]
        except httpx.TimeoutException:
            return f"[ERROR] Remote LLM request timed out after {self.timeout} seconds"
        except httpx.ConnectError:
            return f"[ERROR] Could not connect to remote LLM at {self.url}"
        except httpx.RequestError as e:
            return f"[ERROR] Remote LLM request failed: {str(e)}"
 def get_adapter() -> LLMAdapter:
    """Factory function to create the appropriate adapter based on configuration.
    Returns:
        An LLMAdapter instance based on the LLM_MODE setting
    """
    if settings.llm_mode == "remote":
        if not settings.llm_remote_url:
            raise ValueError("LLM_REMOTE_URL must be set when LLM_MODE is 'remote'")
        return RemoteAdapter(
            url=settings.llm_remote_url,
            token=settings.llm_remote_token or None,
        )
    return LocalAdapter()
--- a/app/main.py
+++ b/app/main.py
@ -0,0 +1,79 @@
 import logging
 import uuid
 from fastapi import FastAPI, HTTPException
 from app.config import settings, MAX_MESSAGE_LENGTH
 from app.llm.adapter import get_adapter
 from app.schemas import ChatRequest, ChatResponse, HealthResponse
 # Configure logging
 logging.basicConfig(
    level=logging.INFO,
    format="%(asctime)s - %(name)s - %(levelname)s - %(message)s",
 )
 logger = logging.getLogger(__name__)
 # Create FastAPI app
 app = FastAPI(
    title="Tyndale AI Service",
    description="LLM Chat Service for algorithmic trading support",
    version="0.1.0",
 )
@app.get("/health", response_model=HealthResponse)
 async def health_check() -> HealthResponse:
    """Health check endpoint."""
    return HealthResponse(status="ok")
@app.post("/chat", response_model=ChatResponse)
 async def chat(request: ChatRequest) -> ChatResponse:
    """Process a chat message through the LLM adapter.
    - Validates message length
    - Generates conversation_id if not provided
    - Routes to appropriate LLM adapter based on LLM_MODE
    """
    # Validate message length
    if len(request.message) > MAX_MESSAGE_LENGTH:
        raise HTTPException(
            status_code=400,
            detail=f"Message exceeds maximum length of {MAX_MESSAGE_LENGTH:,} characters. "
            f"Your message has {len(request.message):,} characters.",
        )
    # Generate or use provided conversation_id
    conversation_id = request.conversation_id or str(uuid.uuid4())
    # Log request metadata (not content)
    logger.info(
        "Chat request received",
        extra={
            "conversation_id": conversation_id,
            "message_length": len(request.message),
            "mode": settings.llm_mode,
        },
    )
    # Get adapter and generate response
    adapter = get_adapter()
    response_text = await adapter.generate(conversation_id, request.message)
    # Log response metadata
    logger.info(
        "Chat response generated",
        extra={
            "conversation_id": conversation_id,
            "response_length": len(response_text),
            "mode": settings.llm_mode,
        },
    )
    return ChatResponse(
        conversation_id=conversation_id,
        response=response_text,
        mode=settings.llm_mode,
        sources=[],
    )
--- a/app/schemas.py
+++ b/app/schemas.py
@ -0,0 +1,33 @@
 from typing import Literal
 from pydantic import BaseModel, Field
 class ChatRequest(BaseModel):
    """Request model for the /chat endpoint."""
    message: str = Field(..., description="The user's message")
    conversation_id: str | None = Field(
        default=None, description="Optional conversation ID for continuity"
    )
 class ChatResponse(BaseModel):
    """Response model for the /chat endpoint."""
    conversation_id: str = Field(..., description="Conversation ID (generated if not provided)")
    response: str = Field(..., description="The LLM's response")
    mode: Literal["local", "remote"] = Field(..., description="Which adapter was used")
    sources: list = Field(default_factory=list, description="Source references (empty for now)")
 class HealthResponse(BaseModel):
    """Response model for the /health endpoint."""
    status: str = Field(default="ok")
 class ErrorResponse(BaseModel):
    """Standard error response model."""
    detail: str = Field(..., description="Error description")
--- a/requirements.txt
+++ b/requirements.txt
@ -0,0 +1,5 @@
 fastapi>=0.109.0
 uvicorn[standard]>=0.27.0
 pydantic>=2.5.0
 pydantic-settings>=2.1.0
 httpx>=0.26.0