osint-llm / QUICKSTART.md
Tom
Add complete RAG-powered OSINT investigation assistant
6466c00
# OSINT RAG App Quickstart
## Stack
- **Frontend**: Gradio 4.0+ (ChatInterface with auto API endpoints)
- **Database**: Supabase PGVector (768-dim embeddings, HNSW index)
- **LLM**: HuggingFace Inference API (Llama-3.1-8B-Instruct)
- **Embeddings**: HuggingFace Inference API (all-mpnet-base-v2, 768-dim)
- **Client**: Supabase Python client + InferenceClient (huggingface_hub)
## Key Parameters
- **Temperature**: 0.2 (low hallucination)
- **Max Tokens**: 600 (short responses)
- **Retrieval K**: 5 tools
- **Match Threshold**: 0.5 (cosine similarity)
- **Connection**: Transaction Pooler (port 6543)