OmniRAG
ProductionAn enterprise retrieval-augmented-generation system over technical PDF manuals — four LLM providers (Phi-4 and Llama local; Mistral and GPT cloud), built on Clean Architecture.
- 4 language models, 5 embedding strategies, 4 chunking algorithms, 4 retrieval strategies
- Sub-50ms vector search on ChromaDB; under 8s end-to-end query time
- Enterprise resilience patterns: circuit breaker, retry with backoff, timeout protection
- 66/66 tests passing with 100% coverage on critical paths