Block 03 — deep dive
RAG knowledge assistant — platform-independent, EU-hosted.
When Microsoft Copilot doesn't fit — because sources sit outside M365, because EU hosting is mandatory, or because user licensing doesn't work economically — we build a dedicated RAG knowledge assistant. Document chat with your content, on European infrastructure.
What RAG means
Retrieval-Augmented Generation (RAG) is the established architecture for AI knowledge assistants: your documents are indexed in a vector database, an embedding model creates semantic representations, and a large language model formulates the answer based on the retrieved evidence. Unlike "bare" LLMs, a RAG system answers on your data — with source citation and without hallucinating into the blue.
EU-hosted LLMs
We rely on Mistral and Aleph Alpha as European LLM providers with EU hosting and clear GDPR commitments. Over the past 24 months, both providers have largely closed the quality gap to OpenAI — for German-language knowledge answers, they are on par.
Vector DB setup
We use Qdrant, Weaviate, or pgvector (PostgreSQL extension), depending on the requirement. Embedding strategy, chunking, re-ranking, and index partitioning per permission boundary are part of the pilot package.
Hybrid sources
A RAG assistant can integrate sources that Microsoft Copilot cannot reach: file shares, Confluence, older SQL databases, scanned PDFs (with OCR), industry databases. That makes it the choice when your knowledge is heterogeneously distributed.
Quality metrics
Every RAG pilot at arades delivers metrics from day one: recall (which relevant documents are found?), precision (how many of the found documents are actually relevant?), and hallucination rate (how often does the LLM formulate answers without evidence?). Without these metrics, a RAG assistant remains a feeling — we make it measurable.