When not Microsoft Copilot, but open AI?

Three typical reasons: sovereignty (Mistral or Aleph Alpha for strict EU requirements), cost control (with many thousand requests per day, Copilot becomes more expensive than a self-orchestrated setup), and special model requirements (embedding models for search, vision models for document processing, audio transcription).

Which AI models do you use?

OpenAI (GPT-4o, GPT-4.1) for general tasks. Anthropic Claude for complex reasoning and coding tasks. Mistral Large/Small (EU-hosted at Mistral or via AWS Bedrock EU) for sovereign setups. Aleph Alpha for German public-sector-affine projects. Local LLMs (Llama, Qwen, Mistral) on Ollama for fully isolated environments.

What is RAG, and do I need it?

Retrieval-augmented generation: you combine an LLM with your own documents so the model generates answers from your knowledge base — not from generic internet knowledge. For internal knowledge bases, contract search, customer-service bots, RAG is today's standard architecture. We typically use PostgreSQL with pgvector or Qdrant for vector search.

What does an AI integration cost?

A discovery spike (2–4 weeks) we calculate together. A productive RAG pipeline with your document corpus as a fixed-price range. Ongoing LLM provider API costs are separate — typically €200 to €4,000 per month, depending on volume and model choice.

What does the EU AI Act mean for my company?

Since February 2025, Article 4 of the EU AI Act applies: training duty for all employees who use AI systems in the work context. From August 2026 a fine framework takes effect — up to €35 million or 7% of global annual revenue. We help with inventory, training design, and documentation.

Can you also host LLMs on-prem?

Yes, with caveats. Local models like Llama 3, Qwen, Mistral can run on your own hardware (ideally with a GPU) via Ollama or vLLM. Quality, however, is significantly below GPT-4 or Claude. For highly sensitive use cases (full data isolation) it's a valid option — not recommended for general knowledge work.

How do Microsoft Copilot and open AI combine?

They aren't mutually exclusive. Microsoft Copilot covers the standard Office world (email summaries, Teams notes, Word/Excel). Open AI integrations extend that with use cases Copilot doesn't serve — industry-specific RAG applications, customer-service bots, code assistance with specific models. In many mid-sized companies, both run in parallel.

Open AI Integrations — when Microsoft Copilot doesn't fit

Provider / Model	Strengths	Hosting	Price indication
OpenAI · GPT-4o, GPT-4.1	All-rounder, excellent multi-modal support, huge ecosystem	USA, EU region via Azure OpenAI	from approx. $2.50 / 1M input tokens
Anthropic Claude · Sonnet, Opus	Reasoning, coding, long contexts (1M tokens), safety tuning	USA, AWS Bedrock EU	from approx. $3 / 1M input tokens
Mistral · Large, Small	EU provider, good multilingual support, competitive open-source models	France (Mistral), AWS Bedrock EU	from approx. $2 / 1M input tokens
Aleph Alpha · Pharia	German provider, public-sector-affine, focus on EU compliance	Germany (Heidelberg)	individual, license-based
Local LLMs · Llama, Qwen, Mistral	Fully isolated operation, no external API costs	Own infrastructure, ideally with GPU	Only hardware cost, from approx. €800 per month (Hetzner GPU)

Provider / Model

Strengths

Hosting

Price indication

OpenAI · GPT-4o, GPT-4.1

All-rounder, excellent multi-modal support, huge ecosystem

USA, EU region via Azure OpenAI

from approx. $2.50 / 1M input tokens

Anthropic Claude · Sonnet, Opus

Reasoning, coding, long contexts (1M tokens), safety tuning

USA, AWS Bedrock EU

from approx. $3 / 1M input tokens

Mistral · Large, Small

EU provider, good multilingual support, competitive open-source models

France (Mistral), AWS Bedrock EU

from approx. $2 / 1M input tokens

Aleph Alpha · Pharia

German provider, public-sector-affine, focus on EU compliance

Germany (Heidelberg)

individual, license-based

Local LLMs · Llama, Qwen, Mistral

Fully isolated operation, no external API costs

Own infrastructure, ideally with GPU

Only hardware cost, from approx. €800 per month (Hetzner GPU)

The EU AI Act has been in force since August 2024, and its obligations apply in stages:

February 2025: Article 4 applies — training duty for all employees who use AI systems in the work context. It's not about mandatory slide-wiping, but about demonstrable AI competence per role.
August 2025: Obligations for general-purpose AI providers (OpenAI, Anthropic, Mistral) — affects you indirectly via contractual situations.
August 2026: Obligations for high-risk AI systems apply fully. Fine framework takes effect: up to €35 million or 7% of global annual revenue — the higher value.

We build compliance into every AI integration:

Inventory of all AI systems with use-case description and risk classification
Training concept for affected roles (in collaboration with your HR/compliance)
Audit logs at the API request level
Model datasheets with clear notes on model origin, training, and limitations
Data protection impact assessment under GDPR Art. 35, where required

Before implementation. For many mid-sized companies, a combined inventory + training concept is the first sensible step — even without new implementation. More under AI Governance & EU AI Act.

What typically runs alongside this engineering work.

Engineering projects rarely stand alone — license logic, architecture clarification, quality gates, knowledge transfer, and follow-on operations usually run in parallel. Below are the most common accompanying services we add via discovery spikes, fixed-price sprints, or application-care contracts.

Open AI Integrations — when Microsoft Copilot doesn't fit.

When not Microsoft Copilot, but open AI.

Sovereignty

Cost control at scale

Specialized model choice

Which model for which task — an honest overview.

Where open AI creates value for mid-sized companies today.

RAG with your own documents

Customer-service bots

Code assistance

Content pipelines

Compliance is part of the architecture, not a retrofitted PDF.

Where AI integrations dock into the Microsoft and your own ecosystem.

Custom Software & Web Platforms →

Advisory & Architecture →

AI & Microsoft Copilot →

AI Governance & EU AI Act →

What clients ask before the architecture call.

Book an architecture conversation.

What typically runs alongside this engineering work.

Advisory & Architecture

License Advisory & CSP

Project Assurance

Training & learning program

Application Care

Knowledge Recovery