Confidential
Confidential·AI and back-end architect·2026 · 3 months

An agent that queries the internal knowledge base and answers with citations.

Automated ingestion of internal documentation, Qdrant vector store, hybrid search combining semantic similarity and BM25 lexical, a LangGraph-orchestrated agent that synthesises an answer and systematically cites its sources. No unsourced answer is tolerated.

Teams don't lose time for lack of knowledge, they lose it for lack of access.

In a thirty to eighty-person company, the critical knowledge exists: it's scattered across Notion, Drive, old threads, email. The operational cost of searching is never accounted for, but it easily exceeds the cost of a full-time hire beyond a certain headcount. The client knew this, but didn't have the number.

Upstream audit, report delivered before the engineering go-ahead.

Exhaustive mapping of knowledge sources, a user survey to quantify time lost in search, classification of query types by frequency and criticality. An eighteen-page scoping report was delivered before any development: document volumetrics, prioritised use cases, estimated annual cost of the status quo. Explicit sign-off before any code was written.

Ingestion pipeline, vector store, citation-mandatory agent.

Automated ingestion from Notion, Drive and email exports with parsing, chunking and embedding. Dedicated Qdrant vector store. Hybrid search combining semantic similarity and BM25 lexical to catch short queries. LangGraph-orchestrated agent for retrieval and synthesis, constrained to cite its sources on every answer. Docker containerisation, deployment on European infrastructure, ops documentation handed over to the internal team.

What it produces.

<3 s
median response time
92 %
answers rated usable in internal eval
100 %
answers sourced, citation enforced
Let's talk in 30 minutes.
Got a similar project?
Agent RAG sur connaissances internes | Ceres Broker