Structured Semantic RAG with Access Control
Retrieval-Augmented Generation (RAG) improves language models by letting them pull in external context — but traditional implementations fall short for enterprise use. Aegis extends this pattern with Structured Semantic RAG: a secure, metadata-aware, graph-enabled retrieval system that supports fine-grained access control.
This guide explains what makes Structured Semantic RAG (SSRAG) different, why it’s needed, and how it enables enterprise-grade retrieval for intelligent agents.
Why Simple Semantic Search Falls Short
Imagine building an assistant for a multi-tenant healthcare platform. A physician asks:
“Summarize the patient’s respiratory history.”
Standard RAG will:
- Embed the query
- Run approximate nearest neighbor search
- Return the top-k closest document chunks
Problems:
- May return data from other patients or departments
- Doesn’t filter by user role or visibility
- Doesn’t understand document structure or relationships
In enterprise settings, this leads to:
- Data leakage across tenants
- Inconsistent or irrelevant answers
- Lack of control and traceability
What SSRAG Adds
Structured Semantic RAG enhances traditional RAG with:
1. Structured Metadata Filtering
Every document and chunk in Aegis is tagged with metadata like:
tenant_id
,document_type
,access_visibility
,tags
,owner_id
At query time, we filter based on:
- User’s tenant and role
- Required tags or document types
- Visibility constraints
Example:
Only return chunks from
physician-notes
taggedrespiratory
, authored within the user’s department, and markedprivate
.
2. Semantic Graph Relationships (GraphRAG)
Chunks are not isolated — they’re linked semantically:
- “Patient History” → “Treatment Plan” → “Follow-up Notes”
- “Clause” → “Cited Case” → “Commentary”
Aegis supports traversing this graph:
- Multi-hop reasoning
- Re-ranking based on neighborhood
- Expansion from anchor concepts to supporting context
Example:
A legal agent asks for “termination clause context.” SSRAG expands from the clause to related cases and commentary, returning a richer, more precise summary.
3. Access-Aware Retrieval
Access control is not applied post-retrieval — it’s embedded in the retrieval process:
- Chunk access policies are enforced in the Vespa query layer
- Users only see what they’re permitted to see, at index time
Example:
An intern from Hospital A cannot retrieve notes from Hospital B, even if embeddings are close.
Why Vespa?
Vespa is uniquely suited to power SSRAG:
- Hybrid Retrieval: Lexical (BM25) + Semantic (ANN)
- Structured Filtering: First-class support for metadata-based filters
- Ranking Expressions: Re-rank using recency, semantic similarity, access score, etc.
- High Performance: Scales across large, multi-tenant deployments
Other vector databases lack this combination of structured filters, hybrid scoring, and runtime ranking flexibility.
Use Cases Enabled
Multi-Tenant AI Workflows
Guarantee data boundaries across organizations with chunk-level access enforcement.
Scoped Retrieval with Reasoning
Traverse related context across a semantic document graph.
Domain-Specific Control
Retrieve based on structured metadata: department, doc type, access, version.
What’s Next
- Learn about document and chunk schemas
- Explore the Vespa setup
- Read about access control models
SSRAG turns your RAG pipeline into a permission-aware, structured knowledge base — unlocking the next generation of enterprise-grade AI agents.