Question 1

How is this different from a vanilla RAG library?

Accepted Answer

Saaya KBs ship the production layer most teams underestimate: heading-aware chunking, per-source refresh schedules, scoped access, citation rendering for both voice and chat, strict-citation gating, KB-gap analytics, and per-tenant isolation. You can roll your own — most teams burn 6+ weeks doing it.

Question 2

What vector store does it use?

Accepted Answer

Saaya runs a managed, per-tenant, fully-isolated vector store by default. On Enterprise you can self-host or bring your own — pgvector, Pinecone, and other connectors are available on request.

Question 3

How does strict-citation mode work?

Accepted Answer

Every retrieval comes back with a confidence score (0-1). If the top-K average is below your threshold (default 0.7), the agent does not answer — it escalates and logs the gap. You see the gap in the Session Viewer and add the missing doc.

Question 4

Can citations be turned off?

Accepted Answer

You can configure how citations render — full URL, summary form ("according to our refund policy"), or hidden in voice contexts where they'd sound robotic. The audit log always retains the source. We don't recommend hiding citations entirely; they're half the trust.

Question 5

How fresh is the data?

Accepted Answer

Per-source refresh — hourly, daily, weekly, or on-demand via webhook. URL-crawled sources detect content hash changes and re-embed only what changed. Most teams set help-center sources to hourly and product PDFs to daily.

Question 6

What about PII and sensitive content?

Accepted Answer

Pre-ingest PII redaction is on by default for known patterns (emails, phone numbers, credit cards). Per-source access tags let you route internal docs to support agents without exposing them to customer-facing ones.

Question 7

How big can a KB get?

Accepted Answer

Tested in production at ~5M chunks. Latency stays under 80ms median for top-K retrieval. Larger than that, talk to us about sharding strategies.

Stop guessing. Cite the source.

Hallucinations are a prompt problem. We fixed the prompt.

What's under the hood.

Multi-source ingest

Smart chunking

Citations on every answer

Freshness controls

Scoped access

Strict-citation mode

Four steps from raw docs to grounded answers.

Connect a source

Chunk & embed

Query at runtime

Cite or escalate

A working KB, ready to ship.

On every tier — what you get.

Pair this with the right Solution.

Frequently Asked Questions.

Ground your agents.

Explore other product modules