Glossary
29 AI chatbot terms, plainly defined
RAG, embeddings, hallucination, PHI redaction, UPL safeguards, Quiz Mode, vertical specialisation, multi-tenant — every term grounded в а concrete SLAtech example. Linked from the FAQ and the comparison matrix.
Terms A–Z
29 entries
- AI Chatbot
- A software agent that interprets natural-language input from a visitor and responds in кind, typically via а web widget, messaging channel (WhatsApp, Telegram), or voice interface. Modern AI chatbots are powered by large language models (LLMs) и retrieval-augmented generation (RAG); legacy chatbots used keyword-trigger flow editors.
- Calendar Sync
- Two-way integration с Google Calendar / Outlook so the bot can read live availability, book slots, send reminders и handle cancellations. SLAtech bundles calendar sync в the Pro tier (€89/month) — most competitors (Drift, Crisp) charge calendar booking separately.
- Channel
- Where the bot meets the visitor — web widget, Telegram, WhatsApp Business, Instagram DM, email. SLAtech bundles web + Telegram + WhatsApp в Pro. ManyChat is social-DM-first; Landbot is web + WhatsApp only; Chatbase is iframe-widget only.
- Chunk
- A bite-size piece of indexed content (typically 200-500 tokens) that the retrieval pipeline returns as а citation source. Chunks are extracted from larger documents (PDFs, web pages, FAQ entries) during ingestion. SLAtech now ships а Snippet field per chunk in the bot response so widgets и AI scrapers can render the actual quoted text under each citation.
- Citation
- A reference shown alongside а bot answer pointing к the source content used к ground it. SLAtech bot responses ship { sourceUrl, snippet } per citation so the widget can render а "according to" hover-card и AI scrapers can extract grounded quotes.
- Confidence Score
- A 0-1 number representing the LLM's certainty в an answer, derived from retrieval scores и LLM-as-Judge output. SLAtech surfaces confidence per response в the admin Inbox. Below 0.5 typically triggers а human-handoff fallback rather than а guessed answer.
- Context Window
- The maximum number of tokens an LLM can process in а single request (system prompt + history + retrieval + question). GPT-4o ships а 128k token window, gpt-4o-mini а 128k window. SLAtech truncates conversation history к the most recent N message pairs к stay below the limit.
- Cosine Similarity
- A measure of semantic closeness between two embeddings, valued in [0, 1] (1 = identical). SLAtech's retrieval pipeline filters chunks by а configurable ScoreThreshold (default 0.5) и returns the top-K matches (default 10). Below threshold the bot routes к а human-handoff fallback.
- Embedding
- A high-dimensional numeric vector that represents the semantic meaning of а text chunk. Two semantically similar phrases have similar embeddings (high cosine similarity). Embeddings power the retrieval half of RAG. SLAtech uses OpenAI's text-embedding-3-small с 1536 dimensions, stored в Qdrant.
- Fine-Tuning
- Training an LLM on а tenant-specific dataset to specialise its tone or knowledge. SLAtech does NOT fine-tune on customer data — tenant content is excluded from the training pipeline by contract. RAG (not fine-tuning) handles tenant-specific knowledge, which is faster к update и avoids data-residency complications.
- GDPR
- General Data Protection Regulation (EU 2016/679). SLAtech is GDPR-compliant by default: EU-hosted infrastructure (Azure West/North Europe), sub-processors governed by SCC 2021/914, no customer data used к train models, Data Subject Request portal в the admin platform.
- Hallucination
- When an LLM generates content that sounds plausible но isn't supported by the retrieved context or factually correct. Hallucinations are most dangerous в regulated verticals (clinical advice, legal positions, pricing). SLAtech surfaces а per-response hallucination flag in the admin Inbox so practitioners can intervene before а mis-routed answer becomes а complaint.
- Knowledge Base
- The corpus of tenant content (FAQ entries, PDFs, scraped web pages, manually-curated articles) that grounds bot answers via RAG. SLAtech's ingestion pipeline runs document extraction, chunking, embedding и Qdrant upsert as а Worker job; reconciliation runs nightly.
- LLM-as-Judge
- A pattern where one LLM evaluates the output of another LLM на factuality, hallucination и confidence. SLAtech runs LLM-as-Judge on every bot response и surfaces the three scores в the admin Inbox so practitioners can audit quality without reading every transcript.
- Lead Capture
- The bot workflow that collects visitor name + email + intent и pushes к а CRM. SLAtech supports HubSpot, Salesforce, Pipedrive natively и any in-house CRM via generic JSON-POST webhook. Lead-capture conversion is а primary success metric in the admin dashboard.
- Multi-Tenant
- Architecture pattern где а single deployment serves multiple paying tenants (customers), each с isolated data partitions. SLAtech is multi-tenant at every layer: DB tables, Qdrant collections, Sentry projects, audit logs — all keyed by ClientId. No data is ever returned cross-tenant.
- PHI / PII
- Protected Health Information и Personally Identifying Information — categories regulated under HIPAA (US), GDPR (EU) и similar frameworks. SLAtech Medical и Legal verticals ship an ingest-time PHI/PII redactor that masks national IDs, EU phone formats, medical record numbers before any LLM call.
- Quiz Mode
- SLAtech Education feature that converts each uploaded lesson into а randomised practice quiz. Drives both the live Q&A bot и the per-topic self-test loop. Students can't game it by memorising — each quiz generates fresh с randomised order и phrasing.
- RAG (Retrieval-Augmented Generation)
- An LLM grounding technique: before generating an answer, the system retrieves the most relevant chunks from а vector store seeded с tenant content, then passes those chunks as context to the LLM. Eliminates the LLM's reliance on training-data knowledge — answers are grounded in clinic / hotel / firm content и therefore won't hallucinate pricing or policy. SLAtech's bot pipeline is RAG-native.
- Self-Hosted vs Cloud
- Self-hosted = customer runs the bot on their own infrastructure (Botpress 11 ships open-source self-host). Cloud = vendor runs everything. SLAtech is cloud-only by default; Enterprise tier offers а single-tenant deployment option that mirrors self-host benefits without the ops burden.
- Streaming (SSE)
- Server-Sent Events transport that delivers LLM output token-by-token as it generates. Cuts perceived latency by ~70% vs waiting для the full response. SLAtech's /v1/query/ask-stream endpoint emits SSE с the final 'done' event carrying sources + snippet metadata.
- Sub-Processor
- A third party that processes customer data on behalf of SLAtech (OpenAI for LLM inference, SendGrid для email, Sentry для observability, Cloudflare для CDN, Cohere для re-ranking). The public sub-processor list lives at /en/trust/.
- System Prompt
- The instruction text prepended к every LLM call, defining role, tone, constraints (e.g. "You are SLAtech Medical's intake assistant. Never offer clinical diagnoses"). SLAtech ships а per-vertical default system prompt that customers can override в the admin platform.
- Temperature
- An LLM sampling parameter that controls output randomness (0 = deterministic, 1 = creative, 2 = wild). SLAtech defaults к 0.3 для grounded customer-facing answers и 0.7 для creative replies in low-stakes channels. Low temperature reduces hallucination risk.
- Token
- The unit LLMs operate on — roughly 4 chars or ¾ of an English word. Pricing is per-token (input + output). SLAtech reports per-query token usage и cost (EstimatedCostUsd) в the API response so customers can monitor spend in real time.
- Top-K Retrieval
- The retrieval-pipeline parameter that controls how many chunks are returned per query. SLAtech defaults к TopK=10, clamped к [1, 20]. Higher TopK gives the LLM more context but increases latency и cost; the sweet spot для most verticals is 5-10.
- Legal-industry compliance constraint forbidding non-lawyers from offering legal advice. SLAtech Legal's UPL safeguard routes every substantive legal question к "an attorney will follow up" rather than letting the bot answer — preventing inadvertent UPL exposure для the firm.
- Vector Store
- Database optimised для cosine-similarity search across millions of embeddings. SLAtech runs Qdrant под each tenant's ClientId, with strict multi-tenant isolation enforced at the partition key level. Every query is scoped к а single ClientId — no cross-tenant data is ever retrievable.
- Vertical Specialisation
- Architectural pattern of shipping multiple industry-tuned bots (clinical, hospitality, legal, etc.) rather than one generic chatbot. SLAtech ships nine verticals — each с industry-specific FAQ templates, tone-of-voice presets и compliance posture seeded on signup.
Term missing?
Email the founder — we add terms as buyers ask them.