Ask a generic AI about your business and you'll get generic advice. The gap isn't intelligence — it's context.

Every leader who's pasted a company question into a public chat tool has felt it: plausible, fluent, wrong for your reality. Earlier articles in this series covered context in each prompt and team practices. This one steps up a level: context windows, retrieval (RAG), and your own business data — the layer that turns AI into an operational tool.

At a glance

  • Context window = how much text (measured in tokens, units of text) the model can "see" in one request — with real limits
  • RAG (retrieval-augmented generation) = fetch relevant docs from your corpus, then generate an answer
  • Without curated data, AI guesses from public training — risky for operations and compliance
  • Context strategy connects to governance and data safety

Why context windows matter

LLMs (large language models) process a finite amount of text per call — policies, emails, manuals, chat history all compete for space. When you exceed limits, something gets truncated — often silently.

SymptomLikely cause
Answer ignores part of your uploadTruncation or poor prioritization
Inconsistent answers same dayDifferent chunks retrieved or session drift
"Forgot" earlier instructionsWindow filled; earlier context dropped

Practical rule: send what matters for this task, not everything you have. Summarize long material; link to canonical sources. A manager who pastes 80 pages of project notes and asks for an executive summary often gets output that ignores the budget section — not because the model is "dumb," but because the window is saturated.

RAG in plain language

Retrieval-augmented generation follows four steps:

  1. User asks a question
  2. System searches your approved knowledge base (policies, wikis, project files)
  3. Top relevant passages are injected into the prompt
  4. Model answers using those passages — ideally with citations

Done well, RAG turns AI from "smart stranger" into "well-briefed colleague." Done poorly, it retrieves wrong chunks and confidently misquotes your own policies — often a stale version still indexed.

Building the corpus is operational work, not a standalone IT project. Article 8 in this series details how to build a knowledge base.

Business data: the asset and the risk

Your data is why AI could be valuable for you specifically:

  • Pricing history, SOPs, past proposals, ticket resolutions
  • Meeting archives, inspection templates, regulatory checklists

It's also why privacy and security matter. Classify before indexing:

  • Green — internal, low sensitivity, good for early RAG
  • Yellow — personal or confidential — strict access and logging
  • Red — excluded until legal and security sign off

Never index "everything in the drive" as v1. Noise drowns signal — and multiplies risk.

A professional firm I advised indexed HR policies before client-facing SOPs — the wrong priority for their pilot. Staff kept asking operational questions; the system kept returning vacation policy excerpts. Reordering the corpus around the top 30 weekly questions fixed retrieval quality faster than switching models.

Context for agents and workflows

Autonomous agents multiply context needs — they pull from CRM, email, and tools in one run. Without:

  • Clear source of truth per data type
  • Permissions matching human roles
  • Logging of what was retrieved

…agents amplify confusion faster than chat. System context isn't optional when AI acts, not just answers.

How to improve context without a big project

  1. One canonical folder or wiki for the pilot domain
  2. Remove duplicates and obsolete versions — RAG hates stale PDFs
  3. Metadata — owner, date, language (fr/en), status
  4. Test questions — 20 real queries your team asks weekly; score answer quality
  5. Human review loop — "wrong doc retrieved" is a product signal, not user error

Context and prompting together

Even with RAG, good prompts specify:

  • Which sources to prefer
  • What to do if evidence is missing ("say you don't know")
  • Output format and review flags

"Use only the provided documents" is a minimum — enforce it in workflow design, not hope.

Where you are

You've standardized team prompting practices; this article explains why your data and history matter more than this week's model. Next: Multimodal AI: text, audio, images, and video — operational use cases beyond text alone.

Before you buy another chat tool, Let's sanity-check your data readiness. We'll sanity-check whether your data is ready for RAG — or what to fix first.