—°F Boise, ID
Boise Standard · Document Intelligence
◈ BM25 Keyword Search ◈ Semantic Vector Search ◈ Compliance Intelligence ◈ AI Context Export

Every word of
every document.
Under a second.

Your organization is sitting on years of accumulated documents. Contracts, specs, permits, reports, compliance records. The information is in there. Finding it is the problem. Document Search solves that — permanently, for your documents, built around your industry's language.

< 1s
Search response
across full corpus
2
Search modes —
keyword + semantic
Any
Industry —
built for your terms
Private
Your documents
nobody else's
The Problem

Your documents are a library with no index.

Every organization accumulates documents. The information inside them is valuable. Finding it is expensive — in time, in frustration, in decisions made without the right data because nobody could locate it fast enough.

Opening files one by one
Scanning page by page, hoping you remember which document had that clause, that number, that date. Real hours, every week, paid out of your payroll.
Generic search that doesn't know your field
Standard search tools don't understand prevailing wage, DIR registration, scope of work, abatement specs, or whatever your industry calls the things that matter most.
Information that can't reach AI tools
Your team uses Claude, ChatGPT, or other AI tools but can't get their own document data in front of them. The intelligence is locked in files nobody can search fast enough to use.
◈ The Fix
A searchable intelligence layer over everything you own.

Document Search builds a complete indexed intelligence layer over your entire document archive. Every word of every page becomes retrievable in under a second — by exact term or by plain English question.

We start with a conversation. We ask what you search for every day, what you find easily, and what you can never find fast enough. Then we bake your industry's exact terminology into your system. The result is a search engine that speaks your language — not a generic tool you have to learn to work around.

Your documents stay private. Nobody else's data is in your system. No vendor has access to your files. This is built for you and nobody else.

Currently accepting new engagements.

We work with a limited number of clients at a time to ensure every system gets built right. If your organization has a document search problem, reach out now. Early clients receive hands-on setup and direct support from the team that built it.
How It Works

Built for your documents. Around your language. In days — not months.

This is not off-the-shelf software you configure yourself. We build it with you. Three steps from conversation to live system.

Step 01
The Conversation
We talk first. What do you search for every day? What takes too long to find? What do you never find at all? What does your team call things that a generic search engine would never recognize? This conversation is the foundation of your system.
Step 02
Ingest & Build
You send us your documents. We run the full pipeline — extract, normalize, index, and embed every page. We bake your industry terminology into the search vocabulary. Every document type you own, indexed and retrievable. PDFs, Word files, scanned pages, emails, plain text — all of it.
Step 03
Live System
Your team gets access to a live search interface — keyword search, semantic search, compliance flags, full document viewer, notes, and one-click AI export. Results in under a second. Page-precise. Industry-aware. Ready for your team from day one.
What's Inside

Enterprise document intelligence. Every capability included.

No add-ons. No per-seat licensing. No usage caps. Everything below is part of every engagement.

§
BM25 Keyword Search
Type any term — clause, name, date, number, requirement — and see every mention across your entire archive with surrounding context and exact page number. Results in under a second regardless of corpus size. Industry vocabulary baked in so your terms return your results.
Semantic Vector Search
Ask a question in plain English. The system surfaces the right clause or section even when your exact words never appear in the document. Ask about liquidated damages and find the right section even if the document calls it something different. Intelligence over vocabulary — not just string matching.
Compliance Intelligence
Every document is automatically analyzed for compliance flags relevant to your industry — prevailing wage requirements, regulatory certifications, key dates, flagged conditions. The Intelligence tab surfaces extraction quality, confidence score, and compliance status for every document in your corpus without a single manual review.
Context Pod — AI Export
Collect passages from multiple documents in a single session. When you have what you need, copy everything — with full source attribution — directly into Claude, ChatGPT, or any AI assistant. Research that takes an hour takes minutes. Your document intelligence, inside your AI tools.
Notes Ledger
Notes written against a document stay attached to that document permanently — timestamped, searchable, tied to the source. Session notes for free-form thinking. Document notes for specific findings. A master notes log per job or project. Every observation your team records becomes part of the intelligence layer.
Full Document Workspace
Click any result to open the source document directly in the interface — PDF viewer, full extracted text with in-page search, intelligence tab, notes tab. Page-precise navigation. Full-screen PDF mode. Your documents are readable, searchable, and annotatable without leaving the system.
◈ Under the Hood
Built on the same pipeline that powers the Boise Standard directory.

Every document goes through a deterministic ingestion pipeline — extract, normalize, topology fingerprint, semantic index, compliance flag, enrich. The same pipeline that processes thousands of entities for the Boise Standard directory is what processes your documents.

The result is provenance on every result. You know exactly which document the answer came from, which page, with what confidence score, and what the extraction quality was. No black boxes. No mystery results.

When a document can't be fully extracted — scanned images, degraded PDFs — the system flags it explicitly and tells you why. You always know what you can trust and what needs manual review.

◈ Request a Demo
Document Search — Live System
§ search "liquidated damages" mode:keyword
────────────────────────────────────
✓ 7 results · 0.04s · BM25 · 312 docs indexed
[01] Section 8.4 — Liquidated Damages
Contract_Final_Executed.pdf · p.34 ↑ 0.94
[02] Exhibit A — Scope of Work
Addendum_One_Signed.pdf · p.7 ↑ 0.87
+ 5 more results
ask "what happens if contractor misses deadline"
────────────────────────────────────
✓ Semantic match · 0.09s · Vector search
[01] Section 8.4 — Liquidated Damages
Contract_Final_Executed.pdf · p.34 ↑ 0.91
compliance flags — this document
────────────────────────────────────
⚑ Prevailing Wage: REQUIRED
⚑ Bid Due: 2026-07-15
✓ Extraction quality: clean · conf: 97%
────────────────────────────────────
§ _
Any Industry

If your work produces documents, Document Search works for you.

The system is built around your vocabulary. Whatever your industry calls the things that matter — we bake that in before you ever run your first search.

⚖️
Law Firms
Case files · Contracts · Precedents · Motions · Discovery
🏗️
Construction
Bid packages · Specs · Permits · Submittals · RFIs
🏥
Medical Practices
Compliance records · Policies · Insurance · Protocols
🏛️
Government & Civic
Public records · Ordinances · Reports · Meeting minutes
🏢
Real Estate
Leases · Due diligence · Title docs · Appraisals
📊
Finance & Accounting
Invoices · Contracts · Audits · Compliance filings
🔬
Research & Science
Papers · Lab reports · Grant docs · Regulatory filings
⚙️
Any Industry
If it has words, numbers, or characters — it works
The Engagement

Enterprise document intelligence. Built for your organization. Priced to actually make sense.

◈ Every engagement includes
Discovery conversation — we learn your documents, your vocabulary, your search needs
Full document ingestion — every format, every page, every character indexed
Custom industry vocabulary baked into your search engine
BM25 keyword search and semantic vector search — both modes live
Compliance intelligence flags tuned to your regulatory environment
Context Pod — collect and export passages directly to any AI tool
Notes Ledger — session notes and per-document notes, permanently attached
Full document workspace — PDF viewer, full text, intelligence tab
Your documents stay private — no third-party cloud, no vendor access
Team onboarding and direct support from the team that built it
◈ Get Access
Request a free demo. See it working on your documents.
We set up a demo environment using a sample of your actual documents — not generic test data. You search your own content on day one. If it solves your problem, we build the full system. If it doesn't, you've lost nothing.
◈ What to expect
We respond within one business day
Demo uses your actual documents — not generic samples
No sales call required to see it working
Pricing is by engagement — no subscriptions, no per-seat fees
Your documents never leave your environment
Your business is already being talked about by AI.
Verify your entity in the Boise Standard directory. Permanent. Machine-readable. $25.
Verify My Business — $25
Provenance Chain
Your Documents Ingest Pipeline boisestandard.org/document-search/ Results in < 1 second