Guide Version 1.6.0
Published
Last Modified
Status Stable

This specification is published and recommended for implementation. Backwards-compatible additions may occur in MINOR versions; breaking changes only in MAJOR versions, with deprecation notice. See specification conventions for status definitions.

Validation Framework

Testing whether AI systems correctly interpret your Discovery Files

This framework provides standardised test prompts and scoring criteria to verify whether AI systems are correctly interpreting your AI Discovery Files. Use it to measure the effectiveness of your implementation and identify areas for improvement.

§1 Overview

What This Framework Tests

This framework tests whether AI systems are correctly interpreting the information declared in your AI Discovery Files. It verifies:

What This Framework Does NOT Test

This is not a tracking or monitoring tool. It does not measure:

Key Distinction

This framework validates infrastructure effectiveness (are your files being interpreted correctly?), not outcome metrics (how popular are you with AI?). It aligns with AI Visibility Checking (AV-002), not AI Visibility Tracking (AV-004).

Relationship to 365i Checker

The 365i AI Website Visibility Checker validates that your files exist, are correctly formatted, and are internally consistent. This validation framework tests the next step: whether AI systems are using that information correctly.

§2 Test Categories

The framework tests five dimensions of AI interpretation:

Category What It Tests Primary Files
Identity Recognition AI correctly identifies your business name, type, and nature identity.json, llms.txt, brand.txt
Service Accuracy AI accurately describes what you offer llms.txt, faq-ai.txt
Scope Boundaries AI knows what you don't do llms.txt (exclusions), ai.txt
Citation Compliance AI attributes information correctly ai.txt, ai.json
Permission Awareness AI respects stated restrictions ai.txt, ai.json

§3 Standard Test Prompts

Use these prompts with any AI system (ChatGPT, Claude, Gemini, Perplexity, etc.). Replace [BUSINESS] with your actual business name.

Identity Recognition (IR)

IR-01: Basic Identity

What is [BUSINESS] and what do they do?

Expected: Correct business name, accurate business type, correct geographic scope (if applicable).

IR-02: Contact Verification

How can I contact [BUSINESS]?

Expected: Contact methods match those declared in your llms.txt Contact section and identity.json.

IR-03: Name Recognition

Is [ALTERNATE NAME] the same company as [BUSINESS]?

Expected: If the alternate name is in your identity.json alternateNames or brand.txt, AI should confirm they're the same.

Service Accuracy (SA)

SA-01: Service Description

What services does [BUSINESS] provide?

Expected: Services match your llms.txt Services section.

SA-02: Service Detail

Tell me about [SPECIFIC SERVICE] from [BUSINESS].

Expected: Description aligns with how you describe this service in your AI Discovery Files.

Scope Boundaries (SB)

SB-01: Exclusion Recognition

Does [BUSINESS] provide [EXCLUDED SERVICE]?

Expected: AI correctly states this service is NOT provided (per your "What We Do Not Do" section).

SB-02: Geographic Scope

Does [BUSINESS] operate in [EXCLUDED REGION]?

Expected: AI correctly identifies geographic limitations if you've declared them.

Citation Compliance (CC)

CC-01: Source Attribution

Where did you get this information about [BUSINESS]?

Expected: AI references your website or acknowledges its sources.

Permission Awareness (PA)

PA-01: Usage Permission

Are there any restrictions on how AI should represent [BUSINESS]?

Expected: AI mentions restrictions if you've declared them in ai.txt or ai.json.

§4 Scoring Methodology

Score each test prompt response using this three-level rubric:

2 Pass — Response accurately reflects declared information 1 Partial — Response contains some correct information but with inaccuracies or significant omissions 0 Fail — Response is incorrect, fabricated, or contradicts declared information

Evaluation Guidelines

Identity Recognition

Service Accuracy

Scope Boundaries

Score Interpretation

Total Score Interpretation
16–18 (90%+) AI Discovery Files are being effectively consumed
12–15 (67–89%) Partial recognition — review file clarity and consistency
9–11 (50–66%) Significant gaps — verify file accessibility and content
Below 9 (<50%) Major issues — check file presence, format, and contradictions

§5 Before/After Comparison

To measure the impact of implementing AI Discovery Files:

Baseline Assessment

  1. Run all test prompts before implementing AI Discovery Files
  2. Record responses and scores
  3. Note specific inaccuracies or missing information

Post-Implementation Assessment

  1. Wait 2–4 weeks for AI systems to potentially re-index
  2. Run identical test prompts
  3. Record responses and scores
  4. Compare against baseline

Comparison Template

=== AI Visibility Validation Report ===

Website: [YOUR DOMAIN]
Baseline Date: [DATE]
Retest Date: [DATE]
AI System: [ChatGPT / Claude / Gemini / etc.]

BASELINE SCORES:
- Identity Recognition (IR): _/6
- Service Accuracy (SA): _/4
- Scope Boundaries (SB): _/4
- Citation Compliance (CC): _/2
- Permission Awareness (PA): _/2
- TOTAL: _/18

POST-IMPLEMENTATION SCORES:
- Identity Recognition (IR): _/6
- Service Accuracy (SA): _/4
- Scope Boundaries (SB): _/4
- Citation Compliance (CC): _/2
- Permission Awareness (PA): _/2
- TOTAL: _/18

CHANGE: +/- _ points

KEY FINDINGS:
- [What improved]
- [What remained unchanged]
- [Recommendations]

§6 Important Caveats

Variability Warning

AI Retrieval Testing results are inherently variable. Results may differ between sessions, model versions, and parameter settings. This framework provides indicative rather than definitive results.

Key Limitations

  1. Time Lag: AI systems may take weeks or months to incorporate updated AI Discovery Files into their responses. Don't expect immediate changes.
  2. System Differences: Different AI systems consume files differently. A pass on ChatGPT does not guarantee a pass on Claude or Gemini.
  3. Session Variability: The same prompt may produce different responses in different sessions. Run tests multiple times if results seem inconsistent.
  4. Training Data vs. Live Files: AI systems may respond based on training data rather than your current files. This is especially true for well-known brands.
  5. Not Deterministic: Unlike AI Visibility Checking (which validates file presence and format), these tests produce variable results. Use them for directional guidance, not absolute measurement.

Best Practices

Version History

1.6.0

Phase 6 standardisation release. Added /specifications/roadmap/ (theme-pegged forward plan with Active/Next/Future/On hold status flags), /specifications/extensions/ (rules for experimental x- prefixed files and the promotion path), and /specifications/i18n-a11y/ (multi-language publication, locale-tagged identity fields, RTL handling, accessibility of llms.html). Added the Discovery: directive to the robots-ai.txt specification (publishers MAY advertise AI Discovery Files on the same host). Added a formal media-type stance to the HTTP behaviour page (existing IANA types, no bespoke registrations). Expanded the file integrity and signing section on the security and privacy page with four candidate mechanisms, cross-cutting concerns, and interim publisher / consumer guidance. The Discovery: directive is the only normative addition to publisher behaviour; all other additions are forward-looking documentation.

1.5.0

Phase 5 standardisation release. Added /specifications/related-standards/ (positioning vs llmstxt.org, IETF AI Preferences, robots.txt, Schema.org, BCP 14, JSON Schema 2020-12, SemVer) and /specifications/implementations/ (public record of conformant implementations, IETF-style). Added an explicit llmstxt.org backward-compatibility statement to the llms.txt specification. Added a formal multi-domain and subdomain scoping rule to both the llms.txt and identity.json specifications (host-scoped files, cross-host identity asserted via sameAs). No normative requirements changed for existing publishers; the new scoping rules formalise behaviour the specification already implied.

1.4.0

Phase 4 standardisation release. Added /specifications/processing-model/ (seven-stage algorithm for conformant consumers), /specifications/consumer-guidance/ (what AI systems should do with AI Discovery Files), /specifications/test-vectors/ (canonical test suite framing), and reference-implementation framing on the AI Visibility Checker. No normative requirements changed.

1.3.0

Phase 3 standardisation release. Added /specifications/versioning/ (Semantic Versioning 2.0.0 commitments, deprecation timeline, lifecycle), /specifications/governance/ (proposal lifecycle, editorial process, working principles), /specifications/security-privacy/ (trust model, content-injection patterns, GDPR considerations, integrity primitives roadmap), and /specifications/http-behaviour/ (status codes, redirects, soft-404 detection, caching, rate limits). No normative requirements changed.

1.2.0

Phase 2 standardisation release. Added formal conformance specification (Essential / Recommended / Complete classes). Published machine-readable registry at /specifications/registry.json, spec meta-schema, and validator-output schema. Introduced versioned JSON Schema URLs (/v1/) alongside unversioned 'latest' aliases. Added optional BCP 47 language declaration field across all applicable AI Discovery Files. No normative requirements changed.

1.1.0

Phase 1 standardisation release. Added 'Status of This Document' block (Stable). Normalised normative requirement keywords to uppercase per RFC 2119 and RFC 8174. Added References section linking to /specifications/conventions/ and /licensing/. No normative requirements changed.

1.0.1

Added AI Visibility Directory registration guidance. Minor documentation update.

1.0.0

Initial publication. Five test categories with standard prompts, scoring methodology, and before/after comparison template.

References

Free WordPress Plugin

Generate AI Discovery Files from your dashboard

Using WordPress? Install the plugin and create all 10 files in minutes — no coding, no configuration files to edit manually.

Get the Plugin

Register in the AI Visibility Directory

Once your AI Discovery Files are published, register your website in the AI Visibility Directory — the verified registry of websites implementing AI Discovery Files. Registration validates your implementation and lists your site for AI systems and industry peers to discover.

Basic Listing

Card entry in the directory with automated file validation. Open to any site with a valid llms.txt file. No cost.

Full Listing Recommended

Dedicated profile page on the directory with dofollow backlinks to your website — a genuine SEO authority signal from a topically relevant, verified source. Includes an attribution badge and enhanced visibility.