This specification is published and recommended for implementation. Backwards-compatible additions may occur in MINOR versions; breaking changes only in MAJOR versions, with deprecation notice. See specification conventions for status definitions.
Validation Framework
Testing whether AI systems correctly interpret your Discovery Files
This framework provides standardised test prompts and scoring criteria to verify whether AI systems are correctly interpreting your AI Discovery Files. Use it to measure the effectiveness of your implementation and identify areas for improvement.
§1 Overview
What This Framework Tests
This framework tests whether AI systems are correctly interpreting the information declared in your AI Discovery Files. It verifies:
- AI can correctly identify your business
- AI accurately describes your services
- AI knows what you don't do
- AI respects your stated permissions
What This Framework Does NOT Test
This is not a tracking or monitoring tool. It does not measure:
- How often AI mentions your brand
- Your "ranking" in AI responses
- Sentiment or tone of AI responses
- Traffic from AI systems
This framework validates infrastructure effectiveness (are your files being interpreted correctly?), not outcome metrics (how popular are you with AI?). It aligns with AI Visibility Checking (AV-002), not AI Visibility Tracking (AV-004).
Relationship to 365i Checker
The 365i AI Website Visibility Checker validates that your files exist, are correctly formatted, and are internally consistent. This validation framework tests the next step: whether AI systems are using that information correctly.
§2 Test Categories
The framework tests five dimensions of AI interpretation:
| Category | What It Tests | Primary Files |
|---|---|---|
| Identity Recognition | AI correctly identifies your business name, type, and nature | identity.json, llms.txt, brand.txt |
| Service Accuracy | AI accurately describes what you offer | llms.txt, faq-ai.txt |
| Scope Boundaries | AI knows what you don't do | llms.txt (exclusions), ai.txt |
| Citation Compliance | AI attributes information correctly | ai.txt, ai.json |
| Permission Awareness | AI respects stated restrictions | ai.txt, ai.json |
§3 Standard Test Prompts
Use these prompts with any AI system (ChatGPT, Claude, Gemini, Perplexity, etc.). Replace [BUSINESS] with your actual business name.
Identity Recognition (IR)
IR-01: Basic Identity
Expected: Correct business name, accurate business type, correct geographic scope (if applicable).
IR-02: Contact Verification
Expected: Contact methods match those declared in your llms.txt Contact section and identity.json.
IR-03: Name Recognition
Expected: If the alternate name is in your identity.json alternateNames or brand.txt, AI should confirm they're the same.
Service Accuracy (SA)
SA-01: Service Description
Expected: Services match your llms.txt Services section.
SA-02: Service Detail
Expected: Description aligns with how you describe this service in your AI Discovery Files.
Scope Boundaries (SB)
SB-01: Exclusion Recognition
Expected: AI correctly states this service is NOT provided (per your "What We Do Not Do" section).
SB-02: Geographic Scope
Expected: AI correctly identifies geographic limitations if you've declared them.
Citation Compliance (CC)
CC-01: Source Attribution
Expected: AI references your website or acknowledges its sources.
Permission Awareness (PA)
PA-01: Usage Permission
Expected: AI mentions restrictions if you've declared them in ai.txt or ai.json.
§4 Scoring Methodology
Score each test prompt response using this three-level rubric:
Evaluation Guidelines
Identity Recognition
- Business name spelled correctly?
- Business type accurately described?
- No conflation with other businesses?
Service Accuracy
- All major services mentioned?
- Descriptions substantially accurate?
- No services attributed that you don't offer?
Scope Boundaries
- Excluded services correctly identified as not offered?
- No false positives (saying you do something you don't)?
Score Interpretation
| Total Score | Interpretation |
|---|---|
| 16–18 (90%+) | AI Discovery Files are being effectively consumed |
| 12–15 (67–89%) | Partial recognition — review file clarity and consistency |
| 9–11 (50–66%) | Significant gaps — verify file accessibility and content |
| Below 9 (<50%) | Major issues — check file presence, format, and contradictions |
§5 Before/After Comparison
To measure the impact of implementing AI Discovery Files:
Baseline Assessment
- Run all test prompts before implementing AI Discovery Files
- Record responses and scores
- Note specific inaccuracies or missing information
Post-Implementation Assessment
- Wait 2–4 weeks for AI systems to potentially re-index
- Run identical test prompts
- Record responses and scores
- Compare against baseline
Comparison Template
=== AI Visibility Validation Report ===
Website: [YOUR DOMAIN]
Baseline Date: [DATE]
Retest Date: [DATE]
AI System: [ChatGPT / Claude / Gemini / etc.]
BASELINE SCORES:
- Identity Recognition (IR): _/6
- Service Accuracy (SA): _/4
- Scope Boundaries (SB): _/4
- Citation Compliance (CC): _/2
- Permission Awareness (PA): _/2
- TOTAL: _/18
POST-IMPLEMENTATION SCORES:
- Identity Recognition (IR): _/6
- Service Accuracy (SA): _/4
- Scope Boundaries (SB): _/4
- Citation Compliance (CC): _/2
- Permission Awareness (PA): _/2
- TOTAL: _/18
CHANGE: +/- _ points
KEY FINDINGS:
- [What improved]
- [What remained unchanged]
- [Recommendations]
§6 Important Caveats
AI Retrieval Testing results are inherently variable. Results may differ between sessions, model versions, and parameter settings. This framework provides indicative rather than definitive results.
Key Limitations
- Time Lag: AI systems may take weeks or months to incorporate updated AI Discovery Files into their responses. Don't expect immediate changes.
- System Differences: Different AI systems consume files differently. A pass on ChatGPT does not guarantee a pass on Claude or Gemini.
- Session Variability: The same prompt may produce different responses in different sessions. Run tests multiple times if results seem inconsistent.
- Training Data vs. Live Files: AI systems may respond based on training data rather than your current files. This is especially true for well-known brands.
- Not Deterministic: Unlike AI Visibility Checking (which validates file presence and format), these tests produce variable results. Use them for directional guidance, not absolute measurement.
Best Practices
- Run each test prompt 2–3 times to account for variability
- Test across multiple AI systems if possible
- Focus on trends over time rather than single-point measurements
- Use this framework alongside the 365i Checker for complete validation
Version History
Phase 6 standardisation release. Added /specifications/roadmap/ (theme-pegged forward plan with Active/Next/Future/On hold status flags), /specifications/extensions/ (rules for experimental x- prefixed files and the promotion path), and /specifications/i18n-a11y/ (multi-language publication, locale-tagged identity fields, RTL handling, accessibility of llms.html). Added the Discovery: directive to the robots-ai.txt specification (publishers MAY advertise AI Discovery Files on the same host). Added a formal media-type stance to the HTTP behaviour page (existing IANA types, no bespoke registrations). Expanded the file integrity and signing section on the security and privacy page with four candidate mechanisms, cross-cutting concerns, and interim publisher / consumer guidance. The Discovery: directive is the only normative addition to publisher behaviour; all other additions are forward-looking documentation.
Phase 5 standardisation release. Added /specifications/related-standards/ (positioning vs llmstxt.org, IETF AI Preferences, robots.txt, Schema.org, BCP 14, JSON Schema 2020-12, SemVer) and /specifications/implementations/ (public record of conformant implementations, IETF-style). Added an explicit llmstxt.org backward-compatibility statement to the llms.txt specification. Added a formal multi-domain and subdomain scoping rule to both the llms.txt and identity.json specifications (host-scoped files, cross-host identity asserted via sameAs). No normative requirements changed for existing publishers; the new scoping rules formalise behaviour the specification already implied.
Phase 4 standardisation release. Added /specifications/processing-model/ (seven-stage algorithm for conformant consumers), /specifications/consumer-guidance/ (what AI systems should do with AI Discovery Files), /specifications/test-vectors/ (canonical test suite framing), and reference-implementation framing on the AI Visibility Checker. No normative requirements changed.
Phase 3 standardisation release. Added /specifications/versioning/ (Semantic Versioning 2.0.0 commitments, deprecation timeline, lifecycle), /specifications/governance/ (proposal lifecycle, editorial process, working principles), /specifications/security-privacy/ (trust model, content-injection patterns, GDPR considerations, integrity primitives roadmap), and /specifications/http-behaviour/ (status codes, redirects, soft-404 detection, caching, rate limits). No normative requirements changed.
Phase 2 standardisation release. Added formal conformance specification (Essential / Recommended / Complete classes). Published machine-readable registry at /specifications/registry.json, spec meta-schema, and validator-output schema. Introduced versioned JSON Schema URLs (/v1/) alongside unversioned 'latest' aliases. Added optional BCP 47 language declaration field across all applicable AI Discovery Files. No normative requirements changed.
Phase 1 standardisation release. Added 'Status of This Document' block (Stable). Normalised normative requirement keywords to uppercase per RFC 2119 and RFC 8174. Added References section linking to /specifications/conventions/ and /licensing/. No normative requirements changed.
Added AI Visibility Directory registration guidance. Minor documentation update.
Initial publication. Five test categories with standard prompts, scoring methodology, and before/after comparison template.
References
- Specification Conventions — RFC 2119 + RFC 8174 requirement keywords, document statuses, anchor naming, versioning, and language conventions used across every AI Discovery File specification.
- Licensing & Trademark — CC BY 4.0 for specification text and examples, MIT for JSON Schemas, and the free-use policy on the name "AI Discovery Files".
Generate AI Discovery Files from your dashboard
Using WordPress? Install the plugin and create all 10 files in minutes — no coding, no configuration files to edit manually.
Get the PluginRegister in the AI Visibility Directory
Once your AI Discovery Files are published, register your website in the AI Visibility Directory — the verified registry of websites implementing AI Discovery Files. Registration validates your implementation and lists your site for AI systems and industry peers to discover.
Card entry in the directory with automated file validation. Open to any site with a valid llms.txt file. No cost.
Dedicated profile page on the directory with dofollow backlinks to your website — a genuine SEO authority signal from a topically relevant, verified source. Includes an attribution badge and enhanced visibility.