We check if AI can find you

35.7% of the top 1,000 websites accidentally block AI crawlers. We analyze 16 factors across 4 categories to find what's working and what's not.

Step 1

AI Visibility

Can AI platforms even access your site? We check your robots.txt to see which AI bots you're allowing — and which you're accidentally blocking.

What we check:

  • Search bot access — Can GPTBot, PerplexityBot, and other AI search engines reach your content?
  • Training consent — Your choice on whether to allow AI model training (blocking is valid)
  • Site speed — Slow sites get skipped. We check if you're fast enough.

AI-friendly robots.txt

User-agent: GPTBot
Allow: /

User-agent: ClaudeBot
Allow: /

User-agent: PerplexityBot
Allow: /

Step 2

Content Structure

AI systems extract content in chunks. If your paragraphs depend on surrounding context to make sense, AI quotes you poorly — or not at all.

What we check:

  • Island Test — Are paragraphs self-contained? Starting with "it" or "this" hurts extraction.
  • Data density — Tables and lists are easier for AI to parse than prose.
  • Server-side content — Most AI crawlers can't run JavaScript.
  • Semantic HTML — Proper tags help AI understand structure.

❌ Context-dependent

"It was founded in 2020. This approach is recommended."

✅ Self-contained

"OpenAI was founded in 2020. Schema markup is recommended."

Step 3

Technical Infrastructure

The unsexy stuff that makes everything work. Sitemaps, security certificates, and platform-specific requirements that determine if crawlers find your pages.

What we check:

  • Sitemap — Your roadmap for crawlers to discover all pages.
  • HTTPS — Insecure sites may be skipped entirely.
  • Bing access — Microsoft Copilot uses Bing's index.
  • llms.txt — Emerging standard for AI-specific guidance.
  • Twitter Cards — Required for Grok visibility on X.

Technical requirements

/sitemap.xml present
HTTPS enabled
Bingbot allowed
/llms.txt (optional)

Step 4

Structured Data

Schema markup tells AI exactly what your content means. Without it, AI has to guess — and guesses lead to being ignored or misquoted.

What we check:

  • FAQ Schema — Marks Q&A pairs for easier AI extraction and citation.
  • HowTo Schema — Perfect for tutorials and guides.
  • Schema quality — Invalid JSON-LD hurts more than helps.
  • Freshness signals — AI prefers citing recent content.

FAQ Schema example

{
  "@type": "FAQPage",
  "mainEntity": [{
    "@type": "Question",
    "name": "How do I improve AI visibility?",
    "acceptedAnswer": {
      "@type": "Answer",
      "text": "Allow AI bots, add schema markup, write self-contained content."
    }
  }]
}

See where you stand

Get your AI visibility score in 30 seconds. We'll tell you exactly what to fix.

Check Your Site - Free
GEO v2 Methodology • Updated February 2026