What actually gets you cited in Perplexity AI
We analyzed crawling patterns, referral traffic, and optimization data from 539 WordPress customer websites. Perplexity AI cited content from some sites within days of launch. Others were crawled regularly and never cited once. The gap comes down to three factors: schema quality, answer-first structure, and content depth. This guide covers how to improve all three on a WordPress site.
How Perplexity AI reads your WordPress site
Perplexity is an answer engine. When a user asks a question, Perplexity does two things simultaneously: it draws on its trained knowledge and it browses the live web in real time. The live browsing part is done by PerplexityBot and PerplexityBot-User � crawlers that fetch pages on demand when a query matches your content.
This is different from how Google works. Google pre-indexes your pages and queries that index later. Perplexity crawls some pages live, in the moment a user is asking. That means your site needs to be fast, crawlable, and immediately readable by a machine � not just keyword-optimized.
In our dataset, AI-related bots now generate more crawl traffic than traditional search engines:
| Bot Category | Share of crawls |
|---|---|
| AI Training bots (GPTBot, ClaudeBot, Amazonbot) | 37.1% |
| Traditional search engines (Googlebot, Bingbot) | 35.7% |
| AI On-Demand (PerplexityBot-User, OAI-SearchBot, ChatGPT-User) | 7.1% |
| SEO tools | 14.0% |
| Social media | 6.0% |
The 7.1% "AI On Demand" category is the most commercially valuable. These bots fire when a user asks a live question. OAI-SearchBot was detected on 457 of our 539 sites. PerplexityBot-User operates the same way for Perplexity queries.
The citation gap: why most crawled sites never get recommended
Of the 539 sites in our dataset, 88.5% were being crawled by at least one AI bot. Only 34.9% received any AI referral traffic back.
That 53.6% gap � crawled but never cited � is where Perplexity AI optimization actually lives.
| Stage | Sites | % of Total |
|---|---|---|
| Total sites analyzed | 539 | 100% |
| Crawled by any AI bot | 477 | 88.5% |
| Received AI referral traffic | 188 | 34.9% |
| Crawled but never cited | 289 | 53.6% |
The sites stuck in the "crawled but never cited" column are not being penalized � they are simply not being chosen. Perplexity reads their content and passes them over in favor of a page that answered more directly, had cleaner structured data, or covered the topic in more depth.
Getting crawled is essentially automatic. 86% of new sites in our data were crawled by an AI bot on their first day live. Getting cited requires deliberate action.
Five optimizations that move the needle for Perplexity
1. Fix your schema markup first
Perplexity uses structured data to confirm what your page is about before deciding whether to cite it. When two pages cover the same topic and one has clean JSON-LD schema, the structured page has an interpretive advantage � the AI gets machine-readable confirmation instead of having to guess from raw HTML.
In our data, sites that went through structured data optimization improved from an average schema quality score of 29 out of 100 to 71 out of 100. That improvement correlated directly with increased AI citation frequency.
For a WordPress site, the highest-priority schema types for Perplexity are:
- Organization � Confirms your business name, URL, description, and contact info. Perplexity uses this when generating business recommendations.
- Article / BlogPosting � Confirms the author, publication date, and topic of each page. Critical for news and informational content.
- FAQPage � Directly maps questions to answers in a format Perplexity can extract and display verbatim.
- Service / Product � Required if your site promotes specific services or products. Perplexity will often cite service pages for transactional queries.
Use the LovedByAI GEO Checker to run a free scan of your schema implementation before manually auditing. It flags missing schema types and shows exactly which properties are incomplete.
2. Write answer-first (BLUF format)
The single strongest content change you can make is putting the answer at the top of every section � not at the bottom.
Perplexity scans pages looking for content it can confidently extract a response from. If your page buries the main point after 500 words of background, a competitor who leads with the answer will win that citation. The AI is not reading for enjoyment. It is looking for extractable facts.
The pattern that gets cited:
H2: How much does it cost to rank in Perplexity AI? There is no paid placement in Perplexity AI. Citations are earned by producing the most relevant, clearly structured content for a given query. [Supporting detail, examples, nuance follow]
The pattern that gets passed over:
H2: Understanding the Cost of AI Search Visibility The digital landscape is evolving rapidly. Search engines have transformed the way we find information. Perplexity AI is one example of this change... [Answer buried after several paragraphs]
Rewrite your top 10 pages for BLUF structure. Check the Google Search Console queries report to find which questions are sending impressions but not clicks � those are your highest-leverage pages to fix first. For lovedby.ai, the query "how to optimize a website for perplexity ai" drives 120 impressions at no clicks because the page currently buries the answer.
3. Add FAQ sections with FAQPage schema
FAQ sections are the highest-return content format for AI citation. The reason: every FAQ item is a pre-packaged question-answer pair. Perplexity can pull the question, pull the answer, and cite your page as the source � without needing to parse paragraphs of surrounding context.
FAQPage schema in JSON-LD makes this even easier by giving Perplexity a machine-readable version of every FAQ on your page. Add five to eight questions at the end of any service or guide page covering the most common questions your customers ask.
The questions should match real search queries. Pull them from:
- Your own customer support inbox
- The "People Also Ask" section in Google for your target keyword
- Perplexity itself � ask it your topic and read the follow-up questions it generates
4. Build content depth on a focused niche
The median site receiving AI referral traffic had 217 indexed pages. The median site receiving no AI traffic had 52 indexed pages � a 4� gap.
This does not mean publish 200 thin pages. It means that AI systems match user queries against every page on your site. A site with 200 specific pages about local plumbing services has 200 chances to match someone asking Perplexity "who does pipe relining in [city]?" A site with 5 generic service pages has 5 chances.
For WordPress sites specifically:
- Publish one detailed page per service type, per location you serve, and per question your customers commonly ask
- Each page should fully answer one specific question � not try to cover everything
- Use internal links between related pages so Perplexity can navigate from a general topic to a specific one
One site in our dataset � a personal finance blog with only 229 indexed pages � received 503 ChatGPT referral visits by owning a single niche (ISA savings rates) with exceptional depth. Volume creates surface area. Depth creates authority.
5. Optimize meta titles for AI query matching
Perplexity uses meta titles as a signal for relevance when deciding which pages to fetch in response to a query. In our dataset, pages with optimized meta titles scored 9 out of 10 on average versus 5 out of 10 for unoptimized pages.
A strong meta title for AI citation:
- Starts with the primary question or topic, not your brand name
- Matches the natural language pattern of a question someone would ask Perplexity
- Is under 60 characters so it is not truncated in standard SERP display
Instead of: LovedByAI | GEO Optimization for WordPress
Use: WordPress GEO Plugin for ChatGPT & Perplexity Citations
Instead of: Our Perplexity AI Services
Use: How to Rank in Perplexity AI: WordPress Optimization Guide
How to check if Perplexity is actively crawling your site
The most direct method is reviewing your server access logs. Look for these user agent strings:
PerplexityBot� the main crawler (pre-indexes content)PerplexityBot-User� the real-time crawler (fires when a user asks a question)
If you run WordPress with Apache or Nginx, you can search your access log with:
grep "Perplexity" /var/log/apache2/access.log | tail -50
If you use LovedByAI, the crawl monitoring dashboard shows all AI bot activity with timestamps and crawled URLs � no log access required.
One thing to confirm immediately: make sure your robots.txt is not blocking Perplexity. A wildcard allow rule (allow: / for all user-agent: *) is the safest configuration. Never add individual disallow rules for paths you want AI to access.
Common WordPress mistakes that block Perplexity citations
Stale XML sitemaps. If your sitemap is not pinged automatically on publish, Perplexity may not know new pages exist. Configure your WordPress sitemap to auto-ping on every post publication.
Slow TTFB. Perplexity fetches pages in real time. A Time to First Byte above 1 second risks timeout on real-time crawls. Use a caching plugin and a CDN. Eliminate render-blocking Javascript on content pages.
Thin content on key pages. Service pages that list what you do in 100 words are invisible to AI. Rewrite your most important pages to answer the 5"�8 questions a potential client would ask before hiring you.
Missing or broken Organization schema. This is the single most common schema error in our dataset. Without a clean Organization block telling Perplexity your business name, location, and description, you are invisible for any branded or local query.
Broken internal links. Perplexity follows internal links when crawling. A broken link stops the crawl path. Run a link audit monthly and fix any 404s that appear in your crawl reports.
How LovedByAI handles Perplexity optimization automatically
LovedByAI is built specifically for AI search engines. When you install it on WordPress:
- It auto-injects nested JSON-LD for Organization, Article, FAQPage, HowTo, and Service schema types
- It generates an
llms.txtfile � a machine-readable summary of your site that Perplexity and other AI crawlers check before parsing individual pages - It rewrites page headings to match natural-language query patterns
- Its crawl monitoring dashboard tracks PerplexityBot activity in real time, showing which pages are being fetched and how often
The free GEO Checker runs a full Perplexity-readiness scan in about 30 seconds � no signup required. It shows your current schema score, content structure issues, and the three highest-impact fixes for your specific site.

