GEO Research: Princeton Study & Optimization Methods
Paper Overview
Title: GEO: Generative Engine Optimization Authors: Princeton University, IIT Delhi, Georgia Tech, Allen Institute for AI Published: November 2023 (arXiv:2311.09735) Accepted: KDD 2024 (30th ACM SIGKDD Conference) Link: https://arxiv.org/abs/2311.09735
Key Findings
What is GEO?
Generative Engine Optimization (GEO) is a novel framework to help content creators improve their visibility in generative engine responses (AI search engines like ChatGPT, Perplexity, Google SGE).
Unlike traditional SEO (ranking in search results), GEO focuses on being cited by AI systems.
Core Results
| Metric | Result |
|---|---|
| Maximum Visibility Boost | Up to 40% |
| Low-ranking Sites Boost | Up to 115% with citations |
| GEO-bench Dataset | 10,000+ queries across domains |
| Tested on | Perplexity.ai (commercial GE) |
Why GEO Matters
- Traditional search engines are being replaced by generative engines
- Content creators have little control over how their content appears in AI responses
- GEO provides systematic methods to improve visibility
- Different domains need different strategies - no one-size-fits-all
The 9 Optimization Methods
Method 1: Cite Sources (+40%)
What: Add authoritative citations and references to your content.
Why it works: AI systems prefer content that appears well-researched and credible. Citations signal authority.
How to apply:
# Before
"Studies show that AI improves productivity."
# After
"According to a 2024 Stanford University study, AI tools improve developer
productivity by 55% on average (Chen et al., 2024)."Best for: All content types, especially academic/professional topics.
Method 2: Statistics Addition (+37%)
What: Include specific numbers, data points, and quantitative information.
Why it works: AI systems prioritize factual, verifiable information. Numbers make content more quotable.
How to apply:
# Before
"Many companies use AI for customer service."
# After
"67% of Fortune 500 companies now use AI chatbots for customer service,
handling an average of 85% of routine inquiries without human intervention."Best for: Business, finance, technology, health content.
Method 3: Quotation Addition (+30%)
What: Add expert quotes with proper attribution.
Why it works: Quotes from recognized experts boost credibility and provide extractable content for AI.
How to apply:
# Before
"AI will transform the workforce."
# After
"'We're likely to see the first one-person billion-dollar company in the
next few years,' predicts Sam Altman, CEO of OpenAI. 'AI will be the
great equalizer for small businesses.'"Best for: Law, academic, news, thought leadership content.
Method 4: Authoritative Tone (+25%)
What: Write with confidence and expertise.
Why it works: AI systems assess content quality partly through linguistic signals of authority.
How to apply:
# Before
"This might help with SEO, I think."
# After
"This strategy demonstrably improves SEO performance. Based on our
analysis of 10,000 websites, implementing structured data increases
organic traffic by an average of 30%."Best for: Business, professional services, technical documentation.
Method 5: Easy-to-Understand (+20%)
What: Simplify complex concepts for broader accessibility.
Why it works: AI aims to provide helpful answers to users of all knowledge levels.
How to apply:
# Before
"The RAG architecture utilizes vector embeddings for semantic retrieval
in conjunction with LLM-based generation."
# After
"RAG (Retrieval-Augmented Generation) works like a research assistant:
it first searches for relevant information, then uses AI to write a
coherent answer based on what it found."Best for: Health, education, general consumer content.
Method 6: Technical Terms (+18%)
What: Include domain-specific terminology appropriately.
Why it works: Technical terms signal expertise and help AI match content to specialized queries.
How to apply:
# Before
"The website loads slowly."
# After
"The website suffers from poor Core Web Vitals: LCP (Largest Contentful
Paint) exceeds 4 seconds, and CLS (Cumulative Layout Shift) scores 0.3,
indicating significant layout instability."Best for: Technology, science, medical, legal content.
Method 7: Unique Words (+15%)
What: Increase vocabulary diversity and use distinctive phrasing.
Why it works: Diverse vocabulary indicates depth of knowledge and makes content more distinguishable.
How to apply:
- Use synonyms and varied terminology
- Avoid repetitive phrasing
- Include industry-specific jargon where appropriate
- Add contextual variations
Best for: All content types.
Method 8: Fluency Optimization (+15-30%)
What: Improve readability, flow, and grammatical quality.
Why it works: Well-written content is easier for AI to parse and more likely to be selected as authoritative.
How to apply:
- Use clear sentence structure
- Maintain logical flow between paragraphs
- Eliminate redundancy
- Use transition words
- Keep paragraphs focused (2-3 sentences)
Best for: All content types.
Method 9: Keyword Stuffing (-10%) ⚠️
What: Overloading content with target keywords.
Why it HURTS: Unlike traditional SEO, keyword stuffing actively decreases AI visibility.
Avoid:
# BAD - Keyword stuffing
"SEO optimization for SEO is the best SEO strategy. Our SEO experts
provide SEO services for all your SEO needs. SEO is important for SEO."
# GOOD - Natural writing
"Search engine optimization is essential for online visibility. Our
experts help businesses improve their search rankings through strategic
content development and technical improvements."Best Combinations
The Princeton research found that combining methods yields better results:
| Combination | Effectiveness |
|---|---|
| Fluency + Statistics | Highest overall boost |
| Citations + Authoritative Tone | Best for professional content |
| Easy Language + Statistics | Best for consumer content |
| Technical Terms + Citations | Best for academic/scientific |
Domain-Specific Recommendations
| Domain | Best Methods | Avoid |
|---|---|---|
| Technology | Technical Terms, Citations, Statistics | Oversimplification |
| Business/Finance | Statistics, Authoritative Tone, Citations | Vague claims |
| Healthcare | Easy Language, Statistics, Citations | Jargon overload |
| Legal | Citations, Quotations, Authoritative Tone | Informal language |
| Education | Easy Language, Examples, Structure | Complexity |
| E-commerce | Statistics, Social Proof, Clear Benefits | Feature dumps |
GEO Metrics
The paper introduces visibility metrics for evaluating GEO effectiveness:
Position-Adjusted Word Count
Measures how much of your content appears in AI responses, weighted by position (earlier = better).
Subjective Impression Score
Human evaluation of how prominently your content is featured in responses.
Word Count Share
Percentage of AI response that comes from your content vs. competitors.
Implementation Checklist
Content Optimization
- [ ] Add 2-3 authoritative citations per major section
- [ ] Include at least 5 relevant statistics with sources
- [ ] Add 1-2 expert quotes with attribution
- [ ] Use confident, authoritative language
- [ ] Ensure content is accessible to general audience
- [ ] Include appropriate technical terminology
- [ ] Vary vocabulary throughout
- [ ] Improve overall fluency and readability
- [ ] Remove any keyword stuffing
Structure Optimization
- [ ] Use "answer-first" format (direct answer at top)
- [ ] Clear H1 > H2 > H3 hierarchy
- [ ] Bullet points for lists
- [ ] Tables for comparisons
- [ ] Short paragraphs (2-3 sentences)
- [ ] Logical flow between sections
Schema Optimization
- [ ] Implement FAQPage schema (+40% AI visibility)
- [ ] Add Article schema with author info
- [ ] Include datePublished and dateModified
- [ ] Add SpeakableSpecification for voice search
Validation on Commercial Platforms
The researchers validated GEO methods on Perplexity.ai, a commercial generative engine:
| Method | Visibility Increase |
|---|---|
| Cite Sources | Up to 37% |
| Statistics | Up to 35% |
| Quotations | Up to 28% |
| Combined methods | Up to 40% |
These results confirm that GEO methods work on real-world AI search engines, not just research benchmarks.
Future of GEO
The paper concludes that GEO is essential for:
- Content creators - Maintain visibility in AI-dominated search
- Businesses - Ensure brand presence in AI responses
- Publishers - Adapt to the citation economy
- SEO professionals - Evolve practices for generative search
As AI search engines become more prevalent, GEO will become as important as traditional SEO.