Turbine AI SEO Teardown: 201 Pages, Zero Schema, 123 Broken Links to One Dead Profile

15 min read
Turbine AI SEO crawl analysis showing site structure, broken links, orphaned pages, and metadata coverage

Crawl data as of February 25, 2026. Analysis powered by redCacti.


Turbine AI is building something genuinely ambitious: AI-powered virtual cell simulations to accelerate cancer drug discovery. They’ve raised over €50M, partnered with AstraZeneca, MSD, Ono Pharmaceutical, and Cancer Research UK, and published research in peer-reviewed journals.

Their web presence tells a different story. The main site has a 47-word homepage. The news section is filled with 40+ press mention stubs averaging 30 words each. There’s no schema anywhere on turbine.ai - not on the homepage, not on any of the 147 pages. And a single abandoned Bluesky profile creates 123 broken external links spread across the site.

The science is serious. The SEO infrastructure hasn’t kept up.


What We Crawled

Turbine AI runs across six distinct domains, each serving a different function:

Crawl summary:

  • turbine.ai - 147 pages (main marketing site, WordPress)
  • blog.turbine.ai - 45 pages (Substack publication)
  • documents.turbine.ai - 4 pages (PDF document viewer)
  • www.turbine.ai - 3 pages (partial www mirror)
  • benchmark.turbine.ai - 1 page (benchmark tool)
  • platform.turbine.ai - 1 page (product platform)
  • Total: 201 pages crawled

The main turbine.ai site is the heaviest, but its content is unevenly distributed. A 68-page news section and a 29-page science section carry most of the URL count. The actual content depth behind those pages is a different question.


The crawl found 134 broken links total, all returning 404. That number is striking on its own. The breakdown makes it more striking:

Turbine AI SEO crawl analysis showing 134 broken links
  • 123 of the 134 broken links point to a single URL: https://bsky.app/profile/turbine-ai.bsky.social
  • 132 out of 134 are external links
  • 2 are internal broken links

Turbine at some point added a Bluesky social link to a sitewide element - likely the footer or a social sharing block - and that profile no longer exists. Because it’s in a template, it now appears on page after page: news articles, science pages, blog posts. Every page that uses that template inherits the dead link.

This is the canonical broken link antipattern: one bad template link multiplied across an entire site.

The remaining external broken links include:

  • A Ginkgo Bioworks blog post (ginkgobioworks.com/blog/) that no longer exists
  • A Substack commenter profile linked from a blog post

The 2 internal broken links:

  • turbine.ai/news/turbine-announces-new-appointments-to-its-board-of-directors/ links to turbine.ai/news/mmv_ind_announcement_2023 - a page that returns 404
  • turbine.ai/reengineered/ links to turbine.ai/news/turbine-achieves-key-milestone-with-ono - another 404

How to fix it: Remove or update the Bluesky social link from the template. That single change eliminates 123 broken links at once. Then redirect or remove the two dead internal news pages, or update the linking pages to point to valid URLs.


Section 2: Zero Schema Across 147 Pages

The crawl found no schema markup on any page across the entire turbine.ai domain.

Not on the homepage. Not on the news articles. Not on the science section with 29 research pages. Not on the approach page, the benchmark page, or the press kit. Zero structured data, zero schema types, across all 147 pages.

For a company doing cutting-edge science and announcing major pharma partnerships, this is a significant missed opportunity:

  • Organization schema on the homepage would give Google a clean, authoritative signal about who Turbine is - name, URL, logo, social profiles, funding stage. This matters for knowledge graph inclusion and brand queries.
  • NewsArticle schema belongs on every /news/ page. Turbine has 68 news pages covering funding rounds, partnerships, and industry recognitions. NewsArticle schema is what enables rich snippets in Google News and structured treatment in search results.
  • Article or TechArticle schema belongs on the research blog posts. Pages like “Assessing a Virtual Cell’s Utility” (3,023 words) and “How did we get a regression model to the top of the Virtual Cell Challenge” (1,502 words) are substantive content that would benefit from structured markup.
  • FAQPage schema would work on the approach page or any page structured around questions about virtual cell technology.
  • SoftwareApplication schema on the platform would help Google understand the product.

The Substack blog at blog.turbine.ai has schema - Substack adds it automatically. That’s 34 out of 45 blog pages with NewsArticle or WebSite schema. But that’s the platform doing the work, not Turbine. On the domain they control directly, there’s nothing.

Takeaway: Organization schema on the homepage and NewsArticle schema on news articles are the highest-priority additions. Both are template-level changes in WordPress - add them once, they apply everywhere.


Section 3: 115 Orphaned Pages - No Internal Linking Strategy

The crawl found 115 orphaned pages, all on turbine.ai. Every single one of them has zero incoming internal links from the rest of the site.

Turbine AI SEO crawl analysis showing 115 orphaned pages

That’s 78% of the 147-page turbine.ai crawl with no internal links pointing to it.

What’s in those 115 pages? The breakdown is telling:

  • Most of the 68 news pages are orphaned
  • Most of the 29 science pages are orphaned
  • Multiple substantive blog posts on the root domain are orphaned
  • Tag pages, category pages, and author pages - all orphaned

Orphaned pages have two compounding problems. First, they receive no PageRank from internal links, which weakens their ability to rank. Second, they’re harder for both users and crawlers to discover - a page that nothing links to is effectively invisible unless you know the direct URL.

For Turbine’s science content specifically, this is a significant waste. A page like “In defense of RNASeq” (1,086 words) or “The Patient Puzzle” (996 words) contains the kind of original, technical content that could rank for niche scientific queries. If nothing on the site links to it, it’s competing blind.

The structural fix: Turbine needs an internal linking strategy. The homepage should link to key content sections. News pages should cross-link to related announcements. Science pages should link to related research and the approach page. A well-linked site concentrates authority where it matters.


Section 4: The News Section - 68 Pages Averaging 30 Words

Turbine has 68 pages under /news/. On paper, that’s a large content section. In practice, the median word count tells a different story.

A significant portion of these pages are press mention stubs - pages that acknowledge a media mention with a headline and 25–40 words of context before linking out to the original article. Here’s what that looks like:

News PageWord Count
Turbine Announces €20M Series A29 words
Cancer Research UK Partnership29 words
Roche Named Turbine Disruptive31 words
Forbes 30 Under 3030 words
Johnson & Johnson Top AI Startups35 words
MIT Technology Review Innovator28 words
Turbine is the Startup of the Year27 words

These pages are indexed, they occupy crawl budget, and they almost certainly contribute nothing to organic search. A page with 28 words and a link to the original article is not competing for any query.

Contrast this with the substantive news pages - the Accenture investment announcement (954 words), the Series A announcement (1,039 words), the Upsized Series A (1,388 words), the AstraZeneca collaboration (385 words). These are real content assets that could rank for brand and funding queries if they were internally linked and had schema.

Two paths forward for the thin news pages:

  1. Expand them: Each press mention could become a 300–500 word post explaining the significance of the coverage and what it means for Turbine’s mission. One time investment per page.
  2. Consolidate them: Group press mentions by year or theme into single roundup pages. Fewer, stronger pages beat a larger count of thin ones.

Neither approach is wrong. The wrong approach is the current one: dozens of thin pages competing for nothing.


Section 5: The Homepage - 47 Words

The turbine.ai homepage has 47 words of crawlable text.

For context: this is a B2B SaaS platform targeting pharmaceutical companies and research institutions. The buying decision is complex, the audience is sophisticated, and the stakes (oncology drug development) are high.

The homepage metadata is technically in place: title tag, meta description, OG tags, Twitter card, canonical. But with 47 words of indexable content, there’s almost nothing for search engines to evaluate. The homepage is essentially invisible for any informational or navigational query beyond the brand name itself.

What the homepage is missing:

  • A clear articulation of the platform’s mechanism and differentiators in crawlable text
  • Descriptions of the use cases (ADC design, target discovery, combination therapy)
  • Any signals that would help Google rank the page for queries like “virtual cell simulation platform” or “AI drug discovery software”

Benchmark context: Homepage word counts below 100 are unusual even for minimalist SaaS sites. Many effective homepage designs carry 400–800 words of crawlable copy while maintaining a clean visual experience. The issue isn’t minimal design - it’s that the text rendering doesn’t reach crawlers.

Image alt coverage on the homepage is also low at 34.5% - 38 of 58 homepage images have no alt text. This affects accessibility and removes descriptive signals from images that are likely central to communicating what the platform does.


Section 6: Duplicate URLs and Alias Pairs

The crawl found 6 URL pairs where one URL has a canonical pointing to another within the same domain. These are alias pages - different URLs that serve the same content, with one designated as the canonical version:

Alias URLCanonical
/adc-payload/adc-payload-selector/
/target-selection/target-discovery-de-risking/
/scientific/science/
/news/accenture-invests-in-turbine/news/accenture-invests-in-turbine-to-accelerate-use-of.../
/announcements/turbine-announces-collaboration-.../news/turbine-announces-collaboration-...-1/
/news/ai-powered-cancer-cell-simulations-.../news/ai-powered-cancer-cell-simulations-...-1/

The canonical tags are doing their job here - Google should follow them to the correct version. But the alias URLs still exist as accessible pages, which means they can still accumulate links and send confused signals.

Additionally, www.turbine.ai has 3 pages, two of which have canonicals pointing to the non-www version. The third - www.turbine.ai/demo - has no canonical at all. That’s a small gap worth closing.

Fix: The /adc-payload, /target-selection, and /scientific slugs are short aliases for longer canonical URLs. These should ideally 301 redirect to the canonical URL rather than serving content with a canonical tag. A redirect is a stronger signal than a canonical tag and prevents link equity from splitting.


Section 7: What Turbine Gets Right

This teardown has focused on gaps. A few things stand out positively:

Substantive long-form content on the main domain. Turbine publishes real research content directly on turbine.ai: “Assessing a Virtual Cell’s Utility” (3,023 words), “So How Do You Benchmark Biology?” (1,739 words), “What Can Virtual Cells Do For You Today?” (1,752 words), “How Did We Get a Regression Model to the Top of the Virtual Cell Challenge?” (1,502 words). These are the kind of original, expert-level posts that can genuinely rank for niche scientific queries. They exist. They just need to be linked and structured.

The Substack blog has solid metadata. blog.turbine.ai has OG tags on 44/45 pages and schema on 34/45. That’s Substack doing the work, but the result is a well-structured content section.

No sitewide crawl errors. Aside from the broken links, the site returns 200 OK across 197 of 201 pages. The 2 pages returning 404 are internal pages linked from within the site - fixable issues, not systemic infrastructure problems.

Canonical strategy is partially in place. Most turbine.ai pages have self-canonicals. The www.turbine.ai redirects appropriately for 2 out of 3 pages. The alias canonical pairs are correctly pointing to preferred versions.

The foundation isn’t broken. The issues are about structure, linkage, and schema - not about a site that’s technically failing.


How Turbine Can Think About Optimization

Quick Wins

  • Remove or update the dead Bluesky link from the sitewide template - eliminates 123 broken links in one change
  • Add Organization schema to the homepage - 30-minute implementation, permanent signal
  • Add NewsArticle schema to all /news/ pages - one WordPress template change covers 68 pages
  • Add canonical tag to www.turbine.ai/demo
  • 301 redirect /adc-payload, /target-selection, and /scientific to their canonical targets

Next Phase

  • Build internal linking across news, science, and blog content - start with the homepage linking to key sections
  • Add Article or TechArticle schema to long-form research posts
  • Fix homepage image alt coverage (currently 34.5% - 38 images without alt text)
  • Decide on a strategy for the 40+ thin news stub pages: expand or consolidate

Final Action Items

  • Evaluate whether thin press mention pages can be expanded into meaningful posts with 300+ words of original context
  • Audit orphaned pages to identify which have organic ranking potential and prioritize internal linking to those first
  • Add descriptive copy to the homepage - 47 words isn’t enough for Google to understand what the platform does
  • Fix image alt text on blog.turbine.ai (currently 57.5% coverage - 117 images without alt text)

Key Takeaways

  • 201 pages across 6 subdomains - a distributed architecture that needs a coordinated SEO strategy, not isolated fixes
  • 123 of 134 broken links trace to a single abandoned Bluesky profile in a sitewide template - one removal fixes 92% of the broken link problem
  • Zero schema on turbine.ai - no Organization, no NewsArticle, no Article markup on 147 pages
  • 115 orphaned pages - 78% of the main site has no incoming internal links, including pages with substantive long-form content
  • 68 news pages, most under 50 words - a large content section with little search value in its current form
  • 47-word homepage - essentially invisible to search engines for anything beyond the brand name
  • Strong long-form content exists - multiple 1,000+ word research posts that could rank for niche queries if linked and structured

Turbine is doing serious science. The content assets are there - the long-form posts, the research pages, the partnership announcements. What’s missing is the structural layer that connects them: internal links, schema, and enough homepage copy for Google to understand what the platform actually does. None of this requires rebuilding. It requires a systematic pass through the infrastructure they’ve already built.


Want to run a crawl like this on your own site? Try redCacti →

Newsletter

Weekly SEO teardowns

Internal linking, broken links & orphan pages — straight to your inbox, every week.

Subscribe free

redCacti Team

The team behind redCacti - helping websites improve their SEO through better internal linking.

Related Posts