How to Find Broken External Links That Hurt Your SEO
External links break quietly and constantly. You publish an article linking to a research paper, a product announcement, or a news story. Six months later, the target site restructures, goes offline, or paywalls the content. Your link now leads nowhere.
Unlike broken internal links - which interrupt PageRank flow and create dead ends in your own site’s crawl graph - broken external links are less directly damaging to your rankings. But they are not harmless. They waste crawl budget, they erode reader trust when a cited source leads to a 404, and at scale they are a reliable signal of poor content maintenance.
This guide covers how to find them efficiently and decide what to do about each one.
What Counts as a Broken External Link
An external link is any link on your site that points to a different domain. A broken external link is one where the destination returns an error status when checked:
- 404 Not Found - the specific page no longer exists at that URL
- 410 Gone - the page has been permanently removed
- 500 Server Error - the target server is failing (may be temporary)
- DNS failure - the entire domain no longer resolves
- SSL/TLS error - the target site has a certificate problem
- Timeout - the target server took too long to respond
Note: some sites block automated requests and return 403 Forbidden for crawler checks even when the page is live. These need manual verification before being treated as broken.
How Broken External Links Affect SEO
The impact is different from broken internal links and worth understanding precisely.
Crawl budget: When Googlebot crawls your pages, it follows both internal and external links. External links that return errors add to the number of failed requests in a crawl session. On large sites with many external links this compounds, though for most sites the impact is modest compared to broken internal links.
User experience and bounce signals: A reader clicking a cited source and hitting a 404 is a bad experience. If the linked resource was part of your argument or was the primary reason a user clicked a link, the broken destination creates frustration. This shows up as engagement signals.
Content credibility: For content that cites research, data, or primary sources, broken citations undermine credibility with the reader. In content-heavy niches like health, finance, or technical writing, outdated or dead external links are a known content quality signal.
Link rot over time: Studies on link longevity consistently find that roughly half of all external links on the web break within 5-7 years. Long-form content with many external citations will reliably accumulate broken links over time without active maintenance.
Step 1: Run a Crawl That Checks External Links
Most crawlers check internal links by default and either skip or do not verify external links. You need to specifically enable external link checking.
Using redCacti:
Register a free account, add your site, and run a crawl. The broken links report includes external links alongside internal ones, with the source page and error type for each.
Using Screaming Frog:
By default, Screaming Frog crawls internal links and lists external links but does not check their status. To verify external links:
- Configuration -> Spider -> Crawl External Links (tick this box)
- Run the crawl
- Filter the Links tab by “External” and sort by Status Code
Note: checking external links significantly increases crawl time on sites with many outbound links. Consider running this as a separate audit from your standard internal link check.
Using Google Search Console:
GSC does not check external links you point outward - it only reports on links pointing inward to your site. You need a crawler for external link audits.
Using command line tools:
# linkinator checks both internal and external links
npx linkinator https://yoursite.com --recurse
# Filter to only show external link errors
npx linkinator https://yoursite.com --recurse --skip "yoursite.com"
Step 2: Triage Broken External Links by Impact
After the crawl, you will likely have a list of varying length. Before editing anything, sort the list by priority.
Fix first - high traffic pages: A broken external link on a page that receives 5,000 visits per month is worth fixing quickly. Use Google Analytics or GSC Performance data to identify which pages with broken external links are your highest-traffic pages.
Fix second - cited sources and footnotes: Links that are explicitly cited as evidence or sources carry more credibility weight than decorative links. A broken citation in a research-backed article is more harmful than a broken link in a roundup sidebar.
Fix third - links on cornerstone content: Your pillar posts, comparison pages, and evergreen guides tend to stay live longer and attract more links. Keep these maintained.
Low priority - links on old, low-traffic posts: A broken external link on a 4-year-old post with 12 monthly visitors is not worth the same urgency. Batch-fix these monthly rather than treating each one as an immediate task.
Special case - check before removing: Some external link errors are false positives. Sites with aggressive bot protection (Cloudflare, Akamai, etc.) return 403 or timeout errors to automated crawlers even when the page is live. Before removing or replacing a link, open the URL manually in a browser to confirm it is actually broken.
Step 3: Find the Correct Replacement
When a linked resource is gone, check two things before deciding the link is dead:
Check the Wayback Machine:
Paste the broken URL into web.archive.org. If the content was archived, you can link directly to the archive version (https://web.archive.org/web/[date]/[url]). This preserves your citation even though the original is gone, and the archived version is permanent.
Check if the domain has moved:
Sometimes entire sites migrate to new domains. A site that was at old-research.com may now be at institute.org. Search for the title of the page or a key phrase from it to find whether the content exists at a new location.
Check for alternative sources: If the original source is genuinely gone and not archived, find an equivalent source that covers the same information. For statistics or data points, the original study may have been superseded by a more recent one anyway - use the newer source.
Step 4: Update or Remove the Broken External Links
For each broken external link after triage:
If a replacement URL exists: Update the link to point to the working URL. If you found a Wayback Machine archive, link to that. If you found the content at a new domain, update to the new URL.
If no replacement exists: Remove the link. This means either:
- Unlinking the text (keeping the text but removing the hyperlink)
- Rewriting the surrounding sentence if the link was integral to the claim
- Removing the reference entirely if it was a citation with no equivalent
If the link was to a tool or product that no longer exists: Update the surrounding content to reflect current alternatives. An article from 2019 citing a now-defunct tool is a content maintenance issue, not just a link issue.
Batch editing workflow for teams:
If multiple people manage content, the most efficient approach is:
- Export the broken external links report to CSV
- Assign rows by source page owner
- Set a one-week deadline for each person to review and fix their pages
- Re-run the crawl after the deadline to verify
Step 5: Set Up Monitoring for External Link Breakage
External links break on schedules you cannot control. A source you linked to last week could be gone next month. The only way to stay ahead of this is automated monitoring.
With redCacti, weekly crawls check both internal and external links and flag new breakage since the last scan. This means newly broken external links appear in your report within a week of breaking - not six months later when someone finally notices.
Set a lightweight monthly triage habit: 20 minutes reviewing the external link portion of your broken links report, prioritising by page traffic.
A Note on Linking Practices That Reduce Future Breakage
A few habits reduce the rate at which your external links break over time:
Link to stable, authoritative sources. A government dataset or academic journal article is far less likely to move or disappear than a startup blog post or a tweet. When choosing between a primary source and a secondary commentary on that source, link to the primary.
Link to canonical resource pages rather than specific articles. Linking to a company’s main resources page rather than a specific blog post means the link stays valid even if the post gets reorganised.
Archive important citations. For any external link that is central to a factual claim in your content, save a copy at archive.org or use a service like Perma.cc (designed specifically for academic and legal citations). If the original goes down, your cited source remains accessible.
Summary Checklist
- Run full-site crawl with external link checking enabled
- Export broken external links report
- Sort by page traffic (highest priority first)
- Manually verify 403/timeout errors in browser before treating as broken
- For each broken link: check Wayback Machine, then search for content at new URL
- Update links to working URLs where replacements exist
- Remove or rewrite links where no replacement exists
- Set up weekly automated monitoring for future breakage
- Schedule monthly 20-minute external link triage
External link rot is a slow, quiet problem on most sites. It rarely causes sudden traffic drops the way a broken internal link structure might. But left unmanaged for a few years, it accumulates into a maintenance problem that takes real effort to unpick.
A simple automated monitoring setup and a monthly triage habit keeps it from ever getting to that point.
Check your site for broken external links ->
The free audit covers both internal and external broken links - no account required.
Also in this series: How to Find Broken Links on Your Website - How to Automatically Monitor for Broken Links
Newsletter
Weekly SEO teardowns
Internal linking, broken links & orphan pages — straight to your inbox, every week.