How RankForge calculates the Structural Health Score

The Structural Health Score is a single 0–100 number summarising how well your site's architecture routes discovery and authority to the pages that matter. This article explains the philosophy behind it — what it measures, why it's a weighted composite of visible dimensions rather than a black box, what good and bad scores look like, and just as importantly what it deliberately is not. We won't publish the exact weights (they're tuned and they evolve); we will be completely transparent about how the number is constructed and why.

link

Run the Internal Link Checker on your site — free, no account.

Get your structural score free

Why reduce structure to one number at all

A structural audit produces dozens of metrics: orphan count, depth distribution, cluster cohesion, authority concentration, anchor quality, cannibalization. Handed all of them at once, most people can't tell whether their site is in good shape or bad, or whether last month's work helped. A single composite score solves two real problems: it gives a defensible at-a-glance verdict, and it gives a trend line you can track across crawls so 'did the restructure work?' has an answer.

The risk of any single score is that it hides its reasoning. RankForge's answer to that is a hard rule: the headline number is always the weighted composite of the dimension scores you can see on the cards below it. There are no hidden caps, secret penalties, or overrides that move the headline away from the visible parts. If a problem deserves to move the score, it does so through a visible dimension — never through an invisible thumb on the scale.

lightbulb

Design principle: You should always be able to reconstruct the headline from the cards. A score that says 40 next to dimension cards averaging 85 would be a bug, not a feature — we removed exactly that kind of hidden cap once and it had destroyed report credibility.

The scored dimensions

The composite is built from a set of independently scored dimensions, each capturing one structural concern. They map directly to the concepts in the rest of this Academy.

Dimensions that make up the composite

DIMENSION              WHAT IT JUDGES
─────────────────────  ──────────────────────────────────
Authority distribution  Is internal equity concentrated on
                        the right pages, or pooled/leaking?
Crawlability & depth    Can pages be reached, and how deep
                        do important ones sit?
Orphans & connectivity  How many pages get no internal links?
Topical clustering      Are related pages wired into pillars
                        & clusters, or isolated?
Anchor text quality     Do internal links describe their
                        targets, or waste the signal?
Cannibalization &       Do pages compete with each other or
  consolidation         consolidate cleanly?

Each dimension is scored on its own, shown on its own card, and contributes to the composite by its weight. The exact dimension set evolves as the product does; the principle that the headline is their weighted blend does not.

Each of these has a deeper article: authority flow, crawl depth, orphan pages, topic clusters, anchor text, and cannibalization.

Why the weights differ

The dimensions are not weighted equally, because they don't have equal impact on whether a site's structure works. Weighting reflects leverage: how much a problem in that dimension actually suppresses rankings and how directly fixing it helps.

chevron_rightAuthority distribution and connectivity carry more weight because they're upstream — if equity never reaches a page, nothing else about that page matters. A perfectly written, perfectly clustered page that's orphaned still can't rank.
chevron_rightAnchor text quality carries real but smaller weight: it sharpens an already-existing link rather than determining whether authority arrives at all.
chevron_rightDimensions only present on some sites are renormalized, not penalized. If a site is too small to have meaningful clusters, the score is computed over the dimensions that do apply — an absent dimension never silently drags the number down.

info

On renormalization: When a dimension doesn't apply, the remaining weights are rescaled to sum to 100%. This is why two sites can be scored fairly even when one has no cluster structure to evaluate — you're never scored on a dimension that's irrelevant to your site.

What good and bad scores look like

Reading the score

85–100   Strong. Authority reaches priority pages, few
         orphans, shallow important pages, real clusters.
         Work is fine-tuning.

70–84    Healthy with gaps. A handful of orphans or some
         depth/anchor issues. Targeted fixes, not a rebuild.

50–69    Structurally compromised. Authority pooling, many
         deep pages, weak/absent clusters. Real plateaus
         usually live here.

below 50 Architecture is working against the content.
         Orphans at scale, leaks, near-zero clustering.
         Restructure before producing more content.

These are interpretation bands, not pass/fail lines. A 78 isn't a failure — it's a site with specific, fixable gaps. The value is in the trend and the per-dimension breakdown, not the single digit.

Common reasons for a poor score cluster into a few patterns: authority trapped on the homepage and never distributed; a large tail of pages at depth 4+; orphans created by a migration or by faceted navigation; and content published as isolated articles that were never wired into clusters. Notably, none of these are content-quality problems — which is exactly why they're invisible to a conventional audit and why the score surfaces them.

What improves the score

chevron_rightReconnect orphans with contextual links from relevant, already-linked pages — usually the fastest single improvement.
chevron_rightFlatten the path to priority pages so they sit within ~3 clicks of a strong entry point, raising both crawl frequency and authority.
chevron_rightWire isolated content into clusters: supporting pages link up to pillars, pillars link down to key supporting pages.
chevron_rightConcentrate homepage and hub outbound links on genuinely high-priority destinations instead of spraying them across everything.
chevron_rightFix the leaks — redirect chains, conflicting canonicals, and nofollow traps that bleed equity out of the graph.
chevron_rightReplace generic anchors ('click here', 'read more') on important internal links with descriptive ones.

info

Expect a one-time jump when we reweight: Adding or reweighting a dimension shifts every site's score at once. When that happens it shows up as a single discontinuity in your Crawl History, and the 'what changed' narrative explains it — it's a change in the ruler, not in your site.

What the score is not

This is the most important section, because the score is easy to over-read.

It is not a ranking prediction

Rankings depend on content quality, search intent, backlinks, competition, and dozens of factors that have nothing to do with internal structure. The Structural Health Score measures one thing well: whether your architecture routes discovery and authority sensibly. A high score removes structure as the bottleneck; it does not promise position one, and a site can score 90 and still rank poorly because its content or backlinks are weak.

It is not a Google score

No search engine publishes a structural health figure. This is RankForge's model, built to be useful and honest, not a leaked Google metric. Treat it as a well-instrumented opinion about your architecture, not a verdict from the search engine.

It is bounded by what was crawled

The score reflects the pages actually crawled. If the crawl hit its page budget, or the site is largely client-rendered, or a mega-menu links everything to everything (which makes depth and authority metrics uninformative), RankForge caveats the affected dimensions rather than reporting confident noise. A score computed over a partial or degenerate crawl says so — honesty under degenerate input is part of the methodology, not an afterthought.

Limitations, stated plainly

chevron_rightIt judges structure, not prose. It can't tell you your content is thin or off-intent — only that the page is or isn't well-connected.
chevron_rightIt sees internal links, not the live link graph Google has built over years; very new or very large sites are approximations.
chevron_rightWeights are informed judgement, not ground truth. They're tuned against what correlates with healthy sites and will keep evolving.
chevron_rightA single crawl is a snapshot. The signal compounds across crawls — the trend in your Crawl History is more meaningful than any one number.

Used as intended — a prioritization and tracking tool for the structural layer — the score earns its keep. Start with the pillar guide on structural SEO for the concepts, then run an audit to see your own dimensions broken out.

FAQ

Does a high Structural Health Score mean I'll rank well?expand_more

No. It means structure isn't holding you back. Rankings also depend on content quality, search intent, backlinks, and competition. A high score removes architecture as the bottleneck; it doesn't guarantee a position.

Why won't you publish the exact formula and weights?expand_more

The weights are tuned and they evolve as the product improves, so a published formula would be wrong by next release. What doesn't change is the principle: the headline is always the weighted composite of the visible dimension cards, with no hidden caps or overrides.

My score dropped but I didn't change anything — why?expand_more

Usually one of two things: a new crawl found an issue the previous one missed (e.g. orphans from a recent publish), or RankForge reweighted or added a dimension, which shifts every site at once. The latter shows as a single discontinuity in Crawl History with an explanation.

What's a good score to aim for?expand_more

85+ is strong; 70–84 is healthy with specific gaps; below 50 means structure is actively working against your content. But the trend across crawls and the per-dimension breakdown are more actionable than the single number.