October 14, 2024 • 20 min read
Information gain in SEO is a bigger deal than you probably think.
Copycat content is a massive problem for SEOs right now, and understanding how to leverage information gain can give you a competitive edge and help you to rank at the top of the SERPs amongst a sea of pages that offer nothing new.
For years now, SEOs have let themselves believe (I’ve been guilty of this at times in the past, we all have) that the best route to creating content that ranks is to take the skyscraper approach; to take what’s already ranking and make it a bit better. Often, that means creating what we could call Frankenstein content… taking best bits of the top few ranking pages and using them to create something ‘new.’
And that’s where the problem begins.
Doing this doesn’t actually create something new. It just mashes up content (including ideas and topics) from pages that already rank in the hope that Google sees them as some kind of ‘super’ page.
And for a while, this worked. If it didn’t, the approach wouldn’t have become so widely used.
But times have changed, and if this is still your approach, you’re going to want to switch things up as soon as you can. As tough as it is to hear, no one wants to read another piece of content that’s nothing more than a reworded version of the one they read a few minutes before. And in turn, this also means Google doesn’t want to rank it.
It’s time to level up your approach to content creation and start producing content that your audience actually wants to read and that actually deserves to rank.
And in this guide, I’m not just going to explain exactly what information gain in SEO is, but help you to understand why it’s such an important tactic to be using when creating or optimising content, as well as walking you through how to go about actually leverage information gain to win on the SERPs.
Things might get a little bit nerdy at times, but I’ll offer simple explanations too, where necessary.
Keep reading to learn why information gain is something that goes hand-in-hand with E-E-A-T and with “creating helpful, reliable, people-first content” and why it might just prompt the most important shift you make in your SEO strategy for years.
You could be forgiven for thinking that information gain is just another SEO buzzword given how it seems everyone is all of a sudden talking about it.
But it’s not. It’s something you need to understand and use as a fundamental part of content creation.
And, for once, it’s a concept that has come straight from Google, rather than being a phrase coined by SEOs. In fact, it’s taken straight from a patent that was filed by Google in 2018 and published in 2020; “Contextual estimation of link information gain.”
But before I take a look at what this patent tells us, let’s look at what information gain means for SEOs in a more general and tactical sense.
Information gain is all about how unique your content is in comparison to other content that’s already out there.
More specifically, we can think about it as a way to reward content that adds new information to any particular topic, rather than just curating other content information that already ranks.
It’s content that’s more valuable to searchers because it offers something that other, existing content doesn’t.
And it’s deeply rooted in the basis that Google wants to return content that’s useful to the searcher and provide a varied set of results; not just the same information presented in 10 or more different ways.
But just to clarify; it’s absolutely not about just creating longer content. Far from it. More words doesn’t mean more (or new) information.
Information gain in SEO is all about:
To fully understand what information gain means as an SEO, we need to look at some of the wording Google’s used around, in particular, the Helpful Content System.
We’ve seen Google talk again and again about the need to publish content that’s ‘helpful,’ and how “Google’s automated ranking systems are designed to present helpful, reliable information that’s primarily created to benefit people, not to gain search engine rankings, in the top Search results.”
And something we need to pay close attention to here is the reference to rewarding content that’s primarily created to benefit people, not to gain search engine rankings.
It’s a line that’s been used by Danny Sullivan, John Mueller and many more Googlers when responding to questions from sites whose rankings and traffic were wiped out by the HCU, and one that has frustrated a good number of SEOs and site owners at times due to the ambiguity around it.
And, it’s a claim that’s often misunderstood, in my opinion, but one which we need to pick apart in order to understand why information gain is such an important part of SEO right now.
Google has given insight into what is meant by “search engine-first content,” sharing the following warning signs that the content approach being followed needs to be reevaluated.
I’m going to bold the points here that I believe we need to consider carefully when talking about information gain.
There’s more to creating helpful content than information gain. Of course there is. But one of the reasons why it’s becoming such a fundamental concept to SEO success is because it forms a key part of it.
If a page adds nothing new or unique to a topic when compared to other content that’s out there, can it really be considered as helpful? It’s doubtful.
And this leads nicely onto the concept of ‘consensus content.’
This is another concept that’s come from Google, yet one that in many ways seems to oppose the idea of information gain. But that’s not quite the case, at least not when they’re considered together and how Google likely uses them as part of its ranking systems.
I’m going to look at this before we dive deep into Google’s information gain score, largely because of my belief that consensus content is what you need to consider as a topic’s baseline information, before you work to improve a page’s information gain.
Knowing how to use both consensus and information gain is going to be a key part of SEO success going forwards, as far as I’m concerned.
But what do we mean by ‘consensus content’?
It’s all about making sure your content aligns with the general consensus on a topic. This means your content needs to:
David Denton writes on the topic of consensus, stating that, “In the context of Google’s search algorithms, consensus refers to the trustworthiness and factual accuracy of a site’s content. High-quality content that reflects the consensus on a topic is more likely to be considered authoritative and reliable by both users and search engines.”
In simple terms, you can’t go telling your audience that bananas are blue. They’re not. They’re yellow. The widely accepted consensus is that they’re yellow. Imagine the uproar if someone searched Google for ‘bananas’ and was met with the top result telling them that bananas are a blue fruit. Consensus content reflects the truth that users expect to find, reinforcing both the trust and reliability of the page and publisher.
This doesn’t mean directly copying content that’s already out there, though. It’s about the finer topics and information within a piece of content and making sure that what you say matches the consensus.
Consensus comes from multiple high quality sources, not just any old sites. It’s about trusted consensus, with this post explaining more about what Google determines a high quality site.
We also see Google’s Quality Rater Guidelines state that, “for informational pages and pages on
YMYL topics, accuracy and consistency with well established expert consensus is important,” when talking about content quality.
If consensus is about aligning with the widely accepted facts on a topic and information gain is about adding new, extra value content that goes beyond what others say, doesn’t that mean these two ideas conflict?
No, it doesn’t, despite initially appearing that way.
You see consensus should be thought about as being the baseline for your content to be considered eligible to rank. If you’re not aligning with the factually correct things in a topic, why would Google rank your content? It wouldn’t.
But that’s not enough. Many sites’ content will align with consensus, and information gain is the value and information that’s added beyond this.
Think about consensus as this baseline requirement and information gain as your competitive advantage and it starts to make a little more sense.
These aren’t competing concepts at all, they’re both part of any successful content strategy and must be used together. Overlook one, and you’ll struggle to rank.
Now we’ve cleared up what information gain means in the tactical sense of SEO and content marketing, let’s dive a little deeper into understanding Google’s ‘information gain score,’ the key concept discussed in the search engine’s “Contextual estimation of link information gain” patent.
Filed in 2018 and published in 2022, this patent demonstrates methods of calculating an ‘information gain score’ for documents (in this context, web pages), then using this score to rank them.
In simple terms, the patent describes using information gain scores to rank web pages as search results. And based on what we’ve already learned about information gain, it’s perhaps no surprise to read that, “An information gain score for a given document is indicative of additional information that is included in the document beyond information contained in documents that were previously viewed by the user.”
When SEOs talk about information gain, what they’re really talking about is Google’s information gain score.
Here’s how information gain score calculation works:
The patent gives us a wealth of information into why the information gain score exists, and how it works. Here’s a top-line summary of the things you need to know:
Perhaps the best summary of Google’s information gain score was written by Bill Slawski, who wrote that, “this may mean that boosting some pages in rankings based on how much information they would add to a searcher and demoting them if they don’t add much information to a searcher.”
I recommend reading Bill’s article to learn more about the patent and how information gain scores are calculated.
As an SEO or content marketer, that’s the main thing you need to know. Content that adds new or additional information to a topic will be rewarded on the SERPs, whilst content that offers less than what’s already out there will perform worse.
Of course, high quality content (I think we can safely refer to both consensus and information gain as parts of this) isn’t the only ranking factor that Google uses, hence why it’s not quite as simple as information gain guaranteeing better rankings. But all else being equal, a page that offers additional or new information on a topic can be expected to rank higher.
Creating content that ranks at the top of the SERPs shouldn’t be easy.
Let that sink in.
Creating top-performance content should involve a lot more than reworking what already exists and that ranks on Google. After all, if every piece of content out there is just a copycat version of the others, where’s the value to the reader?
Where’s the incentive for a user to browse through more than one result?
It’s simple. There isn’t. And that’s the whole point with information gain.
If Google’s top ranked results all say the same thing, that might as well be summarised and returned in an AI Overview.
Surely?
Improving information gain should be one of the biggest priorities for SEOs and content marketers right now, and here’s why:
AI Overviews and Google’s Helpful Content System aren’t going anywhere, neither is the ease of being able to create consensus content on pretty much any topic in minutes with an LLM.
This means that information gain is now your fast-tracked route to outranking your competitors. Many SEOs are still to make the shift to prioritising information gain as part of their content creation and optimisation process, leaving an opportunity for those that do.
So then, how do you go about actually creating content that offers information gain?
If you want to create content that performs on the SERPs, you need to not only understand what information gain is, but adapt your content production and optimisation processes to include it.
Here’s how…
Subject matter experts within your business are a goldmine when it comes to information that can create information gain, and using them should form part of your process.
That doesn’t mean they need to create the content, though.
Subject matter experts can be used to:
Using experts is often the difference between producing consensus content and content that contains information gain. Why? Because those ‘extra bits of information’ that make up information gain are the things inside a subject matter expert’s head; if it’s widely available information, that’s consensus content.
If your business doesn’t have subject matter experts? Find one and collaborate with them.
One guaranteed way to create information gain is by using something that no one else has; original research and/or first-party data.
If you’re able to undertake your own research, you’re instantly including something that adds value when working this into your content. Yes, it takes time and resources to conduct your own research, but no one ever said content creation should be easy.
Go the extra mile to create something of value.
Many businesses also have their own data that can be used to add value to content. Stats from their own customers, for example.
Ask yourself what data you collect and how this could be used to add something extra to your content. In many cases, you’ll find lots of ways this can help you to do stand out.
It’s easy to attempt to produce content within a marketing team. It’s usually quicker and requires less explanation on what you’re trying to do than working with other teams.
But this is the easy route. And remember what we said about content creation not being easy?
Work with other teams and you’ll instantly find yourself going beyond creating consensus content.
Sit down with other teams and use them to help you really understand the challenges and problems of your clients and customers.
Ask teams such as customer service and sales things like:
Understanding your customers in extreme detail means you’re able to tailor content to these people. And this isn’t something you can do by looking at existing content on the SERPs.
I want to show you a smart way to use LLMs and AI such as ChatGPT, Claude or Google Gemini to help you to improve your content’s information gain.
And it’s based on the simple fact that LLMs output answers and responses based on consensus.
In other words, they can’t add anything new to a topic beyond what’s already out there. They’re useless at information gain.
So use this to your advantage.
Ask your chosen LLM to write a piece of content with the same intent and topic as what you’re trying to rank and use that as a perfect example of what consensus-led content looks like.
Compare this to your own, and ask yourself what you’ve added that the consensus content from the LLM doesn’t include.
This is information gain. It’s the opinions, insights, quotes and more than an LLM could never come up with because they’re the things unique to your business.
If you removed the content from your piece that matched what was output from the LLM, then the content that improves your information gain score is what’s left.
LLMs are great for producing consensus content, so leverage them for this, rather than as a writer. If you want a quick and easy way to ensure you’re producing content that goes beyond consensus and includes some level of information gain, then AI is your best friend.
Using AI in SEO is all about figuring how you can use it as part of your workflow to make certain tasks easier (or even possible), and this is one of them where it’s a perfect fit.
SEO has changed in the last few years, and one of the biggest changes is information gain.
In fact, I’d go so far as to say it’s one of the new foundations of SEO success.
If you want to rank at the top of the SERPs in 2024 and beyond, you need to understand that a technically strong site that can be crawled and indexed is a must, as is meeting ‘SEO best practices.’
But beyond that?
Intent + Information Gain + Reputation = SEO Success
Don’t overlook the importance of information gain in SEO. It’s one of the biggest opportunities you have to own the SERPs.