New Study: Is Google Biased Towards Particular News Sites?
Recently, Google CEO Sundar Pichai was called to testify in front of Congress about potential bias in Google’s algorithms. This isn’t the first time Google has been accused of bias and likely will not be the last time. Google alleges there is no bias, yet many Conservatives argue that Google is biased against them.
With our expert knowledge of search engine optimization (SEO) and Ahrefs’ massive amounts of data, we wanted to see if we could identify any bias from Google by looking at data for popular Conservative and Liberal news sites. If you’re not familiar with Ahrefs, we’re one of the top SEO tools with seriously big data about the web.
Conservative news sites:
- New York Post
- The Last Refuge
- Drudge Report
- The Federalist
- Orange County Register
- The Epoch Times
- Washington Times
- Christian Broadcasting Network
- National Review
- Townhall
- The Mark Levin Show
- The Rush Limbaugh Show
- Breitbart
- Newsmax
- The National Interest
- The Gateway Pundit
- RedState
- PJ Media
- Washington Examiner
- Fox News
- Christian Today
- Zero Hedge
- The Daily Caller
- TheBlaze
- The Daily Wire
Liberal news sites:
- Vox
- U.S. News & World Report
- The Washington Post
- CNN
- Bustle
- NBC News
- Hollywood Reporter
- Los Angeles Times
- Yahoo News
- Al Jazeera
- Rolling Stone
- HuffPost
- The Verge
- The New York Times
- ABC News
- TIME
- CBS Local
- The Guardian
- Bloomberg
- NPR
- CBS News
- The Atlantic
- Politico
- Univision
Before we get to the results, I should cover a bit about Ahrefs data. We have hundreds of millions of search terms and large amounts of clickstream data. We use this data to estimate organic traffic by looking at all the different queries people search for, the positions that websites occupy in the search results, and where users click. For the Core Updates, we decided to look at traffic at the start of the Google Core Updates and traffic 14 days later. This is to give Google time to roll out the changes to their different data centers. It also gives us time for our data to reflect the changes.
Our data is normalized in the sense that volumes are averaged over 12 months, so it should account for seasonality mostly, with elections being an exception since they’re not every year. We’re also not going to see newer stories or search topics early on, but we should pick up any popular searches and related clickstream data later.
From 2015 to the present, we see a decline in average traffic for the top news sites in each category during Google Core Update periods.
Conservative total traffic decline: -2.65%
Liberal total traffic decline: ‑1.78%
These numbers are actually very similar and not statistically significant, considering we’re taking into account the traffic of 50 websites and looking at a period of 6 years. Leading up to the last election in 2016, the impact on both categories was roughly equal. Leading up to the 2020 election, if you look at the results from the previous year or so, you’ll see that the impact was roughly equal for both categories, with the most recent update seeming to be better for Conservative websites.
If we look at the individual data points, both Conservative and Liberal news sites saw positive and negative impacts during every one of these Google Core Updates. Each box plot below represents the top websites in each category, and I’ll reiterate that every single update had winners and losers for both categories. Typically, whether a site wins or loses in a core update is related more to its quality than anything else.
While we can’t conclude from this data that there is no bias in Google search results, we can say that within the last 6 years, we don’t see any new bias introduced during Google Core Updates.
Is there a traffic bias?
One of the things that stood out to us is that Liberal websites definitely get more traffic than Conservative websites.
Now the question is, why is that the case? Does this show a potential bias that predates our keyword data set? Let’s find out if we can explain the traffic difference.
Amount of content
When looking at the number of indexed pages, Liberal news sources have over 8x more pages indexed than Conservative news sources. In fact, the chart is almost identical to the one above for traffic share. As a result, the top Liberal news sites generally have more chances to rank for different things than the top Conservative news sites.
Branded vs. unbranded traffic
The branded traffic for CNN and Fox News is roughly the same, meaning that a similar number of people are specifically seeking them out in organic search. However, branded traffic makes up a smaller percentage of CNN’s overall traffic, likely because they simply have more content. CNN has ~2.5 times the number of indexed pages as Fox News, so they have more chances to rank for different things.
Other explanations for traffic differences
With only Google search data, it’s hard to definitively determine why Liberal sites get more traffic than Conservative sites. It may be that Conservatives tend to use Google less. It’s also possible that Conservatives get more news from TV, apps, or social media than Google. All of this may be true, but without additional data, these statements can’t be confirmed and are merely conjecture.
More ways we could have looked for bias
If we looked at specific examples of queries, I’m sure we would have found what appeared to be examples of bias. The problem with using individual examples is there is an inherent bias from the person doing the analysis. The data is complex and you’d need a good way to determine if the sites and content are relevant to specific queries. Unfortunately, this would be difficult and extremely subjective, which is why we chose not to pursue this route.
We could have also looked at link data or links from other websites to these news sites. Again, I have no doubt that this data is biased as the more popular and high-ranking websites with more pages would tend to get more links naturally. We already proved this in our backlink growth study.
We could have looked at specific examples of search terms suggested by Google via their autocomplete system, but we already know that they remove many negative terms from these results. Again, this is already biased, and it would be difficult to prove any malicious bias here.
Final thoughts
There’s an inherent bias in everything. While our data shows that Google Core Updates didn’t seem biased one way or another over the past few years, we still can’t confidently say there is no bias elsewhere in Google’s system.
Got questions about this data? Ping me on Twitter.
Similar Posts
How to Stress Test Your Linux CPU for High Load
The post How to Impose High CPU Load and Stress Test on Linux first appeared on Tecmint: Linux Howtos, Tutorials & Guides .
If you’re a System Administrator, it’s important to examine and monitor the status of your Linux systems when they are under stress of heavy workloads.
The post How to Impose High CPU Load and Stress Test on Linux first appeared on Tecmint: Linux Howtos, Tutorials & Guides.
How to Extend or Resize KVM Virtual Machine Disk Size
The post How to Extend or Increase KVM Virtual Machine (VM) Disk Size first appeared on Tecmint: Linux Howtos, Tutorials & Guides .
KVM virtualization technology supports various disk image formats. Two of the most popular and widely used disk formats are qcow2 and raw disk images. The
The post How to Extend or Increase KVM Virtual Machine (VM) Disk Size first appeared on Tecmint: Linux Howtos, Tutorials & Guides.
How to Install Varnish and Test Web Server Benchmarking
The post How to Install Varnish and Perform Web Server Benchmark first appeared on Tecmint: Linux Howtos, Tutorials & Guides .
Think for a moment about what happened when you browsed to the current page. You either clicked on a link that you received via a
The post How to Install Varnish and Perform Web Server Benchmark first appeared on Tecmint: Linux Howtos, Tutorials & Guides.
Mastering the Core: A Guide to Linux Kernel Customization
The Linux kernel is the beating heart of any Linux Operating System (OS), a formidable software layer that sits at the core of the computing experience, facilitating communication between hardware and software. While many users may never need to tweak this core, there are those who, by delving into kernel customization, unlock a new realm…
How to Upgrade Fedora 38 to Fedora 39 (Command Line and GUI)
The post How to Upgrade Fedora 38 to Fedora 39 first appeared on Tecmint: Linux Howtos, Tutorials & Guides .
Fedora 39 was released on November 7, 2023, after a few delays. It is the latest version of the Fedora Linux distribution, which is known
The post How to Upgrade Fedora 38 to Fedora 39 first appeared on Tecmint: Linux Howtos, Tutorials & Guides.
Mastering Text Manipulation With the Sed Command
The Linux command line interface provides a wealth of tools for text processing, and one of the most powerful among them is the sed command. Sed, an abbreviation for Stream EDitor, is a versatile tool that allows for complex manipulation of text files and streams.
What is Sed?
Sed is a non-interactive text editor that operates on piped input or text files. By providing it with instructions, you can make it modify and process text in files or streams. The most common use cases of sed include operations like selecting text, substituting text, modifying an original file, adding lines to text, or deleting lines from the text. It can be used from the command line in Bash and other command-line shells.
Sed Command Syntax
The syntax of the sed command comprises three main parts:
- Options: These control the output of the command.
- Script: This contains a list of commands to run.
- Input file: This is the file you’re using the sed command on.
In the absence of a filename, the script operates on the standard input data. You can also run the sed command without any options. The basic syntax looks like this:
sed OPTIONS [SCRIPT] [INPUTFILENAME]
Sed Vs. Awk
Sed isn’t the only text processing tool in the Linux ecosystem. Another powerful utility is awk. While both of them work with text, they have some key differences:
- Sed excels at parsing and transforming text in a compact and simple language, making it simple and limited but easy to use.
- Awk, on the other hand, is a tool for text processing and writing potent programs in the form of statements. It’s complex, versatile, and more powerful than sed, but also more complicated to use.
Exploring Sed with Examples
Let’s dive deeper into the workings of sed with some practical examples. For all these examples, assume that we have a file called ik.txt
. Note that sed does not alter the original file by default. All changes will appear in the output, but the original file will remain unmodified.
Substituting Text
One of the most common operations in sed is text substitution. This is done using the s
command. For example, to replace the first instance of a pattern abc
with another pattern def
, use:
sed s/abc/def/ ik.txt
Here, only the first instance of abc
in each line is substituted with def
. If you want to replace the third instance of a pattern, you can use:
sed s/abc/z/3 ik.txt
Global Substitutions
In a global substitution, all instances of a pattern are replaced. This is achieved by appending g
to the substitute command. To replace all instances of abc
with XYZ
, use: