Clicky

X

Subscribe to our newsletter

Get the State of Digital Newsletter
Join an elite group of marketers receiving the best content in their mailbox
* = required field
Daily Updates

New Google Webmaster Tools Feature: Index Status

25 July 2012 BY

Yesterday Google announced a new feature for Webmaster Tools that is genuinely useful and something many website owners have requested: Index Status.

In this new report, found under the Health section in your Webmaster Tools account, Google informs you about how many pages it has included in its index.

More than that, in the Advanced tab Google tells you much more:

  • How many pages it has crawled
  • How many pages it has opted not to include in the index
  • How many pages are blocked by robots.txt

From the relevant Google Support page:

Pages Crawled
This number indicates “the cumulative total of URLs on your site that Google has ever crawled. Not all crawled URLs get indexed, and Google may discover some URLs by other means such as inbound links from other sites. This number should increase over time as new pages are added to your site.”

Pages Not Selected
These are the pages on your site that Google has crawled, but “that are not indexed because they are substantially similar to other pages, or that have been redirected to another URL.”

Pages Blocked
Lastly, these pages are those that Google cannot crawl because they are disallowed in your robots.txt file.

Index Status - advanced tab

This is incredibly useful information that allows webmasters to quickly troubleshoot indexing issues on their websites. For example if you inadvertedly block too many pages in robots.txt, or make a mistake in your attempts to sculpt the indexing of your site (for example of you have faceted navigation and want to ensure Google focuses on the most semantically relevant facets).

Google’s Pierre Far has elaborated on the official blog post, which is definitely worth a read as well as it’ll give you more hands-on details on how to use these new reports.

AUTHORED BY:
h

Barry Adams is one of the editors of State of Digital and is a freelance SEO consultant based in Belfast, delivering specialised SEO services to clients across Europe.
  • Amresh Kumar

    How to check these  pages  Google has crawled, but “that are not indexed because they are substantially similar to other pages, or that have been redirected to another URL. please tell me. 

Nice job, you found it!

Now, go try out the 12th one:

Use Google Translate to bypass a paywall...

Ran into a page you can't read because it is blocked or paywalled? Here's a quick trick (doesn't always work, but often does!):

Type the page into Google translate (replace the example with the page you want):

http://translate.google.com/translate?sl=ja&tl=en&u=http://example.com/

How about that!?

Like this 12th trick? Tell others they need to look for this trick on our page: http://www.stateofdigital.com/search-hacks-marketers/

Or Tweet: Found the secret 12th one!