Top 5 reasons why Google may not crawl Websites

by on November 29, 2009

in SEO

I get lots of emails where people have difficulties in getting site indexed and crawled by Google

Google is not crawling my XYZ website. I don’t know why? Can you help me please? My URL is somedomain.com

And if you post such question on any forum like one of the user of Go4Expert.com, IndianSword posted Google indexing and crawling issue? people always tend to suggest to get more incoming links for new websites and I also did the same but then after some general answers he still had problems and so I needed to see if there is something which is missing on his website. Digging his website’s HTML deeper I found it had some Meta issues for some misconfiguration settings in WordPress.

Now let’s see what could be the possible reason if your website in not in Google’s Index or if Google is not crawling your website

1. Blocked by robots.txt

We should always block certain areas of the website like admin area or test areas but doing it rightly with the right syntax is also a must. Done wrongly you can have problems with indexing in Google. See if you have any code in your robots.txt file which is blocking Googlebot to index your main pages. If you know the syntax rightly you can check yourself and if you are not aware of what should be the correct syntax add your website in Google Webmaster Tools and see if you have any wanted urls blocked by your robots.txt file.

2. Meta tags

Many webmaster by mistake have noindex Meta like

  • <meta name=”robots” content=”noindex, nofollow”>
  • <meta name=”robots” content=”noindex”>

If you have any such HTML on pages you want to be in Google Index try getting rid of such Meta.

Apart from noindex and nofollow I have also seen revisit-after Meta.

  • <meta name=”revisit-after” content=”1 day”>

Though I haven’t seen any evidence of this being supported by major search engines but still if your page says <meta name=”revisit-after” content=”1 day”>, that won’t be taken as an instruction for search engine bots to return every day but rather as an instruction to go away from your site if it hasn’t been 1 day since the last visit.

3. Site is new

New websites are crawled less often in Google than more established websites and if your website is new there are high chances that Google would not be indexing your site very often. Instead of worrying much about it in Google’s index you should focus more on core of your business i.e. if its forum or blog getting more content or if its an e-commerce making more sales because eventually Google would start on your site when you have the right kind of mix with content and incoming links. If you addicted to checking your site stats or Google index read Getting out of Recession for Blogger

4. Less update frequency

If your websites pages do not update often then there is no reason for Googlebot to visit your site and update its cache. If your website does not update content regularly then there are chances that when you update the content Googlebot would not cache the new version that often.

Now if you have a static HTML homepage or website there is no need for Googlebot to come to your site very often. Now when you do some design changes or text changes Googlebot would not be updating your cache often. To avoid such issues you can always add Blog / Forum to your static site which gets more frequently updated. Blog could be as simple as what you or your company are undertaking.

5. Low incoming links

When your site is new you are bound to have few or no incoming links to your site and so try focusing on getting some good quality links to your site and instead of checking your site in Google’s index try getting more incoming links. See how to Build Links to Website. You should never be satisfied with the number of links your website has and always think about getting few more.

The Possible reason why Google may not crawl or index your website can be endless but I have tried to list few of the reasons I have experienced over time.

Still have issues post them in comments and I would be more than happy to give it a look.

Sign up for Free Course on Ways to Make Money Online?

Don't wanna sign up but prefer posts delivered to your Inbox?

But if you are not planning to subscribe through Email. Try subscribing to RSS.

Further Reading:

  1. Site Speed, Google & SEO. What have you done?
  2. Google Instant and Its Impact on SEO and PPC
  3. Check Site Performance with Google Webmaster Tool
  4. Effectively Build Links to New Website
  5. Ways to Make Money From Websites

{ 24 comments… read them below or add one }

ODOnline December 5, 2009 at 8:55 am

Thank you very much for this wonderfull article

I was wondering why my website isn't geting craled by Google.
The most of the people who I asked told me that it was because it won't crawl the pages which are in my language, but now reading your article I know that it is because my site isn't getting enough in-links and also it's new, 3 months since I bought the domain and 1 month since I written some content

Keep up the good posts!

Reply

adorigraphics December 14, 2009 at 6:53 am

Please forgive me of my ignorance, I am new to the whole SEO thing. I have sitemeter installed on my blog (http:adorigraphics.blogspot.com) and I used to notice that googlebot visited my blog maybe 2 or 3 times a day. About 6 weeks ago i couldn't see it appearing in my list of visitors, but I noticed that I was still getting traffic from google and that the cache was still being updated. Then on 8th december even the cahce stopped being updated. I had a feeling it could be a Blogplay widget that I installed on 8th dec so i removed this and within 10 minutes googlebot was visiting my page again and it has done another 2 or 3 times since. however I can see that the cahe view has still not changed since 8th Dec. So now it seems googlebot is visiting but not caching my page? Do you have any ideas if I have a problem here? Or what I can do about it? When considering your response please bear in mind I'm not knowledgable about SEO terminology :) Thanks for your thoughts.

Reply

Shabbir Bhimani December 14, 2009 at 8:45 am

There is nothing wrong that you have done and its just that your homepage is not updated in cache but your 14th December countdown post is already in Google's Cache and also I did not see anything wrong in your site.

Reply

adorigraphics December 14, 2009 at 10:11 am

Oh, when I look at the cache my most recent cached post says 8th Dec. You must be finding it a different way to me (I'm typing my site name into google and then clicking on cache). I will take your word for it that nothing is wrong since I don't have a clue. Thanks so much for taking the time to respond and for the reassurance.

Reply

Shabbir Bhimani December 14, 2009 at 10:00 pm

No you are absolutely right and your homepage cache was of 8th December only but there are other pages of your site as well and apply the same method on pages and posts of 14th December and you will see that its also cached which means there is nothing wrong in caching of posts and home page cache is not updated which is perfectly normal

Reply

Shabbir Bhimani December 15, 2009 at 4:00 am

No you are absolutely right and your homepage cache was of 8th December only but there are other pages of your site as well and apply the same method on pages and posts of 14th December and you will see that its also cached which means there is nothing wrong in caching of posts and home page cache is not updated which is perfectly normal

Reply

umeshg April 6, 2010 at 6:45 am

Hiiiiiiii I m divine light I read this information, i really got the more suitable answer. It porvide me some good & accurate results. thank you.

Reply

Petua Terbaik November 7, 2010 at 4:48 pm

i’m still have a problem google bot not updating my index at http://petuaterbaik.kaer-media.org can u tell me what wrong ?

Reply

Shabbir November 8, 2010 at 4:17 am

Google cached your site on 29 Oct 2010 and as there is not much activity involved Google does not visit your site regularly and so waiting is your best option. Apart from that do submit the sitemap so when you add new content Google is notified of it.

Reply

Pankaj November 13, 2010 at 12:24 pm

Can you suggest something for robots.txt?
I’m little bit confused with making robots.txt
:(

Reply

Shabbir November 13, 2010 at 2:31 pm

What you want to know for robots.txt

Reply

Pankaj November 13, 2010 at 2:55 pm

means, what lines should be there.

Reply

Shabbir November 13, 2010 at 2:59 pm

Copy it from any other site. For Wordpress blog you can use http://imtips.co/robots.txt

Reply

Sreejesh@techgyo February 20, 2011 at 10:07 am

I used to have sitelinks for my site before 2 days. Now i see nothing at all, and my site is not cached in Google. Also only 165 pages are now indexed when compared to 2600 indexed before. I’m wondering what went wrong, i didn’t make any post since then and there was no major changes to the site design too. What would you think might have gone wrong? Any advice would be greatly appreciated.

And your share buttons looks awesome, can you share me the code or the plugin you’ve used?

Reply

Shabbir February 20, 2011 at 10:20 am

It may be one of those Google Dance and 2 days is small time frame to analyze things. Wait for few days and see if things come back to normal. Make sure you work on things recommended by Google and content.

Reply

Sreejesh@techgyo February 20, 2011 at 10:22 am

Thanks for the quick reply, I think I should wait. By the way can you name the plugin you are using for those share buttons, It would be great if you could share it with me.

Reply

Shabbir February 20, 2011 at 10:37 am

Which Share buttons? On the right side ones or after the post?

Reply

Sreejesh@techgyo February 20, 2011 at 10:48 am

I mean the onces after the post.

Reply

Sreejesh@techgyo February 20, 2011 at 1:02 pm

Please feel free to mail me the answer :)

Reply

Shabbir February 20, 2011 at 1:59 pm

It is not done using any plugin but manually done with AddThis. You can grab the html for the code if you want.

Reply

Kiran May 2, 2011 at 6:52 am

Hi,
I need your help, I have website http://www.torontoreitement.com/, which is not cache by google, I don’t know the reasons. Please tell me the exact error in this site.

Reply

Shabbir May 2, 2011 at 7:04 am

Kiran, your site does not open for me.

Reply

geopvp October 3, 2011 at 2:21 pm

why my site geopvp.com is not cached in google

Reply

Shabbir October 3, 2011 at 2:30 pm

I see the site is cached in Google?

Reply

Leave a Comment

Spam protection by WP Captcha-Free

Previous post:

Next post: