Webmaster Key - Discussion Forums


Welcome, Guest. Please login or register.
Did you miss your activation email?
February 08, 2012, 03:32:25 PM

Login with username, password and session length
Welceome to Forums!

Important information for guests and new members:

In order to understand the full benefits of becoming an active member of this forum, please review the following information on guest and new member restrictions. These forum changes have been prompted by an overwhelming and unreasonable amount of bot postings and incoherent guest spam messages. We wish to prevent these events from happening in the future and make our community a more comfortable place for all of our members.

For guests:

Guests are not allowed to open new topics, polls, or posts attachments.
If you wish to open up new discussions on this forum, we encourage you to register.

For new members:

New members with less than five posts are not allowed to modify additional profile information such as avatars, contact information, biographies, and signatures. However, new members are encouraged to post their own topics or reply to topics initiated by other members. Become active on the forums and 5 posts should be an easy task!

We are a diverse community with members from all over the world. We encourage new ideas and interesting conversation. Do not be afraid to post webmaster/computer-related questions or problems, as our active members are always willing to help when they are able. Interested? Join us.

+ Webmaster Key Forums
|-+ Website Marketing
| |-+ Promotion Ideas and Strategies (Moderator: Andy)
| | |-+ Googlebots and Site Duplication as potential spam!
0 Members and 1 Guest are viewing this topic. « previous next »
Pages: [1] Go Down Stumble Upon! Digg It! del.icio.us! Add to Technorati! ReddIt!  Send this topic Print
Author Topic: Googlebots and Site Duplication as potential spam!  (Read 3171 times)
zarathustra
Guest
« on: July 27, 2004, 10:33:57 PM »

Greetings all. Just signed up.   Smiley I have a couple of 'quick' queries I really hope someone can answer.
I created a new website - www.artgraphica.net - which I registered with google two or three weeks ago. From my logs, the googlebots are taking a peek at my robots.txt and my index page, but then disappear. I have a page ranking of zero. I have a number of sites pointing to mine with PR of 3 or 4 on average, so will I have to be patient waiting for something to come along and deep crawl? In contrast MSN has been very quick to look through all my pages.

My other query is that my page content is largely duplicated through a flash enhanced version of the website that people can visit via a link in the main menu. I also intend to include downloadable PDF's pretty much duplicating my online tutorials. Is there a danger someone like google could look at this and consider it spam because the information is there twice? If this is the case would adding a 'disallow' for this flash folder in my robots.txt stop the bots going here and therefore solve a potential problem?

Thanks in advance!!
Report to moderator   Logged
Hope
Key Keeper
Veteran
*****
Posts: 1 975


P.I.T.A.


WWW
« Reply #1 on: July 28, 2004, 11:35:44 AM »

Greetings all. Just signed up.
Welcome to the forums.
Quote
I created a new website - www.artgraphica.net - which I registered with google two or three weeks ago. From my logs, the googlebots are taking a peek at my robots.txt and my index page, but then disappear. I have a page ranking of zero. I have a number of sites pointing to mine with PR of 3 or 4 on average, so will I have to be patient waiting for something to come along and deep crawl? In contrast MSN has been very quick to look through all my pages.
Lets start with 2-3 weeks ago. This is not nearly enough time for google to do a deep crawl. You will need to give it time. You have a PR of 0 because google hasn't updated PR or their index in that time. They are not showing any backlinks to your site. On top of that, they just got hit with a virus and this could  be effecting the results still.
 
As for MSN crawling all over your site, it is a good thing but doesn't help you at all. MSN hasn't launched their search index yet. The spider is going all over the net getting as much information as it can. The estimated launch is December. I woudn't worry too much or be too excited over the MSN spider.

Remember, when you start playing with the search engines, you need to sit back and wait for them. You can not make them move faster than they want. Patience is a key here. If you can't wait or dont' have the patience to wait, I would suggest Google Adwords and Overture.

Quote
My other query is that my page content is largely duplicated through a flash enhanced version of the website that people can visit via a link in the main menu.
This appears to be in a frame set. This should not be a problem for the search engines in general. They don't follow frames. This might be an issue for people wanting to link to you though. Keep that in mind.

Quote
I also intend to include downloadable PDF's pretty much duplicating my online tutorials.
This should not be an issue. This is a common practice and not frowned upon by search engines. They understand that PDF are used to provide a means of downloading and saving or printing files. Don't worry about this.

Quote
Is there a danger someone like google could look at this and consider it spam because the information is there twice? If this is the case would adding a 'disallow' for this flash folder in my robots.txt stop the bots going here and therefore solve a potential problem?
I really don't see this as being an issue. The back side code is different. The flash site is framed. These should solve the issue of spiders seeing spam.

There really isn't a need to have duplicate content though. You don't have enough flash on those pages to make it a problem for users or spiders. Why do you have both?

One the flash page, there is a lot of scripting inside the pages, you might want to pull that from the page and call it from another file. Makes it more spider friendly.
Report to moderator   Logged

zarathustra
Guest
« Reply #2 on: July 28, 2004, 12:11:08 PM »

Hi Heidi - just wrote a nice lengthy reply, and the browser crashed!!  Shocked
Basically I just wanted to say a big thank you for sharing your time and knowledge.

The site was original flash, but I realised it wasn't search engine friendly, and so made it an optional version of the site. The main site itself is what I believe to be search engine friendly; I stripped out all the scripts and used an externall CSS. I like a little interactivity, animation and music which is why I add flash animations here and there, plus I can put it to good use when I come to do future art tutorials online.

I will wait patiently for Google and the bots to do their thing; I've also submitted to Dmoz and Altavista, and hope Altavista won't take too long, but I suspect Dmoz will if I hear from them at all.

I was using google yesterday when the problem occured - I figured it had to be serious, but caught more on the news about the mydoom virus.

Thank you once again, you've been a great help.

Gavin.
Report to moderator   Logged
Hope
Key Keeper
Veteran
*****
Posts: 1 975


P.I.T.A.


WWW
« Reply #3 on: July 28, 2004, 05:01:52 PM »

Gavin,

Only a full flash site is not indexable right now. Your site has flash elements, but most is nonflash, so it should be ok. Any nav in flash should have a second nav in html. Then it should be fine.

You will not hear from Dmoz. Just keep checking back. If your site is not indexed in 2 months send an email to the editor and request them to review it. Explain that you do not want to resubmit because you don't want it to look like you are trying to spam. This usually motivates an editor.

Report to moderator   Logged

zarathustra
Guest
« Reply #4 on: July 28, 2004, 05:07:38 PM »

Because the main site contains no flash, I don't mind the flash side not being indexed at all - in fact I included a disallow in my robots.txt to stop them going there! I think the navigation and hyperlinks in the non-flash main side of the site should be enough for the bots to crawl through.  Smiley

Thanks for the hint about Dmoz - again thank you, the advice is very useful and much appreciated!
Report to moderator   Logged
Lampwize
Key Keeper
Member
**
Posts: 63


SEO, Webmaster


WWW
« Reply #5 on: August 11, 2004, 10:16:41 PM »

I am no SEO specialist or anything, but when I took a look at your code, you have an improper !DOCTYPE declaration. I don't know if this might cause a problem or not, but think it would be smart to fix it. Here is a page on choosing a correct !DOCTYPE: http://htmlhelp.com/tools/validator/doctype.html. After you have done that I would also validate my code: http://validator.w3.org/. Good luck on your high future page rank!
« Last Edit: August 11, 2004, 10:19:17 PM by HotShot » Report to moderator   Logged

"I have not failed. I've just found 10,000 ways that won't work."
-Thomas Alva Edison

http://www.saratogalakesideacresassociation.org/
Local New York homeowners association located on Saratoga Lake.
spherica
Key Keeper
Sr. Member
****
Posts: 277


Consultant


WWW
« Reply #6 on: August 30, 2004, 02:00:04 PM »

Just an update on this. Google started crawling / parsing flash a while ago.

There have been discussions on this for a while now.

I also know in some circles they were doing contest for ranking flash sites, but can't remember where, so I don't know the outcome.
Report to moderator   Logged

Andy
Administrator
Veteran
*****
Posts: 5 752



« Reply #7 on: September 02, 2004, 11:07:35 PM »

I have noticed MSN being very active in crawling and found Hope's comment on it crawling the net very interesting. In fact it's like they are fighting 1-1 on my new website that isn't yet ranked in Google. If only I new what to do to give it the edge? Maybe have some MS related keywords :-) e.g. pagerank.net LOL!!!
Report to moderator   Logged

Hope
Key Keeper
Veteran
*****
Posts: 1 975


P.I.T.A.


WWW
« Reply #8 on: September 03, 2004, 01:01:44 PM »

We are all sitting and waiting to see how this whole MSN search will work out. I know it is in the top 3 for traffic to all my sites. I can't wait to find out how things will settle down so I can get to work on the algo and rank better.
Report to moderator   Logged

Pages: [1] Go Up Stumble Upon! Digg It! del.icio.us! Add to Technorati! ReddIt!  Send this topic Print 
+ Webmaster Key Forums
|-+ Website Marketing
| |-+ Promotion Ideas and Strategies (Moderator: Andy)
| | |-+ Googlebots and Site Duplication as potential spam!

Jump to:  
« previous next »


Our Partners
RelmaxTOP Ranking System Web Hosting RelmaxTOP Ranking System
Staff Sites
12Noon[12Noon Gallery] Andy[Urgentclick]
Tamuril[Tamuril's Digital Art Exhibit] Sensovision
Powered by MySQL Powered by PHP We are hosted by Relmax Inc. |Our Privacy Policy | Sitemap
Powered by SMF 1.1.9 | SMF © 2006-2009, Simple Machines LLC
Forum design by Tamuril © 2005.
Valid XHTML 1.0! Valid CSS!