Here's a thought I had the other day. I decided to ask the good folks at Google about it:
To: suggestions at google.com
Subject: How to restrict Google indexing to specific portions of a web page?
I would like to know if there is a way to force the Google indexer to selectively ignore portions of a web page. In my case, I have a personal web site that lists headlines from another site. Those headlines appear on a lot of pages and it is causing Google search to yield false-matches where people find a match between some word in the headlines of that off-site content and some word I've used in the real content on my site. I would like to be able to tell Google to ignore those headlines altogether since they are dynamic and aren't a part of my site per se. I'm thinking of perhaps some kind of marker or tag that would instruct Google to ignore a range of HTML code. Like "<!--#Google: ignore-on--> ... <!--#Google: ignore-off-->" or something like that.
Please let me know if it is possible and if it isn't, would you consider extending your service to meet this requirement? I think it would improve finding results on the web a good deal if it were put into practice.
The auto-responder sez:
Thanks for writing to Google.
We read all of the email we receive and try to send personal responses to each message.
Wouldn't it be nice if Google were this flexible? We'll see what happens.