|
|
|
|
The ProjectAs an experiment, we are currently working on a Web crawler which collects RDF data from the public Web and uploads this data into Google Base. MotivationGoogle has started a new service called Google Base (see Announcement). Google Base is a public database into which everybody can upload any kind of structured information. Uploaded information can be searched using a web-interface. We think that Google Base might turn out to be an important step in the development of the Web from a medium for publishing unstructured text into a medium for publishing structured information. But Google Base also raises some questions concerning the Semantic Web, an effort aiming at extending the current Web with structured information. Semantic Web data can be accessed and used by everybody. By publishing information only on Google Base and not on the Web, you kind of donate your information to a single private company, having the role of a gatekeeper that can decide what is done with your information. Thus, we think it is preferable to publish structured information directly on the Web using Semantic Web technologies and to have a crawler which collects this public information and pushes it into Google Base. This setup combines the advantages of both architectures: Everybody can access your information without any gatekeeper and information can still be searched easily using the Google search interface. Current StatusWe are currently experimenting with uploading FOAF profiles into Google Base. We search for profiles using the FOAF bulletin board as a starting point, and crawl rdfs:seeAlso links. We crawl only "hand-crafted" profiles and ignore the large social network sites like LiveJournal, tribe.net and TypePad. We currently don't perform any "smushing" on the profiles, so duplicates are possible.
FeedbackWe are very interested in your opinion about this experiment. Please send feedback to Chris Bizer and Richard Cyganiak and cc the Semantic Web mailing list if your comment is of general interest. Opt-OutIf you don't want us to upload your RDF files into Google Base any more, please send an email to Chris Bizer and we will remove your file.
|
||||
|
Freie Universität Berlin - Fachbereich
Wirtschaftswissenschaft - Institut für Produktion, Wirtschaftsinformatik und
Operations Research
Lehrstuhl für Wirtschaftsinformatik: Chris Bizer
Letzte Aktualisierung:
05.12.2005
Administrator