Institute for Learning and Research Technology

Web Crawling High-Quality Metadata using RDF and Dublin Core

David Beckett
Institute for Learning and Research Technology
University of Bristol

Abstract

This paper describes how combining high quality resource description with a web crawler using simple Dublin Core and the RDF model created a novel search system for the teaching, learning and research communities. This gave the major benefits of both - returning high quality resources as well as timely and more widespread resources. It also provided new features such as presenting the provenance of the discovered resources, improving cross-subject area discovery and was capable of integrating well with other emerging semantic web systems such as web page annotations.

Keywords: web searching, RDF, Dublin Core, metadata, subject gateways

Contents

This paper has been submitted to WWW2002.

See also the WSE home page