Published: 2007 September
Herausgeber: László Kovács, Norbert Fuhr, Carlo Meghini
Buchtitel: Proceedings of the 11th European Conference of Digital Libraries (ECDL)
Reihe: Lecture Notes in Computer Science
Erscheinungsort: Budapest, Ungarn
The University of Trier maintains the DBLP (Digital Bibliography & Library Project) Computer Science Bibliography which offers bibliographic information about more than 870.000 scientific publications. This paper describes the DBLP WebCrawler, a meta search engine that is able to search for full text publications in PDF format for each DBLP entry on the web. Various search engines such as Google and Yahoo are used as data sources. The retrieved documents are additionally analysed and ranked according to their relevance. The proposed system differs from systems like CiteSeer in so far, that the DBLP Webcrawler builds upon metadata and tries to find relevant full-texts whereas CiteSeer mainly starts with full-texts and extracts metadata.
VG Wort-Seiten: 4