Where academic tradition
meets the exciting future

Characterization of Russian Deep Web

Denis Shestakov, Natalia Vorontsova, Characterization of Russian Deep Web. In: Proceedings of Yandex Research Contest 2005, Yandex company, 2005.

Abstract:

The significant portion of the Web is hidden behind search forms
and not indexed by conventional search engines. This part of
the Web is known as the deep Web. Pages in the deep Web are
dynamically generated in response to queries submitted
via search forms. In this work, we studied the Russian part of
deep Web. Our main goal was to estimate the number of deep Web
sites in the Russian deep Web. The presented study is a first
work devoted to the certain part of deep Web, which is formed on
the basis of some particular language usage.

Files:

Abstract in PDF-format

BibTeX entry:

@INPROCEEDINGS{inpShVo05a,
  title = {Characterization of Russian Deep Web},
  booktitle = {Proceedings of Yandex Research Contest 2005},
  author = {Shestakov, Denis and Vorontsova, Natalia},
  publisher = {Yandex company},
  year = {2005},
  keywords = {deep web, Russian deep web, random IP-sampling, random host-sampling},
}

Belongs to TUCS Research Unit(s): Turku BioNLP Group

Edit publication