International journal
ISSN 2311-0759 (Online)
ISSN 2311-0740 (Print)


текстуальность

Web Pages, Text Types, and Linguistic Features: Some Issues

  From a textual point of view, the web is a huge reservoir of documents. On the web virtually everything can be seen as a ‘document’ or better a ‘web page’. The sheer amount of texts available is just overwhelming. Furthermore, the web is mainly wild and uncontrolled. This becomes clear if we compare a ‘tamed’ resource of the paper world, like the British National Library, and the ‘untamed’ English Web. In: this empirical study, I investigated text typologies in a random sample of raw web pages, and not in a corpus of pre-selected and pre-processed documents.