The web as a corpus: a resource for translation

Helia Vaezian

doi:10.15388/VertStud.2018.5

Articles

Helia Vaezian

Khatam University

Published 2018-12-20

https://doi.org/10.15388/VertStud.2018.5

PDF

HTML

Keywords

corpora
corpora for translation purposes
webascorpus
translator training

How to Cite

Vaezian, H. (2018) “The web as a corpus: a resource for translation”, Vertimo studijos, 11, pp. 62–75. doi:10.15388/VertStud.2018.5.

Download Citation

Abstract

[full article, abstract in English; abstract in Lithuanian]

Accessing ready-made corpora may not be always easy. This is especially true for less dominant languages such as Persian for which the number of available corpora is very limited. Moreover, most existing corpora are domain specific, which implies that they supply a limited range of genres and text types. They, thus, may not always contain the information the translator is looking for. Drawing on the world wide web as a big corpus, however, is not subject to such limitations. The web, in fact, can be considered as a very large multilingual corpus containing texts in almost all languages and all text types. The present paper reports the results obtained from a collaborative experience in which undergraduate English translation students from the Department of translation Studies of Allameh Tabataba’i University made use of Google search engine and webascorpus web concordancer to extract translationally-relevant data from the web.

PDF

HTML

Downloads

Download data is not yet available.

Most read articles by the same author(s)

Helia Vaezian, Fatemeh Ghaderi Bafti, On Optional Shifts in Translation from Persian into English , Vertimo studijos: Vol. 12 (2019): Vertimo studijos