部分重要概念
Text Corpus In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (now usually electronically stored and processed). They are used to do statistical analysis, checking occurrences or validating linguistic rules on a specific universe.
Brown Corpus The Brown Corpus of Standard American English (or just Brown Corpus) was compiled by Henry Kucera and W. Nelson Francis at Brown University, Providence, RI as a general corpus (text collection) in the field of corpus linguistics.
Bank of English The Bank of English is the name of the COBUILD corpus, a collection of English texts. These are mainly British, but American and Australian data are also included.
Part-of-Speech Tagging Part-of-speech tagging (POS tagging or POST), also called grammatical tagging, is the process of marking up the words in a text as corresponding to a particular part of speech, based on both its definition, as well as its context, i.e. relationship with adjacent and related words in a phrase, sentence, or paragraph.
重要参考文献 何安平,2004,《语料库语言学与英语教学》,北京:外语教学与研究出版社。
杨惠中(编),2002,《语料库语言学导论》,上海:上海外与教育出版社。Gavioli, L. (2005). Exploring corpora for ESP learning. Amsterdam: John Benjamins.
华南师范大学外国语言文化学院编委会(编),2005,《语料库语言学的研究与应用》,长春:东北师范大学出版社。
Kennedy, G. (2000). An introduction to corpus linguistics [语料库语言学入门], 北京:外语教学与研究出版社。
Deignan, A. (2005). Metaphor and corpus linguistics. Amsterdam: John Benjamins.
Dash, N. S. (2005). Corpus linguistics and language technology: With reference to Indian language. New Delhi: Mittal Publications.
Connor, U. & Upton, T. A. (2004). (Eds.)
Applied corpus linguistics: A multidimensional perspective. New York: Rodopi.
Halliday, M.A.K. et al. (2004). Lexicography and corpus linguistics: An introduction. New York: Continuum.
领域前沿 Mark Davies, Brigham Young University http://davies-linguistics./personal/ Susan Hunston, University of Birmingham http://www.english./who/hunston.htm Gary Kennedy, Ohio State University http://www.math./~kennedy/ Wolfgang Teubert, University of Birmingham http://www.english./who/teubert.htm Corpus Linguistics 2007, the fourth Corpus Linguistics conference, the University of Birmingham http://www.corpus./conference2007/ International Journal of Corpus Linguistics http://www./cgi-bin/t_seriesview.cgi?series=IJCL The Inter-Varietal Applied Corpus Studies (IVACS) http://www.mic./ivacs/about.htm British National Corpus http://www.comp./computing/research/ucrel/bnc.html