分享

CALL(8):Corpus Linguistics 语料库语言学

 赛波 2007-05-26
Corpus Linguistics 语料库语言学
2007-04-27 11:22:07
部分重要概念
Text Corpus
 In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (now usually electronically stored and processed). They are used to do statistical analysis, checking occurrences or validating linguistic rules on a specific universe.
Brown Corpus The Brown Corpus of Standard American English (or just Brown Corpus) was compiled by Henry Kucera and W. Nelson Francis at Brown University, Providence, RI as a general corpus (text collection) in the field of corpus linguistics.
Bank of English The Bank of English is the name of the COBUILD corpus, a collection of English texts. These are mainly British, but American and Australian data are also included.

Part-of-Speech Tagging
Part-of-speech tagging (POS tagging or POST), also called grammatical tagging, is the process of marking up the words in a text as corresponding to a particular part of speech, based on both its definition, as well as its context, i.e. relationship with adjacent and related words in a phrase, sentence, or paragraph.

重要参考文献
何安平,2004,《语料库语言学与英语教学》,北京:外语教学与研究出版社。
杨惠中(编),2002,《语料库语言学导论》,上海:上海外与教育出版社。Gavioli, L. (2005). Exploring corpora for ESP learning. Amsterdam: John Benjamins.
华南师范大学外国语言文化学院编委会(编),2005,《语料库语言学的研究与应用》,长春:东北师范大学出版社。
Kennedy, G. (2000). An introduction to corpus linguistics [语料库语言学入门], 北京:外语教学与研究出版社。
Deignan, A. (2005). Metaphor and corpus linguistics. Amsterdam: John Benjamins.
Dash, N. S. (2005). Corpus linguistics and language technology: With reference to Indian language. New Delhi: Mittal Publications.
Connor, U. & Upton, T. A. (2004). (Eds.) Applied corpus linguistics: A multidimensional perspective. New York: Rodopi.
Halliday, M.A.K. et al. (2004). Lexicography and corpus linguistics: An introduction. New York: Continuum.

领域前沿
Mark Davies, Brigham Young University
 http://davies-linguistics./personal/
Susan Hunston, University of Birmingham
 http://www.english./who/hunston.htm
Gary Kennedy, Ohio State University
 http://www.math./~kennedy/
Wolfgang Teubert, University of Birmingham
 http://www.english./who/teubert.htm
Corpus Linguistics 2007, the fourth Corpus Linguistics conference, the University of Birmingham
 http://www.corpus./conference2007/
International Journal of Corpus Linguistics
 http://www./cgi-bin/t_seriesview.cgi?series=IJCL
The Inter-Varietal Applied Corpus Studies (IVACS)
 http://www.mic./ivacs/about.htm
British National Corpus
 http://www.comp./computing/research/ucrel/bnc.html

    本站是提供个人知识管理的网络存储空间,所有内容均由用户发布,不代表本站观点。请注意甄别内容中的联系方式、诱导购买等信息,谨防诈骗。如发现有害或侵权内容,请点击一键举报。
    转藏 分享 献花(0

    0条评论

    发表

    请遵守用户 评论公约

    类似文章 更多