INTRODUCTION TO CORPUS LINGUISTICS AND ITS HISTORY
Ibrohimova Mohichehra
Keywords: Key words: Methods, corpus, linguistics, language, data, patterns, collocation, analysis
Abstract
Abstract
Corpus linguistics is a field within linguistics that utilizes extensive sets of
authentic language data, referred to as corpora, to investigate language patterns.
These corpora can vary in size, ranging from niche collections to colossal databases
housing billions of words. The roots of corpus linguistics can be identified in the
early 1900s when academics started utilizing text collections for language analysis.
Nonetheless, it was with the introduction of computers in the 1960s and 1970s that
corpus linguistics evolved into a defined area of academic research.
Corpus linguistics is a method that uses computers to study language by
analyzing large collections of natural spoken and written texts known as corpora.
Research using this approach has demonstrated that relying solely on speakers'
intuitions may not always fully capture the complexities of language, especially
when exploring less common linguistic patterns like word combinations, grammar
variations, meanings, idioms, and metaphors.
References
References
G. Kennedy, in International Encyclopedia of the Social & Behavioral
Sciences, 2001
Francis, W. Nelson; Kučera, Henry (1 June 1967). Computational Analysis
of Present-Day American English. Providence: Brown University Press.
Anke Lüdeling, Kytö Merja Mouton de Gruyter, Corpus Linguistics 2008
Carlos Assunção, Carla Sofia Araújo Linha D'Água 32 (1), 39-57, 2019 S. Hunston, in Encyclopedia of Language & Linguistics (Second Edition),
https://en.m.wikipedia.org/wiki/Corpus_language
www.studysmarter.co.uk
https://www.ucl.ac.uk/english-usage/resources/ftfs/method.