Corpus linguistics is not just for linguists: Considering the potential of computer-based corpus methods for library and information science research

Corpus linguistics is not just for linguists: Considering the potential of computer-based corpus methods for library and information science research

Keywords:Research methods, Computer applications, Techniques, Software tools, Methods, Languages
Abstract:Corpus linguistics is not just for linguistsConsidering the potential of computer-based corpus methods for library and information science research Purpose The purpose of this paper is to generate awareness of and interest in the techniques used in computer-based corpus linguisticsfocusing on their methodological implications for research in library and information scienceLISDesignmethodologyapproach This methodology paper provides an overview of computer-based corpus linguisticsdescribes the main techniques used in this fieldassesses its strengths and weaknessesand presents examples to illustrate the value of corpus linguistics to LIS researchFindings Overallcorpus-based techniques are simpleyet powerfuland they support both quantitative and qualitative analysesWhile corpus methods alone may not be sufficient for research in LISthey can be used to complement and to help triangulate the findings of other methodsCorpus linguistics techniques also have the potential to be exploited more fully in LIS research that involves a higher degree of automationegrecommender systemsknowledge discovery systemsand text miningPractical implications Numerous LIS researchers have drawn attention to the lack of diversity in research methods used in this fieldand suggested that approaches permitting mixed methods research are neededIf LIS researchers learn about the potential of computer-based corpus methodsthey can diversify their approachesOriginalityvalue Over the past quarter centurycorpus linguistics has established itself as one of the main methods used in the field of linguisticsbut its potential has not yet been realized by researchers in LISCorpus linguistics tools are readily available and relatively straightforward to applyBy raising awareness about corpus linguisticsthe author hopes to make these techniques available as additional tools in the LIS researchers methodological toolboxthus broadening the range of methods applied in this fieldThis study emphasizes the application potential of corpus linguistics in library and information science researchThese technologies can be applied toretrieval and indexeswith automated text analysis and information retrievaland can also be used incollection classificationto help automatic classification of textThis article mainly explores computer-based linguistic technology based on computersespecially their methodological significance in library and information science researchCiguet linguistics focuses on a large number of structured collection of language materialssuch as text or voice recordswhich involves the technology and methods of natural language processingNLP