Triangulating Methodological Approaches in Corpus Linguistic Research
eBook - ePub

Triangulating Methodological Approaches in Corpus Linguistic Research

  1. 216 pages
  2. English
  3. ePUB (mobile friendly)
  4. Available on iOS & Android
eBook - ePub

Triangulating Methodological Approaches in Corpus Linguistic Research

About this book

Contemporary corpus linguists use a wide variety of methods to study discourse patterns. This volume provides a systematic comparison of various methodological approaches in corpus linguistics through a series of parallel empirical studies that use a single corpus dataset to answer the same overarching research question. Ten contributing experts each use a different method to address the same broadly framed research question: In what ways does language use in online Q+A forum responses differ across four world English varieties (India, Philippines, United Kingdom, and United States)? Contributions will be based on analysis of the same 400,000 word corpus from online Q+A forums, and contributors employ methodologies including corpus-based discourse analysis, audience perceptions, Multi-Dimensional analysis, pragmatic analysis, and keyword analysis.

In their introductory and concluding chapters, the volume editors compare and contrast the findings from each method and assess the degree to which 'triangulating' multiple approaches may provide a more nuanced understanding of a research question, with the aim of identifying a set of complementary approaches which could arguably take into account analytical blind spots. Baker and Egbert also consider the importance of issues such as researcher subjectivity, type of annotation, the limitations and affordances of different corpus tools, the relative strengths of qualitative and quantitative approaches, and the value of considering data or information beyond the corpus. Rather than attempting to find the 'best' approach, the focus of the volume is on how different corpus linguistic methodologies may complement one another, and raises suggestions for further methodological studies which use triangulation to enrich corpus-related research.

Trusted by 375,005 students

Access to over 1.5 million titles for a fair monthly price.

Study more efficiently using our study tools.

Information

Publisher
Routledge
Year
2016
eBook ISBN
9781317532071

1 Introduction

Paul Baker and Jesse Egbert
DOI: 10.4324/9781315724812-1

Introduction

This chapter serves as an introduction to the aims and structure of the book. It begins with an outline of the aim of the project we undertook, as well as a brief discussion of the concepts and central tenets of corpus linguistics and triangulation. This is followed by a survey of the small amount of triangulation-related research in corpus linguistics. Next, we outline how the project was conceived, giving a description of the corpus used in the project, how analysts were contacted, and how they were asked to carry out an analysis. The corpus-driven versus corpus-based distinction is used as a starting point for explaining how the analyses were grouped and ordered in the book. The chapter ends with a short overview of the ten analysis chapters.

All Roads Lead to Rome?

Around 20 bc, Emperor Caesar Augustus erected the Golden Milestone, a monument in the central forum of Ancient Rome. Such was the power of the Empire that all roads were considered to begin from it and distances were measured from that point, resulting in the still-used phrase (or variations upon it) ‘All roads lead to Rome’. Today the phrase is not used literally, but more colloquially it means ‘it doesn’t matter how you do it, you’ll get the same result’.
In this book, we set out to test the applicability of the phrase to corpus linguistics, a method (or collection of principles and procedures) which uses large collections of naturally occurring language texts (written, spoken, or computer mediated) that are sampled and balanced in order to represent a particular variety (e.g. nineteenth century fiction, twentieth century news articles, twenty-first century tweets). In dealing with real, often unpredictable and linguistically ‘messy’ texts, corpus linguistics differs from introspective methods of analysis which can rely on neater but somewhat artificial-looking cases of language such as ‘the duke gave my aunt this teapot’ (Halliday & Matthiessen 2004: 53).1 It also differs from more traditionally human-based qualitative research in that extremely large numbers of texts are analysed with the help of computers which process data and carry out statistical tests in order to identify unexpectedly frequent or salient language patterns. However, it would be unfair to cast corpus linguistics as a merely quantitative form of analysis—the patterns need to be interpreted and explained by human researchers, and this involves close reading of the texts in a corpus, often abetted again by corpus tools which can present the texts or sections of them in ways that make it easier for human eyes to process. As Biber (1998: 4) points out, ‘Association patterns represent quantitative relations, measuring the extent to which features and variants are associated with contextual factors. However functional (qualitative) interpretation is also an essential step in any corpus-based analysis’.
Corpus linguists are thus able to make fairly confident generalisations about the varieties of language they are examining based on the combination of automated and human elements to the analysis. The automated side helps to direct the human researcher to aspects of the corpus that he or she may not have thought interesting to look at (a form of analysis which Tognini-Bonelli (2001) called corpus driven), but it can also help to confirm or refute existing researcher hypotheses (referred to as corpus-based analysis). Partington, Duguid, and Taylor (2013: 9) refer to the serendipity of corpus research as
the chancing upon hitherto unforeseen phenomena or connections … Evidence-driven research is highly likely to take the researcher into uncharted waters because the observations arising from the data will inevitably dictate to a considerable degree which next steps are taken.
Does an uncharted corpus contain a set of discoverable ‘findings’, possibly a finite number, or at least a smallish subset which most people would concur are particularly notable or even unexpected (the opposite of so-called ‘so what’ findings (Baker & McEnery 2015: 9)). And if so, is it the case that, assuming the analyst has a reasonably high degree of experience and expertise, the procedures of corpus linguistics will direct everybody to the same set of salient findings, the same serendipities? In other words, if we vary the procedure and analyst, but keep the research question and the corpus the same, are we likely to obtain similar outcomes? For corpus linguists, do all roads really lead to Rome?
Why would it matter if they don’t? A key advantage of corpus linguistics over other forms of analysis is that the computational procedures are thought to remove human cognitive, social, or political biases which may skew analysis in certain directions or even lead to faulty conclusions. Unlike humans, computers do not care about what they study, so there is no chance that their findings are misguided by conviction (‘I know it’s true; it must be true!’). Nor do computers make errors due to fatigue or boredom. It is tempting to view corpus linguistics in a similar light to ‘hard’ sciences such as biology or chemistry, where phenomena can be objectively measured. Potassium placed in water will always result in potassium hydroxide and hydrogen gas. There is a sense of reassurance about the replicability of that kind of research—facts are facts. And with its reliance on scientific, empirical notions of sampling, balance and representativeness in corpus construction, along with the certainty that our tools and procedures of analysis will not mislead us, corpus linguists might not be blamed if they experience a robust feeling of confidence about the validity of their findings.
But what if this is not the case? What if say, ten people, all with their own favourite way of doing corpus linguistics, all good at what they do, were given the same corpus and research question and asked to produce an analysis. What if they all found different things? Even worse, what if their findings disagreed? Would that render the method unworkable? Or would it tell us something interesting in itself? These are the issues which inspired this edited collection, and we explore them by carrying out a comparison of ten analyses of the same corpus in order to see the extent to which all roads actually do lead to Rome. Such a meta-analysis hopefully provides a clearer picture around questions of analytical objectivity and also should give insight into what individual techniques can achieve and how they may complement one another. A different way of looking at this collection of chapters, though, is that they also work as analyses in themselves of a corpus of an ‘emerging’ register of language. They tell many interesting things about how people who have learnt different varieties of English use this form of language in ways that reflect aspects of their identity and culture.

Triangulation

Triangulation is a term taken from land surveying which uses distance from and direction to two landmarks in order to elicit bearings on the location of a third point (hence completing one triangle). According to Layder (1993: 128), methodological triangulation facilitates validity checks of hypotheses, anchors findings in more robust interpretations and explanations, and allows the researcher to respond flexibly to unforeseen problems and aspects of the research. Such triangulation can involve using multiple methods, analysts, or datasets, and it has been used for decades by social scientists as a means of explaining behaviour by studying it from two or more perspectives (Webb, Campbell, Schwartz & Sechrest, 1966, Glaser & Strauss 1967, Newby 1977, Layder 1993, Cohen & Manion 2000). Most contemporary corpus linguists employ triangulation to an extent in their own research by, for example, using different techniques on their corpora. However, the potential benefits of triangulating the results of two or more corpus-linguistic methods have been largely unexplored.
This book is not the first study of this kind, although it is the largest study of triangulation in corpus linguistics that we are aware of. Prior to our study, there existed a collection by van Dijk and Petofi (1977) which contained multiple analyses of a short story called “The Lover and His Lass” by James Thurber. Grimshaw (1994) contains a collection of analyses of a transcript of a thesis defence, while another collection by van den Berg, Wetherell, and Houtkoop-Steenstra (2004) involves 12 chapters which each analyse the same transcript of interview data about race. Grimshaw’s book is the only one which attempts to synthesize the analyses in a concluding chapter called ‘What Have We Learnt?’, but all three collections consist of qualitative analyses of relatively short texts and do not involve corpus-based methods.
It is worth describing two related pieces of research in more detail before moving further on, as they function as unintended pilot studies to this one, both involving corpus-based critical discourse analysis of newspapers. First, Marchi and Taylor (2009) separately carried out critical analyses of a newspaper corpus, asking the question ‘how do journalists talk about themselves/each other and their profession in a corpus of British media texts?’ In comparing their results, they noted a range of convergent (broadly similar), complementary (different but not contradictory), and dissonant (contradictory) findings. An example of a dissonant finding was that one analyst concluded that journalists tend to talk about themselves, while the other noted that they do not talk about themselves but instead refer to other newsmakers. Both journalists had convergent findings relating to notions of good and bad journalism, while complementary findings were cases where analysts focused on related but different aspects of the corpus data. For example, one analyst noted a number of metaphors which constructed journalists as beasts, e.g. press pack, feeding frenzy, while the other pointed out that journalism is a highly reflexive activity, talking about and to itself. These two findings function together as pieces of a larger picture. Marchi and Taylor (2009: 18) conclude that ‘the implementation of triangulation within a research study in no way guarantees greater validity, nor can it be used to make claims for “scientific” neutrality’.
Second, one of the authors of this chapter carried out a slightly larger pilot study (Baker 2015), giving five analysts a newspaper corpus about foreign doctors and asking ‘How are foreign doctors represented in this corpus?’ While Marchi and Taylor’s meta-analysis was more qualitative and reflective, this study attempted to impose an element of quantification onto the analysis by counting and comparing findings. Of the 26 separate findings identified across the reports, all five analysts agreed on only one: the finding that foreign doctors were criticized (and thus seen as unwanted) for having poor language skills. A further five findings were s...

Table of contents

  1. Cover Page
  2. Half Title Page
  3. Title Page
  4. Copyright Page
  5. Table of Contents
  6. List of Tables
  7. List of Figures
  8. 1 Introduction
  9. 2 Keywords
  10. 3 Lexical Bundles
  11. 4 Semantic Annotation
  12. 5 Multi-Dimensional Analysis
  13. 6 Collocation Networks
  14. 7 Variationist Analysis
  15. 8 Pragmatics
  16. 9 Gendered Discourses
  17. 10 Qualitative Analysis of Stance
  18. 11 Stylistic Perception
  19. 12 Research Synthesis
  20. Contributors
  21. Index

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription
No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn how to download books offline
Perlego offers two plans: Essential and Complete
  • Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
  • Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.5M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Both plans are available with monthly, semester, or annual billing cycles.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1.5 million books across 990+ topics, we’ve got you covered! Learn about our mission
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more about Read Aloud
Yes! You can use the Perlego app on both iOS and Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app
Yes, you can access Triangulating Methodological Approaches in Corpus Linguistic Research by Paul Baker, Jesse Egbert, Paul Baker,Jesse Egbert in PDF and/or ePUB format, as well as other popular books in Languages & Linguistics & Communication Studies. We have over 1.5 million books available in our catalogue for you to explore.