What is a corpus? [Corpus Linguistics]
A principled collection of texts, either spoken or written, stored on a computer and available for analysis using specialized software. [O’Keefe, McCarthy and Carter]
What is required for a collection of texts to be considered a true corpus? [Corpus Design]
It must represent a specific theme or domain, and its validity depends on how representative it is of that target language.
How is a spoken corpus made representative? [Corpus Design]
By including accurate, varied samples from different geographic areas, educational backgrounds, and social classes.
What is the etymology of the word “corpus”? [Corpus Linguistics]
It originates from the Latin word “carpa” (body), meaning a body of text.
What are the three core analytical functions of corpus software applications? [Corpus Tools]
Generating word lists, concordances, and keyword lists.
What does a word and multi-word frequency list provide? [Corpus Analysis]
Quantitative counts detailing which individual words or phrases occur most frequently within the dataset.
What is a concordance? [Corpus Analysis]
A search result displaying an individual word or phrase centered in the middle of a line, flanked by its immediate left and right co-text.
What is a collocation? [Corpus Analysis]
A statistical tendency of specific words to frequently co-occur or follow one another (e.g., “blonde hair” or “bargain with”).
What is an n-gram? [Corpus Analysis]
Contiguous sequences of words (like two-word or four-word clusters) that function together, often carrying specific discourse functions.
What is a keyword list? [Corpus Analysis]
A comparative metric contrasting a target corpus against a larger reference corpus to identify unusually frequent words, revealing the “aboutness” of the text.
Why is the British National Corpus (BNC) frequently used for keyword comparison? [Corpus Reference]
It is a massive, 100-million-word dataset representing a wide variety of spoken and written English, making it an ideal baseline.
How has corpus linguistics changed modern lexicography? [Lexicography]
Dictionaries are now informed by authentic language data, often prioritizing actual frequency and real-world usage over literal definitions (e.g., the figurative use of “fire”).
How can corpus tools distinguish near-synonyms like “terrible” and “horrible”? [Semantic Analysis]
By using functions like “Word Sketch Difference” to isolate and compare the unique collocational behaviors of the similar words.
What characterizes word frequencies in spoken English compared to written English? [Sociolinguistics]
Spoken English exhibits a higher frequency of discourse markers, personal pronouns, and politeness strategies.
What is Sketch Engine? [Corpus Tools]
A cloud-based application used for corpus analysis, featuring functions like word sketches, concordances, and n-grams.
What are the primary pedagogical applications of corpus research? [Language Teaching]
It informs curriculum design, vocabulary prioritization, dictionary creation, and the evaluation of both learner competence and language errors.
What is the focus of the 90% assignment in the AL7732 module? [Module Assessment]
Writing a 2,000-3,000 word case study report profiling a learner’s strengths and errors using corpus tools to determine if they are accurately placed at a C1 level.
What is required for the 10% forum assignment in the AL7732 module? [Module Assessment]
Reviewing a single lesson using five specific headings and replying to at least one classmate’s review with a minimum of 100 words.
Assessment 1 Requirements [Module Assessment]
Review a lesson from the resource book Teaching English with Corpora using five specific headings (title/page, level, summary, liked, improved) and reply to at least one classmate.
Assessment 1 Word Count Breakdown [Module Assessment]
A 50-word summary, 100 words on what was liked, 100 words on what could be improved, and a 100-word peer reply.
Assessment 1 Deadline [Module Assessment]
Friday, April 24, 2026.
Word Sketch [Sketch Engine]
A tool that generates categorized tables and visualizations of a word’s collocations, grouping them by syntactic relationships like modifiers, verbs, and prepositional phrases.
Semantic Nuance via Modifiers [Corpus Linguistics]
Analyzing the specific adverbs that modify a word (e.g., “surprisingly warm”) to reveal idiomatic, metaphoric, or interpersonal meanings rather than literal definitions.
Word Sketch Difference [Sketch Engine]
A comparative tool used to distinguish near-synonyms (like “hot” vs. “warm”) by contrasting their aggregate frequency counts and visually mapping their shared and exclusive collocations.