Programming for Corpus Linguistics with Python and Dataframes

Daniel Keller

Programming for Corpus Linguistics with Python and Dataframes
Format
Hardback
Publisher
Cambridge University Press
Country
United Kingdom
Published
30 June 2024
Pages
75
ISBN
9781009486781

Programming for Corpus Linguistics with Python and Dataframes

Daniel Keller

This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.

Order online and we’ll ship when available (30 June 2024)

Our stock data is updated periodically, and availability may change throughout the day for in-demand items. Please call the relevant shop for the most current stock information. Prices are subject to change without notice.

Sign in or become a Readings Member to add this title to a wishlist.