Key sites

Contact details

Faculty of Linguistics, Philology and Phonetics,
41 Wellington Square,
UK - OX1 2JF


Home page at Faculty of Linguistics, Phonetics and Phonetics

Home page at Oxford E-Research Centre (former)

Martin Wynne

Picture of Martin Wynne at Cumnor Hirst

Martin Wynne is a Senior Researcher in Corpus Linguistics at the University of Oxford University of Oxford logo, based in the Faculty of Linguistics, Philology and Phonetics, where he runs an AHRC-funded project to build a repository for literary and linguistic resources as part of a new national data curation service. Martin is the National Coordinator in the UK for CLARIN, which operates a pan-European research infrastructure for research with language resources in the Humanities. The repository which Martin manages is also responsible for the distribution of the British National Corpus (BNC).

Martin has previously been based in the Bodleian Libraries, where he was responsible for the Oxford Text Archive and worked as part of the Electronic Enlightenment team. The ongoing curation and collections development of the OTA has now moved to the new repository. Martin and the OTA were, until 2016, based in IT Services (formerly OUCS) at the University of Oxford, and he also had a role in the Oxford e-Research Centre, where he founded the Digital Humanities at Oxford initiative, and has previously worked with TORCH.

Martin's current research and teaching focus on corpus linguistics, and developing infrastructure to support the use of language corpora and large data collections. Current projects include exploring recurrent multi-word units in Early English books, and exploring oral testimonies of the Holocaust as research data, in a collaboration with the European Holocaust Research Infrastructure (EHRI). He teaches Text Analysis on the MSc Digital Scholarship course, Research Methods for Linguistics masters students, and contributes to the Digital Humanities at Oxford Summer School. He has previously taught courses on research methods with large text collections, including a module on 'Exploring English Usage in Language Corpora' for a Masters course in English Language, as well as sessions for Education and Linguistics, Philology and Phonetics. He has also worked on various projects including the visualization of the sounds and syntax of poetry, hosting language resources on the Grid and in the Cloud, federated access and identity management, and the application of linguistic techniques to tracing the history of ideas.

Martin has worked on a number of infrastructure initiatives, including the Arts and Humanities Data Service, Project Bamboo, the Coalition of Humanities and Arts Infrastructures and Networks (CHAIN), CLARIN, DARIAH, Digital Humanities at Oxford, and the Humanities Computing Unit. Martin was for many years webmaster for the Poetics and Linguistics Association, and has worked on developing the web presence for TORCH, CLARIN, CLARIN-UK, and a number of research projects at the University of Oxford.

Martin has worked at the Universities of Birmingham and Łódź, the Institut für Deutsche Sprache, Goldsmiths College, London, and Lancaster Universit. He studied Linguistics at Leeds University and French at Queen Mary College, London. The numerous research projects that he has worked on include topics such as wordclass tagging in a number of languages; parallel text alignment; the BNC tag enhancement project; the Lancaster Speech, Thought and Writing Presentation project; the PELCRA Polish corpus building project; TELRI, building a resource infrastructure in central and eastern Europe, and the associated TRACTOR archive of language resources and tools; and building the BNC-Baby corpus. He has taught general linguistics, and a number of specialized topics, including corpus linguistics, second language acquisition theory, translation theory, literary stylistics, and semantics. Martin has also worked as a consultant for Collins Dictionaries and OUP Dictionaries.

In his spare time Martin used to play football, but has now retired, and sometimes dabbles in skiing and kayaking; he appreciates medieval and renaissance art and architecture, likes to travel, takes an interest in European history and culture, tries to speak foreign languages, and sometimes just sits around reading books, drinking strong Belgian beer, and eating raw fish. This year he is mostly going out again.

Please note change of email address in 2020 - the old ones and have expired.

a cada cerdo le llega su San Martín


Presentations and Events

List of research presentations

List of events organized


Blogs, etc.

Blog: Oxford Text Archive (Digital texts and research in the Humanities), blog at OpenEdition

CLARIN website (blog posts and more)


Digital Research in the Humanities Blog [deleted without notice - the link has been left here a testament to the impermanence of online resources]

Blog at [deleted without notice - the link has been left here a testament to the impermanence of online resources].

Some online publications

Developing Linguistic Corpora: a guide to good practice

Searching and Concordancing

An Investigation into Free e-Books

A Course in the Unix Operating System

Last Updated: January 2024.