Project News
Publications
Events
Other News
Interviews
Multimedia
Resources
Genealogies of Knowledge
  • Home
  • About
  • GoK Corpus
  • Credits
  • Software
  • People
  • Events
  • TEC
  • Links
  • Research Network

Corpus text preparation

Overview Corpus design Corpus contents Corpus text preparation Research avenues User manual

 

Documentation on the process of preparing texts for uploading to the corpus can be downloaded by clicking on the links below:

  • Instructions on preparing texts for the ancient Greek, Latin and medieval Arabic subcorpora
  • Instructions on preparing texts for the Modern English subcorpus
  • Instructions on preparing texts for the Internet English subcorpus
  • Instructions on preparing texts for the Modern Arabic subcorpus
  • Instructions on using Regular Expressions 

To view the latest version of the .dtd files used to annotate the corpus texts, please click on the following links:

  • goktext.dtd (19 June 2017)
  • gokheader.dtd (29 Nov. 2018)
GoK Tool
User Manual
Explore the Contents of the Genealogies of Knowledge Modern English Corpus

Recent Posts

  • From text to data: Mediality in corpus-based translation studies

    From text to data: Mediality in corpus-based translation studies

    04/06/2021
  • Epistemologies of evidence-based medicine: A plea for corpus-based conceptual research in the medical humanities

    Epistemologies of evidence-based medicine: A plea for corpus-based conceptual research in the medical humanities

    04/06/2021
  • Phobia: A corpus study of political diagnostics

    Phobia: A corpus study of political diagnostics

    25/09/2020

Categories

  • Articles
  • Books
  • Events
  • Interviews
  • Multimedia
  • Other News
  • Project News
  • Publications
  • Resources

Archives

  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016

© 2023 Genealogies of Knowledge