Speech

  • A corpus of helpdesk interactions
    • digitised speech
    • transcribed speech

Writing

  • A corpus of Public Media English
    • A Corpus of British Media English
    • A Corpus of Chinese Media English
  • A term-annotated corpus of MEDLINE abstracts
  • A term-annotated corpus of different text categories