DATA COLLECTIONS

The data on this page contain links to corpora and to data collections that can be transformed into corpora. 

There are also links to other forensic linguistic and legal language repositories.

Sources for data collections are provided in the spreadsheet below (it is best to open the spreadsheet in a new browser tab). 

How to use the spreadsheet:

Data collections are organized by title, source, and "Data Type" (Public Corpora, Public Data Collections, Private Collections, or Data Repositories). Descriptions of data collections are also provided. Links to the data sources are in the right-hand columns. As with all external links, use caution when visiting new websites. 

You may search the entire spreadsheet by pressing Ctrl+F or Command+F and searching for a particular word/phrase. 

ForensicLing.com - Data Collections