
Sorry if this is a FAQ, or a variation on a FAQ. Are there any Project Gutenberg databases that show the *original* publication dates (e.g. 1875, 1916) for all or most of the texts? I've created a database (current as of a few months ago) that has info on each book -- author, title, LC classification, etc --- but nowhere in the metadata for the texts could I find the original publication date. Unless I can find such a database, I'm going to get a research assistant to find this info for all ([original] English language) texts in the collection, or else write a script to automate the process. FWIW, I'm planning on using the Gutenberg texts as part of a 100 million word corpus of texts from English (British and US) from the 1800s-1900s, similar to what I've done for the 100 million word British National Corpus (http://view.byu.edu) and the 100 million word Corpus del Espanol (www.corpusdelespanol.org). Thanks in advance for any info you might have. Mark Davies ================================================= Mark Davies Assoc. Prof., Linguistics Brigham Young University (phone) 801-422-9168 / (fax) 801-422-0906 http://davies-linguistics.byu.edu ** Corpus design and use // Linguistic databases ** ** Historical linguistics // Language variation ** ** English, Spanish, and Portuguese ** =================================================