
greg said:
one value of this is that it does a good job of identifing children's eBooks (they tend to be "easy").
checklist said:
bigword density short word density (-) wordsPerSentences syllablesPerWords profainwordsPerWords numbersPerWords mostCommon1000WordsPerWord (-) commascharsPerWords wordsPerParagraphs letterFrequencyDistributionError adjacentLetterPairsFrequencyDistributionError uniqueStemmedWordsPerWord;
aren't scientists silly? :+) look, greg, if you want a list of children's e-books, or a list of "easy" e-books, or any kind of list of books, just ask the distributed proofreaders people for the list... -- If I ask them to classify 20000 books for me, will I get a reply any time this century? :-) they'll give you a long list of books, any kind of list you want, and you won't have to do one little bit of fancy-ass statistics... i'm serious, they can give a list with p.g. e-text numbers and meaningful notes, and funny little stories, and _everything_... much more vivid than your boring-ass statistics... :+) -- and anecdotes about 'hey I remember that really weird typo on page 263' and 'I picked that up at Fred's garage sale on Smith street' are very exciting? ;-) ------------------------------------------------------------ This email was sent from Netspace Webmail: http://www.netspace.net.au