
On Mon, 7 Aug 2006 16:50:31 EDT, Bowerbird@aol.com wrote: |google is releasing some very cool data into the wild, |based on a corpus of a trillion words from web-pages. | |one trillion. | |if you know anything about previous projects in this vein, |you'll know that a corpus this big is totally unprecedented. | |as google points out -- |> |http://googleresearch.blogspot.com/2006/08/all-our-n-gram-are-belong-to-you.... ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ <<< All Our N-gram are Belong to You >>> ROTFLMAO If this is the quality their data, I doubt it will be much use. :-))) -- Dave Fawthrop <dave hyphenologist co uk> "Intelligent Design?" my knees say *not*. "Intelligent Design?" my back says *not*. More like "Incompetent design". Sig (C) Copyright Public Domain