I recall hearing a statistic that, in a typical block of English text (e.g. a novel) a really suprisingly large proportion (a third? half?) of the distinct words that appear, appear only once. That is, if you counted each occurrence of each distinct word in the text, then you'd find a huge proportion of the distinct words appear only once each.
I think I heard this on a radio show but I can't find the source. Can anyone confirm where I might have heard this, and/or indicate whether there's any truth to it?
Thanks.