The other day I was working on a new application which needed to process large batches of words – as comprehensively as possible. After some quick searches I found that there are (unsurprisingly) a number of freely available dictionary/wordlist files available on the Internet.
The first repository that I tried was that of one hosted on Sourceforge, simply called ‘Wordlist‘. Many of the lists hosted on that page are spell-checker centric, but the 12 Dicts package, in particular, was rather comprehensive. It originally contained 12 dictionaries, which has since been pruned down. Within the package there are a number of different dictionaries, some contain old English words, some have hyphenated words, some have acronyms, etc. You need to use the grid, that they provide, to determine which package is best suited for you. After doing some work with this list, however, I determined that it simply wasn’t comprehensive enough for me (at 74,000 words).
After some more digging I came across the public domain list called ENABLE, which is overwelmingly comprehensive. This particular list is used in just about every word game on the planet – containing approximately 173,000 words! This particular list is very clear-cut and has no limitations imposed as to the words contained within it. If you need a word list for any of your upcoming projects, I highly recommend it!
Bob Aman (August 4, 2005 at 1:39 pm)
Nifty, thanks. Actually, I’ve also been looking for a good word list, so this is really quite useful!
Metrophanes (September 5, 2007 at 3:58 am)
interesting
Spyros (September 7, 2007 at 12:58 pm)
Nice!