The world at large would greatly benefit from Wiktionary publishing lists of words and definitions on a regular basis much like Wikimedia publishes raw dump files of all its wikis.
I envisage several levels, the rougher ones will be trivial to implement. The better ones will take a little more work. For each English is obivously wanted with all other languages also desired.
- Raw list of words (page titles).
- List of words with all "common misspellings" removed.
- As per 2. but with all inflected forms removed (alternative spellings should stay)
- List of words per 3 (or possibly 2) with all definitions but lacking information on homonyms, example sentences, quotations, etc
- As per 3 but with senses clearly separated from homonyms
Note that 4 and 5 will require some structure. A very basic XML format seems obvious.
Version: unspecified
Severity: enhancement