Word Lists and Dictionaries

To browse the files you should have the program CPT Dictionary.
The files marked 'ready for CPT Crosswords' contain words in crossword form (lower case and all non-letter characters are ignored). You have to extract the file in 'words' directory and select it as Base Words in order to generate crosswords using the word list.

Arabic:
ar_wiki.zip - 648 KB, 198054 words, Cp1256, extracted from Wikipedia archive arwiki-20100401-all-titles-in-ns0, ready for CPT Crosswords.

Armenian:
hy_70K.zip - 153 KB, 70268 words, Unicode, extracted from hy_AM-0.20.0.oxt and from Wikipedia archive hywiki-20100405-all-titles-in-ns0, ready for CPT Crosswords.

Artificial:
art.zip - 3 KB. Three files with 5, 10, and 20 millions words, generated using the English alphabet.
To open the 20M.wlz file in 'Browse' style, you will need 512 MB virtual memory and at least 1.1.5 version of CPT Dictionary.

Bosnian:
bosn.zip - 46 KB, 19371 words, all lower case Latin.

Bulgarian:
bg2.zip - 247 KB, 156235 words, upper and lower case Cyrillic. Set 'Sort' for natural sorting.
bg_wiki.zip - 380 KB, 99904 words, extracted from Wikipedia archive bgwiki-20100406-all-titles-in-ns0, ready for CPT Crosswords.

Chinese:
zh.zip - 142 KB, 44364 words, Unicode.
zh_357K.zip - 841 KB, 357024 words, Unicode, ready for CPT Crosswords.

Croatian:
hr_wiki.zip - 309 KB, 74636 words, extracted from Wikipedia archive hrwiki-20100402-all-titles-in-ns0, ready for CPT Crosswords.

English:
en_Moby_xw.zip - 155 KB, 117969 words, all lower case Latin. This is the Moby collection of crossword words.
smile11.zip - 50 KB, dictionary of 2164 emoticons.
en_web.zip - 664KB, 309345 words, ready for CPT Crosswords.
en_701K.zip - 1.4MB, 701379 words, ready for CPT Crosswords.

French:
fr_wiki.zip - 4531 KB, 1182337 words, extracted from Wikipedia archive frwiki-20100401-all-titles-in-ns0, ready for CPT Crosswords.
fr_wiki_noacc.zip - 4 MB, 1060150 words, the same word list but all accent marks are removed, ready for CPT Crosswords.

German:
de_wiki.zip - 5193 KB, 1388433 words, extracted from Wikipedia archive dewiki-20100326-all-titles-in-ns0, ready for CPT Crosswords.
de_wiki_noacc.zip - 5 MB, 1381473 words, the same word list but the umlauts are replaced by 'ae', 'oe', and 'ue', and es-zed by 'ss', ready for CPT Crosswords.

Greek:
el_spell_wiki.zip - 595 KB, 618292 words, extracted from el_gr_v110.oxt and from Wikipedia archive elwiki-20100414-all-titles-in-ns0, ready for CPT Crosswords.

Hebrew:
he_wiki.zip - 508 KB, 159331 words, Cp1255, extracted from Wikipedia archive hewiki-20100330-all-titles-in-ns0, ready for CPT Crosswords.

Hindi:
hi_190K_NS.zip - 609 KB, 192790 words, Custom Hindi Unicode Normalization - Single Cells, ready for CPT Crosswords (v 1.2 and above).
hi_190K_NF.zip - 632 KB, 192753 words, Custom Hindi Unicode Normalization - Full Syllables, ready for CPT Crosswords (v 1.2 and above).

Hungarian:
hu_434K.zip - 1 MB, 434440 words, ready for CPT Crosswords.

Italian:
it_wiki.zip - 2771 KB, 711543 words, extracted from Wikipedia archive itwiki-20100408-all-titles-in-ns0, ready for CPT Crosswords.
it_wiki_noacc.zip - 2.66 MB, 705048 words, the same word list but all accent marks are removed, ready for CPT Crosswords.

Macedonian:
mk.zip - 13 KB, 5620 words, all lower case Cyrillic.

Persian:
fa.zip - 16 KB, 7915 words, Unicode, not proof read. Set 'Right Alignment' and optionally 'Shaping'.

Portuguese:
pt_wiki.zip - 1938 KB, 498325 words, extracted from Wikipedia archive ptwiki-20100404-all-titles-in-ns0, ready for CPT Crosswords.

Romanian:
ro.zip - 92 KB, 49880 words, upper and lower case Latin 2. Set 'Sort' for natural sorting.
ro_wiki_uni.zip - 509 KB, 142402 words, Unicode, extracted from Wikipedia archive rowiki-20100405-all-titles-in-ns0, ready for CPT Crosswords.
ro_wiki_noacc.zip - 482 KB, 138613 words, the same word list but all diacritical marks are removed, ready for CPT Crosswords.

Russian:
ru.zip - 58 KB, 31800 words, all lower case Cyrillic.
ru_101K.zip - 306 KB, 101035 words, mainly nouns and names, ready for CPT Crosswords.
ru_wiki.zip - 2646 KB, 639145 words, extracted from Wikipedia archive ruwiki-20100331-all-titles-in-ns0, ready for CPT Crosswords.

Spanish:
es_wiki.zip - 3573 KB, 984257 words, extracted from Wikipedia archive eswiki-20100331-all-titles-in-ns0, ready for CPT Crosswords.
es_wiki_noacc.zip - 3.05 MB, 800258 words, the same word list but all accent marks are removed, ready for CPT Crosswords.

Thai:
thai.zip - 72 KB, 24331 words, Cp874, sorted binary.

Turkish:
tr.zip - 104 KB, 50290 words, all lower case letters.
tr_wiki.zip - 693 KB, 171545 words, Cp1254, extracted from Wikipedia archive trwiki-20100406-all-titles-in-ns0, ready for CPT Crosswords.

Ukrainian:
uk2.zip - 328 KB, 134524 words, upper and lower case Cyrillic. Set 'Sort' for natural sorting.

Vietnamese:
vi_wiki.zip - 685 KB, 142981 words, custom encoding VN1 (User 8-bit converter), extracted from Wikipedia archive viwiki-20100403-all-titles-in-ns0, ready for CPT Crosswords.

Yidish:
yi.zip - 27 KB, 11723 words, Unicode. Set 'Right Alignment'.


top of page  |  cpt home