PlainSpell Rankings

Languages by Dictionary Size

Languages in the PlainSpell database ranked by total number of word entries.

5
ranked entries
4,485,239
#1 words

What This Ranking Tells Us

PlainSpell covers five major world languages, each with a different volume of dictionary entries from Wiktionary. Dictionary size reflects both the natural vocabulary size of the language and the completeness of Wiktionary volunteer coverage. French and German have the largest entries partly because Wiktionary editors in those languages have been exceptionally active. Word counts include all word forms, so languages with more inflection (like French and German) naturally have more entries.

The ranking shown on this page is computed once per ETL refresh from PlainSpell's underlying dictionary tables, then cached in the rankings table for fast retrieval. Each row is a real dictionary record from open-source linguistic sources , Wiktionary lemma entries via kaikki.org, Hunspell affix and dictionary packs, and published word-frequency corpora. There is no scraping, no synthesised data, and no editorial reordering: every ranked entry exists in the source dictionary and the value column is a measurable property of that entry, not an opinion about it. The same data powers PlainSpell's per-word pages, so any item in the table can be inspected in detail by following its link to see the IPA pronunciation, etymology, part-of-speech tags, and recorded variants. Positions are stable between data refreshes so that returning visitors can confirm that a previously-cited rank has not silently shifted because of a UI change.

Reading this list is most useful with two things in mind. First, the value column is measured in concrete units, letters for length rankings, variants for misspelling rankings, group size for homophone rankings, raw entry count for language-size rankings , not in arbitrary scores. When two rows tie, the tie is real: the underlying dictionary assigns them identical measurements. Second, the ranking is a discovery surface, not a scoreboard. A high rank on the most-misspelled list does not mean a word is harder than a word at a lower rank by some absolute measure of difficulty; it means the word has accumulated more observed misspelling variants in available corpora, which can reflect exposure (the word appears often enough for variants to be recorded) as much as intrinsic complexity. The accompanying narrative above frames each ranking with the specific interpretation suited to its underlying field.

Methodology for every ranking on PlainSpell is documented on the methodology page. In short: PlainSpell ingests the latest open Wiktionary dumps, runs Hunspell and IPA-based pre-processing, joins against published frequency lists, and writes the result into rankings rows. No row is created without a backing dictionary record, and no value is rounded, capped, or re-weighted. When upstream Wiktionary revisions ship, the ETL recomputes from scratch, which means an entry can move up or down between quarterly refreshes if its underlying record was edited by Wiktionary contributors. Audit notes for each refresh are stored alongside the data so any change in position has a traceable cause.

Languages by Dictionary Size, top 5

Languages in the PlainSpell database ranked by total number of word entries.

words
Source PlainSpell · Wiktionary corpus As of May 2026
# Word Words
1 French 4,485,239
2 German 1,077,739
3 Spanish 770,428
4 English 545,755
5 Portuguese 39,583

Source: Wiktionary extracts via kaikki.org.

Spelling & Dictionary Insight

The Languages by Dictionary Size ranking is generated from PlainSpell's pre-computed rankings table where type = 'languages_by_size'. The current query returned 5 ranked rows, each carrying a rank position, a display name, a scoreable value measured in words, and, where applicable, a slug that links back to the detail page. Rankings are rebuilt at ETL time so positions are stable between data refreshes rather than recomputed on every request.

The top of this list is anchored by French with a value of 4,485,239, followed by German at 1,077,739 and Spanish at 770,428. The bottom of the current slice ends at rank #5 with Portuguese at 39,583, giving a visible spread of roughly 4,485,239 → 39,583.

PlainSpell covers five major world languages, each with a different volume of dictionary entries from Wiktionary. Dictionary size reflects both the natural vocabulary size of the language and the completeness of Wiktionary volunteer coverage. French and German have the largest entries partly because Wiktionary editors in those languages have been exceptionally active. Word counts include all word forms, so languages with more inflection (like French and German) naturally have more entries. Every entry above is backed by the same dictionary data that powers PlainSpell's word and confusable pages, so a ranked entry with a slug can be clicked through to see the full definition, IPA pronunciation, etymology, and any misspelling or confusable relationships that apply. The underlying fields come from Wiktionary and corpus frequency lists, no scraping, no extrapolation.

Frequently Asked Questions

Why does French have more entries than English?

Two factors: French has rich verb conjugation (each verb can have 50+ forms), and the French Wiktionary community is exceptionally active. Each conjugated form counts as a separate entry. English, with minimal inflection, has fewer total forms per word.

Does dictionary size equal vocabulary size?

Not directly. Dictionary entries include all word forms (conjugations, plurals, cases). A language with heavy inflection will have more entries per root word. English might have fewer total entries but similar unique root words compared to a heavily inflected language like German.

Data sourced from official open-source linguistic references (Wiktionary, Kaikki). See our methodology for details. Retrieved and formatted by PlainSpell Editorial