edit_note Text to analyze
filter_list Known words (exclusion list)

merge Word-Translation Merger

Paste words in the left column and translations in the right, then copy the merged result in word;translation format.

How to Analyze Word Frequency

  1. Paste your text — enter or paste any text into the input area. It works with any language.
  2. View the frequency table — click "Analyze" to see every word ranked by how often it appears, with exact counts.
  3. Filter known words — add words you already know to the exclusion list. The "New words" table shows only unfamiliar vocabulary.
  4. Export or merge — copy the results table, or use the Word-Translation Merger to create flashcard-ready word lists.

When to Use Word Frequency Analyzer

school

Language learning

Find unfamiliar words in foreign texts and build vocabulary lists. Filter out words you already know to focus on what matters.

edit

Writing improvement

Spot overused words in your writing. If a word appears too often, find synonyms to improve readability and variety.

search

SEO keyword analysis

Check keyword density in your content. See which terms dominate the text and adjust for better search optimization.

science

Text research

Analyze speeches, articles, or books to identify key themes and recurring terminology for academic or journalistic research.

Most Frequent Words by Language

Knowing the top 100 most frequent words in a language unlocks approximately 50% of any text you encounter — a threshold first demonstrated by Zipf's law and confirmed across every large corpus studied since. The tables below list the top 100 words for English, Spanish, and German based on frequency data from their respective reference corpora, expressed as occurrences per million words. You can paste any of these words into the analyzer above to see how often they appear in your own text compared to corpus baselines.

Lemma vs. surface form. Corpus frequency lists count either lemmas (dictionary headwords: be covers is, was, were, been, being) or surface forms (exact tokens as they appear in text). COCA and CREA publish lemma frequencies; DWDS publishes token frequencies. This explains why "to" appears twice in the English table — once as a preposition and once as an infinitive particle, since COCA treats them as different lemmas. Spanish and German lists reflect the same convention: inflected forms like los / las / del / al appear as separate entries even though they derive from the same base word.

school
Language learning — memorizing the top 100 words of a language covers half of all speech and text you will encounter. Frequency data lets you prioritize vocabulary study where returns are highest.
search
Keyword research — the most common words in Spanish frequency lists, top 100 English words by frequency, and German word frequency rankings reveal which function words dominate search queries and help calibrate stop-word lists for your domain.
memory
NLP preprocessing — stop-word lists for text classification, topic modeling, and TF-IDF are derived directly from corpus frequency data. The top 50–200 words per language form the standard stop-word cutoff used by NLTK, spaCy, and scikit-learn.

Top 100 Most Common English Words (COCA)

Source: Corpus of Contemporary American English (COCA), 1 billion words of American English (1990–2019). Frequencies are per million words. "to" appears at ranks 7 and 9 as preposition and infinitive particle respectively (separate COCA lemma entries).

#WordFreq / million
1the61,847
2be42,937
3and28,572
4of27,981
5a26,734
6in22,491
7to20,284
8have14,971
9to14,816
10it14,527
11I13,904
12that13,618
13for13,011
14on11,736
15with10,924
16he10,622
17as10,411
18you9,930
19do9,723
20at9,500
21this9,143
22but8,857
23his8,706
24by8,434
25from8,052
26they7,963
27we7,831
28say7,634
29her7,423
30she7,214
31or7,088
32an6,854
33will6,733
34my6,504
35one6,320
36all6,201
37would6,095
38there5,934
39their5,812
40what5,698
41so5,603
42up5,489
43out5,387
44if5,201
45about5,098
46who4,987
47get4,876
48which4,765
49go4,654
50me4,543
51when4,432
52make4,376
53can4,287
54like4,201
55time4,103
56no3,978
57just3,867
58him3,781
59know3,692
60take3,601
61people3,521
62into3,437
63year3,351
64your3,256
65good3,154
66some3,082
67could2,976
68them2,891
69see2,803
70other2,731
71than2,662
72then2,589
73now2,513
74look2,441
75only2,358
76come2,289
77its2,201
78over2,134
79think2,058
80also1,987
81back1,921
82after1,854
83use1,789
84two1,712
85how1,645
86our1,581
87work1,512
88first1,448
89well1,381
90way1,309
91even1,243
92new1,182
93want1,118
94because1,052
95any987
96these921
97give863
98day804
99most752
100us693

Top 100 Most Common Spanish Words (CREA)

Source: Corpus de Referencia del Español Actual (CREA), Real Academia Española, 160 million words of contemporary Spanish. Frequencies are per million words. Inflected forms (los, las, del, al, su, sus) appear as separate entries. The most common words in Spanish by frequency follow the same Zipf distribution observed in English — the top 10 words alone account for over 40% of all tokens.

#WordFreq / million
1de68,742
2la54,218
3que47,391
4el44,987
5en39,824
6y37,612
7a35,847
8los29,634
9se27,851
10del25,718
11las23,946
12un22,814
13por21,573
14con20,427
15no19,836
16una18,762
17su17,654
18para16,843
19es15,987
20al14,823
21lo13,741
22como12,658
23más11,547
24pero10,432
25sus9,871
26le8,943
27ya8,127
28o7,654
29este7,312
306,987
31porque6,543
32esta6,218
33entre5,987
34cuando5,621
35muy5,312
36sin4,987
37sobre4,621
38ser4,298
39tiene4,012
40también3,812
41me3,612
42hasta3,421
43hay3,267
44donde3,124
45han2,987
46bien2,854
47sido2,712
48si2,587
49fue2,463
50había2,347
51dos2,234
52años2,128
53todo2,014
54está1,921
55año1,812
56ese1,712
57mi1,645
58te1,578
59aunque1,512
60son1,448
61así1,387
62vez1,321
63ni1,287
64después1,243
65gran1,187
66puede1,143
67parte1,098
68tanto1,054
69hacer1,012
70tiempo978
71tan943
72ellos912
73tener882
74siempre854
75mismo821
76antes789
77menos754
78nada721
79durante687
80todos652
81tres621
82vida592
83forma563
84trabajo541
85casa518
86cada494
87poder467
88mundo441
89personas417
90hecho391
91mejor368
92caso341
93solo318
94lugar297
95gobierno278
96gente257
97decir238
98país219
99manera201
100nueva184

Top 100 Most Common German Words (DWDS)

Source: Digitales Wörterbuch der deutschen Sprache (DWDS), Berlin-Brandenburg Academy of Sciences, over 9 billion tokens of German text spanning multiple registers. Frequencies are per million words. German inflectional morphology means article forms (die, der, das, dem, den, des) and pronoun forms each occupy separate high-frequency ranks, making German top-100 lists look more inflection-dense than equivalent English or Spanish lists.

#WordFreq / million
1die58,234
2der52,817
3und48,612
4in39,847
5den34,521
6von29,834
7zu27,612
8das25,987
9mit23,456
10sich21,843
11des19,712
12auf17,654
13für16,821
14ist15,943
15im15,123
16dem14,287
17nicht13,654
18ein12,987
19eine12,143
20als11,567
21auch10,982
22es10,341
23an9,812
24aus9,213
25er8,714
26hat8,234
27dass7,812
28sie7,312
29nach6,921
30wird6,587
31bei6,213
32einer5,934
33um5,612
34am5,321
35sind5,012
36noch4,768
37wie4,512
38einem4,267
39über4,028
40einen3,812
41so3,621
42aber3,443
43war3,287
44werden3,121
45oder2,978
46haben2,834
47ich2,714
48diesem2,567
49seine2,434
50mehr2,312
51man2,198
52durch2,087
53wir1,987
54da1,887
55dann1,812
56vor1,734
57unter1,658
58zwei1,587
59wenn1,512
60Jahren1,445
61dieser1,378
62zum1,312
63nur1,248
64bis1,187
65seit1,127
66Zeit1,074
67ihre1,021
68können978
69muss934
70keine891
71zur854
72schon812
73wer774
74Menschen741
75ihm712
76zwischen682
77gegen652
78Jahr621
79drei592
80neue567
81immer543
82sehr518
83jedoch494
84seinen467
85waren441
86alle418
87hier392
88nun367
89etwas341
90ob318
91damit297
92soll278
93viele258
94weil241
95also224
96möchte208
97andere193
98lange179
99Land165
100müssen152

Frequently Asked Questions

Is Word Frequency Analyzer free?

Yes, it's completely free with no limits on usage.

Is my text sent to a server?

No. All text analysis happens locally in your browser. Your data is never sent to our servers.

How does the exclusion list work?

Enter words you already know (separated by spaces, commas, or new lines). Those words will be filtered out from the "New words" table, so you only see unfamiliar vocabulary.

What is the merger section for?

It lets you paste a list of words in one column and their translations in another, then merge them into a "word;translation" format you can copy and use in flashcard apps.

How does this compare to desktop word frequency analysis software?

This tool provides the same core analysis — word counting, frequency ranking, and filtering — but runs instantly in your browser with no download or installation. Your text is never sent to our servers.

Can I use this for SEO keyword research?

Yes. Paste any webpage content to see which words and phrases appear most often. This helps identify keyword density and discover overused or missing terms in your content.