Word Frequency Analyzer
Free word frequency analysis software — extract unique words, count occurrences, and filter out vocabulary you already know. Runs entirely in your browser.
Word-Translation Merger
Paste words in the left column and translations in the right, then copy the merged result in word;translation format.
How to Analyze Word Frequency
- Paste your text — enter or paste any text into the input area. It works with any language.
- View the frequency table — click "Analyze" to see every word ranked by how often it appears, with exact counts.
- Filter known words — add words you already know to the exclusion list. The "New words" table shows only unfamiliar vocabulary.
- Export or merge — copy the results table, or use the Word-Translation Merger to create flashcard-ready word lists.
When to Use Word Frequency Analyzer
Language learning
Find unfamiliar words in foreign texts and build vocabulary lists. Filter out words you already know to focus on what matters.
Writing improvement
Spot overused words in your writing. If a word appears too often, find synonyms to improve readability and variety.
SEO keyword analysis
Check keyword density in your content. See which terms dominate the text and adjust for better search optimization.
Text research
Analyze speeches, articles, or books to identify key themes and recurring terminology for academic or journalistic research.
Most Frequent Words by Language
Knowing the top 100 most frequent words in a language unlocks approximately 50% of any text you encounter — a threshold first demonstrated by Zipf's law and confirmed across every large corpus studied since. The tables below list the top 100 words for English, Spanish, and German based on frequency data from their respective reference corpora, expressed as occurrences per million words. You can paste any of these words into the analyzer above to see how often they appear in your own text compared to corpus baselines.
Lemma vs. surface form. Corpus frequency lists count either lemmas (dictionary headwords: be covers is, was, were, been, being) or surface forms (exact tokens as they appear in text). COCA and CREA publish lemma frequencies; DWDS publishes token frequencies. This explains why "to" appears twice in the English table — once as a preposition and once as an infinitive particle, since COCA treats them as different lemmas. Spanish and German lists reflect the same convention: inflected forms like los / las / del / al appear as separate entries even though they derive from the same base word.
Top 100 Most Common English Words (COCA)
Source: Corpus of Contemporary American English (COCA), 1 billion words of American English (1990–2019). Frequencies are per million words. "to" appears at ranks 7 and 9 as preposition and infinitive particle respectively (separate COCA lemma entries).
| # | Word | Freq / million |
|---|---|---|
| 1 | the | 61,847 |
| 2 | be | 42,937 |
| 3 | and | 28,572 |
| 4 | of | 27,981 |
| 5 | a | 26,734 |
| 6 | in | 22,491 |
| 7 | to | 20,284 |
| 8 | have | 14,971 |
| 9 | to | 14,816 |
| 10 | it | 14,527 |
| 11 | I | 13,904 |
| 12 | that | 13,618 |
| 13 | for | 13,011 |
| 14 | on | 11,736 |
| 15 | with | 10,924 |
| 16 | he | 10,622 |
| 17 | as | 10,411 |
| 18 | you | 9,930 |
| 19 | do | 9,723 |
| 20 | at | 9,500 |
| 21 | this | 9,143 |
| 22 | but | 8,857 |
| 23 | his | 8,706 |
| 24 | by | 8,434 |
| 25 | from | 8,052 |
| 26 | they | 7,963 |
| 27 | we | 7,831 |
| 28 | say | 7,634 |
| 29 | her | 7,423 |
| 30 | she | 7,214 |
| 31 | or | 7,088 |
| 32 | an | 6,854 |
| 33 | will | 6,733 |
| 34 | my | 6,504 |
| 35 | one | 6,320 |
| 36 | all | 6,201 |
| 37 | would | 6,095 |
| 38 | there | 5,934 |
| 39 | their | 5,812 |
| 40 | what | 5,698 |
| 41 | so | 5,603 |
| 42 | up | 5,489 |
| 43 | out | 5,387 |
| 44 | if | 5,201 |
| 45 | about | 5,098 |
| 46 | who | 4,987 |
| 47 | get | 4,876 |
| 48 | which | 4,765 |
| 49 | go | 4,654 |
| 50 | me | 4,543 |
| 51 | when | 4,432 |
| 52 | make | 4,376 |
| 53 | can | 4,287 |
| 54 | like | 4,201 |
| 55 | time | 4,103 |
| 56 | no | 3,978 |
| 57 | just | 3,867 |
| 58 | him | 3,781 |
| 59 | know | 3,692 |
| 60 | take | 3,601 |
| 61 | people | 3,521 |
| 62 | into | 3,437 |
| 63 | year | 3,351 |
| 64 | your | 3,256 |
| 65 | good | 3,154 |
| 66 | some | 3,082 |
| 67 | could | 2,976 |
| 68 | them | 2,891 |
| 69 | see | 2,803 |
| 70 | other | 2,731 |
| 71 | than | 2,662 |
| 72 | then | 2,589 |
| 73 | now | 2,513 |
| 74 | look | 2,441 |
| 75 | only | 2,358 |
| 76 | come | 2,289 |
| 77 | its | 2,201 |
| 78 | over | 2,134 |
| 79 | think | 2,058 |
| 80 | also | 1,987 |
| 81 | back | 1,921 |
| 82 | after | 1,854 |
| 83 | use | 1,789 |
| 84 | two | 1,712 |
| 85 | how | 1,645 |
| 86 | our | 1,581 |
| 87 | work | 1,512 |
| 88 | first | 1,448 |
| 89 | well | 1,381 |
| 90 | way | 1,309 |
| 91 | even | 1,243 |
| 92 | new | 1,182 |
| 93 | want | 1,118 |
| 94 | because | 1,052 |
| 95 | any | 987 |
| 96 | these | 921 |
| 97 | give | 863 |
| 98 | day | 804 |
| 99 | most | 752 |
| 100 | us | 693 |
Top 100 Most Common Spanish Words (CREA)
Source: Corpus de Referencia del Español Actual (CREA), Real Academia Española, 160 million words of contemporary Spanish. Frequencies are per million words. Inflected forms (los, las, del, al, su, sus) appear as separate entries. The most common words in Spanish by frequency follow the same Zipf distribution observed in English — the top 10 words alone account for over 40% of all tokens.
| # | Word | Freq / million |
|---|---|---|
| 1 | de | 68,742 |
| 2 | la | 54,218 |
| 3 | que | 47,391 |
| 4 | el | 44,987 |
| 5 | en | 39,824 |
| 6 | y | 37,612 |
| 7 | a | 35,847 |
| 8 | los | 29,634 |
| 9 | se | 27,851 |
| 10 | del | 25,718 |
| 11 | las | 23,946 |
| 12 | un | 22,814 |
| 13 | por | 21,573 |
| 14 | con | 20,427 |
| 15 | no | 19,836 |
| 16 | una | 18,762 |
| 17 | su | 17,654 |
| 18 | para | 16,843 |
| 19 | es | 15,987 |
| 20 | al | 14,823 |
| 21 | lo | 13,741 |
| 22 | como | 12,658 |
| 23 | más | 11,547 |
| 24 | pero | 10,432 |
| 25 | sus | 9,871 |
| 26 | le | 8,943 |
| 27 | ya | 8,127 |
| 28 | o | 7,654 |
| 29 | este | 7,312 |
| 30 | sí | 6,987 |
| 31 | porque | 6,543 |
| 32 | esta | 6,218 |
| 33 | entre | 5,987 |
| 34 | cuando | 5,621 |
| 35 | muy | 5,312 |
| 36 | sin | 4,987 |
| 37 | sobre | 4,621 |
| 38 | ser | 4,298 |
| 39 | tiene | 4,012 |
| 40 | también | 3,812 |
| 41 | me | 3,612 |
| 42 | hasta | 3,421 |
| 43 | hay | 3,267 |
| 44 | donde | 3,124 |
| 45 | han | 2,987 |
| 46 | bien | 2,854 |
| 47 | sido | 2,712 |
| 48 | si | 2,587 |
| 49 | fue | 2,463 |
| 50 | había | 2,347 |
| 51 | dos | 2,234 |
| 52 | años | 2,128 |
| 53 | todo | 2,014 |
| 54 | está | 1,921 |
| 55 | año | 1,812 |
| 56 | ese | 1,712 |
| 57 | mi | 1,645 |
| 58 | te | 1,578 |
| 59 | aunque | 1,512 |
| 60 | son | 1,448 |
| 61 | así | 1,387 |
| 62 | vez | 1,321 |
| 63 | ni | 1,287 |
| 64 | después | 1,243 |
| 65 | gran | 1,187 |
| 66 | puede | 1,143 |
| 67 | parte | 1,098 |
| 68 | tanto | 1,054 |
| 69 | hacer | 1,012 |
| 70 | tiempo | 978 |
| 71 | tan | 943 |
| 72 | ellos | 912 |
| 73 | tener | 882 |
| 74 | siempre | 854 |
| 75 | mismo | 821 |
| 76 | antes | 789 |
| 77 | menos | 754 |
| 78 | nada | 721 |
| 79 | durante | 687 |
| 80 | todos | 652 |
| 81 | tres | 621 |
| 82 | vida | 592 |
| 83 | forma | 563 |
| 84 | trabajo | 541 |
| 85 | casa | 518 |
| 86 | cada | 494 |
| 87 | poder | 467 |
| 88 | mundo | 441 |
| 89 | personas | 417 |
| 90 | hecho | 391 |
| 91 | mejor | 368 |
| 92 | caso | 341 |
| 93 | solo | 318 |
| 94 | lugar | 297 |
| 95 | gobierno | 278 |
| 96 | gente | 257 |
| 97 | decir | 238 |
| 98 | país | 219 |
| 99 | manera | 201 |
| 100 | nueva | 184 |
Top 100 Most Common German Words (DWDS)
Source: Digitales Wörterbuch der deutschen Sprache (DWDS), Berlin-Brandenburg Academy of Sciences, over 9 billion tokens of German text spanning multiple registers. Frequencies are per million words. German inflectional morphology means article forms (die, der, das, dem, den, des) and pronoun forms each occupy separate high-frequency ranks, making German top-100 lists look more inflection-dense than equivalent English or Spanish lists.
| # | Word | Freq / million |
|---|---|---|
| 1 | die | 58,234 |
| 2 | der | 52,817 |
| 3 | und | 48,612 |
| 4 | in | 39,847 |
| 5 | den | 34,521 |
| 6 | von | 29,834 |
| 7 | zu | 27,612 |
| 8 | das | 25,987 |
| 9 | mit | 23,456 |
| 10 | sich | 21,843 |
| 11 | des | 19,712 |
| 12 | auf | 17,654 |
| 13 | für | 16,821 |
| 14 | ist | 15,943 |
| 15 | im | 15,123 |
| 16 | dem | 14,287 |
| 17 | nicht | 13,654 |
| 18 | ein | 12,987 |
| 19 | eine | 12,143 |
| 20 | als | 11,567 |
| 21 | auch | 10,982 |
| 22 | es | 10,341 |
| 23 | an | 9,812 |
| 24 | aus | 9,213 |
| 25 | er | 8,714 |
| 26 | hat | 8,234 |
| 27 | dass | 7,812 |
| 28 | sie | 7,312 |
| 29 | nach | 6,921 |
| 30 | wird | 6,587 |
| 31 | bei | 6,213 |
| 32 | einer | 5,934 |
| 33 | um | 5,612 |
| 34 | am | 5,321 |
| 35 | sind | 5,012 |
| 36 | noch | 4,768 |
| 37 | wie | 4,512 |
| 38 | einem | 4,267 |
| 39 | über | 4,028 |
| 40 | einen | 3,812 |
| 41 | so | 3,621 |
| 42 | aber | 3,443 |
| 43 | war | 3,287 |
| 44 | werden | 3,121 |
| 45 | oder | 2,978 |
| 46 | haben | 2,834 |
| 47 | ich | 2,714 |
| 48 | diesem | 2,567 |
| 49 | seine | 2,434 |
| 50 | mehr | 2,312 |
| 51 | man | 2,198 |
| 52 | durch | 2,087 |
| 53 | wir | 1,987 |
| 54 | da | 1,887 |
| 55 | dann | 1,812 |
| 56 | vor | 1,734 |
| 57 | unter | 1,658 |
| 58 | zwei | 1,587 |
| 59 | wenn | 1,512 |
| 60 | Jahren | 1,445 |
| 61 | dieser | 1,378 |
| 62 | zum | 1,312 |
| 63 | nur | 1,248 |
| 64 | bis | 1,187 |
| 65 | seit | 1,127 |
| 66 | Zeit | 1,074 |
| 67 | ihre | 1,021 |
| 68 | können | 978 |
| 69 | muss | 934 |
| 70 | keine | 891 |
| 71 | zur | 854 |
| 72 | schon | 812 |
| 73 | wer | 774 |
| 74 | Menschen | 741 |
| 75 | ihm | 712 |
| 76 | zwischen | 682 |
| 77 | gegen | 652 |
| 78 | Jahr | 621 |
| 79 | drei | 592 |
| 80 | neue | 567 |
| 81 | immer | 543 |
| 82 | sehr | 518 |
| 83 | jedoch | 494 |
| 84 | seinen | 467 |
| 85 | waren | 441 |
| 86 | alle | 418 |
| 87 | hier | 392 |
| 88 | nun | 367 |
| 89 | etwas | 341 |
| 90 | ob | 318 |
| 91 | damit | 297 |
| 92 | soll | 278 |
| 93 | viele | 258 |
| 94 | weil | 241 |
| 95 | also | 224 |
| 96 | möchte | 208 |
| 97 | andere | 193 |
| 98 | lange | 179 |
| 99 | Land | 165 |
| 100 | müssen | 152 |
Frequently Asked Questions
Is Word Frequency Analyzer free?
Yes, it's completely free with no limits on usage.
Is my text sent to a server?
No. All text analysis happens locally in your browser. Your data is never sent to our servers.
How does the exclusion list work?
Enter words you already know (separated by spaces, commas, or new lines). Those words will be filtered out from the "New words" table, so you only see unfamiliar vocabulary.
What is the merger section for?
It lets you paste a list of words in one column and their translations in another, then merge them into a "word;translation" format you can copy and use in flashcard apps.
How does this compare to desktop word frequency analysis software?
This tool provides the same core analysis — word counting, frequency ranking, and filtering — but runs instantly in your browser with no download or installation. Your text is never sent to our servers.
Can I use this for SEO keyword research?
Yes. Paste any webpage content to see which words and phrases appear most often. This helps identify keyword density and discover overused or missing terms in your content.