stopwords_nl.txt 4.4 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117
  1. | From svn.tartarus.org/snowball/trunk/website/algorithms/dutch/stop.txt
  2. | This file is distributed under the BSD License.
  3. | See http://snowball.tartarus.org/license.php
  4. | Also see http://www.opensource.org/licenses/bsd-license.html
  5. | - Encoding was converted to UTF-8.
  6. | - This notice was added.
  7. | A Dutch stop word list. Comments begin with vertical bar. Each stop
  8. | word is at the start of a line.
  9. | This is a ranked list (commonest to rarest) of stopwords derived from
  10. | a large sample of Dutch text.
  11. | Dutch stop words frequently exhibit homonym clashes. These are indicated
  12. | clearly below.
  13. de | the
  14. en | and
  15. van | of, from
  16. ik | I, the ego
  17. te | (1) chez, at etc, (2) to, (3) too
  18. dat | that, which
  19. die | that, those, who, which
  20. in | in, inside
  21. een | a, an, one
  22. hij | he
  23. het | the, it
  24. niet | not, nothing, naught
  25. zijn | (1) to be, being, (2) his, one's, its
  26. is | is
  27. was | (1) was, past tense of all persons sing. of 'zijn' (to be) (2) wax, (3) the washing, (4) rise of river
  28. op | on, upon, at, in, up, used up
  29. aan | on, upon, to (as dative)
  30. met | with, by
  31. als | like, such as, when
  32. voor | (1) before, in front of, (2) furrow
  33. had | had, past tense all persons sing. of 'hebben' (have)
  34. er | there
  35. maar | but, only
  36. om | round, about, for etc
  37. hem | him
  38. dan | then
  39. zou | should/would, past tense all persons sing. of 'zullen'
  40. of | or, whether, if
  41. wat | what, something, anything
  42. mijn | possessive and noun 'mine'
  43. men | people, 'one'
  44. dit | this
  45. zo | so, thus, in this way
  46. door | through by
  47. over | over, across
  48. ze | she, her, they, them
  49. zich | oneself
  50. bij | (1) a bee, (2) by, near, at
  51. ook | also, too
  52. tot | till, until
  53. je | you
  54. mij | me
  55. uit | out of, from
  56. der | Old Dutch form of 'van der' still found in surnames
  57. daar | (1) there, (2) because
  58. haar | (1) her, their, them, (2) hair
  59. naar | (1) unpleasant, unwell etc, (2) towards, (3) as
  60. heb | present first person sing. of 'to have'
  61. hoe | how, why
  62. heeft | present third person sing. of 'to have'
  63. hebben | 'to have' and various parts thereof
  64. deze | this
  65. u | you
  66. want | (1) for, (2) mitten, (3) rigging
  67. nog | yet, still
  68. zal | 'shall', first and third person sing. of verb 'zullen' (will)
  69. me | me
  70. zij | she, they
  71. nu | now
  72. ge | 'thou', still used in Belgium and south Netherlands
  73. geen | none
  74. omdat | because
  75. iets | something, somewhat
  76. worden | to become, grow, get
  77. toch | yet, still
  78. al | all, every, each
  79. waren | (1) 'were' (2) to wander, (3) wares, (3)
  80. veel | much, many
  81. meer | (1) more, (2) lake
  82. doen | to do, to make
  83. toen | then, when
  84. moet | noun 'spot/mote' and present form of 'to must'
  85. ben | (1) am, (2) 'are' in interrogative second person singular of 'to be'
  86. zonder | without
  87. kan | noun 'can' and present form of 'to be able'
  88. hun | their, them
  89. dus | so, consequently
  90. alles | all, everything, anything
  91. onder | under, beneath
  92. ja | yes, of course
  93. eens | once, one day
  94. hier | here
  95. wie | who
  96. werd | imperfect third person sing. of 'become'
  97. altijd | always
  98. doch | yet, but etc
  99. wordt | present third person sing. of 'become'
  100. wezen | (1) to be, (2) 'been' as in 'been fishing', (3) orphans
  101. kunnen | to be able
  102. ons | us/our
  103. zelf | self
  104. tegen | against, towards, at
  105. na | after, near
  106. reeds | already
  107. wil | (1) present tense of 'want', (2) 'will', noun, (3) fender
  108. kon | could; past tense of 'to be able'
  109. niets | nothing
  110. uw | your
  111. iemand | somebody
  112. geweest | been; past participle of 'be'
  113. andere | other