stopwords_nl.txt 4.6 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119
  1. | From svn.tartarus.org/snowball/trunk/website/algorithms/dutch/stop.txt
  2. | This file is distributed under the BSD License.
  3. | See http://snowball.tartarus.org/license.php
  4. | Also see http://www.opensource.org/licenses/bsd-license.html
  5. | - Encoding was converted to UTF-8.
  6. | - This notice was added.
  7. |
  8. | NOTE: To use this file with StopFilterFactory, you must specify format="snowball"
  9. | A Dutch stop word list. Comments begin with vertical bar. Each stop
  10. | word is at the start of a line.
  11. | This is a ranked list (commonest to rarest) of stopwords derived from
  12. | a large sample of Dutch text.
  13. | Dutch stop words frequently exhibit homonym clashes. These are indicated
  14. | clearly below.
  15. de | the
  16. en | and
  17. van | of, from
  18. ik | I, the ego
  19. te | (1) chez, at etc, (2) to, (3) too
  20. dat | that, which
  21. die | that, those, who, which
  22. in | in, inside
  23. een | a, an, one
  24. hij | he
  25. het | the, it
  26. niet | not, nothing, naught
  27. zijn | (1) to be, being, (2) his, one's, its
  28. is | is
  29. was | (1) was, past tense of all persons sing. of 'zijn' (to be) (2) wax, (3) the washing, (4) rise of river
  30. op | on, upon, at, in, up, used up
  31. aan | on, upon, to (as dative)
  32. met | with, by
  33. als | like, such as, when
  34. voor | (1) before, in front of, (2) furrow
  35. had | had, past tense all persons sing. of 'hebben' (have)
  36. er | there
  37. maar | but, only
  38. om | round, about, for etc
  39. hem | him
  40. dan | then
  41. zou | should/would, past tense all persons sing. of 'zullen'
  42. of | or, whether, if
  43. wat | what, something, anything
  44. mijn | possessive and noun 'mine'
  45. men | people, 'one'
  46. dit | this
  47. zo | so, thus, in this way
  48. door | through by
  49. over | over, across
  50. ze | she, her, they, them
  51. zich | oneself
  52. bij | (1) a bee, (2) by, near, at
  53. ook | also, too
  54. tot | till, until
  55. je | you
  56. mij | me
  57. uit | out of, from
  58. der | Old Dutch form of 'van der' still found in surnames
  59. daar | (1) there, (2) because
  60. haar | (1) her, their, them, (2) hair
  61. naar | (1) unpleasant, unwell etc, (2) towards, (3) as
  62. heb | present first person sing. of 'to have'
  63. hoe | how, why
  64. heeft | present third person sing. of 'to have'
  65. hebben | 'to have' and various parts thereof
  66. deze | this
  67. u | you
  68. want | (1) for, (2) mitten, (3) rigging
  69. nog | yet, still
  70. zal | 'shall', first and third person sing. of verb 'zullen' (will)
  71. me | me
  72. zij | she, they
  73. nu | now
  74. ge | 'thou', still used in Belgium and south Netherlands
  75. geen | none
  76. omdat | because
  77. iets | something, somewhat
  78. worden | to become, grow, get
  79. toch | yet, still
  80. al | all, every, each
  81. waren | (1) 'were' (2) to wander, (3) wares, (3)
  82. veel | much, many
  83. meer | (1) more, (2) lake
  84. doen | to do, to make
  85. toen | then, when
  86. moet | noun 'spot/mote' and present form of 'to must'
  87. ben | (1) am, (2) 'are' in interrogative second person singular of 'to be'
  88. zonder | without
  89. kan | noun 'can' and present form of 'to be able'
  90. hun | their, them
  91. dus | so, consequently
  92. alles | all, everything, anything
  93. onder | under, beneath
  94. ja | yes, of course
  95. eens | once, one day
  96. hier | here
  97. wie | who
  98. werd | imperfect third person sing. of 'become'
  99. altijd | always
  100. doch | yet, but etc
  101. wordt | present third person sing. of 'become'
  102. wezen | (1) to be, (2) 'been' as in 'been fishing', (3) orphans
  103. kunnen | to be able
  104. ons | us/our
  105. zelf | self
  106. tegen | against, towards, at
  107. na | after, near
  108. reeds | already
  109. wil | (1) present tense of 'want', (2) 'will', noun, (3) fender
  110. kon | could; past tense of 'to be able'
  111. niets | nothing
  112. uw | your
  113. iemand | somebody
  114. geweest | been; past participle of 'be'
  115. andere | other