stopwords_sv.txt 3.5 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133
  1. | From svn.tartarus.org/snowball/trunk/website/algorithms/swedish/stop.txt
  2. | This file is distributed under the BSD License.
  3. | See http://snowball.tartarus.org/license.php
  4. | Also see http://www.opensource.org/licenses/bsd-license.html
  5. | - Encoding was converted to UTF-8.
  6. | - This notice was added.
  7. |
  8. | NOTE: To use this file with StopFilterFactory, you must specify format="snowball"
  9. | A Swedish stop word list. Comments begin with vertical bar. Each stop
  10. | word is at the start of a line.
  11. | This is a ranked list (commonest to rarest) of stopwords derived from
  12. | a large text sample.
  13. | Swedish stop words occasionally exhibit homonym clashes. For example
  14. | så = so, but also seed. These are indicated clearly below.
  15. och | and
  16. det | it, this/that
  17. att | to (with infinitive)
  18. i | in, at
  19. en | a
  20. jag | I
  21. hon | she
  22. som | who, that
  23. han | he
  24. på | on
  25. den | it, this/that
  26. med | with
  27. var | where, each
  28. sig | him(self) etc
  29. för | for
  30. så | so (also: seed)
  31. till | to
  32. är | is
  33. men | but
  34. ett | a
  35. om | if; around, about
  36. hade | had
  37. de | they, these/those
  38. av | of
  39. icke | not, no
  40. mig | me
  41. du | you
  42. henne | her
  43. då | then, when
  44. sin | his
  45. nu | now
  46. har | have
  47. inte | inte någon = no one
  48. hans | his
  49. honom | him
  50. skulle | 'sake'
  51. hennes | her
  52. där | there
  53. min | my
  54. man | one (pronoun)
  55. ej | nor
  56. vid | at, by, on (also: vast)
  57. kunde | could
  58. något | some etc
  59. från | from, off
  60. ut | out
  61. när | when
  62. efter | after, behind
  63. upp | up
  64. vi | we
  65. dem | them
  66. vara | be
  67. vad | what
  68. över | over
  69. än | than
  70. dig | you
  71. kan | can
  72. sina | his
  73. här | here
  74. ha | have
  75. mot | towards
  76. alla | all
  77. under | under (also: wonder)
  78. någon | some etc
  79. eller | or (else)
  80. allt | all
  81. mycket | much
  82. sedan | since
  83. ju | why
  84. denna | this/that
  85. själv | myself, yourself etc
  86. detta | this/that
  87. åt | to
  88. utan | without
  89. varit | was
  90. hur | how
  91. ingen | no
  92. mitt | my
  93. ni | you
  94. bli | to be, become
  95. blev | from bli
  96. oss | us
  97. din | thy
  98. dessa | these/those
  99. några | some etc
  100. deras | their
  101. blir | from bli
  102. mina | my
  103. samma | (the) same
  104. vilken | who, that
  105. er | you, your
  106. sådan | such a
  107. vår | our
  108. blivit | from bli
  109. dess | its
  110. inom | within
  111. mellan | between
  112. sådant | such a
  113. varför | why
  114. varje | each
  115. vilka | who, that
  116. ditt | thy
  117. vem | who
  118. vilket | who, that
  119. sitta | his
  120. sådana | such a
  121. vart | each
  122. dina | thy
  123. vars | whose
  124. vårt | our
  125. våra | our
  126. ert | your
  127. era | your
  128. vilkas | whose