Stop Words List in English for NLP

Stop words are a set of commonly used words in a language. Examples of stop words in English are “a”, “the”, “is”, “are”, etc. These words do not add much meaning to a sentence. 

They can be safely ignored without sacrificing the meaning of the sentence. For some search engines, these are some of the most common short function words like is, is, because, which, and on.

List of Stop Words

Following is a list of stop words used for natural language processing (NLP):

aourselves
aboutout
aboveover
afterown
againsame
againstshan’t
allshe
amshe’d
anshe’ll
andshe’s
anyshould
areshouldn’t
aren’tso
assome
atsuch
bethan
becausethat
beenthat’s
beforethe
beingtheir
belowtheirs
betweenthem
boththemselves
butthen
bythere
can’tthere’s
cannotthese
couldthey

 

couldn’tthey’d
didthey’ll
didn’tthey’re
dothey’ve
doesthis
doesn’tthose
doingthrough
don’tto
downtoo
duringunder
eachuntil
fewup
forvery
fromwas
furtherwasn’t
hadwe
hadn’twe’d
haswe’ll
hasn’twe’re
havewe’ve
haven’twere
havingweren’t
hewhat
he’dwhat’s
he’llwhen
he’swhen’s

 

herwhere
herewhere’s
here’swhich
herswhile
herselfwho
himwho’s
himselfwhom
hiswhy
howwhy’s
how’swith
iwon’t
i’dwould
i’llwouldn’t
i’myou
i’veyou’d
ifyou’ll
inyou’re
intoyou’ve
isyour
isn’tyours
ityourself
it’syourselves
itsnor
itselfnot
let’sof
meoff
moreon
mostonce
mustn’tonly
myor
myselfother
noought
oursour

You should only remove these tokens if they do not add any new information about your problem. Classification problems usually do not need stop words because it is possible to talk about the general idea of ​​the text even if you remove the stop words from it.

Removing stop words helps reduce both index size and query size. Fewer deadlines is always a win when it comes to performance. And since stop words are semantically empty, the relevance score is not affected.

Quick Links

  1. Computer Vocabulary Words List
  2. Words to Describe Technology