While indexing Lucene ignores some common English words, because they rarely add any value to a search. These words are considered as noise; and ignoring them actually improves the search quality. If you have ever wondered what these words are, here is the complete list.
a, an, and, are, as, at, be, but, by, for, if, in, into, is, it, no, not, of, on, or, such, that, the, their, then, there, these, they, this, to, was, will, with
Friday, February 29, 2008
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment