News Feed
Jobs Feed
Sections




News Archive
feed this:

Sameer Borate's Blog:
Porter Stemming algorithm for search
April 29, 2009 @ 07:57:06

In a recent post to his blog Sameer looks at implementing a Stemming algorithm to search an array of words. It uses this library (as written by Richard Heyes).

A stemming algorithm lets you reduce each English input word to its basic root or stem (e.g. 'walking' to 'walk') so that variations on a word ('walks', 'walked', 'walking') are considered equivalent when searching. This stems can than be used in a search query rather than the original words, which generally (but not always) results in more relevant search results.

His code example uses the library to search for two different types of strings - a single word and a phrase (with stop words removed). The Stem() method is called on the word and the results are looped through to remove all matching the values in the stop words array.

0 comments voice your opinion now!
stop word search stem root query library richardheyes



Community Events











Don't see your event here?
Let us know!


development zendframework2 example testing functional language conference object opinion series code phpunit introduction podcast framework interview unittest tool release community

All content copyright, 2013 PHPDeveloper.org :: info@phpdeveloper.org - Powered by the Solar PHP Framework