News Feed
Jobs Feed
Sections




News Archive
Sameer Borate's Blog:
Porter Stemming algorithm for search
April 29, 2009 @ 07:57:06

In a recent post to his blog Sameer looks at implementing a Stemming algorithm to search an array of words. It uses this library (as written by Richard Heyes).

A stemming algorithm lets you reduce each English input word to its basic root or stem (e.g. 'walking' to 'walk') so that variations on a word ('walks', 'walked', 'walking') are considered equivalent when searching. This stems can than be used in a search query rather than the original words, which generally (but not always) results in more relevant search results.

His code example uses the library to search for two different types of strings - a single word and a phrase (with stop words removed). The Stem() method is called on the word and the results are looped through to remove all matching the values in the stop words array.

0 comments voice your opinion now!
stop word search stem root query library richardheyes


blog comments powered by Disqus

Similar Posts

Content with Style Blog: Zend Framework DB and Mysql pre 5.17

NetTuts.com: Expressive Tests with Hamcrest

Michael Kimsal's Blog: PHP4 -> PHP5 XML Wrapper

Arnold Daniels' Blog: An alternative way of EAV modeling

Richard Heyes\' Blog: Updates to XML-RPC Library


Community Events











Don't see your event here?
Let us know!


language symfony2 development interview functional usergroup phpunit conference introduction testing release podcast framework google rest series community database zendframework2 opinion

All content copyright, 2013 PHPDeveloper.org :: info@phpdeveloper.org - Powered by the Solar PHP Framework