<?xml version="1.0"?>
<rss version="2.0">
  <channel>
    <title>PHPDeveloper.org</title>
    <link>http://www.phpdeveloper.org</link>
    <description>Up-to-the Minute PHP News, views and community</description>
    <language>en-us</language>
    <pubDate>Wed, 19 Jun 2013 08:44:44 -0500</pubDate>
    <ttl>30</ttl>
    <item>
      <title><![CDATA[Sameer Borate's Blog: Porter Stemming algorithm for search]]></title>
      <guid>http://www.phpdeveloper.org/news/12423</guid>
      <link>http://www.phpdeveloper.org/news/12423</link>
      <description><![CDATA[<p>
In a recent post to his blog <i>Sameer</i> looks at <a href="http://www.codediesel.com/php/porter-stemming-in-search/">implementing a Stemming algorithm</a> to search an array of words. It uses <a href="http://tartarus.org/~martin/PorterStemmer/php.txt">this library</a> (as written by <i>Richard Heyes</i>).
</p>
<blockquote>
A stemming algorithm lets you reduce each English input word to its basic root or stem (e.g. 'walking' to 'walk') so that variations on a word ('walks', 'walked', 'walking') are considered equivalent when searching. This stems can than be used in a search query rather than the original words, which generally (but not always) results in more relevant search results.
</blockquote>
<p>
His code example uses the library to search for two different types of strings - a single word and a phrase (with stop words removed). The Stem() method is called on the word and the results are looped through to remove all matching the values in the stop words array.
</p>]]></description>
      <pubDate>Wed, 29 Apr 2009 07:57:06 -0500</pubDate>
    </item>
  </channel>
</rss>
