25 Jan 2006 23:48:21 UTC
- Distribution: KinoSearch
- Source (raw)
- Browse (raw)
- How to Contribute
- Issues (5)
- Testers (3 / 3 / 0)
- KwaliteeBus factor: 0
- License: perl_5
- Activity24 month
- Download (190.38KB)
- MetaCPAN Explorer
- Subscribe to distribution
- This version
- Latest versionCREAMYG Marvin Humphreyand 1 contributors
- Marvin Humphrey <marvin at rectangular dot com>
KinoSearch::Analysis::Stemmer - reduce related words to a shared root
my $stemmer = KinoSearch::Analysis::Stemmer->new( language => 'es' ); my $polyanalyzer = KinoSearch::Analysis::PolyAnalyzer->new( analyzers => [ $lc_normalizer, $tokenizer, $stemmer ], );
Stemming reduces words to a root form. For instance, "horse", "horses", and "horsing" all become "hors" -- so that a search for 'horse' will also match documents containing 'horses' and 'horsing'. For more information, see the documentation for Lingua::Stem.
This class is a wrapper around Lingua::Stem::Snowball, so it supports the same languages.
Create a new stemmer. Takes a single named parameter,
language, which must be an ISO two-letter code that Lingua::Stem::Snowball understands.
Submit patches for Lingua::Stem::Snowball which enhance speed and address apostrophe-handling issues.
Copyright 2005-2006 Marvin Humphrey
See KinoSearch version 0.05.