15 Apr 2012 22:44:37 UTC
- Distribution: KinoSearch
- Source (raw)
- Browse (raw)
- How to Contribute
- Issues (5)
- Testers (21 / 4 / 0)
- KwaliteeBus factor: 0
- License: perl_5
- Activity24 month
- Download (854.12KB)
- MetaCPAN Explorer
- Subscribe to distribution
- This version
- Latest versionCREAMYG Marvin Humphreyand 1 contributors
- Marvin Humphrey <marvin at rectangular dot com>
- COPYRIGHT AND LICENSE
KinoSearch::Analysis::Stemmer - Reduce related words to a shared root.
The KinoSearch code base has been assimilated by the Apache Lucy project. The "KinoSearch" namespace has been deprecated, but development continues under our new name at our new home: http://lucy.apache.org/
my $stemmer = KinoSearch::Analysis::Stemmer->new( language => 'es' ); my $polyanalyzer = KinoSearch::Analysis::PolyAnalyzer->new( analyzers => [ $case_folder, $tokenizer, $stemmer ], );
This class is a wrapper around Lingua::Stem::Snowball, so it supports the same languages.
Stemmer is an Analyzer which reduces related words to a root form (using the "Snowball" stemming library). For instance, "horse", "horses", and "horsing" all become "hors" -- so that a search for 'horse' will also match documents containing 'horses' and 'horsing'.
my $stemmer = KinoSearch::Analysis::Stemmer->new( language => 'es' );
language - A two-letter ISO code identifying a language supported by Snowball.
Copyright 2005-2011 Marvin Humphrey
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.