24 Dec 2009 00:10:44 UTC
- Development release
- Distribution: KinoSearch
- Source (raw)
- Browse (raw)
- How to Contribute
- Issues (5)
- Testers (68 / 0 / 15)
- KwaliteeBus factor: 0
- License: perl_5
- Activity24 month
- Download (809.92KB)
- MetaCPAN Explorer
- Subscribe to distribution
- This version
- Latest versionCREAMYG Marvin Humphreyand 1 contributors
- Marvin Humphrey <marvin at rectangular dot com>
KinoSearch::Analysis::Stemmer - Reduce related words to a shared root.
my $stemmer = KinoSearch::Analysis::Stemmer->new( language => 'es' ); my $polyanalyzer = KinoSearch::Analysis::PolyAnalyzer->new( analyzers => [ $case_folder, $tokenizer, $stemmer ], );
This class is a wrapper around Lingua::Stem::Snowball, so it supports the same languages.
Reduce related words to a shared root.
Stemmer is an Analyzer which reduces related words to a root form (using the "Snowball" stemming library). For instance, "horse", "horses", and "horsing" all become "hors" -- so that a search for 'horse' will also match documents containing 'horses' and 'horsing'.
my $stemmer = KinoSearch::Analysis::Stemmer->new( language => 'es' );
language - A two-letter ISO code identifying a language supported by Snowball.
Copyright 2005-2009 Marvin Humphrey
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.