30 Sep 2014 01:32:42 UTC
- Distribution: Lucy
- Source (raw)
- Browse (raw)
- How to Contribute
- Clone repository
- Testers (28 / 33 / 3)
- KwaliteeBus factor: 1
- License: apache_2_0
- Perl: v5.8.3
- Activity24 month
- Download (1.06MB)
- MetaCPAN Explorer
- Subscribe to distribution
- This version
- Latest version++ed by:6 non-PAUSE usersCREAMYG Marvin Humphreyand 1 contributors
- The Apache Lucy Project <dev at lucy dot apache dot org>
Lucy::Analysis::SnowballStemmer - Reduce related words to a shared root.
my $stemmer = Lucy::Analysis::SnowballStemmer->new( language => 'es' ); my $polyanalyzer = Lucy::Analysis::PolyAnalyzer->new( analyzers => [ $tokenizer, $normalizer, $stemmer ], );
This class is a wrapper around the Snowball stemming library, so it supports the same languages.
SnowballStemmer is an Analyzer which reduces related words to a root form (using the "Snowball" stemming library). For instance, "horse", "horses", and "horsing" all become "hors" -- so that a search for 'horse' will also match documents containing 'horses' and 'horsing'.
my $stemmer = Lucy::Analysis::SnowballStemmer->new( language => 'es' );
language - A two-letter ISO code identifying a language supported by Snowball.
Lucy::Analysis::SnowballStemmer isa Lucy::Analysis::Analyzer isa Clownfish::Obj.