27 Feb 2018 08:40:23 UTC
- Distribution: Lucy
- Source (raw)
- Browse (raw)
- How to Contribute
- Clone repository
- Testers (1249 / 3 / 7)
- KwaliteeBus factor: 1
- License: apache_2_0
- Perl: v5.8.3
- Activity24 month
- Download (1.12MB)
- MetaCPAN Explorer
- Subscribe to distribution
- This version
- Latest version++ed by:6 non-PAUSE usersNWELLNHOF Nick Wellnhoferand 1 contributors
- The Apache Lucy Project <dev at lucy dot apache dot org>
Lucy::Analysis::SnowballStemmer - Reduce related words to a shared root.
my $stemmer = Lucy::Analysis::SnowballStemmer->new( language => 'es' ); my $polyanalyzer = Lucy::Analysis::PolyAnalyzer->new( analyzers => [ $tokenizer, $normalizer, $stemmer ], );
This class is a wrapper around the Snowball stemming library, so it supports the same languages.
SnowballStemmer is an Analyzer which reduces related words to a root form (using the “Snowball” stemming library). For instance, “horse”, “horses”, and “horsing” all become “hors” – so that a search for ‘horse’ will also match documents containing ‘horses’ and ‘horsing’.
my $stemmer = Lucy::Analysis::SnowballStemmer->new( language => 'es' );
Create a new SnowballStemmer.
language - A two-letter ISO code identifying a language supported by Snowball.
my $inversion = $snowball_stemmer->transform($inversion);
Take a single Inversion as input and returns an Inversion, either the same one (presumably transformed in some way), or a new one.
inversion - An inversion.
Lucy::Analysis::SnowballStemmer isa Lucy::Analysis::Analyzer isa Clownfish::Obj.