-
-
13 Feb 2015 23:47:57 UTC
- Distribution: Lingua-Stem-Patch
- Module version: 0.06
- Source (raw)
- Browse (raw)
- Changes
- Homepage
- How to Contribute
- Repository
- Issues
- Testers (612 / 2 / 0)
- Kwalitee
Bus factor: 0- 95.69% Coverage
- License: perl_5
- Perl: v5.8.1
- Activity
24 month- Tools
- Download (21.34KB)
- MetaCPAN Explorer
- Permissions
- Subscribe to distribution
- Permalinks
- This version
- Latest version
- Dependencies
- Carp
- Exporter
- Moo
- namespace::clean
- parent
- strict
- utf8
- warnings
- and possibly others
- Reverse dependencies
- CPAN Testers List
- Dependency graph
NAME
Lingua::Stem::Patch - Patch stemmers for Esperanto and Ido
VERSION
This document describes Lingua::Stem::Patch v0.06.
SYNOPSIS
use Lingua::Stem::Patch; # create Esperanto stemmer $stemmer = Lingua::Stem::Patch->new(language => 'eo'); # get stem for word $stem = $stemmer->stem($word); # get list of stems for list of words @stems = $stemmer->stem(@words);
DESCRIPTION
This module contains a collection of stemmers for multiple languages using the Patch stemming algorithms. The languages currently implemented are Esperanto, Ido, and Polish.
This is a new project under active development and the current stemming algorithms are likely to change.
Attributes
- language
-
The following language codes are currently supported.
┌───────────┬────┐ │ Esperanto │ eo │ │ Ido │ io │ │ Polish │ pl │ └───────────┴────┘
They are in the two-letter ISO 639-1 format and are case-insensitive but are always returned in lowercase when requested.
# instantiate a stemmer object $stemmer = Lingua::Stem::Patch->new(language => $language); # get current language $language = $stemmer->language; # change language $stemmer->language($language);
- aggressive
-
By default a light stemmer will be used, but when
aggressive
is set to true, an aggressive stemmer will be used instead.$stemmer->aggressive(1);
Methods
- stem
-
Accepts a list of words, stems each word, and returns a list of stems. The list returned will always have the same number of elements in the same order as the list provided. When no stemming rules apply to a word, the original word is returned.
@stems = $stemmer->stem(@words); # get the stem for a single word $stem = $stemmer->stem($word);
The words should be provided as character strings and the stems are returned as character strings. Byte strings in arbitrary character encodings are intentionally not supported.
- languages
-
Returns a list of supported two-letter language codes using lowercase letters.
# object method @languages = $stemmer->languages; # class method @languages = Lingua::Stem::Patch->languages;
SEE ALSO
Lingua::Stem::Any provides a unified interface to any stemmer on CPAN, including this module, as well as additional features like normalization, casefolding, and in-place stemming.
AUTHOR
Nick Patch <patch@cpan.org>
COPYRIGHT AND LICENSE
© 2014–2015 Nick Patch
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
Module Install Instructions
To install Lingua::Stem::Patch, copy and paste the appropriate command in to your terminal.
cpanm Lingua::Stem::Patch
perl -MCPAN -e shell install Lingua::Stem::Patch
For more information on module installation, please visit the detailed CPAN module installation guide.