-
-
21 Jun 2007 05:11:26 UTC
- Development release
- Distribution: KinoSearch
- Source (raw)
- Browse (raw)
- Changes
- How to Contribute
- Issues (5)
- Testers (10 / 7 / 0)
- Kwalitee
Bus factor: 0- License: perl_5
- Activity
24 month- Tools
- Download (456.89KB)
- MetaCPAN Explorer
- Permissions
- Subscribe to distribution
- Permalinks
- This version
- Latest version
and 1 contributors- Marvin Humphrey <marvin at rectangular dot com>
- Dependencies
- Clone
- Compress::Zlib
- HTML::Parser
- Lingua::Stem::Snowball
- Lingua::StopWords
- and possibly others
- Reverse dependencies
- CPAN Testers List
- Dependency graph
NAME
KinoSearch::Analysis::Analyzer - Base class for analyzers.
SYNOPSIS
# abstract base class -- must be subclassed package MyAnalyzer; sub analyze_batch { my ( $self, $token_batch ) = @_; while ( my $token = $token_batch->next ) { my $new_text = transform( $token->get_text ); $token->set_text($new_text); } return $token_batch; } sub transform { # ... }
DESCRIPTION
In KinoSearch, an Analyzer is a filter which processes text, transforming it from one form into another. For instance, an analyzer might break up a long text into smaller pieces (Tokenizer), or it might convert text to lowercase (LCNormalizer).
SUBCLASSING
All Analyzer subclasses must provide an
analyze_batch
method.analyze_batch
$token_batch = $analyzer->analyze_batch($token_batch);
Abstract method.
analyze_batch()
takes a single TokenBatch as input, and it returns a TokenBatch, either the same one (presumably transformed in some way), or a new one.COPYRIGHT
Copyright 2005-2007 Marvin Humphrey
LICENSE, DISCLAIMER, BUGS, etc.
See KinoSearch version 0.20.
Module Install Instructions
To install KinoSearch, copy and paste the appropriate command in to your terminal.
cpanm KinoSearch
perl -MCPAN -e shell install KinoSearch
For more information on module installation, please visit the detailed CPAN module installation guide.