Changes for version 0.7.12

  • Add metadata about VCS

Documentation

Corpus Search Sentence utility.
initialize a sparse matrix with words co-occurrence.
one of the three possible EM-Algorithm implementations of NATools
A translator from co-occurrence matrices to a dictionary file.
A translator from dictionary file to the Perl readable format.
A pre-processor for parallel texts, counting words, checking sentence numbers, and creating auxiliary files.
one of the three possible EM-Algorithm implementations of NATools
one of the three possible EM-Algorithm implementations of NATools
C sentence aligner.
Creates a StarDict from a NATools corpus.
Command line tool to codify corpora
used to compare two PTDs in Perl dumper format.
Command line tool to create NATools Corpora Objects
interface for binary PTDs operations.
Command line tool to dump NATools PTDs
dumps a lexicon file as Perl hash.
Dumps a NATools corpus in a format suitable to be imported in CWB
generates a pmakefile to be used by Makefile::Parallel
used to create a dictionary similar to a PTD based on a word aligned corpus.
Indexes a ngrams SQLite file
join two files in NATools input format into a TMX file.
classifies each parallel corpus aligned sentence
simple interface for Vanilla aligner.
A shell interface to NATools corpora alignment
splits a TMX file into several files, one for each language

Modules

A framework for Parallel Corpora processing
Utility functions for NATools CGI tools
Simple API to query NAT Objects
Simple configuration file API
To inter-operate with NATools Corpus files
Perl extension to encapsulate Dict interface
Encapsulates NATools Lexicon files
Module to align a sentence by blocks
Perl extension to encapsulate a NATools Dictionary
Perl extension to inter-operate with a NATool Parallel Corpus

Provides

in lib/Lingua/NATools/Corpus.pm