popuretext

Extracts all the source text from a directory of POT files or the target text from a directory of PO files, removing PO headers and optionally the accelerator keys.

If you want to use other tools to analyse the text within a translation project, then this is the tool for you. For example, you can use it to calculate word frequencies to create an initial glossary based on the pure source text.

Prerequisites

  • GNU Gettext

  • sed

Usage

popuretext <-P pot-dir|po-dir> <file.txt> [accelerator]

Where:

pot-dir

a directory containing POT files

po-dir

a directory containing PO files

file.txt

file that contains the output text

accelerator

optional: accelerator marker to be removed from the text

Examples

popuretext -P pot pot.txt '&'

Extract all the source text from the pot directory and place it in the pot.txt file removing all occurrences of the & accelerator.

popuretext af af.txt

Extract all target text from the Afrikaans files in the af directory, placing the extracted text in af.txt. In this case we are not filtering any accelerator characters.