This is a rudimentary tool to produce a clean file from an unclean file (Trados/Wordfast) by stripping out the tw4win indicators.


poclean <input> <output>


<input> is the text versions of the unclean RTF files
<output> is the intended output file / directory


--version show program’s version number and exit
-h, --help show this help message and exit
--manpage output a manpage based on the help
 show progress as: dots, none, bar, names, verbose
 show errorlevel as: none, message, exception, traceback
-iINPUT, --input=INPUT
 read from INPUT in pot format
-xEXCLUDE, --exclude=EXCLUDE
 exclude names matching EXCLUDE from input paths
-oOUTPUT, --output=OUTPUT
 write to OUTPUT in po, pot formats
 read from TEMPLATE in po, pot formats


To create a text version of the unclean RTF file, you need UnRTF, available here: project site or here (windows).

unrtf translation.rtf  --text > translation.po

You might need to convert the encoding of the file, with iconv, for example:

iconv -f latin1 -t utf-8 translation.po > new_translation.po

Now you can clean the file with poclean

poclean new_translation.po clean_translation.po