This is a Python script for converting office documents from one to another format and transforming the images and text to these formats.
Python, Amazon EC2, Ubuntu
Supported formats: - documents: DOC, DOCX, ODT, RTF, HTML, EPUB, FB2, TXT, PDF; - presentations: PPT, PPTX, ODP, PDF, ZIP, SWF; - tables: XSL, XSLX, CSV, ODS, HTML, PDF. Runs on Ubuntu on Amazon EC2 Large.