We aggregate information from all open source repositories to provide the most of useful open source products, from Enterprise product to small libraries across all platforms. Please search and find what you want.

docx2txt

Docx2txt is a Perl based command-line utility to convert (even corrupted) Microsoft docx documents to reasonably formatted text files, along with appropriate character conversions. Apart from Perl it also requires a command line unzipping program like unzip/7z/pkzipc/wzunzip.

Cloudbased voice solutions are common in enterprise networks and frustrating for operations teams to manage. Simplify VoIP monitoring by having a proactive analysis of on-prem, hybrid and UCaaS voice services. Try the ThousandEyes VoIP monitoring solution today, free.

Website http://docx2txt.sourceforge.net
Tags
License
Platform
Features
  • Consists of (core) Perl and (wrapper) Unix/Windows shell scripts and a configuration file, with provision for maintaining separate system-wide configuration file and individual user-level configuration files.
  • Perl script also works with input/output redirection, and is useful in viewing docx file content directly with editors like vim, emacs, and file browsers like mc (midnight commander).
  • Can recover text from damaged docx documents in many cases.
  • Short line justifications, showing hyperlink and many character conversions (missing in MS text conversion).
  • Handles (bullet, decimal, letter, roman) lists along with indentation.
  • Installation via Makefiles and Windows batch file. On non-Windows systems scripts and configuration file can be installed in separate directories.
  • Can conveniently be used to build a web based docx document conversion service.