“XML-Print” is a joint project of the FH Worms (Prof. Marc W. Küster) and the University of Trier (Prof. Claudine Moulin) with support from TU Darmstadt (Prof. Andrea Rapp). Its goal is the creation of a XML formatter designated especially for the needs of the “Digital Humanties”. The project is funded by the DFG. Please […]

Read More


Presage (formerly Soothsayer) is an intelligent predictive text entry system. Presage generates predictions by modelling natural language as a combination of redundant information sources. Presage computes probabilities for words which are most likely to be entered next by merging predictions generated by the different predictive algorithms. Presage’s modular and extensible architecture allows its language model […]

Read More


Arramooz Alwaseet Open Arabic Dictionary for morphological analyze. To be useful for Arabic language processing. This dictionary is derived from the Ayaspell Arabic spell checker. You’re probably paying too much for cell phone service. Wirefly compares hundreds of plans to help you save. Enter what you need (minutes, data, texts) into Wirefly’s innovative plan comparison […]

Read More


This substitution cipher toolkit enables you to en- and decrypt texts with substitution cipher, to gather language statistics of a specific language and to crack encrypted texts both manually and automatically. All functions can be accessed via an easy-to-use graphical user interface. Today’s small-to-medium-sized (SMB) businesses and large enterprises are saving on their monthly communications […]

Read More


Perstem is a Persian (Farsi) stemmer, morphological analyzer, transliterator, and partial part-of-speech tagger. Inflexional morphemes are separated or removed from their stems. Perstem can also tokenize and transliterate between various character set encodings and romanizations. If you are like the rest of our user community, your IT team is busy. With pressure to deliver on-time […]

Read More


Editor for formal grammars. Attempts to be universal – customizable for any grammatical formalism and any syntax. Provides features such as syntax checking and highlighting, transformations (refactoring) and advanced rule editor. If you are like the rest of our user community, your IT team is busy. With pressure to deliver on-time projects, you don’t have […]

Read More


Donatus is an on-going project consisting of Python, NLTK-based tools and grammars for deep parsing and syntactical annotation of Brazilian Portuguese corpora. It includes a user-friendly graphical user interface for building syntactic parsers with the NLTK, providing some additional functionalities. If you are like the rest of our user community, your IT team is busy. […]

Read More


MyVocabtionary (formerly phpVocabtionary) is a free PHP/MySQL-based web software that allows you to create a free dictionary. With our vast number of modifications, you can also make your dictionary even better! Download is completely free. Usage is a piece of cake and you can customise almost everything through a user-friendly GUI. Creating an online dictionary […]

Read More


Based on the Buckwalter Morphological Analyzer (Version 1.0) for doing Arabic stemming and POS tagging. Includes a rewrite of the original Perl script, with better documentation and more flexible options, and a C++ interface (usable as a library or app). Cloudbased voice solutions are common in enterprise networks and frustrating for operations teams to manage. […]

Read More


A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: – simple code to hold/read/write data and perform sample processing. – BioC-formatted corpora – BioC tools that work with BioC corpora BioC goals – simplicity – interoperability – broad use – reuse […]

Read More