Marcion is a software forming a study environment of ancient languages (esp. Coptic, Greek, Latin) and providing many tools and resources (dictionaties, grammars, texts). Although Marcion is focused on to study the gnosticism and early christianity, it is an universal library working with various file formats and allowing to collect, organize and backup texts of […]

Read More


This is an ooRexx line-by-line port from Ansi-C to ooRexx of the stemming routine published by Martin Porter 1980. The original source code from Porter has been commented out and emulated by the corresponding (oo)Rexx code as far as possible. This is not an example of good or fast (oo)Rexx programming, it is merely a […]

Read More


Weka wrapper for the SGM toolkit for text classification and modeling. Provides Sparse Generative Models for scalable and accurate text classification and modeling for use in high-speed and large-scale text mining. Has lower time complexity of classification than comparable software due to inference based on sparse model representation and use of an inverted index. The […]

Read More


Mansour is a simple application for analyzing digital text written in Arabic.منصور هو تطبيق مكتبي بسيط لتحليل النصوص الرقمية المكتوبة باللغة العربية. You’re probably paying too much for cell phone service. Wirefly compares hundreds of plans to help you save. Enter what you need (minutes, data, texts) into Wirefly’s innovative plan comparison tools and see […]

Read More


Entity recognition and normalization software for biomedical text The SourceForge Speed Test measures Latency/Ping, Jitter, Download Speed, Upload Speed, Buffer Bloat, and Packet Loss. Upon completion, you can view detailed reports about your connection. This HTML5 speed test does not require Flash or Java, and works on all devices including tablets and smartphones. Host on […]

Read More


Cross-platform application aimed at helping users to learn vocabulary from any foreign language(s). Add/Edit/Delete vocab words (w/ translation, category, sentence, notes, picture). Review (Quiz) vocabulary words. Today’s small-to-medium-sized (SMB) businesses and large enterprises are saving on their monthly communications costs by making one simple decision: to switch to a VoIP service solution from their old, […]

Read More


EzerKb is a virtual keyboard for Windows. It emulates a keyboard with, for example, Russian, Greek, or Hebrew characters without actually installing a keyboard driver for that language. EzerKb works with most (but not all) Windows programs. Today’s small-to-medium-sized (SMB) businesses and large enterprises are saving on their monthly communications costs by making one simple […]

Read More


With LexHoard you can build vocabularies, glossaries, dictionaries or other types of wordlists. LexHoard features: -Unicode support (with a built-in Unicode editor) -export for MediaWiki and HTML format -attribution of more than one meaning/translation/relation to each word The program is written completely in native Lazarus/FreePascal. The only non-standard library needed is ZeosLib. It uses SQlite […]

Read More


This project has been developed as part of a master thesis named “Edit Distance Adapted to Natural Language Words”. The available project consists three parts. First, the corpus gathers more than one million distinct Arab words. Second, the text files of Arabic resources. Third, the index file presents some information about these resources. Additional details […]

Read More


TF-IDF.jar is a Java Archive file to measure TF-IDF of each document in a document collection (corpus). The jar can be used to (a) get all the terms in the corpus (b) get the document frequency (DF) and inverse document frequency (IDF) of all the terms in the corpus (c) get the TF-IDF of each […]

Read More