Map your path to clean data with Open Studio for Data Quality, the leading open source data profiling tool. Open Studio for Data Quality easily connects to hundreds of data sources and generates analysis to help define the next steps to clean data. Evaluate data quality against custom-defined thresholds, and measure conformance to internal standards […]

Read More


GPerftools (formerly Google Performance Tools) is a collection of a high-performance multi-threaded malloc() implementation, plus some pretty nifty performance analysis tools useful for creating more robust applications. These tools can be especially useful when developing multi-threaded applications in C++ with templates. Among these tools are TCMalloc, a thread-friendly heap-checker, heap-profiler and cpu-profiler. The SourceForge Speed […]

Read More


IMDbPY is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies. Platform-independent, it can retrieve data from both the IMDb’s web server and a local copy of the whole db. Cloudbased voice solutions are common in enterprise networks and frustrating for operations teams […]

Read More


Open|SpeedShop is an open source multi platform Linux performance tool which is targeted to support performance analysis of applications running on both single node and large scale IA64, IA32, EM64T, AMD64, PPC, Blue Gene, ARM and Cray platforms. The SourceForge Speed Test measures Latency/Ping, Jitter, Download Speed, Upload Speed, Buffer Bloat, and Packet Loss. Upon […]

Read More


Java agent which recognizes “flows” as unique sequences of instrumented method calls. Runtime statistics are collected for the dynamically detected flows during snapshot interval. The instrumented methods may be set either manually via UI or automatically in drill down mode. The automatic drill down mode effectively instruments classes generated by frameworks like Hibernate, Rhino, etc. […]

Read More


The DaCapo benchmark suite is designed to faciliate performance analysis of Java Virtual Machines, compilers and memory management. It comprises a harness and a suite of open source real-world programs. The SourceForge Speed Test measures Latency/Ping, Jitter, Download Speed, Upload Speed, Buffer Bloat, and Packet Loss. Upon completion, you can view detailed reports about your […]

Read More


This is an offshoot project of open source data quality (osDQ) project https://sourceforge.net/projects/dataquality/ This sub project will create apache spark based data quality and data preparation features for big data. This uses java API of apache spark Network slow, but you don’t know the cause? Use Bandwidth Analyzer Pack (BAP) to monitor and analyze network […]

Read More


SLOCCount is an easy-to-use tool that counts Source Lines of Code (SLOC). It auto-determines the language(s) (inc. C, C++, Ada, Assembly, shell, COBOL, C#, Fortran, Haskell, Java, LISP/Scheme, Perl, PHP, Python, Ruby, SQL). It also estimates cost & time. If you are like the rest of our user community, your IT team is busy. With […]

Read More


Warning: Older projects may have to be compiled with -DDEBUG_NEW_CONFIG_HAVE_DEPRECATED Debug_New offers the possibility of debugging memoryleaks, not only (note the “not only”; that’s just one of its functions) in a kind of way, but implemented using memory-states. – Cross platform + Cross compiler support (only utilizes the preprocessor and template-voodoo-magic for c++) – Supports […]

Read More


Windows ploting (wplot) Software wplot is a quick and easy to use Windows GUI software program for quickly creating and analyzing data plots. Create plots with millions of data points in seconds. Crazy fast. Once the data is plotted you can easily use the many tools to explore your plot in great detail by zooming […]

Read More