Projects that are tagged with data cleaning.


Logo DCABags 0.63

by wbuntine - November 8, 2013, 13:31:04 CET [ Project Homepage BibTeX Download ] 1071 views, 206 downloads, 2 subscriptions

About: Document/Text preprocessing for topic models: suite of Perl scripts for preprocessing text collections to create dictionaries and bag/list files for use by topic modelling software.

Changes:

Cleaned up man pages and created user guide.


Logo GritBot 2.01

by zenog - September 2, 2011, 14:56:26 CET [ Project Homepage BibTeX Download ] 1942 views, 484 downloads, 1 subscription

About: GritBot is an data cleaning and outlier/anomaly detection program.

Changes:

Initial Announcement on mloss.org.