Project details for mldata-utils

Logo mldata-utils 0.1.0

by sonne - July 12, 2010, 13:33:04 CET [ Project Homepage BibTeX Download ]

view (9 today), download ( 0 today ), 1 comment, 1 subscription

Description:

A tool to convert data files from and to HDF5, as used on mldata.org.

Changes to previous version:

Initial Announcement on mloss.org.

BibTeX Entry: Download
URL: Project Homepage
Supported Operating Systems: Posix
Data Formats: Svmlight, Matlab, Arff, Octave, Hdf, Csv
Tags: Python, Data Formats, Weka, Libsvm
Archive: download here

Other available revisons

Version Changelog Date
0.5.0
  • Change task file format, such that data splits can have a variable number items and put into up to 256 categories of training/validation/test/not used/...
  • Various bugfixes.
April 8, 2011, 10:02:44
0.4.1
  • Various bugfixes (sparse matrix, data extraction).
  • Client api to interact with mldata.org works with live website now.
December 7, 2010, 03:06:42
0.4.0
  • Finally reliably convert sparse, dense matrices of floating point or integer types and string lists from/to .hdf5, octave, matlab, csv, arff.
  • Added examples and a small test-suite.
November 7, 2010, 14:39:56
0.3.6
  • Added a fix when data.get_correct internally receives an array of array with values instead an array with values.
  • Added support for sparse matrices in data.get_correct.
August 27, 2010, 15:31:58
0.3.5
  • Introduced task.get_test_output to get test_idx and output_variables from Task file.
  • Introduced data.get_correct() to get the 'correct' results from Data file.
  • Fixed minor issus when converting to octave/matlab.
August 25, 2010, 18:56:53
0.3.4
  • Fixed an issue with data extracts.
  • Fixed an issue when updating Task files.
  • Fixed a few issues when converting to arff/octave/matlab.
August 24, 2010, 16:01:27
0.3.2
  • task.create now includes handling of input/output_variables and train/test_idx.
  • fixed a little error handling octave files.
August 21, 2010, 13:02:07
0.3.1
  • Had removed too much from data.get_extract and put it back in.
  • Added safeguard for illegal task files with no output_variables.
August 20, 2010, 10:35:46
0.3.0
  • Restructured package into more different modules.
  • Revamped conversion structure.
  • Bugfix re Task vs output variables.
August 19, 2010, 12:42:57
0.2.4
  • Caught a few more error conditions when handlings Task.
  • Temporarily removed author from package information because it threw ugly error message on older python installations.
  • Removed label_dims and improved support for input/output variables for Tasks.
  • Created new module 'data' for better encapsulation.
August 17, 2010, 12:00:51
0.2.2
  • Added extract function (and script) for Task datasets.
  • Moved extract function for Data from website to this tool.
  • Improved handling of Task files.
August 16, 2010, 11:52:18
0.2.0

Initial Announcement on mloss.org.

July 21, 2010, 15:03:24
0.1.0

Initial Announcement on mloss.org.

July 12, 2010, 13:33:04

Comments

Yaroslav Halchenko (on December 16, 2010, 05:47:51)

any plans for furnishing Debian package, Soeren? I see no ITP ;)

Leave a comment

You must be logged in to post comments.