Projects running under platform independent.
Showing Items 1-20 of 56 on page 1 of 3: 1 2 3 Next

Logo JMLR MLPACK 1.0.6

by rcurtin - June 19, 2013, 19:21:07 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 21061 views, 3821 downloads, 4 subscriptions

About: A scalable, fast C++ machine learning library, with emphasis on usability.

Changes:

Minor bugfix so that FastMKS gets built.


Logo JMLR GPstuff 4.2

by avehtari - June 17, 2013, 13:22:52 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 2476 views, 408 downloads, 1 subscription

About: The GPstuff toolbox is a versatile collection of Gaussian process models and computational tools required for inference. The tools include, among others, various inference methods, sparse approximations and model assessment methods.

Changes:

2013-06-14 Version 4.2

Improvements

  • Cross-validation much faster if no bias-corrections are needed (computes only the necessary predictions)
  • Marginal posterior corrections with loopred (Laplace) and cross-validation
  • More robust computation of marginal posterior corrections
  • More robust density estimation in lgpdens (default parameters changed)

Bug fixes

  • Mex files now in correct folders if compiled with SuiteSparse (covariance matrix computation now much faster)
  • Fixed bug with default marginal posterior correction when using gp_predcm
  • Fixed conditions in likelihood functions for grid approximation of predictions with marginal posterior corrections
  • Fixed outputs of gpmc_preds with multilatent models (thanks to Mahdi Biparva for pointing this out)
  • and some minor bug fixes

Logo APCluster 1.3.2

by UBod - June 12, 2013, 11:38:01 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 9311 views, 1728 downloads, 1 subscription

Rating Whole StarWhole StarWhole StarWhole Star1/2 Star
(based on 2 votes)

About: The apcluster package implements Frey's and Dueck's Affinity Propagation clustering in R. The package further provides leveraged affinity propagation, exemplar-based agglomerative clustering, and various tools for visual analysis of clustering results.

Changes:
  • plotting of clustering results extended to data sets with more than two dimensions (resulting in the clustering result being superimposed in a scatterplot matrix); the variant that plot() can be used to draw a heatmap has been removed. From now on, heatmap() must always be used.
  • improved NA handling
  • correction of input check in apcluster() and apclusterL() (previously, both functions issued a warning whenever argument p had length > 1)
  • corresponding updates and further improvements of help pages and vignette

Logo JMLR Jstacs 2.1

by keili - June 3, 2013, 07:32:55 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 9944 views, 2177 downloads, 2 subscriptions

About: A Java framework for statistical analysis and classification of biological sequences

Changes:

New classes:

  • MultipleIterationsCondition: Requires another TerminationCondition to fail a contiguous, specified number of times
  • ClassifierFactory: Allows for creating standard classifiers
  • SeqLogoPlotter: Plot PNG sequence logos from within Jstacs
  • MultivariateGaussianEmission: Multivariate Gaussian emission density for a Hidden Markov Model
  • MEManager: Maximum entropy model

New features and improvements:

  • Alignment: Added free shift alignment
  • PerformanceMeasure and sub-classes: Extension to weighted test data
  • AbstractClassifier, ClassifierAssessment and sub-classes: Adaption to weighted PerformanceMeasures
  • DNAAlphabet: Parser speed-up
  • PFMComparator: Extension to PFM from other sources/databases
  • ToolBox: New convenience methods for computing several statistics (e.g., median, correlation)
  • SignificantMotifOccurrencesFinder: New methods for computing PWMs and statistics from predictions
  • SequenceScore and sub-classes: New method toString(NumberFormat)
  • DataSet: Adaption to weighted data, e.g., partitioning
  • REnvironment: Changed several methods from String to CharSequence

Restructuring:

  • changed MultiDimensionalSequenceWrapperDiffSM to MultiDimensionalSequenceWrapperDiffSS

Several minor new features, bug fixes, and code cleanups


Logo Cognitive Foundry 3.3.3

by Baz - May 21, 2013, 05:59:37 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 9013 views, 1796 downloads, 2 subscriptions

About: The Cognitive Foundry is a modular Java software library of machine learning components and algorithms designed for research and applications.

Changes:
  • General:
    • Made code able to compile under both Java 1.6 and 1.7. This required removing some potentially unsafe methods that used varargs with generics.
    • Upgraded XStream dependency to 1.4.4.
    • Improved support for regression algorithms in learning.
    • Added general-purpose adapters to make it easier to compose learning algorithms and adapt their input or output.
  • Common Core:
    • Added isSparse, toArray, dotDivide, and dotDivideEquals methods for Vector and Matrix.
    • Added scaledPlus, scaledPlusEquals, scaledMinus, and scaledMinusEquals to Ring (and thus Vector and Matrix) for potentially faster such operations.
    • Fixed issue where matrix and dense vector equals was not checking for equal dimensionality.
    • Added transform, transformEquals, tranformNonZeros, and transformNonZerosEquals to Vector.
    • Made LogNumber into a signed version of a log number and moved the prior unsigned implementation into UnsignedLogNumber.
    • Added EuclideanRing interface that provides methods for times, timesEquals, divide, and divideEquals. Also added Field interface that provides methods for inverse and inverseEquals. These interfaces are now implemented by the appropriate number classes such as ComplexNumber, MutableInteger, MutableLong, MutableDouble, LogNumber, and UnsignedLogNumber.
    • Added interface for Indexer and DefaultIndexer implementation for creating a zero-based indexing of values.
    • Added interfaces for MatrixFactoryContainer and DivergenceFunctionContainer.
    • Added ReversibleEvaluator, which various identity functions implement as well as a new utility class ForwardReverseEvaluatorPair to create a reversible evaluator from a pair of other evaluators.
    • Added method to create an ArrayList from a pair of values in CollectionUtil.
    • ArgumentChecker now properly throws assertion errors for NaN values. Also added checks for long types.
    • Fixed handling of Infinity in subtraction for LogMath.
    • Fixed issue with angle method that would cause a NaN if cosine had a rounding error.
    • Added new createMatrix methods to MatrixFactory that initializes the Matrix with the given value.
    • Added copy, reverse, and isEmpty methods for several array types to ArrayUtil.
    • Added utility methods for creating a HashMap, LinkedHashMap, HashSet, or LinkedHashSet with an expected size to CollectionUtil.
    • Added getFirst and getLast methods for List types to CollectionUtil.
    • Removed some calls to System.out and Exception.printStackTrace.
  • Common Data:
    • Added create method for IdentityDataConverter.
    • ReversibleDataConverter now is an extension of ReversibleEvaluator.
  • Learning Core:
    • Added general learner transformation capability to make it easier to adapt and compose algorithms. InputOutputTransformedBatchLearner provides this capability for supervised learning algorithms by composing together a triplet. CompositeBatchLearnerPair does it for a pair of algorithms.
    • Added a constant and identity learners.
    • Added Chebyshev, Identity, and Minkowski distance metrics.
    • Added methods to DatasetUtil to get the output values for a dataset and to compute the sum of weights.
    • Made generics more permissive for supervised cost functions.
    • Added ClusterDistanceEvaluator for taking a clustering that encodes the distance from an input value to all clusters and returns the result as a vector.
    • Fixed potential round-off issue in decision tree splitter.
    • Added random subspace technique, implemented in RandomSubspace.
    • Separated functionality from LinearFunction into IdentityScalarFunction. LinearFunction by default is the same, but has parameters that can change the slope and offset of the function.
    • Default squashing function for GeneralizedLinearModel and DifferentiableGeneralizedLinearModel is now a linear function instead of an atan function.
    • Added a weighted estimator for the Poisson distribution.
    • Added Regressor interface for evaluators that are the output of (single-output) regression learning algorithms. Existing such evaluators have been updated to implement this interface.
    • Added support for regression ensembles including additive and averaging ensembles with and without weights. Added a learner for regression bagging in BaggingRegressionLearner.
    • Added a simple univariate regression class in UnivariateLinearRegression.
    • MultivariateDecorrelator now is a VectorInputEvaluator and VectorOutputEvaluator.
    • Added bias term to PrimalEstimatedSubGradient.
  • Text Core:
    • Fixed issue with the start position for tokens from LetterNumberTokenizer being off by one except for the first one.

Logo MICP 1.04

by kay_brodersen - March 26, 2013, 12:42:04 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 2825 views, 537 downloads, 2 subscriptions

About: This toolbox implements models for Bayesian mixed-effects inference on classification performance in hierarchical classification analyses.

Changes:

In addition to the existing MATLAB implementation, the toolbox now also contains an R package of the variational Bayesian algorithm for mixed-effects inference.


Logo cbMDS Correlation Based Multi Dimensional Scaling 1.1

by emstrick - March 11, 2013, 11:47:39 CET [ BibTeX BibTeX for corresponding Paper Download ] 994 views, 269 downloads, 1 subscription

About: The aim is to embed a given data relationship matrix into a low-dimensional Euclidean space such that the point distances / distance ranks correlate best with the original input relationships. Input relationships may be given as (asymmetric) distances, dissimilarities, or (negative) scores. Input-output relations are modelled as row-conditioned. (Weighted) Pearson and soft Spearman rank correlation, and unweighted soft Kendall correlation are supported correlation measures for input/output object neighborhood relationships.

Changes:
  • Initial release (Ver 1.0): Weighted Pearson and correlation and soft Spearman rank correlation, Tue Dec 4 16:14:51 CET 2012

  • Ver 1.1 Added soft Kendall correlation, Fri Mar 8 08:41:09 CET 2013


Logo JMLR GPML Gaussian Processes for Machine Learning Toolbox 3.2

by hn - January 21, 2013, 15:34:50 CET [ Project Homepage BibTeX Download ] 10017 views, 2773 downloads, 3 subscriptions

Rating Whole StarWhole StarWhole StarWhole StarWhole Star
(based on 2 votes)

About: The GPML toolbox is a flexible and generic Octave 3.2.x and Matlab 7.x implementation of inference and prediction in Gaussian Process (GP) models.

Changes:

We now support inference on large datasets using the FITC approximation for non-Gaussian likelihoods for EP and Laplace's approximation. New likelihood functions: mixture likelihood, Poisson likelihood, label noise. We added two MCMC samplers.


Logo LIBOL 0.1.0

by stevenhoi - December 27, 2012, 18:09:54 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 2265 views, 148 downloads, 1 subscription

About: LIBOL is an open-source library that consists of a family of state-of-the-art online learning algorithms for machine learning and data mining research.

Changes:

Initial Announcement on mloss.org.


Logo FABIA 2.4.0

by hochreit - December 20, 2012, 14:20:58 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 5511 views, 1095 downloads, 1 subscription

Rating Whole StarWhole StarWhole StarWhole Star1/2 Star
(based on 1 vote)

About: FABIA is a biclustering algorithm that clusters rows and columns of a matrix simultaneously. Consequently, members of a row cluster are similar to each other on a subset of columns and, analogously, members of a column cluster are similar to each other on a subset of rows. Biclusters are found by factor analysis where both the factors and the loading matrix are sparse. FABIA is a multiplicative model that extracts linear dependencies between samples and feature patterns. Applications include detection of transcriptional modules in gene expression data and identification of haplotypes/>identity by descent< consisting of rare variants obtained by next generation sequencing.

Changes:

CHANGES IN VERSION 2.4.0

o spfabia bugfixes

CHANGES IN VERSION 2.3.1

NEW FEATURES

o Getters and setters for class Factorization

2.0.0:

  • spfabia: fabia for a sparse data matrix (in sparse matrix format) and sparse vector/matrix computations in the code to speed up computations. spfabia applications: (a) detecting >identity by descent< in next generation sequencing data with rare variants, (b) detecting >shared haplotypes< in disease studies based on next generation sequencing data with rare variants;
  • fabia for non-negative factorization (parameter: non_negative);
  • changed to C and removed dependencies to Rcpp;
  • improved update for lambda (alpha should be smaller, e.g. 0.03);
  • introduced maximal number of row elements (lL);
  • introduced cycle bL when upper bounds nL or lL are effective;
  • reduced computational complexity;
  • bug fixes: (a) update formula for lambda: tighter approximation, (b) corrected inverse of the conditional covariance matrix of z;

1.4.0:

  • New option nL: maximal number of biclusters per row element;
  • Sort biclusters according to information content;
  • Improved and extended preprocessing;
  • Update to R2.13

Logo ELKI 0.5.5

by erich - December 14, 2012, 18:49:58 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4408 views, 792 downloads, 2 subscriptions

About: ELKI is a framework for implementing data-mining algorithms with support for index structures, that includes a wide variety of clustering and outlier detection methods.

Changes:

This is mostly a bug fix release. A lot of small issues have been fixed that improve performance, make error reporting a lot better, ease the use of sparse vectors and external precomputed distances, for example.

This will be the last ELKI release to support Java 6. The next ELKI release will require Java 7.

Algorithms

  • Some new LOF variants (LDF, SimpleLOF, SimpleKernelDensityLOF)
  • Correlation Outlier Probabilities (ICDM 2012)
  • A naive mean-shift clustering
  • Single-link clustering (SLINK algorithm) should be significantly faster due to optimized data structures
  • "Benchmarking" algorithms for measuring the performance of index structures

Index layer

  • Bulk loading R-Trees should be faster - in particular Sort Tile Recursive can work very well.
  • M-Trees have been refactored and optimized for double distances

Database layer

  • Bundle format (work in progress): low-level binary format for fast data exchange
  • DBID and DataStore layer received some additional classes for further performance improvements
  • KNN heap structures were revisited. The code is less clean now, but performs better in benchmarks.

Visualizations

  • General clean up and API simplifications
  • Some additional modules and improvements

Various

  • There is a new parameter class, RandomParameter
  • Some new distributions were added, also to the data set generator.

Tutorials

  • The website has new tutorials, including one on a k-means variation that produces equal sized clusters.

Logo gensim 0.8.6

by Radim - December 9, 2012, 13:15:16 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 10822 views, 2159 downloads, 1 subscription

About: Python Framework for Vector Space Modelling that can handle unlimited datasets (streamed input, online algorithms work incrementally in constant memory).

Changes:
  • added the "hashing trick" (by Homer Strong)
  • support for adding target classes in SVMlight format (by Corrado Monti)
  • fixed problems with global lemmatizer object when running in parallel on Windows
  • parallelization of Wikipedia processing + added script version that lemmatizes the input documents
  • added class method to initialize Dictionary from an existing corpus (by Marko Burjek)

About: Stochastic neighbor embedding aims at the reconstruction of given distance, dissimilarity, or score neighborhood relations in a low-dimensional Euclidean space. This can be regarded as general approach to multi-dimensional scaling, but the reconstruction is based on the definition of input (and output) neighborhood probability alone. Probability of score exceedance is used for neighborhood probability estimation, which is connected to soft-rank optimization. The present implementation makes use of quasi 2nd order gradient-based (l-)BFGS optimization.

Changes:
  • scoretoprob.m replaced by d2p.m

  • protein score data set added

  • trank.m computes (mid/max -tied) ranks along columns of matrix

  • local P- neighborhood probability estimation added

  • experimental soft_rank_SNE added for minimizing KL between probabilities of exceedance in source and embedding space

  • symmetry option removed, because this was strange in previous version


Logo PLEASD 0.1

by heroesneverdie - September 10, 2012, 03:53:26 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 1038 views, 166 downloads, 1 subscription

About: PLEASD: A Matlab Toolbox for Structured Learning

Changes:

Initial Announcement on mloss.org.


Logo libmind alpha 1

by neuromancer - September 4, 2012, 04:30:57 CET [ Project Homepage BibTeX Download ] 659 views, 146 downloads, 1 subscription

About: A general purpose library to process and predict sequences of elements using echo state networks.

Changes:

Initial Announcement on mloss.org.


Logo JMLR Mulan 1.4.0

by lefman - August 1, 2012, 09:49:21 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 9955 views, 4744 downloads, 1 subscription

About: Mulan is an open-source Java library for learning from multi-label datasets. Multi-label datasets consist of training examples of a target function that has multiple binary target variables. This means that each item of a multi-label dataset can be a member of multiple categories or annotated by many labels (classes). This is actually the nature of many real world problems such as semantic annotation of images and video, web page categorization, direct marketing, functional genomics and music categorization into genres and emotions.

Changes:

Learners

  • BinaryRelevance.java: improved data handling that avoids copying the entire input space, leading to important speedups in case of large datasets and very large number of labels.
  • RAkEL.java: updated technical information, added a check for the case where the number of labels is less or equal than the size of the subset.
  • MultiLabelKNN.java: now checks whether the number of instances is less than the number of requested nearest neighbors.
  • Addition of AdaBoostMH.java, an explicit implementation of AdaBoost.MH as combination of AdaBoostM1 and IncludeLabelsClassifier.
  • Addition of MLPTO.java, the Multi Label Probabilistic Threshold Optimizer (MLTPTO) thresholding technique.
  • Addition of ApproximateExampleBasedFMeasureOptimizer.java, an approximate method for the maximization of example-based F-measure.

Measures/Evaluation

  • Addition of Specificity measure (example-based, micro/macro label-based)
  • Addition of Mean Average Interpolated Precision (MAiP), Geometric Mean Average Precision (GMAP), Geometric Mean Average Interpolated Precision (GMAiP).
  • New methods for stratified multi-label evaluation.
  • Added support for outputting per label results for all measures that implement the MacroAverageMeasure interface.
  • Simplifying the "strictness" issue of information retrieval measures, by adopting specific assumptions (outlined in the new class InformationRetrievalMeasures.java) to handle special cases, instead of the less clear and useful solution of outputting NaN and the less realistic solution or ignoring special cases.

Bug fixes

  • Bug fix in LabelsBuilder.java.
  • Bug fix in Ranker.java.
  • Bug-fix in ThresholdPrediction.java.
  • Fix for bug occurring when loading the XSD for mulan data outside the command-line environment (e.g. web applications).
  • Javadoc comment updates.

API changes

  • Upgrade to Java 1.6
  • Upgrade to JUnit 4.10
  • Upgrade to Weka 3.7.6.

Miscellaneous

  • Meaningful messages are now shown when a DataLoadException is thrown.
  • PT6(PT6Transformation.java): renamed to IncludeLabelsTransformation.java.
  • MultiLabelInstances now support serialization, as needed by the improved binary relevance transformation.
  • BinaryRelevanceAttributeEvaluator.java: updated according to latest BR improvements.

Logo MLWizard 5.2

by remat - July 26, 2012, 15:04:14 CET [ Project Homepage BibTeX Download ] 1709 views, 377 downloads, 1 subscription

About: MLwizard recommends and optimizes classification algorithms based on meta-learning and is a software wizard fully integrated into RapidMiner but can be used as library as well.

Changes:

Faster parameter optimization using genetic algorithm with predefined start population.


Logo SVM with uncertain labels 0.2

by rflamary - July 17, 2012, 11:06:23 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 3049 views, 596 downloads, 2 subscriptions

About: Matlab code for learning probabilistic SVM in the presence of uncertain labels.

Changes:

Added missing dataset function (thanks to Hao Wu)


Logo pymaBandits 1.0

by garivier - July 6, 2012, 18:32:41 CET [ BibTeX Download ] 2545 views, 411 downloads, 1 subscription

About: This package contains a python and a matlab implementation of the most widely used algorithms for multi-armed bandit problems. The purpose of this package is to provide simple environments for comparison and numerical evaluation of policies.

Changes:

Initial Announcement on mloss.org.


About: The package provides a Lagrangian approach to the posterior regularization of given linear mappings. This is important in two cases, (a) when systems are under-determined and (b) when the external model for calculating the mapping is invariant to properties such as scaling. The software may be applied in cases when the external model does not provide its own regularization strategy. In addition, the package allows to rank attributes according to their distortion potential to a given linear mapping.

Changes:

Version 1.1 (May 23, 2012) memory and time optimizations distderivrel.m now supports assessing the relevance of attribute pairs

Version 1.0 (Nov 9, 2011) * Initial Announcement on mloss.org.


Showing Items 1-20 of 56 on page 1 of 3: 1 2 3 Next