mloss.org epachttp://mloss.orgUpdates and additions to epacenWed, 09 Oct 2013 14:00:15 -0000epac 0.10http://mloss.org/software/view/515/<html><p>Embarrassingly Parallel Array Computing (EPAC) is a machine learning workflow builder. Building machine learning workflow is like playing “lego” game: user can combine machine learning building blocks into sequential pipelines (Pipe primitive) and bind them into parallel branches. Parallel branches may stem from different pipelines (Methods primitive) processing the same array or the same pipeline applied on different arrays. As it is the case with resampled data, using cross-validation (CV primitive) or permutation. </p> <p>EPAC allows a “write it once, run it everywhere” workflow construction: the same workflow can be executed on a local multi-core computer or on a remote cluster (soma-workflow [1]). EPAC parallel execution is based on mapreduce paradigm providing an efficient adequation between computing resources and workflows. Large arrays (10 GB) can be processed thanks to EPAC memory management features (joblib [2] or memory mapping) avoiding unnecessary array duplication. </p> <p>EPAC enable user to define their own bricks which can be plugged into the workflow. User defined bricks (classes) are automatically packed to be remotely executed without any additional configuration. Such feature, based on dill [3], makes the workflow easy to extend and reinforce the “write it once, run it everywhere” capability. </p> <p>In summary, EPAC provides users with a way to easily build machine learning workflow, to run workflow faster with minimum memory footprint, to easily extend your own workflow. Feel free to try EPAC on mloss.org: https://mloss.org/software/view/515/ </p> <p>[1] Soizic Laguitton et al., Soma-workflow: A unified and simple interface to parallel computing resources, HBM Annual Meeting 2011, Quebec </p> <p>[2] Ogrisel et al., joblib: running Python function as pipeline jobs, http://pythonhosted.org/joblib/ </p> <p>[3] M.M. McKerns, L. Strand, T. Sullivan, A. Fang, M.A.G. Aivazis, "Building a framework for predictive science", Proceedings of the 10th Python in Science Conference, 2011 </p></html>Jinpeng LI, Edouard Duchesnay, Mathieu Duboism, Laure Hugo, Benoit Da MotaWed, 09 Oct 2013 14:00:15 -0000http://mloss.org/software/rss/comments/515http://mloss.org/software/view/515/machine learningarray computingworkflow builder