Projects that are tagged with cuda.


Logo Theano 1.0.0

by jaberg - November 16, 2017, 17:42:27 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 37027 views, 6214 downloads, 3 subscriptions

About: A Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Dynamically generates CPU and GPU modules for good performance. Deep Learning Tutorials illustrate deep learning with Theano.

Changes:

Theano 1.0.0 (15th of November, 2017)

Highlights (since 0.9.0):

  • Announcing that MILA will stop developing Theano <https://groups.google.com/d/msg/theano-users/7Poq8BZutbY/rNCIfvAEAwAJ>_

  • conda packages now available and updated in our own conda channel mila-udem To install it: conda install -c mila-udem theano pygpu

  • Support NumPy 1.13

  • Support pygpu 0.7

  • Moved Python 3.* minimum supported version from 3.3 to 3.4

  • Added conda recipe

  • Replaced deprecated package nose-parameterized with up-to-date package parameterized for Theano requirements

  • Theano now internally uses sha256 instead of md5 to work on systems that forbid md5 for security reason

  • Removed old GPU backend theano.sandbox.cuda. New backend theano.gpuarray is now the official GPU backend

  • Make sure MKL uses GNU OpenMP

  • NB: Matrix dot product (gemm) with mkl from conda could return wrong results in some cases. We have reported the problem upstream and we have a work around that raises an error with information about how to fix it.

  • Improved elemwise operations

  • Speed-up elemwise ops based on SciPy

  • Fixed memory leaks related to elemwise ops on GPU

  • Scan improvements

  • Speed up Theano scan compilation and gradient computation

  • Added meaningful message when missing inputs to scan

  • Speed up graph toposort algorithm

  • Faster C compilation by massively using a new interface for op params

  • Faster optimization step, with new optional destroy handler

  • Documentation updated and more complete

  • Added documentation for RNNBlock

  • Updated conv documentation

  • Support more debuggers for PdbBreakpoint

  • Many bug fixes, crash fixes and warning improvements


Logo Somoclu 1.7.4

by peterwittek - June 6, 2017, 15:48:11 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 32855 views, 5905 downloads, 3 subscriptions

About: Somoclu is a massively parallel implementation of self-organizing maps. It relies on OpenMP for multicore execution, MPI for distributing the workload, and it can be accelerated by CUDA on a GPU cluster. A sparse kernel is also included, which is useful for training maps on vector spaces generated in text mining processes. Apart from a command line interface, Python, Julia, R, and MATLAB are supported.

Changes:
  • New: Verbosity parameter in the command-line, Python, MATLAB, and Julia interfaces.
  • Changed: Calculation of U-matrix parallelized.
  • Changed: Moved feeding data to train method in the Python interface.
  • Fixed: The random seed was set to 0 for testing purposes. This is now changed to a wall-time based initialization.
  • Fixed: Sparse matrix reader made more robust.
  • Fixed: Compatibility with kohonen 3 resolved.
  • Fixed: Compatibility with Matplotlib 2 resolved.

Logo deepdetect 0.1

by beniz - June 2, 2015, 09:25:28 CET [ Project Homepage BibTeX Download ] 2597 views, 667 downloads, 3 subscriptions

About: A Deep Learning API and server

Changes:

Initial Announcement on mloss.org.


Logo MShadow 1.0

by antinucleon - April 10, 2014, 02:57:54 CET [ Project Homepage BibTeX Download ] 3774 views, 955 downloads, 1 subscription

About: Lightweight CPU/GPU Matrix/Tensor Template Library in C++/CUDA. Support element-wise expression expand in high performance. Code once, run smoothly on both GPU and CPU

Changes:

Initial Announcement on mloss.org.


Logo CXXNET 0.1

by antinucleon - April 10, 2014, 02:47:08 CET [ Project Homepage BibTeX Download ] 4065 views, 977 downloads, 1 subscription

About: CXXNET (spelled as: C plus plus net) is a neural network toolkit build on mshadow(https://github.com/tqchen/mshadow). It is yet another implementation of (convolutional) neural network. It is in C++, with about 1000 lines of network layer implementations, easily configuration via config file, and can get the state of art performance.

Changes:

Initial Announcement on mloss.org.