github ml code

38226
vinta
19267
keras

Modular neural network library based on Theano.

14207
Spark

Spark is a fast and general engine for large-scale data processing.

12963
Bayesian Methods for Hackers

Book/iPython notebooks on Probabilistic Programming in Python

12268
CNTK

The Computational Network Toolkit (CNTK) by Microsoft Research, is a unified deep-learning toolkit that describes neural networks as a series of computational steps via a directed graph.

10390
PredictionIO

PredictionIO, a machine learning server for software developers and data engineers.

10042
data-science-ipython-notebooks

Continually updated Data Science Python Notebooks: Spark, Hadoop MapReduce, HDFS, AWS, Kaggle, scikit-learn, matplotlib, pandas, NumPy, SciPy, and various command lines.

8632
XGBoost

A parallelized optimized general purpose gradient boosting library.

8632
XGBoost

Python bindings for eXtreme Gradient Boosting (Tree) Library

8570
Awesome TensorFlow

A list of all things related to TensorFlow

7823
Brain

Neural networks in JavaScript [Deprecated]

7120
java-deeplearning

Distributed Deep Learning Platform for Java, Clojure, Scala

7120
Deeplearning4j

Scalable deep learning for industry with parallel GPUs

7104
Dive into Machine Learning with Python Jupyter notebook and scikit-learn

"I learned Python by hacking first, and getting serious later. I wanted to do this with Machine Learning. If this is your style, join me in getting a bit ahead of yourself."

7092
PyTorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

6926
Gym

OpenAI Gym is a toolkit for developing and comparing reinforcement learning algorithms.

6862
NLP Compromise

Natural Language processing in the browser

6812
natural

General natural language facilities for node

6598
TFLearn

Deep learning library featuring a higher-level API for TensorFlow.

6402
bokeh

Interactive Web Plotting for Python.

6286
ML-From-Scratch

Implementations of Machine Learning models from scratch in Python with a focus on transparency. Aims to showcase the nuts and bolts of ML in an accessible way.

5926
CCV

C-based/Cached/Core Computer Vision Library, A Modern Computer Vision Library

5892
face_recognition

Face recognition library that recognize and manipulate faces from Python or from the command line

5867
Universe

Universe is a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.

5649
pattern

Web mining module for Python.

5453
zipline

A Pythonic algorithmic trading library.

5182
NuPIC

Numenta Platform for Intelligent Computing.

5182
numpic
4980
gensim

Topic Modelling for Humans.

4928
Synaptic

Architecture-free neural network library for Node.js and the browser

4703
Neural Networks and Deep Learning

Code samples for my book "Neural Networks and Deep Learning" [DEEP LEARNING]

4454
Go Learn

Machine Learning for Go

4362
NeuralTalk

NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.

4212
PHP-ML

Machine Learning library for PHP. Algorithms, Cross Validation, Neural Network, Preprocessing, Feature Extraction and much more in one library.

4132
DeepMind Lab

DeepMind Lab is a 3D learning environment based on id Software's Quake III Arena via ioquake3 and other open source software. Its primary purpose is to act as a testbed for research in artificial intelligence, especially deep reinforcement learning.

4129
SymPy

A Python library for symbolic mathematics.

4083
aerosolve

A machine learning library by Airbnb designed from the ground up to be human friendly.

3995
Darknet

Darknet is an open source neural network framework written in C and CUDA. It is fast, easy to install, and supports CPU and GPU computation.

3681
LightGBM

Microsoft's fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

3557
Fuzzy Wuzzy

Fuzzy String Matching in Python

3344
SmileMiner

Statistical Machine Intelligence & Learning Engine

3217
Lasagne

Lightweight library to build and train neural networks in Theano.

3200
neon

Nervana's high-performance Python-based Deep Learning framework [DEEP LEARNING]

3116
Machine Learning For Hackers
3099
ggplot

Same API as ggplot2 for R.

3093
deeplearnjs

A hardware-accelerated machine intelligence library for the web

3044
skflow

Simplified interface for TensorFlow, mimicking Scikit Learn.

2925
Warp-CTC

A fast parallel implementation of Connectionist Temporal Classification (CTC), on both CPU and GPU.

2867
Scalding

A Scala API for Cascading

2805
TPOT

Tool that automatically creates and optimizes machine learning pipelines using genetic programming. Consider it your personal data science assistant, automating a tedious part of machine learning.

2798
pattern_classification
2472
Annoy

Approximate nearest neighbours implementation

2454
Pylearn2

A Machine Learning library based on Theano.

2453
DIGITS

The Deep Learning GPU Training System (DIGITS) is a web application for training deep learning models.

2399
Kaldi

Kaldi is a toolkit for speech recognition written in C++ and licensed under the Apache License v2.0. Kaldi is intended for use by speech recognition researchers.

2384
H2O

ML engine that supports distributed learning on Hadoop, Spark or your laptop via APIs in R, Python, Scala, REST/JSON.

2363
PyBrain

Another Python Machine Learning Library.

2348
pandas cookbook

Recipes for using Python's pandas library

2275
statsmodels

Statistical modeling and econometrics in Python.

2239
SnowNLP

A library for processing Chinese text.

2201
Breeze

Breeze is a numerical processing library for Scala.

2104
Blaze

NumPy and Pandas interface to Big Data.

2012
vincent

A Python to Vega translator.

1884
Summing Bird

Streaming MapReduce with Scalding and Storm

1859
Impala

Real-time Query for Hadoop

1757
Scikit-Image

A collection of algorithms for image processing in Python.

1684
Twitter-text

A JavaScript implementation of Twitter's text processing library

1679
DyNet

A dynamic neural network library working well with networks that have dynamic structures that change for every training instance. Written in C++ with bindings in Python.

1573
Pattern Recognition and Machine Learning

This package contains the matlab implementation of the algorithms described in the book Pattern Recognition and Machine Learning by C. Bishop.

1566
Onyx

Distributed, masterless, high performance, fault tolerant data processing. Written entirely in Clojure.

1558
Algebird

Abstract Algebra for Scala

1545
simple-statistics

A JavaScript implementation of descriptive, regression, and inference statistics. Implemented in literate JavaScript with no dependencies, designed to work in all modern browsers (including IE) as well as in Node.js.

1521
Shogun

The Shogun Machine Learning Toolbox

1521
Shogun

The Shogun Machine Learning Toolbox

1511
DrQA

Reading Wikipedia to answer open-domain questions

1496
bqplot

An API for plotting in Jupyter (IPython)

1464
MIT Information Extraction Toolkit

C, C++, and Python tools for named entity recognition and relation extraction

1386
vispy

GPU-based high-performance interactive OpenGL 2D/3D data visualization library

1317
deap

Evolutionary algorithm framework.

1313
Retext

Extensible system for analyzing and manipulating natural language

1287
HLearn

a suite of libraries for interpreting machine learning models according to their algebraic structure.

1286
DeepDetect

A machine learning API and server written in C++11. It makes state of the art machine learning easy to work with and integrate into existing applications.

1280
Treat

Text REtrieval and Annotation Toolkit, definitely the most comprehensive toolkit I’ve encountered so far for Ruby

1266
Brainstorm

Fast, flexible and fun neural networks. This is the successor of PyBrain.

1239
Introduction to machine learning with scikit-learn

IPython notebooks from Data School's video tutorials on scikit-learn.

1235
Mahout

Distributed machine learning

1232
Bender

Fast Neural Networks framework built on top of Metal. Supports TensorFlow models.

1228
d3py

A plotting library for Python, based on D3.js.

1227
TDB

TensorDebugger (TDB) is a visual debugger for deep learning. It features interactive, node-by-node debugging and visualization for TensorFlow.

1223
ORYX

Lambda Architecture Framework using Apache Spark and Apache Kafka with a specialization for real-time large-scale machine learning.

1159
hebel

GPU-Accelerated Deep Learning Library in Python.

1154
PCV

Open source Python module for computer vision

1153
pydeep

Deep Learning In Python

1071
Gadfly

Crafty statistical graphics for Julia.

992
mlxtend

A library consisting of useful tools for data science and machine learning tasks.

949
nn

Neural Network package for Torch

946
sylvester

Vector and Matrix math for JavaScript.

942
Mocha

Deep Learning framework for Julia inspired by Caffe

920
Spearmint

Spearmint is a package to perform Bayesian optimization according to the algorithms outlined in the paper: Practical Bayesian Optimization of Machine Learning Algorithms. Jasper Snoek, Hugo Larochelle and Ryan P. Adams. Advances in Neural Information Processing Systems, 2012.

915
Kartograph.py

Rendering beautiful SVG maps in Python.

912
MLPNeuralNet

Fast multilayer perceptron neural network library for iOS and Mac OS X. MLPNeuralNet predicts new examples by trained neural network. It is built on top of the Apple's Accelerate Framework, using vectorized operations and hardware acceleration if available.

910
python-recsys

A Python library for implementing a Recommender System.

908
torchnet

framework for torch which provides a set of abstractions aiming at encouraging code re-use as well as encouraging modular programming

903
Datumbox

Machine Learning framework for rapid development of Machine Learning and Statistical applications

895
scikit-plot

A visualization library for quick and easy generation of common plots in data analysis and machine learning.

884
Xcessiv

A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling

856
Crab

A flexible, fast recommender engine.

852
rnn

A Recurrent Neural Network library that extends Torch's nn. RNNs, LSTMs, GRUs, BRNNs, BLSTMs, etc.

823
Quepy

A python framework to transform natural language questions to queries in a database query language

798
BIDMach

CPU and GPU-accelerated Machine Learning Library.

790
PyQtGraph

A pure-python graphics and GUI library built on PyQt4 / PySide and NumPy.

786
Open Mining

Business Intelligence (BI) in Python (Pandas web interface)

783
cortex

Neural networks, regression and feature learning in Clojure.

742
SimpleAI

Python implementation of many of the artificial intelligence algorithms described on the book "Artificial Intelligence, a Modern Approach". It focuses on providing an easy to use, well documented and tested library.

727
jellyfish

a python library for doing approximate and phonetic matching of strings.

705
emcee

The Python ensemble sampling toolkit for affine-invariant MCMC.

703
thinking bayes

Book on Bayesian Analysis

703
Allen Downey’s Think Bayes Code

Code repository for Think Bayes.

654
adam

A genomics processing engine and specialized file format built using Apache Avro, Apache Spark and Parquet. Apache 2 licensed.

630
Awesome NLP with Ruby

Curated link list for practical natural language processing in Ruby.

627
Clojure-openNLP

Natural Language Processing in Clojure (opennlp)

627
Twitter-text-rb

A library that does auto linking and extraction of usernames, lists and hashtags in tweets

625
CardMagic-Classifier

A general classifier module to allow Bayesian and other types of classifications.

621
Encog

An advanced neural network and machine learning framework. Encog contains classes to create a wide variety of networks, as well as support classes to normalize and process data for these neural networks. Encog trains using multithreaded resilient propagation. Encog can also make use of a GPU to further speed processing time. A GUI based workbench is also provided to help model and train neural networks.

614
caffegraph

A DSL for deep neural networks

613
AIToolbox

A toolbox framework of AI modules written in Swift: Graphs/Trees, Linear Regression, Support Vector Machines, Neural Networks, PCA, KMeans, Genetic Algorithms, MDP, Mixture of Gaussians.

596
Awesome Machine Learning with Ruby

Curated list of ML related resources for Ruby

579
Polyglot

Multilingual text (NLP) processing toolkit

577
Restricted Boltzmann Machines

Restricted Boltzmann Machines in Python. [DEEP LEARNING]

560
PyMC

Markov Chain Monte Carlo sampling toolkit.

556
milk

Machine learning toolkit focused on supervised classification.

546
pgmpy

A python library for working with Probabilistic Graphical Models.

545
Cloudforest

Ensembles of decision trees in go/Golang.

532
H2O Sparkling Water

H2O and Spark interoperability.

520
Figaro

a Scala library for constructing probabilistic models.

514
autograd

Autograd automatically differentiates native Torch code. Inspired by the original Python version.

511
swix

A bare bones library that

506
OverFeat

A state-of-the-art generic dense feature extractor

504
FACTORIE

FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a succinct language for creating relational factor graphs, estimating parameters and performing inference.

492
stanford-corenlp-python

Python wrapper for Stanford CoreNLP

490
ml.js

Machine learning and numerical analysis tools for Node.js and the Browser!

485
rusty-machine

a pure-rust machine learning library.

481
textacy

higher-level NLP built on Spacy

479
bayesian

Naive Bayesian Classification for Golang.

475
ViZDoom

ViZDoom allows developing AI bots that play Doom using only the visual information (the screen buffer). It is primarily intended for research in machine visual learning, and deep reinforcement learning, in particular.

474
Pavlov.js

Reinforcement learning using Markov Decision Processes

470
Bowtie

A dashboard library for interactive visualizations using flask socketio and react.

468
PigPen

Map-Reduce for Clojure.

459
REP

an IPython-based environment for conducting data-driven research in a consistent and reproducible way. REP is not trying to substitute scikit-learn, but extends it and provides better user experience.

447
DN2A

Digital Neural Networks Architecture

424
grt

The Gesture Recognition Toolkit (GRT) is a cross-platform, open-source, C++ machine learning library designed for real-time gesture recognition.

423
ipython-notebooks
409
hyperopt
402
jieba-php

Chinese Words Segmentation Utilities.

399
Kaggle Galaxy Challenge

Winning solution for the Galaxy Challenge on Kaggle

397
DataFrames

library for working with tabular data in Julia

394
auto_ml

Automated machine learning for production and analytics. Lets you focus on the fun parts of ML, while outputting production-ready code, and detailed analytics of your dataset and results. Includes support for NLP, XGBoost, LightGBM, and soon, deep learning.

389
SKLL

A wrapper around scikit-learn that makes it simpler to conduct experiments.

374
Twitter Text Java

A Java implementation of Twitter's text processing library

371
MeTA

MeTA : ModErn Text Analysis is a C++ Data Sciences Toolkit that facilitates mining big text data.

368
regression-js

A javascript library containing a collection of least squares fitting methods for finding a trend in a set of data.

368
Dora

Tools for exploratory data analysis in Python.

355
Machine Learning Module

Class on machine w/ PDF, lectures, code

355
pastalog

Simple, realtime visualization of neural network training performance.

352
dp

A deep learning library designed for streamlining research and development using the Torch7 distribution. It emphasizes flexibility through the elegant use of object-oriented design patterns.

350
brushfire

Distributed decision tree ensemble learning in Scala

349
gago

Multi-population, flexible, parallel genetic algorithm.

349
Knet

Koc University Deep Learning Framework

340
machineJS

Automated machine learning, data formatting, ensembling, and hyperparameter optimization for competitions and exploration- just give it a .csv file!

337
LambdaNet

Configurable Neural Networks in Haskell

337
Conjecture

Scalable Machine Learning in Scalding

333
BrainCore

The iOS and OS X neural network framework

329
Fido

A highly-modular C++ machine learning library for embedded electronics and robotics.

315
rsruby

Ruby - R bridge

309
metric-learn

A Python module for metric learning.

304
pyhsmm

library for approximate unsupervised inference in Bayesian Hidden Markov Models (HMMs) and explicit-duration Hidden semi-Markov Models (HSMMs), focusing on the Bayesian Nonparametric extensions, the HDP-HMM and HDP-HSMM, mostly with weak-limit approximations.

304
Bioruby
300
SparklingPandas

Pandas on PySpark (POPS)

296
WalnutiQ

object oriented model of the human brain

291
nilearn

Machine learning for NeuroImaging in Python

291
Swift Brain

The first neural network / machine learning library written in Swift. This is a project for AI algorithms in Swift for iOS and OS X development. This project includes algorithms focused on Bayes theorem, neural networks, SVMs, Matrices, etc..

289
datakit

A lightweight framework for data analysis in JavaScript

286
ggfortify

Unified interface to ggplot2 popular R packages.

282
Simple Bayes

A Simple Bayes / Naive Bayes implementation in Elixir.

277
PyNLPl

Python Natural Language Processing Library. General purpose NLP library for Python. Also contains some specific modules for parsing common NLP formats, most notably for FoLiA, but also ARPA language models, Moses phrasetables, GIZA++ alignments.

274
RustNN

RustNN is a feedforward neural network library.

272
NeuralN

C++ Neural Network library for Node.js. It has advantage on large dataset and multi-threaded training.

266
Allen Downey’s Think OS Code

Text and supporting code for Think OS: A Brief Introduction to Operating Systems.

265
Featureforge

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

259
Pyevolve

Genetic algorithm framework.

255
Distributions

A Julia package for probability distributions and associated functions.

250
GoNN

GoNN is an implementation of Neural Network in Go Language, which includes BPNN, RBF, PCN

250
CLTK

The Classical Language Toolkit

250
Petrel

Tools for writing, submitting, debugging, and monitoring Storm topologies in pure Python.

250
rb-libsvm

Ruby language bindings for LIBSVM which is a Library for Support Vector Machines

249
nngraph

This package provides graphical computation for nn library in Torch7.

246
cutorch

Torch CUDA Implementation

245
Arel
244
Clever Algorithms For Machine Learning
243
Chalk

Chalk is a natural language processing library.

242
VIGRA

VIGRA is a generic cross-platform C++ computer vision and machine learning library for volumes of arbitrary dimensionality with Python bindings.

242
Vigranumpy

Python bindings for the VIGRA C++ computer vision library.

239
rustlearn

a machine learning framework featuring logistic regression, support vector machines, decision trees and random forests.

238
BIDMat

CPU and GPU-accelerated matrix library intended to support large-scale exploratory data analysis.

237
Node-SVM

Support Vector Machine for Node.js

232
Stemmer

Expose libstemmer_c to Ruby

219
pycascading
215
htm.java

General Machine Learning library using Numenta’s Cortical Learning Algorithm

210
skbayes

Python package for Bayesian Machine Learning with scikit-learn API

205
Clojush

The Push programming language and the PushGP genetic programming system implemented in Clojure

204
gobrain

Neural Networks written in go

187
dpnn

Many useful features that aren't part of the main nn package.

176
optim

An optimization library for Torch. SGD, Adagrad, Conjugate-Gradient, LBFGS, RProp and more.

174
Node-fann

FANN (Fast Artificial Neural Network Library) bindings for Node.js

167
jRuby Mahout

JRuby Mahout is a gem that unleashes the power of Apache Mahout in the world of JRuby.

167
Big Data For Chimps
165
Rosetta

Text processing tools and wrappers (e.g. Vowpal Wabbit)

163
LDA.js

LDA topic modeling for Node.js

162
Infer

Inference and machine learning in Clojure

162
clortex

General Machine Learning library using Numenta’s Cortical Learning Algorithm

159
Images

An image library for Julia

159
cunn

Torch CUDA Neural Network Implementation

155
fuku-ml

Simple machine learning library, including Perceptron, Regression, Support Vector Machine, Decision Tree and more, it's easy to use and easy to learn for beginners.

152
visualize_ML

A python package for data exploration and data analysis.

150
Intel(R) DAAL

A high performance software library developed by Intel and optimized for Intel's architectures. Library provides algorithmic building blocks for all stages of data analytics and allows to process data in batch, online and distributed modes.

150
Infections-clj

Rails-like inflection library for Clojure and ClojureScript

150
go-galib

Genetic Algorithms library written in Go / Golang

149
SOMPY

Self Organizing Map written in Python (Uses neural networks for data analysis).

147
TDSP-Utilities

Two data science utilities in R from Microsoft: 1) Interactive Data Exploration, Analysis, and Reporting (IDEAR) ; 2) Automated Modeling and Reporting (AMR).

145
genius

A Chinese segment base on Conditional Random Field.

143
GLM

Generalized linear models in Julia

143
deeplearn-rs

deeplearn-rs provides simple networks that use matrix multiplication, addition, and ReLU under the MIT license.

142
BLLIP Parser

BLLIP Natural Language Parser (also known as the Charniak-Johnson parser)

141
Decision Trees

NodeJS Implementation of Decision Tree using ID3 Algorithm

138
Clj-ML

A machine learning library for Clojure built on top of Weka and friends

138
go-ml

Linear / Logistic regression, Neural Networks, Collaborative Filtering and Gaussian Multivariate Distribution

136
Encog

Clojure wrapper for Encog (v3) (Machine-Learning framework that specializes in neural-nets)

136
LightGraphs

Graph modeling and analysis

135
Recommender

A C library for product recommendations/suggestions using collaborative filtering (CF).

133
climin

Optimization library focused on machine learning, pythonic implementations of gradient descent, LBFGS, rmsprop, adadelta and others

129
go-porterstemmer

A native Go clean room implementation of the Porter Stemming algorithm.

128
Text Analysis

Julia package for text analysis

127
Mamba

Markov chain Monte Carlo (MCMC) for Bayesian analysis in Julia

124
comportex

Functionally composable Machine Learning library using Numenta’s Cortical Learning Algorithm

124
ScikitLearn

Julia implementation of the scikit-learn API

124
kaggle insults

Kaggle Submission for "Detecting Insults in Social Commentary"

117
Stemmer

An English (Porter2) stemming implementation in Elixir.

117
kaggle-blackbox

Deep learning made easy

116
neurolab

https://github.com/zueve/neurolab

116
Glean

A data management tool for humans

114
DynaML

Scala Library/REPL for Machine Learning Research

113
TensorFlow

OCaml bindings for TensorFlow.

113
nut

Natural language Understanding Toolkit

112
Touchstone

Clojure A/B testing library

112
ganitha

scalding powered machine learning

108
somoclu

Massively parallel self-organizing maps: accelerate training on multicore CPUs, GPUs, and clusters, has python API.

106
sentiment_classifier

Sentiment classifier using word sense disambiguation.

105
YCML

A Machine Learning framework for Objective-C and Swift (OS X / iOS).

104
ClearNLP

The ClearNLP project provides software and resources for natural language processing. The project started at the Center for Computational Language and EducAtion Research, and is currently developed by the Center for Language and Information Research at Emory University. This project is under the Apache 2 license.

102
Hydrosphere Mist

a service for deployment Apache Spark MLLib machine learning models as realtime, batch or reactive web services.

102
Hydrosphere Mist

a service for deployment Apache Spark MLLib machine learning models as realtime, batch or reactive web services.

102
Hydrosphere Mist

a service for deployment Apache Spark MLLib machine learning models as realtime, batch or reactive web services.

99
manifold

A package to manipulate manifolds

98
MLBase

A set of functions to support the development of machine learning algorithms

94
SuperLearner

and subsemble - Multi-algorithm ensemble learning packages.

93
Mixed Models

A Julia package for fitting (statistical) mixed-effects models

92
nnx

A completely unstable and experimental package that extends Torch's builtin nn library

92
bioscala

Bioinformatics for the Scala programming language

91
Merlin

Flexible Deep Learning Framework in Julia

90
Fungp

A genetic programming library for Clojure

90
Data Frames Meta

Metaprogramming tools for DataFrames

90
breze

Theano based library for deep and recurrent neural networks

89
Pattern Recognition Toolbox

A complete object-oriented environment for machine learning in Matlab.

87
unsup

A package for unsupervised learning in Torch. Provides modules that are compatible with nn (LinearPsd, ConvPsd, AutoEncoder, ...), and self-contained algorithms (k-means, PCA).

85
Decision Tree

Decision Tree Classifier and Regressor

83
RF

Random forests implementation in Go

82
kalman

Kalman filter for Javascript.

82
topik

Topic modelling toolkit

81
Clustering

Basic functions for clustering data: k-means, dp-means, etc.

81
MLKit

A simple Machine Learning Framework written in Swift. Currently features Simple Linear Regression, Polynomial Regression, and Ridge Regression.

80
shaman

Node.js library with support for both simple and multiple linear regression.

80
Time Series

Time series toolkit for Julia

80
Crab

A recommendation engine library for Python

80
scikit-learn tutorials

Series of notebooks for learning scikit-learn

79
XGBoost

eXtreme Gradient Boosting Package in Julia

78
BanditLib

A simple Multi-armed Bandit library.

78
go-graph

Graph library for Go/Golang language.

78
YAlign

A sentence aligner, a friendly tool for extracting parallel sentences from comparable corpora.

78
Perfect TensorFlow

Swift Language Bindings of TensorFlow. Using native TensorFlow models on both macOS / Linux.

77
DSP

Digital Signal Processing (filtering, periodograms, spectrograms, window functions).

76
MachineLearning

Julia Machine Learning library

75
go-ngram

In-memory n-gram index with compression.

75
Bolt

Bolt Online Learning Toolbox

72
enlighten-apply

Example code and materials that illustrate applications of SAS machine learning techniques.

71
word-embedding

Word Embeddings: the full implementation of word2vec, GloVe in Go.

71
PredictionBuilder

A library for machine learning that builds predictions using a linear regression.

71
NuPIC Studio

An all-in-one NuPIC Hierarchical Temporal Memory visualization and debugging super-tool!

69
rgf_python

Python bindings for Regularized Greedy Forest (Tree) Library.

69
jProcessing

Kanji / Hiragana / Katakana to Romaji Converter. Edict Dictionary & parallel sentences Search. Sentence Similarity between two JP Sentences. Sentiment Analysis of Japanese Text. Run Cabocha(ISO--8859-1 configured) in Python.

69
Allen Downey’s Think Complexity Code

Code for Allen Downey's book Think Complexity.

68
data-visualization-ruby

Source code and supporting content for my Ruby Manor presentation on Data Visualisation with Ruby

67
colibri-core

C++ library, command line tools, and Python binding for extracting and working with basic linguistic constructions such as n-grams and skipgrams in a quick and memory-efficient way.

67
colibri-core

Python binding to C++ library for extracting and working with with basic linguistic constructions such as n-grams and skipgrams in a quick and memory-efficient way.

66
JuliaCon Presentations

Presentations for JuliaCon

65
Neon Course

IPython notebooks for a complete course around understanding Nervana's Neon

64
Envision

Clojure Data Visualisation library, based on Statistiker and D3

62
RDataSets

Julia package for loading many of the data sets available in R

56
Kaggle Dogs vs. Cats

Code for Kaggle Dogs vs. Cats competition

53
ANN

Julia artificial neural networks

53
kaggle_acquire-valued-shoppers-challenge

Code for the Kaggle acquire valued shoppers challenge

53
kaggle_acquire-valued-shoppers-challenge

Code for the Kaggle acquire valued shoppers challenge

52
Statistiker

Basic Machine Learning algorithms in Clojure.

52
kaggle-advertised-salaries

Predicting job salaries from ads - a Kaggle competition

51
GreatCircle

Library for calculating great circle distance.

51
GreatCircle

Library for calculating great circle distance.

48
go-pr

Pattern recognition package in Go lang.

47
Hypothesis Tests

Hypothesis tests for Julia

47
BayesPy

Bayesian Inference Tools in Python

45
Learning.js

Javascript implementation of logistic regression/c4.5 decision tree

45
Data Arrays

Data structures that allow missing values

44
haskell-ml

Haskell implementations of various ML algorithms.

44
Data Read

Read files from Stata, SAS, and SPSS

43
Core torch7 demos repository

.

43
Distance

Levenshtein and Hamming distance computation

43
UEA Stemmer

Ruby port of UEALite Stemmer - a conservative stemmer for search and indexing

42
statkit

Statistics kit for JavaScript

42
Gaussian Processes

Julia package for Gaussian processes

42
Graph Layout

Graph layout algorithms in pure Julia

41
Lyric

Linear Regression library.

40
Hadoop

Hadoop/HDFS

40
Kmeans.js

Simple Javascript implementation of the k-means algorithm, for node.js and the browser

40
Regression

Algorithms for regression analysis (e.g. linear regression and logistic regression)

39
PGM

A Julia framework for probabilistic graphical models.

39
kaggle-cifar

Code for the CIFAR-10 competition at Kaggle, uses cuda-convnet

39
Kaggle Stackoverflow

Predicting closed questions on Stack Overflow

37
svm

Torch-SVM library

37
plot-rb

A plotting library in Ruby built on top of Vega and D3.

37
enlighten-integration

Example code and materials that illustrate techniques for integrating SAS with other analytics technologies in Java, PMML, Python and R.

36
Neural

A neural network in Julia

36
Training a Convnet for the Galaxy-Zoo Kaggle challenge(CUDA demo)
36
sentiment-analyzer

Tweets Sentiment Analyzer

35
Bayesian-Bandit

Bayesian bandit implementation for Node and the browser.

35
kNear

JavaScript implementation of the k nearest neighbors algorithm for supervised learning

35
NMF

A Julia package for non-negative matrix factorization

35
PyStanfordDependencies

Python interface for converting Penn Treebank trees to Stanford Dependencies.

34
torch-datasets

Scripts to load several popular datasets including:

33
rapaio

statistics, data mining and machine learning toolbox in Java

33
python-zpar

Python bindings for ZPar, a statistical part-of-speech-tagger, constiuency parser, and dependency parser for English.

32
MAChineLearning

An Objective-C multilayer perceptron library, with full support for training through backpropagation. Implemented using vDSP and vecLib, it's 20 times faster than its Java equivalent. Includes sample code for use from Swift.

31
PyDexter

Simple plotting for Python. Wrapper for D3xterjs; easily render charts in-browser.

31
Ruby Machine Learning

Some Machine Learning algorithms, implemented in Ruby

30
ucto

Unicode-aware regular-expression based tokenizer for various languages. Tool and C++ library. Supports FoLiA format.

30
Clustering.js

Clustering algorithms implemented in Javascript for Node.js and the browser

30
graph

Graph package for Torch

30
BPN-NeuralNetwork

It implemented 3 layers neural network ( Input Layer, Hidden Layer and Output Layer ) and it named Back Propagation Neural Network (BPN). This network can be used in products recommendation, user behavior analysis, data mining and data analysis.

29
frog

Memory-based NLP suite developed for Dutch: PoS tagger, lemmatiser, dependency parser, NER, shallow parser, morphological analyzer.

29
SVM

SVM's for Julia

29
Kernel Density

Kernel density estimators for julia

29
Allen Downey’s Data Science Course

Code for Data Science at Olin College, Spring 2014.

29
dm-flow

Library of SAS Enterprise Miner process flow diagrams to help you learn by example about specific data mining topics.

28
go-mxnet-predictor

Go binding for MXNet c_predict_api to do inference with pre-trained model

28
randomkit

Numpy's randomkit, wrapped for Torch

27
neat

Plug-and-play, parallel Go framework for NeuroEvolution of Augmenting Topologies (NEAT)

27
cephes

Cephes mathematical functions library, wrapped for Torch. Provides and wraps the 180+ special mathematical functions from the Cephes mathematical library, developed by Stephen L. Moshier. It is used, among many other places, at the heart of SciPy.

26
Distance

Julia module for Distance evaluation

26
Topic Models

TopicModels for Julia

25
Montague

Montague is a semantic parsing library for Scala with an easy-to-use DSL.

24
HDBScan

implementation of the hdbscan algorithm in Python - used for clustering

23
ToPS

This is an objected-oriented framework that facilitates the integration of probabilistic models for sequences over a user defined alphabet.

23
lambda-ml

Simple, concise implementations of machine learning techniques and utilities in Clojure.

22
kaggle amazon

Amazon access control challenge

21
Dimensionality Reduction

Methods for dimensionality reduction

21
Lunum
21
python-ucto

Python binding to ucto (a unicode-aware rule-based tokenizer for various languages)

20
MCMC

MCMC tools for Julia

20
ManifoldLearning

A Julia package for manifold learning and nonlinear dimensionality reduction

20
Multi-Perceptron-NeuralNetwork

it implemented multi-perceptrons neural network (???????????) based on Back Propagation Neural Network (BPN) and designed unlimited-hidden-layers.

20
CoverTree

Python implementation of cover trees, near-drop-in replacement for scipy.spatial.kdtree

20
decision-weights
19
machine

Simple machine learning algorithm.

19
crystal-fann

FANN (Fast Artifical Neural Network) binding.

18
Listof

Community based data collection, packed in gem. Get list of pretty much anything (stop words, countries, non words) in txt, json or hash. Demo/Search for a list

17
paicehusk

Golang implementation of the Paice/Husk Stemming Algorithm.

17
GaussianMixtures

Large scale Gaussian Mixture Models

17
Atari2600

Scripts to generate a dataset with static frames from the Arcade Learning Environment

17
xerial

Data management utilities for Scala

16
imgraph

An image/graph library for Torch. This package provides routines to construct graphs on images, segment them, build trees out of them, and convert them back to images.

16
group-lasso

Some experiments with the coordinate descent algorithm used in the (Sparse) Group Lasso model

15
cerebro2

A web-based visualization and debugging platform for NuPIC.

14
KRKmeans-Algorithm

It implemented K-Means the clustering and classification algorithm. It could be used in data mining and image compression.

14
python-frog

Python binding to Frog, an NLP suite for Dutch. (pos tagging, lemmatisation, dependency parsing, NER)

14
Machine Learning Ruby
13
hopfield-networks

Hopfield Networks for unsupervised learning in Haskell.

13
used for machine learning

.

13
used for machine learning

.

13
dockerface

Easy to install and use deep learning Faster R-CNN face detection for images and video in a docker container.

13
Image-to-Image Translation with Conditional Adversarial Networks

Implementation of image to image (pix2pix) translation from the paper by isola et al.[DEEP LEARNING]

13
Neuron

Neuron is simple class for time series predictions. It's utilize LNU (Linear Neural Unit), QNU (Quadratic Neural Unit), RBF (Radial Basis Function), MLP (Multi Layer Perceptron), MLP-ELM (Multi Layer Perceptron - Extreme Learning Machine) neural networks learned with Gradient descent or LeLevenberg–Marquardt algorithm.

12
naive-apl

Naive Bayesian Classifier implementation in APL

12
Local Regression

Local regression, so smooooth!

12
Online Learning
12
Music Tagging

Music Tagging scripts for torch7

12
BioPy

Biologically-Inspired and Machine Learning Algorithms in Python.

11
wine-quality

Predicting wine quality

11
enlighten-deep

Example code and materials that illustrate using neural networks with several hidden layers in SAS.

10
Simple MCMC

basic mcmc sampler implemented in Julia

10
ROCAnalysis

Receiver Operating Characteristics and functions for evaluation probabilistic binary classifiers

10
KRHebbian-Algorithm

It is a non-supervisor and self-learning algorithm (adjust the weights) in neural network of Machine Learning.

10
kaggle-accelerometer

Code for Accelerometer Biometric Competition at Kaggle

9
Gaussian Mixture Model

Unsupervised machine learning with multivariate Gaussian mixture model

9
KRFuzzyCMeans-Algorithm

It implemented Fuzzy C-Means (FCM) the fuzzy clustering / classification algorithm on Machine Learning. It could be used in data mining and image compression.

9
python-timbl

A Python extension module wrapping the full TiMBL C++ programming interface. Timbl is an elaborate k-Nearest Neighbours machine learning toolkit.

9
Kaggle Gender

A Kaggle competition: discriminate gender based on handwriting

8
libfolia

C++ library for the FoLiA format

8
videograph

A video/graph library for Torch. This package provides routines to construct graphs on videos, segment them, build trees out of them, and convert them back to videos.

8
fex

A package for feature extraction in Torch. Provides SIFT and dSIFT modules.

8
Sarah Palin LDA

Topic Modeling the Sarah Palin emails.

8
Data Driven Code

Very simple implementation of neural networks for dummies in python without using any libraries, with detailed comments.

7
snowball

Snowball Stemmer for Go.

7
wiki challenge

An implementation of Dell Zhang's solution to Wikipedia's Participation Challenge on Kaggle

6
LKYDeepNN

A header-only C++11 Neural Network library. Low dependency, native traditional chinese document.

6
OpenGM

OpenGM is a C++ library for graphical modeling, and inference. The Lua bindings provide a simple way of describing graphs, from Lua, and then optimizing them with OpenGM.

6
2012-paper-diginorm
6
mne-python-notebooks

IPython notebooks for EEG/MEG data processing using mne-python

5
Hybrid Recommender System

A hybrid recommender system based upon scikit-learn algorithms.

5
DA

Julia package for Regularized Discriminant Analysis

5
Julia Data

library for working with tabular data in Julia

5
Stats

Statistical tests for Julia

5
yase

Transcode sentence (or other sequence) to list of word vector

5
thinking stats 2
5
Kaggle Merck

Merck challenge at Kaggle

4
kaggle-bestbuy_big

Code for the Best Buy competition at Kaggle

3
Naive Bayes

Simple Naive Bayes implementation in Julia

3
saliency

code and tools around integral images. A library for finding interest points based on fast integral histograms.

3
stitch

allows us to use hugin to stitch images and apply same stitching to a video sequence

3
sfm

A bundle adjustment/structure from motion package

3
kaggle-bestbuy_small
2
SignalProcessing

Signal Processing tools for Julia

2
lbfgs

FFI Wrapper for liblbfgs

2
LuaSHKit

A lua wrapper around the Locality sensitive hashing library SHKit

2
kernel smoothing

KNN, kernel-weighted average, local linear regression smoothers

2
Support Vector Machines
2
Diffusion Segmentation

A collection of image segmentation algorithms based on diffusion methods

1
vowpalwabbit

An old vowpalwabbit interface to torch.

1
sphagetti

Spaghetti (sparse linear) module for torch7 by @MichaelMathieu

1
Naive Bayes
1
Scipy Tutorials

SciPy tutorials. This is outdated, check out scipy-lecture-notes

你可能感兴趣的:(github ml code)