Machine learning approaches and algorithms

Machine Learning by Tom Mitchell, McGraw Hill.
- Video lectures
Textbooks on machine learning
A Few Useful Things to Know about Machine Learning by Pedro Domingos (local copy (139.89 KiB, 4510 downloads))
Clever Algorithms: Nature-Inspired Programming Recipes by Jason Brownlee. A handbook of algorithmic recipes from the fields of Metaheuristics, Biologically Inspired Computation and Computational Intelligence.
Machine Learning Methods and Algorithms Debategraph
The 10 Algorithms Machine Learning Engineers Need to Know
Thales Sehn Korting lectures on machine learning
Mining of Massive Datasets by Anand Rajaraman and Jeff Ullman
Обзор наиболее интересных материалов по анализу данных и машинному обучению №4
- Обзор наиболее интересных материалов по анализу данных и машинному обучению №10 (18 — 25 августа 2014)
- Обзор наиболее интересных материалов по анализу данных и машинному обучению №24 (24 — 30 ноября 2014)
How I can distinguish if I have over-fitting problem or Covariance shift?
Machine learning challenges for education, research, and industry
Подборка: Более 70 источников по машинному обучению для начинающих
Unsupervised Feature Learning and Deep Learning Tutorial
- Unsupervised Feature Learning and Deep Learning

Data mining

Top 10 algorithms in data mining [2007] by Xindong Wu, Vipin Kumar – the presents the top 10 data mining algorithms identified by the IEEE International Conference on Data Mining (ICDM) in December 2006
Топ-10 data mining-алгоритмов простым языком
12 Free (as in beer) Data Mining Books
Text & Data Mining by practical means
Best way to do a keyword-based text sentiment analysis on reviews and topic extraction
Big Data vs Data Mining
Освоение специальности Data Science на Coursera: личный опыт
Автоматическое определение пола по имени
Машинное обучение в навигационных устройствах: определяем маневры машины по акселерометру и гироскопу
Простой метод для извлечения соотношений и фактов из текста

Clustering

Neural networks

Байесовская нейронная сеть — потому что а почему бы и нет, чёрт возьми (часть 1)
Books on neural network and bot framework for Java and C# by Jeff Heaton
- Introduction to Neural Networks for Java, 2nd Edition [2012] (3.8 MiB, 115 downloads) by Jeff Heaton
Theory of neural networks

The best way to understand neural networks is to stop thinking of them as intelligent, but rather as function approximators. So when you give a neural network n inputs and expect m outputs, you are really just trying to approximate some unknown function f(Rⁿ) → R^m. Now, the nice thing about real world functions is that they tend not to be too pathological. They tend to be at least mostly continuous, they vary more or less smoothly from input to input, etc. And if this is true, the function tends to have a 'nice' approximation. The underlying assumption of neural networks is that if I take a 'random' function, and sort of push it towards my ideal function enough times in enough places, it will eventually converge to my ideal function. And with real world functions being mostly smooth (plus some noise term), this process is very likely to work in real world situations. What you refer to as 'yielding intelligence' is really just the slow process of the original randomness being replaced by something more closely resembling the ideal function. I will comment that natural intelligences do appear to store data in very logical ways; the human brain for instance has many highly specialized areas. So, as with human brains, highly trained neural networks are not likely to store data chaotically, but rather in a highly ordered way whose ordering happened to be defined by more or less random events. Taylor White

I think you are asking how do the ANN perform the approximation and what is happening within the neural networks to yield the result? If I'm right, then I might have an initial response for you. In one of my course on machine learning, our teacher explained the NN's with an analogy to SVM kernels. The idea of the NN is to transform the data points into a space that will have better separation properties. Each layer of connections is a new transformation that will improve the separation properties of the space. That is how you can approximation the separation function that Taylor spoke about. The next phrases are mine own interpretation, but I'm far from being an expert in NN, so feel free to refute it :) : The more layer's you have, the better the transformation MAY be, however the more chances of overlearning you have! Cédric Penet

Basically, it is multiple parallel interconnected logistic regression. So we have a non-linear equation with a large number of variables that, as was said above, is a universal function approximator, it maps inputs to outputs, based on training examples learned during the supervised training. Now the biological neural network, that is a very different thing. It is far more complex, with over 35 neurotransmitter chemicals, and much that “modern science” just does not understand. Brad Morantz PhD
Руководство хакера по нейронным сетям. Схемы реальных значений. Стратегия №3: Аналитический градиент (читать по ссылкам)
Копируем человеческий мозг: операция «Свертка»
Что читать о нейросетях

Conferences & courses

Online machine learning course by Andrew Ng (Stanford)
- Additional lectures which are missing in 2011 course.
- Andrew Ng about Deep Learning at Paris ML Meetup (direct link to presentation).
Machine Learning course recorded at a live broadcast from Caltech
20th ACM Conference on Information and Knowledge Management (CIKM) 2011 in Glasgow, UK (£620 + £140 – conference + workshop)
International Information Conference on Search, Data Mining and Visualization
Хакатон по глубинному обучению (deep learning) в Москве
Taking machine learning to the next level
Best Online Courses You Took

Tools & Libraries

Shark Machine Learning Library – C++ library with support for regression and classification tasks (including leaner method, neural networks, kernel methods, support vector machines, …), solving discrete and continuous optimization problems, multi objective optimization evolutionary algorithms.
WEKA – a collection of visualization tools and algorithms for data analysis and predictive modelling written in Java.
Carrot2 – Open Source Search Results Clustering Engine.
Apache Mahout – Apache project to produce free implementations of distributed machine learning algorithms on the Hadoop platform.
MALLET – a Java-based package for statistical natural language processing, document classification, clustering, topic modelling, information extraction, and other…
Java-ML – a collection of machine learning algorithms (clustering, classification, cross validation, …)
OpenCV includes image processing, video analysis, feature detection, statistical models, bayes classifier, SVM, decision trees, neural networks, clustering, …
SMILE is a fully portable library of C++ classes implementing graphical decision-theoretic methods, such as Bayesian networks and influence diagrams.
Knowledge Java software list
Text Processing Tools
Best Scalable commercial purpose NLP framework/Toolkits Poll
Text Analytics Software Applications
Semantic Analyser Tools
Top Machine Learning Projects for Julia – a high-level, high-performance dynamic programming language for technical computing, with syntax that is familiar to users of other technical computing environments.

machine learning, big data, data mining

Table of Contents

Machine learning approaches and algorithms

Data mining

Clustering

Neural networks

Conferences & courses

Tools & Libraries