SAMOA: Scalable advanced massive online analysis

Gianmarco Morales, Albert Bifet

Research output: Contribution to journalArticle

92 Citations (Scopus)

Abstract

samoa (Scalable Advanced Massive Online Analysis) is a platform for mining big data streams. It provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms. It features a pluggable architecture that allows it to run on several distributed stream processing engines such as Storm, S4, and Samza. samoa is written in Java, is open source, and is available at http://samoa-project.net under the Apache Software License version 2.0.

Original languageEnglish
Pages (from-to)149-153
Number of pages5
JournalJournal of Machine Learning Research
Volume16
Publication statusPublished - 2015
Externally publishedYes

    Fingerprint

Keywords

  • Classification
  • Clustering
  • Data streams
  • Distributed systems
  • Machine learning
  • Regression
  • Toolbox

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Software
  • Statistics and Probability
  • Artificial Intelligence

Cite this