SAMOA: Scalable advanced massive online analysis

Gianmarco Morales, Albert Bifet

Research output: Contribution to journalArticle

92 Citations (Scopus)


samoa (Scalable Advanced Massive Online Analysis) is a platform for mining big data streams. It provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms. It features a pluggable architecture that allows it to run on several distributed stream processing engines such as Storm, S4, and Samza. samoa is written in Java, is open source, and is available at under the Apache Software License version 2.0.

Original languageEnglish
Pages (from-to)149-153
Number of pages5
JournalJournal of Machine Learning Research
Publication statusPublished - 2015
Externally publishedYes



  • Classification
  • Clustering
  • Data streams
  • Distributed systems
  • Machine learning
  • Regression
  • Toolbox

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Software
  • Statistics and Probability
  • Artificial Intelligence

Cite this