Solution for a system at Abivin JSC
My personal selection: BOINC
Suitable selection for my company: Spark
(BOINC: pure C++, interesting; Hadoop: old generation; Spark: big improvement on Hadoop; Apex: not so developed; Storm: small processing only; HPCC: no community)

Honourable mention:

  • BOINC: Originally created for SETI@home project but now used by a lot of volunteered network computing projects.

Famous mention:

  • Hadoop: Most people know about this framework for distributed storage and distributed computing specially with Java. Some unofficial containers available on Docker Hub.

Modern frameworks:

Distributed storage:

Data query only:

Data streaming (no processing):

Message distribution:

Small projects: