Solution for a system at Abivin JSC
My personal selection: BOINC
Suitable selection for my company: Spark
(BOINC: pure C++, interesting; Hadoop: old generation; Spark: big improvement on Hadoop; Apex: not so developed; Storm: small processing only; HPCC: no community)
- BOINC: Originally created for SETI@home project but now used by a lot of volunteered network computing projects.
- Hadoop: Most people know about this framework for distributed storage and distributed computing specially with Java. Some unofficial containers available on Docker Hub.
- Next-gen Hadoop. Website.
- Officially available on Docker Hub
- High-Performance Computing Cluster. Website
Data query only:
Data streaming (no processing):
- Samza: Website