PROTEUS mission is to investigate and develop ready-to-use scalable online machine learning algorithms and real-time interactive visual analytics to deal with extremely large data sets and data streams.
The foundation for the PROTEUS advances is the use of an optimized implementation of combined batch and streaming processing and building around this later scalable real time processes. The developed algorithms and techniques will form a library to be integrated into an enhanced version of Apache Flink, the EU Big Data platform. PROTEUS will contribute to the Big Data area by addressing fundamental challenges related to the scalability and responsiveness of analytics capabilities. The requirements are defined by a steelmaking industrial use case. The techniques developed in PROTEUS are however, general, flexible and portable to all data stream-based domains.
In particular, the project will go beyond the current state-of-art technology by making the following specific original contributions:
The PROTEUS impact is manifold: