The EU-funded MAESTRA project invented a solution to big data complexity in machine learning (ML) systems, which was up to now an untameable challenge. Structured big data-based predictions can generate wonderful answers to questions in medicine, sociology, natural resources and energy, biology and many more disciplines. However, large raw data fed to the machines often comes from different sources, are unstructured, streaming at rates too high for the machine to deal with, or all of the above. Newly published results of Sašo Džeroski's MAESTRA project team offer a solution.

Maestra project logo

The MAESTRA software is based on a tree-based and rule-based machine learning methods and is able to process extensive sets of data or streams of data, including even the incompletely labelled or network data, and give them a logical structure.

The software was already tested in a variety of fields. For example, their method was successfully applied to predict the phenotypes of micro-organisms from their genotypes and gene functions identified compounds to help treat tuberculosis and salmonella. Further, in the solar energy discipline, the MAESTRA methods were applied to predict both the production and the consumption of energy from different kinds of sensor data in different contexts. Similarly, Džeroski's team predicted equipment failures in trains and taxi demand from transport data, improved the accuracy of sentiment analysis and image annotation in social media, and worked on ML processes in the contexts of drug repurposing, tumor mutation, personalised medicine, brain informatics, sustainable food production and biodiversity.

For the future the MAESTRA team hopes for other users to further customise the tools (developed in open source)  for commercial Artificial Intelligence's applications and add their own user interfaces. ‘This will allow MAESTRA partners to develop secondary products in the form of tools and services that are easier to use for potential customers,’ Džeroski says.


MAESTRA  is part of the Future and Emerging Technologies (FET) programme and falls within the scope of the EU Big Data policy

More information

More information on this project is available on CORDIS :