The EU-funded ExCAPE project is combining high performance computing (HPC – also known as supercomputing) and machine learning to speed up the discovery of new drugs. Scientists developing medicines need to find out how different pharmaceutical compounds behave – their bioactivity. Testing each compound individually is time-consuming and expensive, so it is necessary to find more efficient ways of predicting which compounds are likely to be helpful.
Machine learning – ‘teaching’ computers to assess and improve their performance of a task – is already used for this, but the algorithms currently used do not work well with large amounts of very disparate data. The ExCAPE project is therefore developing more advanced algorithms and ways of implementing them to run on supercomputers, accelerating drug discovery and reducing costs.
While HPC systems were originally designed and used mainly for carrying out physics simulations, in the last ten years a new type of use has emerged: handling and analysing big datasets. When that data is expensive to generate, it is important to make the most of it, not just by analysis but also carrying out calculations to make new predictions. This is the case with chemogenomics (predicting the activity of compounds in the drug discovery phase of the pharmaceutical industry), and so the ExCAPE project has brought together researchers from nine institutions to create sophisticated computational models to be used with this information.
The project is focused on developing algorithms, writing software that can be used on large HPC machines, and preparing datasets that can be used to test them. Some of the software and datasets have been released publicly as Open Source and Open Data. The project’s work should result in cheaper drug development, but also has possible applications in in e-commerce, fraud detection and manufacturing.
ExCAPE in brief
- Total Budget: EUR 3 910 140 (EU contribution: EUR 3 910 140)
- Duration: 09/2015 – 08/2018
- Countries involved: Belgium (coordinator), Austria, Bulgaria, Czech Republic, Finland, Sweden, Spain, United Kingdom
Key figures in the European Union
- The average cost of developing a drug is EUR 930 million and it takes 9 to 13 years for a compound to reach the patient.
- The failure rates of pharmaceutical compounds in clinical trials are c. 54% due to a lack of efficiency and 9% due to toxicity.
- The EU pharmaceutical market is worth around EUR 260 billion annually.
High performance computing (HPC)
HPC, also known as supercomputers, refers to computing systems able to process huge amounts of data and realise complex calculations in record time. Its potential range of uses is vast, including better treatments and personalised healthcare, the prevention and management of large-scale natural disasters, the development of complex encryption technologies, and the production of more innovative goods and services.
In January 2018 the European Commission launched a major initiative on supercomputing – the EuroHPC Joint Undertaking – to create with Member States an integrated world-class supercomputing and data infrastructure and encourage European contribution to this field. It is expected to be operational before the end of 2018. The Commission’s proposed new Digital Europe Programme, with an overall budget of EUR 9.2 billion, also includes EUR 2.7 billion of funding for HPC.