Engineering enzymes with AI
Aalto University and VTT scientists have launched a project to create new enzymes with the help of machine learning. The goal is to accelerate the transition towards a circular bioeconomy.
Enzymes are the biological catalysts of all living things. They are already widely used in industrial biotechnological processes, like in the production of bioethanol, drugs like antibiotics or in making bioplastics. However, biological systems have evolved to maximise their survival fitness, instead of industrial value. The potential of biological diversity is still far from being fully utilised.
‘Synthetic biology enables intelligent bioengineering, which is expected to increase in importance and replace many current processes based on fossil raw materials in the future,’ says Merja Penttilä, research professor at VTT and adjunct professor at Aalto University. ‘We can design and engineer novel cells to produce basically any useful product, and enzymes are key players to make this happen.’
Penttilä leads the BioDesign research project with Professor Samuel Kaski of Aalto University, who is also director of the Finnish Center for Artificial Intelligence FCAI. The project recently received almost two million euros in funding from the Jane and Aatos Erkko Foundation to provide solutions to the challenge of customising enzymes, including designing novel proteins.
Proteins are the main drivers of cellular activity, and hence an important target of research. Google’s AlphaFold-2 made headlines in 2022 when it demonstrated that the 3D structure of almost any protein can be predicted with high accuracy, thus solving one of the longest-standing open problems in bioinformatics, the protein folding problem.
However, much work remains to discover and engineer proteins that can serve as enzymes with desired functions or properties.
‘Our goal is to create totally new enzymes that are capable of transcending natural evolutionary principles and maximising their industrial utility,’ states Penttilä. ‘If we succeed, we will revolutionise possibilities to transition from a fossil-based economy to a circular bioeconomy.’
If we succeed, we will revolutionise possibilities to transition from a fossil-based economy to a circular bioeconomy
Adjunct Professor Merja Penttilä, Aalto University
Unlocking the full potential of custom-built enzymes
The BioDesign project is gathering scientists from molecular biology, synthetic biology, machine learning and computer science to work on an actively learning next-generation probabilistic model that is steered by insights from experts.
‘Generative models, such as ChatGPT and Stable Diffusion, have opened unprecedented opportunities for text and visual applications. Even though our project may not have quite as wide an audience, we believe that its potential impact could be similarly transformative,’ says Vikas Garg, assistant professor at Aalto University’s Department of Computer Science.
Garg’s research has focused on devising machine learning models that could help discover new proteins and molecules with desired properties, or what is referred to as the inverse protein folding problem. In 2018, Garg and his colleagues at MIT’s Computer Science and Artificial Intelligence Lab (CSAIL) were the first to introduce a graph-based deep learning model for generating new protein sequences that could fold into a given 3D protein structure.
More recently at Aalto, Garg has been working closely with Kaski, developing a generative model to create novel candidate molecules that resemble their real-world counterparts. Garg has also recently made inroads in developing proteins and molecules with designer functionalities.
‘We might want to design batteries with better efficiency, but lower power requirements and smaller carbon footprints,’ says Garg. ‘However, it is not always possible to perfectly segregate these factors. We’ve taken an important step toward tackling this issue in one of our recent papers by learning to unravel, and represent, complex interdependencies while disentangling the factors.’
Virtual laboratories allow human-AI collaboration to flourish
The BioDesign research team’s key idea is to construct a fully virtual cycle, or laboratory, for enzyme engineering, which allows human-AI collaboration to flourish. Virtual laboratories are AI-driven environments for simulated measurements with human designers and researchers. The outcomes of the simulations are tested in physical molecular biology laboratories, which generate experimental data. This data is fed back to the virtual simulations to increase learning and prediction power.
‘The virtual laboratory will help to not only incorporate new data as it becomes available from automated experiments, but also leverage human expertise in steering the AI model toward better solutions,’ says Kaski, who is developing AI methods for human-AI collaboration with his FCAI colleagues.
‘New enzymes discovered through this cycle could not only speed up current chemical reactions, but also enable alternative biological pathways toward synthesizing better materials, batteries, biofertilizers, and drugs,’ says Garg.
It is perhaps no surprise that synthetic biology is considered a key enabling technology for a sustainable future in the bioeconomy strategies of the EU and the US. Finland is well positioned to punch above its weight in the field, with investment and interest in the bioeconomy acceleration.
‘Finland has a world-class combination of competences both in AI methods development and molecular biology, which underpin research in synthetic biology,’ says Juho Rousu, professor of computer science at Aalto University, who leads a research group that develops machine learning methods, models and tools for small molecules. ‘The broader aim of this project is to keep Finland in the global forefront of synthetic biology and thereby enable advanced biomanufacturing to serve a sustainable circular bioeconomy.’
Originally published on the Aalto University website