Abstract
Proteins have been playing a critical role in the regulation of diverse biological processes related to human life. With the increasing demand, functional proteins are sparse in this immense sequence space. Therefore, protein design has become an important task in various fields, including medicine, food, energy, materials, etc. Directed evolution has recently led to significant achievements. Molecular modification of proteins through directed evolution technology has significantly advanced the fields of enzyme engineering, metabolic engineering, medicine, and beyond. However, it is impossible to identify desirable sequences from a large number of synthetic sequences alone. As a result, computational methods, including data-driven machine learning and physics-based molecular modeling, have been introduced to protein engineering to produce more functional proteins. This review focuses on recent advances in computational protein design, highlighting the applicability of different approaches as well as their limitations.
Keywords: Protein design, machine learning, neural networks, deep learning, molecular modeling, computational protein design.
[http://dx.doi.org/10.1111/1750-3841.12802] [PMID: 25757896]
[http://dx.doi.org/10.1016/j.gde.2015.09.005] [PMID: 26451979]
[http://dx.doi.org/10.1016/j.crvi.2005.06.001] [PMID: 16286078]
[http://dx.doi.org/10.1021/acs.jproteome.7b00306] [PMID: 28707887]
[http://dx.doi.org/10.1126/science.181.4096.223] [PMID: 4124164]
[http://dx.doi.org/10.1038/s41580-019-0163-x] [PMID: 31417196]
[http://dx.doi.org/10.1038/nature19946] [PMID: 27629638]
[http://dx.doi.org/10.1093/bioinformatics/btu791] [PMID: 25431331]
[http://dx.doi.org/10.1126/science.aav7541] [PMID: 30923216]
[http://dx.doi.org/10.1038/s41586-018-0830-7] [PMID: 30626941]
[http://dx.doi.org/10.1093/nar/28.1.235] [PMID: 10592235]
[http://dx.doi.org/10.1101/2021.10.04.463034]
[http://dx.doi.org/10.1038/s41592-019-0496-6] [PMID: 31308553]
[http://dx.doi.org/10.1016/j.tibtech.2013.10.008] [PMID: 24268901]
[http://dx.doi.org/10.1016/j.sbi.2015.05.009] [PMID: 26093060]
[http://dx.doi.org/10.1093/nar/gkx1077] [PMID: 29136216]
[http://dx.doi.org/10.1111/cbdd.13847] [PMID: 33894099]
[http://dx.doi.org/10.1080/19420862.2017.1289302] [PMID: 28165915]
[http://dx.doi.org/10.1073/pnas.89.10.4285] [PMID: 1350088]
[http://dx.doi.org/10.1016/j.ymeth.2004.04.007] [PMID: 15312672]
[http://dx.doi.org/10.1093/nar/gkl163]
[http://dx.doi.org/10.1126/science.1089427] [PMID: 14631033]
[http://dx.doi.org/10.1101/2020.01.06.895466]
[http://dx.doi.org/10.1038/nsb805] [PMID: 12042875]
[http://dx.doi.org/10.1038/s41594-018-0028-6] [PMID: 29434346]
[http://dx.doi.org/10.1016/j.jmb.2004.05.051] [PMID: 15236968]
[http://dx.doi.org/10.1016/j.jmb.2007.08.005] [PMID: 17825836]
[http://dx.doi.org/10.1016/j.jmb.2005.11.092] [PMID: 16413576]
[http://dx.doi.org/10.1146/annurev.biophys.37.032807.125832] [PMID: 18573077]
[http://dx.doi.org/10.1016/j.bioeng.2004.12.003] [PMID: 15857780]
[http://dx.doi.org/10.1016/j.mib.2006.03.003] [PMID: 16621678]
[http://dx.doi.org/10.1021/acs.jcim.0c00073] [PMID: 32250622]
[http://dx.doi.org/10.1371/journal.pone.0255076] [PMID: 34320027]
[http://dx.doi.org/10.1109/TCBB.2010.93] [PMID: 20855926]
[http://dx.doi.org/10.1142/S0129065704001899] [PMID: 15112367]
[http://dx.doi.org/10.1073/pnas.1215251110] [PMID: 23277561]
[http://dx.doi.org/10.1038/s41592-019-0583-8] [PMID: 31611694]
[http://dx.doi.org/10.1016/j.patter.2020.100142] [PMID: 33336200]
[http://dx.doi.org/10.1038/s41586-019-1923-7] [PMID: 31942072]
[http://dx.doi.org/10.1038/s41586-021-03819-2] [PMID: 34265844]
[http://dx.doi.org/10.1016/j.sbi.2021.03.009] [PMID: 33910104]
[http://dx.doi.org/10.1002/prot.25834] [PMID: 31602685]
[http://dx.doi.org/10.1016/0893-6080(89)90020-8]
[http://dx.doi.org/10.1038/s41586-021-04184-w] [PMID: 34853475]
[http://dx.doi.org/10.1038/s41467-022-28313-9] [PMID: 35136054]
[http://dx.doi.org/10.1016/j.cels.2020.08.016] [PMID: 32971019]
[http://dx.doi.org/10.1038/s42256-020-0217-y]
[http://dx.doi.org/10.1080/02664763.2018.1441383] [PMID: 31631918]
[http://dx.doi.org/10.1038/s41587-022-01618-2] [PMID: 36702895]
[http://dx.doi.org/10.1126/science.aba3304] [PMID: 32703877]
[http://dx.doi.org/10.1093/protein/gzh067] [PMID: 15331774]
[http://dx.doi.org/10.1073/pnas.1700269114] [PMID: 28283661]
[http://dx.doi.org/10.1007/978-1-4939-3569-7_17] [PMID: 27094298]
[http://dx.doi.org/10.1016/j.cell.2014.04.034] [PMID: 24949974]
[http://dx.doi.org/10.1073/pnas.1910080116] [PMID: 31371498]
[http://dx.doi.org/10.1038/s41589-020-00699-x] [PMID: 33398169]
[http://dx.doi.org/10.1126/science.aay5051] [PMID: 32409444]
[http://dx.doi.org/10.1371/journal.pcbi.1006623] [PMID: 30452434]
[http://dx.doi.org/10.1371/journal.pone.0024109] [PMID: 21909381]
[http://dx.doi.org/10.1093/bioinformatics/btx352] [PMID: 28582565]
[http://dx.doi.org/10.1126/science.abj8754] [PMID: 34282049]
[http://dx.doi.org/10.1101/2022.07.20.500902]
[http://dx.doi.org/10.1101/2022.07.21.500999]
[http://dx.doi.org/10.1073/pnas.1914677117] [PMID: 31896580]
[http://dx.doi.org/10.1039/C5CP05771J] [PMID: 26745505]
[http://dx.doi.org/10.1039/C7CP05688E] [PMID: 29057413]
[http://dx.doi.org/10.1039/C7CP07869B] [PMID: 29451287]
[http://dx.doi.org/10.1021/acschemneuro.7b00490] [PMID: 29300091]
[http://dx.doi.org/10.3389/fmolb.2020.00029] [PMID: 32195265]
[http://dx.doi.org/10.1021/acschemneuro.1c00694] [PMID: 35041375]
[http://dx.doi.org/10.3390/molecules27072105] [PMID: 35408504]
[http://dx.doi.org/10.1093/nar/gkab926] [PMID: 34664670]
[http://dx.doi.org/10.1038/s41597-022-01882-6] [PMID: 36599873]