Abstract
Target gene delivery is crucial to gene therapy. Adeno-associated virus (AAV) has emerged as a primary gene therapy vector due to its broad host range, long-term expression, and low pathogenicity. However, AAV vectors have some limitations, such as immunogenicity and insufficient targeting. Designing or modifying capsids is a potential method of improving the efficacy of gene delivery, but hindered by weak biological basis of AAV, complexity of the capsids, and limitations of current screening methods. Artificial intelligence (AI), especially machine learning (ML), has great potential to accelerate and improve the optimization of capsid properties as well as decrease their development time and manufacturing costs. This review introduces the traditional methods of designing AAV capsids and the general steps of building a sequence-function ML model, highlights the applications of ML in the development workflow, and summarizes its advantages and challenges.
Keywords: Gene therapy, vector, adeno-associated virus, capsid, directed evolution, machine learning.
[http://dx.doi.org/10.3390/v13081528] [PMID: 34452392]
[http://dx.doi.org/10.3390/pharmaceutics13050750] [PMID: 34069541]
[http://dx.doi.org/10.1016/j.omtm.2018.09.005] [PMID: 30397626]
[http://dx.doi.org/10.1089/biores.2020.0031]
[http://dx.doi.org/10.1186/s12985-021-01555-7] [PMID: 33892762]
[http://dx.doi.org/10.1016/j.ymthe.2018.10.004] [PMID: 30366819]
[http://dx.doi.org/10.1016/j.ymthe.2018.09.001] [PMID: 30241741]
[http://dx.doi.org/10.1038/srep28965] [PMID: 27377618]
[http://dx.doi.org/10.1038/mt.2014.139] [PMID: 25048217]
[http://dx.doi.org/10.1007/978-1-61779-370-7_3] [PMID: 22034026]
[http://dx.doi.org/10.3389/fphys.2019.00168] [PMID: 30890951]
[http://dx.doi.org/10.1128/JVI.00161-16] [PMID: 26962225]
[http://dx.doi.org/10.1016/bs.aivir.2020.01.002] [PMID: 32327148]
[http://dx.doi.org/10.1038/s41576-019-0205-4] [PMID: 32042148]
[http://dx.doi.org/10.1038/sj.gt.3302527] [PMID: 15829993]
[http://dx.doi.org/10.2131/jts.46.57] [PMID: 33536390]
[http://dx.doi.org/10.1016/j.mad.2021.111549] [PMID: 34352323]
[http://dx.doi.org/10.1007/s10529-021-03183-1] [PMID: 34590222]
[http://dx.doi.org/10.3390/jcm9020589] [PMID: 32098144]
[http://dx.doi.org/10.3390/jcm8091321] [PMID: 31466263]
[http://dx.doi.org/10.4155/tde.12.63] [PMID: 22900466]
[http://dx.doi.org/10.1128/JVI.78.12.6381-6388.2004] [PMID: 15163731]
[http://dx.doi.org/10.1073/pnas.55.6.1467] [PMID: 5227666]
[http://dx.doi.org/10.1016/0042-6822(84)90271-X] [PMID: 6324476]
[http://dx.doi.org/10.1038/gt.2009.82] [PMID: 19626054]
[http://dx.doi.org/10.1016/j.virol.2006.05.023] [PMID: 16806384]
[http://dx.doi.org/10.1016/j.omtm.2018.03.004] [PMID: 29766031]
[http://dx.doi.org/10.1038/ncomms7246] [PMID: 25665714]
[http://dx.doi.org/10.1016/j.bej.2021.108096]
[http://dx.doi.org/10.1227/NEU.0000000000000589] [PMID: 25549186]
[http://dx.doi.org/10.1038/nbt.1599] [PMID: 20037580]
[http://dx.doi.org/10.1038/mt.2011.237] [PMID: 22068425]
[http://dx.doi.org/10.1038/gt.2009.101] [PMID: 19727141]
[http://dx.doi.org/10.1016/j.celrep.2015.07.019] [PMID: 26235624]
[http://dx.doi.org/10.1038/gt.2015.74] [PMID: 26186661]
[http://dx.doi.org/10.1007/978-1-0716-0290-4_10] [PMID: 32006401]
[http://dx.doi.org/10.1038/s41598-022-13617-z] [PMID: 35705606]
[http://dx.doi.org/10.1016/j.ymthe.2017.09.021] [PMID: 29055620]
[http://dx.doi.org/10.1038/mt.2016.84] [PMID: 27117222]
[http://dx.doi.org/10.1016/j.ymthe.2006.05.009] [PMID: 16824801]
[http://dx.doi.org/10.1128/JVI.02440-20] [PMID: 33658343]
[http://dx.doi.org/10.1038/nbt.3440] [PMID: 26829320]
[http://dx.doi.org/10.1186/s12929-016-0223-x] [PMID: 26786672]
[http://dx.doi.org/10.1038/mt.2009.292] [PMID: 20040913]
[PMID: 22491297]
[http://dx.doi.org/10.1089/hum.2019.264] [PMID: 32000541]
[http://dx.doi.org/10.1016/j.cell.2021.08.028] [PMID: 34506722]
[http://dx.doi.org/10.1093/protein/9.1.77] [PMID: 9053906]
[http://dx.doi.org/10.18609/cgti.2019.058]
[http://dx.doi.org/10.1007/s12033-010-9335-9] [PMID: 20865348]
[http://dx.doi.org/10.1038/ncomms4075] [PMID: 24435020]
[http://dx.doi.org/10.1073/pnas.1910061116] [PMID: 31818949]
[http://dx.doi.org/10.1016/j.jcyt.2022.07.005] [PMID: 35999132]
[http://dx.doi.org/10.1038/s41592-019-0496-6] [PMID: 31308553]
[http://dx.doi.org/10.1109/TSMCC.2011.2161285]
[http://dx.doi.org/10.1109/TKDE.2008.239]
[http://dx.doi.org/10.1016/j.omtm.2020.11.017] [PMID: 33511242]
[http://dx.doi.org/10.1101/2021.06.15.447941]
[http://dx.doi.org/10.1007/s12065-021-00642-6]
[http://dx.doi.org/10.3390/ijms161226237] [PMID: 26703574]
[http://dx.doi.org/10.1038/s41598-019-48713-0] [PMID: 31451742]
[http://dx.doi.org/10.1007/s00705-019-04343-5] [PMID: 31321584]
[http://dx.doi.org/10.1101/2021.04.16.440236]
[http://dx.doi.org/10.1162/neco.1996.8.7.1341]
[http://dx.doi.org/10.1016/j.chemolab.2021.104255]
[http://dx.doi.org/10.1038/nbt1333] [PMID: 17721510]
[http://dx.doi.org/10.1093/nar/gki375] [PMID: 15980478]
[http://dx.doi.org/10.1093/bioinformatics/bti1109] [PMID: 16204125]
[http://dx.doi.org/10.1002/prot.20810] [PMID: 16372356]
[http://dx.doi.org/10.1002/prot.22422] [PMID: 19415757]
[http://dx.doi.org/10.1186/1471-2105-13-44] [PMID: 22435732]
[http://dx.doi.org/10.1007/s10822-017-0090-x] [PMID: 29234997]
[http://dx.doi.org/10.1074/jbc.RA117.001052] [PMID: 29378850]
[http://dx.doi.org/10.1186/1471-2105-11-370] [PMID: 20598148]
[http://dx.doi.org/10.1371/journal.pone.0047247] [PMID: 23077576]
[http://dx.doi.org/10.1371/journal.pone.0138022] [PMID: 26361227]
[http://dx.doi.org/10.1073/pnas.1215251110] [PMID: 23277561]
[http://dx.doi.org/10.1093/bioinformatics/bty238] [PMID: 29949987]
[http://dx.doi.org/10.1093/bioinformatics/btt691] [PMID: 24281696]
[http://dx.doi.org/10.1021/acssynbio.5b00294] [PMID: 27007080]
[http://dx.doi.org/10.1021/acssynbio.8b00155] [PMID: 30103599]
[http://dx.doi.org/10.1371/journal.pcbi.1005786] [PMID: 29059183]
[http://dx.doi.org/10.1038/s41592-019-0583-8] [PMID: 31611694]
[http://dx.doi.org/10.3390/ijms222111741] [PMID: 34769173]
[http://dx.doi.org/10.3390/ijms22116032] [PMID: 34199677]
[http://dx.doi.org/10.1038/s41587-020-00793-4] [PMID: 33574611]
[http://dx.doi.org/10.1088/1742-6596/2090/1/012170]
[http://dx.doi.org/10.1039/D0NR05605G] [PMID: 33231239]
[http://dx.doi.org/10.1016/j.neunet.2021.09.008] [PMID: 34634606]
[http://dx.doi.org/10.1016/j.eswa.2018.05.024]
[http://dx.doi.org/10.1155/2021/8493795]
[http://dx.doi.org/10.1016/0167-7799(90)90206-D] [PMID: 1366766]
[http://dx.doi.org/10.32614/RJ-2010-006]
[http://dx.doi.org/10.1080/03610918.2013.862275]
[http://dx.doi.org/10.1038/225563a0] [PMID: 5411867]
[http://dx.doi.org/10.1038/nrm2805] [PMID: 19935669]
[http://dx.doi.org/10.1126/science.aaw2900] [PMID: 31780559]
[http://dx.doi.org/10.1101/2021.05.18.444607]
[http://dx.doi.org/10.1126/sciadv.adj3786] [PMID: 38266077]
[http://dx.doi.org/10.1371/journal.pbio.3002112] [PMID: 37467291]
[http://dx.doi.org/10.1038/s41434-022-00322-9] [PMID: 35132204]
[http://dx.doi.org/10.1042/CS20210052] [PMID: 34076247]
[http://dx.doi.org/10.1089/cmb.2011.0152] [PMID: 21923411]
[http://dx.doi.org/10.1038/nature10724] [PMID: 22230956]
[http://dx.doi.org/10.1038/nbt0308-303] [PMID: 18327243]
[http://dx.doi.org/10.1126/science.abd7331] [PMID: 33446556]
[http://dx.doi.org/10.1016/j.chemolab.2015.09.016]
[http://dx.doi.org/10.1186/s13015-021-00195-4] [PMID: 34210336]
[http://dx.doi.org/10.1101/2022.12.22.521680]
[http://dx.doi.org/10.1007/s11265-022-01758-3] [PMID: 36742147]
[http://dx.doi.org/10.2174/156652307782151416] [PMID: 17979679]
[http://dx.doi.org/10.1089/hum.2017.150] [PMID: 28835127]
[http://dx.doi.org/10.1016/j.cels.2018.05.014] [PMID: 29960884]
[http://dx.doi.org/10.1038/s41587-019-0322-9] [PMID: 31844290]