Abstract
Background: The high mutability of severe acute respiratory syndrome coronavirus 2(SARS-CoV-2) makes it easy for mutations to occur during transmission. As the epidemic continues to develop, several mutated strains have been produced. Researchers worldwide are working on the effective identification of SARS-CoV-2.
Objective: In this paper, we propose a new deep learning method that can effectively identify SARSCoV- 2 Variant sequences, called SCVfilter, which is a deep hybrid model with embedding, attention residual network, and long short-term memory as components.
Methods: Deep learning is effective in extracting rich features from sequence data, which has significant implications for the study of Coronavirus Disease 2019 (COVID-19), which has become prevalent in recent years. In this paper, we propose a new deep learning method that can effectively identify SARS-CoV-2 Variant sequences, called SCVfilter, which is a deep hybrid model with embedding, attention residual network, and long short-term memory as components.
Results: The accuracy of the SCVfilter is 93.833% on Dataset-I consisting of different variant strains; 90.367% on Dataset-II consisting of data collected from China, Taiwan, and Hong Kong; and 79.701% on Dataset-III consisting of data collected from six continents (Africa, Asia, Europe, North America, Oceania, and South America).
Conclusion: When using the SCV filter to process lengthy and high-homology SARS-CoV-2 data, it can automatically select features and accurately detect different variant strains of SARS-CoV-2. In addition, the SCV filter is sufficiently robust to handle the problems caused by sample imbalance and sequence incompleteness.
Other: The SCVfilter is an open-source method available at https://github.com/deconvolutionw/ SCVfilter.
Keywords: COVID-19, SARS-CoV-2, variant strains, deep learning, SCVfilter, residual network.
[http://dx.doi.org/10.1038/s41576-021-00408-x] [PMID: 34535792]
[http://dx.doi.org/10.1016/j.compbiomed.2021.104650] [PMID: 34329865]
[http://dx.doi.org/10.1016/j.cie.2021.107666] [PMID: 34511707]
[http://dx.doi.org/10.1101/2020.03.13.990242]
[http://dx.doi.org/10.1038/s41598-020-80363-5] [PMID: 33441822]
[http://dx.doi.org/10.1109/ACCESS.2021.3073728] [PMID: 34812391]
[http://dx.doi.org/10.1109/ASYU52992.2021.9599084]
[http://dx.doi.org/10.1038/s41598-021-93656-0] [PMID: 34239004]
[http://dx.doi.org/10.32604/iasc.2022.017683]
[http://dx.doi.org/10.1016/j.csbj.2021.05.039] [PMID: 34141139]
[http://dx.doi.org/10.1109/CVPR.2016.90]
[http://dx.doi.org/10.1162/neco.1997.9.8.1735] [PMID: 9377276]