Time series prediction of COVID-19 by mutation rate analysis using recurrent neural network-based LSTM model

dc.creatorKhan Pathan, Refat
dc.creatorBiswas, Munmun
dc.creatorUddin Khandaker, Mayeen
dc.date.accessioned2020-07-24T19:17:39Z
dc.date.available2020-07-24T19:17:39Z
dc.date.created2020
dc.description.abstractSARS-CoV-2, a novel coronavirus mostly known as COVID-19 has created a global pandemic. The world is now immobilized by this infectious RNA virus. As of June 15, already more than 7.9 million people have been infected and 432k people died. This RNA virus has the ability to do the mutation in the human body. Accurate determination of mutation rates is essential to comprehend the evolution of this virus and to determine the risk of emergent infectious disease. This study explores the mutation rate of the whole genomic sequence gathered from the patient’s dataset of different countries. The collected dataset is processed to determine the nucleotide mutation and codon mutation separately. Furthermore, based on the size of the dataset, the determined mutation rate is categorized for four different regions: China, Australia, the United States, and the rest of the World. It has been found that a huge amount of Thymine (T) and Adenine (A) are mutated to other nucleotides for all regions, but codons are not frequently mutating like nucleotides. A recurrent neural network-based Long Short Term Memory (LSTM) model has been applied to predict the future mutation rate of this virus. The LSTM model gives Root Mean Square Error (RMSE) of 0.06 in testing and 0.04 in training, which is an optimized value. Using this train and testing process, the nucleotide mutation rate of 400th patient in future time has been predicted. About 0.1% increment in mutation rate is found for mutating of nucleotides from T to C and G, C to G and G to T. While a decrement of 0.1% is seen for mutating of T to A, and A to C. It is found that this model can be used to predict day basis mutation rates if more patient data is available in updated time.spa
dc.format.extent7 páginasspa
dc.format.mimetypeimage/jepgspa
dc.identifier.doihttps://doi.org/10.1016/j.chaos.2020.110018spa
dc.identifier.issn0960-0779spa
dc.identifier.otherhttps://doi.org/10.1016/j.chaos.2020.110018spa
dc.identifier.urihttps://hdl.handle.net/20.500.12010/11113
dc.publisherChaos, Solitons & Fractalseng
dc.rights.accessrightsinfo:eu-repo/semantics/openAccessspa
dc.sourcereponame:Expeditio Repositorio Institucional UJTLspa
dc.sourceinstname:Universidad de Bogotá Jorge Tadeo Lozanospa
dc.subjectSARS-Cov-2spa
dc.subjectGene sequencespa
dc.subjectMutation ratespa
dc.subjectNeural Networkspa
dc.subjectLSTM modelspa
dc.subject.lembSíndrome respiratorio agudo gravespa
dc.subject.lembCOVID-19spa
dc.subject.lembSARS-CoV-2spa
dc.subject.lembCoronavirusspa
dc.titleTime series prediction of COVID-19 by mutation rate analysis using recurrent neural network-based LSTM modelspa
dc.type.hasversioninfo:eu-repo/semantics/acceptedVersionspa
dc.type.localArtículospa

Archivos

Bloque original

Mostrando 1 - 2 de 2
Cargando...
Miniatura
Nombre:
Captura.PNG
Tamaño:
196.26 KB
Formato:
Portable Network Graphics
Descripción:
Ver portada
Cargando...
Miniatura
Nombre:
Time-series-prediction-of-COVID-19-by-mutation-rate-analy_2020_Chaos--Solito.pdf
Tamaño:
2.65 MB
Formato:
Adobe Portable Document Format
Descripción:
Artículo reservado

Bloque de licencias

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
license.txt
Tamaño:
2.87 KB
Formato:
Item-specific license agreed upon to submission
Descripción: