Machine Learning Prediction of Adenovirus D8 Conjunctivitis Complications from Viral Whole-Genome Sequence

Citation:

Nakamichi K, Akileswaran L, Meirick T, Lee MD, Chodosh J, Rajaiya J, Stroman D, Wolf-Yadlin A, Jackson Q, Holtz BW, Lee AY, Lee CS, Van Gelder RN, Van Gelder RN. Machine Learning Prediction of Adenovirus D8 Conjunctivitis Complications from Viral Whole-Genome Sequence. Ophthalmol Sci 2022;2(4):100166.

Date Published:

2022 Dec

Abstract:

OBJECTIVE: To obtain complete DNA sequences of adenoviral (AdV) D8 genome from patients with conjunctivitis and determine the relation of sequence variation to clinical outcomes. DESIGN: This study is a post hoc analysis of banked conjunctival swab samples from the BAYnovation Study, a previously conducted, randomized controlled clinical trial for AdV conjunctivitis. PARTICIPANTS: Ninety-six patients with AdV D8-positive conjunctivitis who received placebo treatment in the BAYnovation Study were included in the study. METHODS: DNA from conjunctival swabs was purified and subjected to whole-genome viral DNA sequencing. Adenovirus D8 variants were identified and correlated with clinical outcomes, including 2 machine learning methods. MAIN OUTCOME MEASURES: Viral DNA sequence and development of subepithelial infiltrates (SEIs) were the main outcome measures. RESULTS: From initial sequencing of 80 AdV D8-positive samples, full adenoviral genome reconstructions were obtained for 71. A total of 630 single-nucleotide variants were identified, including 156 missense mutations. Sequence clustering revealed 3 previously unappreciated viral clades within the AdV D8 type. The likelihood of SEI development differed significantly between clades, ranging from 83% for Clade 1 to 46% for Clade 3. Genome-wide analysis of viral single-nucleotide polymorphisms failed to identify single-gene determinants of outcome. Two machine learning models were independently trained to predict clinical outcome using polymorphic sequences. Both machine learning models correctly predicted development of SEI outcomes in a newly sequenced validation set of 16 cases (P = 1.5 × 10-5). Prediction was dependent on ensemble groups of polymorphisms across multiple genes. CONCLUSIONS: Adenovirus D8 has ≥ 3 prevalent molecular substrains, which differ in propensity to result in SEIs. Development of SEIs can be accurately predicted from knowledge of full viral sequence. These results suggest that development of SEIs in AdV D8 conjunctivitis is largely attributable to pathologic viral sequence variants within the D8 type and establishes machine learning paradigms as a powerful technique for understanding viral pathogenicity.

Last updated on 01/03/2023