Machine learning-coupled combinatorial mutagenesis enables resource-efficient engineering of CRISPR-Cas9 genome editor activities

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

24 Scopus Citations
View graph of relations

Author(s)

  • Dawn G. L. Thean
  • Hoi Yee Chu
  • John H. C. Fong
  • Becky K. C. Chan
  • Peng Zhou
  • Cynthia C. S. Kwok
  • Yee Man Chan
  • Silvia Y. L. Mak
  • Gigi C. G. Choi
  • Joshua W. K. Ho
  • Alan S. L. Wong

Detail(s)

Original languageEnglish
Article number2219
Journal / PublicationNature Communications
Volume13
Online published25 Apr 2022
Publication statusPublished - 2022

Link(s)

Abstract

The genome-editing Cas9 protein uses multiple amino-acid residues to bind the target DNA. Considering only the residues in proximity to the target DNA as potential sites to optimise Cas9’s activity, the number of combinatorial variants to screen through is too massive for a wet-lab experiment. Here we generate and cross-validate ten in silico and experimental datasets of multi-domain combinatorial mutagenesis libraries for Cas9 engineering, and demonstrate that a machine learning-coupled engineering approach reduces the experimental screening burden by as high as 95% while enriching top-performing variants by ∼7.5-fold in comparison to the null model. Using this approach and followed by structure-guided engineering, we identify the N888R/A889Q variant conferring increased editing activity on the protospacer adjacent motif-relaxed KKH variant of Cas9 nuclease from Staphylococcus aureus (KKH-SaCas9) and its derived base editor in human cells. Our work validates a readily applicable workflow to enable resource-efficient high-throughput engineering of genome editor’s activity.

Research Area(s)

Citation Format(s)

Machine learning-coupled combinatorial mutagenesis enables resource-efficient engineering of CRISPR-Cas9 genome editor activities. / Thean, Dawn G. L.; Chu, Hoi Yee; Fong, John H. C. et al.
In: Nature Communications, Vol. 13, 2219, 2022.

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Download Statistics

No data available