Domain-specific introduction to machine learning terminology, pitfalls and opportunities in CRISPR-based gene editing

Loading...
Thumbnail Image

Date

Authors

O'Brien, Aidan
Burgio, Gaetan
Bauer, Denis C

Journal Title

Journal ISSN

Volume Title

Publisher

British Academy and Oxford University Press

Abstract

The use of machine learning (ML) has become prevalent in the genome engineering space, with applications ranging from predicting target site efficiency to forecasting the outcome of repair events. However, jargon and ML-specific accuracy measures have made it hard to assess the validity of individual approaches, potentially leading to misinterpretation of ML results. This review aims to close the gap by discussing ML approaches and pitfalls in the context of CRISPR gene-editing applications. Specifically, we address common considerations, such as algorithm choice, as well as problems, such as overestimating accuracy and data interoperability, by providing tangible examples from the genome-engineering domain. Equipping researchers with the knowledge to effectively use ML to better design gene-editing experiments and predict experimental outcomes will help advance the field more rapidly.

Description

Citation

Source

Briefings in Bioinformatics

Book Title

Entity type

Access Statement

Open Access

License Rights

Creative Commons Attribution licence

Restricted until