Cultural advice

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

Aboriginal and Torres Strait Islander peoples are advised that ANU Library collections may include images, names, voices, and other representations of deceased persons.

Material in the collection may contain terms, language or views that reflect the period in which the item was created and may be considered inappropriate today.

Examining selection dynamics and limitations in multi-round protein selection of high diversity libraries

dc.contributor.authorChen, John Z.en
dc.contributor.authorGall, Barnabasen
dc.contributor.authorLu, Tommy Y.en
dc.contributor.authorHeslop, Isabellaen
dc.contributor.authorHesselson, Danielen
dc.contributor.authorNitsche, Christophen
dc.contributor.authorTham, Wai Hongen
dc.contributor.authorPayne, Richard J.en
dc.contributor.authorJackson, Colin J.en
dc.date.accessioned2026-07-03T23:41:12Z
dc.date.available2026-07-03T23:41:12Z
dc.date.issued2026en
dc.description.abstractProteins and peptides underpin essential biological functions and technological applications, from targeting disease-relevant interactions to providing broad enzymatic activities. However, engineering molecules with desired properties remains difficult, owing to complex sequence-structure-function relationships and the lack of data on specific systems. Experimental selection strategies, including directed evolution, phage display, and mRNA display, address this challenge by leveraging high diversity libraries and iterative enrichment under defined selection pressures. This allows for the identification of candidates without requiring extensive prior knowledge, and can generate extensive datasets for use in machine learning. While many selection systems exist, comparisons across different selection approaches are hindered by the lack of a unifying analytical framework. Here, we developed a toolset of broadly applicable analyses for assessing selection dynamics in multi-round or multi-condition experiments, ranging from position level analysis of sequence properties to full sequence space mappings through protein language model embeddings. Performing analyses across different systems, we identify desirable traits in selection experiments including enrichment of distinct sequence patterns and correlation between enrichment and final desired functions. Notably, even under weak selection regimes with all sequences <1% frequency, functional sequences (e.g., 70nM IC50 binder to SARS-CoV-2 main protease) are still consistently enriched. We also find repeated selections of the same starting library can help differentiate selection effects of varying conditions (e.g., different delivery of metal ligand) from system noise. These findings, along with the toolset, can be used to guide experimental design, interpretation, and troubleshooting across protein and peptide discovery platforms.en
dc.description.statusPeer-revieweden
dc.identifier.issn1741-0126en
dc.identifier.otherPubMed:42223509en
dc.identifier.scopus105041274454en
dc.identifier.urihttps://hdl.handle.net/1885/733812875
dc.language.isoenen
dc.rightsPublisher Copyright: © The Author(s) 2026. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.comen
dc.sourceProtein Engineering, Design and Selectionen
dc.subjectComputational pipelineen
dc.subjectDeep sequencingen
dc.subjectProtein engineeringen
dc.subjectProtein librariesen
dc.subjectSelectionen
dc.titleExamining selection dynamics and limitations in multi-round protein selection of high diversity librariesen
dc.typeJournal articleen
dspace.entity.typePublicationen
local.contributor.affiliationChen, John Z.; The Australian National Universityen
local.contributor.affiliationGall, Barnabas; Australian National Universityen
local.contributor.affiliationLu, Tommy Y.; ARC Centre for Innovations in Peptide & Protein Scienceen
local.contributor.affiliationHeslop, Isabella; Australian National Universityen
local.contributor.affiliationHesselson, Daniel; The University of Sydneyen
local.contributor.affiliationNitsche, Christoph; The Australian National Universityen
local.contributor.affiliationTham, Wai Hong; Walter and Eliza Hall Institute of Medical Researchen
local.contributor.affiliationPayne, Richard J.; The University of Sydneyen
local.contributor.affiliationJackson, Colin J.; The Australian National Universityen
local.identifier.doi10.1093/protein/gzag012en
local.identifier.purecc827dd4-b922-42c5-91dc-fa4c55b62ed1en
local.identifier.urlhttps://www.scopus.com/pages/publications/105041274454en
local.type.statusAccepted/In pressen

Downloads

abcd