Events The 1st International Online Conference by Antibodies

Event submissions

Published

This submission belongs to the session S5. Computational Antibody Engineering of the event The 1st International Online Conference by Antibodies

Published date

07 Oct, 2025

Academic Editor

Cecile King

Citation

Klara Kropivsek, Katharina Dost, Christian Leonardo Camacho-Villalón, Saso Dzeroski, Ario de Marco, Multi-Objective Active Learning for Nanobody Development, in Proceedings of The 1st International Online Conference by Antibodies, 13 October–14 October 2025, MDPI: Basel, Switzerland

Facebook

Twitter

Multi-Objective Active Learning for Nanobody Development

Klara Kropivsek ¹

Katharina Dost ²

Christian Leonardo Camacho-Villalón ²

Saso Dzeroski ²

Ario de Marco ¹

1. Laboratory for Environmental and Life Sciences, University of Nova Gorica, Nova Gorica, Slovenia, Slovenia

2. Department of Knowledge Technologies, Jozef Stefan Institute, Ljubljana, Slovenia, Slovenia

Abstract

Introduction
Nanobodies—compact, single-domain antibody fragments—are seeing increasing use in therapeutics and diagnostics due to their high specificity and stability. However, optimizing multiple properties such as expression yield and binding affinity remains experimentally costly. While machine learning can accelerate candidate selection, its effectiveness depends on the quality and diversity of labeled data. Standard active learning (AL) approaches address this by prioritizing informative samples, but typically ignore the practical constraints critical to nanobody development.

Methods
We present a multi-objective active learning (MOAL) framework tailored to nanobody discovery. This framework integrates predictive models for binding affinity and expression yield with uncertainty estimation from ensemble learning. Candidate selection is guided by three objectives: informativeness (model improvement), feasibility (predicted expression), and performance (binding affinity). To balance trade-offs among these objectives, we apply evolutionary multi-objective optimization algorithms, specifically NSGA-II and IBEA. This enables exploration of diverse, high-potential regions of nanobody sequence space.

Results
We evaluate our framework on a curated dataset of characterized nanobody sequences and a large-scale nanobody repertoire comprising over 10 million candidates. The curated data enable supervised learning, while the repertoire supports broad exploration. Our approach identifies nanobody candidates that are both experimentally viable and model-informative, improving generalization while reducing experimental costs. By avoiding redundant queries and favoring biologically diverse selections, this method supports efficient discovery.

Conclusions
Our domain-aware MOAL approach provides an effective strategy for guiding nanobody selection under multiple constraints. It enables iterative refinement of predictive models while maintaining experimental feasibility. Though it was developed for nanobody engineering, the framework generalizes to other biological domains requiring data-efficient, multi-objective decision-making.

Keywords

nanobodies

nanobody engineering

active learning

yield

developability

machine learning

multi-objective active learning

Poster

Kropivsek_Active_Learning_Nanobodies.pdf

Evaluation of anti-drug antibody formation in response to AAV-mediated monoclonal antibody expression in sheep

A Computational Workflow for The Prediction of Epitope/Paratope Regions and Antibody–Antigen Binding Poses