APSPM 2026

SMBE Australasian Protein Structural Phylogenetics Meeting

An SMBE Regional Meeting in Australasia

-16Days
08Hours
03Minutes
46Seconds

Did you know? For 50 species, there are approximately $2.84 \times 10^{76}$ possible rooted trees.

APSPM 2026 Workshops

Protein structures are known to be 3x to 10x more conserved compared to protein sequences - allowing for inference of deep homology, which often is not accessible with conventional sequence analysis.

Three beginner/intermediate-level workshops will run in succession on the second day of the meeting (Tuesday 17 Feb; 1PM - 5PM) to train the audience on extracting this conserved signal from protein structures. The workshops will use cutting-edge methods allowing attendees to gain relevant skills and productively contribute to the field.

(Note: You will need a modern laptop, with a modern OS and browser to access the exercise material provided. A detailed list of softwares requirementsd will be communicated closer to the workshop date)


Workshop 1: Structural Homology Detection

Instructors:

Cameron Gilchrist, Sukhwan Park

Duration:

75 minutes

Description:

Because of tools like AlphaFold, there are now hundreds of millions of high quality protein structural predictions available online. This workshop will show how our suite of tools, including Foldseek, Foldseek-cluster, Foldseek-multimer, Folddisco and FoldMason, can be used to rapidly search and compare homologous structures from these massive structure databases.


Workshop 2: Phylogenetic Inference from Structure

Instructors:

Caroline Puente-Lelievre & Jordan Douglas

Duration:

75 minutes

Description:

Protein structures are well-conserved over long evolutionary timescales, making them particularly useful for phylogenetics when sequence divergence is high. Using IQ-tree, this workshop will demonstrate how phylogenetic trees can be estimated under the 3Di structural alphabet employed by Foldseek.

Software and Data Requirements

Please install the latest versions of these programs:

Please download the dataset and supporting files from the Zenodo repository (Douglas, Matzke, and Puente-Lelievre, 2025):

Materials:

Download Workshop Slides (PDF)


Workshop 3: Protein Language Models and Phylogenetics

Instructors:

Ashar Malik

Duration:

90 minutes

Description:

Protein Language Models (pLMs) allow latent space representation of proteins, both sequence and structure. This workshop will explore distance-space and see how this latent space can be used to extract evolutionary signals and put it towards phylogenetic inference. The workshop will introduce some toy examples and pLMs, look at a state of the art model like Prot-T5, ESM-Cambrian and SA-Prot (time permitting), show how to extract latent space representations and their respective computational transformation for phylogenetic inference.

Materials:

Distance Notebook

pLM Distance Notebook