A structural alphabet is a way to represent protein structures using a discrete set of symbols, similar to how amino acids represent protein sequences. These symbols capture local backbone conformations, allowing complex 3D structures to be encoded as linear sequences. This alphabet is generated by Foldseek.
By converting 3D structures into sequences of structural characters, traditional sequence alignment methods can be applied to compare protein structures. This enables multiple structure alignments written out as MSAs and subsequent downstream analysis.
Alignments determine how structures are compared and analyzed. Poor alignments can lead to incorrect evolutionary interpretations, misidentification of structural similarities, and errors in downstream applications like phylogenetic analysis.
Structome-AlignViewer allows users to visualize 3Di character alignments alongside molecular structures. It provides tools to inspect alignment quality using structure-based visualization to ensure accuracy in structure-based evolutionary analyses.
The application now supports four different input methods, available via tabs on the input page:
1hv4_A;1hv4_B
). The server will download the corresponding structures from the RCSB PDB..zip
archive containing multiple single-chain structure files in mmCIF (.cif
) format. This allows for the analysis of custom structures.For methods that generate a new alignment, users can choose between the ClustalW and MAFFT alignment algorithms. As of 1st Sept, 2025 Z-scores profiles are being constructed for SCOP and CATH alignmnets. Once they are ready, MAFFT will be able to assess alignment scores based on these backgrounds.
The results page displays:
When analyzing alignments, it is crucial to assess the reliability of each column. The Confidence score is a proxy for identifying regions where structures are well-aligned versus areas of uncertainty. This score is derived from the substitution matrix, which assigns values to character replacements based on their likelihood in structural alignments.Higher scores indicate greater agreement between structures, while lower scores suggest high variability or misalignment. The Average (Avg) provides an overall measure of alignment quality by averaging the per-column confidence scores across only the well-aligned columns (after filtering out low-information regions). This ensures that the global score reflects the alignment's reliability rather than being biased by poor regions. A higher global confidence score suggests a well-conserved alignment, while a lower score may indicate inconsistent structural relationships. This information can be used to refine alignments or highlight regions needing further inspection.
Update 26th March, 2025 : The confidence per column is now encoded as b-factor and visualised on each structure. These structures can now be downloaded. Low confidence regions appear as blue (score approaching 0), whereas high confidence regions appear red (score approaching 1).
The statistics panel summarizes the quality of the alignment. This panel provides:
These statistics help users quickly assess the reliability and structural consistency of an alignment.
Users can download:
One modifications is planned.
This enhancements will be rolled out soon-ish. 🤞
If you use Structome-AlignViewer in your work, please cite:
@article{malik2025structome,
title = {Structome-AlignViewer: On Confidence Assessment in Structure-Aware Alignments},
author = {Malik, Ashar J and Mao, Siying and Hugenholtz, Philip J and Ascher, David B},
journal = {bioRxiv},
pages = {2025--05},
year = {2025},
publisher = {Cold Spring Harbor Laboratory}
}
(The peer-reviewed version will be linked here once available.)
Structome-AlignViewer would like to thank the anonymous reviewer for its most recent upgrade.
For any issues with the server or functionality, please email and include job IDs.
Ashar Malik
Email: ashar.malik@uq.edu.au