Bioinformatics tools for protein degradation

By Chao Hou

Degrons are short motifs recognized by E3 ubiquitin ligases, marking proteins for degradation via the ubiquitin-proteasome system. Mutations in degrons can disrupt protein degradation, leading to diseases. Despite their critical role, degron identification remains challenging. To address this, we developed Degpred, a deep learning model that predicts degrons directly from protein sequences. Degpred, based on the BERT architecture, accurately predicts degrons on a proteome-wide scale, capturing key degron-related sequence properties and surpassing traditional motif-based methods. Unlike previous approaches, which rely on known E3 motifs to identify degrons, Degpred can predict novel degrons beyond those defined by existing motif-based methods. Additionally, we identified E3 ligase motifs by analyzing degrons predicted in our collected E3-substrate interaction dataset. This allowed us to construct a regulatory network for protein degradation, assigning predicted degrons to specific E3s. Experimentally, we validated a predicted degron on CBX6, demonstrating its role in mediating CBX6 degradation via interaction with SPOP. Degpred expands the known degron landscape, offering new insights into protein degradation and the potential for drug development. Data and predictions are available at http://degron.phasep.pro/.

Chao Hou 侯超
Chao Hou 侯超
PhD of Bioinformatics

My research interests include computational biology and genetics.