I’m a postdoc at Columbia University in the Shen Lab, where I focus on applying AI for biology. My work centers on developing and interpreting deep learning models, with a particular emphasis on protein language models. I’m interested in exploring how protein structural dynamics can be incorporated into deep learning models and how to accurately predict protein fitness landscapes. I developed SeqDance and ESMDance, two protein language models trained on protein dynamics data. I explained the scaling behaviour of ESM2 and other models on fitness prediction.
I earned my Ph.D. in Bioinformatics from Peking University in the Li Lab, where I built computational tools for studying phase separation and protein degradation.
I was born in Zhucheng, China, and I enjoy basketball, photography, and skiing.
Updated: Aug 2025
Postdoc, 2023-
Columbia University
PhD of Bioinformatics, 2020-2023
Peking University
Bachelor of Medicine and Economics, 2015-2020
Peking University
SeqDance-A Protein Language Model for Representing Protein Dynamic Properties
PhaSepDB, PhaSePred and MloDisDB
Degpred-predict degron via deep learning