The analysis of the dynamic behavior of proteins has always been a core challenge in structural biology. Although traditional static structure analysis methods can provide high-resolution molecular snapshots, they are unable to capture the rapidly changing conformational changes in life activities. In recent years, hydrogen-deuterium exchange mass spectrometry (HDX-MS) technology has been undergoing revolutionary breakthroughs - from achieving near-atomic resolution dynamic tracking of single amino acids to artificial intelligence-driven full-atom dynamic modeling, this technology is redefining the boundaries of our understanding of protein functional mechanisms. With the integration of ultra-performance liquid chromatography, orbital well ultra-high resolution mass spectrometry and deep learning algorithms, HDX-MS is no longer merely an auxiliary tool in structural biology, but has become a core technology for revealing the spatiotemporal dynamics of protein conformation, providing an unprecedented dynamic molecular perspective for drug design and disease mechanism research. These breakthroughs have enabled scientists to for the first time reconstruct the "molecular movie" of protein machines with millisecond-level time resolution and amino acid-level spatial accuracy under conditions close to physiological ones.
Although the traditional HDX-MS technology can provide dynamic information of proteins, its resolution is often limited to the level of polypeptide fragments, which makes it a challenge to accurately identify the conformational changes of key amino acid residues.
With the improvement of the sensitivity of mass spectrometry instruments and the optimization of data processing algorithms, the new generation of high-resolution HDX-MS has been able to precisely map the hydrogen-deuterium exchange signal to a single amino acid site. The core of this progress lies in the combination of more efficient enzyme digestion strategies, ultra-high-performance mass spectrometry detection (such as Orbitrap and ion mobility techniques), and advanced machine learning-assisted data analysis methods.
Dynamic tracking with single-amino acid resolution enables researchers to observe subtle conformational changes that were previously difficult to capture. For instance, in the study of protein-ligand interactions, scientists can now precisely identify which amino acid residues in the binding pocket undergo local flexibility or rigidity changes after binding, and even track how allosteric effects are transmitted through the protein structure. This fine analytical ability is crucial for understanding the catalytic mechanism of enzymes, protein folding diseases (such as amyloid protein aggregation in Alzheimer's disease), and the optimal design of targeted drugs.
Furthermore, this technology also reveals the hierarchical characteristics of protein dynamics. Experimental data indicate that even within the same secondary structural unit (such as α-helix or β-fold), the hydrogen-deuterium exchange rates of different amino acid residues may vary significantly, reflecting the complex regulation of local solvent accessibility, hydrogen bond networks, and side chain interactions. For example, some residues buried in the protein core may be briefly exposed due to instantaneous structural fluctuations, and high-resolution HDX-MS is capable of capturing such brief and subtle dynamic events.
The study of membrane proteins and complex targets has always been a difficulty in structural biology, and traditional methods often have difficulty capturing their dynamic characteristics. The integration of HDX-MS with Nanodisc technology has brought a breakthrough solution to this field. Nanodisks can simulate the natural lipid bilayer environment, enabling membrane proteins to maintain their physiological conformations and dynamic behaviors, while HDX-MS provides high-resolution dynamic information. The combination of the two greatly enhances the feasibility of membrane protein research.
In traditional methods, the dissolution of membrane proteins usually relies on detergents, but this often disrupts their natural conformation or leads to aggregation, resulting in distortion of dynamic analysis. The nanodisk is composed of phospholipids and membrane scaffold proteins (MSPS) or synthetic polymers, forming a lipid bilayer of controllable size.
After embedding membrane proteins in it, not only can its correct folding be maintained, but also the key protein-lipid interactions can be retained. This system close to the natural environment enables HDX-MS to reflect the conformational changes of membrane proteins more truly, such as the activation status of G protein-coupled receptors (GPCRs) and the conformational transitions of transport proteins.
In terms of experimental strategies, the HDX-MS research integrating Nanodisc requires the optimization of sample preparation and mass spectrometry conditions. Since the nanodiscs themselves increase the complexity of the samples, mild separation methods (such as size exclusion chromatography) should be adopted after HDX quenching to remove free lipids and scaffold proteins to avoid interference from mass spectrometry signals.
In addition, the enzymatic digestion step also needs to be adjusted, as the hydrophobic region of the membrane protein may be difficult to be effectively cut by conventional proteases. Supplementary proteases (such as thermophilic protease) can be attempted to increase coverage.
The factors such as large molecular weight, complex structure and wide dynamic range make it difficult for the traditional HDX-MS method to obtain high sensitivity and sufficient sequence coverage. In recent years, by optimizing sample processing, mass spectrometry techniques and data analysis strategies, researchers have significantly enhanced the application capabilities of HDX-MS in complex systems.
One of the keys to improving sensitivity lies in improving the sample preparation process. For low-abundance or easily dissociated complexes, mild crosslinking strategies (such as short-distance chemical crosslinking agents) can stabilize the complex structure without significantly affecting the kinetics of hydrogen-deuterium exchange and reduce gas-phase dissociation. Meanwhile, optimizing the enzymatic digestion conditions (such as mixed protease combinations, adjusting pH and temperature) can increase the diversity of polypeptide fragments, especially helping to cover hydrophobic or highly structured regions that are traditionally difficult to detect.
These improvements have demonstrated value in the research of super-large molecular systems such as nuclear pore complexes and viral capsids. For example, the HDX-MS analysis of ribosomes can now cover more than 90% of the sequences and accurately capture the dynamic changes of subbase interfaces during the translation process.
Artificial intelligence (AI) is completely transforming the data analysis approach of HDX-MS, solving the bottlenecks of traditional methods when dealing with complex dynamic information. Traditional HDX-MS data analysis relies on manual intervention and fixed algorithms, making it difficult to handle massive mass spectrometry data efficiently. Especially when studying complex protein systems or samples with low signal-to-noise ratios, key dynamic information may be omitted. The introduction of AI, especially machine learning and deep learning technologies, is making the data parsing of HDX-MS faster and more accurate.
Deep learning methods such as three-dimensional convolutional neural networks (3D-CNN) and spatio-temporal graph neural networks (ST-GNN) are establishing brand-new dynamic modeling frameworks. These models, by analyzing the HDX-MS spectra at different time points, can not only predict the dominant conformation but also capture the transient intermediate states that account for less than 5%. For instance, in GPCR research, the AI model successfully reconstructed the complete activation path of the receptor after ligand binding by integrating microsecond-level HDX data, including metastable conformations that cannot be observed by traditional methods. This dynamic modeling ability has provided new research tools for long-standing unresolved issues such as allosteric signal transmission and protein folding pathways.
Yu, Jiali et al. Demonstrated the first deep learning model, namely AI-based HDX (AI-HDX), which predicts intrinsic protein dynamics based on protein sequences. It reveals the dynamics of protein structure by combining deep learning, experimental HDX, sequence alignment and protein structure prediction.
Figure 1. The Design of AI-HDX Prediction Model. (Yu, Jiali et al., 2023)
The architecture based on graph neural networks shows unique advantages in this field. These models transform protein structures into topological maps, where nodes represent amino acid residues and edges encode spatial distances and chemical interactions. By training the model on thousands of experimental HDX datasets, GNN can accurately predict the deuteration rate under different solvent conditions and temperatures, with an average accuracy rate of over 85% of the experimental data.
It is particularly worth noting that this type of model successfully captured the abnormally rapid exchange phenomenon of certain residues in the protein core region, which is highly consistent with the local unfolding events later verified through ultrafast hybrid HDX experiments.
AI-driven HDX analysis and molecular dynamics (MD) simulation are forming a strong complementary relationship. Although traditional MD simulation can provide atomic-level resolution time trajectories, there is often an order of magnitude gap between its microsecond-level time scale and experimental observations. While the HDX data parsed by AI can capture real dynamics, it lacks the physical details of atomic movement. The integration of the two is breaking through their respective technical bottlenecks and generating a synergy effect of "1+1>2".
The introduction of the machine learning force field has significantly improved the simulation efficiency. Traditional full-atom MDS are limited by computational costs and have difficulty handling large protein complexes. The graph neural network force field trained with HDX data can increase the simulation speed by 3 to 4 orders of magnitude while maintaining accuracy. For example, in the simulation of nuclear pore complexes, GNN-FF completed the sampling volume equivalent to that of traditional MD in 5 years in just 2 days, and the correlation between the hydrogen bond lifetime generated by the simulation and the HDX protection factor reached r=0.91.
Dynamic fingerprint mapping technology is another innovative direction. By conducting a joint dimensionality reduction analysis of HDX time series data and the key motion patterns of MD trajectories, researchers can extract the core dynamic modules that dominate protein functions. The DynaMine algorithm developed by the MIT team applied this method to identify the "dynamic conduction chain" connecting the ligand-binding pocket and the G protein-coupled interface in G protein-coupled receptors, providing precise targets for the design of heteromorphic drugs.
The cross-integration of HDX-MS with technologies such as AI, MD simulation, and structural biology is reshaping the process of modern drug development. Traditional drug discovery often relies on static structural information, while key mechanisms such as dynamic conformational changes and allosteric regulation are often overlooked. Nowadays, multidisciplinary cross-technology enables researchers to optimize drug design at the dynamic molecular level, significantly increasing the success rate.
Traditional high-throughput screening of HTS relies on activity detection, but it is unable to distinguish the mechanism of action of compounds. The Dynamical Fingerprinting technology of HDX-MS can simultaneously evaluate the binding modes, conformational regulation and allosteric effects of thousands of compounds at an early stage.
By comparing the HDX profiles before and after small molecule treatment, false positives (such as aggregates or non-specific conjugates) can be quickly excluded.
AI cluster analysis can automatically group compounds according to dynamic response patterns (such as competitive inhibitors, allosteric modulators or protein-protein interaction blockers).
Even weakly bound fragments (Kd >100 μM) can be detected through local HDX changes, guiding subsequent chemical optimization.
Quantify the stabilization/de-stabilization effects of compounds on active sites, allosteric channels or protein interfaces through changes in hydrogen-deuterium exchange rates.
Although some inhibitors bind to the same pocket, they induce different dynamic responses (such as "closed state" vs. "open state"). HDX can distinguish these subtle differences and associate them with in vivo activity.
Monitor the HDX changes of mutant proteins to predict in advance the conformational escape mechanisms that may affect drug binding.
By monitoring the HDX changes of proteins in animal models or cell lysates, it directly verifies whether the drug binds to the expected target in vivo.
Compare the HDX profiles (HDX-PP) of the entire proteome before and after drug treatment to identify unexpected interactions (such as the unexpected stabilization of a certain scaffold protein by a kinase inhibitor).
The characteristics of HDX in specific conformational states can serve as pharmacodynamic (PD) markers, supporting the design of clinical trials.
KRAS G12C, the most common RAS mutation found in non-small-cell lung cancer, has been the subject of multiple recent covalent small-molecule inhibitor campaigns including efforts directed at the guanine nucleotide pocket and separate work focused on an inducible pocket adjacent to the switch motifs.
Multiple conformations of switch II have been observed, suggesting that switch II pocket (SIIP) binders may be capable of engaging a range of KRAS conformations. Here we report the use of hydrogen/deuterium-exchange mass spectrometry (HDX MS) to discriminate between conformations of switch II induced by two chemical classes of SIIP binders. We investigated the structural basis for differences in HDX MS using X-ray crystallography and discovered a new SIIP configuration in response to binding of a quinazoline chemotype.
To examine the molecular details of ligand activation of G-protein coupled receptor, emphasis has been placed on structure determination of these receptors with stabilizing ligands.
Here Zhang, Xi et al. present the methodology for receptor dynamics characterization of the GPCR human β2 adrenergic receptor bound to the inverse agonist carazolol using the technique of amide hydrogen/deuterium exchange coupled with mass spectrometry. The HDX MS profile of receptor bound to carazolol is consistent with thermal parameter observations in the crystal structure and provides additional information in highly dynamic regions of the receptor and chemical modifications demonstrating the highly complementary nature of the techniques.
Following optimization of HDX experimental conditions for this membrane protein, better than 89% sequence coverage was obtained for the receptor. The methodology presented paves the way for future analysis of β2AR bound to pharmacologically distinct ligands as well as analysis of other GPCR family members.
HDX-MS faces two major challenges when analyzing membrane proteins and complex systems: the dissolution stability of membrane proteins and sample Carryover. The following are the key optimization strategies:
The complexity of HDX-MS data and the diversity of experimental conditions often lead to the difficulty in directly comparing cross-laboratory results. Establishing a unified interpretation standard is crucial for promoting the wide application of this technology. The following are the key standardization strategies:
The traditional manual operation mode of HDX-MS (such as step-based quenching and manual injection) severely limits its application in drug screening and large-scale dynamic research. In recent years, through the integration of modular robot systems and intelligent data processing pipelines, the throughput of HDX-MS has increased by 10 to 100 times, while significantly reducing human errors. The following are the key breakthrough directions:
Liquid handling robots: such as Tecan or Hamilton platforms integrate temperature-controlled incubation, precise quenching (pH 2.2, 0°C), and enzyme digestion steps, enabling parallel processing on a 96-well plate scale (with over 300 samples per day).
Microfluidic rapid mixing: By using a chip-type mixer, the resolution of the start time of deuteration reactions is enhanced to the millisecond level, making it suitable for the study of rapid processes such as GPCR activation.
Online reduction/alkylation: Integrate the disulfide bond reduction module in the LC flow path to avoid conformational perturbations caused by manual steps (especially for antibody samples containing multiple pairs of disulfide bonds).
Multiplexed LC chromatography: By using alternating loading of two columns (such as the Waters SELECT series), the utilization rate of mass spectrometry is increased by 80%, and 150 HDX samples can be analyzed in a single day.
Ion mobility pre-separation: Increase the peptide coverage of HDX-MS (especially for the hydrophobic regions of membrane proteins), while reducing cross-contamination between samples (Carryover<0.1%).
DIA scanning mode: Data-independent acquisition (such as SONAR technology) ensures the stable detection of low-abundance peptides and is suitable for complex biological matrices (such as cell lysates).
References