生化学および生理学ジャーナル

Automatic identification of site-specific glycosylation in proteomics using mass spectrometry and bioinformatics

Jong Shin Yoo

Protein glycosylation, one of the most prevalent posttranslational modifi cations in proteins, plays important roles in biological systems via various processes, such as adhesion, signaling through cellular recognition, and response to abnormal biological states. However, due to the complexity and heterogeneity of a glycoprotein, current analyses focus mainly on the identification of either glycosites or the released glycans only. In this study, we have developed MS-based high throughput method for intact N-glycopeptides analysis, named GlycoProteomeAnalyzer (GPA) for analysis of N-and O-glycosylation in proteomics, which combines tandem Mass Spectrometry (MS) with a database search and algorithmic suite. We created novel scoring algorithms for confi dent identifi cation of N- and O-glycosylation of proteins with calculation of False Discovery Rate (FDR). In our approach, all amino acid sequence as well as glycosylation site information were obtained from the Uniprot database. From the Swiss-Prot accession number of human protein, our GPA program automatically construct tryptic N- and O-glycopeptide database for the proteins in human plasma sample. It allows automatic identifi cation of site-specifi c N- and O-glycopeptides of protein mixtures using HCD, CID, and ETD MS/MS spectra with GPA-DB from Uniprot with estimated FDR ≤ 1%. GPA has been designed to easily handle high-throughput glycoproteomic data with a graphical user interface and demonstrated on website (https://www.igpa.kr/). It can also be integrated with cloud computing service that eliminates the need for local clusters and increases throughput of data analysis