TY - JOUR
T1 - Large-scale identification of n-linked intact glycopeptides in human serum using HILIC enrichment and spectral library search
AU - Shu, Qingbo
AU - Li, Mengjie
AU - Shu, Lian
AU - An, Zhiwu
AU - Wang, Jifeng
AU - Lv, Hao
AU - Yang, Ming
AU - Cai, Tanxi
AU - Hu, Tony
AU - Fu, Yan
AU - Yang, Fuquan
N1 - Funding Information:
* This work was supported by grants from the National Key R&D Program of China (2018YFA0507801 and 2018YFA0507103), the National Natural Science Foundation of China (Grant nos. 91640112, 21607170 and 31670185), the Strategic Priority Research Programs of the Chinese Academy of Sciences (XDA12030202), and the NCMIS CAS. The authors declare that they have no conflicts of interest with the contents of this article. □S This article contains supplemental Figures, Tables, Documents, and Discussion. §§ To whom correspondence may be addressed. E-mail: fqyang@ ibp.ac.cn. ¶¶ To whom correspondence may be addressed. E-mail: yfu@ amss.ac.cn. ‖‖ These authors contributed equally to this work.
Funding Information:
This work was supported by grants from the National Key R&D Program of China (2018YFA0507801 and 2018YFA0507103), the National Natural Science Foundation of China (Grant nos. 91640112, 21607170 and 31670185), the Strategic Priority Research Programs of the Chinese Academy of Sciences (XDA12030202), and the NCMIS CAS. The authors declare that they have no conflicts of interest with the contents of this article.*%blankline%*
Publisher Copyright:
© 2020 Shu et al.
PY - 2020
Y1 - 2020
N2 - Large-scale identification of N-linked intact glycopeptides by liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) in human serum is challenging because of the wide dynamic range of serum protein abundances, the lack of a complete serum N-glycan database and the existence of proteoforms. In this regard, a spectral library search method was presented for the identification of N-linked intact glycopeptides from Nlinked glycoproteins in human serum with target-decoy and motif-specific false discovery rate (FDR) control. Serum proteins were firstly separated into low-abundance and high-abundance proteins by acetonitrile (ACN) precipitation. After digestion, the N-linked intact glycopeptides were enriched by hydrophilic interaction liquid chromatography (HILIC) and a portion of the enriched N-linked intact glycopeptides were processed by Peptide-N-Glycosidase F (PNGase F) to generate N-linked deglycopeptides. Both Nlinked intact glycopeptides and deglycopeptides were analyzed by LC-MS/MS. From N-linked deglycopeptides data sets, 764 N-linked glycoproteins, 1699 N-linked glycosites and 3328 unique N-linked deglycopeptides were identified. Four types of N-linked glycosylation motifs (NXS/T/C/V, X≠P) were used to recognize the N-linked deglycopeptides. The spectra of these N-linked deglycopeptides were utilized for N-linked deglycopeptides library construction and identification of N-linked intact glycopeptides. A database containing 739 N-glycan masses was constructed and utilized during spectral library search for the identification of N-linked intact glycopeptides. In total, 526 N-linked glycoproteins, 1036 N-linked glycosites, 22,677 N-linked intact glycopeptides and 738 N-glycan masses were identified under 1% FDR, representing the most in-depth serum Nglycoproteome identified by LC-MS/MS at N-linked intact glycopeptide level.
AB - Large-scale identification of N-linked intact glycopeptides by liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) in human serum is challenging because of the wide dynamic range of serum protein abundances, the lack of a complete serum N-glycan database and the existence of proteoforms. In this regard, a spectral library search method was presented for the identification of N-linked intact glycopeptides from Nlinked glycoproteins in human serum with target-decoy and motif-specific false discovery rate (FDR) control. Serum proteins were firstly separated into low-abundance and high-abundance proteins by acetonitrile (ACN) precipitation. After digestion, the N-linked intact glycopeptides were enriched by hydrophilic interaction liquid chromatography (HILIC) and a portion of the enriched N-linked intact glycopeptides were processed by Peptide-N-Glycosidase F (PNGase F) to generate N-linked deglycopeptides. Both Nlinked intact glycopeptides and deglycopeptides were analyzed by LC-MS/MS. From N-linked deglycopeptides data sets, 764 N-linked glycoproteins, 1699 N-linked glycosites and 3328 unique N-linked deglycopeptides were identified. Four types of N-linked glycosylation motifs (NXS/T/C/V, X≠P) were used to recognize the N-linked deglycopeptides. The spectra of these N-linked deglycopeptides were utilized for N-linked deglycopeptides library construction and identification of N-linked intact glycopeptides. A database containing 739 N-glycan masses was constructed and utilized during spectral library search for the identification of N-linked intact glycopeptides. In total, 526 N-linked glycoproteins, 1036 N-linked glycosites, 22,677 N-linked intact glycopeptides and 738 N-glycan masses were identified under 1% FDR, representing the most in-depth serum Nglycoproteome identified by LC-MS/MS at N-linked intact glycopeptide level.
UR - http://www.scopus.com/inward/record.url?scp=85082756594&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85082756594&partnerID=8YFLogxK
U2 - 10.1074/mcp.RA119.001791
DO - 10.1074/mcp.RA119.001791
M3 - Article
C2 - 32102970
AN - SCOPUS:85082756594
SN - 1535-9476
VL - 19
SP - 672
EP - 689
JO - Molecular and Cellular Proteomics
JF - Molecular and Cellular Proteomics
IS - 4
ER -