COSMIC-based mutation database enhances identification efficiency of HLA-I immunopeptidome

doi:10.21203/rs.3.rs-3346799/v1

Download PDF

Research Article

COSMIC-based mutation database enhances identification efficiency of HLA-I immunopeptidome

https://doi.org/10.21203/rs.3.rs-3346799/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 10 Feb, 2024

Read the published version in Journal of Translational Medicine →

You are reading this latest preprint version

Background: Neoantigens have emerged as a promising area of focus in tumor immunotherapy, with several established strategies aiming to enhance their identification. Human leukocyte antigen class I molecules (HLA-I), which present intracellular immunopeptides to T cells, provide an ideal source for identifying neoantigens. However, solely relying on a mutation database generated through commonly used whole exome sequencing (WES) for the identification of HLA-I immunopeptides, may result in potential neoantigens being missed due to limitations in sequencing depth and sample quality.

Method: In this study, we constructed and evaluated an extended database for neoantigen identification, based on COSMIC mutation database. This study utilized mass spectrometry-based proteogenomic profiling to identify the HLA-I immunopeptidome enriched from HepG2 cell. HepG2 WES-based and the COSMIC-based mutation database were generated and utilized to identify HepG2-specific mutant immunopeptides.

Result: The results demonstrated that COSMIC-based database identified 5 immunopeptides compared to only 1 mutant peptide identified by HepG2 WES-based database, indicating its effectiveness in identifying mutant immunopeptides. Furthermore, HLA-I affinity of the mutant immunopeptides was evaluated through NetMHCpan and peptide-docking modeling to validate their binding to HLA-I molecules, demonstrating the potential of mutant peptides identified by the COSMIC-based database as neoantigens.

Conclusion: Utilizing the COSMIC-based mutation database is a more efficient strategy for identifying mutant peptides from HLA-I immunopeptidome without significantly increasing the false positive rate. HepG2 specific WES-based database may exclude certain mutant peptides due to WES sequencing depth or sample heterogeneity. The COSMIC-based database can effectively uncover potential neoantigens within the HLA-I immunopeptidomes.

HLA immunopeptidome

neoantigen identification

HepG2 cell

COSMIC database

hepatocellular carcinoma

Primary liver cancer ranks as the fifth most common tumor and the second leading cause of cancer-related deaths in China. The majority of primary liver cancer cases are hepatocellular carcinoma (HCC), constitutes the majority of primary liver cancer cases, representing approximately 75–85% of all liver cancer cases[1]. Over 50% of HCC patients are diagnosed with advanced HCC, characterized by a poor prognosis and a 1-year survival rate ranging from 12–38% 1-year[2, 3]. Identifying effective treatment options for advanced HCC is crucial. Immune checkpoint inhibitors and antiangiogenic targeted drugs have recently emerged as first-line treatment for advanced HCC, demonstrating promising objective response rate. However, due to drug resistance, nearly half of the patients have an unsatisfactory prognosis[3, 4]. Thus, the development of efficient second-line therapies is essential. Recent reports suggest that adoptive T-cell therapy achieves promising results against various cancers, including HCC, by improving immunosuppression in the tumor microenvironment[5–7]. To prepare adoptive T-cell with high specificity and cytotoxicity against tumor cells, antigens that specifically stimulate T-cell are of high priority[8]. Increasing evidence suggests that neoantigens, characterized by tumor-specific mutant proteins or peptides with immunogenicity, are ideal antigens for T-cell activation, facilitating effective immune responses against tumors while minimizing the incidence of autoimmune reaction[9].

Conventional approaches for identifying neoantigen typically involve Next-generation sequencing or Mass spectrometry (MS) to detect somatic mutations in whole cell proteome[10–12]. These approaches, combined with in silico HLA-I binding affinity prediction, can identify neoantigen[13]. However, only a fraction of these mutant peptides is considered as neoantigens, limiting their clinical application. In comparison to previous strategies such as genomic or proteogenomic approaches, HLA-I immunopeptidome approach shows great potential in neoantigen identification directly detecting HLA-I presented peptides. This approach has been successfully used in identifying neoantigen in melanoma, non-small lung cancer and other cancers[12, 14–16], resulting in improved accuracy. Nonetheless, the identification of HLA-I immunopeptide is based on database generated from the genome, which may result in the omission of mutant peptides due to undetected low-frequency mutations and differences sample collection[17]. For instance, a study involving 8 HCC patients identified 11,266 non-synonymous single-nucleotide DNA variants, but only 1,875 amino acid mutations at the proteomic level[18]. Similarly, in another study involving whole-exome and transcriptome sequencing of 16 HCC tumor tissues and normal tissue samples, 1,039 mutations and 159 potential tumor neoantigen peptides were identified and verified by proteomics, but no corresponding HLA peptides were found from tumor tissue[19]. HCC is considered a low tumor mutational burden (TMB) cancer compared to other types of cancer[20], resulting in fewer mutations at the genomic level. Moreover, previous studies have identified a long-tail phenomenon in tumor mutation genes, leading to a high prevalence of low-frequency mutations[21]. Inadequate sequencing depth of WES poses challenges in the detection of low-frequency mutations. Due to the limitations of exome sequencing technology and cost, uneven sequencing depth is observed, resulting in insufficient coverage in SNP-intensive regions, which hinders the detection of existing variations. Consequently, WES may fail to detect mutations, especially in tumors with low frequencies[22]. These findings indicate that the number of mutations at the protein level is significantly reduced, posing challenges in identifying mutant protein or peptides recognized by the immune system[23]. This presents an increased difficulty in identifying tumor neoantigens in HCC. The Catalogue of Somatic Mutations in Cancer (COSMIC) is a comprehensive database that collects somatic mutations identified in various types of cancers, including HCC. Additionally, COSMIC provides a wealth of information on other less common genetic alterations in HCC, extending mutation database for identification of neoantigens, apart from the well-established HCC-associated genes.

This study proposes a database generation strategy to enhance the coverage of somatic mutations in HCC. The approach is based on HCC mutation data from COSMIC somatic mutation database. To evaluate the effectiveness of the strategy, we enriched and analyzed HLA-I presented peptides from HepG2 cell line using high-resolution mass spectrometry. Both COSMIC-based and HepG2 WES-based database were employed to identify potential neoantigens. Furthermore, the identified neoantigens underwent validated through HLA-I binding affinity prediction and peptide-protein docking models.

HepG2 Cell line

The HepG2 hepatocellular carcinoma cell line was obtained from the American Type Culture Collection and cultured in Dulbecco's modified Eagle's medium supplemented with 10% fetal bovine serum in 37℃ with 5% CO₂.

Western blot analysis to assess HLA-I

Samples containing HLA- I peptide complexes, including total cell lysate (TCL), flowthrough (FT), and elution fractions, were collected and separated on a 10% SDS-PAGE gel subsequently, the proteins were transferred onto a nitrocellulose membrane. The membrane was blocked with 5% defatted milk in TBST for 2 hours to prevent nonspecific binding. For HLA-A/B detection, the membrane was incubated with a primary antibody against HLA-A/B (ABclonal, Hubei, China) for 2 hours at room temperature. This was followed by incubation with a secondary antibody for 1 hour at room temperature. Finally, signal detection was performed using a chemiluminescent substrate (Scientific, California, United States).

HLA-I immunopeptidome enrichment and purification

To prepare the cell lysate, three biological replicates of HepG2 cells were collected and washed three times with cold phosphate-buffered saline (PBS). The cells were then lysed using a cold solution of lysis buffer consisting of 0.25% sodium deoxycholate, 0.2mM iodoacetamide, 1mM EDTA, and 1:200 Protease Inhibitors Cocktail (Sigma-Aldrich, Missouri, United States) in PBS. After 30 minutes on ice, the lysate was centrifuged at 20,000 g for 30 minutes at 4 ℃ to remove sediment. To enrich HLA-I peptide complexes, a house-made pan-HLA class I antibody was coupled with Protein G Sehparose beads (Cytiva, Massachusetts, United States). The beads with antibody and HepG2 cell lysate were co-incubated overnight at 4 ℃ to specifically bind the HLA-I peptide complexes. Subsequently, the captured HLA-I peptide complexes were subjected to three cold PBS washes to remove nonspecifically bound proteins or contaminants. Finally, the HLA-I peptide complexes were eluted from the beads using 0.15% Trifluoroacetic acid (TFA) in water. To purify the immunopeptidome, house-made C₁₈ stage tips were prepared. The stage tips were activated by 100% acetonitrile (ACN) and 80% ACN in 0.1% TFA. followed by equilibration with 0.1% TFA in water. The HLA-I peptide complexes were loaded onto the C₁₈ stage tips twice, followed by three washes 0.1% TFA in water. The purified HLA-presented peptides were eluted from the C₁₈ stage tips using a solution of 30% ACN in 0.1% TFA, and subsequently dried using freeze vacuum drying equipment. To prepare for LC-MS/MS analysis, the peptide samples were reconstituted in a loading buffer containing 1% ACN in 0.1% TFA.

LC-MS/MS analysis of HLA-I immunopeptidome

The HLA-I immunopeptidome was analyzed using an EASY-nLC 1200 instrument (Thermo Fisher Scientific, California, United States) equipped with a self-packed capillary column (75 µm i.d. × 20 cm, 1.9 µm C₁₈ reversed-phase fused silica) coupled to an Orbitrap Exploris 480 (Thermo Fisher Scientific, California, United States). The gradient was comprised of an increasing Buffer B (Buffer A: 0.1% FA in water; Buffer B: 0.1% FA in 100% ACN) from 5% to10% for 4 min, 10–30% for 56 min, 30–95% for 6 min and holding for 2 min. Full MS scans ranged from 300 to 1,600 m/z with a resolution of 60,000. The maximum injection time was 50ms, and the normalized automatic gain control (AGC) target was set at 300% for the remaining settings. The MS₂ scan was configured to collect fragmentation for charge state of 2 to 6, using high-energy collision dissociation (HCD) with normalized collision energy of 27%, the resolution for MS₂ was set at 15,000, AGC target was set at 100%, and the maximum injection time was 40 ms. A dynamic exclusion time of 15 seconds was applied. Theoretical retention time were generated using DeepLC (https://iomics.ugent.be/deeplc/)[24].

HLA-I typing and HLA binding prediction

HLA typing of HepG2 was obtained from a previous study[30]. Additionally, HLA typing was performed using arcasHLA with the HepG2 transcriptome sequence from our previous work[31, 32]. The confirmed HLA typing results for HepG2 cell line were used in the subsequent analysis. Only peptides consisting of 8–14 a.a were selected for HLA binding prediction. The peptides were clustered into groups based on sequence similarities using Gibbscluster-2.0[33]. HLA-I binding prediction was performed using NetMHCpan-4.1 (https://services.healthtech.dtu.dk/services/NetMHCpan-4.1/) with HLA subtype HLA-A0201, HLA-A2402, HLA-B3514, HLA-B5101, HLA-C0401, HLA-C1602. Binding prediction results were visualized using R studio (build 524), and the artwork is created with BioRender.com.

Peptides-protein docking modeling

Mutant peptides identified from the HepG2 WES-based and COSMIC-based database searches underwent several filtering steps based on spectrum, length, and HLA binding results. Only mutant peptides with a length of 8–14 a.a and strong HLA-I binding were selected for peptides-protein docking modeling. The structure model of HLA-A0201 (1DUZ) and HLA-A2402 (5HGA) were downloaded from RCSB Protein Data Bank[34]. HPepDock 2.0 (http://huanglab.phys.hust.edu.cn/hpepdock/) was used for docking modeling[24]. The peptide-protein docking model and molecular surface hydrophobicity were subsequently analyzed and visualized using ChimeraX (v 1.6.1)[35].

Identification and characteristics of HLA-I immunopeptidome in HepG2 cell line

The process of identifying neoantigens is illustrated (Fig. 1). Immunoprecipitation successfully enriched HLA-I peptide complexes from HepG2 cell lysate. To evaluate the enrichment efficiency, we compared the HLA A/B signals in TCL, FT and elution by western blotting, quantifying them using grayscale (Fig. 2a, supplementary Fig. S1a). Western blot analysis revealed that the HLA A/B signal in the FT was significantly lower compared to the TCL, while a strong signal was observed in the elution. Greyscale quantification demonstrated successfully enrichment and elution of nearly 50% of HLA A/B and eluted from the TCL (Fig. 2b). Subsequently, a total of 8,549 peptides were identified through LC-MS/MS analysis. Of these, approximately 74.6% of them (n = 6,376) being 8–14 a.a in length, which corresponds to the length characteristics of HLA-I immunopeptidome (Supplemental Table S1). In three biological replicates, we identified 4,537, 3,900 and 3,481 8–14 a.a peptide sequences, respectively (Fig. 2c). Venn diagrams demonstrated commonly identified peptides comprised the highest portion, demonstrating consistency in the enrichment and identification of HLA-I immunopeptidome (Fig. 2d). Furthermore, Hyperscores, used for evaluating the quality of spectra by comparing observed spectra to theoretical ones generated by MSFragger,[29] were compared between HLA-I immunopeptidome from three replicates, indicating consistency in the mass spectrum quality (Fig. 2e). The theoretical reaction time (RT) exhibits a strong correlation with the observed RT (supplementary Fig. S1b). Peptides with lengths ranging from 8–14 a.a exhibited a distribution pattern, with the majority consisting of 9 a.a peptides (Fig. 2f, g).

Discover HepG2 specific mutant peptide using HepG2 WES-based database

Based on the satisfied data quality of the HepG2 HLA-I immunopeptidome, we utilized HepG2 WES-based database to identify potential neoantigen from HLA-I immunopeptidome. The Gibbs clustering approach was used to analyze the anchor residues of the eluted peptides. The results revealed primary binding motifs that primarily clustered in two groups (Fig. 3a, b). Subsequent analysis was performed using NetMHCpan 4.1 with default settings. Peptides were categorized as having strong binding when the percentage rank was less than 0.5%, and weak binding was assigned to those within the percentage rank range of 0.5 to 2.0%. The results indicated that a significant proportion of 8–14 a.a peptides were predicted to bind to HLA-I molecules. A preference for binding was observed across different HLA alleles, with HLA-A0201 (n = 2,014) and HLA-A2402 (n = 1,918) exhibiting particularly strong preference (Fig. 3c). The distribution of amino acid species at the P2 (Second amino acid) and PΩ (Last amino acid) positions in the Gibbs clustered peptides corresponds with the distribution observed in the NetMHCpan reference database (supplementary Fig. S2d, e).

These findings are consistent with the results obtained from Gibbs clustering, and indicating that the binding motif clusters within HLA-A0201 and HLA-A2402. Additionally, there was a length distribution among strong binders, and it was found that 9 a.a peptides were the most frequently observed (Fig. 3d), which is consistent with the length distribution of HLA-I immunopeptides reported in previous studies[12, 14, 36]. Furthermore, 9 a.a peptides were predicted to have a lower elution percentage rank, indicating a stronger binding ability to the HLA-I molecules (Fig. 3e). However, there was no significant difference in peptide intensity observed among peptides of different lengths (supplementary Fig. S2a). Our study identified only 1 mutant peptide using the HepG2 WES-based database (Table 1, Fig. 3f), indicating the limited effectiveness of personalized-specific databases in identifying mutant immunopeptidomes. Similar findings were reported in a previous study[16]. Notably, the HLA-I immunopeptidomes of HepG2 contained both the wild type peptide "SLFDVSHML" and the mutant peptide “SLFDASHML”. Although the mutant peptide and wild-type peptide had low intensities, their predicted HLA-I affinity was higher compared to other peptides (Fig. 3g, h).

Table 1

List of mutant peptides identified by HepG2 WES-based database
No.	Peptide	Hyperscore	Length	Nucleic variant	Amino variant	Protein	Binding level	HLA allele	Elution rank %
1	SLFDASHML	11.98	9	c.242T > C	p.V81A	AMT	Strong	*HLA-A02:01**	0.003
2	SLFDVSHML	15.81	9	WT	WT	AMT	Strong	*HLA-A02:01**	0.005

Generation of COSMIC-based Database

The construction workflow for the mutation database was summarized into four parts (Fig. 4a). The genes sequences of mutations were extracted from COMSIC genomic database, which consisted of 1,233,831 mutations, and HepG2 WES database, which contained 388 mutations. Non-synonymous mutations lead to changes in the amino acid sequence. Therefore, further filtering was conducted to remove synonymous and redundant mutations. As a result, 279 mutations were identified in the HepG2 WES-based database, and 81,137 mutations were identified in the COSMIC-based database. Among the 957 samples in the COSMIC-based database, TP53 was the most frequently mutated gene, followed by TTN and CTNNB1 (Fig. 4b). Notably, most of the samples exhibited a low TMB, although a few individuals displayed exceptionally high TMB. These results are consistent with previous findings that HCC has a relatively lower TMB compared to other types of tumors[19, 20]. The distribution of mutant classes in the HepG2 WES-based and COSMIC-based database was similar, with missense mutations being the most common, followed by nonsense mutations (Fig. 4c and Table 2). The proportions of other mutation types, including nonstop mutations, insertions, and deletions, were similar in the COSMIC-based database. Only 1 mutation was found in both databases, while 222 mutated genes were commonly identified (Fig. 4d, e). To conduct subsequent MS data searches, a COSMIC-based database was constructed by incorporating these filtered somatic mutations from both COSMIC and HepG2 WES into the UniProt Human database.

Table 2

Statistic of mutant classification in HepG2-based and COSMIC-based database
Mutant classification	HepG2-based database	COSMIC-based database
Translation start site	0	21
Nonstop mutation	0	415
Nonsense mutation	12	6640
Missense mutation	240	75883
In frame insertion	3	100
In frame deletion	0	497
Frameshift insertion	3	1052
Frameshift deletion	4	1278

Evaluation of HepG2 WES-based and COSMIC-based database

To evaluate the impact of HepG2 WES-based and COSMIC-based database on database search result, a comparison was made between the outcomes of the two mutation databases. Venn diagrams showed that a majority of immunopeptides (n = 6,245) were commonly identified by both databases, resulting in a total of 2,565 shared proteins (Fig. 5a). The quality of unique identified peptides by each mutation database was initially evaluated (Fig. 5b). Comparative analysis revealed that, in comparison to the commonly identified peptides, the majority of unique identified peptides from either mutation database exhibited lower hyperscores, indicating lower spectrum quality. Nonetheless, a subset of spectra exhibited hyperscores higher than the average hyperscore of the commonly identified counterparts, suggesting that the use of mutation databases enables the discovery of unique peptides with high quality. Further investigation was conducted on the unique peptides identified by both databases. Incomplete product ion coverage was observed as a prevalent scenario, resulting in spectra matching to different peptides. manual inspection of the MS₂ spectra revealed that 41 peptides, comprising 11.71% of all unique peptides, had equal-weight amino acids or combinations (supplementary Fig. S3a). Furthermore, a quantitative evaluation demonstrated a strong correlation between the peptide intensities obtained from the two search results (Fig. 5c).

We observed that unique identified peptides from both mutations databases had statistically lower intensities compared to their commonly identified counterparts. However, there was no significant difference between in the intensities of unique identified peptides (Fig. 5d). Furthermore, a comparison of the length distribution patterns for commonly and uniquely identified peptides revealed that both groups exhibited a similar distribution, with the majority of peptides having a length of 9 a.a (Fig. 5e). These results imply that the unique identified peptides are highly likely to be immunopeptides. To evaluate the their immunoaffinity, we utilized NetMHCpan to predict their binding affinity. The analysis revealed that the proportion of strong binding peptides was lower in the unique identified peptides compared to their counterparts, although strong binders still accounted for nearly 50% (Fig. 5f). Additionally, the elution percentage rank distribution based on peptide length supported this conclusion, as both commonly and uniquely identified peptides demonstrated similar trends. Specifically, 9 a.a peptides showed the lowest percentage rank among strong binders (Fig. 5g, supplementary Fig. S3b). This pattern was also evident in the immunopeptides identified using UniProt database, providing further evidence that the majority of the unique identified peptides were immunopeptides.

Discover HepG2 specific mutant peptide using COSMIC-based database

During the evaluation of the HepG2 WES-based and COSMIC-based database for the identification of HLA-I immunopeptidome, we excluded peptides with "isoleucine" to "leucine" mutations, as these cannot be distinguished by mass spectrometry. As a result, we identified 16 mutant peptides (Table 3). Although both mutation databases include other mutant classes of mutations, such as nonsense mutations, insertions, and deletions, all of the mutant peptides identified in our study harbored SNVs instead of other mutation class. In contrast, the COSMIC-based identified 16 mutant peptides, including the one discovered using the HepG2 WES-based database. These results indicate that the COSMIC-based database is more efficient for the identification of the HLA-I immunopeptidome.

Table 3

List of mutant peptides identified by COSMIC-based database
No.	Peptide	Origin	Hyperscore	Length	Nucleic variant	Amino variant	Protein	Binding level	HLA allele	Elution rank %
1	RYSEYTEEF	COSMIC	22.11	9	c.1795G > A	p.A599T	MTMR6	Strong	HLA-A*24:02	0.003
2	RMPEAAPRV	COSMIC	11.81	9	c.215C > G	p.P72R	TP53	Strong	HLA-A*02:01	0.084
3	SLFDASHML	HepG2 COSMIC	11.98	9	c.242T > C	p.V81A	AMT	Strong	*HLA-A02:01**	0.007
4	TVLSSRPVV	COSMIC	13.06	9	c.1574T > C	p.I525T	ITGAL	Weak	*HLA-B51:08**	1.487
5	LSWHLPLLI	COSMIC	18.14	9	c.26G > A	p.R9H	IL2RB	Weak	*HLA-C16:02**	1.029
6	LNDLIVALS	COSMIC	12.87	9	c.1668C > A	p.F556L	NVL	None
7	KAYGSYEELAKDPN	COSMIC	11.19	14	c.197G > A	p.S66N	DHDH	None
8	DEAQNLTRD	COSMIC	12.61	9	c.2177G > A	p.G726D	DDX54	None
9	HGELLEVNL	COSMIC	14.11	9	c.219C > A	p.D73E	PCDHA13	None
10	LFLDAIHLT	COSMIC	14.69	9	c.2255C > T	p.P752L	BBX	None
11	DLLLVPTAGL	COSMIC	18.93	10	c.346T > G	p.Y116D	PROM2	None
12	GTLLSGAVGSLLL	COSMIC	19.86	13	c.508A > T	p.T170S	SLC17A9	None
13	HMLIDLHFR	COSMIC	12.86	9	c.559A > T	p.M187L	FMR1	None
14	QVQLLQQQ	COSMIC	12.12	8	c.593A > T	p.Q198L	TFAP4	None
15	DSNRNLDLDSIIA	COSMIC	23.42	13	c.914A > G	p.N305S	KRT79	None
16	QVQIGTHSPP	COSMIC	12.92	10	c.2125G > A	p.A709T	PHEX	None

NetMHCpan predicted binding of at least one HLA allele for 5 of the mutant peptides (Table 4). Further investigation of the mutation peptides exclusively found in HCC revealed the presence of aminomethyltransferase (AMT) _p.V81A, integrin alpha-L (ITGAL) _p.I525T, and interleukin-2 receptor subunit beta (IL2RB) _p.R9H. Interestingly, myotubularin-related protein 6 (MTMR6) _p.A599T was also identified in large intestine cancer, while cellular tumor antigen p53 (TP53) _p.P72R was confirmed in multiple cancers affecting the bone, skin, meninges, and large intestine (supplementary Fig. S4a). These findings suggests that peptides derived from mutations occurring in multiple cancers could potentially serve as neoantigens, stimulating tumor cytotoxic T-cells against a variety of cancers. Next, we compared the spectral quality and intensities of mutant peptides to those of wild-type peptides in immunopeptidomes. The intensities of mutant peptides showed no significant different compared to those of normal peptides (Fig. 6a). Specifically, the intensities of mutant peptides were evenly distributed across the overall peptide (Fig. 6c), with the majority of mutant peptides falling within a linear range. However, the hyperscores of mutant peptides were significantly lower than those of normal peptides, indicating poorer spectrum quality for the mutant peptides (Fig. 6b). The distribution of hyperscores revealed that mutant peptides were mainly concentrated in the sub-average region (Fig. 6d). A similar distribution pattern was observed in HLA-I affinity of mutant peptides (Fig. 6e).To assess the spectrum quality of mutant peptides, a manual inspection was performed, which revealed a high product ion coverage, particularly at the mutant amino acid, in spectra with high hyperscores. This finding increased our confidence in the accuracy of the mutant peptides (Fig. 6f). Conversely, it was also observed that some spectra with high quality were ranked with low hyperscores (supplementary Fig. S4a, b). However, the majority of mutant peptides had lower spectrum quality than wild-type peptides. Common observations in spectra of low-quality peptides included incomplete product ion coverage and low relative intensity of product ions, which can result in low hyperscores. Furthermore, incomplete product ion coverage may lead to single or multiple amino acid mismatches, thus resulting in wild-type peptides being mistaken as mutant peptides. This discrepancy also explains the lower proportion of binders among mutant peptides compared to wild-type peptides. For further analysis, 3 mutant peptides were selected based on their affinity and satisfactory spectrum quality.

Table 4

Summaries of HLA affinity, molecular docking energy score
No.	Peptide	Type	HLA affinity (nM)	HLA Allele	HLA template	Docking energy score
1	SLFDASHML	Mutant	4.93	HLA-A0201	1DUZ	-232.927
2	SLFDVSHML	Wild type	4.63	HLA-A0201	1DUZ	-256.297
3	RYSEYTEEF	Mutant	9.98	HLA-A2402	5HGA	-260.960
4	RYSEYAEEF	Wild type	13.32	HLA-A2402	5HGA	-213.461

We performed molecular structure prediction and peptide-protein docking modeling to confirm the binding affinity of mutant peptides. The results indicated no significant difference in the structure and hydrophobicity of the molecular surface between the mutant peptide "SLFDASHML" and the wild-type peptide "SLFDVSHML" (Fig. 6g). Similarly, the amino acid substitution resulting from the MTMR6 _p.A599T mutation did not significantly modify the structure or surface hydrophobicity in "RYSEYTEEF" (Fig. 6h). The predictions from NetMHCpan indicated that these mutant peptides, which exhibited strong binding, had minimal difference in binding affinity compared to the wild-type peptides (Table 4). This could be explained by the fact that the mutation site does not align with the canonical anchor motif, which is typically involves amino acid at the P2 or PΩ position in peptides. To further examine the binding ability of mutant peptides to HLA-I molecules, we employed peptide-protein docking using HPepDock. The mutant peptide "SLFDASHML" and wild-type peptide "SLFDVSHML" were examined in complex with HLA-A0201, while the mutant peptide "RYSEYTEEF" and wild-type peptide "RYSEYAEEF" were investigated in complex with HLA-A2402(Fig. 6i, j). The results revealed that the binding energy of HLA-I molecule for the mutant peptide was comparable to that of the wild-type peptide, supporting the predicted HLA-I affinity. The mutant peptide and wild-type peptide exhibit slight differences in their position and conformation within the protein. Furthermore, the hydrogen bond between the peptide and protein has undergone alterations. This finding suggests that amino acid alterations resulting from mutations can impact the peptide's ability to bind to HLA-I molecules, indicating that mutations, even if not situated at the P2 or PΩ position, can still influence the affinity of peptides for the HLA-I molecule.

Neoantigen-based immunotherapy, comprised of adoptive cell therapy, tumor vaccines, and bi-specific antibody therapy, holds significant potential for the treatment of cancers[5–7]. The successful identification of tumor-specific mutant peptides is essential for uncovering potential neoantigens. Immune cells have the ability to recognize tumor cells that present immunogenic mutant peptides on cell surface, resulting in engagement with cytotoxic T cells[37, 38]. However, the identification of neoantigens remains a challenge. Regardless of whether it is based on genomics or whole cell genomic-proteomics, there are several limitations and shortcomings[12, 17]. The low identification efficiency and uncertain immunogenicity significantly restrict the clinical application of neoantigen-based therapy. The discovery of mutant peptides from the HLA immunopeptidome holds significant importance. Recent studies have begun to investigate the HLA immunopeptidome, and emerging evidence has shown successful identification of neoantigens in melanoma, lung cancer, glioblastoma and breast cancer[12, 39–41]. However, this method is still relying on genomic-proteomic identification and the utilization of genomic databases for the discovery of mutant peptides.

Previous studies have suggested that neoantigens are more likely to be discovered in tumors with high TMB[19, 42, 43]. Moreover, low TMB cancers typically exhibit decreased responsiveness to current first-line therapies, such as immune checkpoint therapy and targeted therapy[20]. This further emphasizes the importance of neoantigens. HCC is characterized by a low-to-medium TMB, indicating a relatively low mutation frequency (less than 5 mutations per Mb gene). However, the combination of low mutation frequency, inadequate sequencing depth in WES, and high level of heterogeneity poses challenges in identifying neoantigens in HCC. In a study involving 16 HCC patients, only 11 neoantigens were identified from a total of 1,039 non-synonymous mutations. Notably, no neoantigens were detected in HLA immunopeptides[19], highlighting the infrequent presentation of mutant peptides by HLA complexes in HCC. This finding underscores the difficulties associated with neoantigens identification in HCC. Due to the challenges in obtaining clinical samples from patients with advanced HCC, we initially described the HLA-I immunopeptidome of the HepG2 cell line. This dataset serves as a valuable reference for future studies involving other cell lines and tissue samples from HCC.

In this study, we initially characterized the HLA-I immunopeptidomes of HepG2 cell line using immunoprecipitation. Although the western blot analysis revealed an enrichment efficiency of approximately 50% compared to TCL. Previous research has demonstrated that HLA-I immunopeptides can be completely eluted from HLA-I molecules at a pH of 3.3[44, 45]. In this manuscript, the elution buffer, primarily consisting of 0.15% TFA, was adjusted to a pH of approximately 2.0 to ensure the elution of the majority of peptides for subsequent MS analysis. Surprisingly, one mutant peptide was identified, indicating a relatively low efficiency in discovering neoantigens using the HepG2 WES-based database. Comparable outcomes have been observed in previous studies as well. For instance, lung adenocarcinoma cell lines H1975 and PC9 detected only 3 and 4 mutant peptides, respectively; Moreover, 5 patient-derived organoids from colorectal cancers revealed the presence of 3 potential neoantigens; and even in high TMB cancers like melanoma, 5 potential neoantigens were identified using a criterion of FDR = 1%[16, 46]. These studies imply that personal WES or genomic-based database can be used for identifying neoantigens, despite their relatively low efficiency. Efforts have been made to address this issue. For example, immune peptide databases that compile immunopeptidome information from global research have been established, such as the Immune Epitope Database (IEDB), TSNAdb, and databasePepNeo[47–49]. These databases provide sequences of mutant peptides, information about HLA-I affinity, and experimental evidence of immunogenicity. However, neoantigen databases primarily serve to verify identified peptides rather than discover mutant peptides from MS data files.

We generated an expanded database of HCC somatic mutation, which includes the most comprehensive collection. This database was constructed using mutation genes obtained from COSMIC, widely acknowledged as a detailed resource for cancer somatic mutations and extensively utilized in various cancer research studies. A similar strategy was employed to enhance the identification of non-canonical peptides. This was achieved by generating an ENSEMBL-based proteogenomics database, which unveiled that non-canonical peptide constituted over 5% of the total number of identified peptides in 65 cell line datasets[50]. However, databases generated from filtered non-synonymous mutations tend to be large, significantly impacting search efficiency and compatibility with certain software. To address this, we compressed the database by retaining only the upstream and downstream amino acid sequences surrounding the mutation site. By employing this strategy, we successfully generated a COSMIC-based database that is less than 50 Mb in size. This compressed database can be utilized with the majority of proteomics analysis software. The main advantage of the COSMIC-based database is its ability to encompass a wide range of mutations, including low frequency mutations, which potentially uncovers a greater number of mutant peptides compared to HepG2 WES-based database. This study presents high-quality HepG2 HLA-I immunopeptidomes for the evaluation of COSMIC-based database. Our results demonstrate that the COSMIC-based database successfully identifies a higher number of mutant peptides without the reliance on prior WES sequence data. Further exploration revealed its ability to discover immunopeptides with high spectral quality using COSMIC-based database. However, it is important to note that some peptides identified in the COSMIC-based database exhibited relatively low quality, potentially leading to the misidentification of wild-type peptides as mutant peptides. To mitigate this issue, manual inspection is necessary to remove peptides with poor spectral quality. Overall, adopting this strategy improves the identification of mutant peptides from HLA-I immunopeptidomes without a significant increase in the false positive rate.

To confirm the potential of the COSMIC-based database as a resource for mutant peptides, we evaluated the immunoaffinity of these peptides to assess the feasibility of employing the COSMIC-based database for neoantigen identification. These mutant peptides, characterized by high spectral quality, also demonstrated strong HLA-I affinity according to in silico prediction. Within the COSMIC-based database, MTMR6 _p.A599T and TP53 _p.P72R were also found in other cancers, indicating the occurrence tumorigenic signaling pathways shared among various cancers. TP53 _p.P72R was observed at a frequency ranging from 0.4 to 0.7 in all ethnic groups, and its association with an increased risk of cancer remains controversial[51]. Additionally, two mutant peptides were validated in the IEDB, confirming their presentation by tumors and their affinity for HLA-I molecules. Through peptide-protein docking modeling, it was revealed that these peptides exhibited lower binding energy to HLA-I molecules, indicating a higher likelihood of binding. HLA-I molecules bind antigen peptide through a groove structure, with a typical binding to the second and last amino acid of antigen peptides[52, 53]. The anchor residues Leu/Met and Leu/Val are commonly observed in immunopeptides binding to HLA-A0201. Both the mutant peptide "SLFDASHML" and the corresponding wild-type peptide "SLFDVSHML" fulfill this requirement. Moreover, the mutant peptides "RYSEYTEEF" meet the binding criteria for HLA-A2402. It is important to noted that substitutions of amino acids resulting from mutations at positions other than P2 or PΩ might have a limited impact on the binding process. Indeed, previous studies have confirmed that neoantigens can arise not only from coding genes, but also from non-coding region or even microorganism[54, 55]. Mutant peptides originating from these sources identified from HLA immunopeptidome are still undergoing investigation.

In summary, we present an analysis of the characteristics of HepG2 HLA-I immunopeptidome. To enhance the efficiency of neoantigen identification from the HLA-I immunopeptidome, we developed an HCC COSMIC-based mutation database. Our results suggest that the COSMIC-based database demonstrates superior effectiveness in identifying tumor-specific mutant peptides and neoantigens compared to the HepG2 WES-based database. This strategy allows us with a broader range of potential neoantigen targets for precision immunotherapy.

Ethics approval and consent to participate

Not applicable.

Consent for publication

The corresponding author has received consent for publication.

Competing interests

The authors declare no potential conflicts of interest.

Funding

This work was funded by the MOST (2022YFA1304600 and 2020YFE0202200), the National Natural Science Foundation of China (NSFC) (32141003, 32101190, 32070668 and 32071431); the CAMS Innovation Fund for Medical Sciences (2019-I2M-5-017 and 2022-I2M-C&T-B-082), the Foundation of State Key Lab of Proteomics (SKLP-K201704, SKLP-C202002 and SKLP-K201901); and the Mass Spectrometry Platform Open Project of National Center for Protein Sciences Beijing (2021-NCPSB-001).

Author contributions

Ping Xu and Shichun Lu initiated the project and oversee all aspects of the project. Fangzhou Wang, Zhenpeng Zhang and Yudai Yang performed the immunopeptide enrichment and LC-MS analysis. Fangzhou Wang, Zhenpeng Zhang, Mingsong Mao and Ping Xu performed all data analyses. Zhenpeng Zhang, Ping Xu and Fangzhou Wang prepared all of the figures. Fangzhou Wang wrote the manuscript with input from all the authors. All authors have given approval to the final version of the manuscript.

Acknowledgments

The authors thank Drs Yanchang Li, Tao Zuo, Yuan Li in Xu Lab at the National Center for Protein Sciences (Beijing) for experiments and discussion.

Availability of data and materials

The raw MS-based sequencing files of HepG2 HLA-I immunopeptidome have been deposited to iProX Integrated Proteome Resources with identifier IPX0007010000, ProteomeXchange identifier PXD045203. The transcriptomic data have been published in our previous work³².

Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin. 2021;71(3):209-49.
Mokdad AA, Singal AG, Marrero JA, Zhu H, Yopp AC. Vascular Invasion and Metastasis is Predictive of Outcome in Barcelona Clinic Liver Cancer Stage C Hepatocellular Carcinoma. J Natl Compr Canc Netw. 2017;15(2):197-204.
Tang H, Cao Y, Jian Y, Li X, Li J, Zhang W, et al. Conversion therapy with an immune checkpoint inhibitor and an antiangiogenic drug for advanced hepatocellular carcinoma: A review. Biosci Trends. 2022;16(2):130-41.
Zhang W, Hu B, Han J, Wang Z, Ma G, Ye H, et al. Surgery After Conversion Therapy With PD-1 Inhibitors Plus Tyrosine Kinase Inhibitors Are Effective and Safe for Advanced Hepatocellular Carcinoma: A Pilot Study of Ten Patients. Front Oncol. 2021;11:747950.
Rosenberg SA, Restifo NP. Adoptive cell transfer as personalized immunotherapy for human cancer. Science. 2015;348(6230):62-8.
Tran E, Turcotte S, Gros A, Robbins PF, Lu YC, Dudley ME, et al. Cancer immunotherapy based on mutation-specific CD4+ T cells in a patient with epithelial cancer. Science. 2014;344(6184):641-5.
Chen F, Zou Z, Du J, Su S, Shao J, Meng F, et al. Neoantigen identification strategies enable personalized immunotherapy in refractory solid tumors. J Clin Invest. 2019;129(5):2056-70.
Yamamoto TN, Kishton RJ, Restifo NP. Developing neoantigen-targeted T cell-based treatments for solid tumors. Nat Med. 2019;25(10):1488-99.
Jiang T, Shi T, Zhang H, Hu J, Song Y, Wei J, et al. Tumor neoantigens: from basic research to clinical applications. J Hematol Oncol. 2019;12(1):93.
Hu Z, Ott PA, Wu CJ. Towards personalized, tumour-specific, therapeutic vaccines for cancer. Nat Rev Immunol. 2018;18(3):168-82.
Lang F, Schrors B, Lower M, Tureci O, Sahin U. Identification of neoantigens for individualized therapeutic cancer vaccines. Nat Rev Drug Discov. 2022;21(4):261-82.
Bassani-Sternberg M, Braunlein E, Klar R, Engleitner T, Sinitcyn P, Audehm S, et al. Direct identification of clinically relevant neoepitopes presented on native human melanoma tissue by mass spectrometry. Nat Commun. 2016;7:13404.
Jurtz V, Paul S, Andreatta M, Marcatili P, Peters B, Nielsen M. NetMHCpan-4.0: Improved Peptide-MHC Class I Interaction Predictions Integrating Eluted Ligand and Peptide Binding Affinity Data. J Immunol. 2017;199(9):3360-8.
Bulik-Sullivan B, Busby J, Palmer CD, Davis MJ, Murphy T, Clark A, et al. Deep learning using tumor HLA peptide mass spectrometry datasets improves neoantigen identification. Nat Biotechnol. 2018.
Wang W, Yuan T, Ma L, Zhu Y, Bao J, Zhao X, et al. Hepatobiliary Tumor Organoids Reveal HLA Class I Neoantigen Landscape and Antitumoral Activity of Neoantigen Peptide Enhanced with Immune Checkpoint Inhibitors. Adv Sci (Weinh). 2022;9(22):e2105810.
Qi YA, Maity TK, Cultraro CM, Misra V, Zhang X, Ade C, et al. Proteogenomic Analysis Unveils the HLA Class I-Presented Immunopeptidome in Melanoma and EGFR-Mutant Lung Adenocarcinoma. Mol Cell Proteomics. 2021;20:100136.
Kumar D, Bansal G, Narang A, Basak T, Abbas T, Dash D. Integrating transcriptome and proteome profiling: Strategies and applications. Proteomics. 2016;16(19):2533-44.
Zhang Q, Lou Y, Yang J, Wang J, Feng J, Zhao Y, et al. Integrated multiomic analysis reveals comprehensive tumour heterogeneity and novel immunophenotypic classification in hepatocellular carcinomas. Gut. 2019;68(11):2019-31.
Loffler MW, Mohr C, Bichmann L, Freudenmann LK, Walzer M, Schroeder CM, et al. Multi-omics discovery of exome-derived neoantigens in hepatocellular carcinoma. Genome Med. 2019;11(1):28.
Yarchoan M, Hopkins A, Jaffee EM. Tumor Mutational Burden and Response Rate to PD-1 Inhibition. N Engl J Med. 2017;377(25):2500-1.
Garraway LA, Lander ES. Lessons from the cancer genome. Cell. 2013;153(1):17-37.
Meienberg J, Bruggmann R, Oexle K, Matyas G. Clinical sequencing: is WGS the better WES? Hum Genet. 2016;135(3):359-62.
Lu L, Jiang J, Zhan M, Zhang H, Wang QT, Sun SN, et al. Targeting Neoantigens in Hepatocellular Carcinoma for Immunotherapy: A Futile Strategy? Hepatology. 2021;73(1):414-21.
Bouwmeester R, Gabriels R, Hulstaert N, Martens L, Degroeve S. DeepLC can predict retention times for peptides that carry as-yet unseen modifications. Nat Methods. 2021;18(11):1363-9.
Ghandi M, Huang FW, Jane-Valbuena J, Kryukov GV, Lo CC, McDonald ER, 3rd, et al. Next-generation characterization of the Cancer Cell Line Encyclopedia. Nature. 2019;569(7757):503-8.
Cock PJ, Antao T, Chang JT, Chapman BA, Cox CJ, Dalke A, et al. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics. 2009;25(11):1422-3.
Tate JG, Bamford S, Jubb HC, Sondka Z, Beare DM, Bindal N, et al. COSMIC: the Catalogue Of Somatic Mutations In Cancer. Nucleic Acids Res. 2019;47(D1):D941-D7.
UniProt C. UniProt: the Universal Protein Knowledgebase in 2023. Nucleic Acids Res. 2023;51(D1):D523-D31.
Kong AT, Leprevost FV, Avtonomov DM, Mellacheruvu D, Nesvizhskii AI. MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics. Nat Methods. 2017;14(5):513-20.
Boegel S, Lower M, Bukur T, Sahin U, Castle JC. A catalog of HLA type, HLA expression, and neo-epitope candidates in human cancer cell lines. Oncoimmunology. 2014;3(8):e954893.
Orenbuch R, Filip I, Comito D, Shaman J, Pe'er I, Rabadan R. arcasHLA: high-resolution HLA typing from RNAseq. Bioinformatics. 2020;36(1):33-40.
Li Y, Zhang Z, Jiang S, Xu F, Tulum L, Li K, et al. Using transcriptomics, proteomics and phosphoproteomics as new approach methodology (NAM) to define biological responses for chemical safety assessment. Chemosphere. 2023;313:137359.
Andreatta M, Alvarez B, Nielsen M. GibbsCluster: unsupervised clustering and alignment of peptide sequences. Nucleic Acids Res. 2017;45(W1):W458-W63.
Burley SK, Bhikadiya C, Bi C, Bittrich S, Chao H, Chen L, et al. RCSB Protein Data Bank (RCSB.org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning. Nucleic Acids Res. 2023;51(D1):D488-D508.
Pettersen EF, Goddard TD, Huang CC, Meng EC, Couch GS, Croll TI, et al. UCSF ChimeraX: Structure visualization for researchers, educators, and developers. Protein Sci. 2021;30(1):70-82.
Sarkizova S, Klaeger S, Le PM, Li LW, Oliveira G, Keshishian H, et al. A large peptidome dataset improves HLA class I epitope prediction across most of the human population. Nat Biotechnol. 2020;38(2):199-209.
Olsson N, Heberling ML, Zhang L, Jhunjhunwala S, Phung QT, Lin S, et al. An Integrated Genomic, Proteomic, and Immunopeptidomic Approach to Discover Treatment-Induced Neoantigens. Front Immunol. 2021;12:662443.
Zhu Y, Liu J. The Role of Neoantigens in Cancer Immunotherapy. Front Oncol. 2021;11:682325.
Bonte PE, Arribas YA, Merlotti A, Carrascal M, Zhang JV, Zueva E, et al. Single-cell RNA-seq-based proteogenomics identifies glioblastoma-specific transposable elements encoding HLA-I-presented peptides. Cell Rep. 2022;39(10):110916.
Aparicio B, Reparaz D, Ruiz M, Llopiz D, Silva L, Vercher E, et al. Identification of HLA class I-restricted immunogenic neoantigens in triple negative breast cancer. Front Immunol. 2022;13:985886.
Dimou A, Grewe P, Sidney J, Sette A, Norman PJ, Doebele RC. HLA Class I Binding of Mutant EGFR Peptides in NSCLC Is Associated With Improved Survival. J Thorac Oncol. 2021;16(1):104-12.
Chalmers ZR, Connelly CF, Fabrizio D, Gay L, Ali SM, Ennis R, et al. Analysis of 100,000 human cancer genomes reveals the landscape of tumor mutational burden. Genome Med. 2017;9(1):34.
Marty R, Kaabinejadian S, Rossell D, Slifker MJ, van de Haar J, Engin HB, et al. MHC-I Genotype Restricts the Oncogenic Mutational Landscape. Cell. 2017;171(6):1272-83 e15.
Caron E, Kowalewski DJ, Chiek Koh C, Sturm T, Schuster H, Aebersold R. Analysis of Major Histocompatibility Complex (MHC) Immunopeptidomes Using Mass Spectrometry. Mol Cell Proteomics. 2015;14(12):3105-17.
Sturm T, Sautter B, Worner TP, Stevanovic S, Rammensee HG, Planz O, et al. Mild Acid Elution and MHC Immunoaffinity Chromatography Reveal Similar Albeit Not Identical Profiles of the HLA Class I Immunopeptidome. J Proteome Res. 2021;20(1):289-304.
Newey A, Griffiths B, Michaux J, Pak HS, Stevenson BJ, Woolston A, et al. Immunopeptidomics of colorectal cancer organoids reveals a sparse HLA class I neoantigen landscape and no increase in neoantigens with interferon or MEK-inhibitor treatment. J Immunother Cancer. 2019;7(1):309.
Wu J, Zhao W, Zhou B, Su Z, Gu X, Zhou Z, et al. TSNAdb: A Database for Tumor-specific Neoantigens from Immunogenomics Data Analysis. Genomics Proteomics Bioinformatics. 2018;16(4):276-82.
Vita R, Mahajan S, Overton JA, Dhanda SK, Martini S, Cantrell JR, et al. The Immune Epitope Database (IEDB): 2018 update. Nucleic Acids Res. 2019;47(D1):D339-D43.
Tan X, Li D, Huang P, Jian X, Wan H, Wang G, et al. dbPepNeo: a manually curated database for human tumor neoantigen peptides. Database (Oxford). 2020;2020.
Umer HM, Audain E, Zhu Y, Pfeuffer J, Sachsenberg T, Lehtio J, et al. Generation of ENSEMBL-based proteogenomics databases boosts the identification of non-canonical peptides. Bioinformatics. 2022;38(5):1470-2.
Doffe F, Carbonnier V, Tissier M, Leroy B, Martins I, Mattsson JSM, et al. Identification and functional characterization of new missense SNPs in the coding region of the TP53 gene. Cell Death Differ. 2021;28(5):1477-92.
Rossjohn J, Gras S, Miles JJ, Turner SJ, Godfrey DI, McCluskey J. T cell antigen receptor recognition of antigen-presenting molecules. Annu Rev Immunol. 2015;33:169-200.
Rudolph MG, Stanfield RL, Wilson IA. How TCRs bind MHCs, peptides, and coreceptors. Annu Rev Immunol. 2006;24:419-66.
Laumont CM, Vincent K, Hesnard L, Audemard E, Bonneil E, Laverdure JP, et al. Noncoding regions are the main source of targetable tumor-specific antigens. Sci Transl Med. 2018;10(470).
Kalaora S, Nagler A, Nejman D, Alon M, Barbolin C, Barnea E, et al. Identification of bacteria-derived HLA-bound peptides in melanoma. Nature. 2021;592(7852):138-43.

Download PDF

Journal Publication

published 10 Feb, 2024

Read the published version in Journal of Translational Medicine →

Reviewers agreed at journal
06 Oct, 2023
Reviewers invited by journal
23 Sep, 2023
Editor assigned by journal
14 Sep, 2023
First submitted to journal
12 Sep, 2023

You are reading this latest preprint version

COSMIC-based mutation database enhances identification efficiency of HLA-I immunopeptidome

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Materials and method

Western blot analysis to assess HLA-I

HLA-I immunopeptidome enrichment and purification

LC-MS/MS analysis of HLA-I immunopeptidome

HLA-I typing and HLA binding prediction

Peptides-protein docking modeling

Results

Discover HepG2 specific mutant peptide using HepG2 WES-based database

Generation of COSMIC-based Database

Evaluation of HepG2 WES-based and COSMIC-based database

Discover HepG2 specific mutant peptide using COSMIC-based database

Discussion

Declarations

References

Supplementary Files

Status:

Journal Publication

Version 1