Skip to main content

A new prediction model of hepatocellular carcinoma based on N7-methylguanosine modification



Hepatocellular carcinoma (HCC) is a kind of primary liver cancer. It is a common malignant tumor of digestive system that is difficult to predict the prognosis of patients. As an important epigenetic modification, N7 methyl guanosine (m7G) is indispensable in gene regulation. This regulation may affect the development and occurrence of cancer. However, the prognosis of long non coding RNAs (lncRNAs) in HCC is limited, especially how m7G-related lncRNAs regulate the development of HCC has not been reported.


The Cancer Genome Atlas (TCGA) provides us with the expression data and corresponding clinical information of HCC patients we need. We used a series of statistical methods to screen four kinds of m7G-related lncRNAs related to HCC prognosis and through a series of verifications, the results were in line with our expectations. Finally, we also explored the IC50 difference and correlation analysis of various common chemotherapy drugs.


Our study identified four differentially expressed m7g-related lncRNAs associated with HCC prognosis. Survival curve analysis showed that high risk lncRNAs would lead to poor prognosis of HCC patients. M7G signature's AUC was 0.789, which shows that the prognosis model we studied has certain significance in predicting the prognosis of HCC patients. Moreover, our study found that different risk groups have different immune and tumor related pathways through gene set enrichment analysis. In addition, many immune cell functions are significantly different among different risk groups, such as T cell functions, including coordination of type I INF response and coordination of type II INF response. The expression of PDCD1, HHLA2, CTLA-4 and many other immune checkpoints in different risk groups is also different. Additionally, we analyzed the differences of IC50 and risk correlation of 15 chemotherapeutic drugs among different risk groups.


A novel lncRNAs associated with m7G predicts the prognosis of HCC.

Peer Review reports


HCC is one of the most common cancers worldwide, with the highest mortality rate among primary liver cancer [1, 2]. Although curable therapies are available for early-stage HCC, the majority of patients have progressed to advanced HCC at the time of diagnosis [3]. Surgical resection is often the only option for patients with advanced HCC.One of the recommended treatments for unresectable intermediate HCC is transarterial chemoembolization (TACE) [4, 5]. However, TACE cannot completely cure HCC. Drug‑eluting bead (DEB)-TACE, which is considered to be more effective and safer, was also found to have no significant difference with conventional TACE [6]. With the progress of medicine, the overall survival rate (OS) of HCC patients is increasing, but the prognosis prediction and detailed molecular mechanism of HCC are still unclear, which needs to be further clarified [7]. So, it is imperative to find new and effective prognostic markers for HCC. Presently, different modifications of RNA have been found to have more than 160 types [8]. RNA modifications play an important role in gene expression regulation [9]. It has been reported that RNA modification can affect RNA processing, transport, stability and mRNA translation [10]. Therefore, RNA modification plays a key role in developing different diseases, including HCC [11].

N7-methylguanosine (m7G) is an essential epigenetic modification, playing a vital role in gene expression regulation [12]. The methylation of guanosine (G) at N7 position is called m7G, which exists in mRNA 5′cap [13], rRNA [14], tRNA [13], and internal mRNA regions [15]. It is not difficult to see that this modification plays a crucial role in regulating RNA processing, function and metabolism. Research shows that many diseases are related to this modification [16]. For example, WDR4 (WD Repeat Domain 4) is a human methyltransferase complex. Its mutation can lead to facial malformation and even seizures [16]. Previous studies have reported that the degree of neural differentiation is affected by the expression of WDR4 [17]. At the same time, the researchers found that the m7G tRNA methyltransferase METTL1 (methyltransferase like 1) is related to the growth of cancer cells [18]. It is reported that LncRNA is a subset of about 200 nt RNA molecules that regulate gene expression [19]. The researchers found that autophagy-related lncRNAs significantly affect the prognoses of patients with colon adenocarcinoma [20]. An another study found that the fatty aldehyde dehydrogenase (FALDH) gene,ALDH3A2 could effectively predict the prognosis of gastric carcinoma (GC) patients and may be an independent prognostic biomarker [21]. To sum up, we found that lncRNAs are also involved in gene regulation, tumorigenesis, disease development and metastasis [22]. This provides a new way for us to predict tumor prognosis and find new therapeutic targets through bioinformatics analysis. But, the study of m7G regulator modifying lncRNAs to affect tumor development is limited [23], especially in HCC. Therefore, determining that m7G related lncRNAs are as part of the process of HCC will help to determine effective and targeted treatment. This also makes our research very meaningful.

Materials and methods

Data sources

Obtain transcriptome information and clinical information of HCC from TCGA database( In the subsequent analysis steps, patients lacking survival information were excluded. Detailed transcriptome data were downloaded from TCGA-LICH queue, including 374 HCC tissues and 50 precancerous tissues. First, hundreds of lnRNAs relating to 29 acknowledged m7G-related genes were selected. In this study m7G-related genes included are shown in Supplementary Table S1 [24]. In our study, we used average probe strength to show the expression level of lncRNA/genes with different probes.

Screening of differential genes

We use "limma" (R package) to identify m7G-related genes (m7GG) differentially expressed between cancer and normal tissues [25]. The definition of DE-m7GGs (differentially expressed m7GGs) is that the adjusted | log2FC |> 0 and p value < 0.01 are considered as differentially expressed m7GGs. (Supplementary Table S2). In order to study the accuracy, we selected differentially expressed genes according to ap-values < 0.01 and |log2 FC|> 1, they include NUDT10, NCBP2, EIF3D, LARP1, AGO2, NUDT11.Presently, two databases, including WikiPathways and BioPlanet, are considered origins of pathway classification to specify the shared pathways among DE-m7GGs.

Identification of m7G-related lnRNAs

Our study evaluated the relationship between m7G related LncRNA and HCC using Pearson correlation analysis. The condition of significant correlation is that when p < 0.001, the correlation coefficient | R2 |> 0.4. AND four hundred and sixty-one m7G-related lncRNAs (Supplementary Table S3).

Construction of prognosis model of m7G signature

First, the TCGA-LICH dataset was divided into three parts, one as the joint set, one as the test set, and one as the training set. In our study, we first conducted a single factor analysis of m7G-related lncRNAs. After univariate Cox analysis, the lncRNA with p < 0.05 was screened, and then further analyzed by the least absolute contraction and selection operator (LASSO). Finally, we determined the risk lncRNAs in the model through multivariate Cox. Finally, the risk scores of the patients we studied were calculated as follows: (Coefficient TMCC1-AS1 × expression of TMCC1-AS1) + (Coefficient ZNF232-AS1 × expression of ZNF232-AS1) + (Coefficient AL031985.3 × expression of AL031985.3) + (Coefficient MKLN1-AS × expression of MKLN1-AS). If the risk score is greater than the median number, we call it high- risk group; otherwise, it is called low-risk group. We used R package "survival" to evaluate the survival rate between different risk groups [26]. In order to verify whether the risk score of the model is meaningful, we also used the R package "Survival ROC" to analyze receiver operating characteristics (ROC) [27].

Analysis combined with clinical related information

We obtained the clinicopathological information of each HCC patient from TCGA to analyze the correlation between the clinical information of HCC and the prognosis model we studied. Then, chi square test was used to evaluate the difference of clinicopathological characteristics between different risk groups. Wilcoxon test is also an important test method selected by us. We use it for analysis to evaluate the difference of risk scores in different groups according to clinical pathological characteristics.

Nomogram analysis

Our study used R package “rms” to analyze the 1-year, 3-year and 5-year survival rates of HCC patients, and displayed the results with Nomogram. The calibration curve is then used to assess the difference between the survival rate predicted by the nomogram and the actual survival rate. Calculation of area under ROC curve (AUC) shows that the prediction ability of nomogram is more accurate than other prediction factors.

Principal component analysis at molecular level

The significance of principal component analysis (PCA) is to show that our model analysis of m7G-related lncRNAs is superior to direct analysis of all genes set, all lncRNAs set and all m7G- elated lncRNAs.

The predictive ability of m7G signature is superior to other prognostic models

To verify whether our m7G feature is superior to the previously determined HCC model, we compared our HCC features with other features, including 9 lncRNAs features [28], 11 lncRNAs features [29] and 7 lncRNAs features [30]. We also compared the survival curve and AUC value of each prognostic model.

Analysis of immune correlation in prognosis model

We used six algorithms to study the immune response of people with different risks, and the results were visualized using bubble charts. In addition, to compare the difference of tumor infiltrating immune cell (TIIC) subsets among different risk groups, we analyzed them through ssGSEA and evaluated the immune function of the two groups. Immune checkpoints are available from reported literature. And we applied the currently developed deconvolution algorithm, including MCPcounter [31], TIMER [32], QUANTISEQ [33], XCELL [34], EPIC [35], CIBERSORT and CIBERSORT-ABS [36].

Analysis of chemosensitivity

We use “pRRophic” (R package) application algorithm to predict chemical drug IC50, which is significant in studying the difference of chemosensitivity between different risk groups [37]. We used box graph and correlation graph to visualize common chemotherapy drugs.

Validate the prognosis model at the cellular level

In order to further validate our previous analysis, we have carried out validation at the cellular level. We chose to validate ZNF232-AS1 in the prognostic model because it has never been reported, which may provide new research directions. We first used TRIzol to extract total RNA from cell samples. Next, We use QuantiTect reverse transcription kit to reverse transcribe RNA into cDNA. QRT-PCR was used to evaluate the expression of ZNF232-AS1 in different cell lines. We use GAPDH for internal reference and the primer related information is shown in Supplementary Table S4. Human HCC cells, including SK HEP-1, Huh7, HEPG2, HCCLM3,PLC/PRF/5.and the normal human liver cell line L02 were purchased from the cell bank of the Type Culture Collection of the Chinese Academy of Sciences (Shanghai, China).

Statistical analysis

R software is used for data analysis. The ap value of both sides is 0.05, which is considered to be statistically significant.


Differential expression of m7G-related genes

M7G-related genes were compared in normal and HCC tissues. DEGs are visualized by heat map. (| log2(fold change) |≥ 0 and p value < 0.05) (Fig. 1). See Fig. 1B for co-expression of differential genes. Secondly, we explored lncRNAs co-expressing differentially expressed genes (Fig. 1C). We have concluded that m7g-related genes are almost differentially expressed in HCC and normal tissues. In order to study the accuracy, we selected DEGs based on adjusted p-values < 0.01 and |log2 FC|> 1, they include NUDT10, NCBP2, EIF3D, LARP1, AGO2, NUDT11.

Fig. 1
figure 1

Determine m7G related lncRNAs. A Heat map of DE-m7GGs. B Correlation network of DE-m7GGs (red represents positive correlations). Sankey diagram shows correlations between m7G regulators and lncRNAS

Establishment of m7G signature

In conclusion, the entire TCGA cohort contains 375 HCC patients (n = 375). Our study randomly divided them into a training cohort containing 183 HCC patients (n = 183) and an internal validation cohort containing 182 HCC patients (n = 282). Their clinical information is shown in Table 1. Our study found that 40 m7G related lncRNA was contacted with the prognosis of HCC patients, which was obtained through univariate Cox regression analysis (p < 0.05) (Fig. 2A). They were further screened by LASSO regression analysis and multivariate Cox analysis and 4 lncRNAs were screened for inclusion in the prognosis model (Fig. 2B-D). Finally, we explored the co-expression of these four lnRNAs with m7G-related genes (Fig. 2E). The risk score was calculated by the expression level of these four m7G related lncRNAs and Cox regression coefficient. Among them, in order to study the difference of ZNF232-AS1 expression between normal liver cell line (LO-2) and liver cancer cell line (lm-3, skhep1, prf5, Huh7, HEPG2), we analyzed it by qRT-PCR (Fig. 2F). Because ZNF232-AS1 has never been reported in previous studies, we found that the results are significant. Therefore, we conducted the following research.

Table 1 Clinical information of HCC patients in each cohort
Fig. 2
figure 2

A univariate Cox regression analysis showed that 40 m7G-related lncRNAs were associated with the survival of HCC patients. B, C Lasso Cox regression models identified 4 m7G-related lncRNAs. D Four lncRNAs were analyzed by multivariate Cox regression. E The co-expression of these four lnRNAs with m7g-related genes. F The expression difference of ZNF232-AS1 in the model between liver cancer cell line (lm-3, skhep1, prf5, Huh7, HEPG2) and normal liver cell line (LO-2). *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001

Survival results and multivariate examination

Our study found that 0.8765 was the median risk score. Next, we will divide the liver cancer patients in the training queue into a high-risk group containing 91 HCC patients (n = 91) and a low-risk group containing 92 HCC patients (n = 92). Kaplan Meier analysis using log rank test showed that the survival rate of patients in low-risk group was significantly higher than that of patients in high-risk group (Fig. 3A). Similarly, as shown in the risk plot, we found that the mortality of HCC patients was positively correlated with the risk score (Fig. 3B). The time ROC curve showed that 0.789 was the AUC value of 1-year survival period, 0.759 was the AUC value of 2-year survival period and 0.733 was the AUC value of 3-year survival period (Fig. 3C). At the same time, we conducted the same analysis in the entire TCGA cohorts and the internal verification cohorts, and the results are still satisfactory. It is easy to see that through the analysis of the internal training queue, we get the same results as the training set (Fig. 3D-E). In the internal validation cohort, we conducted the same analysis on the internal training queue, and the time ROC curve results showed that 0.732 was the AUC value of 1-year survival, 0.696 was the AUC value of 2-year survival and 0.716 was the AUC value of 3-year survival (Fig. 3F). For the entire TCGA cohorts, our analysis results are the same as above (Fig. 3G-H). And 0.763 was the AUC value of 1-year survival, 0.732 was the AUC value of 2-year survival and 0.730 was the AUC value of 3-year survival (Fig. 3I). In conclusion, the m7G-related lncRNAs we studied can be used to predict the prognosis of HCC patients. More interestingly, we validated the clinical information by grouping. At the same time, we also studied whether the survival curves of different risk groups in different clinical information groups are meaningful, and the results are the same as those we obtained before (Fig. 4).

Fig. 3
figure 3

A In the training cohort, Kaplan–Meier survival curve of the m7G signature. B In the training cohort. the risk plot of the m7G signature (C) In the training cohort, the ROC curve of the m7G signature. D In the internal validation cohort, Kaplan–Meier survival curve of the m7G signature. E In the internal validation cohort, the risk plot of the m7G signature. F In the internal validation cohort, the ROC curve of the m7G signature. G In the entire TCGA cohort, Kaplan–Meier survival curve of the m7G signature. H In the entire TCGA cohort, the ROC curve of the m7G signature. I In the entire TCGA cohort, the ROC curve of the m7G signature

Fig. 4
figure 4

Predictability of prognostic model in different clinicopathological conditions

Correlation between prognosis model and clinical pathological parameters of HCC

Through univariate and multivariate Cox analysis in our study, we found that risk score was independent prognostic factors of OS in HCC patients (Fig. 5A, 5B). More importantly, through our analysis, we found that the risk score was closely related to the grading, T-classification and pathological staging of HCC. We show the more aggressive clinicopathological features by scatter plot (Fig. 5C). These results indicate that the risk score is significantly related to the clinicopathological characteristics of relatively advanced HCC.

Fig. 5
figure 5

A In the training cohort, analysis results of univariate cox regression analysis. B In the training cohort, analysis results of multivariate cox regression analysis. C Scatterplot shows that high risk score levels are significantly closely related to T classification, tumor grade and pathological stage

Nomograms for predicting long-term survival of HCC patients

To more accurately predict the 1-year, 3-year and 5-year survival rates of patients with HCC patients, we constructed a nomogram of the combined risk score of age, stage and so on (Fig. 6A). Calibration chart shows excellent performance of nomogram (Fig. 6B). The AUC value corresponding to the risk score is 0.759 according to the ROC curve, AUC values corresponding to nomogram was 0.772, AUC values corresponding to age was 0.550, AUC values corresponding to gender was 0.445, AUC values corresponding to tumor grade was 0.532, AUC values corresponding to stage was 0.705, AUC values corresponding to T classification was 0.694, AUC values corresponding to N classification was 0.517 and AUC values corresponding to M classification was 0.514 (Fig. 6C). To sum up, the nomograph we built is very meaningful for predicting the survival prognosis of HCC patients.

Fig. 6
figure 6

A Predictive effect of nomogram on HCC patients. B Nomogram calibration map constructed in our study. C The AUC value of nomogram and m7G signature was significantly higher than that of other clinical information

Principal component analysis of m7G signature

For different expression profiles, we used PCA analysis to study different distribution patterns among different risk groups. The low-risk group is represented by blue dots and high-risk group is represented by red dots. We divided four m7G-related lncRNAs set, all genes set, all lncRNAs set, and all m7G-related lncRNAs set into two parts. The results showed that four m7G-related lncRNAs set had the best results (Fig. 7A-D).

Fig. 7
figure 7

A In all genes set, the PCA analysis results. B In all m7G-related lncRNAs set, the PCA analysis results. C PCA analysis results of all lncRNAs set. D PCA analysis results of m7G-related lncRNAs set

Function analysis of M7G signature

In order to study the potential mechanism of these lnRNAs, we conducted enrichment analysis on two different risk populations through Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO) and showed the most prominent five functions [38,39,40]. For m7G signature, KEGG path analysis such as 'cytokine cytokine receptor interaction', 'cell cycle', 'hematopoietic cell lineage', 'ECM receptor interaction', 'pathways in cancer' were mainly concentrated in high-risk groups (Fig. 8A). Relatively low-risk groups, mainly enriched in ' clycine serine and threonine metabolism ', ' metabolism cytochrome P450 ', ' fatty acid metabolism ', ' retinol metabolism ', ' complement and coagulation cascades ' (Fig. 8B). As shown in Fig. 8C, ‘B cell activation’ and ‘adaptive immune response’ were significantly enriched for BPs. More importantly, m7G signature may participate in fatty acids in the oxidation pathway in functional analysis β Oxidation, which further indicates that they may participate in the growth of HCC (Fig. 8D). The above analysis shows that risk score may be related to tumor immunity and tumor microenvironment, which is very important for further study of HCC.

Fig. 8
figure 8

GSEA analysis. A-B GO analysis results. C-D KEGG analysis results

M7G signature are obviously superior to other prediction models

To verify that the prediction ability of m7G features is the best, we compared m7G signature with some previous prediction models used to predict HCC. As shown in (Fig. 9A-D), the survival curve p value and AUC of our m7G signature are also better than other models. Yang’s signature showed that 0.789 was the AUC value of 1-year survival,0.618 was the AUC value of 3-year and 0.590 was the AUC value of 5-year survival (Fig. 9B). Wang's signature showed that 0.700 was the AUC value of 1-year survival,0.566 was the AUC value of 3-year and 0.534 was the AUC value of 5-year survival (Fig. 9C). Xu’s signature showed that 0.760 was the AUC value of 1-year survival,0.654 was the AUC value of 3-year and 0.643 was the AUC value of 5-year survival (Fig. 9D). The study found that our prognosis model is obviously superior to other prognosis models.

Fig. 9
figure 9

A M7G signature’s ROC curve and Kaplan–Meier survival curve. B-D Yang et al.’s signature, Wang et al.’s signature and Xu et al.’s signature ROC curve and ROC curve Kaplan–Meier survival curve

Correlation analysis for immunity

In order to compare the immune responses among different risk groups, we used a variety of algorithms, including TIMER, CIBERSORT, CIBERSORT-ABS, QUANTISEQ, MCP, XCELL, and EPIC algorithms. The results are shown in Fig. 10A. Through the correlation analysis between ssGSEA related functions, we found that the coordination of T cell functions was significantly different among patients in different risk groups (Fig. 10B). As we all know, immune checkpoint is very important in cancer treatment, and targeted treatment is the key to immunotherapy. Therefore, we further studied the difference of immune checkpoint expression between different risk groups. Our study found that some meaningful immune checkpoints were highly expressed in high-risk groups, including PDCD1, HHLA2 and CTLA-4 (Fig. 10C). More importantly, the TIDE score, exclusion score and dysfunction score of different risk groups were also significantly different (Fig. 10D-F). Our research results show that the effect of immunotherapy on different risk groups is also significantly different, and high risk may benefit more from immunotherapy. Therefore, our risk model may provide a new way to treat HCC.

Fig. 10
figure 10

A Bubble diagram of immune response of different risk groups based on 7 different algorithms. B Analysis of correlation between related functions by ssGSEA and immune cell subsets. C Expression of immune checkpoint among different risk groups. D-F Sensitivity analysis of immunotherapy for different risk population based on TIDE algorithm

Analysis of differences between different risk groups for chemotherapy

We discussed the difference and correlation analysis of IC50 of various commonly used chemotherapy drugs among different risk groups. Using the prophetic algorithm, we showed 15 chemotherapeutic drugs with multiple anticancer mechanisms. In addition, the IC50 and correlation difference boxplots of 15 chemotherapy drugs between different risk groups were drawn (Fig. 11).

Fig. 11
figure 11

The box chart of IC50 and risk correlation difference of 15 chemotherapy drugs between different risk groups


The research of lncRNAs in tumorigenesis and cancer progression has become a research hotspot Because many studies have shown that lncRNAs are potential diagnostic and prognostic markers of cancer. Therefore, people are more and more enthusiastic about the research of the influence of lncRNA on HCC. This also shows that epigenetic modification is related to the growth of various cancers [41]. In addition, some studies have shown that RNA methylation is related to tumor immunity [42]. However, the pathogenesis of HCC is still unclear and deserves further elucidation. Therefore, we speculate that m7G-related lncRNA also plays an important role in the growth of HCC. It may be possible to explore new treatment schemes. So our research is very meaningful.

First, we obtained HCC data from TCGA, and screened 20 differentially expressed m7G-related genes by comparing the expression of transcriptome data in liver cancer tissues and normal tissues. Then, in order to study the accuracy, we screened six m7G-related genes with significant differential expression through statistical values. In this study, 461 lncRNAs were identified by comparing the expression level of lncRNAs in normal tissues and HCC tissues. Then, based on these 461 differentially expressed lncRNAs, we carried out univariate Cox regression, lasso Cox regression, and multivariate Cox regression analyses, and finally constructed the features containing 4 m7G related lncRNAs. It has previously been reported that three lncRNAs including MKLN1 ‑ AS, AL031985.3 and TMCC1-AS1 in this prognostic model are all related to the growth of HCC. High expression of TMCC1-AS1 in HCC patients may lead to shorter survival [43]. However, the specific mechanism of TMCC1-AS1 on HCC progression has not been studied, which may provide us with a new direction. It was previously reported that MKLN1 ‑ AS can increase the expression level of HDGF, thereby promoting the growth of HCC [44]. Research shows that AL031,985.3 may participate in the process of apoptosis and autophagy to affect the growth of HCC [45,46,47]. But ZNF232-AS1 in our prognostic model has never been reported. If the next research is aimed at ZNF232-AS1, by studying its mechanism of action, it may bring new hope to HCC patients and provide new choices in the treatment of HCC. Similarly, research on the role of m7G-related lncRNAs in the growth of HCC is still limited. Therefore, the specific impact mechanism of m7G-related lncRNAs on cancer remains unclear. Therefore, it is necessary to find new prognostic markers of lncRNAs, which is also the original intention of this study. These findings of this study may become a gospel for HCC patients in the future.

It is worth mentioning that we have developed four prognosis models of m7G-related lncRNAs (TMCC1-AS1, MKLN1-AS, ZNF232-AS1, and AL031985.3) and all the research results are satisfactory. For example, the response to immunotherapy varies greatly among different risk groups. In order to confirm the reliability and independence of the prognosis model, we have verified many methods and found that the prognosis model we studied is not only feasible but also practical.

During liver injury, the changes of liver microenvironment are mainly characterized by the imbalance of extracellular matrix (ECM), which will promote the growth of HCC [48]. And the analysis of lncRNAs in the prognosis model by KEGG, we found that these lncRNAs participated in ECM receiver interaction in high-risk groups It shows that these lncRNAs are indeed closely related to HCC. More interestingly, we analyzed the immune related functions of different risk groups through GO, and the results showed that patients in high-risk groups were closely related to immune regulation. Previous studies have shown that immunity also plays an important role in the prognosis of other digestive system tumors [49]. This indicates that the intervention aimed at immunity may affect the growth of tumors. Therefore, we boldly speculate that the low-risk group promotes tumor progression through immunosuppression or immune escape. We analyzed the difference of immune microenvironment between different risk groups to verify our conjecture. Importantly, in the past few years, immunotherapy has become the most important means of anti-cancer. In addition, In our study, we found that most TIIC cells were positively related to the risk score, such as macrophages. Then, we focused on the functional differences of T cells between different risk groups. Through further analysis of different risk groups, we also found that their coordination functions in T-cell type I INF response and type II INF response were different. As we all know, targeting immune checkpoints to treat cancer has become a new anti-cancer method. Studies have shown that the evaluation of the effect of immunotherapy can be reflected by the expression of immune checkpoints [50]. Therefore, we further studied the correlation between M7G signature and the expression of immune checkpoints biomarkers, and studied the expression level of immune checkpoints biomarkers in HCC patients of different risk groups. Nowadays, immunotherapy is very important for patients with advanced HCC. At the same time, studies have shown that immunotherapy with anti PDCD1 and anti CTLA-4 antibodies is effective in the treatment of advanced liver cancer [51]. Studies have shown that inducing M2 polarization and chemotaxis of macrophages in HCC can promote immune escape of HCC [52]. At the same time, NRP1 will be highly expressed in HCC tissues and cell lines, and inhibiting the expression of NRP1 will slow down the development of HCC [53]. However, our research further found that low-risk groups immunization had a greater chance of escaping. To sum up, the M7G signature we have studied can predict the immune characteristics of HCC. Our analysis results show that high-risk people are more sensitive to immunotherapy. Finally, we studied the difference and correlation analysis of IC50 of various commonly used chemotherapy drugs among different risk groups. Importantly, our study found that the effect of sorafenib, a first-line chemotherapy drug for HCC, was significantly different risk groups. Through our research and analysis, we found that high-risk groups are more likely to benefit from chemotherapy.

Inevitably, this study has some limitations. First, the data of this study are all from TCGA, the data used for the analysis may not be fully applicable to patients in a particular region because of differences in genetics between races. There may also be errors and bias caused by the inherent defects of the TCGA, such as insufficient clinical follow-up time, missing clinical data, and data processing by multiple institutions. In the future, we may integrate multiple databases to obtain more clinical samples for data mining. Secondly, there may be statistical differences caused by prognostic analysis software and algorithm, and the validation method of prognostic model is relatively simple, and its molecular mechanism still needs to be further verified by basic experiments. Third, there are still several gaps in the study of m7G-related lncRNAs. Our next work needs to explore the specific mechanism of m7G related lncRNA affecting HCC, in order to further prove its potential as a new therapeutic target for HCC, which we believe is a very meaningful study.


In short, we found a new risk model to predict the prognosis of HCC patients, which includes four m7G-related lncRNAs. Through our research, we found that this prognostic model has important value in predicting the survival rate of HCC patients. Targeting m7G-related lncRNAs to treat HCC may bring good news to HCC patients.

Availability of data and materials

All data and materials used in this study are available in TCGA (



Hepatocellular carcinoma


N7 methyl guanosine


Long non-coding RNA


The Cancer Genome Atlas


Receiver Operating Characteristic


Principal Component Analysis


Tumor Infiltrating Immune Cell


Gene Ontology


Kyoto Encyclopedia Of Genes And Genomes


Overall Survival


WD Repeat Domain 4


Methyltransferase Like 1


Differentially Expressed Gene


Area Under Curve


Recombinant Protein Of Programmed Cell Death 1


Cytotoxic T Lymphocyte-associated Antigen-4


American Joint Committee on Cancer


Extracellular Matrix


HOX Transcript Antisense Intergenic RNA


Urothelial Carcinoma Associated 1


Colon Cancer Associated Transcript


Hepatoma-derived growth factor




Gene Set Enrichment Analysis


Half Maximal Inhibitory Concentration


  1. Ruan Q, Wang H, Burke LJ, Bridle KR, Li X, Zhao CX, et al. Therapeutic modulators of hepatic stellate cells for hepatocellular carcinoma. Int J Cancer. 2020;147(6):1519–27.

    Article  CAS  PubMed  Google Scholar 

  2. Morse MA, Sun W, Kim R, He AR, Abada PB, Mynderse M, et al. The role of angiogenesis in hepatocellular carcinoma. Clin Cancer Res. 2019;25(3):912–20.

    Article  CAS  PubMed  Google Scholar 

  3. Forner A, Llovet JM, Bruix J. Hepatocellular carcinoma. Lancet. 2012;379(9822):1245–55.

    Article  PubMed  Google Scholar 

  4. Heimbach JK, Kulik LM, Finn RS, Sirlin CB, Abecassis MM, Roberts LR, et al. AASLD guidelines for the treatment of hepatocellular carcinoma. Hepatology. 2018;67(1):358–80.

    Article  PubMed  Google Scholar 

  5. Bruix J, Sherman M, American Association for the Study of Liver D. Management of hepatocellular carcinoma: an update. Hepatology. 2011;53(3):1020–2.

    Article  PubMed  Google Scholar 

  6. Wang H, Cao C, Wei X, Shen K, Shu Y, Wan X, et al. A comparison between drug-eluting bead-transarterial chemoembolization and conventional transarterial chemoembolization in patients with hepatocellular carcinoma: A meta-analysis of six randomized controlled trials. J Cancer Res Ther. 2020;16(2):243–9.

    Article  CAS  PubMed  Google Scholar 

  7. Marquardt JU, Galle PR, Teufel A. Molecular diagnosis and therapy of hepatocellular carcinoma (HCC): an emerging field for advanced technologies. J Hepatol. 2012;56(1):267–75.

    Article  PubMed  Google Scholar 

  8. Zhao LY, Song J, Liu Y, Song CX, Yi C. Mapping the epigenetic modifications of DNA and RNA. Protein Cell. 2020;11(11):792–808.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Shi H, Wei J, He C. Where, when, and how: context-dependent functions of RNA methylation writers, readers, and erasers. Mol Cell. 2019;74(4):640–50.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Roost C, Lynch SR, Batista PJ, Qu K, Chang HY, Kool ET. Structure and thermodynamics of N6-methyladenosine in RNA: a spring-loaded base modification. J Am Chem Soc. 2015;137(5):2107–15.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Chen Y, Peng C, Chen J, Chen D, Yang B, He B, et al. WTAP facilitates progression of hepatocellular carcinoma via m6A-HuR-dependent epigenetic silencing of ETS1. Mol Cancer. 2019;18(1):127.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Dai C, Feng P, Cui L, Su R, Chen W, Wei L. Iterative feature representation algorithm to improve the predictive performance of N7-methylguanosine sites. Brief Bioinform. 2021;22(4):bbaa278.

    Article  PubMed  Google Scholar 

  13. Cowling VH. Regulation of mRNA cap methylation. Biochem J. 2009;425(2):295–302.

    Article  PubMed  Google Scholar 

  14. Guy MP, Phizicky EM. Two-subunit enzymes involved in eukaryotic post-transcriptional tRNA modification. RNA Biol. 2014;11(12):1608–18.

    Article  PubMed  Google Scholar 

  15. Malbec L, Zhang T, Chen YS, Zhang Y, Sun BF, Shi BY, et al. Dynamic methylome of internal mRNA N(7)-methylguanosine and its regulatory role in translation. Cell Res. 2019;29(11):927–41.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Ma J, Zhang L, Chen J, Song B, Zang C, Liu H. m(7)GDisAI: N7-methylguanosine (m(7)G) sites and diseases associations inference based on heterogeneous network. BMC Bioinformatics. 2021;22(1):152.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Lin S, Liu Q, Lelyveld VS, Choe J, Szostak JW, Gregory RI. Mettl1/Wdr4-Mediated m(7)G tRNA Methylome Is Required for Normal mRNA Translation and Embryonic Stem Cell Self-Renewal and Differentiation. Mol Cell. 2018;71(2):244-255 e245.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Barbieri I, Tzelepis K, Pandolfini L, Shi J, Millan-Zambrano G, Robson SC, et al. Promoter-bound METTL3 maintains myeloid leukaemia by m(6)A-dependent translation control. Nature. 2017;552(7683):126–31.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Zhang T, Li K, Zhang ZL, Gao K, Lv CL. LncRNA Airsci increases the inflammatory response after spinal cord injury in rats through the nuclear factor kappa B signaling pathway. Neural Regen Res. 2021;16(4):772–7.

    Article  PubMed  Google Scholar 

  20. Wu D, Yin Z, Ji Y, Li L, Li Y, Meng F, et al. Identification of novel autophagy-related lncRNAs associated with a poor prognosis of colon adenocarcinoma through bioinformatics analysis. Sci Rep. 2021;11(1):8069.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Yin Z, Wu D, Shi J, Wei X, Jin N, Lu X, et al. Identification of ALDH3A2 as a novel prognostic biomarker in gastric adenocarcinoma using integrated bioinformatics analysis. BMC Cancer. 2020;20(1):1062.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Gupta RA, Shah N, Wang KC, Kim J, Horlings HM, Wong DJ, et al. Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis. Nature. 2010;464(7291):1071–6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Luo Y, Yao Y, Wu P, Zi X, Sun N, He J. The potential role of N(7)-methylguanosine (m7G) in cancer. J Hematol Oncol. 2022;15(1):63.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Tomikawa C. 7-Methylguanosine Modifications in Transfer RNA (tRNA). Int J Mol Sci. 2018;19(12):4080.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43(7):e47.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Rizvi AA, Karaesmen E, Morgan M, Preus L, Wang J, Sovic M, et al. gwasurvivr: an R package for genome-wide survival analysis. Bioinformatics. 2019;35(11):1968–70.

    Article  CAS  PubMed  Google Scholar 

  27. Mandrekar JN. Receiver operating characteristic curve in diagnostic test assessment. J Thorac Oncol. 2010;5(9):1315–6.

    Article  PubMed  Google Scholar 

  28. Xu Z, Peng B, Liang Q, Chen X, Cai Y, Zeng S, et al. Construction of a Ferroptosis-Related Nine-lncRNA Signature for Predicting Prognosis and Immune Response in Hepatocellular Carcinoma. Front Immunol. 2021;12: 719175.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Wang A, Lei J. Identification of an 11-lncRNA signature with high performance for predicting the prognosis of hepatocellular carcinoma using bioinformatics analysis. Medicine (Baltimore). 2021;100(5):e23749.

    Article  CAS  PubMed  Google Scholar 

  30. Yang S, Zhou Y, Zhang X, Wang L, Fu J, Zhao X, et al. The prognostic value of an autophagy-related lncRNA signature in hepatocellular carcinoma. BMC Bioinformatics. 2021;22(1):217.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Dienstmann R, Villacampa G, Sveen A, Mason MJ, Niedzwiecki D, Nesbakken A, et al. Relative contribution of clinicopathological variables, genomic markers, transcriptomic subtyping and microenvironment features for outcome prediction in stage II/III colorectal cancer. Ann Oncol. 2019;30(10):1622–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Li T, Fan J, Wang B, Traugh N, Chen Q, Liu JS, et al. TIMER: a web server for comprehensive analysis of tumor-infiltrating immune cells. Cancer Res. 2017;77(21):e108–10.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Plattner C, Finotello F, Rieder D. Deconvoluting tumor-infiltrating immune cells from RNA-seq data using quanTIseq. Methods Enzymol. 2020;636:261–85.

    Article  CAS  PubMed  Google Scholar 

  34. Aran D, Hu Z, Butte AJ. xCell: digitally portraying the tissue cellular heterogeneity landscape. Genome Biol. 2017;18(1):220.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Racle J, de Jonge K, Baumgaertner P, Speiser DE, Gfeller D. Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data. Elife. 2017;6:e26476.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Chen B, Khodadoust MS, Liu CL, Newman AM, Alizadeh AA. Profiling Tumor Infiltrating Immune Cells with CIBERSORT. Methods Mol Biol. 2018;1711:243–59.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Geeleher P, Cox N, Huang RS. pRRophetic: an R package for prediction of clinical chemotherapeutic response from tumor gene expression levels. PLoS ONE. 2014;9(9):e107468.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Kanehisa M, Furumichi M, Sato Y, Kawashima M, Ishiguro-Watanabe M. KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res. 2023;51(D1):D587–92.

    Article  PubMed  Google Scholar 

  39. Kanehisa M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 2019;28(11):1947–51.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28(1):27–30.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Gu L, Frommel SC, Oakes CC, Simon R, Grupp K, Gerig CY, et al. BAZ2A (TIP5) is involved in epigenetic alterations in prostate cancer and its overexpression predicts disease recurrence. Nat Genet. 2015;47(1):22–30.

    Article  CAS  PubMed  Google Scholar 

  42. Zhang M, Song J, Yuan W, Zhang W, Sun Z. Roles of RNA Methylation on Tumor Immunity and Clinical Implications. Front Immunol. 2021;12:641507.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Chen C, Su N, Li G, Shen Y, Duan X. Long non-coding RNA TMCC1-AS1 predicts poor prognosis and accelerates epithelial-mesenchymal transition in liver cancer. Oncol Lett. 2021;22(5):773.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Gao W, Chen X, Chi W, Xue M. Long noncoding RNA MKLN1AS aggravates hepatocellular carcinoma progression by functioning as a molecular sponge for miR6543p, thereby promoting hepatomaderived growth factor expression. Int J Mol Med. 2020;46(5):1743–54.

    CAS  PubMed  PubMed Central  Google Scholar 

  45. Chen ZA, Tian H, Yao DM, Zhang Y, Feng ZJ, Yang CJ. Identification of a Ferroptosis-Related Signature Model Including mRNAs and lncRNAs for Predicting Prognosis and Immune Activity in Hepatocellular Carcinoma. Front Oncol. 2021;11: 738477.

    Article  PubMed  PubMed Central  Google Scholar 

  46. Wu ZH, Li ZW, Yang DL, Liu J. Development and Validation of a Pyroptosis-Related Long Non-coding RNA Signature for Hepatocellular Carcinoma. Front Cell Dev Biol. 2021;9:713925.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Jia Y, Chen Y, Liu J. Prognosis-Predictive Signature and Nomogram Based on Autophagy-Related Long Non-coding RNAs for Hepatocellular Carcinoma. Front Genet. 2020;11:608668.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Mazza G, Telese A, Al-Akkad W, Frenguelli L, Levi A, Marrali M, et al. Cirrhotic Human Liver Extracellular Matrix 3D Scaffolds Promote Smad-Dependent TGF-beta1 Epithelial Mesenchymal Transition. Cells. 2019;9(1):83.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Li J, Han T, Wang X, Wang Y, Chen X, Chen W, et al. Construction of a Novel Immune-Related mRNA Signature to Predict the Prognosis and Immune Characteristics of Human Colorectal Cancer. Front Genet. 2022;13:851373.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. Bethmann D, Feng Z, Fox BA. Immunoprofiling as a predictor of patient’s response to cancer therapy-promises and challenges. Curr Opin Immunol. 2017;45:60–72.

    Article  CAS  PubMed  Google Scholar 

  51. Zongyi Y, Xiaowu L. Immunotherapy for hepatocellular carcinoma. Cancer Lett. 2020;470:8–17.

    Article  PubMed  Google Scholar 

  52. Wang R, Guo H, Tang X, Zhang T, Liu Y, Zhang C, et al. Interferon Gamma-Induced Interferon Regulatory Factor 1 Activates Transcription of HHLA2 and Induces Immune Escape of Hepatocellular Carcinoma Cells. Inflammation. 2022;45(1):308–30.

    Article  CAS  PubMed  Google Scholar 

  53. Lin J, Zhang Y, Wu J, Li L, Chen N, Ni P, et al. Neuropilin 1 (NRP1) is a novel tumor marker in hepatocellular carcinoma. Clin Chim Acta. 2018;485:158–65.

    Article  CAS  PubMed  Google Scholar 

Download references


We acknowledge the TCGA database for free use.


NO Funding.

Author information

Authors and Affiliations



All authors made contributions to conception and design. JL, LY and YW designed the study. LY, ZM, PX and KD collected data from the TCGA and performed the bioinformatics. LY and ZM drafted the main manuscript. JW and JL revised the manuscript. All authors participated in the article and agreed to contribute.

Corresponding author

Correspondence to Jing Li.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

M7G related genes used in this study.

Additional file 2: Table S2.

The differentially expressed m7G-related genes between HCC and normal samples.

Additional file 3: Table S3.

461 m7G-related lncRNAs.

Additional file 4: Table S4.

Premier sequences for qRT‒PCR Analysis.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yang, L., Wang, Yr., Mou, Zq. et al. A new prediction model of hepatocellular carcinoma based on N7-methylguanosine modification. BMC Gastroenterol 23, 131 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • 7-methylguanosine (m7G) modification
  • lncRNAs
  • Hepatocellular carcinoma
  • Immune infiltration
  • Data mining