*Corresponding Author:
A. S. Wu
Department of Oncology, The Affiliated Zhuzhou Hospital of Xiangya Medical College, Central South University, Zhuzhou, Hunan 412007, China
E-mail: wuanshan2022@163.com
This article was originally published in a special issue, “Advanced Targeted Therapies in Biomedical and Pharmaceutical
Sciences”
Indian J Pharm Sci 2023:85(1) Spl Issue “43-50”

This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as the author is credited and the new creations are licensed under the identical terms

Abstract

In order to obtain the target gene that affects the prognosis of osteosarcoma and to establish the risk ratio and prognosis of prediction model was the objective of the study. We obtained messenger ribonucleic acid transcriptome data from gene expression Omnibus database and clinical data from therapeutically applicable research to generate effective treatments database. Through intersection of differential genes, we obtained key genes in gene expression Omnibus database and then we used them to conduct survival analysis and establish risk ratio in therapeutically applicable research to generate effective treatments database. We obtained complement C1qA and complement C1qB as the differential key genes through GSE14359, GSE28424, GSE33382 and GSE36001 from gene expression Omnibus database. The expression of complement C1qA and complement C1qB were significantly different in 4 series data (p<0.01). Meanwhile, complement C1qA and complement C1qB have significant correlation in osteosarcoma clinical samples (p=8.61e-64, ρSpearman=0.97, 95 % confidence interval (0.96-0.98). The survival time of overexpression of complement C1qA and complement C1qB samples were significantly prolonged (p=0.0026 and p=0.006), area under the curve=0.689 (95 % confidence interval, 0.575-0.804), 0.702 (95 % confidence interval, 0.595-0.809), 0.641 (95 % confidence interval, 0.513-0.768) respectively. Complement C1qA and complement C1qB over-expression can significantly prolong survival time and can be used as predictors of survival in osteosarcoma.

Keywords

Complement C1qA, complement C1qB, osteosarcoma, prognosis, hazard ratio, gene expression Omnibus database, therapeutically applicable research to generate effective treatments database

Osteosarcoma (OSA) is the most common primary Bone Sarcomas (BS) (incidence: 0.3/100 000/y)[1]. The incidence is higher in adolescents (0.8-1.1/100 000/y at age 15-19 y) but there is a significant 2nd peak in the 7th and 8th decades of life. The male to female ratio is 1.4:1[2,3]. OSA most common lesions were the distal femur and the metaphysis of the proximal tibia with active growth[4]. Low grade intramedullary OSA accounts for 2 % of all OSA and the location of the disease is similar to that of classical OSA[5]. OSA mainly has three subtypes: Intramedullary, surface and extraosseous. Intramedullary high-grade OSA is a classic pathological type, accounting for 80 % of all OSA[6].

At present, OSA main therapy method includes chemotherapy, surgery and radiation therapy. Decades ago, amputation is recommended for patients with OSA because the mortality rate of limb preservation is unacceptable. However, the 5 y survival rate is still only 20 %[7]. With the use of chemotherapy drugs, the survival rate of OSA has been significantly improved. The addition of doxorubicin, cisplatin or cyclophosphamide to highdose methotrexate increased disease-free survival to up to 65 %[8,9]. The 5 y relative survival rate of people with localized appendicular tumours is 70 %, but in those who present with distant metastasis, survival is dramatically diminished to less than 20 %-30 %[10,11].

In the recent decades, researchers have summarized the clinically relevant information about prognostic indicators of poor outcome which include advanced age, secondary OSA (such as from Paget’s disease), elevated serum Lactate Dehydrogenase (LDH), large primary tumor size and axial/proximal tumor location[12,13]. However, it did not further reduce the mortality of OSA, especially metastasis and recurrence. With the rapid development of gene sequencing, more and more tumors are sequenced to improve their survival time by individual treatment. In order to improve the survival rate of OSA, we used Gene Expression Omnibus (GEO) and Therapeutically Applicable Research to Generate Effective Treatment (TARGET) databases to analyze gene sequencing data of OSA tissues and cell lines, and to identify key differential genes and provide ideas for improving treatment method of OSA.

Materials and Methods

Data collection and workflow illustration:

We obtained messenger Ribonucleic Acid (mRNA) transcriptome data from GEO database (https://www.ncbi.nlm.nih.gov/geo/) and clinical data from TARGET (https://ocg.cancer.gov/programs/target) database of osteosarcoma, respectively. In this study, through the intersection of differential genes, we got key genes in GEO database and then we used them to conduct survival analysis and established risk ratio in TARGET database. The workflow is illustrated in fig. 1.

IJPS-data

Fig. 1: Data collection and workflow illustrated in the study

Gene expression profile and intersection of differential key genes analysis:

We selected series data, GSE14359, GSE28424, GSE33382 and GSE36001 from GEO database. Then, we obtained expression profile genes through clusterProfiler package in R version 4.0.2 (http://www.r-project.org/). Next, we used the limma package (version 3.40.6) to analyze the differential genes. The threshold for identifying significant Differentially Expressed Genes (DEGs) was False Discovery Rate (FDR)<0.01 and |log2 (Fold Change (FC))|≥2. Venn diagram was used to show the relationship between differential genes and obtained intersecting differential key genes from four series data.

Clinical samples and mRNA expression:

mRNA expression data (88 samples, Workflow type: HTSeq-Fragments Per Kilobase Million (FPKM)) and clinical information were downloaded from TARGET database. The following samples were excluded, “0” gene expression value and insufficient survival information. A total of 87 patients with OSA with the corresponding clinical features were enrolled in this study. We analyzed the clinical characteristics, including age, sex, primary tumor location, metastasis or non-metastasis, total survival time and survival status (Table 1). Analysis of survival curve and establishment of prediction model for differential key genes in four series data was used in this study.

Clinical characteristics N (%)
Age (y)  
<14 39, (44.32 %)
≥14 49, (55.68 %)
Sex  
Male 51, (57.95 %)
Female 37, (42.05 %)
Primary tumor site  
Arm 6, (6.82 %)
Foot 80, (90.91 %)
Other 2, (2.27 %)
Metastasis  
No 66, (75 %)
Yes 22, (25 %)
Overall survival time (years)  
<2 29, (32.95 %)
2-5 26, (29.55 %)
>5 32, (36.36 %)
C1QA expression  
High 43, (49.43 %)
Low 44, (50.57 %)
C1QB expression  
High 43, (49.43 %)
Low 44, (50.57 %)

Table 1: Clinical Characteristics of 88 Samples of OSA from Target Database

Statistical analysis:

The Student’s t-test (R function t-test) was used to determine whether there were significant differences between the two groups and p-value<0.05 was considered to be statistically significant. Statistical Package for the Social Sciences (SPSS) 21.0 software was used in this study and Receiver Operating Characteristic (ROC) curve and survival curve was drawn.

Results and Discussion

Clinical characteristics of 88 samples of OSA from TARGET database was shown in Table 1. We collected 88 samples of osteosarcoma from TARGET database. They are confirmed with osteosarcoma through pathology diagnosis. 39 (44.32 %) samples of age<14 y and 49 (55.68 %) samples of age≥14 y were collected, respectively. 51 (57.95 %) samples are of male and 37 (42.05 %) samples are from female. The osteosarcoma primary tumor site is foot and it mainly occurs here. About 80 (90.91 %) samples and 6 (6.82 %) samples are from arm and 2 (2.27 %) samples are from other sites (pelvis). At the time of initial diagnosis of osteosarcoma, 66 (75 %) samples had no metastasis and 22 (25 %) samples had metastasis.

Complement C1qA (C1QA) and Complement C1qB (C1QB) are intersection differential key genes. In series data GSE14359 and GSE 28424 and GSE33382 and GSE36001, we analyzed the differential genes (fig. 2). Then, we took the top 500 genes of each series data according to the differential genes, p value and log FC was used to draw Venn diagram. There are 8 intersecting genes, Leptin Receptor (LEPR), Cluster of Differentiation 93 (CD93), TYRO Protein Tyrosine Kinase Binding Protein (TYROBP), C1QA, Major Histocompatibility Complex, Class II, DR Alpha (HLA-DRA), C1QB, Cluster of Differentiation 74 (CD74) and Allograft Inflammatory Factor 1 (AIF1) are shown in fig. 3. After analysis, we finally determined C1QA and C1QB as the differential key genes of our study.

IJPS-map

Fig. 2: Differential gene volcano map and heat map of series data GSE14359, GSE28424, GSE33382 and GSE36001, ( ) Up-regulated; ( ) Down-regulated and ( ) No change

IJPS-key

Fig. 3: Key genes of intersection difference of series data GSE14359 and GSE 28424 and GSE33382 and GSE36001

C1QA and C1QB show over-expression in osteosarcoma cell lines as shown in fig. 4 and fig. 5. We found the expression of C1QA and C1QB were significantly different in the series data GSE14359 (p=0.011, fig. 4A and fig. 5A), GSE28424 (p=0.00023, fig. 4B and fig. 5B), GSE33382 (p=0.0035, fig. 4C and fig. 5C), GSE36001 (p=0.0044, fig. 4D and fig. 5D). Meanwhile, C1QA and C1QB have significant correlation in osteosarcoma clinical samples from TARGET database (p=8.61e-64, ρSpearman=0.97, 95 % CI (0.96-0.98), fig. 6).

IJPS-series

Fig. 4: The expression of C1QA was significantly different in the series data, (A) GSE14359 (*p=0.011); (B) GSE28424 (***p=0.00023); (C) GSE33382 (**p=0.0035) and (D) GSE36001 (**p=0.0044)

IJPS-data

Fig. 5: The expression of C1QB was significantly different in the series data, (A) GSE14359 (*p=0.011); (B) GSE28424 (***p=0.00023); (C) GSE33382 (**p=0.0035) and (D) GSE36001 (**p=0.0044)

IJPS-clinical

Fig. 6: C1QA and C1QB have significant correlation in osteosarcoma clinical samples from TARGET database (p=8.61e-64, ρSpearman=0.97, (95 % CI (0.96-0.98))

C1QA and C1QB can be used as factors to predict the survival time of osteosarcoma. In TARGET database, compared with low expression, the survival time of over-expression C1QA and C1QB samples was significantly prolonged in osteosarcoma (p=0.0026 and p=0.006), and Hazard Ratio (HR)=0.2874 (95 % Confidence Interval (CI), 0.128-0.647) and 0.334 (95 % CI, 0.153-0.73), respectively (fig. 7). Interestingly, C1QA combined with C1QB analysis for the prognosis of osteosarcoma, the survival time of patients with both C1QA and C1QB over-expression was significantly shortened, compared with those with low expression (p=0.00409, HR=3.147 (95 % CI, 1.439-6.884)). At the same time, ROC curve was used to evaluate the accuracy of the prediction of the 1 y, 3 y and 5 y survival rates of C1QA combined with C1QB expression samples. Area Under the Curve (AUC)=0.689 (95 % CI, 0.575-0.804), 0.702 (95 % CI, 0.595-0.809), 0.641 (95 % CI, 0.513- 0.768), respectively (fig. 8A-fig. 8D).

IJPS-low

Fig. 7: Compared with low expression, the survival time of over-expression C1QA and C1QB samples were significantly prolonged in osteosarcoma (p=0.0026 and 0.006), and HR=0.2874 (95 % CI, 0.128-0.647) and 0.334 (95 % CI, 0.153-0.73), respectively, ( ) High expression, n=43 and ( ) Low expression, n=44

IJPS-time

Fig. 8: C1QA and C1QB analysis for the prognosis of osteosarcoma and the survival time of patients Note: C1QA and C1QB over-expression was significantly shortened, compared with those with low expression (p=0.00409, HR=3.147 (95 % CI, 1.439-6.884)). (A) Risk score, ( ) High risk and ( ) Low risk; (B) Overall survival probability of groups, ( ) High and ( ) Low; (C) C1QA and C1QB expression, ( ) Alive and ( ) Dead and (D) At the same time, ROC curve was used to evaluate the accuracy of the prediction of the 1 y, 3 y and 5 y survival rates of C1QA combined with C1QB expression samples. ( ) 1 y, AUC=0.689 (95 % CI, 0.575-0.804); ( ) 3 y, AUC=0.702 (95 % CI, 0.595-0.809) and ( ) 5 y, AUC=0.641 (95 % CI, 0.513-0.768) respectively

The prognosis of osteosarcoma is related to many factors, although it has been clearly related to clinical characteristics such as age, sex, metastasis and primary site[12,13]. With the update of chemotherapy, the prognosis of osteosarcoma has been improved. However, the prognosis of patients with chemotherapy resistance and metastasis is still only 30 % [11,14].

In our study, we used mRNA data obtained from differential key genes of C1QA and C1QB in GEO database. C1Q can perform various immune and non-immune tasks in a complement-dependent or complement independent manner[15]. C1Q is the starting molecule of the classical complement pathway, which can recognize immune complexes and start the classical complement pathway. C1Q in many types of cancer microenvironments show overexpression[ 16] and it has anti-tumor effect. Kaur et al. found that C1Q can promote apoptosis in ovarian cancer cells, it is through the upregulation of Tumor Necrosis Factor alpha (TNF-α)-induced apoptosis path by regulating Bcl-2-Associated X protein (BAX) and Fas Cell Surface Death Receptor (FAS)[17]. In the study of breast cancer, C1Q can induce apoptosis and has anti-angiogenic effect. C1Q includes C1QA, C1QB and C1QC[18]. Some studies reported, C1QB is a reliable biomarker in cutaneous melanoma. It can be identified and observed from the treatment outcomes in cutaneous melanoma[19,20].

In this study, we found that the expressions are significantly different in C1QA and C1QB in normal tissues vs. tumor tissues and in non-tumor bone cells vs. cell lines. Meanwhile, the C1QA and C1QB expressions are closely related in clinical samples obtained from TARGET database and then, we analyzed the prognostic correlation between C1QA and C1QB, including the risk factors in OAS sample. The over-expression of C1QA and C1QB are protective factors to significantly prolong the survival time and the risk factors<1. In a study about skin cutaneous melanoma, the researcher found that over-expression levels of C1QA, C1QB and C1QC are protective factors which improve survival time at different stages[21]. The results obtained are consistent with our results. Yang et al.[22] study on pancreatic cancer found that C1QA and C1QB were overexpressed in cancer tissues than in normal pancreatic tissues. They speculated that C1QA and C1QB might be a new predictor of pancreatic cancer disease. Thus, we can conclude that the overexpression levels of C1QA and C1QB are protective factors which improve survival in OAS.

Interestingly, combined analysis of C1QA and C1QB is a high risk factor which mean, the over-expression of C1QA and C1QB in the same time indicates poor prognosis. The cause of this situation is not clear. This may be related to C1Q as a complement and it can active the immune system. Kou et al.[23] through GEO database and the research on Idiopathic Pulmonary Fibrosis (IPF) and non-small cell lung cancer, they found that the prognosis of non-small cell lung cancer and IPF patients with high levels of C1Q is poor. Vidergar et al.[24] reported that C1Q promoted the growth of malignant pleural mesothelioma. Besides, in liver cancer, C1Q promoted migratory and invasive phenotypes[25]. From our results and other tumor research reports, we believe that the high expression of C1Q can promote tumors, but the high expression of C1QA and C1QB components in C1Q plays a protective role. C1QA and C1QB are closely related with OSA. It may be an effective factor to predict the prognosis of OSA.

Funding:

This work was supported by a grant from the Natural Science Foundation of Hunan province (No. 2020JJ6110).

Acknowledgements:

We would like to acknowledge that this study is very grateful to Dr. Yin Tao of Zhuzhou Central Hospital for his guidance.

Conflict of interests:

The authors declared no conflict of interest.

References