Samples
Blood serum samples from pediatric patients with Ast (Past (Pilocytic astrocytoma), n = 10 (four boys and six girls; DAst (Diffuse astrocytoma), n = 13 (five boys and eight girls)) were collected from 2017 to 2020 (Table 4), according to the approved protocol by the Ethics Committee of the CMNSXXI-IMSS. The samples were taken as part of the routine tests, with a vacutainer. The control children were boys and girls referred to the hospital for a cause other than cancer. All children included in this study were patients from the Hospital de Pediatría “Silvestre Frenk Freund”, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social (IMSS). The patient´s age range was 0–15 years old; they had no history of relatives with cancer. All children’s relatives or tutors signed an informed consent.
Ethics
The project had the approval of the Hospital de Pediatría “Silvestre Frenk Freund”, CMNSXXI-IMSS, Ethics in Security Research (R-2018-3603-048) and all the experimental procedures were performed in accordance to the Declaration of Helsinki (1964) and its later amendments or comparable ethical standards.
Table 4
Clinical data of pediatric patients
Histopathological grade | Age | Gender | Location | Outcome |
Pilocytic astrocytoma |
PAst 1 | 2 | F | Temporal lobe | Alive |
PAst 2 | 13 | F | Posterior fossa | Alive |
PAst 3 | 5 | F | Posterior fossa | Alive |
PAst 4 | 5 | M | Visual pathway optical | Alive |
PAst 5 | 6 | F | Posterior fossa | Alive |
PAst 6 | 14 | F | Infratentorial | Alive |
PAst 7 | 13 | M | Temporal lobe | Alive |
PAst 8 | 8 | F | Posterior fossa | Alive |
PAst 9 | 5 | M | Temporal lobe | Alive |
PAst 10 | 13 | M | Visual pathway optical | Dead |
Diffuse astrocytoma | | | | |
DAst 1 | 8 | F | Temporal lobe | Alive |
DAst 2 | 9 | M | Posterior fossa | Alive |
DAst 3 | 3 | M | Infratentorial | Alive |
DAst 4 | 13 | M | Temporal lobe | Dead |
DAst 5 | 4 | F | Thalamus | Alive |
DAst 6 | 2 | F | Posterior fossa | Dead |
DAst 7 | 16 | F | Posterior fossa | Alive |
DAst 8 | 5 | F | Temporal lobe | Dead |
DAst 9 | 9 | M | Posterior fossa | Dead |
DAst 10 | 7 | F | Visual pathway optical | Alive |
DAst 11 | 1 | M | Posterior fossa | Dead |
DAst 12 | 5 | F | Temporal lobe | Alive |
DAst 13 | 12 | F | Posterior fossa | Alive |
Exosome and RNA Isolation
Exosomes were isolated with the Total Isolation Exosome kit (from serum) (ThermoScientific) and the presence and integrity of exosomes was visualized by electron microscopy. For RNA isolation, exosomes were incubated for 15 h (4ºC) with TRIZOL® (this was to optimally resuspend exosomes) and later with 40 µL chloroform for 15 minutes at 4ºC. The aqueous phase (RNA) was transferred to a new tube and 500 µL isopropanol were added for RNA precipitation. Finally, RNA was washed three times with 75% ethanol and resuspended in 5 µL RNase-free water. RNA quantification was performed by spectrophotometry (Nanodrop 1000, Thermo Fisher Scientific).
GeneChip® Human Transcriptome Array 2.0
Three conditions (The Control Pull (n = 50), The Pilocytic Pull (n = 10), and The Diffuse Pull (n = 13) were used for the expression analysis. The HTA 2.0 arrays (ThermoScientific) determined the differential expression among arrays of distinct experimental conditions. The expression analysis was done by groups (Pulls) and not individually, therefore differences in the transcript expression for each sample could not be detected. With this array, it was possible to evaluate > 285,000 full-length covered transcripts, > 245,000 coding transcripts, > 40,000 non-coding transcripts, and > 339,000 probe sets covering exon-exon junctions. Median expression levels (all p < 0.001) were significant.
Bioinformatic analyses
To perform bioinformatic analyses, the web server interface (free access), belonging to the Iowa State University, was used (RNA-Protein Interaction Prediction Server “PRIDB: “Protein-RNA Interaction Database RPI-Seq Version” Dobbs Lab) [39, 40]. Sequences of the four human Y-RNAs were downloaded from the NCBI (National Center for Biotechnology Information) nucleotide databases.
Protein sequences of cell receptors from diverse protein families were obtained by using key words: Cell, Receptor, Brain, Human and "Homo sapiens"[txid9606], being in total 1319 sequences. Filtered applied: 0 TO 1200 A.A. length. Protein sequences of Toll-like cell receptors were downloaded from the NCBI protein database by using key words: Toll, Cell, Receptors, and Human and "Homo sapiens" [txid9606], being in total 119 not filtered sequence. The removal of the terminal segments in the 5´- and 3´- terminals, to generate transcripts only with the loop domain from the four human Y-RNAs, was performed as shown in Additional file 1.
Protein sequences of the Wnt cell receptors were downloaded by using key words: Wnt, Receptors, Human and "Homo sapiens" [txid9606], being in total 37 sequences. In the same way, protein sequences related to the Origin of Replication Complex (ORC) were retrieved using key words: (DNA, Polymerase, and Human) and "Homo sapiens" [txid9606], being in total 1286. Filtered applied: from 0 to 1200 A.A. length. All the sequences were checked manually to avoid redundant annotations or duplicated sequences and formatted before being entered into the web server: http://pridb.gdcb.iastate.edu/RPISeq.
In this work, data obtained with the RF algorithm were mainly discussed, since its predictions are the most validated at the experimental level. Only in the interactions of Y-RNAs with the proteins related to the ORC, the scores obtained with RF and SVM were shown. This was with the purpose of comparing the results obtained with both classifiers. Although values greater than 0.5 were taken as significant, those closest to 1 have the highest probability of occurring in vivo. Essentially, RPIseq exploits the aminoacidic composition of proteins sequences and the ribonucleotides composition from the RNA sequences to predict the probability of in vivo interactions of a pair (RNA-Protein). This web server RPIseq implements the RPIseq method developed by Muppirala [39, 40]. This algorithm takes a data pair of sequence belonging to RNA and protein as an input, and computes probabilities of interaction through the RF and SVM trained classifiers using the datasets from the RPI2241. This interface could accept many proteins against a specific RNA molecule or vice versa users could introduce a maximum of 100 sequences.
String Protein Network
Cytoscape is an open source software platform for visualizing molecular interaction networks and biological pathways and integrate these networks with annotations, gene expression profiles, and other state data. Although Cytoscape was originally designed for biological research, now it is a general platform for complex network analysis and visualization [41]. The String Networks were constructed with Cytoscape 3.8.2 for windows, using String Protein Query database.
Enrichment analyses
We used PANTHER GO-slim", which uses a selected set of terms from the Gene Ontology TM for classifications by molecular function, biological process, and cellular component. The PANTHER Protein Class ontology was adapted from the PANTHER/X molecular function ontology and includes commonly used classes of protein functions, many of which are not covered by GO molecular function. Download the classes and relationship information [42].