CN112185465A - Method for analyzing periodontitis immune microenvironment through spatial transcriptome technology - Google Patents
Method for analyzing periodontitis immune microenvironment through spatial transcriptome technology Download PDFInfo
- Publication number
- CN112185465A CN112185465A CN202010948773.8A CN202010948773A CN112185465A CN 112185465 A CN112185465 A CN 112185465A CN 202010948773 A CN202010948773 A CN 202010948773A CN 112185465 A CN112185465 A CN 112185465A
- Authority
- CN
- China
- Prior art keywords
- periodontitis
- gene
- spatial
- sequencing
- transcriptome
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 201000001245 periodontitis Diseases 0.000 title claims abstract description 46
- 238000000034 method Methods 0.000 title claims abstract description 31
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 86
- 238000012163 sequencing technique Methods 0.000 claims abstract description 47
- 230000014509 gene expression Effects 0.000 claims abstract description 28
- 230000003239 periodontal effect Effects 0.000 claims abstract description 24
- 238000004458 analytical method Methods 0.000 claims abstract description 15
- 230000003832 immune regulation Effects 0.000 claims abstract description 10
- 230000007246 mechanism Effects 0.000 claims abstract description 10
- 210000004027 cell Anatomy 0.000 claims description 33
- 239000003550 marker Substances 0.000 claims description 18
- 238000010201 enrichment analysis Methods 0.000 claims description 9
- 238000010186 staining Methods 0.000 claims description 7
- 239000002771 cell marker Substances 0.000 claims description 6
- 108020004999 messenger RNA Proteins 0.000 claims description 6
- 238000007405 data analysis Methods 0.000 claims description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 5
- 230000000007 visual effect Effects 0.000 claims description 5
- 201000010099 disease Diseases 0.000 claims description 4
- 238000012545 processing Methods 0.000 claims description 4
- 230000031018 biological processes and functions Effects 0.000 claims description 3
- 238000001914 filtration Methods 0.000 claims description 3
- 230000004879 molecular function Effects 0.000 claims description 3
- 238000003068 pathway analysis Methods 0.000 claims description 3
- 230000009467 reduction Effects 0.000 claims description 3
- 230000019491 signal transduction Effects 0.000 claims description 3
- 108091034117 Oligonucleotide Proteins 0.000 claims description 2
- 230000001413 cellular effect Effects 0.000 claims description 2
- 210000003850 cellular structure Anatomy 0.000 claims description 2
- 239000011521 glass Substances 0.000 claims description 2
- 238000012360 testing method Methods 0.000 claims description 2
- 230000001131 transforming effect Effects 0.000 claims description 2
- 239000000470 constituent Substances 0.000 claims 1
- 238000013507 mapping Methods 0.000 claims 1
- 230000008569 process Effects 0.000 abstract description 6
- 230000009286 beneficial effect Effects 0.000 abstract description 4
- 238000011161 development Methods 0.000 abstract description 3
- 238000012216 screening Methods 0.000 abstract description 2
- 210000004261 periodontium Anatomy 0.000 abstract 1
- 210000001519 tissue Anatomy 0.000 description 37
- 101150049850 CXCL6 gene Proteins 0.000 description 5
- 239000011159 matrix material Substances 0.000 description 5
- 206010061218 Inflammation Diseases 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 239000012634 fragment Substances 0.000 description 4
- 208000007565 gingivitis Diseases 0.000 description 4
- 230000004054 inflammatory process Effects 0.000 description 4
- QWTDNUCVQCZILF-UHFFFAOYSA-N isopentane Chemical compound CCC(C)C QWTDNUCVQCZILF-UHFFFAOYSA-N 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 206010065687 Bone loss Diseases 0.000 description 3
- 238000003559 RNA-seq method Methods 0.000 description 3
- 230000000740 bleeding effect Effects 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 230000008014 freezing Effects 0.000 description 3
- 238000007710 freezing Methods 0.000 description 3
- 210000002865 immune cell Anatomy 0.000 description 3
- 230000002757 inflammatory effect Effects 0.000 description 3
- 238000010606 normalization Methods 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 244000052616 bacterial pathogen Species 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- AFABGHUZZDYHJO-UHFFFAOYSA-N dimethyl butane Natural products CCCC(C)C AFABGHUZZDYHJO-UHFFFAOYSA-N 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 210000004969 inflammatory cell Anatomy 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 210000000214 mouth Anatomy 0.000 description 2
- 244000052769 pathogen Species 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- 208000006386 Bone Resorption Diseases 0.000 description 1
- 102100036153 C-X-C motif chemokine 6 Human genes 0.000 description 1
- 208000034619 Gingival inflammation Diseases 0.000 description 1
- 101000947177 Homo sapiens C-X-C motif chemokine 6 Proteins 0.000 description 1
- 208000009019 Pericoronitis Diseases 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 238000001790 Welch's t-test Methods 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000033289 adaptive immune response Effects 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 229940124599 anti-inflammatory drug Drugs 0.000 description 1
- 229940124350 antibacterial drug Drugs 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 230000024279 bone resorption Effects 0.000 description 1
- 208000037976 chronic inflammation Diseases 0.000 description 1
- 208000037893 chronic inflammatory disorder Diseases 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000011841 epidemiological investigation Methods 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000008004 immune attack Effects 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 208000027866 inflammatory disease Diseases 0.000 description 1
- 230000015788 innate immune response Effects 0.000 description 1
- 210000002540 macrophage Anatomy 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 239000003507 refrigerant Substances 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000017423 tissue regeneration Effects 0.000 description 1
- 230000037303 wrinkles Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
- G16B25/10—Gene or protein expression profiling; Expression-ratio estimation or normalisation
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
- G16B40/30—Unsupervised data analysis
Landscapes
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Medical Informatics (AREA)
- Theoretical Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biophysics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- General Health & Medical Sciences (AREA)
- Evolutionary Biology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Data Mining & Analysis (AREA)
- Genetics & Genomics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Epidemiology (AREA)
- Databases & Information Systems (AREA)
- Software Systems (AREA)
- Bioethics (AREA)
- Chemical & Material Sciences (AREA)
- Analytical Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Public Health (AREA)
- Molecular Biology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention discloses a method for analyzing periodontitis immune microenvironment by a space transcriptome technology, which comprises the steps of obtaining periodontium sections of different periodontitis patients and normal controls by a frozen section technology, sequencing each tissue sample by the space transcriptome technology, carrying out subsequent analysis by using space transcriptome data obtained by sequencing, grouping, identifying and effectively positioning cell types in the immune microenvironment of different regions of the periodontal tissues, and further clustering and differential analysis on the types and expression conditions of original genes of different regions to select key genes and related molecular signal paths in a periodontitis immune regulation and control mechanism. The invention is beneficial to screening out key genes and related molecular signal channels in the periodontitis immune regulation mechanism, and provides powerful and effective data support for finally understanding the occurrence and development processes of periodontitis.
Description
Technical Field
The invention belongs to the technical field of oral medicine and biological information, and particularly relates to a method for analyzing a periodontitis immune microenvironment by a space transcriptome technology.
Background
Periodontitis is a type of chronic inflammatory disease mainly characterized by periodontal tissue destruction, alveolar bone resorption and tooth loosening, and is one of the most common oral inflammatory diseases. Epidemiological investigation shows that the worldwide prevalence of periodontitis is as high as 11.2%, and reaches the peak of the disease around the age of 40.
When the periodontal tissues of the body are stimulated to inflammation by pathogens or pathogenic bacteria, i.e., periodontitis, the released inflammatory factors recruit immune cells to the site of inflammation. However, while attacking pathogens or pathogenic bacteria, immune cells can also attack normal tissue cells of the body, and the oral cavity is in a bacteria-carrying state for a long time as a bacteria-carrying environment, so that the inflammatory immune attack can also continuously exist for a long time, generate a large amount of inflammatory factors, seriously damage the normal tissues of the body and interfere with the tissue regeneration and repair process.
Studies have shown that there is a close link between bone tissue loss due to periodontitis and inflammation-induced immune responses. The process is not only related to the innate immune response mediated by mononuclear macrophages, fibroblasts and the like, but also closely related to the adaptive immune response mediated by T cells and B cells. The periodontitis immune microenvironment mainly refers to immune cells and immune related molecules in the periodontitis microenvironment, and classification of the immune environment in the periodontitis microenvironment is helpful for understanding how immune composition and immune state affect inflammatory cells and periodontitis treatment, but specific immune regulation and control mechanisms of periodontitis are not clear in the existing research. Therefore, the research on the periodontitis immune regulation mechanism has important practical significance and application prospect, and a new idea and a new method are urgently needed to carry out deep exploration research on the periodontitis immune microenvironment.
For RNA sequencing, the current major sequencing methods include Bulk transcriptome sequencing (Bulk RNA-seq), single-cell sequencing (single-cell RNA-seq), and spatial transcriptome sequencing. Population transcriptome sequencing is the detection of the expression of each gene in large cell populations, and is mainly used for comparative genomics, and can be used for comparing gene expression differences of the same tissues of different species; the single cell sequencing is used for determining the expression condition of each gene in a single cell, has microcosmic property, and can detect heterogeneous information which cannot be obtained by sequencing a mixed sample compared with the former method; and the spatial transcriptome sequencing additionally reserves the spatial position information of the cells on the basis of single cell sequencing, can project gene expression to the existing spatial information and know the relative position relationship between specific cells and tissue slices.
Disclosure of Invention
The invention aims to provide a method for analyzing a periodontitis immune microenvironment by a space transcriptome technology, and aims to solve the problem that the existing research method cannot clearly analyze and understand a periodontitis immune regulation and control mechanism.
The invention is realized by a method for analyzing periodontitis immune microenvironment by a spatial transcriptome technology, which comprises the following steps:
(1) obtaining periodontal tissue slices of different periodontitis patients and normal control by a frozen section technology,
(2) sequencing each tissue sample by a spatial transcriptome technology to obtain spatial transcriptome data;
(3) and performing standard data analysis according to the spatial transcriptome data, grouping, identifying and effectively positioning cell types in immune microenvironment of different regions of the periodontal tissue, and further clustering and differential analysis on types and expression conditions of original genes of different regions to select key genes and related molecular signal pathways in a periodontitis immune regulation and control mechanism.
Preferably, in step (2), the mRNA captured by the oligonucleotide chain in the capture region of the slide glass and labeled with the unique molecular identifier UMI and the special spatial barcode in each tissue section is sequenced by the spatial transcriptome technique to obtain spatial transcriptome data.
Preferably, in step (3), the standard data analysis according to the spatial transcriptome data comprises the following steps:
filtering spatial transcriptome data by adopting Fastp software to obtain sequencing data which can be directly used for subsequent analysis;
counting cell barcode information and corresponding counts contained in the filtered statistical sequencing data by adopting a barcode processing algorithm, so as to judge the actually detected lattice number in the sequencing sample and obtain real space transcriptome sequencing information;
comparing reads corresponding to cell barcodes in sequencing data to a genome corresponding to a known species, analyzing similarity and difference between a detected unknown sequence and a known sequence, and obtaining a bam file compared with the unknown sequence;
transforming the bam file containing the various information after genome comparison, merging the monomolecular labels compared to the same gene in the file, removing repeated UMI sequences in the monomolecular labels to obtain the number of UMI corresponding to each gene, counting to obtain the number of genes detected by each space lattice, and performing visual display in a space staining sheet.
Preferably, in step (3), the grouping, identifying and effectively locating all detected cell types in the immune microenvironment of different regions of the periodontal tissue comprises the following steps:
based on PCA algorithm, performing dimensionality reduction on spatial sequencing data to obtain two-dimensional/low-dimensional information, clustering spots with similarity in thousands of spatial lattices by using t-SNE algorithm and UMAP algorithm to further obtain a cluster subset (cluster), and performing visual display in a spatial staining sheet;
and finally identifying the Marker gene of each cluster by combining verified Marker genes in a cellMarker database through Welcht-tests on the log expression quantity of each gene between every two clusters to obtain the differential gene of top10 as a Marker gene candidate.
Preferably, in step (3), the further clustering and difference analysis of the types and expression conditions of the original genes in different regions comprises the following steps:
performing enrichment analysis on the Marker gene of the cluster and the gene in the cellMarker database by using a clusterProfiler software package, and observing the cell type in which a specific cluster is enriched so as to complete the identification of the cell type;
taking the genes with obvious differential expression in each cluster obtained by enrichment analysis as candidate gene groups, and analyzing by combining with each large database;
analyzing cell components, molecular functions and biological processes of the candidate genes through a Gene Ontology database;
performing Pathway Analysis on the candidate gene by using a KEGG database, and observing the signal paths on which the candidate gene is mainly enriched;
and combining the candidate gene with a DisGeNET database, and observing the close correlation between the candidate gene and certain diseases through enrichment analysis.
The invention overcomes the defects of the prior art and provides a method for analyzing periodontitis immune microenvironment by a space transcriptome technology, wherein periodontal tissue slices of different periodontitis patients and normal control are obtained by a frozen section technology, each tissue sample is sequenced by the space transcriptome technology, the space transcriptome data obtained by sequencing is utilized for subsequent analysis, then grouping, identifying and effectively positioning the cell types in the immune microenvironment of different regions of the periodontal tissue are detected, and the types and the expression conditions of original genes of different regions are further clustered and differentially analyzed, so that key genes and related molecular signal channels in the periodontitis immune regulation mechanism can be selected.
Compared with the defects and shortcomings of the prior art, the invention has the following beneficial effects: the method can obtain the transcription information of specific spatial positions in the complete tissue slices, is beneficial to screening out key genes and related molecular signal channels in the periodontitis immune regulation mechanism, and provides powerful and effective data support for finally understanding the occurrence and development processes of periodontitis.
Drawings
FIG. 1 is a flowchart illustrating the operation of the method for analyzing the immune microenvironment for periodontitis by the spatial transcriptome technique according to the present invention;
FIG. 2 is a schematic view showing the position of a periodontal tissue section of a patient suffering from periodontitis in an example of the present invention;
FIG. 3 is an exemplary presentation of counts number, dot matrix number, spatial transcriptome sequencing information in an embodiment of the invention;
FIG. 4 is an exemplary presentation after data normalization processing in an embodiment of the present invention;
FIG. 5 is a visual display of the clustered subpopulations in a spatially stained disc according to an embodiment of the present invention; FIG. 5A is a diagram showing that through UMAP analysis, all spots are clustered according to similarity to obtain 14 cluster clusters (cluster), and all the spots with the same color form one cluster; FIG. 5B is a diagram showing the restoration of the specific spatial positions corresponding to all the spots of the 14 clustered sub-groups onto the slice according to the spatial position information previously retained by each spot;
FIG. 6 shows the expression levels of CXCL6 gene at different sites in a periodontal tissue section in an example of the present invention; FIG. 6A is a sectional view of a periodontitis tissue; FIG. 6B shows the expression levels of CXCL6 gene at different sites in a periodontal tissue section, and the darker the color of the dot matrix in the figure, the higher the expression level of CXCL6 gene at that site; among them, CXCL6 was expressed in the highest amount in the lower right corner region of fig. 6B, and there was a greater accumulation of deeply stained inflammatory cells in the same region corresponding to the left figure, indicating that CXCL6 gene may be a periodontitis-associated gene.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
The invention discloses a method for analyzing periodontitis immune microenvironment by a spatial transcriptome technology, which comprises the following steps of:
(1) obtaining periodontal tissue slices of different periodontitis patients and normal control by frozen section technology
In step (1), periodontal tissues of healthy controls, gingivitis and periodontitis patients were collected clinically. All subjects had good general health and were not taking antibacterial or anti-inflammatory drugs within 3 months prior to drawing the material. Inclusion of age-matched healthy controls, gingivitis and peridentitis patients prior to treatment all ensured that at least 16 teeth were present in the oral cavity and that periodontal treatment was not performed within 6 months prior to drawing the material.
1) The inclusion criteria for healthy controls were: gingival health without inflammation (no bleeding on probing, and probing depth <4mm), no loss of clinical attachment or bone loss, healthy periodontal tissue samples collected during coronal prolongation due to denture or orthodontic needs;
2) gingivitis inclusion criteria: gingival inflammation (exploratory bleeding, but no loss of clinical attachment or bone loss) exists in teeth that need to be extracted due to malposition, crowding, or pericoronitis, periodontal tissue samples are collected at the time of tooth extraction (gingivitis is an early stage in the development of periodontitis disease and is therefore also included in the sequencing samples);
3) inclusion criteria prior to periodontitis treatment: affected teeth requiring extraction for periodontitis (probing bleeding, with probing depth >5mm, bone loss > 60% of root length), collecting periodontal tissue samples during tooth extraction;
the collected samples were immediately placed in sterile tubes containing α MEM medium and transferred to the laboratory for study within 2 hours;
(2) and (3) freezing and storing isopentane: freezing the collected sample by using precooled isopentane (refrigerant), so that the space form of the periodontal tissue is maintained and the quality of RNA is protected;
(3) OCT embedding: the frozen periodontal tissue is placed in a mould, and a precooled OCT frozen section embedding medium is used for completely embedding the tissue block, so that bubbles are avoided in the embedding process. The embedding medium is beneficial to keeping the complete structure of the tissue and providing structural support for the tissue in the process of freezing and slicing so as to increase the continuity of the tissue and reduce wrinkles and fragmentation, thereby preparing for preparing subsequent slicing;
(4) preparing a slice: a shallow incision (about 1mm deep) is cut on the surface of the embedded tissue block by a blade to avoid damaging tissues by too deep incision, and then the tissue block is cut along the incision to keep the thickness of the section between 10 and 20 mu m, as shown in figure 2.
(2) Sequencing each tissue sample by a spatial transcriptome technology to obtain spatial transcriptome data
The step (2) comprises the following specific steps:
(i) data filtering
In order to make the data more effective, the embodiment of the invention needs to filter the data.
Low-quality reads (sequencing fragments), undetected reads, reads containing linkers, and the like in raw data (raw data) are filtered by adopting Fastp software, so that clean reads (clean data) which can be directly used for subsequent analysis are obtained, wherein the clean reads comprise cell barcode information and gene counts (counts).
(ii) Dot matrix judgment
The mRNA released from the tissue cells migrates to the spot (lattice) marked with a specific tag sequence (cell barcode) on the sequencing chip. The cell barcode information contained in the filtered statistical sequencing data (clean data) and the corresponding gene counting counts are counted by adopting a barcode processing algorithm, wherein the counts represent the occurrence frequency of the same reads/sequencing fragments detected in the sequencing process, namely the abundance corresponding to the reads, and reflect the transcription frequency of the gene, namely the expression quantity of the gene. And judging the actually detected lattice number (spots) in the sequencing sample according to the counts number to obtain the real space transcriptome sequencing information. An example of the results is shown in FIG. 3.
(iii) Data comparison
Comparing reads (limited by the current sequencing level, when the genome is sequenced, the genome needs to be firstly broken into DNA fragments, then library is built for sequencing, the reads are the read length/the length of the base sequence of the corresponding sequencing fragment after each sequencing) corresponding to the cell barcode in the sequencing data to the genome corresponding to the known species (human). Through comparison, the similarity and difference between the detected unknown sequence and the known sequence can be found, and a bam file compared with the unknown sequence can be obtained (the bam file is a binary plate of a sam file, stores various information after map/genome comparison after sequencing and greatly compresses the size of the file);
(iv) sequencing data normalization
For sequencing data, differences in sequencing depth, gene length, occur due to differences in the technology level. Therefore, in order to compare the expression levels of different genes in different samples, it is necessary to standardize the raw sequencing data.
In the embodiment of the present invention, the data normalization processing method is to transform the bam file containing the various information after genome alignment, merge the Unique Molecular Identifiers (UMIs) aligned to the same gene in the file, remove the repeated UMI sequences in the file, obtain the number of UMIs corresponding to each gene (each mRNA is labeled with a Unique Molecular Identifier UMI, which can be used to count mRNA molecules), and further obtain the basis factors (which reflect how many genes are detected in a spot (the number of genes that can be expressed in a cell is larger, the differentiation degree of the cell is lower)) and the number of mRNA molecules detected in each spatial lattice (spot) by statistics.
Staining of the spatial section, the deeper the staining at a point, the greater the number of genes expressed at that point, which may reflect a lower degree of differentiation of the cells at that point (relative to the number of cell types contained at that point), as shown, for example, in FIG. 4.
(3) Performing standard data analysis according to the space transcriptome data, grouping, identifying and effectively positioning cell types in immune microenvironment of different regions of periodontal tissue, and further clustering and differential analysis on types and expression conditions of original genes of different regions to select key genes and related molecular signal pathways in periodontitis immune regulation mechanism
In the embodiment of the invention, the step (3) comprises the following specific steps:
(v) lattice clustering
And performing dimensionality reduction on the spatial sequencing data based on a PCA algorithm to obtain two-dimensional/low-dimensional information. And clustering the spots with similarity in thousands of space lattices (spots) by utilizing a t-SNE algorithm and a UMAP algorithm to further obtain dozens or twenty clustering subgroups (cluster), visually displaying in a space staining sheet, and obtaining a result as shown in figure 5.
In the embodiment of the invention, the clustering aims at clustering thousands of spots according to the gene expression condition based on the t-SNE algorithm and the UMAP algorithm to obtain dozens or twenty clusters, and performing cell type identification and gene function analysis on the spots in the clusters according to the clustering condition by combining a public database at the later stage.
(vi) Marker analysis
Different cluster are obtained after the dot matrix clustering, and then a Marker gene can be searched for each cluster. Welch t-tests are carried out on the log expression quantity of each gene between every two clustering clusters (cluster), so that a differential gene of top10 is obtained to be used as a Marker gene candidate, and the Marker gene of each cluster is finally identified by combining the verified Marker genes in a cellMarker database. The expression distribution of the Marker gene is visualized on the section in a way of a gene expression space diagram, as shown in fig. 6, fig. 6 reflects the expression quantity of the CXCL6 gene at different parts of the periodontal tissue section, and the darker the color of the dot matrix in the diagram, the higher the expression quantity of the Marker gene at the position is.
The Marker gene can be used for subsequent judgment or identification of cell types through identification of the Marker gene.
(vii) Cell type identification
By means of a clusterProfiler (a software package of R language), the Marker gene of the cluster (cluster) and the gene in the cellMarker database are subjected to enrichment analysis, and the cell type enriched in a specific cluster is observed, so that the identification of the cell type is completed.
(viii) Difference analysis
And (3) taking the genes with obvious differential expression in each cluster obtained by enrichment analysis as a candidate gene group, and analyzing by combining with each large database. Analyzing cellular components, molecular functions and biological processes of the candidate genes through a GO (Gene ontology) database; performing Pathway Analysis on the candidate gene by using a KEGG database, and observing the signal paths on which the candidate gene is mainly enriched; in addition, the candidate gene is combined with a DisGeNET database, and the candidate gene is closely related to diseases through enrichment analysis. And combining the results to comprehensively and completely describe the related information of the differential expression genes.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.
Claims (5)
1. A method of analyzing the immune microenvironment of periodontitis by spatial transcriptome techniques, the method comprising the steps of:
(1) obtaining periodontal tissue slices of different periodontitis patients and normal control by a frozen section technology,
(2) sequencing each tissue sample by a spatial transcriptome technology to obtain spatial transcriptome data;
(3) and performing standard data analysis according to the spatial transcriptome data, grouping, identifying and effectively positioning cell types in immune microenvironment of different regions of the periodontal tissue, and further clustering and differential analysis on types and expression conditions of original genes of different regions to select key genes and related molecular signal pathways in a periodontitis immune regulation and control mechanism.
2. The method for analyzing the microenvironment of periodontitis through the spatial transcriptome technique of claim 1, wherein in the step (2), the mRNA captured by the oligonucleotide chains in the capture region of the slide glass and labeled with the unique molecular identifier UMI and the specific spatial barcode in each tissue section is sequenced through the spatial transcriptome technique to obtain the spatial transcriptome data.
3. The method of analyzing a periodontitis immune microenvironment by a spatial transcriptome technique of claim 1, wherein in the step (3), the performing of the standard data analysis according to the spatial transcriptome data comprises the steps of:
filtering spatial transcriptome data by adopting Fastp software to obtain sequencing data which can be directly used for subsequent analysis;
counting cell barcode information and corresponding counts contained in the filtered statistical sequencing data by adopting a barcode processing algorithm, so as to judge the actually detected lattice number in the sequencing sample and obtain real space transcriptome sequencing information;
comparing reads corresponding to cellbarcode in sequencing data with a genome corresponding to a known species, analyzing similarity and difference between a detected unknown sequence and a known sequence, and obtaining a bam file compared with the unknown sequence;
transforming the bam file containing the various information after genome comparison, merging the monomolecular labels compared to the same gene in the file, removing repeated UMI sequences in the monomolecular labels to obtain the number of UMI corresponding to each gene, counting to obtain the number of genes detected by each space lattice, and performing visual display in a space staining sheet.
4. The method of claim 3, wherein in step (3), said grouping, identifying and efficiently mapping the cellular constituents of the immune microenvironment in different regions of the periodontal tissue detected comprises the steps of:
based on PCA algorithm, performing dimensionality reduction on spatial sequencing data to obtain two-dimensional/low-dimensional information, clustering spots with similarity in thousands of spatial lattices by using t-SNE algorithm and UMAP algorithm to further obtain a cluster subset (cluster), and performing visual display in a spatial staining sheet;
and finally identifying the Marker gene of each cluster by combining verified Marker genes in a cellMarker database through Welcht-tests on the log expression quantity of each gene between every two clusters to obtain the differential gene of top10 as a Marker gene candidate.
5. The method for analyzing the microenvironment of periodontitis through the spatial transcriptome technology of claim 4, wherein in the step (3), the further clustering and difference analysis of the species and expression of the original genes of different regions comprises the following steps:
performing enrichment analysis on the Marker gene of the cluster and the gene in the cellMarker database by using a clusterProfiler software package, and observing the cell type in which a specific cluster is enriched so as to complete the identification of the cell type;
taking the genes with obvious differential expression in each cluster obtained by enrichment analysis as candidate gene groups, and analyzing by combining with each large database;
analyzing cell components, molecular functions and biological processes of the candidate genes through a Gene Ontology database;
performing Pathway Analysis on the candidate gene by using a KEGG database, and observing the signal paths on which the candidate gene is mainly enriched;
and combining the candidate gene with a DisGeNET database, and observing the close correlation between the candidate gene and certain diseases through enrichment analysis.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010948773.8A CN112185465A (en) | 2020-09-10 | 2020-09-10 | Method for analyzing periodontitis immune microenvironment through spatial transcriptome technology |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010948773.8A CN112185465A (en) | 2020-09-10 | 2020-09-10 | Method for analyzing periodontitis immune microenvironment through spatial transcriptome technology |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112185465A true CN112185465A (en) | 2021-01-05 |
Family
ID=73920478
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010948773.8A Withdrawn CN112185465A (en) | 2020-09-10 | 2020-09-10 | Method for analyzing periodontitis immune microenvironment through spatial transcriptome technology |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112185465A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2022220385A1 (en) * | 2021-04-15 | 2022-10-20 | 주식회사 포트래이 | Apparatus and method for predicting cell type enrichment from tissue images using spatially resolved gene expression data |
-
2020
- 2020-09-10 CN CN202010948773.8A patent/CN112185465A/en not_active Withdrawn
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2022220385A1 (en) * | 2021-04-15 | 2022-10-20 | 주식회사 포트래이 | Apparatus and method for predicting cell type enrichment from tissue images using spatially resolved gene expression data |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110313034B (en) | Method, machine-readable medium and computer system for sequencing nucleic acid molecules | |
CN104221022B (en) | A kind of copy number mutation detection method and system | |
CN111009286B (en) | Method and apparatus for microbiological analysis of a host sample | |
CN105209918B (en) | Biomarker authentication method and system | |
AU2015314114B2 (en) | Detecting repeat expansions with short read sequencing data | |
CN102952854B (en) | Single cell sorting and screening method and device thereof | |
US20150302042A1 (en) | Data analysis apparatus and data analysis method | |
EA028642B1 (en) | Method of prenatal diagnostics of fetal chromosomal aneuploidy | |
Utsuno et al. | Evaluation of sperm head shape at high magnification revealed correlation of sperm DNA fragmentation with aberrant head ellipticity and angularity | |
CN112599199A (en) | Analysis method suitable for 10x single cell transcriptome sequencing data | |
CN112185465A (en) | Method for analyzing periodontitis immune microenvironment through spatial transcriptome technology | |
CN109652513A (en) | The method and kit of liquid biopsy idiovariation are accurately detected based on two generation sequencing technologies | |
CN115989544A (en) | Method and system for visualizing short reads in repetitive regions of a genome | |
CN105528532B (en) | A kind of characteristic analysis method in rna editing site | |
US20210366569A1 (en) | Limit of detection based quality control metric | |
CN111192636B (en) | mRNA second-generation sequencing result analysis method suitable for oligo dT enrichment | |
CN105838720B (en) | PTPRQ gene mutation body and its application | |
JP2001149073A (en) | Method and apparatus for analyzing intestinal flora | |
CN114388066A (en) | Analysis device and method for atrial fibrillation myocardial fibrosis | |
CN114540482A (en) | Analysis device and method for improving atrial fibrillation myocardial fibrosis by Qibo pulse-activating composition based on space transcriptome technology | |
CN111676299A (en) | Method for identifying cell types in chick blastocyst paring disc | |
CN116144669B (en) | TCOF1 gene mutant and application thereof | |
CN111028889A (en) | Method for obtaining in-vivo nutritional type plant pathogenic oomycete pollution-free genome | |
CN109741790A (en) | The macro genome analytical method and system of microorganism two generations sequencing data | |
CN115948521B (en) | Method for detecting aneuploidy deletion chromosome information |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20210105 |
|
WW01 | Invention patent application withdrawn after publication |