CN116559142B - Colony Raman spectrum rapid selection method based on HCA +' - Google Patents
Colony Raman spectrum rapid selection method based on HCA +' Download PDFInfo
- Publication number
- CN116559142B CN116559142B CN202310523753.XA CN202310523753A CN116559142B CN 116559142 B CN116559142 B CN 116559142B CN 202310523753 A CN202310523753 A CN 202310523753A CN 116559142 B CN116559142 B CN 116559142B
- Authority
- CN
- China
- Prior art keywords
- colony
- hca
- spectrum
- raman spectrum
- raman
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001237 Raman spectrum Methods 0.000 title claims abstract description 74
- 238000010187 selection method Methods 0.000 title claims abstract description 19
- 238000000034 method Methods 0.000 claims abstract description 33
- 238000001228 spectrum Methods 0.000 claims description 44
- 230000003595 spectral effect Effects 0.000 claims description 22
- 238000001069 Raman spectroscopy Methods 0.000 claims description 14
- 238000006073 displacement reaction Methods 0.000 claims description 13
- 238000010606 normalization Methods 0.000 claims description 10
- 230000009467 reduction Effects 0.000 claims description 8
- JJWKPURADFRFRB-UHFFFAOYSA-N carbonyl sulfide Chemical compound O=C=S JJWKPURADFRFRB-UHFFFAOYSA-N 0.000 claims description 6
- 238000005259 measurement Methods 0.000 claims description 6
- 230000003044 adaptive effect Effects 0.000 claims description 5
- 238000012937 correction Methods 0.000 claims description 5
- 230000008569 process Effects 0.000 claims description 5
- 238000012163 sequencing technique Methods 0.000 claims description 4
- 238000012800 visualization Methods 0.000 claims description 4
- 241000894006 Bacteria Species 0.000 claims description 3
- 238000000605 extraction Methods 0.000 claims description 3
- 238000012804 iterative process Methods 0.000 claims description 3
- 238000005192 partition Methods 0.000 claims description 3
- 238000007619 statistical method Methods 0.000 claims 1
- 230000001580 bacterial effect Effects 0.000 abstract description 20
- 238000001514 detection method Methods 0.000 abstract description 5
- 230000000877 morphologic effect Effects 0.000 abstract description 5
- 238000012216 screening Methods 0.000 abstract description 4
- 230000000295 complement effect Effects 0.000 abstract description 2
- 238000013461 design Methods 0.000 abstract description 2
- 238000005516 engineering process Methods 0.000 abstract description 2
- 244000005700 microbiome Species 0.000 description 3
- 230000006799 invasive growth in response to glucose limitation Effects 0.000 description 2
- 238000007781 pre-processing Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N21/00—Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
- G01N21/62—Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light
- G01N21/63—Systems in which the material investigated is excited whereby it emits light or causes a change in wavelength of the incident light optically excited
- G01N21/65—Raman scattering
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N21/00—Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
- G01N21/01—Arrangements or apparatus for facilitating the optical investigation
Landscapes
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Immunology (AREA)
- Pathology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Investigating, Analyzing Materials By Fluorescence Or Luminescence (AREA)
Abstract
The invention provides a colony Raman spectrum rapid selection method based on HCA+, which uses Raman spectrum technology and an unsupervised clustering method to realize more mixed sample colony selection schemes with fewer selection times according to the complementary relation of different clustering methods on colony Raman spectrum adaptability; compared with the traditional morphological screening method or the derivative method based on the morphological screening method, the invention carries out fingerprint detection on the bacterial colony based on the Raman spectrum of the bacterial colony, realizes more accurate identification of the bacterial colony, and can obviously shorten the time consumption of bacterial selection and promote the bacterial colony selection efficiency on the premise of comprehensively and accurately taking the bacterial selection into account compared with the traditional bacterial selection without deviation; the invention provides a novel quick selection method for colony detection so as to improve colony selection efficiency, and the method is scientific and concise in design and suitable for popularization.
Description
Technical Field
The invention relates to the technical field of colony identification, in particular to a colony Raman spectrum rapid selection method based on HCA+.
Background
Colony detection and identification are an important part of bacterial research, and conventional bacterial identification comprises morphological, physiological, biochemical and genetic characteristics, and the aim of the identification is usually to divide pure culture monoclonal colonies; researchers pick colonies of interest by characterization of colony morphology (e.g., size, morphology, color, transparency, density, and edges), etc., and then conduct identification or other studies.
In reality, the microorganisms from different sources may have similar colony morphology, so that the task of selecting the microbial colonies without deviation of the species is often not performed efficiently, and this problem is particularly remarkable when the sampling amount is large.
The colony Raman spectrum is a fingerprint spectrum of the colony, contains abundant phenotypic information of the colony in a specific physiological state, has the advantages of no marking and no damage to a sample, and provides a new selection method for selecting the colony based on the colony Raman spectrum, so that researchers can quickly select and even determine specific colony species of the colony.
Disclosure of Invention
In order to overcome the defects of the prior art, the invention provides a colony Raman spectrum rapid selection method based on HCA < + >, which greatly improves the selection efficiency and sampling specificity of microorganism colony and reduces the time for acquiring specific microorganisms.
The technical scheme adopted for solving the technical problems is as follows: a colony raman spectrum rapid selection method based on hca+' comprising:
step one: inputting colony Raman spectrum information;
inputting Raman spectrum information of the colony into a spectrum projection module and an HCA+ bacteria picking scheme module;
as an illustration, the colony raman spectral information includes: colony culture dish information, colony information, and the like.
As an illustration, the colony culture dish information includes: number of culture dishes, number of culture dish colonies, coordinates of culture dish colonies, and the like.
As an illustration, the colony information further comprises: colony numbering, colony extraction position coordinates, the number of Raman spectra collected at colony picking positions, and the like.
Step two: spectral projection and colony distribution visualization;
carrying out spectrum projection on the input colony Raman spectrum information;
further, before the spectrum projection, spectrum preprocessing and displacement calibration processing are performed, including the following operations:
step1: spectral pretreatment;
first, removing cosmic rays for raman spectra of each colony;
secondly, carrying out self-adaptive baseline correction on Raman spectra of each colony;
finally, carrying out standardization treatment on the Raman spectrum of each colony so as to realize dimensionless treatment of the spectrum intensity;
as an illustration, the removal of cosmic rays may employ: a median fitting algorithm and/or a second order differential derivative method.
As an illustration, the adaptive baseline correction may employ: an adaptive iterative re-weighting penalty least squares algorithm (Adaptive Iterative Re-weighted Penalized Least Squares, airPLS) and/or an ALS alternating least squares algorithm (alternating least squares, ALS).
As an example, the normalization process includes: normalization methods and/or normalization methods project spectra between [0,1 ].
Step2: performing displacement calibration;
firstly, judging and intercepting a Raman shift intersection area of a colony Raman spectrum;
then, fitting and interpolating the spectra of the intersection region to ensure that the Raman spectra of the input colony have the same spectral resolution under the same displacement;
as an illustration, the fitting and interpolation operation includes: polynomial fitting interpolation and/or spline interpolation.
Step3: spectral projection;
performing nonlinear dimension reduction on the colony spectrum data subjected to spectrum pretreatment and displacement calibration treatment, wherein the nonlinear dimension reduction comprises the following steps:
firstly, converting the space distance of Raman spectrum data of any two colonies in a high-dimensional space into similarity probability;
then, replacing the conditional probability of random neighborhood embedding with the probability between the high-dimensional space data point and the analog data point of the corresponding low-dimensional space, and using a distribution strategy in the low-dimensional space to relieve the problem of data point crowding in the low-dimensional space;
as an illustration, the spatial distance includes: one or a combination of euclidean distance, manhattan distance, min Shi distance, or entropy of information.
As an illustration, the distribution strategy is: t distribution and/or normal distribution.
Step three: "HCA+" picking protocol;
an "HCA" moiety;
input:
(1) Colony Raman spectrum data set D= { x 11 ,x 12 ,...,x ab A represents the number of colonies, b represents the number of raman spectra collected for each colony, and the number of datasets n=a×b;
(2) The number of clusters is K 1 ,K 1 ≤a;
(3) Colony selection threshold of N 1 ,N 1 ≤b;
(4) Colony raman spectral space distance measurement
Wherein: p and q are the spectral spatial positions of two colonies;
and (3) outputting:
picking a colony number set: c (C) t ={C NO.1 ,C NO.2 ,…C NO.t };;
Clustering: initializing the number of clusters count=n, wherein count is used for controlling the iteration times of the clustering loop, i.e. the Raman spectrum in each colony is independently set as a cluster C i Wherein: i=1, 2,. -%, n;
step1: merging two clusters with the smallest distance in the current cluster number;
step2: the clusters are renumbered after merging, and the distances between the new numbers and other clusters are calculated;
step3: count=count-1, repeat Step1-2 until count=k 1 When the iterative process is finished, the current clustering result C= { C is output 1 ,C 2 ,…C k1 };
According to the current clustering result, K is adjusted 1 Value, re-executing Step1-3;
step4: statistics of intra-cluster C i The spectrum number of each colony is sequenced from big to small, and whether the spectrum number of the largest colony is larger than or equal to N is judged 1 If satisfied, will beMaximum colony number C NO.i Store to C t In the set, otherwise, traversing the next cluster until all K is traversed 1 Clustering clusters;
according to the current C t Statistics result, adjust N 1 Value, re-execute Step4, determine "HCA" part final C t 。
The "+" section, comprising the steps of:
input:
(1) The colony Raman spectrum data set is D;
(2) Number of clusters K 2 ,K 2 ≤a;
(3) Colony selection threshold N 2 ,N 2 ≤b;
(4) Picking colony number set C outputted by HCA t ;
And (3) outputting: picking colony number set C t ={C NO.1 ,C NO.2 ,…C NO.t };
Clustering:
s1: selecting a clustering method of non-HCA distance measurement dist (p, q), carrying out clustering statistics on a colony Raman spectrum dataset, and adjusting K according to a statistical result 2 Value, readjusting clustering result;
as an illustration, the cluster statistics includes: a partition-based cluster, a hierarchical-based cluster, a density-based cluster, and/or a grid-based cluster.
S2: counting the spectrum number of each colony in each cluster, sequencing from big to small, and judging whether the spectrum number of the largest colony is more than or equal to N 2 If the number is satisfied, the maximum colony number C is determined NO.i Whether or not it belongs to C t Aggregate, if not, C NO.i Store to C t A collection;
according to "HCA+" two-part C t Final statistics, N is adjusted 2 Value, re-executing step S2, and outputting final picked colony number set C t ;
Step four: picking colony information;
according to step threePicking colony number set C returned by HCA+' picking scheme t C is carried out by t The corresponding information of the colony culture dish and the colony coordinate information are sent to colony picking equipment for colony picking.
As an illustration, to further verify the picking accuracy, the colonies selected were placed in PCR tubes and identified by 16s-PCR sequencing.
The invention has the beneficial effects that:
according to the invention, a Raman spectrum technology and an unsupervised clustering method are used, and more mixed sample colony picking schemes are realized with fewer picking times according to the complementary relation of different clustering methods on the adaptability of the colony Raman spectrum.
Compared with the traditional morphological screening method or the derivative method based on the morphological screening method, the invention carries out fingerprint detection on the bacterial colony based on the Raman spectrum of the bacterial colony, realizes more accurate identification of the bacterial colony, and can obviously shorten the time consumption of bacterial selection and promote the bacterial colony selection efficiency on the premise of comprehensively and accurately taking the bacterial selection into account compared with the bacterial colony selection without deviation in the traditional bacterial selection process.
The invention provides a novel quick selection method for colony detection so as to improve colony selection efficiency, and the method is scientific and concise in design and suitable for popularization.
Drawings
FIG. 1 is a block diagram of the flow scheme of a colony Raman spectrum rapid selection method based on HCA+.
FIG. 2 is a graph of a record of colony Raman spectrum information from a "HCA+" based rapid colony Raman spectrum selection method of the present invention.
FIG. 3 is a visual result of spectral projection after nonlinear dimension reduction of colony spectral data of a colony Raman spectrum rapid selection method based on HCA+.
FIG. 4 shows the final clustering results of the "HCA" portion of a "HCA+" based colony Raman spectroscopy rapid selection method of the present invention.
FIG. 5 is a collection of "HCA" partial picking spectra of a "HCA+" based rapid selection method of colony Raman spectra in accordance with the present invention.
FIG. 6 shows Kmeans clustering results of a colony Raman spectrum quick selection method based on HCA+.
FIG. 7 is a collection of picked colony numbers for a "HCA+" based rapid selection method of colony Raman spectroscopy in accordance with the present invention.
Detailed Description
The preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings.
Referring to fig. 1 to 7, a colony raman spectrum rapid picking method based on "hca+", comprising:
step one: inputting colony Raman spectrum information;
inputting Raman spectrum information of the colony into a spectrum projection module and an HCA+ bacteria picking scheme module;
as one illustration, the colony raman spectral information includes, but is not limited to: colony culture dish information, colony information, and the like.
As an illustration, as shown in fig. 2, the colony culture dish information includes, but is not limited to: number of culture dish (such as A2-1, A2-2, C2 …), number of colony of culture dish, colony coordinates (colony) of culture dish, etc.;
the colony information includes, but is not limited to: colony numbers (No. 1, no.2, no.3 and …), colony extraction position coordinates (loc 3), the number of Raman spectra collected at colony picking positions (9), and the like.
Step two: spectral projection and colony distribution visualization;
carrying out spectrum projection on the input colony Raman spectrum information;
further, before the spectrum projection, spectrum preprocessing and displacement calibration processing are performed, including the following operations:
step1: spectral pretreatment;
firstly, removing cosmic rays from the Raman spectrum of each colony, namely removing background signals of the Raman spectrum;
secondly, performing self-adaptive baseline correction;
finally, the normalization treatment is carried out to realize the dimensionless treatment of the spectrum intensity.
Further, the method of removing cosmic rays may use: a median fitting algorithm, a second-order difference derivation method and the like;
as an illustration, applying the airPLS algorithm based to remove background signals of raman spectra, lambda=100 is set, maximum number of iterations itermaxair pls=15.
As an example, the normalization process includes: normalization methods and/or normalization methods.
As an example, raman spectrum (SCRS) data for each colony was normalized using Min-Max, and the spectrum was projected between [0,1 ].
Step2: performing displacement calibration;
firstly, judging and intercepting a Raman shift intersection area of a colony Raman spectrum;
then, fitting and interpolating the spectra of the intersection region to ensure that the Raman spectra of the input colony have the same spectral resolution under the same displacement;
as an illustration, the fitting and interpolation operation includes: polynomial fitting interpolation and/or spline interpolation.
Step3: spectral projection;
performing nonlinear dimension reduction on the colony spectrum data subjected to spectrum pretreatment and displacement calibration treatment, wherein the nonlinear dimension reduction comprises the following steps:
firstly, converting the space distance of Raman spectrum data of any two colonies in a high-dimensional space into similarity probability;
then, replacing the conditional probability of random neighborhood embedding with the probability between the high-dimensional space data point and the analog data point of the corresponding low-dimensional space, and using a distribution strategy in the low-dimensional space to relieve the problem of data point crowding in the low-dimensional space;
as an illustration, the spatial distance includes: one or a combination of euclidean distance, manhattan distance, min Shi distance, or entropy of information.
As an illustration, the distribution strategy is: t distribution and/or normal distribution.
As an illustration, the T distribution refers to the nonlinear dimension reduction of colony spectrum data after the pretreatment and the displacement calibration treatment by using T-SNE (T-SNE: T-Distribution Stochastic Neighbour Embedding, T distribution random nearest neighbor embedding), and the visualization result after the spectrum projection is shown in fig. 3.
Step three: "HCA+" picking protocol;
an "HCA" moiety;
input:
(1) Colony Raman spectrum data set D= { x 11 ,x 12 ,...,x ab A represents the number of colonies, where the number of colonies is 7, b represents the number of raman spectra collected for each colony, where the number of raman spectra collected for a colony is 9, and the number of datasets n=a×b=63;
(2) The number of clusters is K 1 =3,K 1 ≤a;
(3) Colony selection threshold N 1 =7,N 1 ≤b;
(4) Colony raman spectral space distance measurement
Wherein: p and q are the spectral spatial positions of two colonies;
and (3) outputting:
picking a colony number set: c (C) t ={C NO.1 ,C NO.2 ,…C NO.t };
Clustering: initializing the number of clusters count=n, wherein count is used for controlling the iteration times of the clustering loop, i.e. the Raman spectrum in each colony is independently set as a cluster C i Wherein: i=1, 2,. -%, n;
step1: merging two clusters with the smallest distance in the current cluster number;
step2: the clusters are renumbered after merging, and the distances between the new numbers and other clusters are calculated;
step3: count=count-1, repeat Step1-2 until count=k 1 When the iterative process is finished, the current clustering result C= { C is output 1 ,C 2 ,…C K1 };
According to the current clustering result, K is adjusted 1 Value, re-executing Step1-3; the final clustering result is shown in fig. 4, and 3 different shape points represent 3 clustering results.
Step4: statistics of intra-cluster C i The spectrum number of each colony is sequenced from big to small, and whether the spectrum number of the largest colony is larger than or equal to N is judged 1 If satisfied, the maximum colony number C NO.i Store to C t In the set, otherwise, traversing the next cluster until all K is traversed 1 Clustering clusters;
according to the current C t Statistics result, adjust N 1 And (5) carrying out Step4 again, wherein fig. 5 shows a bacterial picking spectrum set of the HCA part, and the colony numbers corresponding to the set are used as output and fed back to the bacterial picking equipment for bacterial picking.
The "+" section, comprising the steps of:
input:
(1) The colony Raman spectrum data set is D;
(2) Number of clusters K 2 ,K 2 ≤a;
(3) Colony selection threshold N 2 ,N 2 ≤b;
(4) Picking colony number set C outputted by HCA t ;
And (3) outputting: picking colony number set C t ={C NO.1 ,C NO.2 ,…C NO.t };
Clustering:
s1: selecting Kmeans clustering method of non-HCA distance measurement dist (p, q), carrying out clustering statistics on colony Raman spectrum dataset, and adjusting K according to statistical result 2 =4 values, readjusting the clustering result; kmeans clustering results are shown in FIG. 6;
as an illustration, the cluster statistics includes: a partition-based cluster, a hierarchical-based cluster, a density-based cluster, and/or a grid-based cluster.
S2: counting the spectrum number of each colony in each cluster from large to smallSequencing, judging whether the spectrum number of the maximum colony is greater than or equal to N 2 =5, and if satisfied, the maximum colony number C is determined NO.i Whether or not it belongs to C t Aggregate, if not, C NO.i Store to C t A collection;
according to "HCA+" two-part C t Final statistics, N is adjusted 2 Value, re-executing step S2, and outputting final picked colony number set C t The method comprises the steps of carrying out a first treatment on the surface of the The final picking result is shown in 7.
Step four: picking colony information;
picking colony number set C returned according to the HCA+ picking scheme of step three t As shown in FIG. 7, C t The corresponding information of the colony culture dish and the colony coordinate information are sent to colony picking equipment for colony picking.
As an illustration, to further verify the picking accuracy, the colonies selected were placed in PCR tubes and identified by 16s-PCR sequencing.
The foregoing description of the preferred embodiments of the present invention has been presented only to facilitate the understanding of the principles of the invention and its core concepts, and is not intended to limit the scope of the invention in any way, however, any modifications, equivalents, etc. which fall within the spirit and principles of the invention should be construed as being included in the scope of the invention.
Claims (10)
1. A colony raman spectrum rapid selection method based on hca+ "comprising:
step one: inputting colony Raman spectrum information;
inputting Raman spectrum information of the colony into a spectrum projection module and an HCA+ bacteria picking scheme module;
step two: spectral projection and colony distribution visualization;
carrying out spectrum projection on the input colony Raman spectrum information;
step three: "HCA+" picking protocol;
an "HCA" moiety;
input:
(1) colony Raman spectrum data set D= { x 11 ,x 12 ,...,x ab A represents the number of colonies, b represents the number of raman spectra collected for each colony, and the number of datasets n=a×b;
(2) the number of clusters is K 1 ,K 1 ≤a;
(3) Colony selection threshold of N 1 ,N 1 ≤b;
(4) Colony raman spectral space distance measurement
Wherein: p and q are the spectral spatial positions of two colonies;
and (3) outputting:
picking a colony number set: c (C) t ={C NO.1 ,C NO.2 ,…C NO.t };
Clustering: initializing the number of clusters count=n, wherein count is used for controlling the iteration times of the clustering loop, i.e. the Raman spectrum in each colony is independently set as a cluster C i Wherein: i=1, 2,. -%, n;
step1: merging two clusters with the smallest distance in the current cluster number;
step2: the clusters are renumbered after merging, and the distances between the new numbers and other clusters are calculated;
step3: count=count-1, repeat Step1-2 until count=k 1 When the iterative process is finished, the current clustering result C= { C is output 1 ,C 2 ,…C k1 };
According to the current clustering result, K is adjusted 1 Value, re-executing Step1-3;
step4: statistics of intra-cluster C i The spectrum number of each colony is sequenced from big to small, and whether the spectrum number of the largest colony is larger than or equal to N is judged 1 If satisfied, the maximum colony number C NO.i Store to C t In the set, otherwise, traversing the next cluster until all K is traversed 1 Clustering clusters;
according to the current timeFront C t Statistics result, adjust N 1 Value, re-execute Step4, determine "HCA" part final C t ;
The "+" section includes the steps of:
input:
(1) the colony Raman spectrum data set is D;
(2) number of clusters K 2 ,K 2 ≤a;
(3) Colony selection threshold N 2 ,N 2 ≤b;
(4) Picking colony number set C outputted by HCA t ;
And (3) outputting: picking colony number set C t ={C NO.1 ,C NO.2 ,…C NO.t };
Clustering:
s1: selecting a clustering method of non-HCA distance measurement dist (p, q), carrying out clustering statistics on a colony Raman spectrum dataset, and adjusting K according to a statistical result 2 Value, readjusting clustering result;
s2: counting the spectrum number of each colony in each cluster, sequencing from big to small, and judging whether the spectrum number of the largest colony is more than or equal to N 2 If the number is satisfied, the maximum colony number C is determined NO.i Whether or not it belongs to C t Aggregate, if not, C NO.i Store to C t A collection;
according to "HCA+" two-part C t Final statistics, N is adjusted 2 Value, re-executing step S2, and outputting final picked colony number set C t ;
Step four: picking colony information;
picking colony number set C returned according to the HCA+ picking scheme of step three t C is carried out by t The corresponding information of the colony culture dish and the colony coordinate information are sent to colony picking equipment for colony picking.
2. The rapid colony raman spectrum selection method based on hca+ according to claim 1, wherein the spectrum pretreatment and displacement calibration process are performed before the spectrum projection, comprising the following operations:
step1: spectral pretreatment;
first, removing cosmic rays for raman spectra of each colony;
secondly, carrying out self-adaptive baseline correction on Raman spectra of each colony;
finally, carrying out standardization treatment on the Raman spectrum of each colony so as to realize dimensionless treatment of the spectrum intensity;
step2: performing displacement calibration;
firstly, judging and intercepting a Raman shift intersection area of a colony Raman spectrum;
then, fitting and interpolating the spectra of the intersection region to ensure that the Raman spectra of the input colony have the same spectral resolution under the same displacement;
step3: spectral projection;
performing nonlinear dimension reduction on the colony spectrum data subjected to spectrum pretreatment and displacement calibration treatment, wherein the nonlinear dimension reduction comprises the following steps:
firstly, converting the space distance of Raman spectrum data of any two colonies in a high-dimensional space into similarity probability;
then, the probability between the data points in the high-dimensional space and the simulated data points in the corresponding low-dimensional space is used for replacing the conditional probability of random neighborhood embedding, and a distribution strategy is used in the low-dimensional space, so that the problem of data point crowding in the low-dimensional space is solved.
3. A colony raman spectroscopy rapid picking method based on "hca+" according to claim 2, wherein the colony raman spectroscopy information comprises: colony culture dish information and colony information;
the colony culture dish information includes: the culture dish number, the number of culture dish colonies and the culture dish colony coordinates;
the colony information includes: the number of the colony, the coordinates of the extraction position of the colony and the number of the Raman spectrum collected by the picking position of the colony.
4. The method for rapid selection of colony raman spectrum based on hca+ according to claim 2, wherein the removing cosmic rays can be performed by: a median fitting algorithm and/or a second order differential derivative method.
5. A colony raman spectrum rapid picking method based on "hca+" according to claim 2, wherein the adaptive baseline correction is performed by: an adaptive iterative re-weighting penalty least squares algorithm and/or an ALS alternating least squares algorithm.
6. A colony raman spectroscopy rapid picking method based on "hca+" according to claim 2, wherein the normalization process comprises: normalization methods and/or normalization methods project spectra between [0,1 ].
7. A colony raman spectrum rapid picking method based on "hca+" according to claim 2, wherein said fitting and interpolating operations comprise: polynomial fitting interpolation and/or spline interpolation.
8. A colony raman spectroscopy rapid picking method based on "hca+" according to claim 2, wherein said spatial distance comprises: one of euclidean distance, manhattan distance, min Shi distance, or entropy of information.
9. The method of claim 2, wherein the distribution strategy is: t distribution and/or normal distribution.
10. The rapid colony raman spectrum selection method based on hca+ according to claim 1, wherein the clustering statistical method comprises: a partition-based cluster, a hierarchical-based cluster, a density-based cluster, and/or a grid-based cluster.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310523753.XA CN116559142B (en) | 2023-05-11 | 2023-05-11 | Colony Raman spectrum rapid selection method based on HCA +' |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310523753.XA CN116559142B (en) | 2023-05-11 | 2023-05-11 | Colony Raman spectrum rapid selection method based on HCA +' |
Publications (2)
Publication Number | Publication Date |
---|---|
CN116559142A CN116559142A (en) | 2023-08-08 |
CN116559142B true CN116559142B (en) | 2023-11-21 |
Family
ID=87491144
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310523753.XA Active CN116559142B (en) | 2023-05-11 | 2023-05-11 | Colony Raman spectrum rapid selection method based on HCA +' |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116559142B (en) |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108844941A (en) * | 2018-05-30 | 2018-11-20 | 武汉工程大学 | A kind of identification and classification method of the different grade phosphorus mines based on Raman spectrum and PCA-HCA |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009011565A1 (en) * | 2007-07-17 | 2009-01-22 | Erasmus University Medical Center Rotterdam | Method for typing and identification of micro-organisms |
US9170192B2 (en) * | 2009-10-09 | 2015-10-27 | Lawrence D. Ziegler | Systems and methods for identifying materials utilizing multivariate analysis techniques |
-
2023
- 2023-05-11 CN CN202310523753.XA patent/CN116559142B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108844941A (en) * | 2018-05-30 | 2018-11-20 | 武汉工程大学 | A kind of identification and classification method of the different grade phosphorus mines based on Raman spectrum and PCA-HCA |
Also Published As
Publication number | Publication date |
---|---|
CN116559142A (en) | 2023-08-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111709901B (en) | FCM cluster matching and Wallis filtering-based no-weight multi/hyperspectral remote sensing image color homogenizing method | |
CN101526994B (en) | Fingerprint image segmentation method irrelevant to collecting device | |
CN112522371A (en) | Analysis method of spatial transcriptome sequencing data | |
CN101271572A (en) | Image segmentation method based on immunity clone selection clustering | |
CN109584253B (en) | Oil abrasive particle image segmentation method | |
Shao et al. | Using the maximum between-class variance for automatic gridding of cDNA microarray images | |
CN108154158B (en) | Building image segmentation method for augmented reality application | |
CN113889192B (en) | Single-cell RNA-seq data clustering method based on deep noise reduction self-encoder | |
CN110490836A (en) | DPCR microarray images information processing method | |
CN109359604B (en) | Method for identifying instrument under shadow interference facing inspection robot | |
CN107533593A (en) | Method for identifying unknown microbiota subpopulations from the set with reference to subgroup by mass spectrography | |
CN112819299A (en) | Differential K-means load clustering method based on center optimization | |
CN116559142B (en) | Colony Raman spectrum rapid selection method based on HCA +' | |
CN111462157B (en) | Infrared image segmentation method based on genetic optimization threshold method | |
CN110751660B (en) | Color image segmentation method | |
CN104657949A (en) | Method for optimizing structural elements during denoising of coal slime flotation froth image | |
CN116721343A (en) | Cross-domain field cotton boll recognition method based on deep convolutional neural network | |
CN111563536B (en) | Bamboo strip color self-adaptive classification method based on machine learning | |
CN114612787A (en) | Urban green land deep learning extraction method supported by scale change strategy | |
CN105989248B (en) | Data processing method and device for multiple molecular signals | |
Shao et al. | cDNA microarray image segmentation with an improved moving k-means clustering method | |
CN111754533A (en) | Image segmentation method based on improved genetic algorithm and K-mean algorithm | |
CN110348452A (en) | A kind of image binaryzation processing method and its system | |
CN106097293A (en) | A kind of image severability evaluation method for image capturing system optimization design | |
CN111814550B (en) | Ceramic texture extraction method based on convolutional neural network and image processing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |