CN104484558B - The analysis report automatic generation method and system of biological information project - Google Patents

The analysis report automatic generation method and system of biological information project Download PDF

Info

Publication number
CN104484558B
CN104484558B CN201410742360.9A CN201410742360A CN104484558B CN 104484558 B CN104484558 B CN 104484558B CN 201410742360 A CN201410742360 A CN 201410742360A CN 104484558 B CN104484558 B CN 104484558B
Authority
CN
China
Prior art keywords
project
sub
analysis
sample
quality control
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410742360.9A
Other languages
Chinese (zh)
Other versions
CN104484558A (en
Inventor
苏海桥
徐伟玲
石俊杰
邹利君
李卡麟
陈力春
梁绍光
李国庆
刘娜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BGI Technology Solutions Co Ltd
Original Assignee
BGI Technology Solutions Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BGI Technology Solutions Co Ltd filed Critical BGI Technology Solutions Co Ltd
Priority to CN201410742360.9A priority Critical patent/CN104484558B/en
Publication of CN104484558A publication Critical patent/CN104484558A/en
Application granted granted Critical
Publication of CN104484558B publication Critical patent/CN104484558B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Investigating Or Analysing Biological Materials (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Disclose a kind of analysis report automatic generation method of biological information project, including step:When sub-project is an only filtering sub-project, i.e., according to default parameter configuration with unified filter criteria to being filtered and analyzed through machine data under the sample after the completion of sequencer, generation webpage version or/and the analysis result information of PDF editions;When sub-project is normalizer project, then the corresponding sample of the sub-project is while through sequencer, normal process analysis is created to the sample, and corresponding filtration parameter is inputted according to the sample situation of current sub-project by user in each normal process analytic process is created;After the completion of sequencer, sample segment data are first filtered out according to the filtration parameter of input, filtering is then carried out to remaining each sample data with analyzing generation webpage version or/and the analysis result information of PDF editions with created normal process analysis.

Description

The analysis report automatic generation method and system of biological information project
Technical field
The present invention relates to analysis of biological information field, more particularly to a kind of analysis report of biological information project to automatically generate Method and system.
Background technology
With the fast development of life science experimental technique, the automation of scientific instrument, intelligent level increasingly carry Height, data output capacity have qualitative leap.Meanwhile life science to analysis test requirement no matter sample size, Analytical cycle, analysis project and data accuracy etc. are proposed higher standard and the request, biology laboratory output Information increases by geometric progression.
In traditional biology laboratory, since data type is various, form differs, the preservations of data, exchange, inquiry, Analysis, maintenance are all very inconvenient, and the information seriously hindered between researcher is submitted.In large-scale parallel sequencing (under also crying Generation sequencing technologies, high-flux sequence, English:NGS, High-throughput Sequencing) experiment and biological information A variety of flows for arriving involved in credit analysis, as DNA library is built, gene order-checking, data processing, interpretation of result, achievement output, Multiple links such as data sharing, each link have different technical staff to participate in, therefore in links transmission or accepting The problems such as information loss or inefficiency etc. occurs.Sequencing especially in bioinformatics, high-performance calculation link, specially Industry sequencing laboratory needs to receive substantial amounts of sequencing order items, arranges sequencing experiment, the sequencing knot of timely processing high speed output Fruit.
In the link after the completion of sequencing in bioinformatics, all items have corresponding service line project team to do. Service line, which had not only been done, only to be filtered, but also is standardized, and does personalization.Also pass through 1. project managements project verification, 2. information Mans are responsible for People's examination & approval, 3. information executors confirm that 4. arrange to run flow, and 5. reports are filled in, and 6. deliver, the processes such as 7. project managements are linked up, Therefore the originally limited resource of service line is consumed.
In addition, in the prior art, after analysis process is completed, some are generated under linux similar under windows The file of txt, the inside include the data that whole concluding report needs.Data distribution in that file, it is necessary to 1, on linux roads That file is opened under footpath;2nd, some data are replicated manually, such as:Data volume ...;3 and then fill out our word versions In report template;4th, Quality Control is finally carried out again;And 5, client is transmitted to by project management.
The analysis result information generating mode of the prior art has the following disadvantages:
File is not as word document is in use under first, linux.Digital independent is more inconvenient.
Second, process is filled in the reproduction process of data, form modifying process (such as 3000000,3,000 must be extended this as, 000), it is filled in manually in addition, and the easy change property of word templates, be easy to cause the mistake of data.To fill in and the above-mentioned 4th Step Quality Control adds pressure.
3rd, take time and effort, filling in for a concluding report often just needs several hours with Quality Control.
The content of the invention
, can effectively certainly it is an object of the invention to provide the analysis report automatic generation method and system of a kind of biological information project It is dynamic to quickly generate webpage version or/and the analysis report of PDF editions, and since webpage version or/and the report of PDF editions are not easy to change, from And error is reduced, substantially eliminate the pressure of Quality Control work.
The present invention provides a kind of analysis report automatic generation method of biological information project, including step:
Step 1:Establishment project is simultaneously stored in business management system, and each project includes more sub-projects;And described in selecting Sub-project and mission bit stream in establishment project;The type of the sub-project includes only filtering sub-project and normalizer project;
Step 2:When the sub-project type for only filtering a sub-project, then according to the sub-project type and task of selection Information, obtains sample data corresponding and after sequencer from lower machine data management system successively;And often obtain One sample data, i.e., filtered and analyzed with unified filter criteria according to default parameter configuration, so as to filter not Meet the sample data of parameter configuration;And all samples data filtering to be obtained and after having analyzed, directly generate webpage version or/ Analysis result information with PDF editions reports that analysis result information report includes sub-project information and corresponding sample message;
Step 3:When the type of the sub-project is normalizer project, then the corresponding sample of the sub-project is through surveying While sequence instrument is sequenced, filter analysis, express spectra quantitative analysis are included to sample establishment, comparison in difference is analyzed, Cluster gathers One or more normal process analyses in alanysis, microRNA target prediction analysis, KOGO analyses and base editor analysis, and Create in each normal process analytic process and corresponding filtration parameter is inputted according to the sample situation of current sub-project by user;Through After the completion of sequencer, each sample data is filtered according to the filtration parameter of the input, is not met so as to remove The sample data of filtration parameter;Then each sample data to meeting filtration parameter is analyzed with created normal process again Analyzed, directly generate webpage version or/and the analysis result information of PDF editions, analysis result information include sub-project information and Corresponding sample message;
Step 4:Quality Control is carried out to the analysis result information, if Quality Control is by directly exporting the analysis result report Accuse;If Quality Control is by the way that and the gap of the analysis result information and quality control standard is in threshold range, then updates described Filtering and the analytic process of step 2 or step 3 are carried out after sample data or filtering/Quality Control parameter again, until analysis knot Fruit report passes through Quality Control;If Quality Control is by the way that and the gap of the analysis result information and quality control standard exceedes threshold value, then edits The sample and discarded correlation Lane, and place an order again in the business management system.
As the improvement of above-mentioned technical proposal, the summary info per sub-project include sub-project code, sub-project title, Whether sub-project type, be filtering, total sample number, executor, starting and end time, sub-project state and a son Project relevant operation.
As the improvement of above-mentioned technical proposal, the sample message includes sample ID, library title, Lane ID, sequencing Strategy, Flowcell ID, Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, Error Rate, Base distribution figure and base Quality Control distribution map.
As the improvement of above-mentioned technical proposal, further include:
Step 5:The analysis result information is subjected to storage backup.
As the improvement of above-mentioned technical proposal, in the step 4:If Quality Control not by and the analysis result information Gap with quality control standard in threshold range, update the sample data be can be with single sample data edition or sample Product batch is edited.
The invention also discloses a kind of analysis report automatic creation system of biological information project, including:
Creating unit, for creating project and being stored in business management system, each project includes more sub-projects;And select Select the sub-project and mission bit stream in the establishment project;The type of the sub-project includes only filtering sub-project and normalizer Project;
First filter analysis unit, is only filtering sub-project for the type when the sub-project, then according to the son of selection Item types and mission bit stream, obtain sample corresponding and after sequencer from lower machine data management system successively Data;And often obtain a sample data, i.e., filtered according to default parameter configuration with unified filter criteria with being divided Analysis, so as to filter the sample data for not meeting parameter configuration;And all samples data filtering to be obtained and after analyze, generate Webpage version or/and the analysis result information of PDF editions, analysis result information include sub-project information and corresponding sample message;
Second filter analysis unit, for when the type of the sub-project is normalizer project, then the sub-project pair For the sample answered while through sequencer, which, which is created, includes filter analysis, express spectra quantitative analysis, comparison in difference One or more marks in analysis, Cluster cluster analyses, microRNA target prediction analysis, KOGO analyses and base editor analysis Quasi- process analysis, and in each normal process analytic process is created by user according to the input pair of the sample situation of current sub-project The filtration parameter answered;After the completion of sequencer, each sample data is filtered according to the filtration parameter of the input, So as to remove the sample data for not meeting filtration parameter;Then analyzed again with created normal process to meeting filtration parameter Each sample data analyzed, generate webpage version or/and the analysis result information of PDF edition, analysis result information includes sub Project information and corresponding sample message;
Quality Control unit, for carrying out Quality Control to the analysis result information, if Quality Control is by directly exporting the analysis knot Fruit is reported;If Quality Control is by the way that and the gap of the analysis result information and quality control standard then updates in threshold range Filtering and the analytic process of step 2 or step 3 are carried out after the sample data or filtering/Quality Control parameter again, until point Analysis result report passes through Quality Control;If Quality Control is by the way that and the gap of the analysis result information and quality control standard exceedes threshold value, then The sample and discarded correlation Lane are edited, and is placed an order again in the business management system.
As the improvement of above-mentioned technical proposal, the summary info per sub-project include sub-project code, sub-project title, Whether sub-project type, be filtering, total sample number, executor, starting and end time, sub-project state and a son Project relevant operation.
As the improvement of above-mentioned technical proposal, the sample message includes sample ID, library title, Lane ID, sequencing Strategy, Flowcell ID, Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, Error Rate, Base distribution figure and base Quality Control distribution map.
As the improvement of above-mentioned technical proposal, further include:
Storage unit:For storage backup will to be carried out by the analysis result information of Quality Control.
As the improvement of above-mentioned technical proposal, in the Quality Control unit:If Quality Control not by and the analysis result report Accuse and the gap of quality control standard be in threshold range, update the sample data be can with single sample data edition or Sample batch is edited.
Compared with prior art, the analysis report automatic generation method of biological information project disclosed by the invention and system tool Have the advantages that:Can effectively quickly generate webpage version or/and the analysis report of PDF editions automatically, and due to webpage version or/and The report of PDF editions is not easy to change, so as to reduce error, substantially eliminates the pressure of Quality Control work.And the PDF editions reports than word editions Easily fixed propagation, webpage version can be seen online.
Brief description of the drawings
Fig. 1 is a kind of flow signal of analysis report automatic generation method of biological information project in the embodiment of the present invention Figure.
Fig. 2 shows the idiographic flow of the step S2 in Fig. 1.
Fig. 3 shows the idiographic flow of the step S3 in Fig. 1.
Fig. 4 shows the idiographic flow of the step S4 in Fig. 1.
Fig. 5 shows the idiographic flow of the step S5 in Fig. 1.
Fig. 6 is a kind of structural representation of the analysis report automatic creation system of biological information project in the embodiment of the present invention Figure.
Fig. 7 shows UI pages of one embodiment of the analysis report automatic creation system of the thing information project of the invention that grows directly from seeds The screenshot capture in face, the sectional drawing show the selective listing of sub-project.
Fig. 8 shows UI pages of one embodiment of the analysis report automatic creation system of the thing information project of the invention that grows directly from seeds The screenshot capture in face, the sectional drawing show the summary info of every sub-project.
Fig. 9 shows UI pages of one embodiment of the analysis report automatic creation system of the thing information project of the invention that grows directly from seeds The screenshot capture in face, the sectional drawing show the parameter setting interface for only filtering sub-project.
Figure 10 shows the UI of one embodiment of the analysis report automatic creation system of the thing information project of the invention that grows directly from seeds The screenshot capture of the page, the sectional drawing show parameter setting interface and the normal process analysis selection interface of normalizer project.
Embodiment
Below in conjunction with the attached drawing in the embodiment of the present invention, the technical solution in the embodiment of the present invention is carried out clear, complete Site preparation describes, it is clear that described embodiment is only part of the embodiment of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, those of ordinary skill in the art are obtained every other without creative efforts Embodiment, belongs to the scope of protection of the invention.
It is a kind of analysis report automatic generation method of biological information project provided in an embodiment of the present invention referring to Fig. 1 Structure diagram.The analysis report automatic generation method of the biological information project, including step:
S1:Establishment project is simultaneously stored in business management system, and each project includes more sub-projects;And select the establishment Sub-project and mission bit stream in project;The type of the sub-project includes only filtering sub-project and normalizer project;
In this step, the selected summary info per sub-project includes sub-project code, sub-project title, subitem Whether mesh type, be filtering, total sample number, executor, starting and end time, sub-project state and a sub-project Relevant operation.
S2:When the type of the sub-project is an only filtering sub-project, then according to the sub-project type and mission bit stream of selection, Sample data corresponding and after sequencer is obtained from lower machine data management system successively;And often obtain a sample Product data, i.e., filtered and analyzed with unified filter criteria according to default parameter configuration, and ginseng is not met so as to filter The sample data of number configuration;And all samples data filtering to be obtained and after having analyzed, directly generate webpage version or/and PDF The analysis result information of version, analysis result information include sub-project information and corresponding sample message;
In this step, the sample message include sample ID, library title, Lane ID, sequencing strategy, Flowcell ID, Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, ErrorRate, base point Butut and base Quality Control distribution map.
S3:When the type of the sub-project is normalizer project, then the corresponding sample of the sub-project is through sequenator While sequencing, which, which is created, includes filter analysis, express spectra quantitative analysis, comparison in difference analysis, Cluster clusters point One or more normal process analyses in analysis, microRNA target prediction analysis, KOGO analyses and base editor analysis, and creating Corresponding filtration parameter is inputted according to the sample situation of current sub-project by user in each normal process analytic process;Through sequencing After the completion of instrument sequencing, each sample data is filtered according to the filtration parameter of the input, filtering is not met so as to remove The sample data of parameter;Then each sample data for meeting filtration parameter is carried out with created normal process analysis again Analysis, directly generates webpage version or/and the analysis result information of PDF editions, analysis result information includes sub-project information and correspondence Sample message;
S4:Quality Control is carried out to the analysis result information, if Quality Control is by directly exporting the analysis result information;If Quality Control is by the way that and the gap of the analysis result information and quality control standard then updates the sample in threshold range Filtering and the analytic process of step S2 or step S3 are carried out after data or filtering/Quality Control parameter again, until analysis result report Announcement passes through Quality Control;If Quality Control is by the way that and the gap of the analysis result information and quality control standard exceedes threshold value, then described in editor Sample and discarded correlation Lane, and place an order again in the business management system.
S5:The analysis result information is subjected to storage backup.
The filter analysis of the present invention is to be distinguished according to the type of sub-project for only filtering sub-project or normalizer project Carry out, be described in detail separately below by Fig. 2 and Fig. 3.
As shown in Fig. 2, the process of filter analysis is carried out to sample message when the type of sub-project is only filters sub-project Including step:
S201:Detect corresponding one and machine under sample (sample) is only sequenced;
In this step, lower machine refers to the sample data for completing to obtain after sequencing by sequenator by sample data.
S202:Sample is sequenced to this according to default parameter configuration to be filtered and analyzed (run);
In the step, the lower machine data for each being only sequenced with unified filter analysis standard sample carry out, so that Non-compliant lower machine data are filtered out, and obtain analysis result.
S203:Determine whether all sequencing samples (sample) of the sub-project (project) filter and analysis (run) It is completeIf so, then carry out step S204, otherwise return to step S202;
S204:Directly generate webpage version or/and the analysis result information of PDF editions.
As shown in figure 3, the process of filter analysis is carried out to sample message when the type of sub-project is normalizer project Including step:
S301:Detect machine on a normalized sample (sample);
In this step, upper machine refers to sample data uploading to sequenator to be sequenced.
S302:One or more normal process analyses are created to the normalized sample, and create the same of normal process analysis When by the corresponding filtration parameter of user setting;The normal process analysis include but not limited to filter analysis, express spectra quantitative analysis, Comparison in difference analysis, Cluster cluster analyses, microRNA target prediction analysis, KOGO analyses and base editor analysis
S303:Machine under sample selected by detection (sample);
In this step, lower machine refers to the sample data for completing to obtain after sequencing by sequenator by sample data.
S304:The normalized sample is filtered according to the filtration parameter of user setting, so as to filter out the standardization Machine data under the sample of filtration parameter are not met in sample, are then analyzed with the normal process created to remaining sample after filtering Product data are analyzed;S305:Determine whether all normalized samples (sample) of the sub-project (project) are filtered with dividing It is complete to analyse (run)If so, then carry out step S204, otherwise return to step S202;
S306:Directly generate webpage version or/and the analysis result information of PDF editions.
It is the analysis knot to being obtained after any one sample data progress filter analysis in a sub-project with reference to figure 4 Fruit report carries out the process of Quality Control, and paying attention to Quality Control is just carried out after the sample data analysis of all samples of sub-project has been filtered , and Quality Control is carried out successively to each sample, specifically include step:
S401:Detect that some sample completes filter analysis, and directly generate webpage version or/and the analysis result of PDF editions Report;
S402:Quality Control is carried out to the analysis result information;
Specifically analysis result information is compared with quality control standard.
S403:Judge Quality Control whether by if Quality Control is by entering step S404, otherwise entering step S405;
S404:Export the analysis result information;
S405:Judge the analysis result information and quality control standard gap whether not in threshold range (i.e. gap is too Greatly), if otherwise entering step S406, otherwise it is transferred to step S408;
S406:Update the sample data or filtering/Quality Control parameter;
In this step, can be edited with single sample data edition or sample batch.
S407:The sample data is filtered and analyzed again according to sub-project type, directly generates webpage version again Or/and the analysis result information of PDF editions;And return to step S402;
S408:The sample and discarded correlation Lane are edited, and in BMS (Business Management System, industry Business management system) in place an order again;
S409:Wait machine under new sample data and the sub-project type according to sample and carry out corresponding filtering With analysis, webpage version or/and the analysis result information of PDF editions, and return to step S402 are directly generated;
Then, after all samples data of a sub-project carry out Quality Control, then a QC report is generated.
With reference to figure 5, the process that storage backup is carried out to the analysis result information of sample data specifically includes step:
S501:Sample data is analyzed;
S502:Judge whether the sample analysis is completedIf so, then entering step S503, otherwise continue step S501;
S503:Activation system device backup function is available;
S504:User confirms to back up, and clicks on " backup ";
S505:System prompt backup request is submitted;
S506:System copies data to delivery system;
S507:Judge whether copy succeedsIf so, then entering step S509, S508 is otherwise entered step;
S508:Prompt user ID error, and return to step S504.
S509:Prompt user ID success;And terminate.
As it can be seen that the analysis report automatic generation method of biological information project disclosed in the present embodiment, can be effectively automatic quick Webpage version or/and the analysis report of PDF editions are generated, and since webpage version or/and the report of PDF editions are not easy to change, so as to reduce Error, substantially eliminates the pressure of Quality Control work.And the PDF editions reports than word editions are easier fixed propagation, webpage version can be with See online.
Present invention also offers a kind of analysis report automatic creation system of biological information project, as shown in fig. 6, including wound Unit 10, the first filter analysis unit 20, the second filter analysis unit 30, Quality Control unit 40 and storage unit 50 are built, wherein Creating unit 10, the first filter analysis unit 20, the second filter analysis unit 30, Quality Control unit 40 and storage unit 50 can be with It is incorporated into a background server, and front end directly operates on webpage, is operated by user and input parameter, specifically 's:
Creating unit 10, for create project and be stored in business management system (Business Management System, BMS, sequencing and the distribution of information analysis task and management system, contain the organizational informations such as sub-project, person liable, data) in, often A project includes more sub-projects;And select the sub-project and mission bit stream in the establishment project;The type of the sub-project Including only filtering sub-project and normalizer project;
As shown in fig. 7, one embodiment for the analysis report automatic creation system of thing information project that grows directly from seeds for the present invention The screenshot capture of the UI pages, the sectional drawing show the selective listing of sub-project.More sub-projects are shown in the sub-project list, And it is labeled as a filtering items (Y) or standardization project (N) per sub-project.And Fig. 8 is to specifically show a sub-project Summary info.Per sub-project summary info include sub-project code, sub-project title, sub-project type, whether be only Filtering, total sample number, executor, starting and end time, sub-project state and sub-project relevant operation.
First filter analysis unit 20, is only filtering sub-project for the type when the sub-project, then according to selection Sub-project type and mission bit stream, successively from lower machine data management system (Data Management System, DMS, to sequencing The lower machine data completed carry out quality monitoring and data management) in obtain corresponding and after sequencer sample number According to;And often obtain a sample data, i.e., filtered and analyzed with unified filter criteria according to default parameter configuration, So as to filter the sample data for not meeting parameter configuration;And all samples data filtering to be obtained and after analyze, directly give birth to Into webpage version or/and the analysis result information of PDF editions, analysis result information includes sub-project information and corresponding sample message;
With reference to figure 9, be the present invention grow directly from seeds thing information project analysis report automatic creation system one embodiment UI The screenshot capture of the page, the sectional drawing show the parameter setting interface for only filtering sub-project.
Second filter analysis unit 30, for when the type of the sub-project is normalizer project, then the sub-project For corresponding sample while through sequencer, which, which is created, includes filter analysis, express spectra quantitative analysis, diversity ratio Compared with one or more in analysis, Cluster cluster analyses, microRNA target prediction analysis, KOGO analyses and base editor analysis Normal process is analyzed, and is inputted in each normal process analytic process is created by user according to the sample situation of current sub-project Corresponding filtration parameter;After the completion of sequencer, each sample data was carried out according to the filtration parameter of the input Filter, so as to remove the sample data for not meeting filtration parameter;Then again with created normal process analysis to meeting filtering Each sample data of parameter is analyzed, and directly generates webpage version or/and the analysis result information of PDF editions, analysis result report Announcement includes sub-project information and corresponding sample message;
Wherein, the sample message include sample ID, library title, Lane ID, sequencing strategy, Flowcell ID, Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, Error Rate, base distribution figure and base Quality Control distribution map.
With reference to figure 10, be the present invention grow directly from seeds thing information project analysis report automatic creation system one embodiment UI The screenshot capture of the page, the sectional drawing show parameter setting interface and the normal process analysis selection interface of normalizer project.
Quality Control unit 40, for carrying out Quality Control to the analysis result information, if Quality Control is by directly exporting the analysis As a result report;If Quality Control is by the way that and the gap of the analysis result information and quality control standard is then compiled again in threshold range The is carried out after volume (can be edited with single sample data edition or sample batch) described sample data or filtering/Quality Control parameter again The filtering of one filter analysis unit 20 or the second filter analysis unit 30 and analytic process, until analysis result information passes through matter Control;If Quality Control is by the way that and the gap of the analysis result information and quality control standard exceedes threshold value, then edits the sample and useless Related Lane is abandoned, and is placed an order again in the business management system;And
Storage unit 50:The analysis result information is backed up for storing.
As it can be seen that the analysis report automatic creation system of biological information project disclosed in the present embodiment, can be effectively automatic quick Webpage version or/and the analysis report of PDF editions are generated, and since webpage version or/and the report of PDF editions are not easy to change, so as to reduce Error, substantially eliminates the pressure of Quality Control work.And the PDF editions reports than word editions are easier fixed propagation, webpage version can be with See online.
The above is the preferred embodiment of the present invention, it is noted that for those skilled in the art For, various improvements and modifications may be made without departing from the principle of the present invention, these improvements and modifications are also considered as Protection scope of the present invention.

Claims (8)

1. a kind of analysis report automatic generation method of biological information project, it is characterised in that including step:
Step 1:Establishment project is simultaneously stored in business management system, and each project includes more sub-projects;And select the establishment Sub-project and mission bit stream in project;The type of the sub-project includes only filtering sub-project and normalizer project;Wherein, Per sub-project summary info include sub-project code, sub-project title, sub-project type, whether be only filtering, a total sample Product number, executor, starting and end time, sub-project state and sub-project relevant operation;
Step 2:When the type of the sub-project is an only filtering sub-project, then according to the sub-project type and mission bit stream of selection, Sample data corresponding and after sequencer is obtained from lower machine data management system successively;And often obtain a sample Product data, i.e., filtered and analyzed with unified filter criteria according to default parameter configuration, and ginseng is not met so as to filter The sample data of number configuration;And all samples data filtering to be obtained and after having analyzed, directly generate webpage version or/and PDF The analysis result information report of version, analysis result information report include sub-project information and corresponding sample message;
Step 3:When the type of the sub-project is normalizer project, then the corresponding sample of the sub-project is through sequenator While sequencing, which, which is created, includes filter analysis, express spectra quantitative analysis, comparison in difference analysis, Cluster clusters point One or more normal process analyses in analysis, microRNA target prediction analysis, KOGO analyses and base editor analysis, and creating Corresponding filtration parameter is inputted according to the sample situation of current sub-project by user in each normal process analytic process;Through sequencing After the completion of instrument sequencing, each sample data is filtered according to the filtration parameter of the input, filtering is not met so as to remove The sample data of parameter;Then each sample data for meeting filtration parameter is carried out with created normal process analysis again Analysis, directly generates webpage version or/and the analysis result information of PDF editions, analysis result information includes sub-project information and correspondence Sample message;
Step 4:Quality Control is carried out to the analysis result information, if Quality Control is by directly exporting the analysis result information;If Quality Control is by the way that and the gap of the analysis result information and quality control standard then updates the sample in threshold range Filtering and the analytic process of step 2 or step 3 are carried out after data or filtering/Quality Control parameter again, until analysis result report Announcement passes through Quality Control;If Quality Control is by the way that and the gap of the analysis result information and quality control standard exceedes threshold value, then described in editor Sample and discarded correlation Lane, and place an order again in the business management system.
2. the analysis report automatic generation method of biological information project as claimed in claim 1, it is characterised in that the sample Information include sample ID, library title, Lane ID, sequencing strategy, Flowcell ID, Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, Error Rate, base distribution figure and base Quality Control distribution map.
3. the analysis report automatic generation method of biological information project as claimed in claim 1, it is characterised in that further include:
Step 5:The analysis result information is subjected to storage backup.
4. the analysis report automatic generation method of biological information project as claimed in claim 1, it is characterised in that in the step In rapid four:If Quality Control not by and the gap of the analysis result information and quality control standard in threshold range, updating The sample data is can be edited with single sample data edition or sample batch.
A kind of 5. analysis report automatic creation system of biological information project, it is characterised in that including:
Creating unit, for creating project and being stored in business management system, each project includes more sub-projects;And select institute State the sub-project and mission bit stream in establishment project;The type of the sub-project includes only filtering sub-project and standardization subitem Mesh;Whether the summary info per sub-project includes sub-project code, sub-project title, sub-project type, is only filtering, total Sample number, executor, starting and end time, sub-project state and sub-project relevant operation;
First filter analysis unit, is only filtering sub-project for the type when the sub-project, then according to the sub-project of selection Type and mission bit stream, obtain sample number corresponding and after sequencer from lower machine data management system successively According to;And often obtain a sample data, i.e., filtered and analyzed with unified filter criteria according to default parameter configuration, So as to filter the sample data for not meeting parameter configuration;And all samples data filtering to be obtained and after analyze, generation net Page version or/and the analysis result information of PDF editions, analysis result information include sub-project information and corresponding sample message;
Second filter analysis unit, for when the type of the sub-project is normalizer project, then the sub-project to be corresponding For sample while through sequencer, which, which is created, includes filter analysis, express spectra quantitative analysis, comparison in difference point One or more standards in analysis, Cluster cluster analyses, microRNA target prediction analysis, KOGO analyses and base editor analysis Process analysis, and inputted and corresponded to according to the sample situation of current sub-project by user in each normal process analytic process is created Filtration parameter;After the completion of sequencer, each sample data is filtered according to the filtration parameter of the input, from And remove the sample data for not meeting filtration parameter;Then analyzed again with created normal process to meeting filtration parameter Each sample data is analyzed, and generating webpage version or/and the analysis result information of PDF editions, analysis result information includes subitem Mesh information and corresponding sample message;
Quality Control unit, for carrying out Quality Control to the analysis result information, if Quality Control is by directly exporting the analysis result report Accuse;If Quality Control is by the way that and the gap of the analysis result information and quality control standard is in threshold range, then updates described Filtering and the analytic process of step 2 or step 3 are carried out after sample data or filtering/Quality Control parameter again, until analysis knot Fruit report passes through Quality Control;If Quality Control is by the way that and the gap of the analysis result information and quality control standard exceedes threshold value, then edits The sample and discarded correlation Lane, and place an order again in the business management system.
6. the analysis report automatic creation system of biological information project as claimed in claim 5, it is characterised in that the sample Information include sample ID, library title, Lane ID, sequencing strategy, Flowcell ID, Raw data, Raw Reads, Read Length, GC%, Q20%, Q30%, Error Rate, base distribution figure and base Quality Control distribution map.
7. the analysis report automatic creation system of biological information project as claimed in claim 5, it is characterised in that further include:
Storage unit:For storage backup will to be carried out by the analysis result information of Quality Control.
8. the analysis report automatic creation system of biological information project as claimed in claim 5, it is characterised in that in the matter Control in unit:If Quality Control not by and the gap of the analysis result information and quality control standard in threshold range, compiling again It is that can be edited with single sample data edition or sample batch to collect the sample data.
CN201410742360.9A 2014-12-08 2014-12-08 The analysis report automatic generation method and system of biological information project Active CN104484558B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410742360.9A CN104484558B (en) 2014-12-08 2014-12-08 The analysis report automatic generation method and system of biological information project

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410742360.9A CN104484558B (en) 2014-12-08 2014-12-08 The analysis report automatic generation method and system of biological information project

Publications (2)

Publication Number Publication Date
CN104484558A CN104484558A (en) 2015-04-01
CN104484558B true CN104484558B (en) 2018-04-24

Family

ID=52759099

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410742360.9A Active CN104484558B (en) 2014-12-08 2014-12-08 The analysis report automatic generation method and system of biological information project

Country Status (1)

Country Link
CN (1) CN104484558B (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106775998A (en) * 2016-11-09 2017-05-31 上海派森诺生物科技股份有限公司 High flux 16S sequencing data automatic processing methods
CN106557666A (en) * 2016-11-15 2017-04-05 上海派森诺生物科技股份有限公司 It is a kind of can be based on the data filtering method of Nextseq500 high-flux sequence platforms
CN107329941B (en) * 2017-07-05 2020-07-10 福建省特种设备检验研究院 Special equipment inspection report based editing and previewing system
CN108776748A (en) * 2018-05-16 2018-11-09 成都奇恩生物科技有限公司 A kind of gene detection system and its detection method
CN109063959A (en) * 2018-06-22 2018-12-21 深圳弘睿康生物科技有限公司 A kind of sample quality control analysis method and system
CN109712683A (en) * 2018-12-17 2019-05-03 上海派森诺医学检验所有限公司 A kind of examining report generation method of automation
CN110176276B (en) * 2019-04-12 2021-01-05 苏州赛美科基因科技有限公司 Biological information analysis process management method and system
CN110335638B (en) * 2019-05-22 2021-11-23 北京安智因生物技术有限公司 Automatic generation method and system for statin drug gene detection report
CN111424075B (en) * 2020-04-10 2021-01-15 西咸新区予果微码生物科技有限公司 Third-generation sequencing technology-based microorganism detection method and system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1711548A (en) * 2002-11-18 2005-12-21 西门子公司 Method for carrying out quality control on an analytical process and device for carrying out said method
CN101914628A (en) * 2010-09-02 2010-12-15 深圳华大基因科技有限公司 Method and system for detecting polymorphism locus of genome target region
CN102206704A (en) * 2011-03-02 2011-10-05 深圳华大基因科技有限公司 Method and device for assembling genome sequence
CN102477460A (en) * 2010-11-24 2012-05-30 深圳华大基因科技有限公司 Method for performing sequencing and cluster analysis on V6 hypervariable region of metagenomic 16S rDNA
CN103324866A (en) * 2013-03-26 2013-09-25 张弘 Ripple system
CN103714180A (en) * 2014-01-08 2014-04-09 浪潮(北京)电子信息产业有限公司 Bioinformatics database system and data processing method
CN103993069A (en) * 2014-03-21 2014-08-20 深圳华大基因科技服务有限公司 Virus integration site capture sequencing analysis method

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1711548A (en) * 2002-11-18 2005-12-21 西门子公司 Method for carrying out quality control on an analytical process and device for carrying out said method
CN101914628A (en) * 2010-09-02 2010-12-15 深圳华大基因科技有限公司 Method and system for detecting polymorphism locus of genome target region
CN102477460A (en) * 2010-11-24 2012-05-30 深圳华大基因科技有限公司 Method for performing sequencing and cluster analysis on V6 hypervariable region of metagenomic 16S rDNA
CN102206704A (en) * 2011-03-02 2011-10-05 深圳华大基因科技有限公司 Method and device for assembling genome sequence
CN103324866A (en) * 2013-03-26 2013-09-25 张弘 Ripple system
CN103714180A (en) * 2014-01-08 2014-04-09 浪潮(北京)电子信息产业有限公司 Bioinformatics database system and data processing method
CN103993069A (en) * 2014-03-21 2014-08-20 深圳华大基因科技服务有限公司 Virus integration site capture sequencing analysis method

Also Published As

Publication number Publication date
CN104484558A (en) 2015-04-01

Similar Documents

Publication Publication Date Title
CN104484558B (en) The analysis report automatic generation method and system of biological information project
CN104484582B (en) The biological information project automatic analysis method and system realized by modularization selection
Nikolayeva et al. edgeR for differential RNA-seq and ChIP-seq analysis: an application to stem cell biology
Jafarpour et al. Bridging the timescales of single-cell and population dynamics
CN104503840B (en) The method and device that terminal resource is optimized
Saez-Rodriguez et al. Flexible informatics for linking experimental data to mathematical models via DataRail
CN106170762A (en) Management includes the interface used by DFD of subgraph
CN107766696A (en) Eucaryote alternative splicing analysis method and system based on RNA seq data
CN107077349A (en) Job creation with data preview
CN109448788A (en) On-line analysis platform architecture of microbiology of genomics and bioinformatics
Ockree et al. Integrating big data analytics into development planning optimization
CN104484375B (en) Establish the method and system of database automatically in project analysis flow
CN106528511A (en) Form analysis method and device
CN104484750B (en) The product parameters automatic matching method and system of biological information project
CN105653897B (en) LncRNA analysis system and method based on biological cloud platform
CN103136440B (en) Data processing method and device
CN112434032B (en) Automatic feature generation system and method
CN104484581B (en) The automated analysis method and system of biological information project
US20030004612A1 (en) Methods and computer program products for automated experimental design
CN111433610A (en) Laboratory instrument selection and configuration
CN113312529A (en) Data visualization method and device, computer equipment and storage medium
Van der Blom et al. Sparkle: Toward Accessible Meta-Algorithmics for Improving the State of the Art in Solving Challenging Problems
WO2016039011A1 (en) Simulation system, simulation method, and simulation program
CN113643757B (en) Microbiology data interaction analysis system
Ranjan et al. SNIKT: sequence-independent adapter identification and removal in long-read shotgun sequencing data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant