CN103258054A - Method and device for processing data - Google Patents

Method and device for processing data Download PDF

Info

Publication number
CN103258054A
CN103258054A CN2013102138242A CN201310213824A CN103258054A CN 103258054 A CN103258054 A CN 103258054A CN 2013102138242 A CN2013102138242 A CN 2013102138242A CN 201310213824 A CN201310213824 A CN 201310213824A CN 103258054 A CN103258054 A CN 103258054A
Authority
CN
China
Prior art keywords
data
dimension
metric
incidence relation
database
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2013102138242A
Other languages
Chinese (zh)
Other versions
CN103258054B (en
Inventor
闫朝升
李丹
孙忠人
闫朝岐
姜德友
Original Assignee
闫朝升
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 闫朝升 filed Critical 闫朝升
Priority to CN201310213824.2A priority Critical patent/CN103258054B/en
Publication of CN103258054A publication Critical patent/CN103258054A/en
Application granted granted Critical
Publication of CN103258054B publication Critical patent/CN103258054B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method and device for processing data, wherein the method includes the following steps: dimension data are obtained; a first data set corresponding to the dimension data is extracted from a first data bank, wherein the first data set comprises first data; aggregation calculation is carried out on the first data and a metric value is obtained; a first incidence relation is established between the metric value and the dimension data and a second data bank is generated. According to the method and device for processing the data, the effect that a complete ancient Chinese medicine case data bank is established rapidly and accurately is achieved. When a user inquires ancient Chinese medicine case data, the user can directly obtain a complete inquiry result by directly inquiring from the second data bank. Inquiry is rapid, accurate and high in efficiency.

Description

Data processing method and device
Technical field
The present invention relates to data processing field, in particular to a kind of data processing method and device.
Background technology
The ancient Chinese medicine case carrying thousands of year over theory of medicine and the practical experience of ancient Chinese medicine doctor, be richly stored with, unknown information, be the precious resources of Basic Theories of Chinese Medicine and clinical research.How realizing the digitizing storage of the traditional Chinese medical science case in the ancient literature resource, obtain ancient Chinese medicine diagnosis and treatment rule, is very necessary and urgent task.
Along with computer information technology is increasingly extensive in the application of tcm field, utilize computer technology to set up ancient Chinese medicine case data-storage system, realize the digitizing storage of ancient Chinese medicine case, obtain unknown, valuable information that contain, prior, seek ancient Chinese medicine treatment characteristics, become possibility, and attempted by some researchers, on the collection of traditional Chinese medical science case data, storage, the fundamentals of management, realized ancient Chinese medicine case Study on Data Processing in ancient times.But existing research exists problems.One, existing research has only realized simple statistics, data analysis only is confined to from single angle, the statistics frequency and frequency, as: based on the usefulness cave frequency and the frequency of specific passages through which vital energy circulates.Its two, the storage of existing data just realizes the simple storage of data, when such data were carried out complex query, existing storage system was difficult to realize multi-angle, multi-level, multi-sided inquiry and obtains accurate and visual and understandable Query Result.
At imperfect owing to existing ancient Chinese medicine case database in the prior art, thereby cause using inaccurate, the inefficiency of result of existing case data base querying data, and can't realize the more problem of complex query, effective solution is not proposed at present as yet.
Summary of the invention
Imperfect owing to existing ancient Chinese medicine case database at correlation technique, thereby cause using inaccurate, the inefficiency of result of existing case data base querying data, and can't realize the more problem of complex query, effective solution is not proposed at present as yet, for this reason, fundamental purpose of the present invention is to provide a kind of data processing method and device, to address the above problem.
To achieve these goals, according to an aspect of the present invention, provide a kind of data processing method, this method comprises: obtain dimension data; Extract first data acquisition corresponding with dimension data from first database, wherein, first data acquisition comprises first data; First data are assembled calculate metric; First incidence relation of setting up between metric and the dimension data generates second database.
Further, first data being assembled the step that calculates metric comprises: read standard vocabulary from the standard vocabulary storehouse; According to standard vocabulary first data are carried out standardization and be converted to second data; Second data are assembled calculate metric.
Further, after from first database, extracting first data acquisition corresponding with dimension data, data processing method comprises: whether the integrity degree that detects first data meets first threshold, integrity degree in first data does not meet under the situation of first threshold, first data is deleted or augmented first data acquisition of handling after obtaining clearing up; Or detect the 3rd data that whether exist in first data acquisition with first data consistent, and in first data acquisition, exist under the situation of the 3rd data, delete the 3rd data, first data acquisition after obtaining clearing up.
Further, the step of handling first data acquisition after obtaining clearing up is deleted or augmented to first data comprises: extract data of description from dimension data; Use the pick of description number that first data are augmented processing, so that the integrity degree of the first number pick meets first threshold, first data acquisition after obtaining clearing up.
Further, dimension data comprises: second incidence relation between dimension element data, dimension element code data and dimension element data and the dimension element code data, wherein, setting up the step that first incidence relation between metric and the dimension data generates second database comprises: set up the 3rd incidence relation between metric and the dimension element code data; Set up first incidence relation by metric and second incidence relation; Preserve dimension data, metric, first incidence relation and the 3rd incidence relation, to generate second database.
Further, setting up after first incidence relation between metric and the dimension data generates second database, data processing method also comprises: receive user's the first multidimensional data analysis request; The first multidimensional data analysis request is converted into the second multidimensional data analysis request; From second database, obtain the result data collection corresponding with the second multidimensional data analysis request; Output result data collection.
To achieve these goals, according to an aspect of the present invention, provide a kind of data processing equipment, this device comprises: acquisition module is used for obtaining dimension data; First extraction module is used for extracting first data acquisition corresponding with dimension data from first database; Computing module calculates metric for first data of first data acquisition are assembled; First processing module, first incidence relation that is used for setting up between metric and the dimension data generates second database.
Further, computing module comprises: first read module is used for reading standard vocabulary from the standard vocabulary storehouse; Second processing module is used for according to standard vocabulary first data being carried out standardization and is converted to second data; The first sub-computing module calculates metric for second data are assembled.
Further, data processing equipment also comprises: whether first detection module meets first threshold for detection of the integrity degree of first data; The 3rd processing module is used for not meeting under the situation of first threshold at the integrity degree of first data, first data is deleted or augmented first data acquisition of handling after obtaining clearing up; Second detection module is for detection of the 3rd data that whether exist in first data acquisition with first data consistent; The manages module everywhere, is used for existing under the situation of the 3rd data at first data acquisition, deletes the 3rd data, first data acquisition after obtaining clearing up.
Further, the 3rd processing module comprises: second extraction module is used for extracting data of description from dimension data; The 5th processing module is used for using data of description that first data are augmented processing, so that the integrity degree of first data meets first threshold, and first data acquisition after obtaining clearing up.
Further, dimension data comprises: second incidence relation between dimension element data, dimension element code data and dimension element data and the dimension element code data, wherein, first processing module comprises: the first sub-processing module is used for setting up the 3rd incidence relation between metric and the dimension element code data; The second sub-processing module is used for setting up first incidence relation by metric and second incidence relation; The 3rd sub-processing module is used for preserving dimension data, metric, first incidence relation and the 3rd incidence relation, to generate second database.
Adopt the present invention, by obtaining dimension data, from first database, extract first data acquisition corresponding with dimension data then, and first data in first data acquisition are assembled calculate metric after, first incidence relation of setting up between metric and the dimension data generates second database, can obtain first data acquisition by dimension data, after the foundation first data acquisition attainment degree value, set up first incidence relation between metric and the dimension data, generate second database by first incidence relation, import metric by first incidence relation and generate second database, and then provide accurately for multidimensional data analysis, the data source that is suitable for, thereby solved because existing ancient Chinese medicine case database is imperfect and data are unordered, thereby cause using the result of existing case data base querying data inaccurate, inefficiency, and can't realize the more problem of complex query, realized setting up rapidly and accurately the effect of complete ancient Chinese medicine case database, when the user inquires about ancient Chinese medicine case data, directly from second database, inquire about, can directly obtain complete Query Result, inquiry fast, accurately, efficient height.
Description of drawings
Accompanying drawing described herein is used to provide further understanding of the present invention, constitutes the application's a part, and illustrative examples of the present invention and explanation thereof are used for explaining the present invention, do not constitute improper restriction of the present invention.In the accompanying drawings:
Figure l is the structural representation according to the data processing equipment of the embodiment of the invention;
Fig. 2 is the structural representation according to data processing equipment embodiment illustrated in fig. 1;
Fig. 3 is the process flow diagram according to the data processing method of the embodiment of the invention;
Fig. 4 be according to embodiment illustrated in fig. 3 in second database schematic flow sheet of data query; And
Fig. 5 is the schematic flow sheet according to the creation method of initial multi-dimensional database embodiment illustrated in fig. 3.
Embodiment
Need to prove that under the situation of not conflicting, embodiment and the feature among the embodiment among the application can make up mutually.Describe the present invention below with reference to the accompanying drawings and in conjunction with the embodiments in detail.
Figure l is the structural representation according to the data processing equipment of the embodiment of the invention.Fig. 2 is the structural representation according to data processing equipment embodiment illustrated in fig. 1.
As depicted in figs. 1 and 2, this device can comprise: acquisition module 10 is used for obtaining dimension data; First extraction module 30 is used for extracting first data acquisition corresponding with dimension data from first database; Computing module 50 calculates metric for first data of first data acquisition are assembled; First processing module 70, first incidence relation that is used for setting up between metric and the dimension data generates second database.
Adopt the present invention, obtain dimension data by acquisition module, from first database, extract first data acquisition corresponding with dimension data by first extraction module then, and computing module first data in first data acquisition are assembled calculate metric after, first incidence relation of setting up between metric and the dimension data by first processing module generates second database, can obtain first data acquisition by dimension data, after the foundation first data acquisition attainment degree value, set up first incidence relation between metric and the dimension data, generate second database by first incidence relation, import metric by first incidence relation and generate second database, and then provide accurately for multidimensional data analysis, the data source that is suitable for, thereby solved because existing ancient Chinese medicine case database is imperfect, thereby cause using the result of existing case data base querying data inaccurate, inefficiency, and can't realize the more problem of complex query, realized setting up rapidly and accurately the effect of complete ancient Chinese medicine case database, when the user inquires about ancient Chinese medicine case data, directly from second database, inquire about, can directly obtain complete Query Result, inquiry fast, accurately, efficient height.
Particularly, the scheme in the above embodiment of the present invention is applicable to different data base management system (DBMS)s and different operating system.
According to the abovementioned embodiments of the present invention, computing module 50 can comprise: the first read module 5l is used for reading standard vocabulary from the standard vocabulary storehouse; Second processing module 53 is used for according to standard vocabulary first data being carried out standardization and is converted to second data; The first sub-computing module 55 calculates metric for second data are assembled.
Particularly, can carry out standardization to first data by the standard vocabulary that second processing module 53 reads according to first read module 51 and be converted to second data.For example, various ways such as the title of dynasty dimension data may life period, publication time, such as being: different forms such as clear, Qing Dynasty, the Qing Dynasty need according to standard vocabulary so with its unified value form that is treated to " Qing Dynasty ".
In the above embodiment of the present invention, device can also comprise: whether first detection module 90 meets first threshold for detection of the integrity degree of first data; The 3rd processing module 110 is used for not meeting under the situation of first threshold at the integrity degree of first data, first data is deleted or augmented first data acquisition of handling after obtaining clearing up; Second detection module 130 is for detection of second data that whether exist in first data acquisition with first data consistent; The manages module 150 everywhere, is used for deleting second data, first data acquisition after obtaining clearing up under there is situation with second data of first data consistent in first data acquisition.
According to the abovementioned embodiments of the present invention, the 3rd processing module 110 can comprise: second extraction module is used for extracting data of description from dimension data; The 5th processing module is used for using data of description that first data are augmented processing, so that the integrity degree of first data meets first threshold, and first data acquisition after obtaining clearing up.
Particularly, in this data processing equipment, the 3rd processing module 110 can also comprise the data removing module, is used for data value mistake and loss situation according to described first data, carries out data dump; As: the integrity degree that detects first data does not meet first threshold, thinks that then these first data lack the main disease to be treated data of case, then delete this first data by the data removing module; Second detection module 130 and the is managed module 150 everywhere and can be arranged on data and merge in the module, particularly, second detection module 130 can also detect the situation that whether has a plurality of row storage identical datas in first data acquisition, if exist then a plurality of row merged, particularly, adopt and delete unnecessary row or the data that the number pick in the different row is modified in the row are integrated processing, obtain first data acquisition after the cleaning, as: the acupuncture points on the human body data are stored in respectively in master point and adjunct acupuncture points two row, the data of two row can be merged so.
In the above embodiment of the present invention, dimension data can comprise: second incidence relation between dimension element data, dimension element code data and dimension element data and the dimension element code data, wherein, first processing module comprises: the first sub-processing module is used for setting up the 3rd incidence relation between metric and the dimension element code data; The second sub-processing module is used for setting up first incidence relation by metric and second incidence relation; The 3rd sub-processing module is used for preserving dimension data, metric, first incidence relation and the 3rd incidence relation, to generate second database.
In the above embodiment of the present invention, this device can also comprise: receiver module 170 is used for receiving user's the first multidimensional data analysis request; First modular converter 190 is used for the first multidimensional data analysis request is converted into the second multidimensional data analysis request; The 6th processing module 210 is used for obtaining the result data collection corresponding with the second multidimensional data analysis request from second database; And output module 230 is used for output result data collection.
Particularly, after the first multidimensional data analysis request that receives the user by receiver module 170, first modular converter 190 of system can ask to be converted to the second multidimensional data analysis request according to the first multidimensional data analysis request, wherein, ask conversion to comprise according to the first multidimensional data analysis request request of data classification and generated query statement or direct generated query statement are carried out in the first multidimensional data analysis request, from second database, obtain the result data collection corresponding with the second multidimensional data analysis request by the 6th processing module 210 then, then by output module 230 these result data collection of output.Wherein, output module 230 can show this result data collection according to the mode that user in the second multidimensional data analysis request asks to show the result data collection, wherein, can the result data collection be converted to visualized data by second modular converter, and by display module the result data collection be shown.Wherein, receiver module 170 and output module 230 can be arranged on the front-end server of system; The mode that shows the result data collection comprises: tables of data display format and figure display format.
Fig. 3 is the process flow diagram according to the data processing method of the embodiment of the invention, and this method comprises the steps: as shown in Figure 3
Step s102 obtains dimension data.
Step s104 extracts first data acquisition corresponding with dimension data from first database, wherein, first data acquisition comprises first data.
Step S106 assembles to calculate metric to first data.
Step s108, first incidence relation of setting up between metric and the dimension data generates second database.
At first obtain dimension data, from first database, extract first data acquisition corresponding with dimension data then, and first data in first data acquisition are assembled calculate metric after, first incidence relation of setting up between metric and the dimension data generates second database, can obtain first data acquisition by dimension data, after the foundation first data acquisition attainment degree value, set up first incidence relation between metric and the dimension data, generate second database by first incidence relation, import metric by first incidence relation and generate second database, and then provide accurately for multidimensional data analysis, the data source that is suitable for, thereby solved because existing ancient Chinese medicine case database is imperfect, thereby cause using the result of existing case data base querying data inaccurate, and can't obtain the problem of visual and understandable analysis result fast from existing database, realized setting up rapidly and accurately the effect of complete ancient Chinese medicine case database, when the user inquires about ancient Chinese medicine case data, directly from second database, inquire about, can directly obtain complete Query Result, inquiry fast, accurately, efficient height.
Wherein, the above embodiment of the present invention can be applied to the processing of ancient Chinese medicine case data.Store the incidence relation between field of medicaments target standard vocabulary and the standard vocabulary in first database; Assemble being calculated as the statistical calculation that first data that get access to are carried out, wherein statistical calculation comprises the calculating of first data being sued for peace, average and asked counting; Metric comprises the observation index value according to a plurality of data analysis angles.Said counting just refers to the calculating of metering number, for example, Baihui acupoint occurs and once just counts a number, and the counting that twice pair of Baihui acupoint occur is 2, occurs three times and is 3.
Dimension data can be kept in the dimension table in the multi-dimensional database, and multi-dimensional database can be preserved one group of dimension table and second database, can preserve dimension data in each dimension table.Wherein, dimension is the angle of data analysis, dimension hierarchy is the description of the logical organization relation on the dimension, the dimension element data are for being used for the concrete data on the storage dimension hierarchy, the dimension element code data can be one group of character code of unique identification dimension element data, can also preserve the data of description of the logical organization relation on the dimension in the dimension table, and the logical organization relation has reflected the division of the different level of detail on the dimension.
More specifically, can also comprise first database in the multi-dimensional database, first data can comprise the data of description of data analysis angle, and also preserve the 5th incidence relation between first data and the dimension element data in first database, can from first database, read first data acquisition corresponding with dimension data by the 5th incidence relation.
In the above embodiment of the present invention, at first owing to storing the data of description of data analysis angle in the dimension table, storing the observation index value according to a plurality of data analysis angles in second database, wherein the data of storing in second database also can the use center preservation form of table be stored in the multi-dimensional database, set up contacting between center table and the dimension table according to first incidence relation and the 3rd incidence relation, make up ancient Chinese medicine case multi-dimensional database, for multidimensional data analysis provides data source science, reliable, that accurately be suitable for.
In the above-described embodiments, by first incidence relation and the 3rd incidence relation, set up contacting between second database and the dimension table, so that center table and dimension table are linked, be convenient to multidimensional data analysis result's inquiry and demonstration.For example, by the dimension element data in the dimension table and dimension element code data, a plurality of dimension data of description in the first multidimensional data analysis request that gets access to are converted to the element code combination that corresponding dimension element code data is formed, and utilize this contact from the table of center, to obtain element code and make up corresponding metric; Otherwise, utilize this contact, the corresponding element code of metric in the table of center is made up the data of description that is converted to corresponding a plurality of dimensions, and then the concrete implication of level of understanding value.
In the above-described embodiments, the method for execution in step s106 can comprise:
(1) according to the element combinations on the minimum dimension hierarchy between all dimensions, utilize the dimension data in the dimension table that first data in first data acquisition are assembled calculating and generated second data acquisition, as the foundation passages through which vital energy circulates, human body branch, disease, therapy, Qucha cave on the minimum dimension hierarchy between dynasty five dimensions, before/the side Head Section, hemiplegia, acupuncture manipulation, the dimension data of the element combinations of the Ming Dynasty, first data in first data acquisition are carried out the concrete statistics that acupuncture points on the human body uses frequency, and generate shape as (Qucha cave element code, before/side Head Section element code, the hemiplegia element code, the acupuncture manipulation element code, Ming Dynasty's element code, acupuncture points on the human body use frequency) second data acquisition;
Owing to may there be a plurality of elements of the same name and these elements corresponding different father's element on the upper level level on a certain dimension hierarchy of some specific dimension; For example, professional knowledge according to tcm field, different syndromes may comprise same symptoms, be to have a plurality of elements of the same name and these elements corresponding different father's element on the syndrome level on the symptom level of disease dimension, as: according to embodiments of the invention, the hemiplegia symptom can be under the jurisdiction of a plurality of syndromes; At above-mentioned situation, pattern design dimension element code according to the first pre-conditioned utilization " segmentation, succession type ", namely by father's element code section of a hierarchical elements code, can realize the differentiation of element of the same name, for example, utilize father's element (being the element on the syndrome level) of the code of hemiplegia element to carry out the differentiation of the symptom of the same name of different syndromes, and then realize assembling and calculate; In addition, use frequency and frequency of utilization statistics at specific acupoint, acupuncture points on the human body in can the establishing criteria lexicon and " affiliated specific acupoint " contact attribute between the specific acupoint, in the statistic processes of the use frequency of acupuncture points on the human body and frequency of utilization, synchronously finish the use frequency of specific acupoint and the statistics of frequency of utilization.
Wherein, first pre-conditioned can the setting in conjunction with ordinal relation between the different elements in the same level according to the tcm field standard vocabulary in the relation of the logical organization between the dimension hierarchy and the traditional Chinese medical science standard vocabulary storehouse.(there is father's element in even some elements to " segmentation, the succession type " pattern of utilization, then the coding of this element is made up of father's element coding and self coding of this element, otherwise, this element will be by absolute coding), element on the dimension hierarchy that dimension is contained is encoded, above-mentioned coding mode is convenient to contact between the element on label layer minor element and the upper level level, improves the data analysis treatment effeciency; As: at the passages through which vital energy circulates dimension, the element (namely all) on the top layer time be encoded to JM; Two elements on the one-level level (i.e. decent and eight extra-channel) are encoded to respectively: JMO and JMl, and above-mentioned coding is formed by two sections, i.e. father's element encoded radio JM of element and the respective element serial number (0 and 1) on same level; On the secondary level, seriously to be example, its two daughter elements (being yang channel and yin channel) are encoded to respectively: JM00 and JM01, and above-mentioned coding is made up of decent father's element encoded radio JMO and the serial number (0 and 1) of respective element on same level of element; In like manner, on tertiary level, be example with the yang channel, its two daughter elements (three Yang channels of hand and the three yang channels of foots) are encoded to respectively: JM000 and JM001; On the level Four level, be example with the three Yang channels of hand, its three daughter element (large intesting channel of Hand-Yangmings, the Tri-Jiao channels of Hand-Shaoyang and small intestine meridian) be encoded to respectively: JMOOOLI, JMOOOSJ and JM000SI, above-mentioned coding is made up of decent father's element encoded radio JM000 of element and the standard code of respective element, wherein, standard code is according to the classical works (as: " science of acupuncture and moxibustion " of science of acupuncture and moxibustion standard vocabulary with reference to China authority standard (standard No. be GB/T12346-2006's " acupuncture points on the human body title and location ") and science of acupuncture and moxibustion field, " science of channels and collaterals ", " science of acupuncture points ", " meridional point ", " acupuncture cave name international standard handbook etc.); On the Pyatyi level, be example with the large intesting channel of Hand-Yangming, its daughter element (as: shangyang acupoint, two ask cave, three ask cave etc.) is encoded to respectively: JMOOOLIl, JMOOOLl2, JMOOOLl3 etc., above-mentioned coding by decent father's element encoded radio JM000LI of element and and the serial number of respective element on same level form; Wherein, serial number according to science of acupuncture and moxibustion standard vocabulary with reference to the classical works in China authority standard (standard No. be GB/T12346-2006's " acupuncture points on the human body title and location ") and science of acupuncture and moxibustion field (as: " science of acupuncture and moxibustion ", "=the science of channels and collaterals ", " science of acupuncture points ", " meridional point ", " acupuncture cave name international standard handbook etc.)
(2) according to the element combinations on the higher dimension hierarchy between the dimension and the set membership between the element on the different dimensions level, existing data in the table of center are assembled calculating and generated the 3rd data acquisition.As: according to the element combinations of the urinary bladder channel of foot-Taiyang between passages through which vital energy circulates, human body branch, disease, therapy, dynasty five dimensions, head, hemiplegia, acupuncture manipulation, the Ming Dynasty, and according to the daughter element situation of urinary bladder channel of foot-Taiyang element and header elements, data in the center table are carried out the statistics that acupuncture points on the human body uses frequency, and generate shape as the combined data of (urinary bladder channel of foot-Taiyang element code, header elements code, hemiplegia element code, acupuncture manipulation element code, Ming Dynasty's element code, acupuncture points on the human body use frequency).
(3) with data importing to the second database in second data acquisition and the 3rd data acquisition.
In the above-described embodiments, be one group of dimension determining according to the actual needs of ancient Chinese medicine case data analysis, thereby guaranteed the multi-angle of data analysis; And according to the professional knowledge of tcm field, determine the element on dimension hierarchy and the dimension hierarchy, guaranteed the multi-stratification of data analysis; According to the multidimensional data analysis action type, carry out the data analysis of various ways, realize many sides property of data analysis.
After getting access to second database, multi-dimensional database is created and is finished, owing to not only storing first database that gathers in advance in the multi-dimensional database, and storing second database, therefore, in multi-dimensional database, only need in multi-dimensional database, carry out disposable visit or simple statistics during data query, just can obtain the multidimensional data analysis result, avoided the complexity on a large amount of raw data of a plurality of raw data bases, repeat, lot of data query manipulation and statistical operation, shortened system response time, improve the data analysis work efficiency, possessed fast data analyzing and processing ability.
Before from multi-dimensional database, reading dimension data, from raw data base, read raw data, determine dimension and tolerance according to raw data then, and according to the professional knowledge of tcm field, determine dimension element data and dimension hierarchy data, can obtain the dimension element code data according to the first pre-conditioned unique identification of determining each dimension element data afterwards.Wherein, raw data base also can be kept in the multi-dimensional database.
Wherein, data in dimension data and first database all can be kept in the raw data base, raw data in the raw data base can comprise the 4th incidence relation between first basic data, second basic data and each basic data of ancient Chinese medicine case, as, can comprise first basic datas such as symptom data, acupuncture points on the human body data and thorn moxibustion method data of ancient Chinese medicine case in the raw data base, second basic data can comprise data such as case title, case type, disease, author, source, dynasty.
According to the abovementioned embodiments of the present invention, first data being assembled the step that calculates metric can comprise: read standard vocabulary from the standard vocabulary storehouse; According to standard vocabulary first data are carried out standardization and be converted to second data; Second data are assembled calculate metric.
Store the standard vocabulary of tcm field in the standard vocabulary storehouse in above-described embodiment, particularly, standard vocabulary can comprise Basic Theories of Chinese Medicine standard vocabulary, traditional Chinese medical science disease standard vocabulary, science of acupuncture and moxibustion standard vocabulary etc.Wherein, the Basic Theories of Chinese Medicine standard vocabulary is with reference to China authority standard (as: standard No. be GB/T20348-2006's " Basic Theories of Chinese Medicine term ") and relevant classical works (as: " Basic Theories of Chinese Medicine "), traditional Chinese medical science disease standard vocabulary with reference to China authority standard (as: standard No. be GB/T14396-1993's " classification of diseases and code ", standard No. be GB/T15657-1995's " traditional Chinese medical science disease classification and code ", standard No. be GB/T16751.1-1997's " tcm clinical practice diagnosis and treatment term disease part ", standard No. be GB/T16751.2-1997's " tcm clinical practice diagnosis and treatment term syndrome part ", standard No. be GB/T16751.3-1997's " tcm clinical practice diagnosis and treatment term therapy part " etc.) and the classical works (as: " traditional Chinese medical science voluminous dictionary " of China's tcm field, " Diagnostics of Chinese Medicine ", " Chinese Internal Medicine " etc.), science of acupuncture and moxibustion standard vocabulary is with reference to the World Health Organization (WHO) and China authority standard (as: " acupoint international standard book number " of World Health Organization's issue, standard No. be GB/T12346-2006's " acupuncture points on the human body title and location ", standard No. be 6812346-90's " through the position, cave " etc.) and the classical works (as: " science of acupuncture and moxibustion " in science of acupuncture and moxibustion field, " science of channels and collaterals ", " science of acupuncture points ", " meridional point ", " acupuncture cave name international standard handbook, " the logical mirror in Chinese acupuncture and moxibustion acupuncture point " etc.).In addition, the standard vocabulary storehouse also is used for the contact attribute between the storage standards vocabulary.For example, owing to exist between acupuncture points on the human body and the passages through which vital energy circulates and be subordinate to and contact (i.e. acupuncture points on the human body belong to a passages through which vital energy circulates and a passages through which vital energy circulates comprises a plurality of acupuncture points on the human bodys), so " affiliated passages through which vital energy circulates " between the acupuncture points on the human body of preserving in the standard vocabulary storehouse and the passages through which vital energy circulates is used for representing the contact attribute between acupuncture points on the human body standard vocabulary and the passages through which vital energy circulates standard vocabulary.The standard vocabulary storehouse not only provides foundation for obtaining of raw data, and when first data being augmented processing, the contact attribute between the standard vocabulary of storing in the standard vocabulary storehouse can also be as augmenting foundation; When first data being carried out the standardization conversion, the contact attribute between the standard vocabulary of storing in the standard vocabulary storehouse can also be as the conversion foundation of standardization conversion.
Particularly, the standard vocabulary in the establishing criteria lexicon and the incidence relation between the standard vocabulary carry out data-switching to the non-standardization data in first data, obtain second data.For example, may there be the form of another name in the data value of acupuncture points on the human body in first data, and this just need be converted to standard name with another name according to the standard vocabulary of acupuncture points on the human body title.
In the above embodiment of the present invention, after from first database, extracting first data acquisition corresponding with dimension data, method can comprise: whether the integrity degree that detects first data meets first threshold, integrity degree in first data does not meet under the situation of first threshold, first data is deleted or augmented first data acquisition of handling after obtaining clearing up; Or detect second data that whether exist in first data acquisition with first data consistent, in first data acquisition, under the situation of second data of existence and first data consistent, delete second data, first data acquisition after obtaining clearing up.
According to the abovementioned embodiments of the present invention, the step of handling first data acquisition after obtaining clearing up is deleted or augmented to first data can comprise: extract data of description from dimension data; Use data of description that first data are augmented processing, so that the integrity degree of first data meets first threshold, first data acquisition after obtaining clearing up.
Particularly, in the above embodiment of the present invention, to the detection of first data in first data acquisition, that is, and the Data Detection of whether being undertaken by a plurality of row storage conditions, data rows design and data value setting situation according to the data value mistake in first data and loss situation, data value.Specific to traditional Chinese medical science case field, integrity degree in first data does not meet under the situation of first threshold, whether the integrity degree that detects first data is lower than second threshold value, integrity degree in first data is not less than under the situation of second threshold value, according to the contact attribute between the dimension element data that lack in first data and the standard vocabulary first data are carried out data and augment, first data acquisition after obtaining clearing up; Integrity degree in the first number pick is lower than under the situation of second threshold value, deletes first data, first data acquisition after obtaining clearing up.
In the above-described embodiments, lack of standardization in view of existing data value in the raw data base, data value mistake and losing, whether data value is by a plurality of row storages, many situations such as data rows design and data value setting, and then influence data statistics result's accuracy, also may exist the situation that lacks dimension data in the raw data source, and then influence the feasibility of multidimensional data analysis, the above embodiment of the present invention is deleted first data, augment processing first data acquisition is cleared up first data acquisition after obtaining clearing up, thereby avoided above-mentioned situation, thereby guaranteed the feasible degree of multidimensional data analysis, promoted data analysis result's accuracy.
In the above embodiment of the present invention, dimension data can comprise: second incidence relation between dimension element data, dimension element code data and dimension element data and the dimension element code data, wherein, setting up the step that first incidence relation between metric and the dimension data generates second database can comprise: set up the 3rd incidence relation between metric and the dimension element code data; Set up first incidence relation by metric and second incidence relation; Preserve dimension data, metric, first incidence relation and the 3rd incidence relation, to generate second database.
In the above embodiment of the present invention, to set up after first incidence relation between metric and the dimension data generates second database, said method can also comprise: receive user's the first multidimensional data analysis request; The first multidimensional data analysis request is converted into the second multidimensional data analysis request; From second database, obtain the result data collection corresponding with the second multidimensional data analysis request; Output result data collection.
Particularly, after the first multidimensional data analysis request that receives the user by receiver module 170, first modular converter 190 of system can ask to be converted to the second multidimensional data analysis request according to the first multidimensional data analysis request, wherein, ask conversion to comprise according to the first multidimensional data analysis request request of data classification and generated query statement or direct generated query statement are carried out in the first multidimensional data analysis request, from second database, obtain the result data collection corresponding with the second multidimensional data analysis request by the 6th processing module 210 then, then by this number of results pick collection of output module 230 outputs.Wherein, output module 230 can show this result data collection according to the mode that user in the second multidimensional data analysis request asks to show the result data collection, wherein, can the result data collection be converted to visualized data by second modular converter, and by display module the result data collection be shown.Wherein, receiver module 170 and output module 230 can be arranged on the front-end server of system; The mode that shows the result data collection comprises: tables of data display format and figure display format.
As shown in Figure 4, execution in step S202: receive user's the first multidimensional data analysis request.Particularly, the user can import the first multidimensional data analysis request by input equipment, and wherein, the first multidimensional data analysis request can comprise element and the tolerance on dimension, dimension order, dimension hierarchy, the dimension hierarchy.For example, analysis request for " acupuncture points on the human body of acupuncture case in ancient times on decent and eight extra-channel of the crooked symptom of dispute for the treatment of apoplexy uses frequency ", the user can carry out disease dimension and passages through which vital energy circulates dimension at the input end of system, " disease → passages through which vital energy circulates " dimension order, " dispute is crooked " element on all elements on " apoplexy " element → syndrome layer on the disease layer in the disease dimension (being all syndromes of apoplexy) → msq layer, the selection operation of " seriously " on the secondary level in the passages through which vital energy circulates dimension and " eight extra-channel " element and " acupuncture points on the human body use frequency " tolerance, and then generating the first multidimensional data analysis request that machine can be identified, analysis result is with reference to shown in the table 1.
Table 1
Figure BDA00003285908700101
After receiving the first multidimensional data analysis request, can execution in step S204: by first modular converter 190 the first multidimensional data analysis request is converted into the second multidimensional data analysis request.Particularly, if the first multidimensional data analysis request is based on the existing formed further analysis request of analysis result, then according to the multidimensional data analysis action type the first multidimensional data analysis request is classified and the generated query statement, otherwise direct generated query statement; According to the multidimensional data analysis action type the first multidimensional data analysis request is classified and the generated query statement effectively, simplified the complexity of data analysis request, shorten the execution time of data analysis request, and then improved the treatment effeciency of data analysis.
Raw data in the raw data base in the above-described embodiments can real-time update, can guarantee so can not obtain successively different results based on the analysis operation of the same data in the raw data base, thereby guarantee data analysis result's consistance.In addition, the invention provides Analysis Service and foreground service technology scheme, can generate data analysis request and information display requirement easily, after receiving the first multidimensional data analysis request, carry out flexibly data manipulation intuitively by 190 pairs of multidimensional data analysis requests of first modular converter and existing analysis result, and generate further data analysis request, reduced the complexity of the first multidimensional data analysis request generative process, realize polytype data analysis operation, obtain more interested analysis result, and with visual and understandable form demonstration object information, and then improved the interactivity of data analysis.
In the above-described embodiments, according to the multidimensional data analysis action type the first multidimensional data analysis request is classified effectively and the process of generated query statement in, the multidimensional data analysis action type comprises: bore down, last volume is cut into slices stripping and slicing and rotation; Following brill obtains detailed metric for increasing number of dimensions or changing higher level into lower level according to the relation of the logical organization between the dimension hierarchy on the specific dimension; Last volume obtains rough metric for reducing number of dimensions or changing lower level into higher level according to the relation of the logical organization between the dimension hierarchy on the specific dimension; Section is obtained the metric on other dimensions under the concrete element restriction on one or more dimensions; Stripping and slicing is under the interval restriction of the element on one or more dimensions, obtains the metric on other dimensions; Rotate to be the ordering that changes between the dimension, obtain new metric.
Particularly, first modular converter 190 can carry out deep analysis to existing analysis result, further generates the second multidimensional data analysis request after analyzing; For example, with reference to shown in the table 1, the user can click " seriously " viewing area in this tables of data, generate the second multidimensional data analysis request, be used for obtaining all elements (i.e. " yang channel " and " yin channel ") the corresponding metric of decent element on the next stage dimension hierarchy of dimension hierarchy of living in, analysis result is with reference to shown in the table 2.
Table 2
Figure BDA00003285908700111
In addition, may exist set membership between the element on the different dimensions level, the dimension hierarchy affiliated as if an element comprises the next stage level and this element exists and another element that logical relation is arranged self at the next stage level, the former is called father's element of the latter, the latter is called the former daughter element, and father's element can comprise a plurality of daughter elements, and a daughter element can only be under the jurisdiction of father's element; With reference to shown in the table 1, "+" sign of " seriously " front is used for expression and concerns according to the logical organization between the dimension hierarchy on the dimension under this element, dimension hierarchy under this element comprises next stage level and this element and comprises the concrete condition that daughter element and existing analysis result do not show this element corresponding daughter element on the next stage level as yet at the next stage level, and the user can carry out drill down operator at this element; Otherwise "-" sign is used for the expression user can carry out the volume operation at this element; In addition, the element front does not have above-mentioned sign to be used for representing that the affiliated dimension hierarchy of this element does not comprise the next stage dimension hierarchy or this element does not comprise daughter element on the next stage dimension hierarchy.
Obtaining analysis result (also namely obtaining the result data collection) afterwards, output analysis result (being the result data collection).The way of output of analysis result can have different modes according to user's demand.For example, can be multiple message stream modes such as tables of data, histogram, pie chart.
The step of carrying out the second multidimensional data analysis request comprises: obtain the multidimensional data corresponding with the second multidimensional data analysis request from second database, and form the result data collection.Concrete step is as follows:
Particularly, the multidimensional data analysis action type comprises volume, brill, section, stripping and slicing and five kinds of operations of rotation down, for example, with reference to shown in the table 2, the user can click " seriously " viewing area in this table to import the first multidimensional data analysis request, be designated "-" owing to " seriously " front, then show the dimension data of having analyzed with respect to decent lower level, then can obtain new metric (as shown in table 1) by the operation of last volume, so from the level still less attainment degree value distribution situation; The specific implementation process that should go up the volume operation is: the data on the row of corresponding tolerance zone, this viewing area gathered, generates after the new metric, carry out the adjustment of tables of data, form and show new analysis result data table, and as shown in table 1; Said process need not the query manipulation in the multi-dimensional database, has improved treatment effeciency; In addition, drill down operator is the inverse process of going up the volume operation.
Further, the user can add the dynasty dimension, obtain more detailed metric by drill down operator, if the one-level level namely on the dynasty dimension comprises " before bright " and " bright back " two elements (wherein, " bright back " element comprises the Ming Dynasty, this divides according to being: the Ming Dynasty is the maturity stage of acupuncture case development in ancient times), can on existing data analysis basis, further obtain the metric distribution situation of " before bright " and " bright back " so, and then from more angle attainment degree value distribution situation, understand in depth " the acupuncture case in ancient times for the treatment of dispute crooked symptom is on decent and eight extra-channel; bright before with bright after acupuncture points on the human body use frequency " situation, obtain interested, abundanter result.
More specifically, the user can also adjust the sequencing of passages through which vital energy circulates dimension data and dynasty dimension data, obtain new metric by rotary manipulation, namely understand " the acupuncture case in ancient times for the treatment of dispute crooked symptom bright before and after bright, the use of the acupuncture points on the human body on decent and eight extra-channel frequency " situation in depth, and then from new side attainment degree value distribution situation.
Step S206: from second database, obtain the result data collection corresponding with the second multidimensional data analysis request.
Particularly, carry out the above-mentioned query statement that generates in the second multidimensional data analysis request that the first multidimensional data analysis request is being converted into, and generate the result data collection.Wherein, at first link with the foundation of second database, afterwards according to the querying condition in the statement, in second database, carry out disposable inquiry or simple statistics, generate the result data collection.
Wherein, the query statement that changes into of the first multidimensional data analysis request may carry out simple tabulate statistics in second database; Be example with the related analysis request of table 1, the center of second database table has been stored " apoplexy " element on the disease layer in the disease dimension, this element is concrete syndrome daughter element on the syndrome layer, " acupuncture points on the human body use frequency " metric of above-mentioned syndrome daughter element " dispute is crooked " daughter element on msq layer, if the query statement of this request of execution, then need in second database, inquire about and gather " the syndrome daughter element that the apoplexy element comprises; and this daughter element comprises the crooked daughter element of dispute " " acupuncture points on the human body use frequency " metric, and then generate the result data collection.
Step S208, output result data collection.Particularly, can receive and show the result data collection in above-described embodiment, be converted into visual information, and it is showed the user by display device.
Fig. 5 is the schematic flow sheet according to the creation method of initial multi-dimensional database embodiment illustrated in fig. 3.According to embodiment shown in Figure 5, concrete steps are as follows:
Step S302 is according to second pre-conditioned explicit data analytic angle and the observation index, to determine dimension and tolerance.
Particularly, for example, acupuncture points on the human body is the core data of acupuncture case, and the case data analysis often need be observed the acupuncture points on the human body operating position, and the acupuncture points on the human body operating position can comprise acupuncture points on the human body frequency of usage and frequency; In addition, the case data analysis can be observed the acupuncture points on the human body operating position from symptom, passages through which vital energy circulates, human body branch, therapy, dynasty equal angles, and then obtains the rule of hiding in the case; Therefore, dimension can be defined as: dynasty, passages through which vital energy circulates, human body branch, therapy and five dimensions of disease, tolerance can be defined as: acupuncture points on the human body frequency of usage and acupuncture points on the human body frequency of utilization, thus can realize the case data analysis of multi-angle according to above-mentioned a plurality of dimension attainment degree values.In addition, because specific acupoint is the acupuncture points on the human body that has the special treatment effect and sort out by specific title, in the clinical practice of the science of acupuncture and moxibustion very important meaning is arranged; Therefore, use frequency and the frequency of utilization of specific acupoint have outstanding value, also should be set at the tolerance of multi-dimensional database.
Step S304 obtains the dimension element on dimension hierarchy and the dimension hierarchy.Particularly, according to the professional knowledge of tcm field, determine the element on dimension hierarchy and the dimension hierarchy.For example, according to tcm field knowledge, the passages through which vital energy circulates dimension can comprise top layer (be passages through which vital energy circulates itself, can be described as whole), one-level level (comprising decent and two elements of eight extra-channel), secondary level (as: the decent element in the upper level level comprises yin channel and two daughter elements of yang channel at this level), tertiary level (as: the yin channel element in the upper level level comprises the Three Yin channels of hand and two daughter elements of Three Yin Channels of Foot at this level), (as: Three Yin channels of hand element in the upper level level comprises meridian of Hand-TAI YIN at this level to the level Four level, three daughter elements of meridian of Hand-SHAO YIN and meridian of Hand-JUE YIN) and the Pyatyi level (as: meridian of Hand-TAI YIN in the upper level level comprises zhongfu point at this level, Yunmen cave, acupuncture points on the human body daughter elements such as cave, the self-sufficient and strategically located region).
According to above-described embodiment as can be known: time to minimum level, the description degree of passages through which vital energy circulates dimension is become more and more thinner by the top layer of passages through which vital energy circulates dimension; Otherwise the description degree becomes more and more thicker.The user can be according to the passages through which vital energy circulates dimension, is analysis path with " whole one a decent yin channel one Three Yin channels of hand is Taiyin channels one zhongfu point on the other hand ", progressively obtains more and more carefully or more and more thicker metric, and then realizes multi-level data analysis.In addition, exist notional logical organization relation between the dimension hierarchy, namely an element from the angle upper level dimension hierarchy of concept may comprise the one or more daughter elements on the next stage dimension hierarchy, otherwise, an element on the next stage dimension hierarchy is under the jurisdiction of the some father's elements on the upper level dimension hierarchy, as: conceptive, the yang channel element comprises the three Yang channels of hand and two elements of the three yang channels of foots, otherwise three Yang channels of hand element is under the jurisdiction of the yang channel element.
Step S306 uses first pre-conditionedly dimension element is carried out encode to handle, and obtains the dimension element code data.
Step S308 generates dimension table according to dimension element code data and dimension element.
Step S310 is according to dimension element code data and metric data generation center table.Particularly, can form the sky table of a center table, also can in the table of center, store some known complete dimension data and metrics.
Step S312 saves as multi-dimensional database with dimension table and center table.
Particularly, after execution in step S312, can also be with the data importing multi-dimensional database in the raw data base, and follow-up can the execution is kept at operation in the multi-dimensional database with second database.
Need to prove, can in the computer system such as one group of computer executable instructions, carry out in the step shown in the process flow diagram of accompanying drawing, and, though there is shown logical order in flow process, but in some cases, can carry out step shown or that describe with the order that is different from herein.
From above description, as can be seen, the present invention has realized following technique effect: at first obtain dimension data, from first database, extract first data acquisition corresponding with dimension data then, and first data in first data acquisition are assembled calculate metric after, first incidence relation of setting up between metric and the dimension data generates second database, can obtain first data acquisition by dimension data, after the foundation first data acquisition attainment degree value, set up first incidence relation between metric and the dimension data, generate second database by first incidence relation, import metric by first incidence relation and generate second database, and then provide accurately for multidimensional data analysis, the data source that is suitable for, thereby solved because existing ancient Chinese medicine case database is imperfect and data are unordered, thereby cause using the result of existing case data base querying data inaccurate, inefficiency, and can't realize the more problem of complex query, realized setting up rapidly and accurately the effect of complete ancient Chinese medicine case database, when the user inquires about ancient Chinese medicine case data, directly from second database, inquire about, can directly obtain complete Query Result, inquiry fast, accurately, efficient height.
Obviously, those skilled in the art should be understood that, above-mentioned each module of the present invention or each step can realize with the general calculation device, they can concentrate on the single calculation element, perhaps be distributed on the network that a plurality of calculation elements form, alternatively, they can be realized with the executable program code of calculation element, thereby, they can be stored in the memory storage and be carried out by calculation element, perhaps they are made into each integrated circuit modules respectively, perhaps a plurality of modules in them or step are made into the single integrated circuit module and realize.Like this, the present invention is not restricted to any specific hardware and software combination.
The above is the preferred embodiments of the present invention only, is not limited to the present invention, and for a person skilled in the art, the present invention can have various changes and variation.Within the spirit and principles in the present invention all, any modification of doing, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (11)

1. a data processing method is characterized in that, comprising:
Obtain dimension data;
Extract first data acquisition corresponding with described dimension data from first database, wherein, described first data acquisition comprises first data;
Described first data are assembled calculate metric;
First incidence relation of setting up between described metric and the described dimension data generates second database.
2. according to the described data processing method of claim l, it is characterized in that, described first data assembled the step that calculates metric comprise:
From the standard vocabulary storehouse, read standard vocabulary;
According to described standard vocabulary described first data are carried out standardization and be converted to second data;
Described second data are assembled calculate metric.
3. according to the described data processing method of claim l, it is characterized in that extract first data acquisition corresponding with described dimension data from first database after, described data processing method comprises:
Whether the integrity degree that detects described first data meets first threshold, does not meet under the situation of described first threshold at the integrity degree of described first data, described first data is deleted or augmented first data acquisition of handling after obtaining clearing up; Or
Detect the 3rd data that whether exist in described first data acquisition with described first data consistent, in described first data acquisition, exist under the situation of described the 3rd data, delete described the 3rd data, obtain first data acquisition after the described cleaning.
4. data processing method according to claim 3 is characterized in that, the step of handling first data acquisition after obtaining clearing up is deleted or augmented to described first data comprise:
From described dimension data, extract data of description;
Use described data of description that described first data are augmented processing, so that the integrity degree of described first data meets described first threshold, obtain first data acquisition after the described cleaning.
5. data processing method according to claim 1, it is characterized in that, described dimension data comprises: second incidence relation between dimension element data, dimension element code data and described dimension element data and the described dimension element code data, wherein, setting up the step that first incidence relation between described metric and the described dimension data generates second database comprises:
Set up the 3rd incidence relation between described metric and the described dimension element code data;
Set up described first incidence relation by described metric and described second incidence relation;
Preserve described dimension data, described metric, described first incidence relation and described the 3rd incidence relation, to generate described second database.
6. according to the described data processing method of claim l, it is characterized in that setting up after first incidence relation between described metric and the described dimension data generates second database, described data processing method also comprises:
Receive user's the first multidimensional data analysis request;
The described first multidimensional data analysis request is converted into the second multidimensional data analysis request;
From described second database, obtain the result data collection corresponding with the described second multidimensional data analysis request;
Export described result data collection.
7. a data processing equipment is characterized in that, comprising:
Acquisition module is used for obtaining dimension data;
First extraction module is used for extracting first data acquisition corresponding with described dimension data from first database;
Computing module calculates metric for first data of described first data acquisition are assembled;
First processing module, first incidence relation that is used for setting up between described metric and the described dimension data generates second database.
8. data processing equipment according to claim 7 is characterized in that, described computing module comprises:
First read module is used for reading standard vocabulary from the standard vocabulary storehouse;
Second processing module is used for according to described standard vocabulary described first data being carried out standardization and is converted to second data;
The first sub-computing module calculates metric for described second data are assembled.
9. data processing equipment according to claim 7 is characterized in that, described data processing equipment also comprises:
Whether first detection module meets first threshold for detection of the integrity degree of described first data;
The 3rd processing module is used for not meeting under the situation of described first threshold at the integrity degree of described first data, described first data is deleted or augmented first data acquisition of handling after obtaining clearing up;
Second detection module is for detection of the 3rd data that whether exist in described first data acquisition with described first data consistent;
The manages module everywhere, is used for existing under the situation of described the 3rd data at described first data acquisition, deletes described the 3rd data, obtains first data acquisition after the described cleaning.
10. data processing equipment according to claim 9 is characterized in that, described the 3rd processing module comprises:
Second extraction module is used for extracting data of description from described dimension data;
The 5th processing module be used for to use described data of description that described first data are augmented processing, so that the integrity degree of described first data meets described first threshold, obtains first data acquisition after the described cleaning.
11. data processing equipment according to claim 7, it is characterized in that, described dimension data comprises: second incidence relation between dimension element data, dimension element code data and described dimension element data and the described dimension element code data, wherein, described first processing module comprises:
The first sub-processing module is used for setting up the 3rd incidence relation between described metric and the described dimension element code data;
The second sub-processing module is used for setting up described first incidence relation by described metric and described second incidence relation;
The 3rd sub-processing module is used for preserving described dimension data, described metric, described first incidence relation and described the 3rd incidence relation, to generate described second database.
CN201310213824.2A 2013-05-31 2013-05-31 Method and device for processing data Expired - Fee Related CN103258054B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310213824.2A CN103258054B (en) 2013-05-31 2013-05-31 Method and device for processing data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310213824.2A CN103258054B (en) 2013-05-31 2013-05-31 Method and device for processing data

Publications (2)

Publication Number Publication Date
CN103258054A true CN103258054A (en) 2013-08-21
CN103258054B CN103258054B (en) 2017-01-25

Family

ID=48961971

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310213824.2A Expired - Fee Related CN103258054B (en) 2013-05-31 2013-05-31 Method and device for processing data

Country Status (1)

Country Link
CN (1) CN103258054B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103440333A (en) * 2013-09-05 2013-12-11 深圳好视网络科技有限公司 Data processing method and device
CN103870712A (en) * 2014-03-31 2014-06-18 张东升 Traditional Chinese medicine knowledge data intelligent response method and device
CN104166738A (en) * 2014-09-04 2014-11-26 北京国双科技有限公司 Database query processing method and device
CN104182546A (en) * 2014-09-09 2014-12-03 北京国双科技有限公司 Method and device for querying data in databases
CN105224534A (en) * 2014-05-29 2016-01-06 腾讯科技(深圳)有限公司 A kind of method and device of asking response
WO2018033052A1 (en) * 2016-08-18 2018-02-22 腾讯科技(深圳)有限公司 Method and system for evaluating user portrait data
CN107861996A (en) * 2017-10-16 2018-03-30 中国医学科学院医学信息研究所 A kind of medicine evidence-based database Diagrams automatic creation system
CN116991901A (en) * 2023-09-25 2023-11-03 深圳市秦丝科技有限公司 Data control system and method based on multidimensional database query

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101599088A (en) * 2008-11-18 2009-12-09 北京美智医疗科技有限公司 The mining multi-dimensional data system and method for medical information system

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101599088A (en) * 2008-11-18 2009-12-09 北京美智医疗科技有限公司 The mining multi-dimensional data system and method for medical information system

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
李丹等: "基于数据库技术的古代中医治疗数据采集方法的研究", 《中医药学报》 *
王勋: "《数据库***原理》", 30 November 2010 *
鲁均云: "重复和不完整数据的清理方法研究及应用", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103440333A (en) * 2013-09-05 2013-12-11 深圳好视网络科技有限公司 Data processing method and device
CN103870712A (en) * 2014-03-31 2014-06-18 张东升 Traditional Chinese medicine knowledge data intelligent response method and device
CN105224534A (en) * 2014-05-29 2016-01-06 腾讯科技(深圳)有限公司 A kind of method and device of asking response
CN104166738A (en) * 2014-09-04 2014-11-26 北京国双科技有限公司 Database query processing method and device
CN104182546A (en) * 2014-09-09 2014-12-03 北京国双科技有限公司 Method and device for querying data in databases
CN104182546B (en) * 2014-09-09 2017-10-27 北京国双科技有限公司 The data query method and device of database
WO2018033052A1 (en) * 2016-08-18 2018-02-22 腾讯科技(深圳)有限公司 Method and system for evaluating user portrait data
CN107767155A (en) * 2016-08-18 2018-03-06 腾讯科技(深圳)有限公司 A kind of method and system for assessing user's representation data
US10915540B2 (en) 2016-08-18 2021-02-09 Tencent Technology (Shenzhen) Company Limited Method and system for evaluating user persona data
CN107767155B (en) * 2016-08-18 2021-06-18 腾讯科技(深圳)有限公司 Method and system for evaluating user portrait data
CN107861996A (en) * 2017-10-16 2018-03-30 中国医学科学院医学信息研究所 A kind of medicine evidence-based database Diagrams automatic creation system
CN116991901A (en) * 2023-09-25 2023-11-03 深圳市秦丝科技有限公司 Data control system and method based on multidimensional database query

Also Published As

Publication number Publication date
CN103258054B (en) 2017-01-25

Similar Documents

Publication Publication Date Title
CN103258054A (en) Method and device for processing data
CN109410185B (en) A kind of image partition method, device and storage medium
CN108717441B (en) Method and device for determining predicates corresponding to problem templates
CN104834826B (en) Established and optimization method and system based on the clinical path of data mining and graph-theory techniques
CN104572675A (en) Similar medical history searching system and method
WO2021151295A1 (en) Method, apparatus, computer device, and medium for determining patient treatment plan
CN106933859B (en) Medical data migration method and device
CN115497631A (en) Clinical scientific research big data analysis system
Haynes et al. High performance analysis of big spatial data
CN107193996B (en) Similar medical record matching and retrieving system
CN104331469A (en) Multi-disease image feature library system for medical association
CN106126943B (en) The state of an illness acquisition method derived based on association symptom
CN108647340A (en) A kind of multidimensional data real-time analysis method based on dynamic crosstab
CN110827990B (en) Typhoid fever syndrome differentiation reasoning system based on knowledge graph
Nguyen et al. Insights into pharmacotherapy management for Parkinson's disease patients using wearables activity data
CN110010231A (en) A kind of data processing system and computer readable storage medium
CN113223725A (en) Medical data application visualization page display system and method
Li et al. Big data is essential for further development of integrative medicine
CN110364265A (en) A kind of data value generation and implementation method based on health data bank
CN107908660B (en) Data division and organization method for data open sharing
CN112700832A (en) Personalized electronic case generation method and system
CN113032476A (en) Method for acquiring and monitoring dynamic data based on visualization
Bokhari et al. A framework for clustering dental patients' records using unsupervised learning techniques
CN110853745A (en) Skin disease patient standardization system
CN115995301A (en) Data recovery method and system based on artificial intelligence

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C53 Correction of patent of invention or patent application
CB03 Change of inventor or designer information

Inventor after: Yan Chaosheng

Inventor after: Li Dan

Inventor after: Zheng Nan

Inventor after: Sun Zhongren

Inventor after: Yan Chaoqi

Inventor after: Jiang Deyou

Inventor before: Yan Chaosheng

Inventor before: Li Dan

Inventor before: Sun Zhongren

Inventor before: Yan Chaoqi

Inventor before: Jiang Deyou

COR Change of bibliographic data

Free format text: CORRECT: INVENTOR; FROM: YAN CHAOSHENG LI DAN SUN ZHONGREN YAN CHAOQI JIANG DEYOU TO: YAN CHAOSHENG LI DAN ZHENG NAN SUN ZHONGREN YAN CHAOQI JIANG DEYOU

C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170125

Termination date: 20180531