Embodiment
Need to prove that under the situation of not conflicting, embodiment and the feature among the embodiment among the application can make up mutually.Describe the present invention below with reference to the accompanying drawings and in conjunction with the embodiments in detail.
Figure l is the structural representation according to the data processing equipment of the embodiment of the invention.Fig. 2 is the structural representation according to data processing equipment embodiment illustrated in fig. 1.
As depicted in figs. 1 and 2, this device can comprise: acquisition module 10 is used for obtaining dimension data; First extraction module 30 is used for extracting first data acquisition corresponding with dimension data from first database; Computing module 50 calculates metric for first data of first data acquisition are assembled; First processing module 70, first incidence relation that is used for setting up between metric and the dimension data generates second database.
Adopt the present invention, obtain dimension data by acquisition module, from first database, extract first data acquisition corresponding with dimension data by first extraction module then, and computing module first data in first data acquisition are assembled calculate metric after, first incidence relation of setting up between metric and the dimension data by first processing module generates second database, can obtain first data acquisition by dimension data, after the foundation first data acquisition attainment degree value, set up first incidence relation between metric and the dimension data, generate second database by first incidence relation, import metric by first incidence relation and generate second database, and then provide accurately for multidimensional data analysis, the data source that is suitable for, thereby solved because existing ancient Chinese medicine case database is imperfect, thereby cause using the result of existing case data base querying data inaccurate, inefficiency, and can't realize the more problem of complex query, realized setting up rapidly and accurately the effect of complete ancient Chinese medicine case database, when the user inquires about ancient Chinese medicine case data, directly from second database, inquire about, can directly obtain complete Query Result, inquiry fast, accurately, efficient height.
Particularly, the scheme in the above embodiment of the present invention is applicable to different data base management system (DBMS)s and different operating system.
According to the abovementioned embodiments of the present invention, computing module 50 can comprise: the first read module 5l is used for reading standard vocabulary from the standard vocabulary storehouse; Second processing module 53 is used for according to standard vocabulary first data being carried out standardization and is converted to second data; The first sub-computing module 55 calculates metric for second data are assembled.
Particularly, can carry out standardization to first data by the standard vocabulary that second processing module 53 reads according to first read module 51 and be converted to second data.For example, various ways such as the title of dynasty dimension data may life period, publication time, such as being: different forms such as clear, Qing Dynasty, the Qing Dynasty need according to standard vocabulary so with its unified value form that is treated to " Qing Dynasty ".
In the above embodiment of the present invention, device can also comprise: whether first detection module 90 meets first threshold for detection of the integrity degree of first data; The 3rd processing module 110 is used for not meeting under the situation of first threshold at the integrity degree of first data, first data is deleted or augmented first data acquisition of handling after obtaining clearing up; Second detection module 130 is for detection of second data that whether exist in first data acquisition with first data consistent; The manages module 150 everywhere, is used for deleting second data, first data acquisition after obtaining clearing up under there is situation with second data of first data consistent in first data acquisition.
According to the abovementioned embodiments of the present invention, the 3rd processing module 110 can comprise: second extraction module is used for extracting data of description from dimension data; The 5th processing module is used for using data of description that first data are augmented processing, so that the integrity degree of first data meets first threshold, and first data acquisition after obtaining clearing up.
Particularly, in this data processing equipment, the 3rd processing module 110 can also comprise the data removing module, is used for data value mistake and loss situation according to described first data, carries out data dump; As: the integrity degree that detects first data does not meet first threshold, thinks that then these first data lack the main disease to be treated data of case, then delete this first data by the data removing module; Second detection module 130 and the is managed module 150 everywhere and can be arranged on data and merge in the module, particularly, second detection module 130 can also detect the situation that whether has a plurality of row storage identical datas in first data acquisition, if exist then a plurality of row merged, particularly, adopt and delete unnecessary row or the data that the number pick in the different row is modified in the row are integrated processing, obtain first data acquisition after the cleaning, as: the acupuncture points on the human body data are stored in respectively in master point and adjunct acupuncture points two row, the data of two row can be merged so.
In the above embodiment of the present invention, dimension data can comprise: second incidence relation between dimension element data, dimension element code data and dimension element data and the dimension element code data, wherein, first processing module comprises: the first sub-processing module is used for setting up the 3rd incidence relation between metric and the dimension element code data; The second sub-processing module is used for setting up first incidence relation by metric and second incidence relation; The 3rd sub-processing module is used for preserving dimension data, metric, first incidence relation and the 3rd incidence relation, to generate second database.
In the above embodiment of the present invention, this device can also comprise: receiver module 170 is used for receiving user's the first multidimensional data analysis request; First modular converter 190 is used for the first multidimensional data analysis request is converted into the second multidimensional data analysis request; The 6th processing module 210 is used for obtaining the result data collection corresponding with the second multidimensional data analysis request from second database; And output module 230 is used for output result data collection.
Particularly, after the first multidimensional data analysis request that receives the user by receiver module 170, first modular converter 190 of system can ask to be converted to the second multidimensional data analysis request according to the first multidimensional data analysis request, wherein, ask conversion to comprise according to the first multidimensional data analysis request request of data classification and generated query statement or direct generated query statement are carried out in the first multidimensional data analysis request, from second database, obtain the result data collection corresponding with the second multidimensional data analysis request by the 6th processing module 210 then, then by output module 230 these result data collection of output.Wherein, output module 230 can show this result data collection according to the mode that user in the second multidimensional data analysis request asks to show the result data collection, wherein, can the result data collection be converted to visualized data by second modular converter, and by display module the result data collection be shown.Wherein, receiver module 170 and output module 230 can be arranged on the front-end server of system; The mode that shows the result data collection comprises: tables of data display format and figure display format.
Fig. 3 is the process flow diagram according to the data processing method of the embodiment of the invention, and this method comprises the steps: as shown in Figure 3
Step s102 obtains dimension data.
Step s104 extracts first data acquisition corresponding with dimension data from first database, wherein, first data acquisition comprises first data.
Step S106 assembles to calculate metric to first data.
Step s108, first incidence relation of setting up between metric and the dimension data generates second database.
At first obtain dimension data, from first database, extract first data acquisition corresponding with dimension data then, and first data in first data acquisition are assembled calculate metric after, first incidence relation of setting up between metric and the dimension data generates second database, can obtain first data acquisition by dimension data, after the foundation first data acquisition attainment degree value, set up first incidence relation between metric and the dimension data, generate second database by first incidence relation, import metric by first incidence relation and generate second database, and then provide accurately for multidimensional data analysis, the data source that is suitable for, thereby solved because existing ancient Chinese medicine case database is imperfect, thereby cause using the result of existing case data base querying data inaccurate, and can't obtain the problem of visual and understandable analysis result fast from existing database, realized setting up rapidly and accurately the effect of complete ancient Chinese medicine case database, when the user inquires about ancient Chinese medicine case data, directly from second database, inquire about, can directly obtain complete Query Result, inquiry fast, accurately, efficient height.
Wherein, the above embodiment of the present invention can be applied to the processing of ancient Chinese medicine case data.Store the incidence relation between field of medicaments target standard vocabulary and the standard vocabulary in first database; Assemble being calculated as the statistical calculation that first data that get access to are carried out, wherein statistical calculation comprises the calculating of first data being sued for peace, average and asked counting; Metric comprises the observation index value according to a plurality of data analysis angles.Said counting just refers to the calculating of metering number, for example, Baihui acupoint occurs and once just counts a number, and the counting that twice pair of Baihui acupoint occur is 2, occurs three times and is 3.
Dimension data can be kept in the dimension table in the multi-dimensional database, and multi-dimensional database can be preserved one group of dimension table and second database, can preserve dimension data in each dimension table.Wherein, dimension is the angle of data analysis, dimension hierarchy is the description of the logical organization relation on the dimension, the dimension element data are for being used for the concrete data on the storage dimension hierarchy, the dimension element code data can be one group of character code of unique identification dimension element data, can also preserve the data of description of the logical organization relation on the dimension in the dimension table, and the logical organization relation has reflected the division of the different level of detail on the dimension.
More specifically, can also comprise first database in the multi-dimensional database, first data can comprise the data of description of data analysis angle, and also preserve the 5th incidence relation between first data and the dimension element data in first database, can from first database, read first data acquisition corresponding with dimension data by the 5th incidence relation.
In the above embodiment of the present invention, at first owing to storing the data of description of data analysis angle in the dimension table, storing the observation index value according to a plurality of data analysis angles in second database, wherein the data of storing in second database also can the use center preservation form of table be stored in the multi-dimensional database, set up contacting between center table and the dimension table according to first incidence relation and the 3rd incidence relation, make up ancient Chinese medicine case multi-dimensional database, for multidimensional data analysis provides data source science, reliable, that accurately be suitable for.
In the above-described embodiments, by first incidence relation and the 3rd incidence relation, set up contacting between second database and the dimension table, so that center table and dimension table are linked, be convenient to multidimensional data analysis result's inquiry and demonstration.For example, by the dimension element data in the dimension table and dimension element code data, a plurality of dimension data of description in the first multidimensional data analysis request that gets access to are converted to the element code combination that corresponding dimension element code data is formed, and utilize this contact from the table of center, to obtain element code and make up corresponding metric; Otherwise, utilize this contact, the corresponding element code of metric in the table of center is made up the data of description that is converted to corresponding a plurality of dimensions, and then the concrete implication of level of understanding value.
In the above-described embodiments, the method for execution in step s106 can comprise:
(1) according to the element combinations on the minimum dimension hierarchy between all dimensions, utilize the dimension data in the dimension table that first data in first data acquisition are assembled calculating and generated second data acquisition, as the foundation passages through which vital energy circulates, human body branch, disease, therapy, Qucha cave on the minimum dimension hierarchy between dynasty five dimensions, before/the side Head Section, hemiplegia, acupuncture manipulation, the dimension data of the element combinations of the Ming Dynasty, first data in first data acquisition are carried out the concrete statistics that acupuncture points on the human body uses frequency, and generate shape as (Qucha cave element code, before/side Head Section element code, the hemiplegia element code, the acupuncture manipulation element code, Ming Dynasty's element code, acupuncture points on the human body use frequency) second data acquisition;
Owing to may there be a plurality of elements of the same name and these elements corresponding different father's element on the upper level level on a certain dimension hierarchy of some specific dimension; For example, professional knowledge according to tcm field, different syndromes may comprise same symptoms, be to have a plurality of elements of the same name and these elements corresponding different father's element on the syndrome level on the symptom level of disease dimension, as: according to embodiments of the invention, the hemiplegia symptom can be under the jurisdiction of a plurality of syndromes; At above-mentioned situation, pattern design dimension element code according to the first pre-conditioned utilization " segmentation, succession type ", namely by father's element code section of a hierarchical elements code, can realize the differentiation of element of the same name, for example, utilize father's element (being the element on the syndrome level) of the code of hemiplegia element to carry out the differentiation of the symptom of the same name of different syndromes, and then realize assembling and calculate; In addition, use frequency and frequency of utilization statistics at specific acupoint, acupuncture points on the human body in can the establishing criteria lexicon and " affiliated specific acupoint " contact attribute between the specific acupoint, in the statistic processes of the use frequency of acupuncture points on the human body and frequency of utilization, synchronously finish the use frequency of specific acupoint and the statistics of frequency of utilization.
Wherein, first pre-conditioned can the setting in conjunction with ordinal relation between the different elements in the same level according to the tcm field standard vocabulary in the relation of the logical organization between the dimension hierarchy and the traditional Chinese medical science standard vocabulary storehouse.(there is father's element in even some elements to " segmentation, the succession type " pattern of utilization, then the coding of this element is made up of father's element coding and self coding of this element, otherwise, this element will be by absolute coding), element on the dimension hierarchy that dimension is contained is encoded, above-mentioned coding mode is convenient to contact between the element on label layer minor element and the upper level level, improves the data analysis treatment effeciency; As: at the passages through which vital energy circulates dimension, the element (namely all) on the top layer time be encoded to JM; Two elements on the one-level level (i.e. decent and eight extra-channel) are encoded to respectively: JMO and JMl, and above-mentioned coding is formed by two sections, i.e. father's element encoded radio JM of element and the respective element serial number (0 and 1) on same level; On the secondary level, seriously to be example, its two daughter elements (being yang channel and yin channel) are encoded to respectively: JM00 and JM01, and above-mentioned coding is made up of decent father's element encoded radio JMO and the serial number (0 and 1) of respective element on same level of element; In like manner, on tertiary level, be example with the yang channel, its two daughter elements (three Yang channels of hand and the three yang channels of foots) are encoded to respectively: JM000 and JM001; On the level Four level, be example with the three Yang channels of hand, its three daughter element (large intesting channel of Hand-Yangmings, the Tri-Jiao channels of Hand-Shaoyang and small intestine meridian) be encoded to respectively: JMOOOLI, JMOOOSJ and JM000SI, above-mentioned coding is made up of decent father's element encoded radio JM000 of element and the standard code of respective element, wherein, standard code is according to the classical works (as: " science of acupuncture and moxibustion " of science of acupuncture and moxibustion standard vocabulary with reference to China authority standard (standard No. be GB/T12346-2006's " acupuncture points on the human body title and location ") and science of acupuncture and moxibustion field, " science of channels and collaterals ", " science of acupuncture points ", " meridional point ", " acupuncture cave name international standard handbook etc.); On the Pyatyi level, be example with the large intesting channel of Hand-Yangming, its daughter element (as: shangyang acupoint, two ask cave, three ask cave etc.) is encoded to respectively: JMOOOLIl, JMOOOLl2, JMOOOLl3 etc., above-mentioned coding by decent father's element encoded radio JM000LI of element and and the serial number of respective element on same level form; Wherein, serial number according to science of acupuncture and moxibustion standard vocabulary with reference to the classical works in China authority standard (standard No. be GB/T12346-2006's " acupuncture points on the human body title and location ") and science of acupuncture and moxibustion field (as: " science of acupuncture and moxibustion ", "=the science of channels and collaterals ", " science of acupuncture points ", " meridional point ", " acupuncture cave name international standard handbook etc.)
(2) according to the element combinations on the higher dimension hierarchy between the dimension and the set membership between the element on the different dimensions level, existing data in the table of center are assembled calculating and generated the 3rd data acquisition.As: according to the element combinations of the urinary bladder channel of foot-Taiyang between passages through which vital energy circulates, human body branch, disease, therapy, dynasty five dimensions, head, hemiplegia, acupuncture manipulation, the Ming Dynasty, and according to the daughter element situation of urinary bladder channel of foot-Taiyang element and header elements, data in the center table are carried out the statistics that acupuncture points on the human body uses frequency, and generate shape as the combined data of (urinary bladder channel of foot-Taiyang element code, header elements code, hemiplegia element code, acupuncture manipulation element code, Ming Dynasty's element code, acupuncture points on the human body use frequency).
(3) with data importing to the second database in second data acquisition and the 3rd data acquisition.
In the above-described embodiments, be one group of dimension determining according to the actual needs of ancient Chinese medicine case data analysis, thereby guaranteed the multi-angle of data analysis; And according to the professional knowledge of tcm field, determine the element on dimension hierarchy and the dimension hierarchy, guaranteed the multi-stratification of data analysis; According to the multidimensional data analysis action type, carry out the data analysis of various ways, realize many sides property of data analysis.
After getting access to second database, multi-dimensional database is created and is finished, owing to not only storing first database that gathers in advance in the multi-dimensional database, and storing second database, therefore, in multi-dimensional database, only need in multi-dimensional database, carry out disposable visit or simple statistics during data query, just can obtain the multidimensional data analysis result, avoided the complexity on a large amount of raw data of a plurality of raw data bases, repeat, lot of data query manipulation and statistical operation, shortened system response time, improve the data analysis work efficiency, possessed fast data analyzing and processing ability.
Before from multi-dimensional database, reading dimension data, from raw data base, read raw data, determine dimension and tolerance according to raw data then, and according to the professional knowledge of tcm field, determine dimension element data and dimension hierarchy data, can obtain the dimension element code data according to the first pre-conditioned unique identification of determining each dimension element data afterwards.Wherein, raw data base also can be kept in the multi-dimensional database.
Wherein, data in dimension data and first database all can be kept in the raw data base, raw data in the raw data base can comprise the 4th incidence relation between first basic data, second basic data and each basic data of ancient Chinese medicine case, as, can comprise first basic datas such as symptom data, acupuncture points on the human body data and thorn moxibustion method data of ancient Chinese medicine case in the raw data base, second basic data can comprise data such as case title, case type, disease, author, source, dynasty.
According to the abovementioned embodiments of the present invention, first data being assembled the step that calculates metric can comprise: read standard vocabulary from the standard vocabulary storehouse; According to standard vocabulary first data are carried out standardization and be converted to second data; Second data are assembled calculate metric.
Store the standard vocabulary of tcm field in the standard vocabulary storehouse in above-described embodiment, particularly, standard vocabulary can comprise Basic Theories of Chinese Medicine standard vocabulary, traditional Chinese medical science disease standard vocabulary, science of acupuncture and moxibustion standard vocabulary etc.Wherein, the Basic Theories of Chinese Medicine standard vocabulary is with reference to China authority standard (as: standard No. be GB/T20348-2006's " Basic Theories of Chinese Medicine term ") and relevant classical works (as: " Basic Theories of Chinese Medicine "), traditional Chinese medical science disease standard vocabulary with reference to China authority standard (as: standard No. be GB/T14396-1993's " classification of diseases and code ", standard No. be GB/T15657-1995's " traditional Chinese medical science disease classification and code ", standard No. be GB/T16751.1-1997's " tcm clinical practice diagnosis and treatment term disease part ", standard No. be GB/T16751.2-1997's " tcm clinical practice diagnosis and treatment term syndrome part ", standard No. be GB/T16751.3-1997's " tcm clinical practice diagnosis and treatment term therapy part " etc.) and the classical works (as: " traditional Chinese medical science voluminous dictionary " of China's tcm field, " Diagnostics of Chinese Medicine ", " Chinese Internal Medicine " etc.), science of acupuncture and moxibustion standard vocabulary is with reference to the World Health Organization (WHO) and China authority standard (as: " acupoint international standard book number " of World Health Organization's issue, standard No. be GB/T12346-2006's " acupuncture points on the human body title and location ", standard No. be 6812346-90's " through the position, cave " etc.) and the classical works (as: " science of acupuncture and moxibustion " in science of acupuncture and moxibustion field, " science of channels and collaterals ", " science of acupuncture points ", " meridional point ", " acupuncture cave name international standard handbook, " the logical mirror in Chinese acupuncture and moxibustion acupuncture point " etc.).In addition, the standard vocabulary storehouse also is used for the contact attribute between the storage standards vocabulary.For example, owing to exist between acupuncture points on the human body and the passages through which vital energy circulates and be subordinate to and contact (i.e. acupuncture points on the human body belong to a passages through which vital energy circulates and a passages through which vital energy circulates comprises a plurality of acupuncture points on the human bodys), so " affiliated passages through which vital energy circulates " between the acupuncture points on the human body of preserving in the standard vocabulary storehouse and the passages through which vital energy circulates is used for representing the contact attribute between acupuncture points on the human body standard vocabulary and the passages through which vital energy circulates standard vocabulary.The standard vocabulary storehouse not only provides foundation for obtaining of raw data, and when first data being augmented processing, the contact attribute between the standard vocabulary of storing in the standard vocabulary storehouse can also be as augmenting foundation; When first data being carried out the standardization conversion, the contact attribute between the standard vocabulary of storing in the standard vocabulary storehouse can also be as the conversion foundation of standardization conversion.
Particularly, the standard vocabulary in the establishing criteria lexicon and the incidence relation between the standard vocabulary carry out data-switching to the non-standardization data in first data, obtain second data.For example, may there be the form of another name in the data value of acupuncture points on the human body in first data, and this just need be converted to standard name with another name according to the standard vocabulary of acupuncture points on the human body title.
In the above embodiment of the present invention, after from first database, extracting first data acquisition corresponding with dimension data, method can comprise: whether the integrity degree that detects first data meets first threshold, integrity degree in first data does not meet under the situation of first threshold, first data is deleted or augmented first data acquisition of handling after obtaining clearing up; Or detect second data that whether exist in first data acquisition with first data consistent, in first data acquisition, under the situation of second data of existence and first data consistent, delete second data, first data acquisition after obtaining clearing up.
According to the abovementioned embodiments of the present invention, the step of handling first data acquisition after obtaining clearing up is deleted or augmented to first data can comprise: extract data of description from dimension data; Use data of description that first data are augmented processing, so that the integrity degree of first data meets first threshold, first data acquisition after obtaining clearing up.
Particularly, in the above embodiment of the present invention, to the detection of first data in first data acquisition, that is, and the Data Detection of whether being undertaken by a plurality of row storage conditions, data rows design and data value setting situation according to the data value mistake in first data and loss situation, data value.Specific to traditional Chinese medical science case field, integrity degree in first data does not meet under the situation of first threshold, whether the integrity degree that detects first data is lower than second threshold value, integrity degree in first data is not less than under the situation of second threshold value, according to the contact attribute between the dimension element data that lack in first data and the standard vocabulary first data are carried out data and augment, first data acquisition after obtaining clearing up; Integrity degree in the first number pick is lower than under the situation of second threshold value, deletes first data, first data acquisition after obtaining clearing up.
In the above-described embodiments, lack of standardization in view of existing data value in the raw data base, data value mistake and losing, whether data value is by a plurality of row storages, many situations such as data rows design and data value setting, and then influence data statistics result's accuracy, also may exist the situation that lacks dimension data in the raw data source, and then influence the feasibility of multidimensional data analysis, the above embodiment of the present invention is deleted first data, augment processing first data acquisition is cleared up first data acquisition after obtaining clearing up, thereby avoided above-mentioned situation, thereby guaranteed the feasible degree of multidimensional data analysis, promoted data analysis result's accuracy.
In the above embodiment of the present invention, dimension data can comprise: second incidence relation between dimension element data, dimension element code data and dimension element data and the dimension element code data, wherein, setting up the step that first incidence relation between metric and the dimension data generates second database can comprise: set up the 3rd incidence relation between metric and the dimension element code data; Set up first incidence relation by metric and second incidence relation; Preserve dimension data, metric, first incidence relation and the 3rd incidence relation, to generate second database.
In the above embodiment of the present invention, to set up after first incidence relation between metric and the dimension data generates second database, said method can also comprise: receive user's the first multidimensional data analysis request; The first multidimensional data analysis request is converted into the second multidimensional data analysis request; From second database, obtain the result data collection corresponding with the second multidimensional data analysis request; Output result data collection.
Particularly, after the first multidimensional data analysis request that receives the user by receiver module 170, first modular converter 190 of system can ask to be converted to the second multidimensional data analysis request according to the first multidimensional data analysis request, wherein, ask conversion to comprise according to the first multidimensional data analysis request request of data classification and generated query statement or direct generated query statement are carried out in the first multidimensional data analysis request, from second database, obtain the result data collection corresponding with the second multidimensional data analysis request by the 6th processing module 210 then, then by this number of results pick collection of output module 230 outputs.Wherein, output module 230 can show this result data collection according to the mode that user in the second multidimensional data analysis request asks to show the result data collection, wherein, can the result data collection be converted to visualized data by second modular converter, and by display module the result data collection be shown.Wherein, receiver module 170 and output module 230 can be arranged on the front-end server of system; The mode that shows the result data collection comprises: tables of data display format and figure display format.
As shown in Figure 4, execution in step S202: receive user's the first multidimensional data analysis request.Particularly, the user can import the first multidimensional data analysis request by input equipment, and wherein, the first multidimensional data analysis request can comprise element and the tolerance on dimension, dimension order, dimension hierarchy, the dimension hierarchy.For example, analysis request for " acupuncture points on the human body of acupuncture case in ancient times on decent and eight extra-channel of the crooked symptom of dispute for the treatment of apoplexy uses frequency ", the user can carry out disease dimension and passages through which vital energy circulates dimension at the input end of system, " disease → passages through which vital energy circulates " dimension order, " dispute is crooked " element on all elements on " apoplexy " element → syndrome layer on the disease layer in the disease dimension (being all syndromes of apoplexy) → msq layer, the selection operation of " seriously " on the secondary level in the passages through which vital energy circulates dimension and " eight extra-channel " element and " acupuncture points on the human body use frequency " tolerance, and then generating the first multidimensional data analysis request that machine can be identified, analysis result is with reference to shown in the table 1.
Table 1
After receiving the first multidimensional data analysis request, can execution in step S204: by first modular converter 190 the first multidimensional data analysis request is converted into the second multidimensional data analysis request.Particularly, if the first multidimensional data analysis request is based on the existing formed further analysis request of analysis result, then according to the multidimensional data analysis action type the first multidimensional data analysis request is classified and the generated query statement, otherwise direct generated query statement; According to the multidimensional data analysis action type the first multidimensional data analysis request is classified and the generated query statement effectively, simplified the complexity of data analysis request, shorten the execution time of data analysis request, and then improved the treatment effeciency of data analysis.
Raw data in the raw data base in the above-described embodiments can real-time update, can guarantee so can not obtain successively different results based on the analysis operation of the same data in the raw data base, thereby guarantee data analysis result's consistance.In addition, the invention provides Analysis Service and foreground service technology scheme, can generate data analysis request and information display requirement easily, after receiving the first multidimensional data analysis request, carry out flexibly data manipulation intuitively by 190 pairs of multidimensional data analysis requests of first modular converter and existing analysis result, and generate further data analysis request, reduced the complexity of the first multidimensional data analysis request generative process, realize polytype data analysis operation, obtain more interested analysis result, and with visual and understandable form demonstration object information, and then improved the interactivity of data analysis.
In the above-described embodiments, according to the multidimensional data analysis action type the first multidimensional data analysis request is classified effectively and the process of generated query statement in, the multidimensional data analysis action type comprises: bore down, last volume is cut into slices stripping and slicing and rotation; Following brill obtains detailed metric for increasing number of dimensions or changing higher level into lower level according to the relation of the logical organization between the dimension hierarchy on the specific dimension; Last volume obtains rough metric for reducing number of dimensions or changing lower level into higher level according to the relation of the logical organization between the dimension hierarchy on the specific dimension; Section is obtained the metric on other dimensions under the concrete element restriction on one or more dimensions; Stripping and slicing is under the interval restriction of the element on one or more dimensions, obtains the metric on other dimensions; Rotate to be the ordering that changes between the dimension, obtain new metric.
Particularly, first modular converter 190 can carry out deep analysis to existing analysis result, further generates the second multidimensional data analysis request after analyzing; For example, with reference to shown in the table 1, the user can click " seriously " viewing area in this tables of data, generate the second multidimensional data analysis request, be used for obtaining all elements (i.e. " yang channel " and " yin channel ") the corresponding metric of decent element on the next stage dimension hierarchy of dimension hierarchy of living in, analysis result is with reference to shown in the table 2.
Table 2
In addition, may exist set membership between the element on the different dimensions level, the dimension hierarchy affiliated as if an element comprises the next stage level and this element exists and another element that logical relation is arranged self at the next stage level, the former is called father's element of the latter, the latter is called the former daughter element, and father's element can comprise a plurality of daughter elements, and a daughter element can only be under the jurisdiction of father's element; With reference to shown in the table 1, "+" sign of " seriously " front is used for expression and concerns according to the logical organization between the dimension hierarchy on the dimension under this element, dimension hierarchy under this element comprises next stage level and this element and comprises the concrete condition that daughter element and existing analysis result do not show this element corresponding daughter element on the next stage level as yet at the next stage level, and the user can carry out drill down operator at this element; Otherwise "-" sign is used for the expression user can carry out the volume operation at this element; In addition, the element front does not have above-mentioned sign to be used for representing that the affiliated dimension hierarchy of this element does not comprise the next stage dimension hierarchy or this element does not comprise daughter element on the next stage dimension hierarchy.
Obtaining analysis result (also namely obtaining the result data collection) afterwards, output analysis result (being the result data collection).The way of output of analysis result can have different modes according to user's demand.For example, can be multiple message stream modes such as tables of data, histogram, pie chart.
The step of carrying out the second multidimensional data analysis request comprises: obtain the multidimensional data corresponding with the second multidimensional data analysis request from second database, and form the result data collection.Concrete step is as follows:
Particularly, the multidimensional data analysis action type comprises volume, brill, section, stripping and slicing and five kinds of operations of rotation down, for example, with reference to shown in the table 2, the user can click " seriously " viewing area in this table to import the first multidimensional data analysis request, be designated "-" owing to " seriously " front, then show the dimension data of having analyzed with respect to decent lower level, then can obtain new metric (as shown in table 1) by the operation of last volume, so from the level still less attainment degree value distribution situation; The specific implementation process that should go up the volume operation is: the data on the row of corresponding tolerance zone, this viewing area gathered, generates after the new metric, carry out the adjustment of tables of data, form and show new analysis result data table, and as shown in table 1; Said process need not the query manipulation in the multi-dimensional database, has improved treatment effeciency; In addition, drill down operator is the inverse process of going up the volume operation.
Further, the user can add the dynasty dimension, obtain more detailed metric by drill down operator, if the one-level level namely on the dynasty dimension comprises " before bright " and " bright back " two elements (wherein, " bright back " element comprises the Ming Dynasty, this divides according to being: the Ming Dynasty is the maturity stage of acupuncture case development in ancient times), can on existing data analysis basis, further obtain the metric distribution situation of " before bright " and " bright back " so, and then from more angle attainment degree value distribution situation, understand in depth " the acupuncture case in ancient times for the treatment of dispute crooked symptom is on decent and eight extra-channel; bright before with bright after acupuncture points on the human body use frequency " situation, obtain interested, abundanter result.
More specifically, the user can also adjust the sequencing of passages through which vital energy circulates dimension data and dynasty dimension data, obtain new metric by rotary manipulation, namely understand " the acupuncture case in ancient times for the treatment of dispute crooked symptom bright before and after bright, the use of the acupuncture points on the human body on decent and eight extra-channel frequency " situation in depth, and then from new side attainment degree value distribution situation.
Step S206: from second database, obtain the result data collection corresponding with the second multidimensional data analysis request.
Particularly, carry out the above-mentioned query statement that generates in the second multidimensional data analysis request that the first multidimensional data analysis request is being converted into, and generate the result data collection.Wherein, at first link with the foundation of second database, afterwards according to the querying condition in the statement, in second database, carry out disposable inquiry or simple statistics, generate the result data collection.
Wherein, the query statement that changes into of the first multidimensional data analysis request may carry out simple tabulate statistics in second database; Be example with the related analysis request of table 1, the center of second database table has been stored " apoplexy " element on the disease layer in the disease dimension, this element is concrete syndrome daughter element on the syndrome layer, " acupuncture points on the human body use frequency " metric of above-mentioned syndrome daughter element " dispute is crooked " daughter element on msq layer, if the query statement of this request of execution, then need in second database, inquire about and gather " the syndrome daughter element that the apoplexy element comprises; and this daughter element comprises the crooked daughter element of dispute " " acupuncture points on the human body use frequency " metric, and then generate the result data collection.
Step S208, output result data collection.Particularly, can receive and show the result data collection in above-described embodiment, be converted into visual information, and it is showed the user by display device.
Fig. 5 is the schematic flow sheet according to the creation method of initial multi-dimensional database embodiment illustrated in fig. 3.According to embodiment shown in Figure 5, concrete steps are as follows:
Step S302 is according to second pre-conditioned explicit data analytic angle and the observation index, to determine dimension and tolerance.
Particularly, for example, acupuncture points on the human body is the core data of acupuncture case, and the case data analysis often need be observed the acupuncture points on the human body operating position, and the acupuncture points on the human body operating position can comprise acupuncture points on the human body frequency of usage and frequency; In addition, the case data analysis can be observed the acupuncture points on the human body operating position from symptom, passages through which vital energy circulates, human body branch, therapy, dynasty equal angles, and then obtains the rule of hiding in the case; Therefore, dimension can be defined as: dynasty, passages through which vital energy circulates, human body branch, therapy and five dimensions of disease, tolerance can be defined as: acupuncture points on the human body frequency of usage and acupuncture points on the human body frequency of utilization, thus can realize the case data analysis of multi-angle according to above-mentioned a plurality of dimension attainment degree values.In addition, because specific acupoint is the acupuncture points on the human body that has the special treatment effect and sort out by specific title, in the clinical practice of the science of acupuncture and moxibustion very important meaning is arranged; Therefore, use frequency and the frequency of utilization of specific acupoint have outstanding value, also should be set at the tolerance of multi-dimensional database.
Step S304 obtains the dimension element on dimension hierarchy and the dimension hierarchy.Particularly, according to the professional knowledge of tcm field, determine the element on dimension hierarchy and the dimension hierarchy.For example, according to tcm field knowledge, the passages through which vital energy circulates dimension can comprise top layer (be passages through which vital energy circulates itself, can be described as whole), one-level level (comprising decent and two elements of eight extra-channel), secondary level (as: the decent element in the upper level level comprises yin channel and two daughter elements of yang channel at this level), tertiary level (as: the yin channel element in the upper level level comprises the Three Yin channels of hand and two daughter elements of Three Yin Channels of Foot at this level), (as: Three Yin channels of hand element in the upper level level comprises meridian of Hand-TAI YIN at this level to the level Four level, three daughter elements of meridian of Hand-SHAO YIN and meridian of Hand-JUE YIN) and the Pyatyi level (as: meridian of Hand-TAI YIN in the upper level level comprises zhongfu point at this level, Yunmen cave, acupuncture points on the human body daughter elements such as cave, the self-sufficient and strategically located region).
According to above-described embodiment as can be known: time to minimum level, the description degree of passages through which vital energy circulates dimension is become more and more thinner by the top layer of passages through which vital energy circulates dimension; Otherwise the description degree becomes more and more thicker.The user can be according to the passages through which vital energy circulates dimension, is analysis path with " whole one a decent yin channel one Three Yin channels of hand is Taiyin channels one zhongfu point on the other hand ", progressively obtains more and more carefully or more and more thicker metric, and then realizes multi-level data analysis.In addition, exist notional logical organization relation between the dimension hierarchy, namely an element from the angle upper level dimension hierarchy of concept may comprise the one or more daughter elements on the next stage dimension hierarchy, otherwise, an element on the next stage dimension hierarchy is under the jurisdiction of the some father's elements on the upper level dimension hierarchy, as: conceptive, the yang channel element comprises the three Yang channels of hand and two elements of the three yang channels of foots, otherwise three Yang channels of hand element is under the jurisdiction of the yang channel element.
Step S306 uses first pre-conditionedly dimension element is carried out encode to handle, and obtains the dimension element code data.
Step S308 generates dimension table according to dimension element code data and dimension element.
Step S310 is according to dimension element code data and metric data generation center table.Particularly, can form the sky table of a center table, also can in the table of center, store some known complete dimension data and metrics.
Step S312 saves as multi-dimensional database with dimension table and center table.
Particularly, after execution in step S312, can also be with the data importing multi-dimensional database in the raw data base, and follow-up can the execution is kept at operation in the multi-dimensional database with second database.
Need to prove, can in the computer system such as one group of computer executable instructions, carry out in the step shown in the process flow diagram of accompanying drawing, and, though there is shown logical order in flow process, but in some cases, can carry out step shown or that describe with the order that is different from herein.
From above description, as can be seen, the present invention has realized following technique effect: at first obtain dimension data, from first database, extract first data acquisition corresponding with dimension data then, and first data in first data acquisition are assembled calculate metric after, first incidence relation of setting up between metric and the dimension data generates second database, can obtain first data acquisition by dimension data, after the foundation first data acquisition attainment degree value, set up first incidence relation between metric and the dimension data, generate second database by first incidence relation, import metric by first incidence relation and generate second database, and then provide accurately for multidimensional data analysis, the data source that is suitable for, thereby solved because existing ancient Chinese medicine case database is imperfect and data are unordered, thereby cause using the result of existing case data base querying data inaccurate, inefficiency, and can't realize the more problem of complex query, realized setting up rapidly and accurately the effect of complete ancient Chinese medicine case database, when the user inquires about ancient Chinese medicine case data, directly from second database, inquire about, can directly obtain complete Query Result, inquiry fast, accurately, efficient height.
Obviously, those skilled in the art should be understood that, above-mentioned each module of the present invention or each step can realize with the general calculation device, they can concentrate on the single calculation element, perhaps be distributed on the network that a plurality of calculation elements form, alternatively, they can be realized with the executable program code of calculation element, thereby, they can be stored in the memory storage and be carried out by calculation element, perhaps they are made into each integrated circuit modules respectively, perhaps a plurality of modules in them or step are made into the single integrated circuit module and realize.Like this, the present invention is not restricted to any specific hardware and software combination.
The above is the preferred embodiments of the present invention only, is not limited to the present invention, and for a person skilled in the art, the present invention can have various changes and variation.Within the spirit and principles in the present invention all, any modification of doing, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.