CN1787527A - Apparatus and method for languaging automatic digging of distributed isomeric data - Google Patents

Apparatus and method for languaging automatic digging of distributed isomeric data Download PDF

Info

Publication number
CN1787527A
CN1787527A CN 200510111604 CN200510111604A CN1787527A CN 1787527 A CN1787527 A CN 1787527A CN 200510111604 CN200510111604 CN 200510111604 CN 200510111604 A CN200510111604 A CN 200510111604A CN 1787527 A CN1787527 A CN 1787527A
Authority
CN
China
Prior art keywords
module
data
information
metadata
layer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 200510111604
Other languages
Chinese (zh)
Inventor
吴国文
徐汝兴
张亮
张佩毅
孙建兵
孙华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SHANGHAI JINXIN COMPUTER SYSTEM ENGINEERING Co Ltd
Fudan University
Shanghai Jiaotong University
Original Assignee
SHANGHAI JINXIN COMPUTER SYSTEM ENGINEERING Co Ltd
Fudan University
Shanghai Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHANGHAI JINXIN COMPUTER SYSTEM ENGINEERING Co Ltd, Fudan University, Shanghai Jiaotong University filed Critical SHANGHAI JINXIN COMPUTER SYSTEM ENGINEERING Co Ltd
Priority to CN 200510111604 priority Critical patent/CN1787527A/en
Publication of CN1787527A publication Critical patent/CN1787527A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

This invention relates to a semantic automatic excavation device of a distributed heterogeneous data for the long terms of resource process and management in a library and a method, in which, said device is composed of a system of computes, networks and servers, the method is standardized by the meta-data scheme to realize collection, analysis and process of different structures of metadata and the integration, process issuance, management and information communication of various resources and distributed resources.

Description

The semantization automatic mining devices and methods therefor of distributed isomeric data
Technical field
The present invention relates to the devices and methods therefor in a kind of digital asset management and the field of integration, especially refer to that a kind of library that is mainly used in exists long-term resources processing and management work, in its process, exist and large quantities ofly be difficult to use DC (Dublin Core) to carry out the semantization automatic mining devices and methods therefor of the distributed isomeric data of specification data problem.
Background technology
OAI (Open Archives Intiative) agreement is the problem at " not under the jurisdiction of each other between each resources bank; that the data of related data or different field is disperseed storage; as can not to unify integratedly, as to have limited circulation and use between the resource " that proposes in October, 1999.This agreement has defined 6 information interaction instructions and instruction feedack form based on HTTP (Hyper Text Transport Protocol) agreement on the basis of http protocol, adopt the XML technology to carry out the encapsulation and the preservation of information.
Make the OAI system can realize preserving by said structure, and on unified data basis of preserving, carry out the increment processing of resource, thereby effective information increment service is provided, the use value of raising information at the distributed collection and the unification of distributed resource.
At present, the problem of agreement existence is:
The OAI agreement is real by the communication instruction of setting up on the basis of http protocol
The collection of existing data, but in the process of this collection, exist following several of main problems:
1. what gather is the metadata information of information record, and lacks management for the digital object or the full-text data of metadata information correspondence, this will cause the user can only by the OAI system recognize system acquisition to the resource description resource that but can't obtain describing.Simultaneously caused system during from the same asset in different pieces of information source, can't distinguish these resources in processing owing to lack the management means of resource itself.Say nothing of the version management of carrying out resource.
Since the data of system acquisition from different data sources, and the coding mode information that each data source adopts may be different.And the OAI agreement has just been stipulated the XML structure of request feedback and has not been provided clear and definite definition (just saying necessary support dc standard) with regard to the structure (being metadata specification) of information itself.This will cause metadata information to collect local analysis and the processing that can't carry out information afterwards, limit the use of OAI system.
3.OAI there is very big shortcoming in system aspect integrated charge resource.Services such as information gathering and processing are carried out in the proposition of OAI agreement in the face of a large amount of free resource.Yet in the application process at home, integrated system will be faced a large amount of Fare Collection Systems inevitably, and the resource that how can use Fare Collection System well also will be a main problem.
Summary of the invention
In order to overcome above-mentioned weak point, main purpose of the present invention aims to provide a kind of distributed collection and unified preservation to distributed resource, and on unified data basis of preserving, carry out the increment processing of resource, and can be to collection, analysis, the processing of various different structure metadata, the semantization automatic mining devices and methods therefor of the distributed isomeric data of use, management and information communication.
The technical problem to be solved in the present invention is: solve and how the distributed collection and the unification of distributed resource be preserved, and on unified data basis of preserving, carry out the increment processing problems of resource; The effective information increment service that how to provide is provided, improves the use value problem of information; How to solve the technical problems such as use, management and information communication of integration to resources of various types and distributed resource, processing, issue.
The technical solution adopted for the present invention to solve the technical problems is: this device is by computer, hardware environment such as network and server system forms, the input/output terminal of its each parallel computer is connected with the I/O of metadata issuing service module respectively, the output signal of metadata issuing service module is sent to the metadata acquisition server module through network, be the wireless transmission form therebetween, the output of metadata acquisition server module is connected with input end and computer, the output of computer is connected with the input of metadata processing server module, and its system architecture is the hierarchical structure form.
The hierarchical structure of the system architecture of the semantization automatic mining device of described distributed isomeric data is followed successively by data source management level, digital coding layer, instruction level and exchanges data layer, carries out exchanges data by network therebetween, wherein:
The exchanges data layer is a fabric, and its signal transmission comprises: URL (UniformResource Locations) redirect, and the access environment setting, Interrupt Process reaches and provides interface signal to instruction level;
Instruction level is the agreement supporting layer, and the transmission of its signal comprises: instruction generates and generates the information exchange that the exchanges data layer is given in instruction, instruction validation and be given to the data acquisition signal of digital coding layer;
The digital coding layer is the metadata information layer, its signal transmission comprises: the Code And Decode of Information Level, the information that the data source management level are submitted in the affirmation of coded system and filter control information and the data message of metadata, actual parameter is obtained and the feedback of data message;
The data source management level are that layer is used in resource management, and its signal transmission comprises: the information exchange of the preservation of data and management information and direct and database, the request of analog data source and charge resource support information.
A kind of semantization automatic mining method of distributed isomeric data, this method provides the level means of communication of service interface for the upper strata in the processed aspect the information gathering and lower floor's program by metadata proposal standard, system, realize collection, analysis and the processing of various different structure metadata, use, management and the information communication of the integration of realization resources of various types and distributed resource, processing, issue, wherein: the operation principle and the concrete steps of metadata proposal standard are:
Step 1. coding criterion
Adopt a kind of coding criterion, this coding criterion is the fractional reuse of existing multiple coding criterion, or is a kind of brand-new, self-defining coding criterion;
Step 2. information analysis and integrated
Adopting RDF (Resource Description Framework) and RDFS (Resource Description Framework Schema) mechanism is the explanation of each item of information of metadata, RDF is realized semantization ground information description as a kind of model description standard on XML (Extensible Markup Language) basis, as a kind of SCHEMA, in the process of information gathering, provide a kind of checking means of metadata with RDFS;
The concrete steps of the processed flow process of system aspect information gathering are:
Step 1. sends acquisition instructions
The output signal that sends the acquisition instructions module transmits and enters the authentication module of feedback result XML validity;
Step 2. judges whether effective A
The output signal of the authentication module of feedback result XML validity transmits to enter and judges whether effective judge module;
If be verified as useful signal, then enter and extract the metadata record module;
If be verified as invalid signals, the system that then enters reports an error and writes down the error message module;
Step 3. judges whether effective B
The output signal of extracting the metadata record module transmits to enter and judges whether effective B module,
If be verified as useful signal, then enter the metadata processing module;
If be verified as invalid signals, the system that then enters reports an error and writes down the error message module:
The level means of communication of the semantization automatic mining method of described distributed isomeric data are carried out exchanges data by network, and the concrete steps of its workflow are:
Step 1. data source access request
Data source access request signal transmits and enters the data source management layer module;
Request of step 2. data source information coding and data acquisition request
The output signal of the data source information coding request of data source management layer module transmits and enters digital coding layer module, and the output signal of the data acquisition request of digital coding layer module transmits and enters the data source management layer module;
Step 3. instruction generates request and order parameter decoding request
The output signal that the instruction of digital coding layer module generates request transmits entry instruction layer module, and the output signal of the order parameter decoding request of instruction level module transmits and enters digital coding layer module;
Step 4. data source access request and instruction checking request
The output signal of the data source access request of instruction level module transmits and enters exchanges data layer module, and the output signal of the instruction checking request of exchanges data layer module transmits entry instruction layer module.
The invention has the beneficial effects as follows: a kind of distributed collection and unified preservation to distributed resource is provided, and on unified data basis of preserving, carried out the increment processing of resource, thereby effective information increment service is provided, improved the use value of information; By the standard of metadata proposal, system can really carry out collection, analysis and the processing of various different structure metadata effectively; Improved system simultaneously for the using and managing of the resource object of metadata information correspondence, gathered the standard of carrying out metadata with the METS mode, and can provide more management information to system on the basis of metadata information about writing down.
Description of drawings
The present invention is further described below in conjunction with description of drawings and embodiment.
Accompanying drawing 1 is hardware environment structure block diagram of the present invention;
Accompanying drawing 2 is hierarchical structure block diagram of the present invention;
Accompanying drawing 3 is the schematic flow sheet of information gathering processed of the present invention;
Accompanying drawing 4 is the schematic flow sheet of information communication method of the present invention;
Accompanying drawing 5 is the application system schematic diagram of one of embodiment of the invention;
The drawing reference numeral explanation:
1-computer; 40-management of magazine department;
The issuing service of 2-metadata; 41-internal control flow process A;
3-network; 42-data issue A;
4-metadata acquisition server; 50-audio management department;
5-metadata processing server; 51-internal control flow process B;
10-data source management level; 52-data issue B;
11-digital coding layer; 60-Image Management department;
12-instruction level; 61-internal control flow process C;
13-exchanges data layer; 62-data issue C;
20-transmission acquisition instructions; 70-network service management department;
The checking of 21-feedback result XML validity; 71-information collection server;
22-whether effectively A; The issuing service of 72-data;
23-extraction metadata record;
24-whether effectively B;
The processing of 25-metadata;
26-system reports an error and writes down error message;
30-data source access request;
The request of 31-data source information coding;
32-data acquisition request;
33-order parameter decoding request;
34-instruction generates request;
35-data source access request;
36-instruction checking request;
Embodiment:
See also shown in the accompanying drawing 1, apparatus of the present invention are by computer, hardware environment such as network and server system forms, the input/output terminal of its each parallel computer (1) is connected with the I/O of metadata issuing service (2) module respectively, the output signal of metadata issuing service (2) module is sent to metadata acquisition server (4) module through network (3), be the wireless transmission form therebetween, the output of metadata acquisition server (4) module is connected with the input of computer (1), the output of computer (1) is connected with the input of metadata processing server (5) module, and its system architecture is the hierarchical structure form.
See also shown in the accompanying drawing 2, the hierarchical structure of the system architecture of the semantization automatic mining device of described distributed isomeric data is followed successively by data source management level (10), digital coding layer (11), instruction level (12) and exchanges data layer (13), carry out exchanges data by network (3) therebetween, wherein:
Exchanges data layer (13) is a fabric, and its signal transmission comprises: URL (Uniform Resource Locations) redirect, and the access environment setting, Interrupt Process reaches and provides interface signal to instruction level;
Instruction level (12) is the agreement supporting layer, and the transmission of its signal comprises: instruction generates and generates the information exchange that exchanges data layer (13) are given in instruction, instruction validation and be given to the data acquisition signal of digital coding layer (11);
Digital coding layer (11) is the metadata information layer, its signal transmission comprises: the Code And Decode of Information Level, the information that data source management level (10) are submitted in the affirmation of coded system and filter control information and the data message of metadata, actual parameter is obtained and the feedback of data message;
Data source management level (10) are used layer for resource management, and its signal transmission comprises: the information exchange of the preservation of data and management information and direct and database, the request of analog data source and charge resource support information.
See also shown in the accompanying drawing 3,4, a kind of semantization automatic mining method of distributed isomeric data, this method provides the level means of communication of service interface for the upper strata in the processed aspect the information gathering and lower floor's program by metadata proposal standard, system, realize collection, analysis and the processing of various different structure metadata, use, management and the information communication of the integration of realization resources of various types and distributed resource, processing, issue, wherein:
The operation principle and the concrete steps of metadata proposal standard are:
Step 1. coding criterion
Adopt a kind of coding criterion, this coding criterion is the fractional reuse of existing multiple coding criterion, or is a kind of brand-new, self-defining coding criterion;
Step 2. information analysis and integrated
Adopting RDF (Resource Description Framework) and RDFS mechanism is the explanation of each item of information of metadata, RDF (Resource DescriptionFramework Schema) is realized semantization ground information description as a kind of model description standard on XML (ExtensibleMarkup Language) basis, as a kind of SCHEMA, in the process of information gathering, provide a kind of checking means of metadata with RDFS;
The concrete steps of the processed flow process of system aspect information gathering are:
Step 1. sends acquisition instructions (20)
The output signal that sends acquisition instructions (20) module transmits checking (21) module that enters feedback result XML validity;
Step 2. judges whether effective A (22)
The output signal of the checking of feedback result XML validity (21) module transmits to enter and judges whether the effectively judge module of (22);
If be verified as useful signal, then enter and extract metadata record (23) module;
If be verified as invalid signals, the system that then enters reports an error and writes down error message (26) module;
Step 3. judges whether effective B (24)
The output signal of extracting metadata record (23) module transmits to enter and judges whether effective B (24) module,
If be verified as useful signal, then enter metadata processing (25) module;
If be verified as invalid signals, the system that then enters reports an error and writes down error message (26) module;
See also shown in the accompanying drawing 4, the level means of communication of the semantization automatic mining method of described distributed isomeric data are carried out exchanges data by network, and the concrete steps of its workflow are:
Step 1. data source access request (30)
Data source access request (30) signal transmits and enters data source management level (10) module;
The request of step 2. data source information coding (31) and data acquisition request (32)
The output signal of the data source information coding request (31) of data source management level (10) module transmits and enters digital coding layer (11) module, and the output signal of the data acquisition request (32) of digital coding layer (11) module transmits and enters data source management level (10) module;
Step 3. instruction generates request (34) and order parameter decoding request (33)
The output signal that the instruction of digital coding layer (11) module generates request (34) transmits entry instruction layer (12) module, and the output signal of the order parameter decoding request (33) of instruction level (12) module transmits and enters digital coding layer (11) module;
Step 4. data source access request (35) and instruction checking request (36)
The output signal of the data source access request (35) of instruction level (12) module transmits and enters exchanges data layer (13) module, and the output signal of the instruction checking request (36) of exchanges data layer (13) module transmits entry instruction layer (12) module.
System of the present invention is not changing the following corrective measure of proposition under the integrally-built situation of OAI agreement:
1, metadata specification aspect
Exist in the process of long-term resources processing in library and management and large quantities ofly be difficult to use DC (Dublin Core) to carry out specification data, system can support that be not all right to the integrated obvious employing DC of these resources as the resource specification of acquiescence if desired; Simultaneously the OAI agreement is also approved and is adopted other metadata coding criterion to carry out data formatting, but do not have a kind of mode can carry out Data Format Transform between the different pieces of information coding criterion, typical example is the information translation between Marc (MachineReadable Catalogue) specification data and the dc specification data.
We think the data issuing service of an OAI system, and he can adopt a kind of arbitrarily coding criterion, and this coding criterion can be the fractional reuse of existing multiple coding criterion, also can be a kind of brand-new, self-defining coding criterion.Like this system for resource that can be integrated almost without any the information coding on restriction, brought the problem of an information analysis and integrated aspect simultaneously.In order to address this problem, we require to adopt RDF and RDFS mechanism to carry out the explanation of each item of information of metadata.RDF can realize semantization ground information description well as a kind of model description standard on the XML basis.And RDFS can provide a kind of checking means of metadata validity in the process of information gathering as a kind of SCHEMA.
By adopting aforesaid way to carry out the standard of metadata proposal aspect, system can really carry out collection, analysis and the processing of various different structure metadata effectively.Simultaneously for the using and managing of raising system for the resource object of metadata information correspondence, we advise gathering and carry out the standard of metadata with METS (Metadata Encoding andTransmission Standard) mode, can provide the more management information about writing down to system on the basis of metadata information like this.
RDF, RDFS, Marc and DC are the standards that the library has had, they are used in different fields such as information description, information encapsulation, aspect OAI, require to support DC, but so just make on basic unit's ability of system to the non-standard resource and be subjected to influence, so our platform innovation above-mentioned standard is combined, thereby it is required for the basic unit of characteristic resources and non-standardization resource to have satisfied system.
2. system architecture aspect
The processed flow process of system aspect information gathering as shown in Figure 3,
System adopts " hierarchical structure " to design, according to message processing flow as shown in Figure 3, the level of system as shown in Figure 2, wherein:
Exchanges data layer (13)
The mode of the responsible employing of this layer http protocol is carried out the exchanges data on the network, and the problem of responsible deal with data exchange correlation.Comprise the URL redirect, access environment setting, work such as Interrupt Process.This layer provides interface to instruction level.
This layer is the basis of whole system exchanges data, is the structure of the bottom.Communication between SP (ISP) and the DP (data set provider) is to be responsible for by this layer module, forms into network packet according to the standard of http protocol.Simultaneously, this layer module can be noted the access parameter situation of change in the access to netwoks process, and the variation of these parameters has comprised the parameter etc. of parameter, network address route and gateway of parameter, the cookie of parameter, the session of browser.These parameters are the important evidence in the access to netwoks process, in system, be called the scene, when a visit takes place to interrupt, these parameters will be saved, system will realize on-the-spot reduction by recovering these parameters when recovering interrupt operation, so that the operation that is interrupted can normally continue.
Instruction level (12)
This layer mainly is responsible for carrying out the checking of data correctness and adopted the tissue of OAI agreement for data, calls the interface of exchanges data layer simultaneously and realizes data interaction, tape format: indentation: the first trip indentation: 1 character; This layer-management to as if the XML of URL and feedback.Wherein the system journal of management will assist in conjunction with to(for) URL realizes the incremental data collection at certain data source.This layer also is responsible for when receiving effective URL instruction simultaneously, and the format organization that stipulates according to agreement becomes corresponding feedback file.For reciever, the work of this layer is to verify whether the URL that receives meets the standard of OAI; Adopt the schema of OAI agreement regulation to carry out the checking of feedback data validity.
Instruction level is the agreement supporting layer of whole system, this layer module is responsible for the generation of access instruction at the SP end, the exchange that the exchanges data layer carries out information is given in the instruction that will generate then, after obtaining feedback, is responsible for carrying out according to the regulation of OAI agreement the form validation of feedback information.Provide the instruction validation of acceptance at DP end for the exchanges data layer, after efficiency confirmed property, system will be given to the digital coding layer to the effective information in this instruction and carry out obtaining of data.And after obtaining data, carry out the tissue of feedback information, give the feedback that the exchanges data layer is asked at last.
Digital coding layer (11)
This layer mainly is responsible for carrying out the Code And Decode work of metadata information.The standard of Code And Decode adopts the content of stipulating among the RDFS to carry out.For the information that adopts the METS mode to carry out digital coding, system will filter out the data message of control information and metadata automatically according to the implication of RDF.
The digital coding layer mainly is that the mode that in the organization access instruction parameter can be accepted according to DP is carried out the information coding in the work of SP end, when obtaining feedback result, carries out the extraction of information according to RDFS.And be that the parameter that obtains in the access request is carried out the affirmation of coded system in the work of DP end, in determining access request, under the effective situation of parameter, submit to the data source management and once carried out obtaining of information.After obtaining data, the data message of RDF structure feeds back in forming according to the standard of RDFS.
Data source management level (10)
This layer adopts the mode of DOI (Digital Object Identifier), the managerial parameter that parameter (mainly being the authentification of user parameter of resource access) that is provided with in the log-on data source by the keeper and system gather acquisition is automatically set up a virtual resource at this layer, realizes management and use to remote resource by this resource management.
Communication between the strict hierarchical structure requirement system can not be crossed over level and be carried out communication, that is to say that the program of lower floor provides service interface for the upper strata.In the OAI system, information communication as shown in Figure 4,
3, for the charge resource support
Owing to there are " data source management level ", system can simulate the data source that needs in this locality, and when the keeper registers this resource, if the keeper has the legal rights of using to this resource, then the resource management of this layer can realize that the multi-user is simultaneously to the use request of this resource by adopting the access environment simulated mode.Relatively with the charge resource, consider that http protocol is that basic system carries out the mode that authentification of user adopted (user+pin mode, IP mode), system is under the situation of successfully logining and obtain legal rights of using, by network environment parameter (address, parameter, the Cookie etc.) information of automatic preservation Lawful access, realized legal use to the charge information that resource provided.
Seeing also shown in the accompanying drawing 5, below is one of embodiment of success of the present invention:
In the resource consolidation project in library, the dissimilar resource that library users is a large amount of, and exist at each resource some maturation but the characteristic resources (image, phonotape and videotape, picture, literal etc.) of non-standardization.These resources are difficult to adopt a kind of general metadata specification to be encoded, but can obtain a large amount of value-added services when these resources combine.(in the real system, by setting up the association between the resource, can reflect the history of Shanghai well after these resources are gathered) in the eighties of last century twenty or thirty age.Its application system is as shown in Figure 5:
This system comprises: management of magazine department (40), audio management department (50), Image Management department (60) and network service management department (70) etc., wherein:
Management of magazine department (40) is made up of internal control flow process A (41), computer (1) and data issue A modules such as (42) successively, the output signal of internal control flow process A (41) transmits and enters computer (1), the output signal of computer (1) transmits and enters data issue A (42) module, and the output signal of data issue A (42) module transmits and enters information collection server (71) module;
Audio management department (50) is made up of internal control flow process B (51), computer (1) and data issue B modules such as (52) successively, the output signal of internal control flow process B (51) transmits and enters computer (1), the output signal of computer (1) transmits and enters data issue B (52) module, and the output signal of data issue B (52) module transmits and enters information collection server (71) module;
Image Management department (60) is made up of internal control flow process C (61), computer (1) and data issue C modules such as (62) successively, the output signal of internal control flow process C (61) transmits and enters computer (1), the output signal of computer (1) transmits and enters data issue C (62) module, and the output signal of data issue C (62) module transmits and enters information collection server (71) module;
Network service management department (70) is made up of information collection server (71), computer (1) and data issuing service modules such as (72) successively, the output signal of information collection server (71) transmits and enters computer (1), and the output signal of computer (1) transmits and enters data issuing service (72) module.

Claims (4)

1, a kind of semantization automatic mining device of distributed isomeric data, this device has computer, network and server hardware environmental system, it is characterized in that: the input/output terminal of each parallel computer (1) is connected with the I/O of metadata issuing service (2) module respectively, the output signal of metadata issuing service (2) module is sent to metadata acquisition server (4) module through network (3), be the wireless transmission form therebetween, the output of metadata acquisition server (4) module is connected with the input of computer (1), the output of computer (1) is connected with the input of metadata processing server (5) module, and its system architecture is the hierarchical structure form.
2, the semantization automatic mining device of distributed isomeric data according to claim 1, it is characterized in that: the hierarchical structure of described system architecture is followed successively by data source management level (10), digital coding layer (11), instruction level (12) and exchanges data layer (13), carry out exchanges data by network (3) therebetween, wherein:
Exchanges data layer (13) is a fabric, and its signal transmission comprises: the URL redirect, and the access environment setting, Interrupt Process reaches and provides interface signal to instruction level;
Instruction level (12) is the agreement supporting layer, and the transmission of its signal comprises: instruction generates and generates the information exchange that exchanges data layer (13) are given in instruction, instruction validation and be given to the data acquisition signal of digital coding layer (11);
Digital coding layer (11) is the metadata information layer, its signal transmission comprises: the Code And Decode of Information Level, the information that data source management level (10) are submitted in the affirmation of coded system and filter control information and the data message of metadata, actual parameter is obtained and the feedback of data message;
Data source management level (10) are used layer for resource management, and its signal transmission comprises: the information exchange of the preservation of data and management information and direct and database, the request of analog data source and charge resource support information.
3, a kind of semantization automatic mining method of distributed isomeric data is characterized in that:
This method provides the level means of communication of service interface for the upper strata in the processed aspect the information gathering and lower floor's program by metadata proposal standard, system, realize collection, analysis and the processing of various different structure metadata, use, management and the information communication of the integration of realization resources of various types and distributed resource, processing, issue, wherein:
The operation principle and the concrete steps of metadata proposal standard are:
Step 1. coding criterion
Adopt a kind of coding criterion, this coding criterion is the fractional reuse of existing multiple coding criterion, or is a kind of brand-new, self-defining coding criterion;
Step 2. information analysis and integrated
Adopting RDF and RDFS mechanism is the explanation of each item of information of metadata, RDF is realized semantization ground information description as a kind of model description standard on the XML basis, as a kind of SCHEMA, in the process of information gathering, provide a kind of checking means of metadata with RDFS;
The concrete steps of the processed flow process of system aspect information gathering are:
Step 1. sends acquisition instructions (20)
The output signal that sends acquisition instructions (20) module transmits checking (21) module that enters feedback result XML validity;
Step 2. judges whether effective A (22)
The output signal of the checking of feedback result XML validity (21) module transmits to enter and judges whether the effectively judge module of (22);
If be verified as useful signal, then enter and extract metadata record (23) module;
If be verified as invalid signals, the system that then enters reports an error and writes down error message (26) module;
Step 3. judges whether effective B (24)
The output signal of extracting metadata record (23) module transmits to enter and judges whether effective B (24) module,
If be verified as useful signal, then enter metadata processing (25) module;
If be verified as invalid signals, the system that then enters reports an error and writes down error message (26) module;
4, the semantization automatic mining method of distributed isomeric data according to claim 3, it is characterized in that: the described level means of communication are carried out exchanges data by network, and the concrete steps of its workflow are:
Step 1. data source access request (30)
Data source access request (30) signal transmits and enters data source management level (10) module;
The request of step 2. data source information coding (31) and data acquisition request (32)
The output signal of the data source information coding request (31) of data source management level (10) module transmits and enters digital coding layer (11) module, and the output signal of the data acquisition request (32) of digital coding layer (11) module transmits and enters data source management level (10) module;
Step 3. instruction generates request (34) and order parameter decoding request (33)
The output signal that the instruction of digital coding layer (11) module generates request (34) transmits entry instruction layer (12) module, and the output signal of the order parameter decoding request (33) of instruction level (12) module transmits and enters digital coding layer (11) module;
Step 4. data source access request (35) and instruction checking request (36)
The output signal of the data source access request (35) of instruction level (12) module transmits and enters exchanges data layer (13) module, and the output signal of the instruction checking request (36) of exchanges data layer (13) module transmits entry instruction layer (12) module.
CN 200510111604 2005-12-16 2005-12-16 Apparatus and method for languaging automatic digging of distributed isomeric data Pending CN1787527A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200510111604 CN1787527A (en) 2005-12-16 2005-12-16 Apparatus and method for languaging automatic digging of distributed isomeric data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200510111604 CN1787527A (en) 2005-12-16 2005-12-16 Apparatus and method for languaging automatic digging of distributed isomeric data

Publications (1)

Publication Number Publication Date
CN1787527A true CN1787527A (en) 2006-06-14

Family

ID=36784830

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200510111604 Pending CN1787527A (en) 2005-12-16 2005-12-16 Apparatus and method for languaging automatic digging of distributed isomeric data

Country Status (1)

Country Link
CN (1) CN1787527A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101894327A (en) * 2010-07-07 2010-11-24 清华大学 Digital resource long-term storage format outdating risk quantitative evaluation method
CN102231869A (en) * 2011-06-27 2011-11-02 北京邮电大学 Realization method for refinement operation system architecture of valued-added service
CN103207920A (en) * 2013-04-28 2013-07-17 北京航空航天大学 Parallel metadata acquisition system
CN103607469A (en) * 2013-11-28 2014-02-26 东莞中国科学院云计算产业技术创新与育成中心 Cloud platform for achieving distributed isomerous data sharing and data sharing method thereof
CN104731928A (en) * 2015-03-27 2015-06-24 李冬 Data collecting and processing equipment

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101894327A (en) * 2010-07-07 2010-11-24 清华大学 Digital resource long-term storage format outdating risk quantitative evaluation method
CN102231869A (en) * 2011-06-27 2011-11-02 北京邮电大学 Realization method for refinement operation system architecture of valued-added service
CN102231869B (en) * 2011-06-27 2013-08-21 北京邮电大学 Realization method for refinement operation system architecture of valued-added service
CN103207920A (en) * 2013-04-28 2013-07-17 北京航空航天大学 Parallel metadata acquisition system
CN103607469A (en) * 2013-11-28 2014-02-26 东莞中国科学院云计算产业技术创新与育成中心 Cloud platform for achieving distributed isomerous data sharing and data sharing method thereof
CN103607469B (en) * 2013-11-28 2017-05-17 东莞中国科学院云计算产业技术创新与育成中心 Data sharing method of cloud platform for achieving distributed isomerous data sharing
CN104731928A (en) * 2015-03-27 2015-06-24 李冬 Data collecting and processing equipment

Similar Documents

Publication Publication Date Title
CN1282335C (en) Browser testing system and method thereof
CN101056187A (en) A system and method for oriented and customized distribution of the network contents
CN100337235C (en) Method and apparatus for accessing database
CN1976352A (en) Method and system for providing supporting of long-distance software application
CN101080708A (en) Method and system for globally sharing and transacting contents in local area
CN1761961A (en) Method and apparatus for detecting invalid clicks on the internet search engine
CN1230752C (en) Information process system, medium, equipment method, and relative storage medium
CN1488195A (en) Distributed on -demand media transcoding system and method
CN1518708A (en) Real-time search engine
CN1669016A (en) Multimedia advertising service through a mobile communication network and multimedia content controlling apparatus and method of a mobile terminal supporting said service
CN1713574A (en) Delivering system of webpage information of internet
CN1471008A (en) System and method of application programme distribution and configuration management for mobile apparatus
CN1787527A (en) Apparatus and method for languaging automatic digging of distributed isomeric data
CN1867025A (en) Method for carrying out charging control on pre-payment user
CN1838599A (en) Authentication and personal content transmission method and display apparatus and server thereof
CN1487446A (en) Method for the server and to supply user's interface for Internet explorer client end
CN1926532A (en) Data processing device capable of performing data transmission by a predetermined access method
CN1661962A (en) Information-processing apparatus, information-processing method, and computer program
CN1859388A (en) Dynamic content transfer method and personalized engine and dynamic content transmitting system
CN1921612A (en) Method and system for automatic video production
CN101042710A (en) Method and system for implementing acquisition data sharing
CN1694427A (en) Synchronization transmission method and system for data between general server and terminal server
CN1585309A (en) Long-range monitoring system based on mobile terminal and method thereof
CN1798032A (en) Method and system for implementing message subscription through Internet
CN1588850A (en) Network identifying method and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication