CN101216851A - Ontology data administrative system and method - Google Patents

Ontology data administrative system and method Download PDF

Info

Publication number
CN101216851A
CN101216851A CNA2008100560987A CN200810056098A CN101216851A CN 101216851 A CN101216851 A CN 101216851A CN A2008100560987 A CNA2008100560987 A CN A2008100560987A CN 200810056098 A CN200810056098 A CN 200810056098A CN 101216851 A CN101216851 A CN 101216851A
Authority
CN
China
Prior art keywords
module
data
ontology
importing
data management
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2008100560987A
Other languages
Chinese (zh)
Inventor
孟小峰
陈妍
蒋瑜
欧建波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CNA2008100560987A priority Critical patent/CN101216851A/en
Publication of CN101216851A publication Critical patent/CN101216851A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses an ontology data management system which comprises a data storage module for organizing and managing the disc data pages and providing storage management service for the upper layers by taking a logic page as a unit; a data management module for ontology data management in a triple form, including the storage organizing, access and update of the triple in the data page; a data processing module for importing, querying and updating the ontology data and a user interface module for the interaction between users and the system.

Description

A kind of ontology data (Ontology) management system and method
Technical field
The present invention relates to technical field of the computer network, especially relate to a kind of ontology data (Ontology) management method.
Background technology
WWW is a huge information resource database, and text message covering extensive fields is not only arranged, and colourful multimedia messages is arranged, and no matter when and where, people can obtain the information of oneself wanting easily.Yet the blast of information makes current WWW can't bear the heavy load, and wants to find the information of oneself wanting to look for a needle in a haystack especially from this huge resources bank.Search engine such as number of site such as yahoo, *** can provide some helps, but these also are nowhere near.Many times, the user not only wants to search a webpage, and it is more that they also want to allow computing machine do.For example, user's plan goes to the Caribbean to spend the vacation of a romance, want the Italian hotel of a predetermined medium-priced on Web before going, and this hotel has only the distance of five blocks from a famous cinema.These information may be provided by different websites respectively, and under machine can't the situation of identifying information, all things all needed the user to do in person.The user need go the information that these websites obtain to be needed respectively, and then selects suitable content according to the needs of oneself.This demand is that the search engine of current popular can't satisfy, so a lot of research work just occurred about Web information integration aspect, but the work of this respect is general all at a specific application, can't corresponding different data type, therefore can't be applied directly on the complicated applications based on Web information.
For the information from the local data source of difference is integrated easily, need be by the common understanding of an association area knowledge.Ontology provides a public vocabulary of supporting to share and reuse knowledge.Ontology describes the semanteme of real world by the relation between description things and the things.
Ontology can be used as the instrument of knowledge sharing and Knowledge Discovery: it can be used as the communication media between the different system on the one hand, describes by existing Ontology on the other hand and can know more knowledge by inference.Semantic network is the important application scene of Ontology, and it makes machine can discern and handle information on the network automatically, and people use and the ability of network resource administration thereby improve.
WWW has become people to obtain the main source of information, along with development of internet technology, people issue oneself on network information has become a more and more simple thing, but present Web depends on user's technology, can only be understood by the people, machine but can't be discerned automatically, and Web only is responsible for providing information to the user, remaining thing all needs user oneself to go to handle, and this has been with great inconvenience to the user.Semantic net is arisen at the historic moment under this environment just.Semantic net (Semantic Web) is the notion that a Di Mubainasi-Li by World Wide Web Consortium (Tim Berners-Lee) proposed in 1998, its core is: by adding the metadata (Meta data) that can be understood by computing machine to the document on the WWW (as: HTML), thereby make whole internet become a general message exchange media.Semantic web is expanded the ability of WWW by use standard, markup language and relevant handling implement.
Though the information on the WWW is machine-readable, not machine understandable, cause online information to be difficult to be handled automatically thus by computing machine.The approach that solves this difficult problem is exactly to adopt metadata to come information on the Web-indexing, uses the RDF resource description framework to come relation between descriptive metadata and metadata and the metadata then.The RDF[3 that proposes by W3C] and RDFS become the standard language of description Internet resources semanteme at present.The full name of RDF is a resource description framework, is used for the representation element data.RDF tlv triple (Subject, Property, Object) represent relation between resource and the resource, with compare with document or data-centered XML, RDF is to be the method at center with knowledge, be to be what Web, especially semantic net designed specially, and XML provide unified syntax format for message exchange.The aspect that RDF is better than XML is unique expression mode that RDF figure has described resource, and same content XML is had various ways.RDF has only defined the framework that is used to describe resource, but which metadata of definition use is not explained resource, but allows the definition metadata to describe specific resource.The metadata that this definition is come out is called as RDF Schema, is called for short RDFS.RDFS (RDF Schema) can be used to represent the vocabulary in the specific area, compare with XMLDTD or XML Schema, RDFS does not have strict constraint to data, for example, we can use in RDFS the attribute of not definition or class or the like, metadata among the RDFS is divided into two classes: a class is Class, is used to describe the type of resource; Can have inheritance between the Class, can define large-scale split catalog by inheritance, classification at length helps the accurate description to resource.Another kind of is Property, is used to describe the attribute of resource and the relation between the resource.Property itself has Domain and two attributes of Range, and Domain is used for representing the affiliated type of subject of RDF ternary relation, and Range represents the type under the object in the RDF ternary relation.
OrientX is the Native XML data management system by the autonomous research and development in Renmin University of China WAMDM laboratory.Now, the OrientX system is still in the middle of improving.OrientX has kept the tree structure of XML data with Native mode storing X ML data.Buffer management, physical page management, file read-write management function also are provided in the storage subsystem, have supported for the upper layer data administration module provides storage efficiently.
The OrientX/Ontology system is based on the peripheral system that the OrientX system extension obtains, the storage of special disposal Ontology data and management; Can be regarded as the specific versions of OrientX for Ontology.Traditional data management is restricted owing to lacking semantic the support, and the semantic information that Ontology provides has remedied this deficiency.But the management of Ontology data itself becomes a new problem again.The pattern differentials of relation schema and Ontology data is excessive, is not suitable for the Ontology data management; The correlation technique of wishing to use for reference in the XML data management provides more effective Ontology data management.
Summary of the invention
In order to solve above-mentioned traditional problem, so one object of the present invention has proposed a kind of ontology data management system and method exactly.
In one aspect of the invention, a kind of ontology data management system comprises: data memory module, and being used for the data in magnetic disk page or leaf is organized and managed and provides with the logical page (LPAGE) to the upper strata is the mobile sms service of unit; Data management module is used to carry out the ontology data management of triple form, and it comprises storage organization, visit and the renewal of tlv triple in data page; Data processing module is used for being responsible for importing, the inquiry of ontology data and upgrades processing; And subscriber interface module, be used for being responsible for the mutual of user and system.
In aspect this, data memory module comprises: memory management module is used for the management to logical page (LPAGE) and buffer zone; The data set administration module is used for the management to the relevant information of data set; And document management module is used to create file and to the various operations of file.
In aspect this, data management module comprises: the class administration module is used for class is managed picture; Attribute management module is used for various attributes are managed; The same item administration module is used for identical class is managed picture; And the inhomogeneity administration module, be used for inhomogeneity is managed picture.
In aspect this, data processing module comprises importing module and enquiry module.
In aspect this, import module and comprise: the RDF resolver is used to resolve the RDF document; Import inference engine, be used for when importing, partial data being carried out reasoning; And the importing execution module, be used for the data of resolving well and finish reasoning are stored.
In aspect this, enquiry module comprises: the SPASQL language parser is used to resolve the SPASQL query statement; The inquiry inference engine is used for the data that data reasoning is obtained implying; The inquiry execution module is used to choose the data that satisfy condition.
In aspect this, subscriber interface module is made of API and two kinds of interfaces of GUI, and the user both can use the application program on API exploitation upper strata, also can carry out data management by graphical interfaces.
In another aspect of this invention, a kind of introduction method that is used in the ontology data management system has been proposed, this management system comprises data memory module, data management module, by importing the data processing module that module and enquiry module constitute, and subscriber interface module, wherein this data processing module comprises the class administration module, attribute management module, the same item administration module, and inhomogeneity administration module, this importing module comprises the RDF resolver, import inference engine, and importing execution module, this enquiry module comprises the SPASQL language parser, the inquiry inference engine, and the inquiry execution module, this introduction method comprises step: the user submits the RDF document to; Resolve by the RDF resolver, obtain the memory model of body; Import execution module and from memory model, extract tlv triple, pass to the importing inference engine and carry out reasoning; Tlv triple that reasoning obtains and original tlv triple are transferred to data management module and are handled together; Data management module is responsible for according to the corresponding sub-administration module in different semantic selection class administration modules, attribute management module, same item administration module and the inhomogeneity administration module of tlv triple; Described corresponding sub-administration module is organized into data recording with tlv triple, gives memory management module and writes logical page (LPAGE); And memory management module is finally write data in the data file by document management module.
In still another aspect of the invention, a kind of querying method that is used in the ontology data management system has been proposed, this management system comprises data memory module, data management module, by importing the data processing module that module and enquiry module constitute, and subscriber interface module, wherein this data processing module comprises the class administration module, attribute management module, the same item administration module, and inhomogeneity administration module, this importing module comprises the RDF resolver, import inference engine, and importing execution module, this enquiry module comprises the SPASQL language parser, the inquiry inference engine, and the inquiry execution module, this querying method comprises step: the user submits the SPARQL inquiry to; By the SPARQL resolver query statement is resolved; The query engine module receives the inquiry after resolving and delivers to data management module; Data management module carries out the inquiry of single tlv triple, and intermediate result is handled, and obtains final result and returns to the user.
Description of drawings
In conjunction with accompanying drawing subsequently, what may be obvious that from following detailed description draws above-mentioned and other purpose of the present invention, feature and advantage.In the accompanying drawings:
Fig. 1 has provided the system construction drawing according to OrientX/Ontology of the present invention;
Fig. 2 has provided the importing process flow diagram of the method according to this invention; And
Fig. 3 has provided the querying flow figure of the method according to this invention.
Embodiment
At first, with reference to figure 1, the system architecture according to OrientX/Ontology of the present invention is described in detail.Fig. 1 has provided the system construction drawing of OrientX/Ontology.
As shown in Figure 1, this system is bottom-up can be divided into four parts, and it is respectively data memory module, data management module, data processing module and subscriber interface module.Briefly introduce the effect of each layer below.
Data memory module is made of these three modules of memory management module, data set administration module and document management module, and it is the mobile sms service of unit that this accumulation layer is used for the data in magnetic disk page or leaf is organized and managed and provides with the logical page (LPAGE) to the upper strata, and wherein memory management module is used for the management to logical page (LPAGE) and buffer zone; The data set administration module is used for the management to the relevant information of data set; Document management module is used to create file and to the various operations of file.
Data management module is made of class administration module, attribute management module, same item administration module and inhomogeneity administration module, and this data management module is used to carry out the ontology data management of triple form, it comprises storage organization, visit and the renewal of tlv triple in data page, wherein the class administration module is used for class is managed picture, attribute management module is used for various attributes are managed, the same item administration module is used for identical class is managed picture, and the inhomogeneity administration module is used for inhomogeneity is managed picture.These four different administration modules have defined storage organization separately to picture, visit and updating submodule for different separately respectively.
Data processing module is made of data importing module and enquiry module, and this data processing module is responsible for importing, the inquiry of ontology data and is upgraded processing.In the middle of the importing module, comprise the RDF resolver, import inference engine and import execution module.Wherein the RDF resolver is used to resolve the RDF document; Importing inference engine is used for when importing partial data being carried out reasoning; Importing execution module is used for the data of resolving well and finish reasoning are stored.In the middle of enquiry module, comprise SPASQL language parser, inquiry inference engine and inquiry execution module.Wherein the SPASQL language parser is used to resolve the SPASQL query statement; The inquiry inference engine is used for data that data reasoning is obtained implying; The inquiry execution module is used to choose the data that satisfy condition.
Subscriber interface module is made of API and two kinds of interfaces of GUI, is responsible for the mutual of user and system.The user both can use the application program on API exploitation upper strata, also can carry out data management by graphical interfaces.
Flowing of data can be classified as two treatment schemees in the system, is respectively to import flow process and querying flow, simply introduces these two flow processs below respectively.
Fig. 2 has explained the process of the importing flow process of the inventive method: the user submits RDF (resource description framework) document to, resolves by the RDF resolver, obtains the memory model of body; Import execution module and from memory model, extract tlv triple, pass to the importing inference engine and carry out reasoning; Tlv triple that reasoning obtains and original tlv triple are transferred to data management module and are handled together; Data management module is responsible for according to the corresponding sub-administration module in different semantic selection class administration modules, attribute management module, same item administration module and the inhomogeneity administration module of tlv triple; Described corresponding sub-administration module is organized into data recording with tlv triple, gives memory management module and writes logical page (LPAGE); Memory management module is finally write data in the data file by document management module.
Fig. 3 has explained the process of the querying flow of the inventive method: the user submits SPARQL (query language on the RDF) inquiry to, resolve by the SPARQL resolver, the query engine module receives the inquiry after resolving, pass to data management module and carry out the inquiry of single tlv triple, and intermediate result handled, obtain final result and return to the user.
From the above description as can be known, the invention is characterized in the storage means of the Native that has designed the suitable Ontology data characteristics of a cover, inquiry and reasoning all are to be based upon on the storage basis of Native.The Ontology data can be represented with figure, therefore directly represent to bring the difficulty of conversion with relation, thereby search efficiency is affected.Therefore, our Native of OrientX/Ontology system design Storage method is:
1, the RDF tlv triple is organized into the structure of tree.
2, the Ontology data model is a directed acyclic graph, and for the node that a plurality of fathers are arranged, we split this node, and one is major node, and another is the Copy node.
What may be obvious that for the person of ordinary skill of the art draws other advantages and modification.Therefore, the present invention with wider aspect is not limited to shown and described specifying and exemplary embodiment here.Therefore, under situation about not breaking away from, can make various modifications to it by the spirit and scope of claim and the defined general inventive concept of equivalents thereof subsequently.

Claims (9)

1. ontology data management system comprises:
Data memory module, being used for the data in magnetic disk page or leaf is organized and managed and provides with the logical page (LPAGE) to the upper strata is the mobile sms service of unit;
Data management module is used to carry out the ontology data management of triple form, and it comprises storage organization, visit and the renewal of tlv triple in data page;
Data processing module is used for being responsible for importing, the inquiry of ontology data and upgrades processing;
And
Subscriber interface module is used for being responsible for the mutual of user and system.
2. according to the ontology data management system of claim 1, wherein data memory module comprises:
Memory management module is used for the management to logical page (LPAGE) and buffer zone;
The data set administration module is used for the management to the relevant information of data set; And
Document management module is used to create file and to the various operations of file.
3. according to the ontology data management system of claim 1, wherein data management module comprises:
The class administration module is used for class is managed picture;
Attribute management module is used for various attributes are managed;
The same item administration module is used for identical class is managed picture; And
The inhomogeneity administration module is used for inhomogeneity is managed picture.
4. according to the ontology data management system of claim 1, wherein data processing module comprises importing module and enquiry module.
5. according to the ontology data management system of claim 4, wherein import module and comprise:
The RDF resolver is used to resolve the RDF document;
Import inference engine, be used for when importing, partial data being carried out reasoning; And
Import execution module, be used for the data of resolving well and finish reasoning are stored.
6. according to the ontology data management system of claim 4, wherein enquiry module comprises:
The SPASQL language parser is used to resolve the SPASQL query statement;
The inquiry inference engine is used for the data that data reasoning is obtained implying;
The inquiry execution module is used to choose the data that satisfy condition.
7. according to the ontology data management system of claim 1, wherein subscriber interface module is made of API and two kinds of interfaces of GUI, and the user both can use the application program on API exploitation upper strata, also can carry out data management by graphical interfaces.
8. introduction method that is used in the ontology data management system, this management system comprises data memory module, data management module, by importing the data processing module that module and enquiry module constitute, and subscriber interface module, wherein this data processing module comprises the class administration module, attribute management module, the same item administration module, and inhomogeneity administration module, this importing module comprises the RDF resolver, import inference engine, and importing execution module, this enquiry module comprises the SPASQL language parser, the inquiry inference engine, and the inquiry execution module, this introduction method comprises step:
The user submits the RDF document to;
Resolve by the RDF resolver, obtain the memory model of body;
Import execution module and from memory model, extract tlv triple, pass to the importing inference engine and carry out reasoning;
Tlv triple that reasoning obtains and original tlv triple are transferred to data management module and are handled together;
Data management module is responsible for according to the corresponding sub-administration module in different semantic selection class administration modules, attribute management module, same item administration module and the inhomogeneity administration module of tlv triple;
Described corresponding sub-administration module is organized into data recording with tlv triple, gives memory management module and writes logical page (LPAGE); And
Memory management module is finally write data in the data file by document management module.
9. querying method that is used in the ontology data management system, this management system comprises data memory module, data management module, by importing the data processing module that module and enquiry module constitute, and subscriber interface module, wherein this data processing module comprises the class administration module, attribute management module, the same item administration module, and inhomogeneity administration module, this importing module comprises the RDF resolver, import inference engine, and importing execution module, this enquiry module comprises the SPASQL language parser, the inquiry inference engine, and the inquiry execution module, this querying method comprises step:
The user submits the SPARQL inquiry to;
By the SPARQL resolver query statement is resolved;
The query engine module receives the inquiry after resolving and delivers to data management module;
Data management module carries out the inquiry of single tlv triple, and intermediate result is handled, and obtains final result and returns to the user.
CNA2008100560987A 2008-01-11 2008-01-11 Ontology data administrative system and method Pending CN101216851A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2008100560987A CN101216851A (en) 2008-01-11 2008-01-11 Ontology data administrative system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2008100560987A CN101216851A (en) 2008-01-11 2008-01-11 Ontology data administrative system and method

Publications (1)

Publication Number Publication Date
CN101216851A true CN101216851A (en) 2008-07-09

Family

ID=39623283

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2008100560987A Pending CN101216851A (en) 2008-01-11 2008-01-11 Ontology data administrative system and method

Country Status (1)

Country Link
CN (1) CN101216851A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102722569A (en) * 2012-05-31 2012-10-10 浙江理工大学 Knowledge discovery device based on path migration of RDF (Resource Description Framework) picture and method
CN101853251B (en) * 2009-03-30 2012-11-21 华为技术有限公司 Method, device and system for protecting sensitive resource description framework data
CN102929898A (en) * 2011-08-12 2013-02-13 微软公司 Semantic query engine of structured database
CN102999563A (en) * 2012-11-01 2013-03-27 无锡成电科大科技发展有限公司 Network resource semantic retrieval method and system based on resource description framework
CN104205092A (en) * 2012-03-28 2014-12-10 国际商业机器公司 Building an ontology by transforming complex triples
CN105447156A (en) * 2015-11-30 2016-03-30 北京航空航天大学 Resource description framework distributed engine and incremental updating method
CN105760425A (en) * 2016-01-17 2016-07-13 曲阜师范大学 Ontology data storage method
CN109101550A (en) * 2018-07-09 2018-12-28 广州极天信息技术股份有限公司 Semantic network management system, method, equipment and storage medium

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101853251B (en) * 2009-03-30 2012-11-21 华为技术有限公司 Method, device and system for protecting sensitive resource description framework data
CN102929898A (en) * 2011-08-12 2013-02-13 微软公司 Semantic query engine of structured database
CN102929898B (en) * 2011-08-12 2017-08-15 微软技术许可有限责任公司 The semantic query engine of structured database
CN104205092B (en) * 2012-03-28 2017-03-01 国际商业机器公司 Set up the method and system of body by the complicated tlv triple of conversion
CN104205092A (en) * 2012-03-28 2014-12-10 国际商业机器公司 Building an ontology by transforming complex triples
CN102722569A (en) * 2012-05-31 2012-10-10 浙江理工大学 Knowledge discovery device based on path migration of RDF (Resource Description Framework) picture and method
CN102722569B (en) * 2012-05-31 2014-10-22 浙江理工大学 Knowledge discovery device based on path migration of RDF (Resource Description Framework) picture and method
CN102999563A (en) * 2012-11-01 2013-03-27 无锡成电科大科技发展有限公司 Network resource semantic retrieval method and system based on resource description framework
CN105447156A (en) * 2015-11-30 2016-03-30 北京航空航天大学 Resource description framework distributed engine and incremental updating method
CN105760425A (en) * 2016-01-17 2016-07-13 曲阜师范大学 Ontology data storage method
CN105760425B (en) * 2016-01-17 2018-12-04 曲阜师范大学 A kind of ontology data storage method
CN109101550A (en) * 2018-07-09 2018-12-28 广州极天信息技术股份有限公司 Semantic network management system, method, equipment and storage medium
CN109101550B (en) * 2018-07-09 2022-05-03 广州极天信息技术股份有限公司 Semantic web management system, method, device and storage medium

Similar Documents

Publication Publication Date Title
Bizer et al. Dbpedia-a crystallization point for the web of data
Weissenberg et al. An ontology-based approach to personalized situation-aware mobile service supply
Hyvönen Semantic portals for cultural heritage
CN101216851A (en) Ontology data administrative system and method
Cole et al. Library marc records into linked open data: Challenges and opportunities
Yoose et al. The linked open data landscape in libraries and beyond
Orgel et al. A metadata model and mapping approach for facilitating access to heterogeneous cultural heritage assets
Hoang et al. Retracted: Semantic information integration with linked data mashups approaches
Tachmazidis et al. A Hypercat-enabled semantic Internet of Things data hub
Simon et al. Peripleo: a tool for exploring heterogenous data through the dimensions of space and time
Cayzer Semantic blogging: Spreading the semantic web meme
Singer Linked library data now!
Salas et al. Stdtrip: Promoting the reuse of standard vocabularies in open government data
Lu et al. Language engineering for the Semantic Web: A digital library for endangered languages
Orlandi et al. Semantic search on heterogeneous wiki systems
Hou et al. Modeling and representation of intangible cultural heritage knowledge using linked data and ontology
Stergiou et al. OpenAPI QL: searching in OpenAPI service catalogs
Piotrowski et al. The Labeling System: A New Approach to Overcome the Vocabulary Bottleneck
Zhao et al. Heterogeneous Social Linked Data Integration and Sharing for Public Transportation
Fakhre Alam et al. A comparative study of RDF and topic maps development tools and APIs
Kim et al. RDF based linked open data management as a DaaS platform
Cortese et al. Extending the Digital Archives of Italian Psychology with Semantic Data.
Mitchell et al. LA meta (data): Exploring vocabulary interoperability in libraries, Archives and Museums
Le et al. Vietnamese tourism linked open data
Plewe A qualified assertion database for the history of places

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Open date: 20080709