KR20100070084A - Apparatus and method for in real time retrieving knowledge relevant to user's query from a large-scale ontology - Google Patents

Apparatus and method for in real time retrieving knowledge relevant to user's query from a large-scale ontology Download PDF

Info

Publication number
KR20100070084A
KR20100070084A KR1020080128681A KR20080128681A KR20100070084A KR 20100070084 A KR20100070084 A KR 20100070084A KR 1020080128681 A KR1020080128681 A KR 1020080128681A KR 20080128681 A KR20080128681 A KR 20080128681A KR 20100070084 A KR20100070084 A KR 20100070084A
Authority
KR
South Korea
Prior art keywords
path
ontology
graph
knowledge
query
Prior art date
Application number
KR1020080128681A
Other languages
Korean (ko)
Inventor
최은정
최호준
Original Assignee
주식회사 케이티
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 주식회사 케이티 filed Critical 주식회사 케이티
Priority to KR1020080128681A priority Critical patent/KR20100070084A/en
Publication of KR20100070084A publication Critical patent/KR20100070084A/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3346Query execution using probabilistic model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention relates to an apparatus and method for retrieving knowledge related to user queries in a large volume ontology in real time. The present invention relates to a large-scale ontology representing real-world knowledge in the form of a node-arc graph and to each node in the converted ontology graph. Searching the knowledge related to user query in large-scale ontology in real time by managing partial pairs and generating partial graphs by searching and integrating the paths related to user query when user query comes in. An apparatus and a method thereof are provided.

To this end, the present invention provides a knowledge retrieval apparatus, comprising: ontology graph converting means for converting a large-scale ontology into an ontology graph; Path generation means for generating a path from the ontology graph converted by the ontology graph conversion means; Path storage means for storing a path generated by said path generation means; Route retrieving means for retrieving a user query and a related route from the route storing means; Partial graph generating means for generating a partial graph by processing the path searched by said path searching means; And triple converting means for converting the partial graph generated by the partial graph generating means into a triple set.

Description

Apparatus and method for in real time retrieving knowledge relevant to user's query from a large-scale ontology}

The present invention relates to an apparatus and method for retrieving knowledge related to user queries in a large-scale ontology in real time, and more particularly, to a large-scale ontology expressing real-world knowledge in the form of a triple (sub-predicate-object). After converting the subject and the object of each triple into an ontology graph having a descriptor as an arc and managing the paths obtained for each node pair in the converted ontology graph, the user The present invention relates to an apparatus and method for retrieving knowledge related to a user query in a large-scale ontology by generating a partial graph by searching and integrating paths related to the user query when a query comes in.

In the following embodiment, a method of converting an ontology into a graph, a method of generating paths for each node pair in an ontology graph, a method of storing the generated paths, and searching for a part related to a user query Although a method of generating a graph is described as an example, it will be apparent that the present invention is not limited thereto.

Ontology is receiving great attention as it is recognized as a core technology of the Semantic Web, which is emerging as the next generation web. Ontology is a systematic expression of knowledge and semantic relationships between them. Ontologies enable you to obtain not only the information you need but also other relevant information, so you can use intelligent QnA systems, intelligent search engines, etc. It can be used for the same application.

Looking at the related patents and research papers to use the ontology in search, the overall focus is on providing information on the ontology that is directly related to the user search word.

First, in the paper "Multimedia Information Retrieval Using Meaningful Relevance" (Park Chang-Seop, Korean Society for Internet Information, Vol. 8, pp. 67-79), the concepts that are directly related to the user's search terms in multimedia retrieval are discussed. It describes a methodology for providing a variety of multimedia content as a search result. To this end, we proposed an algorithm that numerically calculates the semantic relation between user search terms and concepts included in ontology.

Next, the Korean Patent "Browse system and method for browsing information using ontologies (Patent Registration No. 10-0820746)" provides the ontology on the screen in the form of a node-arc on the screen for the user's search convenience, the user of the graph Browsing systems and methods are described in which selecting a node moves the center of the graph and presents it back to the user. This prior patent also has a limitation in displaying ontologies on the screen, so it is shown mainly on nodes directly connected to the node selected by the user.

On the other hand, in recent years, attempts have been made to find information that is directly related to a user's search term, and even hidden information, that is, information that is indirectly related. Recently, academia has been conducting research to find out the relation between people who have no acquaintance by constructing a human network that expresses the relationship between two people as nodes and arcs. In the same context, studies have been conducted to find the hidden (indirect) linkage between two nodes after converting ontology map node-arc graphs. For example, while Student A and Student B, who are attending the same university, have no acquaintance with each other, the relationship between the nodes in the ontology graph indicates that they are taking different subjects of Professor C. In order to find the hidden (indirect) association between any two nodes on the ontology graph, researches on algorithms for quickly searching the path between the two nodes have been conducted.

However, these studies limit the number of search terms (number of nodes selected) to two, excluding the method of finding partial graphs (hidden associations) connected between search terms when there are three or more search terms.

In addition, as the ontology becomes larger in size, it is almost impossible in terms of time complexity of an algorithm to provide a search result to a user in real time by performing an algorithm simultaneously with inputting a search word.

Therefore, there is an urgent need for an algorithm that finds hidden (indirect) associations among three or more search terms without limiting the number of user search terms, as well as a semantic search system that can search knowledge in real time on a large scale ontology. .

As described above, in the ontology search, an algorithm for finding hidden (indirect) associations among three or more search terms without limiting the number of user search terms is required, and the core of the Semantic Web, which is attracting attention as the next generation web environment, is required. As the ontology of the technology becomes larger, the problem of how to quickly search the knowledge described in the ontology in real time has been encountered, and it is an object of the present invention to solve such a problem and meet the needs.

Accordingly, the present invention provides an apparatus and method for retrieving knowledge related to a user query in real time on a large scale ontology for retrieving and providing a path related to a user query in real time after converting and storing the large ontology in a path form. The purpose is.

That is, the present invention converts a large-scale ontology representing the real world knowledge into a node-arc graph form and manages paths obtained for each node pair in the converted ontology graph. It is an object of the present invention to provide an apparatus and method for retrieving knowledge related to user query in real time by generating partial graphs by searching and integrating paths related to user query.

The objects of the present invention are not limited to the above-mentioned objects, and other objects and advantages of the present invention which are not mentioned above can be understood by the following description, and will be more clearly understood by the embodiments of the present invention. Also, it will be readily appreciated that the objects and advantages of the present invention may be realized by the means and combinations thereof indicated in the claims.

An apparatus of the present invention for achieving the above object is a knowledge retrieval apparatus, comprising: an ontology graph converting means for converting a large capacity ontology into an ontology graph; Path generation means for generating a path from the ontology graph converted by the ontology graph conversion means; Path storage means for storing a path generated by said path generation means; Route retrieving means for retrieving a user query and a related route from the route storing means; Partial graph generating means for generating a partial graph by processing the path searched by said path searching means; And triple converting means for converting the partial graph generated by the partial graph generating means into a triple set.

On the other hand, the method of the present invention for achieving the above object, in a knowledge search method, comprising: converting a triple-based ontology into an ontology graph in the form of a node-arc; Generating a path for each node pair of the converted ontology graph and indexing and storing the path in a path repository; A route retrieval step of retrieving a route associated with a user query from the route repository; Generating a partial graph by incorporating the searched paths or removing the meaningless paths; And converting the generated partial graph into a triple set.

The present invention as described above, there is an effect that can search in real time the knowledge associated with the user query in a large capacity ontology.

That is, the present invention converts a large-scale ontology representing the real world knowledge into a node-arc graph form and manages paths obtained for each node pair in the converted ontology graph. When the partial graph is generated by searching and integrating the paths related to the user query, it is possible to search the knowledge related to the user query in real time.

In addition, the present invention can find a hidden (indirect) association between three or more search terms without limiting the number of user search terms.

In addition, the present invention can be applied to a variety of applications (eg, semantic web-based search engine, etc.) that require retrieving knowledge in real time from a large-scale ontology, thereby improving the quality of the search.

BRIEF DESCRIPTION OF THE DRAWINGS The above and other objects, features and advantages of the present invention will become more apparent from the following detailed description of the present invention when taken in conjunction with the accompanying drawings, It can be easily carried out. In addition, in describing the present invention, when it is determined that the detailed description of the known technology related to the present invention may unnecessarily obscure the gist of the present invention, the detailed description thereof will be omitted. Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

First, the semantic web technology will be described in more detail to help understanding of the present invention.

Tim Berners-Lee first proposed the World Wide Web in 1989, marking the existing client-server architecture and HyperText Markup Language (HTML). The language allows users to post personal information anywhere on the Internet and to have a shared infrastructure for accessing that information through a browser. As a result, a great deal of information has been put on the Internet and distributed, and a great deal of information has existed on the Internet. The sharing of this information promotes social and technological development, and as a result, leads the innovation of the information society. Became.

However, the enormous amount of information has led to more and more efforts to find the information they want, and the emergence of various applications and services using the web has made it difficult and effective to find and use. Many difficulties have arisen.

In particular, the existing web-based search method is mainly searched by keywords, and the method of determining the priority of web documents using frequency of words or lexical information is mainly used to find a desired web document. There is a limit. In addition, it is very difficult to expand, integrate and share related web documents. This problem is because the existing web and markup languages are human-centered and focus on the expression technology of the web browser for humans to see and understand. As a result, the existing web is a human-centered information processing technology that does not provide enough functions for a computer to effectively extract, interpret, and process necessary information on its own.

Then, the Semantic Web is a technology that can extend the existing web to realize semantic interoperability based on well-defined meanings that computers can understand and to build an effective cooperative system between humans and computers. This appeared.

Tim Berners-Lee is not the concept of a new web that is completely different from the existing web, but rather extends the current web to give well-defined meaning to the information on the web, thereby helping computers and people work collaboratively. The role is defined as a paradigm that can be performed. The Semantic Web is designed to understand the meaning of information on the web, not only by people, but also by machines (computers) to provide intelligent services that meet the needs of users, or to facilitate collaboration between people and machines. It is a web with automatic service.

In other words, the semantic web is a next-generation web technology that enables a computer to understand, automate, integrate, and reuse the meaning of information resources.

1) Ontology

Ontology is a formal specification system for shared conceptualization and provides semantic information of domain vocabulary. Ontology is a kind of knowledge expression, and the computer can understand the concept represented by the ontology and process the knowledge. In order to deal with inferences, the ontology's axiom and rule system are needed.

2) semantically annotated web

A semantically annotated web is an ontology annotated web, which is a knowledge base. The Semantic Web can build a huge knowledge base that semantically integrates the distributed information resources of the Internet. In a narrow sense, it may be possible to build a knowledge base of information resources of a company or institution.

3) agent

An agent is an intelligent agent that collects, retrieves and infers information resources on behalf of a person (user), and exchanges information with other agents. Intelligent agents are the core of semantic web-based application systems.

The semantic web realizes semantic interoperability by using ontology and agent technology, and thus, the semantic web can leap from the information-based web to the knowledge-based semantic web.

1 is a block diagram of an embodiment of a device for searching for user query related knowledge in real time and a semantic search system using the same in a large-scale ontology according to the present invention.

As shown in FIG. 1, the semantic search system to which the present invention is applied includes a web document repository 10 storing web documents collected through a web content provider or an automatic web crawler, and the web document repository. Triple-based ontology repository 20 storing ontology constructed by indexing the contents of the web document stored in (10) in the form of triple (predicate-description-object), and receiving and processing query information from a search user terminal. An ontology knowledge retrieval unit 30 for retrieving knowledge (triples) related to a search user query from the query information processing unit 40 from the triple-based ontology repository 20 in real time; Searching the web document repository 10 for web documents having contents of the triples searched by the ontology knowledge search unit 30 as contents A web document searcher 50, and a search result processor for classifying or prioritizing web documents (search results) searched by the web document searcher 50 according to content similarity, 60).

Next, the components of the ontology knowledge search unit 30 will be described in more detail as follows.

As shown in FIG. 1, the ontology knowledge retrieval unit 30 according to the present invention has an ontology stored in the triple-based ontology repository 20, that is, an ontology representing a real world knowledge in a triple form in a node-arc form. An ontology graph converter 31 for converting the ontology graph of the path, a path generator 32 for generating paths for each node pair of the ontology graph converted by the ontology graph converter 31, the path A path storage 33 for storing paths generated by the generation unit 32, and a path search for retrieving paths related to the query from the path storage 33 with a user query received from the query information processing unit 40; The unit 34 generates partial graphs by simply integrating the paths searched by the path search unit 34 or by removing some paths determined to be meaningless. A subgraph generator 35 for converting the subgraphs generated by the subgraph generator 35 into a triple set, and a triple converter 36 for transmitting the subgraph graph to the web document searcher 50. do.

Next, the operation of the ontology knowledge search unit 30 and its specific embodiments will be described in more detail with reference to FIGS. 2 to 4B.

2 is an exemplary view of the ontology graph obtained from the ontology graph conversion unit according to the present invention, Figure 3 is an exemplary view of the table structure of the path storage according to the present invention, Figure 4a is a view of the path storage according to the present invention 4B is a diagram illustrating a structure, and FIG. 4B is a diagram for describing an operation of a path search unit and a partial graph generator according to the present invention.

First, the ontology graph converting unit 31 converts an ontology that expresses real world knowledge in the form of a triple (predicate-predicate-object) to an ontology graph having each of the triplets, an object as a node, and a descriptor as an arc. For example, the knowledge that 'Kim Dong Gun majored in business administration' can be expressed as triple 'Kim Dong Gun-major-business administration', and 'Kim Dong Gun' and 'Business Administration' are nodes and 'major' It can be represented by an arc connecting the

FIG. 2 shows an example of converting ontology graphs into ontology graphs using an ontology language called a resource description framework (RDF) and an RDF schema. In this case, when there is a correlation between two nodes, it is represented by a solid line arc with bidirectionality. Otherwise, it is represented by a unidirectional dotted arc.

When the ontology is converted into an ontology graph in this way, weights can be given to the arcs of each triple. Simply all arcs can be weighted equally equal to 1, or they can be weighted differently depending on the importance of the triples. For example, a method of differentially assigning weights can give a low weight (penalty) when the corresponding triple has appeared a lot lately. Random triple

Figure 112008086771360-PAT00001
Given, arc weights
Figure 112008086771360-PAT00002
Is shown in Equation 1 below.

Figure 112008086771360-PAT00003

Here, t is a parameter for time and means a date from one day before to m days before from the present. If you consider only one year, m is 365.

And

Figure 112008086771360-PAT00004
to be. only,
Figure 112008086771360-PAT00005
Is the number of all triples that occurred in web documents published t days before now,
Figure 112008086771360-PAT00006
Triples occur on web documents published on a date less than t days ago.
Figure 112008086771360-PAT00007
Is the frequency of

And

Figure 112008086771360-PAT00008
to be. only,
Figure 112008086771360-PAT00009
Is the number of all web documents published more than t days before now,
Figure 112008086771360-PAT00010
Is the triple among web documents published on
Figure 112008086771360-PAT00011
Is the number of documents included.

The path generation unit 32 generates the paths for each node pair with the ontology graph received from the ontology graph converter 31. Well known shortest path algorithms can be used to create a path between any two nodes. The shortest path algorithms include the "dijstra" algorithm, the "bellman-ford" algorithm, the "Floyd-warshall" algorithm, and the shortest path algorithm using a directed acyclic graph (DAG). If you want to create multiple paths between two nodes, you can use the k-shortest path algorithm that finds k paths in the shortest path order. However, it should be noted that when using the shortest path algorithm on the ontology graph with arc weights differentially assigned, the higher the importance of the triple, the lower the weight value of the arc.

On the other hand, considering that it is an ontology graph converted from a large ontology, the number of paths for each node pair may be very large. For example, if there are 100,000 nodes, the number of node pairs may be 10 billion, so the number of paths may be 10 billion or more. However, the paths between all node pairs are not meaningful. Therefore, it is not necessary to generate paths for every node pair, and it is reasonable to only generate paths for node pairs that are expected to be meaningful. For example, each node can only generate paths between itself and nodes within a certain number of hops.

In addition, the ontology graph may change as new knowledge is added or changed over time. Then, the path generation unit 32 must obtain new paths for every node pair on the changed ontology graph. However, considering that the ontology graph of the large-scale ontology is very large, it is very unnecessary and time-consuming to regenerate the paths for the small changes of the frequently occurring ontology graph. Therefore, the path generation for the entire ontology graph is performed at regular intervals (for example, every day). When a small change occurs in the ontology graph, the path is generated for the corresponding partial graph each time. For example, when new knowledge is added to the ontology, that is, when new nodes and arcs are added to the ontology graph, the problem is solved by only creating paths between the old and new nodes.

The path storage 33 indexes and stores paths of each node pair generated through the path generation unit 32. An example of a path indexing method for the present invention is to store nodes at both ends, path length (sum of arc number or arc weight), and the like as index information. For example, the path 'Kim Dong-Geun-Major-Business Administration' obtained from the ontology graph of FIG. 2 may be indexed as '(Kim Dong-Gun, Business Administration), 1'. Another path 'Kim Dong-gun-Own-Ontologytech-type-company' can be indexed as '(Kim Dong-gun, Company), 2'. If there is a meaningless node among both nodes, it can be replaced with a meaningful arc (attribute). That is, since 34 is a literal value in the path 'Kim Dong Gun-Age-34', it may be indexed as '(Kim Dong Gun, age), 1'. The following shows an example of indexing several possible paths in the same way. Indexes for each path are underlined.

(Kim Dong Gun, Business Administration), 1 : Kim Dong Gun-Major-Business Administration

(Kim Dong Gun, age), 1 : Kim Dong Gun-Age-34

(Kim Dong Gun, Ontology Tech ), 1 : Kim Dong Gun-Own-Ontology Tech

(Kim Dong Gun, Employer), 1 : Kim Dong Gun-type-employer

(Kim Dong Gun, Company), 2 : Kim Dong Gun-Own-Ontology Tech-type-Company

( Ontology Tech , http://www.ontologytech.co.kr), 1 : Ontology Tech-Homepage-http: //www.ontologytech.co.kr

The indexed paths may be stored in a path table as shown in FIG. 3, and the table may be distributed and stored in a plurality of computers as shown in FIG. 4. As mentioned above, the number of paths could be significantly reduced by only generating paths of node pairs that are expected to be meaningful, but the number of paths may still be large, requiring a large amount of path storage 33. Thus, distributed storage of multiple PCs at a lower cost can solve storage capacity and cost problems.

In addition, as shown in FIG. 4B, when a user query is received later, the same query information is transmitted to each individual computer, and each individual computer simultaneously searches for a relatively small number of paths related to the query. Providing search results can significantly reduce route search time.

The route retrieving unit 34 retrieves the relevant routes from the route storage 33 with the user query received from the query information processing unit 40. First, the query information processor 40 extracts key noun words as a keyword from the user query and transmits the keywords to the path searcher 34. For example, when a user query, 'What company does Kim Dong Gun operate?', The query information processing unit 40 extracts a key noun type keyword 'Kim Dong Gun, company' and transmits it to the path search unit 34. Therefore, the query considered by the path search unit 34 is a noun type, and the number thereof is not limited.

Next, the path searching unit 34 searches for a path based on the number of noun words based on the above-described indexing method.

(1) When one noun word is received from the query information processing unit 40, the path search searches for the shortest paths among the paths including the word in the indexed node information. For example, when the search term is 'Kim Dong Gun', the first or second field of the route table of FIG. 3 is 'Kim Dong Gun' and the path length is 1 to search for paths 1, 2, 3, and 4, respectively. The graph generator 35 transmits the result.

(2) When two noun-type words are received from the query information processing unit 40, the path search finds the shortest path including both words in the indexed node information. For example, when the search term is 'Kim Dong Gun', a route having a route number of 5 may be searched in the route table of FIG. 3. The searched route number 5 is transmitted to the partial graph generator 35.

(3) When three noun-type words are received from the query information processing unit 40, the path search is performed with the first two words as in (2), and then each node and the remaining one of the shortest paths obtained from (2). The shortest path with the word is found and transmitted to the partial graph generator 40. For example, when the search term is 'Kim Dong Gun Company Homepage', first, find the shortest path (Path No. 5) of 'Kim Dong Gun Company'. Next, find the shortest path between nodes of path number 5 'Kim Dong Gun, Ontology Tech, Company' and 'Homepage'. In other words, the shortest path (path number 6) for 'Kim Dong Gun', 'Ontology Tech Homepage' and 'Company Homepage' is searched. Finally, the route numbers 5 and 6 are transmitted to the partial graph generator 35.

(4) When four noun words are received from the query information processing unit 40, the path search is performed on the first three words as in (3), and then all the nodes of the shortest path obtained from the (3) and the remaining ones. The shortest paths with one word are found and transmitted to the partial graph generator 35.

In addition, even when five or more noun-type words are received from the query information processing unit 40, the above-described method may be extended and applied.

The partial graph generator 35 generates a partial graph by simply integrating the paths found by the path searcher 34 or by removing some of the paths determined to be meaningless, and then converts the generated partial graph information into the triple converter 36. To pass). Paths that are determined to be meaningless include, for example, paths that have only one 'type' as an arc. These paths are often knowledge that most people know about (eg, 'Hyo-ri-type-singer'), so removing them doesn't affect search results much. Thus, in the case of the example (1), one partial graph can be generated by simply integrating the paths 1, 2, 3, and 4, or after removing the path 4, the partial graphs are integrated by combining the paths 1, 2, and 3 You can also create

FIG. 5 is a flowchart illustrating a method for searching user query related knowledge in real time in a large-scale ontology according to an embodiment of the present invention. Since the specific embodiments are the same as described above, the technical gist of the operation method is briefly described herein. Let's explain.

First, the ontology graph converter 31 converts the ontology stored in the triple-based ontology repository 20, that is, the ontology in which the knowledge of the real world is expressed in the triple form (501). In this case, the converted ontology graph is an ontology graph in the form of a node-arc.

Thereafter, the path generator 32 generates paths for each node pair of the ontology graph converted by the ontology graph converter 31 and stores the paths in the path storage 33 (502).

Thereafter, the path search unit 34 retrieves the paths related to the query from the path store 33 with the user query from the query information processing unit 40 (503).

Subsequently, the partial graph generator 35 processes the paths searched by the path searcher 34 to generate partial graphs (504). In this case, partial graphs are generated by simply integrating the searched paths or by removing some of the paths determined to be meaningless.

Thereafter, the triple converter 36 converts the subgraphs generated by the subgraph generator 35 into a triple set and transmits the subgraphs to the web document searcher 50 (505).

On the other hand, the method of the present invention as described above can be written in a computer program. And the code and code segments constituting the program can be easily inferred by a computer programmer in the art. In addition, the written program is stored in a computer-readable recording medium (information storage medium), and read and executed by a computer to implement the method of the present invention. The recording medium may include any type of computer readable recording medium.

The present invention described above is capable of various substitutions, modifications, and changes without departing from the technical spirit of the present invention for those skilled in the art to which the present invention pertains. It is not limited by the drawings.

The present invention solves the problem that it is difficult to search the knowledge described in the ontology in real time as the ontology becomes large, so that various applications (eg, semantic web-based search engines, etc.) systems that require retrieval of the knowledge from the large-scale ontology are active. Can be utilized.

1 is a configuration diagram of an apparatus for searching for user query related knowledge in real time and a semantic search system using the same in a large-scale ontology according to the present invention;

2 is an exemplary view of an ontology graph obtained from an ontology graph converter according to the present invention;

3 is an exemplary diagram of a table structure of a path store according to the present invention;

Figure 4a is an exemplary view showing the structure of the path storage according to the present invention,

4B is a diagram for describing an operation of a path search unit and a partial graph generation unit according to the present invention;

5 is a flowchart illustrating a method for searching for user query related knowledge in real time in a large-scale ontology according to the present invention.

* Explanation of symbols for the main parts of the drawings

10: Web Document Repository 20: Triple Based Ontology Repository

30: ontology knowledge search unit 31: ontology graph conversion unit

32: path generation unit 33: path storage

34: path search unit 35: partial graph generation unit

36: triple conversion unit 40: query information processing unit

50: web document search unit 60: search result processing unit

Claims (7)

In the knowledge search apparatus, Ontology graph converting means for converting the large-capacity ontology into an ontology graph; Path generation means for generating a path from the ontology graph converted by the ontology graph conversion means; Path storage means for storing a path generated by said path generation means; Route retrieving means for retrieving a user query and a related route from the route storing means; Partial graph generating means for generating a partial graph by processing the path searched by said path searching means; And Triple conversion means for converting the subgraph generated by the subgraph generating means into a triple set Knowledge searching device comprising a. The method of claim 1, The route generating means, And knowledge path generating means for each node pair of the ontology graph converted by the ontology graph converting means. The method of claim 2, The route storage means, And indexing and storing the paths of the pairs of nodes generated by the path generation means, and storing both nodes and path lengths (sum of arc numbers or arc weights) as index information. 4. The method according to any one of claims 1 to 3, The ontology graph conversion means, Knowledge retrieval device that converts the ontology stored in the external triple-based ontology repository into an ontology graph in the form of a node-arc. The method of claim 4, wherein The partial graph generating means, And generating partial graphs by integrating the paths searched by the path search means or by removing the meaningless paths. In the knowledge search method, Converting the triple based ontology into an ontology graph in the form of a node-arc; Generating a path for each node pair of the converted ontology graph and indexing and storing the path in a path repository; A route retrieval step of retrieving a route associated with a user query from the route repository; Generating a partial graph by incorporating the searched paths or removing the meaningless paths; And Converting the generated subgraph into a triple set; Knowledge search method comprising a. The method of claim 6, The path search step, A first step of searching for the shortest path among paths including the one word in indexed end node information when one query word is received; A second process of searching for the shortest path including the two query terms in the indexed node information when two query terms are received; When three queries are received, after performing the second process with the first two query terms of the three queries, the shortest path between each node of the shortest path obtained from the second process and the other one of the three queries A third process of searching for a route; When four queries are received, all nodes of the shortest path obtained from the third process and the other one of the four queries are performed after performing the third process on the first three query terms of the four query words. A fourth process of searching for the shortest path of the; And When five or more query terms are received, all nodes of the shortest path obtained from the immediately preceding procedure and the last one query word are performed after the previous process is performed on the query words except the last one of the query words. The fifth process of performing expansion by searching the shortest path with Knowledge search method comprising a.
KR1020080128681A 2008-12-17 2008-12-17 Apparatus and method for in real time retrieving knowledge relevant to user's query from a large-scale ontology KR20100070084A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020080128681A KR20100070084A (en) 2008-12-17 2008-12-17 Apparatus and method for in real time retrieving knowledge relevant to user's query from a large-scale ontology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020080128681A KR20100070084A (en) 2008-12-17 2008-12-17 Apparatus and method for in real time retrieving knowledge relevant to user's query from a large-scale ontology

Publications (1)

Publication Number Publication Date
KR20100070084A true KR20100070084A (en) 2010-06-25

Family

ID=42367959

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020080128681A KR20100070084A (en) 2008-12-17 2008-12-17 Apparatus and method for in real time retrieving knowledge relevant to user's query from a large-scale ontology

Country Status (1)

Country Link
KR (1) KR20100070084A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101226162B1 (en) * 2012-07-30 2013-01-24 한국과학기술정보연구원 Method and apparatus for converting ontology date to graph data
KR101648011B1 (en) * 2015-06-30 2016-08-12 경희대학교 산학협력단 Method and apparatus for frequent subgraph mining using embedding overlapped relationships
KR102079289B1 (en) * 2019-04-23 2020-04-07 주식회사 비닛 Wine recommendation system and method
KR102147854B1 (en) * 2020-06-08 2020-08-25 한화시스템(주) Battlefield situation multiple reasoning system and method

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101226162B1 (en) * 2012-07-30 2013-01-24 한국과학기술정보연구원 Method and apparatus for converting ontology date to graph data
KR101648011B1 (en) * 2015-06-30 2016-08-12 경희대학교 산학협력단 Method and apparatus for frequent subgraph mining using embedding overlapped relationships
KR102079289B1 (en) * 2019-04-23 2020-04-07 주식회사 비닛 Wine recommendation system and method
KR102147854B1 (en) * 2020-06-08 2020-08-25 한화시스템(주) Battlefield situation multiple reasoning system and method

Similar Documents

Publication Publication Date Title
Jain et al. Ontology development and query retrieval using protégé tool
Maali et al. Re-using Cool URIs: Entity Reconciliation Against LOD Hubs.
Dong et al. A survey in semantic search technologies
JP2015212947A (en) Apparatus and method for web page access
Singh et al. Ontology development using Hozo and Semantic analysis for information retrieval in Semantic Web
Aksac et al. A novel semantic web browser for user centric information retrieval: PERSON
Xie et al. An evolvable and transparent data as a service framework for multisource data integration and fusion
KR20100070084A (en) Apparatus and method for in real time retrieving knowledge relevant to user's query from a large-scale ontology
Belozerov et al. Semantic web technologies: Issues and possible ways of development
KR20070065774A (en) System and method for managing a semantic blog using the ontology
Marx et al. Exploring term networks for semantic search over RDF knowledge graphs
Angele et al. Semantic Web empowered E-tourism
Chaudhary et al. A novel ontology design and comparative analysis of various retrieval schemes on education domain in protégé
Kettouch et al. Using semantic similarity for schema matching of semi-structured and linked data
Kim et al. The index organizations for RDF and RDF schema
Konstantinou et al. Deploying linked open data: Methodologies and software tools
KR20100003084A (en) Apparatus and method for extracting partial ontology graph, and apparatus and method for semantic matching between user's question and ontology using thereof
TWI442249B (en) Domain Knowledge Network Construction Method and Its System
Mullins et al. Treelicious: a system for semantically navigating tagged web pages
Chun et al. Semantic annotation and search for deep web services
Nešić et al. Publishing agro-environmental resources as linked data
Uppal et al. Semantic web mining and semantic search engine: A review
Soza et al. Web ontology language applied to the tourism sector
Goel et al. Semantic Web Engineering: Boon or Bane
Movva et al. Noesis: a semantic search engine and resource aggregator for atmospheric science

Legal Events

Date Code Title Description
WITN Withdrawal due to no request for examination