KR970049752A - Korean Natural Language Query Information Retrieval Using Verb Information - Google Patents

Korean Natural Language Query Information Retrieval Using Verb Information Download PDF

Info

Publication number
KR970049752A
KR970049752A KR1019950052681A KR19950052681A KR970049752A KR 970049752 A KR970049752 A KR 970049752A KR 1019950052681 A KR1019950052681 A KR 1019950052681A KR 19950052681 A KR19950052681 A KR 19950052681A KR 970049752 A KR970049752 A KR 970049752A
Authority
KR
South Korea
Prior art keywords
verb
word
keyword
file
index
Prior art date
Application number
KR1019950052681A
Other languages
Korean (ko)
Other versions
KR0160919B1 (en
Inventor
장호욱
박세영
Original Assignee
양승택
한국전자통신연구원
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 양승택, 한국전자통신연구원 filed Critical 양승택
Priority to KR1019950052681A priority Critical patent/KR0160919B1/en
Publication of KR970049752A publication Critical patent/KR970049752A/en
Application granted granted Critical
Publication of KR0160919B1 publication Critical patent/KR0160919B1/en

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

본 발명은 한국어 자연어 질의에 대해 키워드와 함께 동사 정보를 이용하여 문서 순위 리스트를 제공하는 정보검색 방법에 관한 것으로, 입력 텍스트 화일에서 키워드, 키워드 + 동사 기본형로 이루어진 색인 화일을 작성하는 제1단계; 상기 제1단계와 동일한 방법으로 사용자 자연어 질의에서 비교 대상이 되는 색인어들을 추출하여 질의어 색인 리스트를 작성하는 제2단계; 및 상기 색인 화일과 질의어 색인 리스트를 비교하여 빈도수와 중요도에 따라 해당 문서를 우선 순위에 따라 보이는 제3단계로 수행되는 것을 특징으로 하며, 키워드만을 사용하는 경우보다 상세하고 정확하게 검색 대상 텍스트와 자연어 질의어를 표현하여 정보검색 과정을 수행함으로써 키워드만을 사용하는 것보다 정확한 문서 순위 리스트를 제공할 수 있는 효과가 있다.The present invention relates to an information retrieval method for providing a document ranking list using verb information together with keywords for a Korean natural language query, comprising: a first step of creating an index file consisting of a keyword, a keyword + a verb basic form in an input text file; A second step of extracting index words to be compared in a user natural language query and creating a query word index list in the same manner as the first step; And comparing the index file with the query index list and performing the third step of displaying the document according to the frequency and the importance according to the priority. The searched text and the natural language query word are more precisely and accurately than the case of using only the keyword. By expressing the information retrieval process, it is possible to provide an accurate document ranking list than using only keywords.

Description

동사정보를 이용한 한국어 자연어 질의 정보검색 방법Korean Natural Language Query Information Retrieval Using Verb Information

본 내용은 요부공개 건이므로 전문내용을 수록하지 않았음Since this is an open matter, no full text was included.

제1도는 본 발명에 의한 정보검색 방법의 수행 과정을 나타낸 흐름도.1 is a flowchart illustrating a process of performing an information retrieval method according to the present invention.

Claims (2)

입력 텍스트 화일에서 "키워드", "키워드 + 동사 기본형"로 이루어진 색인 화일을 작성하는 제1단계; 상기 제1단계와 동일한 방법으로 사용자 자연어 질의에서 비교 대상이 되는 색인어들을 추출하여 질의어 색인 리스트를 작성하는 제2단계; 및 상기 색인 화일과 질의어 색인 리스트를 비교하여 빈도수와 중요도에 따라 해당 문서를 우선 순위에 따라 보이는 제3단계로 수행되는 것을 특징으로 하는 한국어 자연어 질의 정보검색 방법.A first step of creating an index file consisting of "keyword", "keyword + verb basic form" in the input text file; A second step of extracting index words to be compared in a user natural language query and creating a query word index list in the same manner as the first step; And comparing the index file with the query word index list and performing a third step of displaying the corresponding documents in order of priority according to frequency and importance. 제1항에 있어서, 상기 제1단계는 입력 텍스트 화일의 단어들을 하나씩 읽고, 화일의 끝인지를 검사하는 제1과정; 상기 제1과정의 검사 결과, 화일의 끝이면 종료하고, 화일의 끝이 아니면 동사 사전에 일치하는 동사 파생형 단어가 있는지 검사하는 제2과정; 상기 제2과정의 검사 결과, 일치하는 동사 파생형 단어가 있는 경우에는 해당 동사의 기본형을 가져오며, 이전 단어가 명사를 포함하고 있는지 검사하여 이전 단어가 명사를 포함하고 있으면 이전 단어까지 만든 키워드와 "키워드 + 동사" 기본형을 색인화일에 등록하고, 이전 단어가 명사를 포함하지 않는 경우에는 동사 정보를 색인에 등록하지 않는 제3과정; 및 상기 제2과정의 검사 결과, 일치하는 동사 파생형 단어가 없는 경우에는 현재의 단어가 명사를 포함하는 단어인지를 검사하며, 명사를 포함하지 않는 경우에는 이전 단어까지 추출된 키워드 여부를 검사하여 추출된 "키워드"를 색인화일에 등록하고, 현재 단어가 명사를 포함하는 경우 이전 단어까지 추출된 키워드와 함께 키워드를 만들어 가는 제4과정으로 수행되는 것을 특징으로 하는 한국어 자연어 질의 정보검색 방법.The method of claim 1, wherein the first step comprises: a first step of reading words of an input text file one by one and checking whether the file ends; A second step of checking whether the verb-derived word is found in the verb dictionary if the end of the file ends, and if the end of the file is not the end of the file; As a result of the check in the second process, if there is a matching verb-derived word, the basic form of the verb is taken. If the previous word includes a noun, and if the previous word includes a noun, A third step of registering the basic form of "keyword + verb" in the index file and not registering the verb information in the index if the previous word does not include a noun; And if there is no matching verb-derived word, the current word is a word containing a noun, and if it does not include a noun, the keyword is extracted to the previous word. And a fourth process of registering the extracted "keyword" in the index file and creating a keyword together with the extracted keyword up to the previous word if the current word includes a noun. ※ 참고사항 : 최초출원 내용에 의하여 공개하는 것임.※ Note: The disclosure is based on the initial application.
KR1019950052681A 1995-12-20 1995-12-20 Korean language query information search method KR0160919B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1019950052681A KR0160919B1 (en) 1995-12-20 1995-12-20 Korean language query information search method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1019950052681A KR0160919B1 (en) 1995-12-20 1995-12-20 Korean language query information search method

Publications (2)

Publication Number Publication Date
KR970049752A true KR970049752A (en) 1997-07-29
KR0160919B1 KR0160919B1 (en) 1998-12-15

Family

ID=19441857

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1019950052681A KR0160919B1 (en) 1995-12-20 1995-12-20 Korean language query information search method

Country Status (1)

Country Link
KR (1) KR0160919B1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR19990078925A (en) * 1999-08-20 1999-11-05 강준모 Internet Browsing System For Searching with Usual Words
KR20030006201A (en) * 2001-07-12 2003-01-23 서정연 Integrated Natural Language Question-Answering System for Automatic Retrieving of Homepage
KR100419263B1 (en) * 2001-08-17 2004-02-19 주식회사 벤컴디지탈 Method of automatically journalizing for slip and apparatus thereof
KR101371318B1 (en) * 2010-07-29 2014-03-11 네이버 주식회사 System and method for determining rank of document

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100455439B1 (en) * 2000-09-21 2004-11-08 (주)하우앤와이 Internet resource retrieval and browsing method based on expanded web site map and expanded natural domain names assigned to all web resources

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR19990078925A (en) * 1999-08-20 1999-11-05 강준모 Internet Browsing System For Searching with Usual Words
KR20030006201A (en) * 2001-07-12 2003-01-23 서정연 Integrated Natural Language Question-Answering System for Automatic Retrieving of Homepage
KR100419263B1 (en) * 2001-08-17 2004-02-19 주식회사 벤컴디지탈 Method of automatically journalizing for slip and apparatus thereof
KR101371318B1 (en) * 2010-07-29 2014-03-11 네이버 주식회사 System and method for determining rank of document

Also Published As

Publication number Publication date
KR0160919B1 (en) 1998-12-15

Similar Documents

Publication Publication Date Title
US5794177A (en) Method and apparatus for morphological analysis and generation of natural language text
Perkins Python text processing with NLTK 2.0 cookbook
Fletcher Making the web more useful as a source for linguistic corpora
US20170235841A1 (en) Enterprise search method and system
US6957213B1 (en) Method of utilizing implicit references to answer a query
US20140324819A1 (en) Efficient forward ranking in a search engine
CN105045852A (en) Full-text search engine system for teaching resources
WO1997004405A9 (en) Method and apparatus for automated search and retrieval processing
KR101709055B1 (en) Apparatus and Method for Question Analysis for Open web Question-Answering
KR101095866B1 (en) Triple indexing and searching scheme for efficient information retrieval
Merkel et al. Knowledge-lite extraction of multi-word units with language filters and entropy thresholds.
US6907562B1 (en) Hypertext concordance
JP3594701B2 (en) Key sentence extraction device
Lim et al. Automatic genre detection of web documents
KR970049752A (en) Korean Natural Language Query Information Retrieval Using Verb Information
KR20210032253A (en) System and method for searching documents and providing an answer to a natural language question
Fatima et al. STEMUR: An automated word conflation algorithm for the Urdu language
KR100452024B1 (en) Searching engine and searching method
KR100376931B1 (en) A Method of Database System Implementation for Korean-English Translation Using Information Retrieval Techniques
KR100374114B1 (en) Hyperlink generator for korean language terminology based HTML
KR20200122089A (en) Apparatus and Method for Electronic Document Retrieval using Local Indexing
Laitinen et al. Google Books: A shortcut to studying language variability?
KR970017008A (en) Document ranking method using keyword network
JP4635585B2 (en) Question answering system, question answering method, and question answering program
Ababneh et al. Enhanced Arabic Information Retrieval by Using Arabic Slang

Legal Events

Date Code Title Description
A201 Request for examination
E701 Decision to grant or registration of patent right
GRNT Written decision to grant
FPAY Annual fee payment

Payment date: 20070730

Year of fee payment: 10

LAPS Lapse due to unpaid annual fee