JP2002230007A

JP2002230007A - Method for generating information for retrieval, and device and method for retrieving information

Info

Publication number: JP2002230007A
Application number: JP2001026891A
Authority: JP
Inventors: Takeshi Shimizu; 剛清水; Yasushi Seiki; 康清木
Original assignee: BLINK COM KK; Blinkcom Kk
Current assignee: BLINK COM KK; Blinkcom Kk
Priority date: 2001-02-02
Filing date: 2001-02-02
Publication date: 2002-08-16

Abstract

PROBLEM TO BE SOLVED: To provide a method for generating information for retrieval and an information retrieving device and method for allowing a user to easily reach his or her desired information by using his or her own sensitivity as a retrieval condition. SOLUTION: This method for generating information for retrieval comprises a step for receiving one or more descriptors among a plurality of specific descriptors assigned for expressing specific contents from a plurality of users who are respectively given those descriptors corresponding to specific addresses indicating the specific contents and a step for deciding the descriptor of the address based on the assigned descriptor.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、一般に、電気的デ
ジタルデータ処理装置に係り、特に、インターネット等
のネットワークを利用する不特定多数の利用者を対象と
する分散情報検索装置及び方法、並びに、当該情報検索
に使用される（問い合わせ）情報を生成する方法に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention generally relates to an electronic digital data processing apparatus, and more particularly, to a distributed information search apparatus and method for an unspecified number of users using a network such as the Internet, and The present invention relates to a method of generating (inquiry) information used for the information search.

【０００２】[0002]

【従来の技術】インターネットを活用した様々な情報提
供サービスがＷＷＷと呼ばれる分散情報システムを介し
て一般社会に急速に浸透している。ＷＷＷのクライアン
トにファイルやデータを提供するソフトウェアはＷＷＷ
サーバと呼ばれ、ＷＷＷサーバはＵＲＬ（Ｕｎｉｆｏｒ
ｍＲｅｓｏｕｒｃｅＬｏｃａｔｏｒ）によってアドレ
スが指定される。ユーザクライアントはインターネット
・エクスプローラやネットスケープ・ナビゲータなどの
商業的に入手可能なＷＷＷブラウザを使用してＵＲＬを
指定することによって対応するサイトのコンテンツ（即
ち、ＷＷＷサーバが提供する情報）を閲覧することがで
きる。2. Description of the Related Art Various information providing services utilizing the Internet are rapidly penetrating the general society through a distributed information system called WWW. Software that provides files and data to WWW clients is WWW
Server, and the WWW server is a URL (Unifor
The address is specified by “m Resource Locator”. The user client can browse the contents of the corresponding site (ie, information provided by the WWW server) by specifying the URL using a commercially available WWW browser such as Internet Explorer or Netscape Navigator. it can.

【０００３】一方、インターネット上の膨大な情報から
必要な情報を含むサイトを検索する検索エンジンも公知
である。インターネット上では様々なＷＷＷサイトとの
間にリンクを張って情報の効率化が図られている。単に
ＵＲＬを記載するリンク行為は、リンク先のコンテンツ
の複製ではないために著作権法その他の知的所有権法上
の問題は生じないと一般に解されてインターネット上で
は頻繁に行われている。従来の検索エンジンは、キーワ
ード、音声及び／又は画像を問い合わせデータ（クエリ
ー）として要求するもの、カテゴリー毎に階層的に分類
されているもの、両者が混在しているものが知られてい
る。キーワードは、単語、文書、ＵＲＬの一部などであ
り、所定の検索条件（例えば、ＡＮＤ、ＯＲ、ＮＯＴな
どの論理演算、サイトの登録日時範囲（例えば、２００
０年１月１日以降など））で組み合わされる。かかる条
件の下で検索開始に応答して、サーバは各ページ又はペ
ージ全文に対して検索を行い、所定の表示条件（１０件
毎、１００件毎などの表示件数、要約表示の有無、パー
センテージ順の表示）に従って検索結果を表示する。ア
スタ・ビスタ、ヤフー、ライコス、インフォシークなど
の既知の検索エンジンはインターネットにアクセスした
ときに最初に訪れるポータルサイトとして利用されてい
る場合も多い。On the other hand, a search engine for searching a site containing necessary information from a vast amount of information on the Internet is also known. On the Internet, links are established between various WWW sites to improve information efficiency. It is generally understood that a linking operation that simply describes a URL does not cause a problem in copyright law or other intellectual property law because it is not a copy of the linked content, and is frequently performed on the Internet. Conventional search engines are known to request keywords, voice and / or images as query data (query), to be classified hierarchically for each category, and to include both. The keyword is a word, a document, a part of a URL, or the like, and includes a predetermined search condition (for example, a logical operation such as AND, OR, NOT, etc.,
Etc.). In response to the start of the search under these conditions, the server performs a search on each page or the full text of the page, and sets a predetermined display condition (the number of display items such as every ten items, every 100 items, the presence or absence of a summary display, the percentage order). Display) to display search results. Known search engines such as Asta Vista, Yahoo, Lycos, and Infoseek are often used as the first portal sites to visit when accessing the Internet.

【０００４】[0004]

【発明が解決しようとする課題】しかし、従来の検索エ
ンジンは、検索用情報（例えば、キーワード）をコンテ
ンツに含まれる言語等に限定しているので、ユーザの感
性に合致したコンテンツ（例えば、情熱的なジャズを聴
きたいなど）を効率的に検索することはできない。ま
た、ある共通性を有するユーザの集合（コミュニティ）
をサーチする場合でも自分と同様の感性を有する他のユ
ーザ又はコミュニティ（例えば、同様の音楽性を嗜好す
るアマチュアバンドなど）を効率的に検索することはで
きなかった。更に、従来の検索エンジンで上述のことを
実現しようと思えば管理者が自らの主観に従う感性デー
タをコンテンツに付与することになるであろうが、管理
者はユーザ総体を代表していないために検索用情報とし
て必ずしも適切ではない。例えば、あるユーザが情熱的
であると思う音楽も管理者が当該音楽を情熱的とは思わ
ず、その結果、情熱的という感性データをそれに付与し
なければ、ユーザが「情熱的」というキーワードを入力
しても当該音楽を得ることはできない。このため、検索
用情報であるクエリーを効果的に生成する方法も存在し
なかった。However, conventional search engines limit search information (for example, keywords) to a language or the like included in the content, so that a content (for example, passion You want to listen to typical jazz). In addition, a set of users who have a certain commonality (community)
Is not able to efficiently search for other users or communities that have the same sensibility as their own (for example, an amateur band that likes the same musicality). Furthermore, if the administrator wants to realize the above with a conventional search engine, the administrator will give sensibility data according to his own subjectivity to the content, but since the administrator does not represent the whole user, It is not always appropriate as search information. For example, even if a certain user thinks that the music is passionate, the manager does not consider the music passionate, and as a result, if the passion data of passion is not given to it, the user inputs the keyword “passionate”. Even so, the music cannot be obtained. For this reason, there is no method for effectively generating a query that is search information.

【０００５】そこで、本発明は、このような従来の課題
を解決する新規かつ有用な情報検索装置及び方法、並び
に、検索用情報の生成方法を提供することを概括的目的
とする。Accordingly, it is a general object of the present invention to provide a new and useful information retrieval apparatus and method, and a method for generating retrieval information, that solve such conventional problems.

【０００６】より特定的には、本発明は、ユーザが容易
に自分の感性を検索条件として所望の情報に到達するこ
とを可能にする情報検索装置及び方法、並びに、当該情
報検索に使用される検索用情報の生成方法を提供するこ
とを例示的目的とする。More specifically, the present invention is used for an information retrieval apparatus and method for enabling a user to easily reach desired information using his / her sensibility as a retrieval condition, and is used for the information retrieval. An exemplary object is to provide a method for generating search information.

【０００７】[0007]

【課題を解決するための手段】上記目的を達成するため
に、本発明の一側面としての検索用情報の生成方法は、
特定のコンテンツを指す特定のアドレスに対して、複数
の既定の記述子が各々与えられた複数のユーザが、前記
コンテンツを表現するために割り当てた前記複数の記述
子のうちの一以上の記述子を前記ユーザから受け取るス
テップと、前記割り当てられた記述子に基づいて前記ア
ドレスの記述子を決定するステップとを有する。かかる
方法は、複数のユーザがコンテンツを表現するために割
り当てた複数の記述子から前記アドレスの記述子を決定
する。In order to achieve the above object, a method for generating search information according to one aspect of the present invention includes:
For a particular address pointing to particular content, one or more of the plurality of descriptors assigned by a plurality of users, each given a plurality of predefined descriptors, to represent the content Receiving from the user, and determining a descriptor for the address based on the assigned descriptor. Such a method determines a descriptor for the address from a plurality of descriptors assigned by a plurality of users to represent the content.

【０００８】前記受け取りステップは、それぞれ一以上
の前記記述子によって特徴付けられて特定の概念に対応
するフォルダを前記ユーザから受け取ることによって前
記記述子を受け取ってもよい。従って、フォルダを受け
取ることによって一以上の記述子を同時に受け取ること
ができる。[0008] The receiving step may receive the descriptor by receiving from the user a folder each characterized by one or more of the descriptors and corresponding to a particular concept. Thus, by receiving a folder, one or more descriptors can be received simultaneously.

【０００９】前記決定ステップは、多くのユーザが共通
に割り当てた前記記述子を所定数だけ前記アドレスの前
記記述子に選択してもよい。これにより、より多くのユ
ーザが選択した記述子がコンテンツの内容を説明する記
述子となる。[0009] In the determining step, a predetermined number of the descriptors commonly assigned by many users may be selected as the descriptors at the address. As a result, the descriptor selected by more users becomes the descriptor describing the content of the content.

【００１０】前記方法は、前記割り当てられた記述子を
重み付けして評価付けするステップを更に有し、前記決
定ステップは、評価の高い前記記述子を所定数だけ前記
アドレスの前記記述子に選択してもよい。これにより、
アドレスの記述子は、単により多くのユーザが選択した
という事実によって決定されるのではなく、例えば、ユ
ーザの職業、性別などによって重み付けを変更してコン
テンツに詳しいユーザの意見を反映することができる。[0010] The method may further comprise the step of weighting and assigning the assigned descriptor, and the determining step selecting a predetermined number of the descriptors having a high evaluation as the descriptors of the address. You may. This allows
The address descriptor is not determined solely by the fact that more users have selected it, but can be weighted according to the user's occupation, gender, etc. to reflect the user's opinion of the content. .

【００１１】前記方法は、前記記述子を所定の辞書を用
いてベクトル化するステップと、前記アドレスに割り当
てられた全ての記述子のベクトルを合成して前記アドレ
スのベクトルを生成するステップとを更に有してもよ
い。かかる方法は、辞書を参照することによってコンテ
ンツをベクトル化する。The method may further comprise the steps of: vectorizing the descriptor using a predetermined dictionary; and generating a vector of the address by combining vectors of all descriptors assigned to the address. May have. Such a method vectorizes the content by referring to a dictionary.

【００１２】本発明の別の側面としての検索用情報の生
成方法は、複数の既定の記述子が与えられ、情報検索装
置にネットワーク接続された複数のユーザが、特定のコ
ンテンツを指す特定のアドレスに対して、前記コンテン
ツを表現するために前記記述子を割り当てて前記情報検
索装置に送信するステップと、前記情報検索装置が、前
記割り当てられた記述子に基づいて前記アドレスの記述
子を決定するステップとを有する。かかる方法によれ
ば、ユーザと情報検索装置がオンラインで結ばれた環境
において、特定のコンテンツを表現する記述子を、複数
のユーザが付与した複数の記述子に基づいて決定するシ
ステムを提供することができる。According to another aspect of the present invention, there is provided a method for generating search information, wherein a plurality of predetermined descriptors are provided, and a plurality of users connected to the information search apparatus via a network are provided with a specific address indicating specific content. Assigning the descriptor to represent the content and transmitting the descriptor to the information search device; and the information search device determines a descriptor of the address based on the assigned descriptor. And steps. According to such a method, in an environment in which a user and an information search device are connected online, a system for determining a descriptor expressing specific content based on a plurality of descriptors assigned by a plurality of users is provided. Can be.

【００１３】本発明の更に別の側面としての検索用情報
の生成方法は、特定のコンテンツを指す特定のアドレス
に対して、複数の既定の記述子が与えられた複数のユー
ザが、前記コンテンツを表現するために割り当てた前記
複数の記述子のうちの一以上の記述子を前記ユーザから
受け取るステップと、前記複数のユーザの中で、前記ア
ドレスに対して、類似する記述子を割り当てたユーザを
グループ化するステップとを有する。かかる方法は、同
一のコンテンツに対して共通の記述子を割り当てた複数
のユーザをグループ化してコミュニティを形成すること
ができる。[0013] According to still another aspect of the present invention, there is provided a method for generating search information, wherein a plurality of users given a plurality of predetermined descriptors for a specific address pointing to a specific content download the content. Receiving from the user one or more of the plurality of descriptors allocated for representation from the user; and identifying, among the plurality of users, a user who has allocated a similar descriptor to the address. Grouping. Such a method can form a community by grouping a plurality of users who have assigned a common descriptor to the same content.

【００１４】本発明の更に別の側面としての検索用情報
の生成方法は、複数の既定の記述子が与えられ、情報検
索装置にネットワーク接続された複数のユーザが、特定
のコンテンツを指す特定のアドレスに対して、前記コン
テンツを表現するために前記記述子を割り当てて前記情
報検索装置に送信するステップと、前記情報検索装置
が、前記複数のユーザの中で、前記アドレスに対して、
類似する記述子を割り当てたユーザをグループ化するス
テップとを有する。かかる方法によれば、ユーザと情報
検索装置がオンラインで結ばれた環境において、複数の
ユーザの中から同一のコンテンツに対して共通の記述子
を割り当てたユーザをグループ化してコミュニティを形
成するシステムを提供することができる。According to still another aspect of the present invention, there is provided a method for generating search information, wherein a plurality of predetermined descriptors are provided, and a plurality of users network-connected to the information search apparatus can specify a specific content pointing to specific content. For the address, assigning the descriptor to represent the content and transmitting to the information search device, the information search device, among the plurality of users, for the address,
Grouping users who have assigned similar descriptors. According to such a method, in an environment in which a user and an information search device are connected online, a system for forming a community by grouping users who have assigned a common descriptor to the same content from a plurality of users is provided. Can be provided.

【００１５】本発明の更に別の側面としての検索用情報
の生成方法は、特定の概念に対応するフォルダに対して
複数のユーザによって割り当てられ、当該フォルダを特
徴付ける一以上の記述子を前記ユーザから受け取るステ
ップと、前記割り当てられた記述子に基づいて前記フォ
ルダの記述子を決定するステップとを有する。かかる方
法は、フォルダを特徴付ける記述子を複数のユーザが割
り当てた複数の記述子から決定する。前記決定ステップ
は、多くの利用者が共通に割り当てた前記記述子を所定
数だけ前記フォルダの前記記述子に選択することができ
る。これにより、より多くのユーザが選択した記述子が
フォルダを特徴付ける記述子となる。前記方法は、前記
割り当てられた記述子を重み付けして評価付けするステ
ップを更に有し、前記決定ステップは、評価の高い記述
子を所定数だけ前記フォルダの前記記述子に選択する。
これにより、フォルダを特徴付ける記述子は、単により
多くのユーザが選択したという事実によって決定される
のではなく、例えば、ユーザの職業、性別などによって
重み付けを変更してフォルダの内容に詳しいユーザの意
見を反映することができる。According to still another aspect of the present invention, there is provided a method for generating search information, wherein a folder corresponding to a specific concept is assigned by a plurality of users, and one or more descriptors characterizing the folder are provided from the user. Receiving and determining a descriptor for the folder based on the assigned descriptor. In this method, a descriptor characterizing a folder is determined from a plurality of descriptors assigned by a plurality of users. In the determining step, a predetermined number of the descriptors commonly assigned by many users can be selected as the descriptors of the folder. As a result, the descriptor selected by more users becomes the descriptor characterizing the folder. The method further comprises the step of weighting and assigning the assigned descriptors, wherein the determining step selects a predetermined number of highly evaluated descriptors as the descriptors of the folder.
Thus, the descriptor characterizing the folder is not determined by the fact that it has been selected by more users, but rather by, for example, changing the weight according to the user's occupation, gender, etc. Can be reflected.

【００１６】本発明の更に別の側面としての検索用情報
の生成方法は、情報検索装置にネットワーク接続された
複数のユーザが、特定の概念に対応するフォルダに当該
フォルダを特徴付ける一以上の記述子を割り当てて前記
情報検索装置に送信するステップと、前記情報検索装置
が、前記割り当てられた記述子に基づいて前記フォルダ
の記述子を決定するステップとを有する。かかる方法に
よれば、ユーザと情報検索装置がオンラインで結ばれた
環境において、フォルダの内容を表す記述子を、複数の
ユーザが付与した複数の記述子に基づいて決定するシス
テムを提供することができる。According to still another aspect of the present invention, there is provided a method for generating search information, wherein a plurality of users network-connected to an information search apparatus include one or more descriptors that characterize a folder corresponding to a specific concept. And transmitting the information to the information search device, and the information search device determines a descriptor of the folder based on the assigned descriptor. According to this method, it is possible to provide a system that determines a descriptor representing the contents of a folder based on a plurality of descriptors assigned by a plurality of users in an environment where the user and the information search device are connected online. it can.

【００１７】本発明の更に別の側面としての検索用情報
の生成方法は、特定のコンテンツを指すアドレスを複数
含む第１の集合と、複数のユーザの各々が予め作成した
アドレスを複数含む第２の集合とを比較するステップ
と、前記複数のユーザのうち、前記第１の集合に含まれ
る前記アドレスを前記第２の集合に多く含むユーザを所
定数だけ選択するステップとを有する。かかる方法は、
同一のコンテンツに対して興味を示す複数のユーザをグ
ループ化してコミュニティを形成することができる。前
記選択ステップは、前記第１の集合と前記第２の集合と
の間の共通性を判断する際に重み付けして評価付けする
ステップを含んでもよい。これにより、アドレスの共通
性だけではなく、ユーザ個人の情報（例えば、出身地
等）を考慮したコミュニティを形成することができる。According to still another aspect of the present invention, there is provided a method for generating search information, comprising: a first set including a plurality of addresses indicating specific contents; and a second set including a plurality of addresses created in advance by each of a plurality of users. And selecting a predetermined number of users among the plurality of users who include the addresses included in the first set in the second set in large numbers. Such a method
A plurality of users who are interested in the same content can be grouped to form a community. The selection step may include a step of weighting and evaluating when determining the commonality between the first set and the second set. As a result, it is possible to form a community that takes into account not only the commonality of addresses but also personal information of the user (for example, hometown).

【００１８】本発明の更に別の側面としての情報検索方
法は、情報検索装置が、特定のコンテンツを指すアドレ
スが複数含まれた集合を、前記情報検索装置にネットワ
ーク接続された複数のユーザの各々から受け取るステッ
プと、前記情報検索装置が、前記集合を互いに比較し、
前記複数のユーザのうち、共通する前記アドレスを多く
含む集合に対応するユーザを所定数だけ選択するステッ
プとを有する。かかる方法は、同一のコンテンツに対し
て興味を示す複数のユーザをグループ化してコミュニテ
ィを形成するシステムを提供することができる。[0018] According to still another aspect of the present invention, there is provided an information search method, wherein the information search apparatus stores a set including a plurality of addresses indicating specific contents for each of a plurality of users network-connected to the information search apparatus. Receiving from the, the information retrieval device compares the sets with each other,
Selecting a predetermined number of users corresponding to a set including many common addresses among the plurality of users. Such a method can provide a system for forming a community by grouping a plurality of users who are interested in the same content.

【００１９】本発明の更に別の側面としての情報検索装
置は、ネットワーク接続されたユーザから受け取った、
一以上の記述子によって特徴付けられる問い合わせデー
タに対応する情報を提供する情報検索装置であって、前
記記述子をベクトル化する辞書を記憶するメモリと、前
記問い合わせデータを特徴付ける一以上の前記記述子に
対応する第１のベクトルを前記メモリを参照して決定す
ると共に、当該第１のベクトルと同一又は類似する第２
のベクトルを有する前記情報を決定する処理装置とを有
する。かかる情報検索装置は、問い合わせデータを特徴
付ける一以上の記述子を、辞書を参照して第１のベクト
ルにベクトル化し、当該第１のベクトルと同一又は類似
する第２のベクトルを有する情報をユーザに提供するこ
とができる。前記問い合わせデータは前記記述子を複数
有する場合には、前記処理装置は、前記複数の記述子の
それぞれをベクトル化して合成することによって前記第
１のベクトルを決定する。これにより、情報検索装置
は、複数の記述子全てを反映する一のベクトルに基づい
て問い合わせ情報に対応する情報を提供することができ
る。前記情報は、前記ユーザ及び前記情報検索装置に接
続されているネットワークのアドレスを含む。これによ
り、情報検索装置は、ネットワーク上に存在するアドレ
スが指すコンテンツを問い合わせ情報に対応する情報と
してユーザに提供することができる。[0019] According to still another aspect of the present invention, there is provided an information retrieving apparatus which receives from a network-connected user,
An information retrieval device for providing information corresponding to query data characterized by one or more descriptors, comprising: a memory storing a dictionary for vectorizing the descriptors; and one or more descriptors characterizing the query data. Is determined with reference to the memory, and a second vector that is the same as or similar to the first vector is determined.
And a processing device for determining the information having the following vector. Such an information retrieval apparatus converts one or more descriptors characterizing inquiry data into a first vector by referring to a dictionary, and provides a user with information having a second vector that is the same as or similar to the first vector. Can be provided. When the inquiry data has a plurality of the descriptors, the processing device determines the first vector by vectorizing and combining each of the plurality of descriptors. Accordingly, the information search device can provide information corresponding to the inquiry information based on one vector reflecting all of the plurality of descriptors. The information includes an address of a network connected to the user and the information search device. Thus, the information search device can provide the user with the content indicated by the address existing on the network as information corresponding to the inquiry information.

【００２０】本発明の更に別の側面としての情報検索装
置は、特定のコンテンツを指す特定のアドレスに対して
複数のユーザが前記コンテンツを表現するために割り当
てた複数の既定の記述子と、前記複数のユーザとを記憶
するメモリと、前記複数のユーザの中で、前記アドレス
に対して、共通する記述子を割り当てたユーザをグルー
プ化することを可能とする処理装置とを有する。かかる
情報検索装置によれば、同一のコンテンツに対して共通
する記述子を割り当てた複数のユーザをグループ化して
コミュニティを形成することができる。According to still another aspect of the present invention, there is provided an information retrieval apparatus comprising: a plurality of predetermined descriptors assigned by a plurality of users to a specific address indicating a specific content in order to express the content; A memory that stores a plurality of users; and a processing device that enables a group of users assigned a common descriptor to the address among the plurality of users. According to such an information search device, a community can be formed by grouping a plurality of users who have assigned a common descriptor to the same content.

【００２１】本発明の他の目的及び更なる特徴は、以
下、添付図面を参照して説明される実施例により明らか
にされるであろう。Other objects and further features of the present invention will become apparent from the embodiments described below with reference to the accompanying drawings.

【００２２】[0022]

【発明の実施の形態】以下、添付図面を参照して、本発
明の一側面としての情報検索システム１及び情報検索シ
ステム１に適用可能な本発明の別の側面としての情報検
索装置１００について説明する。ここで、図１は、情報
検索システム１のシステム構成図である。図２は、情報
検索装置１００のデータ記憶装置２００に格納される例
示的な各種データベースのブロック図である。情報検索
システム１は、複数の（ユーザ）クライアント１０と、
複数のサイト２０と、情報検索装置１００とがネットワ
ーク接続（本実施例ではインターネット３０を介して接
続）されている。なお、本実施例では情報検索装置１０
０の管理者をシステムの管理者という場合もある。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, an information retrieval system 1 as one aspect of the present invention and an information retrieval apparatus 100 as another aspect of the present invention applicable to the information retrieval system 1 will be described with reference to the accompanying drawings. I do. Here, FIG. 1 is a system configuration diagram of the information search system 1. FIG. 2 is a block diagram of exemplary various databases stored in the data storage device 200 of the information search device 100. The information search system 1 includes a plurality of (user) clients 10,
The plurality of sites 20 and the information search device 100 are connected to a network (in this embodiment, connected via the Internet 30). In this embodiment, the information retrieval device 10
The administrator of 0 may be the system administrator.

【００２３】クライアント１０は、個人、法人を問わ
ず、また設置場所の国内外を問わないが、典型的には、
個人又は企業ユーザが操作するプラットフォーム又はそ
れに格納されたソフトウェアを指す。各クライアント１
０には一のユーザが割り当てられており、本出願ではユ
ーザの意味に使用される場合もある。また、本発明は、
各クライアント１０に同一のクライアント１０に複数の
ユーザが割り当てられる場合も含む。プラットフォーム
は情報を送受信、加工及び格納する機器としてパーソナ
ルコンピュータ（ＰＣ）のみならず、デジタルテレビ、
パーソナルディジタルアシスタンツ（ＰＤＡ）、携帯電
話、ゲーム機などを広く含む。本実施例では、クライア
ント１０は、マイクロフォンなどの音声入力装置やその
他のバイオメトリック情報（指紋、アイリスなど）の入
力装置を有するか、これに接続されていてもよい。クラ
イアント１０は、専用回線又はインターネットサービス
プロバイダ（ＩＳＰ）を介してインターネット３０に接
続されるが、図１においては、ＩＳＰは省略されてい
る。The client 10 may be an individual or a corporation, regardless of whether it is installed in Japan or overseas.
Refers to a platform operated by an individual or company user or software stored therein. Each client 1
0 is assigned to one user, and is sometimes used in the present application to mean the user. Also, the present invention
This includes the case where a plurality of users are assigned to the same client 10 for each client 10. The platform is not only a personal computer (PC) but also a digital TV,
Widely includes personal digital assistants (PDAs), mobile phones, game consoles, and the like. In the present embodiment, the client 10 may have or be connected to a voice input device such as a microphone or an input device for other biometric information (fingerprint, iris, etc.). The client 10 is connected to the Internet 30 via a dedicated line or an Internet Service Provider (ISP), but the ISP is omitted in FIG.

【００２４】サイト２０は、独自のＵＲＬが割り当てら
れ、本実施例では、ＨＴＭＬ文書やＦＴＰ（Ｆｉｌｅ
ＴｒａｎｓｆｅｒＰｒｏｔｏｃｏｌ）ファイルを有す
る各種の情報源である。特定のＵＲＬはサイト２０が提
供する特定のコンテンツを指している。ＵＲＬはＷＷＷ
上のアクセス単位となっているウェブ上のリソース及び
オブジェクトに付与される固有アドレスであるが、本発
明がインターネット以外のネットワークで使用された場
合には当該ネットワーク上のアドレスになる。クライア
ント１０との関係では各サイト２０もサーバであり、音
楽、映画、写真、インテリア、ファッション、スポー
ツ、コンピュータ、政治、経済などの各種の情報やサー
ビスを提供する。サイト２０は、専用回線又はＩＳＰを
介してインターネット３０に接続されるが、図１におい
ては、ＩＳＰは省略されている。後述するように、ネッ
トワークがインターネット３０以外であれば、サイト２
０が格納する情報はＨＴＭＬやＦＴＰフォーマットに従
う必要がないことはいうまでもない。The site 20 is assigned a unique URL. In this embodiment, the site 20 uses an HTML document or an FTP (File).
Transfer Protocol (Transfer Protocol) files. The specific URL points to specific content provided by the site 20. URL is WWW
The unique address given to the resource and the object on the web, which is the above access unit, is an address on the network when the present invention is used in a network other than the Internet. In relation to the client 10, each site 20 is also a server, and provides various information and services such as music, movies, photos, interiors, fashion, sports, computers, politics, and economy. The site 20 is connected to the Internet 30 via a dedicated line or an ISP, but the ISP is omitted in FIG. As described later, if the network is other than the Internet 30, the site 2
It goes without saying that the information stored in 0 does not need to follow the HTML or FTP format.

【００２５】インターネット３０はネットワークの典型
例であるが、本発明は、ＬＡＮ（ＬｏｃａｌＡｒｅａ
Ｎｅｔｗｏｒｋ）、ＭＡＮ（Ｍｅｔｒｏｐｏｌｉｔａｎ
ＡｒｅａＮｅｔｗｏｒｋ）、ＷＡＮ（ＷｉｄｅＡｒ
ｅａＮｅｔｗｏｒｋ）その他のネットワークに使用さ
れることを妨げるものではない。Although the Internet 30 is a typical example of a network, the present invention employs a LAN (Local Area).
Network), MAN (Metropolitan)
Area Network), WAN (Wide Ar)
ea Network) or other networks.

【００２６】情報検索装置１００は本発明の情報検索シ
ステム１に適用され、クライアント１０に対してインタ
ーネット３０を介してＷＷＷサーバとして機能する。情
報検索装置１００は、検索に使用される情報を生成する
と共に、クライアント１０の検索操作に応答して、クラ
イアント１０が所望するコンテンツ及びその付随情報、
当該コンテンツを有する又は有する可能性の高いサイト
２０のアドレス、クライアント１０が希望する条件を満
足する又は満足する可能性の高い他のクライアント１０
及び／又はコミュニティ、クライアント１０自身につい
て登録されている情報その他の情報を提供する。検索用
情報は、クエリーとして要求されるキーワード、音声及
び／又は画像と、指定されるべき階層的に分類されたカ
テゴリーに加えて、本発明との関係では、後述するデー
タ記憶装置２００に格納されるデータ（例えば、記述子
としてのメタデータやフォルダなど）を含む。また、情
報検索装置１００は、これらの様々な情報を提供するた
めに、クライアント１０からデータを受信し、当該クラ
イアント１０の管理者が作成した基準に従ってクライア
ント１０の検索を可能にする後述の各種の処理を行う。The information retrieval device 100 is applied to the information retrieval system 1 of the present invention, and functions as a WWW server for the client 10 via the Internet 30. The information search device 100 generates information used for the search and, in response to a search operation of the client 10, content desired by the client 10 and its accompanying information,
Address of site 20 that has or is likely to have the content, other client 10 that satisfies or is likely to satisfy the conditions desired by client 10
And / or provides information registered with the community and the client 10 itself and other information. The search information is stored in a data storage device 200 described later in relation to the present invention, in addition to the keyword, voice and / or image requested as a query, and the hierarchically classified category to be designated. (E.g., metadata and folders as descriptors). In addition, the information search device 100 receives data from the client 10 and provides various types of information described below that enable the search of the client 10 according to criteria created by the administrator of the client 10 in order to provide these various types of information. Perform processing.

【００２７】情報検索装置１００は、ＣＰＵ１１０と、
通信ポート１２０と、ＲＡＭ１３０と、ＲＯＭ１４０
と、データ記憶装置２００とを有する。なお、図１にお
いては、情報検索装置１００に付随する入出力装置（キ
ーボード、各種のポインティングデバイス（マウス、ト
ラックボール、ジョイスティック、タブレット、デジタ
イザなど））及び表示装置（ディスプレイなど）は省略
されている。必要があれば、情報検索装置１００は、ク
ライアント１０の音声入力装置から入力された音声情報
を認識する音声認識装置や、セキュリティを強化するた
めのバイオメトリック装置（指紋照合装置、声紋照合装
置、アイリス照合装置など）も有することができる。同
様に、情報検索装置１００は、セキュリティを維持する
ための暗号システムを利用することができる。情報検索
装置１００は、ＷＷＷサーバとして機能するだけでな
く、メールサーバやニュースサーバとしても機能するこ
とができる。The information retrieval device 100 includes a CPU 110,
Communication port 120, RAM 130, ROM 140
And a data storage device 200. In FIG. 1, input / output devices (a keyboard, various pointing devices (a mouse, a trackball, a joystick, a tablet, a digitizer, etc.)) and a display device (a display, etc.) accompanying the information retrieval device 100 are omitted. . If necessary, the information search device 100 may include a voice recognition device for recognizing voice information input from a voice input device of the client 10 or a biometric device for enhancing security (a fingerprint collation device, a voiceprint collation device, an iris). Verification device, etc.). Similarly, the information search device 100 can use an encryption system for maintaining security. The information search device 100 can function not only as a WWW server but also as a mail server or a news server.

【００２８】ＣＰＵ１１０は、ＭＰＵなど名称の如何を
問わない処理装置を広く含み、情報検索装置１００の各
部を制御する。必要があれば、情報検索装置１００は、
ＬＡＮその他のネットワークを介して他のコンピュータ
に接続され、ＣＰＵ１１０はそれらのコンピュータと通
信することができる。ＣＰＵ１１０は、本発明との関係
では、データ記憶装置２００に格納されている各種デー
タベースの構築と、当該データベースとクライアント１
０の要求に応じてクライアント１０に各種のサービスを
提供することができる。代替的に、情報検索装置１００
は、ＣＰＵ１１０により制御され、データ記憶装置２０
０の各種データベースを処理する専用の処理装置を含ん
でもよい。The CPU 110 includes a wide range of processing devices having any name such as MPU, and controls each unit of the information search device 100. If necessary, the information retrieval device 100
Connected to other computers via a LAN or other network, the CPU 110 can communicate with those computers. In relation to the present invention, the CPU 110 constructs various databases stored in the data storage device 200,
0 can provide various services to the client 10 in response to the request of 0. Alternatively, the information retrieval device 100
Is controlled by the CPU 110 and the data storage device 20
0 may include a dedicated processing device for processing various databases.

【００２９】通信ポート１２０は、インターネット３０
に（必要があればＩＳＰを介して）接続される公衆電話
回線網、ＩＳＤＮ、各種専用線にモデム、ターミナルア
ダプタ（ＴＡ）などを介して接続可能なＵＳＢポートや
ＩＥＥＥ１３９４ポートなどを含む。また、通信ポート
３０は、情報検索装置１００がＬＡＮに接続される場合
には、ハブやルーターなども含むものである。The communication port 120 is connected to the Internet 30
And a public telephone line network, an ISDN, and various dedicated lines connected via a modem, a terminal adapter (TA), and the like to a public telephone line (IEDP if necessary), an IEEE 1394 port, and the like. When the information search device 100 is connected to a LAN, the communication port 30 also includes a hub and a router.

【００３０】ＲＡＭ１３０は、ＲＯＭ１４０やデータ記
憶装置２００などから読み出すデータ又はデータ記憶装
置２００などへ書き込むデータを一時的に記憶する。Ｒ
ＯＭ１４０は、ＣＰＵ１１０の動作に必要な各種ソフト
ウェアやファームウェアその他のソフトウェアを格納す
る。Ｉデータ記憶装置２００以下、データ記憶装置２００について、構造、登録など
の詳細について説明する。Ａ．データ記憶装置２００の構造図２を参照するに、データ記憶装置２００は、ユーザ管
理データベース２１０、メタデータ（ＭＤ）管理データ
ベース２２０、フォルダ管理データベース２３０、ユー
ザＵＲＬ（Ｕ−ＵＲＬ）管理データベース２４０、シス
テムＵＲＬ（Ｓ−ＵＲＬ）管理データベース２５０、Ｕ
−ＵＲＬ−ＭＤ管理データベース２６０、Ｓ−ＵＲＬ−
ＭＤ管理データベース２７０、特定ＵＲＬに対するユー
ザ別ＭＤ管理データベース２８０、及び、辞書管理デー
タベース２９０を含んでいるが、これらに限定されずに
他のデータベース（例えば、課金データベース、翻訳デ
ータベース、キーワードやカテゴリー等の検索用情報を
格納するデータベース）を含んでもよく、また、これら
の一又は複数は選択的に省略されてもよい。The RAM 130 temporarily stores data to be read from the ROM 140 or the data storage device 200 or data to be written to the data storage device 200 or the like. R
The OM 140 stores various software, firmware, and other software necessary for the operation of the CPU 110. I Data Storage Device 200 Hereinafter, details of the data storage device 200, such as its structure and registration, will be described. A. Structure of Data Storage Device 200 Referring to FIG. 2, the data storage device 200 includes a user management database 210, a metadata (MD) management database 220, a folder management database 230, a user URL (U-URL) management database 240, a system URL (S-URL) management database 250, U
-URL-MD management database 260, S-URL-
An MD management database 270, a user-specific MD management database 280 for a specific URL, and a dictionary management database 290 include, but are not limited to, other databases (eg, a billing database, a translation database, keywords and categories, etc.). (A database that stores search information), and one or more of these may be selectively omitted.

【００３１】ユーザ管理データベース２１０は、ユーザ
の氏名、住所、国籍、旧姓、電話番号、ファックス番
号、電子メールアドレス、当該ユーザ用のフォルダ（Ｆ
ｏｌｄｅｒ）、ユーザ名、パスワード、年齢、銀行口座
番号、クレジットカード口座番号、性別、生年月日、暗
号用の鍵、情報検索システム１に登録された日、システ
ム１を利用可能な有効期限その他のＩＤ情報を格納す
る。クライアント１０の登録はオンライン（即ち、図１
に示すシステム１を介して）でもオフライン（即ち、電
話、郵送、ファックスなどを介して）でも行うことがで
きる。The user management database 210 stores a user's name, address, nationality, maiden name, telephone number, fax number, e-mail address, and a folder (F
older), user name, password, age, bank account number, credit card account number, gender, date of birth, encryption key, date registered in the information retrieval system 1, expiration date when the system 1 can be used, and the like. The ID information is stored. Registration of the client 10 is online (ie, FIG. 1).
) Or offline (i.e., via telephone, mail, fax, etc.).

【００３２】図３に、ユーザ管理データベース２１０に
格納されるユーザデータテーブル２１１の一例を示す。
ユーザデータテーブル２１１は、本実施例では、ユーザ
ＩＤ（ＵＩＤ）２１２、ユーザ名２１３、説明２１４、
ホーム（ＨＯＭＥ）フォルダ２１５及びユーザプロファ
イル情報２１６という５つのフィールドからなる。ＵＩ
Ｄ２１２は、本システム内の各ユーザに対応して設定さ
れる認識子（例えば、ａｂｃ１２３４５）を格納するフ
ィールドである。ユーザ名２１３は、本システム内での
ユーザ識別名（イチロウ）を格納するフィールドであ
る。説明２１４は、ユーザに対する説明文（例えば、
「背が高い」、「アマチュアバンドを組んでいる」等）
を表すフィールドである。FIG. 3 shows an example of the user data table 211 stored in the user management database 210.
In this embodiment, the user data table 211 includes a user ID (UID) 212, a user name 213, a description 214,
It consists of five fields: a home (HOME) folder 215 and user profile information 216. UI
D212 is a field for storing a recognizer (for example, abc12345) set corresponding to each user in the present system. The user name 213 is a field for storing a user identification name (Ichirou) in the present system. The description 214 is a description for the user (for example,
"Tall", "has an amateur band", etc.)
Is a field representing

【００３３】ＨＯＭＥフォルダ２１５は、ユーザのホー
ムフォルダへのポインタを格納するフィールドであり、
ホームフォルダはユーザが自分のフォルダ階層をたどる
ときのトップにあるフォルダである。このように、情報
検索装置１００は、各ユーザに対して、独自のフォルダ
（ファイルやプログラムを収納する場所）を割り当てて
いる。ユーザは、後述するように、所望するコンテンツ
を指すＵＲＬその他の情報をフォルダに格納することが
できる。フォルダは階層（又はツリー）構造を採ること
ができ、ホームフォルダはその階層のトップフォルダで
ある。各フォルダの名称はユーザ及び／又はシステム管
理者が自由に設定することができる。ＨＯＭＥフォルダ
２１５及びそれに含まれるフォルダは、後述するよう
に、他のクライアント及びコミュニティを検索するのに
使用される。The HOME folder 215 is a field for storing a pointer to the user's home folder.
The home folder is the top folder when the user follows his or her folder hierarchy. As described above, the information search device 100 assigns a unique folder (a place for storing files and programs) to each user. The user can store URLs and other information indicating desired contents in the folder, as described later. The folder can have a hierarchical (or tree) structure, and the home folder is the top folder of the hierarchy. The name of each folder can be freely set by the user and / or the system administrator. The HOME folder 215 and the folders contained therein are used to search for other clients and communities, as described below.

【００３４】ユーザプロファイル情報２１６は、ユーザ
の本名（田中一郎）、メールアドレス、職業、住所、電
話番号などの情報を格納するフィールド又はこれらを格
納するファイルへのポインタである。The user profile information 216 is a field for storing information such as the user's real name (Ichiro Tanaka), mail address, occupation, address, and telephone number, or a pointer to a file for storing these.

【００３５】ＭＤ管理データベース２２０は、記述子と
してのメタデータ（ＭＤ）とメタデータの特徴を表す属
性群としてのフィーチャーとの関係を示すテーブル（以
下、「ＭＤエントリテーブル」という。）を格納する。
ＭＤは名詞、形容詞、形容動詞などの記述子であり、具
体的には、フォルダ、各ＵＲＬが指し示すＷＷＷペー
ジ、リソース(即ち、コンピュータが稼動するのに必要
なハードウェア)及びオブジェクト（即ち、プログラム
を構成する基本的なソフトウェア）についてその内容及
び特徴を表す単語である。The MD management database 220 stores a table (hereinafter, referred to as an “MD entry table”) indicating a relationship between metadata (MD) as a descriptor and a feature as an attribute group representing characteristics of the metadata. .
MD is a descriptor such as a noun, an adjective, an adjective verb, etc. Specifically, a folder, a WWW page indicated by each URL, resources (that is, hardware necessary for the computer to operate) and objects (that is, programs) Are words representing the contents and characteristics of the basic software that constitutes.

【００３６】図４に、ＭＤ管理データベース２２０に格
納されるＭＤエントリテーブル２２１の一例を示す。Ｍ
Ｄエントリテーブル２２１は、ＭＤ−ＩＤ２２２とフィ
ーチャー（Ｆｅａｔｕｒｅ）リスト２２３の組から構成
される。ＭＤエントリテーブル２２１は、辞書管理デー
タベース２９０を参照してシステム管理者が手作業入力
によって又はＣＰＵ１１０が自動的に作成する。前者の
場合であっても、辞書管理データベース２９０の辞書を
参照するために、ＭＤエントリテーブル２２１にはシス
テム管理者の主観が入り込む余地はない。FIG. 4 shows an example of the MD entry table 221 stored in the MD management database 220. M
The D entry table 221 is composed of a set of an MD-ID 222 and a feature (Feature) list 223. The MD entry table 221 is created manually by a system administrator with reference to the dictionary management database 290 or automatically by the CPU 110. Even in the former case, there is no room for the subjectivity of the system administrator in the MD entry table 221 to refer to the dictionary in the dictionary management database 290.

【００３７】ＭＤは記述子であり、一般的には、辞書に
出ている名詞や形容詞などの単語の全部又は一部であ
る。ＭＤ−ＩＤ２２２は、それぞれのＭＤに付与される
ＩＤを格納するフィールドであり、図４においては最も
左の一列である。図４においては、ＭＤ−ＩＤ２２２に
は、便宜上ＭＤ１乃至ＭＤＮが割り当てられている。Ｍ
Ｄ１は、例えば、「明るい」を意味し、ＭＤ２は、例え
ば、「音楽」を表す。MD is a descriptor, and is generally or partially a word such as a noun or adjective appearing in a dictionary. The MD-ID 222 is a field for storing an ID assigned to each MD, and is the leftmost column in FIG. In FIG. 4, MD1 to MDN are assigned to the MD-ID 222 for convenience. M
D1 means, for example, "bright", and MD2 means, for example, "music".

【００３８】フィーチャーリスト２２３は複数のフィー
チャーの集合であり、各フィーチャーは後述するように
辞書の基本語に相当する。辞書は、例えば、２０００語
の基本語を使用して辞書に出ている全ての単語（ＭＤに
相当）を説明するように構成されており、この場合には
２０００語の基本語のそれぞれがフィーチャーに相当す
る。フィーチャーリスト２２３は例示的に各２ビットの
ｎ個のデータ列である。図４に示すように、例えば、フ
ィーチャーリスト２２３はｆ１乃至ｆｎからなり、上述
の辞書の場合にはｎ＝２０００である。フィーチャーと
しての基本語はアルファベット順を含む任意の配列で並
べられる。ＭＤをフィーチャー化するためには、辞書で
ＭＤを調べて当該ＭＤを表現するのに各フィーチャーが
使用されていれば１、不使用ならば０を割り当てればよ
い。かかる表示により、各ＭＤはフィーチャーを軸とす
るベクトルで表示される。代替的に、各フィーチャーに
は既定の意味がなくてもよい。The feature list 223 is a set of a plurality of features, and each feature corresponds to a basic word of a dictionary as described later. The dictionary is configured, for example, to describe all words (corresponding to MD) appearing in the dictionary using 2000 basic words, in which case each of the 2000 basic words is featured. Is equivalent to The feature list 223 is, for example, n data strings of 2 bits each. As shown in FIG. 4, for example, the feature list 223 includes f1 to fn, and in the case of the above dictionary, n = 2000. The basic words as features are arranged in any arrangement including alphabetical order. In order to make the MD a feature, it is sufficient to look up the MD in the dictionary and assign 1 if each feature is used to represent the MD, and 0 if it is not used. With this display, each MD is displayed as a vector having the feature as an axis. Alternatively, each feature may have no default meaning.

【００３９】フォルダ管理データベース２３０はフォル
ダデータテーブルを格納する。フォルダは、ある特定の
分類に従って、複数のＵＲＬを格納して、それらのＵＲ
Ｌが表すＷＷＷページ群、リソース又はオブジェクトに
共通する特徴、性質やカテゴリーなどを表すデータ構造
である。後述するように、フォルダにはシステムがＭＤ
を付与する場合とユーザがＭＤを付与する場合がある。
即ち、フォルダはシステムから最初にＭＤ割り当てに応
じて各ユーザに提供されてもよく、また、ユーザが任意
で作成し、命名し、ＭＤを付与してもよい。従って、フ
ォルダはユーザ毎に異なってユーザ毎に所有されてもよ
い。図５に、フォルダデータテーブルの２３１の一例を
示す。フォルダデータテーブル２３１は、例示的に、フ
ォルダＩＤ２３２、フォルダ名２３３、所有者ＩＤ２３
４、メタデータリスト２３５、ＵＲＬリスト２３６、説
明２３７、親フォルダリスト２３８ａ、子フォルダリス
ト２３８ｂ、その他の属性２３９のフィールドを含む。The folder management database 230 stores a folder data table. A folder stores a plurality of URLs according to a specific classification, and stores the URLs.
This is a data structure that represents features, properties, categories, and the like common to a group of WWW pages, resources, or objects represented by L. As described later, the folder contains the system MD.
And the user may provide an MD.
That is, the folder may be initially provided from the system to each user according to the MD assignment, or the user may arbitrarily create, name, and assign an MD. Therefore, the folder may be different for each user and owned by each user. FIG. 5 shows an example of the folder data table 231. The folder data table 231 includes, for example, a folder ID 232, a folder name 233, and an owner ID 23.
4, metadata list 235, URL list 236, description 237, parent folder list 238a, child folder list 238b, and other attribute 239 fields.

【００４０】フォルダＩＤ２３２は、フォルダを独自に
識別するＩＤを格納するフィールドである。フォルダ名
２３３は、フォルダの名称（例えば、ジャズ）を格納す
るフィールドである。所有者ＩＤ２３４は、フォルダの
所有者のＩＤ（例えば、任意のユーザ又は管理者のＩ
Ｄ）を格納するフィールドである。The folder ID 232 is a field for storing an ID for uniquely identifying a folder. The folder name 233 is a field for storing the name of the folder (for example, jazz). The owner ID 234 is the ID of the owner of the folder (for example, the ID of any user or administrator).
D) is a field for storing D).

【００４１】ＭＤリスト２３５は、フォルダに割り当て
られたＭＤのＩＤのリストを格納するフィールドであ
り、ＵＲＬがなくてもフォルダに付随する。即ち、ＭＤ
リスト２３５はフォルダの内容を特徴付けるＭＤである
点で、後述するページＭＤ２５５と相違する。後述する
ページＭＤ２５５はＵＲＬが指すコンテンツを表すＭＤ
である。従って、ＭＤリスト２３５は、後述するＵＲＬ
リスト２３６に含まれるＵＲＬに割り当てられたＭＤと
は独立に設定され、後述するＵＲＬリスト２３６に含ま
れるＵＲＬのページＭＤ２５５にＭＤリスト２３５は影
響を与えない。The MD list 235 is a field for storing a list of MD IDs assigned to the folder, and is attached to the folder even if there is no URL. That is, MD
The list 235 is different from a page MD 255 described later in that the list 235 is an MD characterizing the contents of the folder. A page MD 255 described later is an MD representing the content indicated by the URL.
It is. Therefore, the MD list 235 includes a URL described later.
The MD list 235 is set independently of the MD assigned to the URL included in the list 236, and does not affect the page MD 255 of the URL included in the URL list 236 described later.

【００４２】上述したように、フォルダデータ２３１
は、システムがフォルダのＭＤセットを割り当てる場合
（システムデフォルト型）と、ユーザがフォルダのＭＤ
を割り当てる場合（ユーザ依存型）の２種類を少なくと
も含む。システムがフォルダのＭＤセットを割り当てる
場合であってもＣＰＵ１１０は、後述するように、複数
のユーザの意見を反映することができる。システムデフ
ォルト型のフォルダデータ２３１においては、システム
管理者が各フォルダに属性値として複数のＭＤを予め割
り当てる。ユーザは管理者が作成、命名、ＭＤを付与し
たフォルダセットの雛型をコピーして使用する。即ち、
ユーザ群は同様なツリー構造のフォルダ構造を初期段階
では共有している。但し、異なるフォルダＩＤ２３２が
付与されているためフォルダのオブジェクト自体は異な
る。ユーザはかかるフォルダにＵＲＬを割り当てる。一
方、ユーザ依存型のフォルダデータ２３１においては、
フォルダはユーザが任意に作成、命名、ＭＤを付与す
る。ユーザは必要に応じてフォルダを作成、命名の上、
適当なＭＤを選択して付与することができる。As described above, the folder data 231
When the system assigns the MD set of the folder (system default type), and when the user
Is assigned (user-dependent type). Even when the system assigns the MD set of the folder, the CPU 110 can reflect opinions of a plurality of users as described later. In the system default type folder data 231, the system administrator assigns a plurality of MDs as attribute values to each folder in advance. The user copies and uses the template of the folder set created, named, and given the MD by the administrator. That is,
The user group shares a similar folder structure of a tree structure at an early stage. However, since different folder IDs 232 are assigned, the folder objects themselves are different. The user assigns a URL to such a folder. On the other hand, in the user-dependent folder data 231,
The folder is arbitrarily created, named, and MD-assigned by the user. Users create and name folders as needed,
An appropriate MD can be selected and provided.

【００４３】ＵＲＬリスト２３６は、フォルダに割り当
てられたＵＲＬのＩＤのリストを格納するフィールドで
ある。説明２３７は、フォルダの内容を記述した一般的
な説明（例えば、「情熱的なジャズ」）を格納するフィ
ールドである。親フォルダリスト２３８ａは、当該フォ
ルダが子として上位概念その他（例えば、ＴＯＰフォル
ダであるＨＯＭＥフォルダ）を表す親フォルダ（例え
ば、「音楽」のフォルダ）のＩＤのリストを格納するフ
ィールドである。子フォルダリスト２３８ｂは、当該フ
ォルダが親としてポイントして下位概念を表す子フォル
ダ（例えば、特定アーチスト名のフォルダ）のＩＤのリ
ストを格納するフィールドである。その他の属性２３９
は、フォルダの作成日時、アクセス権設定（例えば、特
定のコミュニティにのみアクセスを許容する場合）など
の情報を格納するフィールドである。The URL list 236 is a field for storing a list of URL IDs assigned to the folder. The description 237 is a field for storing a general description (for example, “passionate jazz”) describing the contents of the folder. The parent folder list 238a is a field for storing a list of IDs of parent folders (for example, a folder of “music”) representing a higher-level concept or the like (for example, a HOME folder that is a TOP folder) as a child. The child folder list 238b is a field for storing a list of IDs of child folders (for example, folders with a specific artist name) that point to the parent as a parent and represent a lower concept. Other attributes 239
Is a field for storing information such as the date and time of folder creation and access right setting (for example, when only a specific community is allowed to access).

【００４４】Ｕ−ＵＲＬ管理データベース２４０は、ユ
ーザ個別のＵＲＬに対するデータ構造を示すＵ−ＵＲＬ
データテーブル２４１を格納する。図６に、Ｕ−ＵＲＬ
データテーブル２４１の一例を示す。同テーブル２４１
は、例示的に、Ｕ−ＵＲＬ−ＩＤ２４２、Ｓ−ＵＲＬ−
ＩＤ２４３、タイトル２４４及び説明２４５のフィール
ドを含む。Ｕ−ＵＲＬ−ＩＤ２４２は、このデータ構造
の識別ＩＤを格納するフィールドである。Ｓ−ＵＲＬ−
ＩＤ２４３は、システムが管理するＵＲＬ情報のＩＤへ
の参照を格納するフィールドであり、Ｓ−ＵＲＬ−ＩＤ
２４３は図７を参照して後述されるＳ−ＵＲＬ−ＩＤ２
５２と同一である。タイトル２４４は、ユーザが付与し
たＵＲＬに対するタイトルを格納するフィールドであ
る。説明２４５は、ユーザが付与したＵＲＬに対する説
明を格納するフィールドである。Ｕ−ＵＲＬデータテー
ブル２４１は、このように、各ユーザが選択したＵＲＬ
と当該ＵＲＬのコンテンツを特徴付けるＭＤが関連付け
ている。The U-URL management database 240 has a U-URL indicating the data structure of the URL for each user.
The data table 241 is stored. In FIG. 6, the U-URL
4 shows an example of a data table 241. Table 241
Is, for example, U-URL-ID 242, S-URL-
It includes fields of ID 243, title 244, and description 245. The U-URL-ID 242 is a field for storing the identification ID of this data structure. S-URL-
The ID 243 is a field for storing a reference to the ID of the URL information managed by the system, and is stored in the S-URL-ID.
243 is an S-URL-ID2 described later with reference to FIG.
Same as 52. The title 244 is a field for storing a title for the URL assigned by the user. The description 245 is a field for storing a description for the URL given by the user. The U-URL data table 241 thus stores the URL selected by each user.
Is associated with the MD characterizing the content of the URL.

【００４５】Ｓ−ＵＲＬ管理データベース２５０は、情
報検索装置１００が管理するＵＲＬとＭＤとの関係を示
すＳ−ＵＲＬデータテーブル２５１を格納する。図７
に、Ｓ−ＵＲＬデータテーブル２５１の一例を示す。同
テーブル２５１は、例示的に、Ｓ−ＵＲＬ−ＩＤ２５
２、ＵＲＬ２５３、タイトル２５４及びページＭＤ２５
５のフィールドを含む。Ｓ−ＵＲＬ−ＩＤ２５２は、こ
のデータ構造のＩＤを格納するフィールドである。ＵＲ
Ｌ２５３は、対応するＵＲＬを格納するフィールドであ
る。タイトル２５４は、ＵＲＬが指すページに付けられ
たオリジナルタイトルを格納するフィールドである。ペ
ージＭＤ２５５は、ＵＲＬが指すページ（即ち、コンテ
ンツ）を特徴付けるＭＤを格納するフィールドである。
ＭＤはシステム管理者が設定してもよいし、後述するよ
うにクライアント１０によって生成されてもよい。ペー
ジＭＤ２５５は、上述したように、フォルダの内容を特
徴付けるＭＤリスト２３５とは無関係に設定される。従
って、ユーザが特定のＵＲＬを当該ＵＲＬのページＭＤ
２５５とは異なるＭＤリスト２３５を有する特定のフォ
ルダに割り当ててもページＭＤ２５５は影響を受けな
い。The S-URL management database 250 stores an S-URL data table 251 indicating the relationship between the URL managed by the information search device 100 and the MD. FIG.
Shows an example of the S-URL data table 251. The table 251 includes, for example, the S-URL-ID 25
2, URL 253, title 254 and page MD25
Includes 5 fields. The S-URL-ID 252 is a field for storing the ID of this data structure. UR
L253 is a field for storing the corresponding URL. The title 254 is a field for storing the original title given to the page indicated by the URL. The page MD 255 is a field for storing an MD characterizing the page (that is, content) indicated by the URL.
The MD may be set by a system administrator or may be generated by the client 10 as described later. As described above, the page MD 255 is set independently of the MD list 235 that characterizes the contents of the folder. Therefore, the user sets the specific URL to the page MD of the URL.
Assigning to a specific folder having an MD list 235 different from 255 does not affect page MD 255.

【００４６】Ｕ−ＵＲＬ−ＭＤ管理データベース２６０
は、ユーザ毎にＵＲＬに付与されたＭＤを管理するＵ−
ＵＲＬ−ＭＤ管理テーブル２６１を格納する。図８に、
Ｕ−ＵＲＬ−ＭＤ管理テーブル２６１の一例を示す。同
テーブル２６１は、図８に示すように、ユーザＵＲＬ２
６２及びＭＤリスト２６３のフィールドを含む。ユーザ
ＵＲＬ（Ｕ−ＵＲＬ）２６２は、ユーザと当該ユーザが
フォルダに割り当てたＵＲＬとを識別し、ユーザがＵＲ
Ｌをフォルダに割り当てる毎に形成される。ＭＤリスト
（又はＭＤセット）２６３は、ユーザが特定のＵＲＬ
を、既にＭＤリスト２３５において特徴付けられたフォ
ルダに格納した場合に、当該フォルダのＭＤリスト２３
５のＭＤセットを格納するフィールドである。従って、
ＭＤリスト２６３は、各ＷＷＷページ、リソース、オブ
ジェクトに対応してシステムではなくユーザが設定した
ＭＤの集合（例えば、情緒的及び若者向き）である。U-URL-MD management database 260
Manages the MD assigned to the URL for each user.
The URL-MD management table 261 is stored. In FIG.
5 shows an example of a U-URL-MD management table 261. The table 261 stores the user URL 2 as shown in FIG.
62 and MD list 263 fields. The user URL (U-URL) 262 identifies a user and a URL assigned to the folder by the user, and the user
It is formed every time L is assigned to a folder. The MD list (or MD set) 263 stores the URL specified by the user.
Is stored in the folder already characterized in the MD list 235, the MD list 23 of the folder is stored.
5 is a field for storing an MD set. Therefore,
The MD list 263 is a set of MDs (for example, for emotional and young people) set by the user instead of the system corresponding to each WWW page, resource, and object.

【００４７】Ｓ−ＵＲＬ−ＭＤ管理データベース２７０
は、ＵＲＬに付与されたＭＤを管理するＳ−ＵＲＬ−Ｍ
Ｄ管理テーブル２７１を格納する。同テーブル２７１
は、システムが全ユーザのＵ−ＵＲＬ−ＭＤ管理テーブ
ル２６１から収集した同一のＳ−ＵＲＬ−ＩＤを有する
Ｕ−ＵＲＬに付与されたＭＤを集計し、統計処理を施し
た結果を保持する。図９に、Ｓ−ＵＲＬ−ＭＤ管理テー
ブル２７１の一例を示す。同テーブル２７１は、図９に
示すように、システムＵＲＬ（Ｓ−ＵＲＬ）２７２及び
ＭＤリスト２７３のフィールドを含む。Ｓ−ＵＲＬ２７
２は、システムが管理するＵＲＬ情報のＩＤへの参照で
あるＳ−ＵＲＬ−ＩＤ（１）乃至（Ｎ）を格納するフィ
ールドである。ＭＤリスト（又はＭＤセット）２７３
は、各ユーザがＵＲＬをフォルダに割り当てるたびにＭ
Ｄリスト２６３として割り当てられたＭＤを、当該ＵＲ
Ｌに関して全ユーザ分集計したＭＤセットを格納するフ
ィールドである。S-URL-MD management database 270
Is the S-URL-M that manages the MD assigned to the URL
The D management table 271 is stored. Table 271
The system collects the MDs assigned to U-URLs having the same S-URL-ID collected from the U-URL-MD management table 261 of all users by the system, and holds the result of performing statistical processing. FIG. 9 shows an example of the S-URL-MD management table 271. The table 271 includes fields of a system URL (S-URL) 272 and an MD list 273 as shown in FIG. S-URL27
Reference numeral 2 denotes a field for storing S-URL-IDs (1) to (N), which are references to IDs of URL information managed by the system. MD list (or MD set) 273
Means that each time each user assigns a URL to a folder,
The MD assigned as the D list 263 is
This is a field for storing an MD set for L for all users.

【００４８】特定ＵＲＬに対するユーザ別ＭＤ管理デー
タベース２８０は、各ユーザが選択したＵＲＬと当該Ｕ
ＲＬに割り当てたメタデータセットの関係を示すテーブ
ル２８１を格納する。図１０に、同テーブル２８１の一
例を示す。同テーブル２８１は、特定のＵＲＬに対して
複数のユーザ（図１０においてはＮ人）の各々が割り当
てたＭＤを整理して保持している。なお、図１０は、便
宜上、ユーザ１乃至４のみが任意に付与したＭＤのみを
示している。The user-specific MD management database 280 for a specific URL stores the URL selected by each user and the
A table 281 indicating the relationship between the metadata sets assigned to the RL is stored. FIG. 10 shows an example of the table 281. The table 281 organizes and holds MDs assigned to a specific URL by a plurality of users (N in FIG. 10). FIG. 10 shows only MDs arbitrarily given by only the users 1 to 4 for convenience.

【００４９】辞書管理データベース２９０は、辞書とフ
ィーチャーとを格納する。辞書は、比較的少数の基本語
で全ての単語を説明している英英辞典などが選択され
る。例えば、「ＬｏｎｇｍａｎＤｉｃｔｉｏｎａｒｙ
ｏｆＣｏｎｔｅｍｐｏｒａｒｙＥｎｇｌｉｓｈ」（以
下、「ＬＤＣＥ」と略す。）という英英辞典は２０００
語の基本語を有し、これらの基本語によって全ての単語
（又は見出し語）を説明している。また、別の英英辞典
である「ＴｈｅＧｅｎｅｒａｌＢａｓｉｃＥｎｇｌｉ
ｓｈＤｉｃｔｉｏｎａｒｙ（以下、「ＧＢＥＤ」と略
す。）は８５０語の基本語を有する。そこで、例えば、
ＬＤＣＥを辞書として選択し、２０００語の基本語をフ
ィーチャーに選択するとＬＤＣＥに単語として出ている
全てのＭＤはフィーチャーによって表現することができ
る。同様に、ＧＢＥＤを辞書として選択し、８５０語の
基本語をフィーチャーに選択すると、ＧＢＥＤに単語と
して出ている全てのＭＤはフィーチャーによって表現す
ることができる。図４を参照して上述したように、ＭＤ
管理データベース２２０は辞書に出ている全ての単語を
ＭＤに選択してテーブル２２１を作成してもよいし、適
当な数のＭＤのみ選択してテーブル２２１を作成しても
よい。後者の場合には、ＣＰＵ１１０は、辞書管理デー
タベース２９０の辞書を参照することによって新規のＭ
ＤをいつでもＭＤ管理データベース２２０に加えること
ができる。Ｂ．データ記憶装置２００へのデータの登録次に、データ記憶装置２００の上述した各種データベー
スにデータを登録、更新及び登録の抹消の方法及び時期
について説明する。The dictionary management database 290 stores dictionaries and features. As the dictionary, an English-English dictionary that explains all words with a relatively small number of basic words is selected. For example, "Longman Dictionary"
The English dictionary of "Contemporary English" (hereinafter abbreviated as "LDCE") is 2000.
It has the base words of the words, which describe all the words (or headwords). Also, another English-English dictionary, "The General BasicEngli"
sh Dictionary (hereinafter abbreviated as "GBED") has 850 base words. So, for example,
When LDCE is selected as a dictionary and 2,000 basic words are selected as features, all MD appearing as words in LDCE can be represented by features. Similarly, if GBED is selected as a dictionary and 850 basic words are selected as features, all MDs appearing as words in GBED can be represented by features. As described above with reference to FIG.
The management database 220 may create the table 221 by selecting all words appearing in the dictionary as MDs, or may create the table 221 by selecting only an appropriate number of MDs. In the latter case, the CPU 110 refers to the dictionary in the dictionary management database 290 to create a new M
D can be added to the MD management database 220 at any time. B. Registration of Data in Data Storage Device 200 Next, a method and timing of registering, updating, and deleting registration in the above-described various databases of the data storage device 200 will be described.

【００５０】ユーザ管理データベース２１０のデータは
クライアント１０が情報検索装置１００にユーザ登録を
する際にユーザによって新規に作成される。但し、ＵＩ
Ｄ２１２、ＨＯＭＥフォルダ２１５はユーザによるデー
タ入力後に、システム管理者の手作業入力により又はＣ
ＰＵ１１０により自動的に、割り当てられる。クライア
ント１０は、ＣＰＵ１１０と交信することによって何時
でも、自己の情報（ユーザ名、プロファイル情報など）
を変更することができる。かかる変更は、住所、電話番
号を変更した場合、結婚して名字が変更した場合その他
の場合に行われるであろう。クライアント１０はＣＰＵ
１１０と交信することによって何時でも、若しくは、Ｃ
ＰＵ１１０により自動的に又はシステム管理者による手
作業入力により、所定の場合（例えば、クライアント１
０が偽造カードを使用した場合など）に、データの登録
を抹消することができる。なお、ＣＰＵ１１０により自
動的に又はシステム管理者による手作業入力により、所
定の場合にデータの登録を抹消される場合であっても、
ＣＰＵ１１０は、メモリ（例えば、ＲＯＭ１４０）の所
定の領域に、再発防止などセキュリティ上の目的から、
当該クライアント１０の情報を格納してもよい。従っ
て、本実施例では、「登録の抹消」は、このようにデー
タを完全に消去しない場合も含む。また、本実施例では
「ＣＰＵ１１０と更新することによって」は、クライア
ント１０がシステム管理者とオフラインで更新する場合
も含む。The data in the user management database 210 is newly created by the user when the client 10 performs user registration in the information search device 100. However, UI
D212, the HOME folder 215 is manually input by the system administrator after the user inputs data or C
It is automatically assigned by the PU 110. The client 10 communicates with the CPU 110 at any time to obtain its own information (user name, profile information, etc.).
Can be changed. Such changes may be made if the address, telephone number, marriage, surname, etc. change. Client 10 is CPU
At any time by communicating with 110 or C
In a predetermined case (for example, the client 1) automatically by the PU 110 or by manual input by the system administrator.
(For example, when 0 uses a counterfeit card), the registration of data can be deleted. Note that even if the registration of data is deleted in a predetermined case automatically by the CPU 110 or by manual input by a system administrator,
The CPU 110 stores a predetermined area of the memory (for example, the ROM 140) in a predetermined area for security purposes such as prevention of recurrence.
The information of the client 10 may be stored. Therefore, in the present embodiment, "deletion of registration" includes a case where data is not completely deleted. In the present embodiment, “by updating with the CPU 110” also includes a case where the client 10 updates offline with the system administrator.

【００５１】ＭＤ管理データベース２２０のデータは、
システム管理者が辞書を参照して手作業入力により又は
ＣＰＵ１１０が辞書管理データベース２９０を参照して
自動的に、ユーザ登録とは無関係な時期に作成される。
当該データは、辞書が変更された場合、フィーチャーの
数や割り当てが変更された場合などに更新される。ＭＤ
管理データベース２２０の主要なデータは、通常は、い
ずれのユーザも登録する前に形成されるであろう。新し
いＭＤが造語等として社会的に形成されればＣＰＵ１１
０は随時ＩＤを当該ＭＤに割り当て、システム管理者は
辞書を参照してフィーチャーリスト２２３を割り当て
る。ＭＤ管理データベース２２０のデータの登録は通常
抹消されないが、二重入力の場合、システム管理者が特
定のＭＤを社会的に死語であると考えた場合などには抹
消される場合がある。The data of the MD management database 220 is
It is created by a manual input by the system administrator referring to the dictionary or automatically by the CPU 110 by referring to the dictionary management database 290 at a time independent of user registration.
The data is updated when the dictionary is changed or when the number or allocation of features is changed. MD
The main data in the management database 220 will typically be formed before any users register. If a new MD is socially formed as a coined word, etc., the CPU 11
0 assigns an ID to the MD at any time, and the system administrator assigns the feature list 223 by referring to the dictionary. The registration of the data in the MD management database 220 is not usually deleted, but in the case of double input, it may be deleted when the system administrator considers a specific MD to be a dead word socially.

【００５２】フォルダ管理データベース２３０のデータ
は、システムデフォルト型であれば、システム管理者の
手作業入力により又はＣＰＵ１１０により自動的に、原
則としていずれのユーザも登録する前に形成される。但
し、図１３を参照して後述されるように、複数のユーザ
に対してフォルダにＭＤを割り当てることを依頼するこ
ともできる。一方、フォルダ管理データベース２３０
は、ユーザ依存型であれば、クライアント１０がユーザ
登録をした時に形成される（但し、フォルダＩＤ２３２
等の一部のデータは、システム管理者の手作業入力によ
り又はＣＰＵ１１０により自動的に、形成される）。フ
ォルダ管理データベース２３０は、特に、ユーザ依存型
について、ＵＲＬ及び／又は説明を追加及び変更した場
合、親子フォルダを作成した場合などは作成時に親子フ
ォルダリスト２３８ａ及びｂのデータが変更される。特
定のユーザやコミュニティにアクセスを制限した場合は
アクセス権を設定した際に属性値２３９のデータが更新
される。クライアント１０はＣＰＵ１１０と交信するこ
とによって何時でも、若しくは、ＣＰＵ１１０はシステ
ム管理者による入力によって所定の場合（例えば、クラ
イアント１０の登録を抹消することによって対応するフ
ォルダデータも抹消する場合など）に、データの登録を
抹消することができる。If the data of the folder management database 230 is a system default type, it is formed by manual input of the system administrator or automatically by the CPU 110 before any users are registered in principle. However, as described later with reference to FIG. 13, a plurality of users can be requested to assign an MD to a folder. On the other hand, the folder management database 230
Is formed when the client 10 registers the user if the user is of a user-dependent type (however, the folder ID 232
Some data is formed by manual input of a system administrator or automatically by the CPU 110). The folder management database 230 changes the data of the parent-child folder lists 238a and 238b when the URL and / or description is added and changed, and when a parent-child folder is created, especially when the user-dependent type is created. When the access is restricted to a specific user or community, the data of the attribute value 239 is updated when the access right is set. The client 10 communicates with the CPU 110 at any time, or in a predetermined case (for example, when the corresponding folder data is deleted by deleting the registration of the client 10) by an input from the system administrator. Registration can be deleted.

【００５３】Ｕ−ＵＲＬ管理データベース２４０は、ユ
ーザがＵＲＬをフォルダに割り当てた場合（例えば、フ
ォルダ管理データベースのデータ作成時に）ＣＰＵ１１
０により自動的に作成され、フォルダ管理データベース
のデータ更新時に更新される。Ｕ−ＵＲＬ管理データベ
ース２４０はユーザ登録の抹消時に抹消される。The U-URL management database 240 is used when the user assigns a URL to a folder (for example, when creating data in the folder management database).
0 automatically created and updated when the data in the folder management database is updated. The U-URL management database 240 is deleted when the user registration is deleted.

【００５４】Ｓ−ＵＲＬ管理データベース２５０は、シ
ステム管理者の手作業入力によって、又は、後述するよ
うにユーザによる割り当て処理を通じてＣＰＵ１１０に
よって自動的に、作成される。Ｓ−ＵＲＬ管理データベ
ース２５０は、例えば、Ｓ−ＵＲＬ−ＭＤ管理データベ
ース２７０に合わせて更新されてもよい。例えば、ＵＲ
Ｌが指すコンテンツが更新されるなどしてページＭＤ２
５５が古くなった場合に、最近多くのユーザが割り当て
たＭＤリスト２７３にページＭＤ２５５を置き換えるな
どである。Ｓ-ＵＲＬ管理データベース２５０のデータ
は、ＵＲＬの抹消時に抹消される。The S-URL management database 250 is created by manual input by a system administrator or automatically by the CPU 110 through an allocation process by a user as described later. The S-URL management database 250 may be updated, for example, in accordance with the S-URL-MD management database 270. For example, UR
Page MD2, for example, when the content pointed to by L is updated
When 55 becomes old, the page MD 255 is replaced with the MD list 273 recently assigned by many users. The data in the S-URL management database 250 is deleted when the URL is deleted.

【００５５】以下、図１１を参照して、複数のユーザが
特定のＵＲＬに対して当該ＵＲＬが指すコンテンツを表
現するＭＤを割り当て、かかる処理を通じてＳ−ＵＲＬ
管理データベース２５０が形成される場合について説明
する。ここで、図１１はクライアント１０と情報検索装
置１００との間で行われる、本発明の特定のＵＲＬに対
するＭＤの割り当て方法の一例を示すフローチャートで
ある。Hereinafter, with reference to FIG. 11, a plurality of users assign an MD representing a content indicated by the URL to a specific URL, and perform S-URL
The case where the management database 250 is formed will be described. Here, FIG. 11 is a flowchart illustrating an example of a method of allocating an MD to a specific URL according to the present invention, which is performed between the client 10 and the information search device 100.

【００５６】まず、情報検索装置１００のＣＰＵ１１０
が、通信ポート１２０、インターネット３０を介し、複
数のクライアント１０に予め一以上のＭＤ（即ち、メタ
データセット）が付与された複数のフォルダと特定のＵ
ＲＬを提供する（ステップ２００２）。なお、提供方法
は、代替的に、ファックス、郵送その他の手段を使用す
るオフライン方法であってもよい。各フォルダには、当
該フォルダを特徴付ける一以上の共通又は相違するＭＤ
が予め割り当てられている。フォルダに対するＭＤの割
り当ては、システム管理者が行ってもよいし、図１３を
参照して後述されるように、クライアント１０が行って
もよい。なお代替的に、フォルダを使用する代わりにメ
タデータのみを使用することもできる。First, the CPU 110 of the information search device 100
However, via the communication port 120 and the Internet 30, a plurality of folders in which one or more MDs (that is, a metadata set) are added in advance to a plurality of clients 10 and a specific U
An RL is provided (step 2002). Alternatively, the providing method may be an off-line method using fax, mail, or other means. Each folder has one or more common or different MDs that characterize the folder.
Are assigned in advance. The assignment of the MD to the folder may be performed by the system administrator, or may be performed by the client 10 as described later with reference to FIG. Alternatively, instead of using folders, only metadata can be used.

【００５７】フォルダを提供された複数のクライアント
１０の各々はＵＲＬをキーボードによって入力するか、
ハイパーリンクされたＵＲＬをマウスでクリックするこ
とによってＵＲＬが指すコンテンツを取得して検討する
（ステップ１００２）。Each of the plurality of clients 10 provided with the folder inputs the URL with the keyboard, or
By clicking the hyperlinked URL with the mouse, the content pointed to by the URL is acquired and examined (step 1002).

【００５８】次に、ユーザは、コンテンツの内容を表す
フォルダを、情報検索装置１００から取得したフォルダ
の中から選択して割り当て、情報検索装置１００に割り
当てたフォルダを送信する（ステップ１００４）。これ
により、クライアント１０は、フォルダを当該ＵＲＬに
割り当てると共にかかるフォルダに含まれるメタデータ
を当該ＵＲＬに割り当てることができる。フォルダを使
用する代わりにメタデータのみを使用する場合にはクラ
イアント１０は一以上のメタデータを割り当てることに
なるであろう。Next, the user selects and assigns a folder representing the contents of the content from the folders obtained from the information search device 100, and transmits the assigned folder to the information search device 100 (step 1004). This allows the client 10 to assign a folder to the URL and to assign metadata included in the folder to the URL. If only metadata were used instead of folders, the client 10 would allocate one or more metadata.

【００５９】ＣＰＵ１１０は、複数のユーザから複数の
フォルダを取得するが、各フォルダをメタデータに分解
し（ステップ２００４）、図１０に示す特定ＵＲＬに対
するユーザ別ＭＤデータテーブル２８１を形成する。図
１０においては、ユーザ１はＭＤ１、ＭＤ４、ＭＤ５及
びＭＤ６を有するフォルダを選択し、ユーザ２はＭＤ
２、ＭＤ４、ＭＤ６及びＭＤ７を有するフォルダを選択
し、ユーザ３はＭＤ３、ＭＤ４、ＭＤ５及びＭＤ７を有
するフォルダを選択し、ユーザ４はＭＤ８を有するフォ
ルダを選択している。上述したように、説明の便宜上、
その他のユーザとそのユーザが選択したＭＤについては
図１０において図示が省略されている。The CPU 110 obtains a plurality of folders from a plurality of users, decomposes each folder into metadata (step 2004), and forms a user-specific MD data table 281 for the specific URL shown in FIG. In FIG. 10, user 1 selects a folder having MD1, MD4, MD5 and MD6, and user 2
2, a folder having MD4, MD6 and MD7, a user 3 has selected a folder having MD3, MD4, MD5 and MD7, and a user 4 has selected a folder having MD8. As described above, for convenience of explanation,
Other users and the MD selected by the user are not shown in FIG.

【００６０】選択的に、ＣＰＵ１１０は次に重み付けを
行う（ステップ２００６）。重み付けは特定のユーザの
データを他のユーザのデータよりも高く評価する処理で
ある。例えば、ユーザ１だけ所定量（例えば、１．５
倍）だけ重み付けることができる。どのような重み付け
を行うかはシステムの管理者が自由に設定することがで
きるが、通常は、コンテンツに関する知識、利用頻度な
どを基準にする。例えば、コンテンツが特定の遺跡を表
していれば考古学者ユーザが割り当てたメタデータを他
のユーザよりも重み付けしたり、コンテンツが着物を表
していれば女性ユーザが割り当てたメタデータを重み付
けしたりするなどである。Optionally, CPU 110 next weights (step 2006). Weighting is a process of evaluating data of a specific user higher than data of other users. For example, a predetermined amount (for example, 1.5
Times). The type of weighting can be freely set by the system administrator, but is usually based on knowledge about the content, usage frequency, and the like. For example, if the content represents a specific archeological site, the metadata assigned by the archeologist user may be weighted more than other users, and if the content represents a kimono, the metadata assigned by the female user may be weighted. And so on.

【００６１】次に、ＣＰＵ１１０は、評価付け又はラン
キングを行う（ステップ２００８）。例えば、ＣＰＵ１
１０はユーザ毎に重み付けしてＭＤ毎に得点を集計す
る。具体的には、ユーザ１を１．５倍に重み付けして、
ユーザ１が選択したＭＤ１、ＭＤ４、ＭＤ５及びＭＤ６
を１．５点、その他のユーザが選択したＭＤを１点等と
して各ＭＤが取得した得点を計算し、特典の高い順にＭ
Ｄを並べ替える。Next, the CPU 110 performs evaluation or ranking (step 2008). For example, CPU1
Numeral 10 weights each user and totals scores for each MD. Specifically, user 1 is weighted 1.5 times,
MD1, MD4, MD5 and MD6 selected by user 1
Is calculated as 1.5 points, and the MD selected by other users is set as one point, and the score obtained by each MD is calculated.
Rearrange D.

【００６２】次に、ランキング上位（即ち、得点の高い
所定数）のＭＤを当該ＵＲＬ用のＭＤとして選択する
（ステップ２０１０）。選択されるＭＤの数はシステム
管理者が任意に設定することができる。例えば、最高得
点を有するＭＤ４のみをＣＰＵ１１０は選択することが
できる。次に、ＣＰＵ１１０は、選択されたＭＤを図７
に示すＳ−ＵＲＬデータテーブル２５１のページＭＤ２
５５に格納してＭＤ割り当て処理を終了する（ステップ
２０１２）。Next, the MD with the highest ranking (that is, a predetermined number with a high score) is selected as the MD for the URL (step 2010). The number of MDs to be selected can be arbitrarily set by a system administrator. For example, the CPU 110 can select only the MD4 having the highest score. Next, the CPU 110 stores the selected MD in FIG.
Page MD2 of the S-URL data table 251 shown in FIG.
55, and the MD allocation process ends (step 2012).

【００６３】このように、かかる割り当て方法によれ
ば、ＣＰＵ１１０は、各クライアント１０が特定のＵＲ
Ｌが指す特定のコンテンツを表現するために割り当てた
一以上のＭＤを（フォルダ単位で）複数受け取り、かか
るＭＤに基づいてＵＲＬのＭＤを決定する。従って、Ｍ
Ｄを決定する主体はシステム管理者ではなくてシステム
を利用して情報検索を行うユーザである。また、ＣＰＵ
１１０は、複数のユーザから複数のＭＤを取得して、そ
の中からＵＲＬのＭＤを決定するために、決定されたＭ
Ｄはユーザ総体の意見を反映している。このため、かか
るＭＤは検索用情報として適している。As described above, according to this allocation method, the CPU 110 determines that each client 10 has a specific UR
A plurality of one or more MDs (in folder units) assigned to represent the specific content pointed to by L are received, and the MD of the URL is determined based on the MDs. Therefore, M
The subject who determines D is not a system administrator but a user who searches for information using the system. Also, CPU
110 obtains a plurality of MDs from a plurality of users, and determines the determined MD in order to determine the MD of the URL from among them.
D reflects the opinion of the entire user. Therefore, such MD is suitable as search information.

【００６４】なお、上述の割り当て処理の結果、Ｓ−Ｕ
ＲＬ管理データベース２５０のデータが形成されるが、
この時Ｕ−ＵＲＬ管理データベース２４０のデータは形
成されない。なぜなら、ユーザは当該ＵＲＬにＭＤを割
り当てるために協力したに過ぎず、自己が所望するＵＲ
Ｌとして自己のフォルダに加えたわけではないからであ
る。Note that, as a result of the above-described assignment processing, the SU
The data of the RL management database 250 is formed,
At this time, the data of the U-URL management database 240 is not formed. Because the user merely cooperated in assigning the MD to the URL, the user did not
This is because L is not added to its own folder.

【００６５】Ｕ−ＵＲＬ−ＭＤ管理データベース２６０
は、ユーザがＵＲＬをフォルダに割り当てるたびに形成
され、ＭＤリスト２６３にはＭＤリスト２３５と同一の
ＭＤが割り当てられる。Ｓ−ＵＲＬ−ＭＤ管理テーブル
２７１は、Ｕ−ＵＲＬ−ＭＤ管理テーブル２６１が形成
されるたびに更新される。特定ＵＲＬに対するユーザ別
ＭＤ管理データベース２８０は、例えば、上述した特定
のＵＲＬにＭＤを割り当てる処理時やＵ−ＵＲＬ−ＭＤ
管理テーブル２６１の作成時に形成される。また、管理
データベース２６０乃至２８０のデータは、システム管
理者又はＣＰＵ１１０が当該ＵＲＬの選択を削除した場
合、当該ＵＲＬがインターネット３０（より詳細には対
応するＩＳＰ）から削除された場合、全てのクライアン
ト１０が当該ＵＲＬに関して登録を抹消した場合などに
抹消される。The U-URL-MD management database 260
Is formed every time a user assigns a URL to a folder, and the same MD as the MD list 235 is assigned to the MD list 263. The S-URL-MD management table 271 is updated each time the U-URL-MD management table 261 is formed. The user-specific MD management database 280 for the specific URL may be used, for example, at the time of processing for allocating an MD to the specific URL or the
It is formed when the management table 261 is created. The data in the management databases 260 to 280 are stored in all the clients 10 when the system administrator or the CPU 110 deletes the selection of the URL, or when the URL is deleted from the Internet 30 (more specifically, the corresponding ISP). Is deleted when the registration of the URL is deleted.

【００６６】以下、図１３を参照して、特定のフォルダ
に、当該フォルダを特徴付けるＭＤの割り当てについて
説明する。ここで、図１３はクライアント１０と情報検
索装置１００との間で行われる、本発明の特定のフォル
ダに対するメタデータの割り当て方法の一例を示すフロ
ーチャートである。Referring to FIG. 13, the assignment of an MD characterizing a particular folder to a specific folder will be described below. Here, FIG. 13 is a flowchart illustrating an example of a method of assigning metadata to a specific folder according to the present invention, which is performed between the client 10 and the information search device 100.

【００６７】まず、情報検索装置１００のＣＰＵ１１０
が、通信ポート１２０、インターネット３０を介し、複
数のクライアント１０に特定のフォルダを提供する（ス
テップ２１０２）。なお、提供方法は、代替的に、ファ
ックス、郵送その他の手段を使用するオフライン方法で
あってもよい。フォルダにはフォルダ名（例えば、「学
校」）は付与されているが、ＭＤが付与されていない。First, the CPU 110 of the information search device 100
Provides a specific folder to a plurality of clients 10 via the communication port 120 and the Internet 30 (step 2102). Alternatively, the providing method may be an off-line method using fax, mail, or other means. The folder is given a folder name (for example, “school”), but is not given an MD.

【００６８】次に、フォルダを提供されたクライアント
１０は、フォルダ名から連想するＭＤ（例えば、「勉
強」、「部活」、「登校」、「楽しい」等）を当該フォ
ルダに割り当て、情報検索装置１００に割り当てたフォ
ルダを送信する（ステップ１１０２）。なお、本実施例
ではクライアント１０が割り当てるＭＤは情報検索装置
１００がサポートしている範囲内であり（換言すれば、
当該ＭＤは図４に示すＭＤ管理テーブル２２１に載って
おり）、クライアント１０が割り当てたＭＤをＣＰＵ１
１０は理解することができるものとする。クライアント
１０が割り当てるＭＤを情報検索装置１００がサポート
していない場合には、後述するように、ＣＰＵ１１０は
当該ＭＤを、辞書管理データベース２９０の辞書を参照
してベクトル化し、当該ベクトルと最も近いサポートし
ている（即ち、図４に示すＭＤ管理テーブル２２１に載
っている）ＭＤに置換する。当該ＭＤが辞書に出ていな
ければＣＰＵ１１０はクライアント１０に別の単語の再
入力を促すか当該単語を無視する。Next, the client 10 provided with the folder assigns an MD (for example, “study”, “club activity”, “school”, “fun”, etc.) associated with the folder name to the folder, The folder assigned to 100 is transmitted (step 1102). In the present embodiment, the MD assigned by the client 10 is within the range supported by the information search device 100 (in other words,
The MD is listed in the MD management table 221 shown in FIG. 4), and the MD assigned by the client 10 is stored in the CPU 1
10 shall be understandable. If the information search device 100 does not support the MD assigned by the client 10, the CPU 110 converts the MD into a vector with reference to the dictionary of the dictionary management database 290 and supports the MD closest to the vector, as described later. (Ie, listed in the MD management table 221 shown in FIG. 4). If the MD does not appear in the dictionary, CPU 110 prompts client 10 to re-enter another word or ignores the word.

【００６９】ＣＰＵ１１０は、複数のユーザから複数の
ＭＤを取得し、選択的に上述した重み付けを行う（ステ
ップ２１０４）。例えば、フォルダ名が「学校」であれ
ば教師ユーザ及び学生ユーザが割り当てたメタデータを
他のユーザよりも重み付けするなどである。次に、ＣＰ
Ｕ１１０は、評価付け又はランキングを行い（ステップ
２１０６）、ランキング上位（即ち、得点の高い所定
数）のＭＤを当該フォルダ用のＭＤとして選択する（ス
テップ２１０８）。選択されるＭＤの数はシステム管理
者が任意に設定することができる。次に、ＣＰＵ１１０
は、選択されたＭＤを図５に示すフォルダデータテーブ
ル２５１のＭＤリスト２３５に格納してＭＤ割り当て処
理を終了する（ステップ２０１０）。The CPU 110 acquires a plurality of MDs from a plurality of users and selectively performs the above-mentioned weighting (step 2104). For example, if the folder name is “school”, the metadata assigned by the teacher user and the student user is weighted more than other users. Next, CP
U110 performs the evaluation or ranking (step 2106), and selects the MD with the highest ranking (that is, a predetermined number with a high score) as the MD for the folder (step 2108). The number of MDs to be selected can be arbitrarily set by a system administrator. Next, the CPU 110
Stores the selected MD in the MD list 235 of the folder data table 251 shown in FIG. 5, and ends the MD assignment processing (step 2010).

【００７０】このように、かかる割り当て方法によれ
ば、ＣＰＵ１１０は、各クライアント１０が特定のフォ
ルダを特徴付けるために割り当てた一以上のＭＤを複数
受け取り、かかるＭＤに基づいてフォルダのＭＤを決定
する。従って、ＭＤを決定する主体はシステム管理者で
はなくてシステムを利用して情報検索を行うユーザであ
る。また、ＣＰＵ１１０は、複数のユーザから複数のＭ
Ｄを取得して、その中からフォルダのＭＤを決定するた
めに、決定されたＭＤはユーザ総体の意見を反映してい
る。このため、かかるＭＤは検索用情報として適してい
る。Ｃ．特定のＵＲＬのベクトル化以下、図１４を参照して、特定のＵＲＬのベクトル化に
ついて説明する。ここで、図１４は、情報検索装置１０
０によって行われる、本発明の特定のＵＲＬのベクトル
化方法の一例を示すフローチャートである。情報検索装
置１００は常に全てのＵＲＬ、ＭＤ、フォルダなどをベ
クトル化するわけではなく、必要に応じてかかる処理を
行うだけである。As described above, according to the assignment method, the CPU 110 receives a plurality of one or more MDs assigned by each client 10 to characterize a specific folder, and determines the MD of the folder based on the MD. Therefore, the subject who determines the MD is not the system administrator but the user who searches for information using the system. In addition, the CPU 110 transmits a plurality of Ms from a plurality of users.
In order to obtain D and determine the MD of the folder from among them, the determined MD reflects the opinion of the entire user. Therefore, such MD is suitable as search information. C. Vectorization of Specific URL Hereinafter, vectorization of a specific URL will be described with reference to FIG. Here, FIG.
6 is a flowchart illustrating an example of a method of vectorizing a specific URL according to the present invention, which is performed by 0. The information search apparatus 100 does not always vectorize all URLs, MDs, folders, and the like, but only performs such processing as necessary.

【００７１】まず、ＣＰＵ１１０は、特定のＵＲＬを特
徴付ける各ＭＤ（即ち、ページＭＤ２５５のＭＤ）を、
ＭＤ管理データベース２２０のＭＤ管理テーブル２２１
を参照してベクトル化する（ステップ２２０２）。特定
のＵＲＬを特徴付ける各ＭＤは、システム管理者によっ
て又は図１１を参照して説明した割り当て方法によって
予め割り当てられているものとする。次いで、ＣＰＵ１
１０は、各ＭＤに対応するベクトルを合成して当該特定
のＵＲＬを表現する一のベクトルを決定する（ステップ
２２０４）。例えば、ＭＤセットがＭＤ１及びＭＤ２と
いう２つのＭＤからなり、それぞれのフィーチャーリス
トが、ＭＤ１＝（ｆ１_a，ｆ２_a，…，ｆｎ_a）、ＭＤ２
＝（ｆ１_b，ｆ２_b，…，ｆｎ_b）で表現される場合、
合成ベクトルＭはＭ＝（ｆ１_a＋ｆ１_b，ｆ２_a＋ｆ
２_b，…，ｆｎ_a＋ｆｎ_b）となる。First, the CPU 110 stores each MD characterizing a specific URL (that is, the MD of the page MD255)
MD management table 221 of MD management database 220
To make a vector (step 2202). It is assumed that each MD characterizing a specific URL has been assigned in advance by the system administrator or by the assignment method described with reference to FIG. Then, CPU1
10 determines one vector expressing the specific URL by combining the vectors corresponding to the MDs (step 2204). For example, MD set consists of two MD of MD1 and MD2, each feature _{list, MD1 = (f1 a, f2} a, ..., fn a), MD2
= (F1 _b , f2 _b ,..., Fn _b )
Resultant vector M is _{_{M = (f1 a + f1 b}} , f2 a + f
2 _b, ..., the fn _a + fn _b).

【００７２】なお、ＭＤのベクトル化は、通常、ＭＤ管
理データベース２２０を参照して行われるが、ＭＤ管理
データベース２２０にないＭＤであれば辞書管理データ
ベース２９０を参照してフィーチャー化及びベクトル化
し、最も近いベクトルを有するＭＤをＭＤ管理データベ
ース２２０から選択してこれと置換する。また、メタデ
ータセットによって特徴付けられているフォルダのベク
トル化も同様に行うことができる。The vectorization of the MD is usually performed with reference to the MD management database 220. If the MD is not in the MD management database 220, the MD is vectorized by referring to the dictionary management database 290, An MD having a close vector is selected from the MD management database 220 and replaced. Also, vectorization of folders characterized by a metadata set can be performed similarly.

【００７３】ＵＲＬ、フォルダ及びＭＤのベクトル化
は、ユーザに、当該ＵＲＬと同一又は類似のベクトルを
有する他のＵＲＬ、他のユーザ及びコミュニティを提供
する際の検索用感性データとして機能する。即ち、ユー
ザは自己が選択したＵＲＬをクエリーとし、その他に多
くのデータを入力することなく、当該ＵＲＬと同種のＵ
ＲＬ、当該ＵＲＬに興味を有する他のユーザやコミュニ
ティを取得することができる。ＩＩＩ情報検索以下、本発明の情報検索装置１００が行う幾つかの例示
的な情報検索方法について説明する。クライアント１０
が情報検索装置１００を利用して検索して取得すること
ができる情報は、クライアント１０が所望するコンテン
ツ及びその付随情報、当該コンテンツを有する又は有す
る可能性の高いサイト２０のＵＲＬ、クライアント１０
が希望する条件を満足する又は満足する可能性の高い他
のクライアント１０及び／又はコミュニティ、クライア
ント１０自身について登録されている情報を含むが、こ
れらに限定されるものではない。また、情報検索装置１
００は、検索された情報に派生する情報やサービスを広
く提供することができる。これらのサービスは、例え
ば、クライアント１０が所望する画家の作品を展示して
いる美術館に関する情報、アーチストのライブチケッ
ト、コミュニティへの申し込み、書籍の予約などを含
む。以下に、代表的な検索を幾つか説明する。Ａ．他のクライアント及びコミュニティの検索情報検索装置１００は、類似の感性を有する一又は複数
の他のクライアント１０（即ち、集団としてのコミュニ
ティ）をクライアント１０に提供することができる。コ
ミュニティは、コミュニティを構成する複数のクライア
ント１０として識別されてもよいし、コミュニティ自体
（例えば、音楽バンド名）として識別されてもよい。情
報検索装置１００は、システム管理者が自発的に又はク
ライアント１０からの要求に応じてコミュニティ（ユー
ザグループ）を生成する。前者の場合は、コミュニティ
とみなされる複数のユーザに対して、興味があると思わ
れる情報や宣伝広告（例えば、明るいミュージカルに興
味があるユーザに新作ミュージカルを案内するなど）を
提供するのに便宜であろう。以下では、コミュニティの
検索について説明するが他のクライアントの検索も同様
である。The vectorization of the URL, the folder, and the MD functions as search sensitivity data when the user is provided with another URL, another user, and a community having the same or similar vector as the URL. That is, the user uses the URL selected by the user as a query, and inputs the same type of URL as the URL without inputting much data.
RL and other users and communities interested in the URL can be obtained. III Information Retrieval Hereinafter, some exemplary information retrieval methods performed by the information retrieval device 100 of the present invention will be described. Client 10
The information that can be searched and obtained by using the information search device 100 includes the content desired by the client 10 and its accompanying information, the URL of the site 20 that has or is likely to have the content, the client 10
Includes, but is not limited to, information registered about other clients 10 and / or communities that satisfy or are likely to satisfy desired conditions, and the clients 10 themselves. In addition, the information search device 1
00 can widely provide information and services derived from the searched information. These services include, for example, information about museums exhibiting works of the painter desired by the client 10, live tickets for artists, applications to the community, book reservations, and the like. Hereinafter, some typical searches will be described. A. Searching for Other Clients and Communities The information search device 100 can provide the client 10 with one or more other clients 10 (ie, a community as a group) having a similar sensibility. The community may be identified as a plurality of clients 10 constituting the community, or may be identified as the community itself (for example, a music band name). The information search device 100 generates a community (user group) spontaneously by a system administrator or in response to a request from the client 10. In the former case, it is convenient to provide a plurality of users considered as a community with information and advertisements that are considered to be interesting (for example, to guide a user interested in a bright musical to a new musical). Will. Hereinafter, a search for a community will be described, but the same applies to a search for another client.

【００７４】ＣＰＵ１１０は、例えば、図１１に示す処
理やＵ−ＵＲＬ―ＭＤ管理テーブル２６１の作成を通じ
て図１０に示す特定ＵＲＬに対するユーザ別ＭＤ管理テ
ーブル２６１を取得する。この結果、ＣＰＵ１１０は、
テーブル２６１に基づいてユーザ１乃至３をメンバーと
して有するコミュニティを生成する。かかる様子は、図
１２を参照すれば容易に理解できるであろう。ここで、
図１２は、図１０に示すテーブル２６１を集合として表
した図である。The CPU 110 acquires the user-specific MD management table 261 for the specific URL shown in FIG. 10, for example, through the processing shown in FIG. 11 and the creation of the U-URL-MD management table 261. As a result, the CPU 110
Based on the table 261, a community having users 1 to 3 as members is generated. Such a situation can be easily understood with reference to FIG. here,
FIG. 12 is a diagram showing the table 261 shown in FIG. 10 as a set.

【００７５】本実施例では、少なくとも一の共通するＭ
Ｄを割り当てたクライアント１０を同一のコミュニティ
に含めているために重み付けは行っていない。性別、年
齢、出身地によってコミュニティの範囲を制限する必要
がある場合には目的としない対象（例えば、男性や２０
歳以上）のユーザにゼロを掛けて排除するなど重み付け
を行ってもよいが、情報検索装置１００は通常のキーワ
ードによる検索も行えるので必要があればクライアント
１０はキー入力により「女性のみ」などの検索条件を入
力してコミュニティの範囲を狭めることができる。ま
た、「共通するＭＤ」の範囲はシステム管理者及び／又
はクライアント１０が自由に設定することができる。例
えば、「高い」というＭＤと「やや高い」というＭＤを
共通するとみなしてもよいし、共通しないとみなしても
よい。ステップ２３０４における所定数もシステム管理
者及び／又はクライアント１０が自由に設定することが
できる。所定数が一であれば一の他のクライアント１０
の検索となる。In this embodiment, at least one common M
Weighting is not performed because the client 10 assigned D is included in the same community. If you need to limit the scope of your community by gender, age, or place of origin, do not target
Weighting may be performed, such as excluding a user over the age of 0) by multiplying by zero. However, since the information search device 100 can also perform a search using a normal keyword, if necessary, the client 10 can use a key input to select “women only”. You can enter search conditions to narrow the scope of the community. Further, the range of the “common MD” can be freely set by the system administrator and / or the client 10. For example, the MD of “high” and the MD of “somewhat high” may be regarded as common or not. The predetermined number in step 2304 can also be freely set by the system administrator and / or the client 10. If the predetermined number is one, one other client 10
Search.

【００７６】次に、クライアント１０が他のクライアン
ト又はコミュニティを検索する別の方法について説明す
る。まず、クライアント１０は、自分のプラットフォー
ムとブラウザを介して情報検索装置１００にアクセスし
て、ユーザ名やパスワード等の入力後に、情報検索装置
１００の図示しない検索ボタンをダブルクリックする。
その後、図示しない検索画面において「他のユーザ又は
コミュニティ」の検索をダブルクリックする。その後、
図示しないユーザ検索画面において、クライアント１０
０は、自己が選択したＵＲＬ、当該ＵＲＬを格納したフ
ォルダ（サンプルフォルダ）、その他のデータ（希望す
るユーザの性別、名前、職業、住所、ＭＤその他のキー
ワード、画像及び／又は音声データ、ＵＩＤ等）を問い
合わせ（クエリー）データとして入力する。本実施例で
は、クライアント１０は、音楽バンドを組むために同一
の感性を有する他のユーザ１０ａを検索するものとす
る。Next, another method in which the client 10 searches for another client or community will be described. First, the client 10 accesses the information search apparatus 100 via its own platform and browser, and after inputting a user name, a password, and the like, double-clicks a search button (not shown) of the information search apparatus 100.
Thereafter, a search for "other users or communities" is double-clicked on a search screen (not shown). afterwards,
In a user search screen (not shown), the client 10
0 is a URL selected by the user, a folder storing the URL (sample folder), other data (gender, name, occupation, address, MD and other keywords of desired user, image and / or audio data, UID, etc.) ) Is input as query data. In this embodiment, the client 10 searches for another user 10a having the same sensitivity to form a music band.

【００７７】これに応答して、ＣＰＵ１１０は、当該Ｕ
ＲＬのページＭＤ２５５、ＭＤリスト２６３又は２７
３、若しくは、サンプルフォルダのＭＤリスト２３４と
同一又は類似するＭＤリスト２３５を有するフォルダを
有する他のユーザ又はコミュニティ、当該ＵＲＬをフォ
ルダに割り当てている他のユーザ又はコミュニティを、
Ｓ−ＵＲＬ−ＩＤ２５２やベクトルの比較などによって
取得する。その後、ＣＰＵ１１０は取得したクライアン
ト１０ａのＵＩＤ、ｅメールアドレスなどをクライアン
ト１０に送信し、これにより、クライアント１０は自分
と同一又は類似の感性を有する他のクライアント１０ａ
を取得する。In response, CPU 110 determines that U
RL page MD255, MD list 263 or 27
3 or another user or community having a folder with an MD list 235 that is the same as or similar to the MD list 234 of the sample folder, or another user or community that has assigned the URL to the folder,
It is obtained by comparing the S-URL-ID 252 and the vector. After that, the CPU 110 transmits the obtained UID, e-mail address, and the like of the client 10a to the client 10, whereby the client 10 has the same or similar sensitivity as the other client 10a.
To get.

【００７８】このように、ＣＰＵ１１０、システム管理
者の主観とは無関係な公平無私な事項（例えば、辞書、
ＵＲＬのＩＤ、ユーザによるＵＲＬに対するＭＤの割り
当て）を基準として類否判断を行っている。そのため、
類否判断の結果はあらゆるクライアント１０に対して公
平である。また、主観的な感性の類否判断を客観的なベ
クトルを基準に行っているので判断対象の範囲も広く判
断の信頼性も高い。Ｂ．ＵＲＬの検索次に、クライアント１０が所望のコンテンツのＵＲＬを
情報検索装置１００を使用して検索する場合について説
明する。まず、クライアント１０は、自分のプラットフ
ォームとブラウザを介して情報検索装置１００にアクセ
スして、ユーザ名やパスワード等の入力後に、情報検索
装置１００の図示しない検索ボタンをダブルクリックす
る。その後、図示しない検索画面において「ＵＲＬ」の
検索をダブルクリックする。その後、図示しないＵＲＬ
検索画面において、クライアント１００は、自己が選択
したＵＲＬ、当該ＵＲＬを格納したフォルダ（サンプル
フォルダ）、その他のデータ（希望するユーザの性別、
名前、職業、住所、ＭＤその他のキーワード、画像及び
／又は音声データ、ＵＩＤ等）を問い合わせ（クエリ
ー）データとして入力する。As described above, the CPU 110 and fair and selfless matters unrelated to the subjectivity of the system administrator (for example, a dictionary,
The similarity determination is performed based on the URL ID and the user's assignment of the MD to the URL. for that reason,
The result of the similarity judgment is fair for all clients 10. In addition, since the subjective sensibility similarity determination is performed based on an objective vector, the range of the determination target is wide and the reliability of the determination is high. B. Next, a case where the client 10 searches for the URL of the desired content using the information search device 100 will be described. First, the client 10 accesses the information search apparatus 100 via its own platform and browser, and after inputting a user name, a password, and the like, double-clicks a search button (not shown) of the information search apparatus 100. Then, on the search screen (not shown), the search for “URL” is double-clicked. Then, a URL (not shown)
On the search screen, the client 100 selects the URL selected by itself, a folder storing the URL (sample folder), and other data (gender of a desired user,
Name, occupation, address, MD, other keywords, image and / or audio data, UID, etc.) are input as query data.

【００７９】これに応答して、ＣＰＵ１１０は、当該Ｕ
ＲＬのページＭＤ２５５、ＭＤリスト２６３又は２７
３、若しくは、サンプルフォルダのＭＤリスト２３４と
同一又は類似するページＭＤ２５５又はＭＤリスト２７
３を有するＵＲＬを必要があれば両者のベクトルを比較
することによって取得する。その後、ＣＰＵ１１０は取
得したＵＲＬ２をクライアント１０に送信し、これによ
り、クライアント１０は自己が所望する可能性の高いコ
ンテンツを指す別のＵＲＬ２を取得する。In response, CPU 110 determines that U
RL page MD255, MD list 263 or 27
3 or a page MD255 or MD list 27 that is the same as or similar to the MD list 234 of the sample folder
If necessary, a URL having 3 is obtained by comparing the two vectors. After that, the CPU 110 transmits the obtained URL2 to the client 10, whereby the client 10 obtains another URL2 pointing to the content which is likely to be desired by itself.

【００８０】このように、ＣＰＵ１１０、システム管理
者の主観とは無関係な公平無私な事項（例えば、辞書や
ＭＤ）を基準として類否判断を行っている。そのため、
類否判断の結果はあらゆるクライアント１０に対して公
平である。また、主観的な感性の類否判断を客観的なベ
クトルを基準に行っているので判断対象の範囲も広く判
断の信頼性も高い。As described above, the similarity is determined based on fair and selfless matters (for example, dictionaries and MDs) irrelevant to the subjectivity of the CPU 110 and the system administrator. for that reason,
The result of the similarity judgment is fair for all clients 10. In addition, since the subjective sensibility similarity determination is performed based on an objective vector, the range of the determination target is wide and the reliability of the determination is high.

【００８１】以上、本発明の好ましい実施例を説明した
が、本発明はその要旨の範囲内で様々な変形や変更が可
能である。While the preferred embodiment of the present invention has been described above, the present invention can be variously modified or changed within the scope of the gist.

【００８２】[0082]

【発明の効果】本発明の検索用情報の生成方法、並び
に、情報検索装置及び方法は、複数のユーザが割り当て
た複数の記述子の中からコンテンツを表現する記述子を
決定するため、情報検索を行うユーザ総体の意見を反映
すると共に客観的である。従って、かかる記述子は信頼
性の高い情報検索を提供することができる。また、デー
タを特徴付ける記述子を、辞書を参照してベクトル化し
ているので、管理者の主観を用いるよりも公平無私であ
る。さらに本発明によれば、記述子の割り当て作業の結
果に基づき、感性や認識が居通または類似するユーザグ
ループを同定することができる。According to the method for generating search information and the information search apparatus and method of the present invention, information retrieval is performed to determine a descriptor representing a content from a plurality of descriptors assigned by a plurality of users. Is objective and reflects the opinion of the entire user who performs Thus, such a descriptor can provide a reliable information search. In addition, since descriptors characterizing data are vectorized with reference to dictionaries, the method is fairer than using the subjectivity of an administrator. Further, according to the present invention, it is possible to identify a user group whose sensibility and recognition are common or similar based on the result of the assignment work of the descriptor.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の一側面としての情報検索システムの
システム構成図である。FIG. 1 is a system configuration diagram of an information search system according to one aspect of the present invention.

【図２】図１に示す情報検索システムの情報検索装置
に使用されるデータ記憶装置の構造を示すブロック図で
ある。FIG. 2 is a block diagram showing a structure of a data storage device used in the information search device of the information search system shown in FIG.

【図３】図２に示すデータ記憶装置のユーザ管理デー
タベースに格納されるユーザデータテーブルの一例であ
る。FIG. 3 is an example of a user data table stored in a user management database of the data storage device shown in FIG. 2;

【図４】図２に示すデータ記憶装置のメタデータ（Ｍ
Ｄ）管理データベースに格納されるメタデータエントリ
テーブルの一例である。4 is a diagram illustrating metadata (M) of the data storage device shown in FIG. 2;
D) An example of a metadata entry table stored in the management database.

【図５】図２に示すデータ記憶装置のフォルダ管理デ
ータベースに格納されるフォルダデータテーブルの一例
である。FIG. 5 is an example of a folder data table stored in a folder management database of the data storage device shown in FIG. 2;

【図６】図２に示すデータ記憶装置のユーザＵＲＬ
（Ｕ−ＵＲＬ）管理データベースに格納されるＵ−ＵＲ
Ｌデータテーブルの一例である。FIG. 6 is a user URL of the data storage device shown in FIG. 2;
(U-URL) U-UR stored in the management database
It is an example of an L data table.

【図７】図２に示すデータ記憶装置のシステムＵＲＬ
（Ｓ−ＵＲＬ）管理データベースに格納されるＳ−ＵＲ
Ｌデータテーブルの一例である。FIG. 7 is a system URL of the data storage device shown in FIG. 2;
(S-URL) S-UR stored in the management database
It is an example of an L data table.

【図８】図２に示すデータ記憶装置のＵ−ＵＲＬ−Ｍ
Ｄ管理データベースに格納されるＵ−ＵＲＬ−ＭＤ管理
テーブルの一例を示す。FIG. 8 is a U-URL-M of the data storage device shown in FIG. 2;
4 shows an example of a U-URL-MD management table stored in a D management database.

【図９】図２に示すデータ記憶装置のＳ−ＵＲＬ−Ｍ
Ｄ管理データベースに格納されるＳ−ＵＲＬ−ＭＤ管理
テーブルの一例を示す。9 is an S-URL-M of the data storage device shown in FIG.
4 shows an example of an S-URL-MD management table stored in a D management database.

【図１０】図２に示すデータ記憶装置の特定ＵＲＬに
対するユーザ別ＭＤ管理データベースに格納されるテー
ブルの一例を示す。10 shows an example of a table stored in a user-specific MD management database for a specific URL in the data storage device shown in FIG. 2;

【図１１】図１に示す複数のクライアントと情報検索
装置との間で行われる、本発明の特定のＵＲＬに対する
ＭＤの割り当て方法の一例を示すフローチャートであ
る。11 is a flowchart illustrating an example of a method of allocating an MD to a specific URL according to the present invention, which is performed between a plurality of clients illustrated in FIG. 1 and an information search device.

【図１２】図１０に示すテーブルを集合として表した
図である。12 is a diagram showing the table shown in FIG. 10 as a set.

【図１３】図１に示すクライアントと情報検索装置と
の間で行われる、本発明の特定のフォルダに対するメタ
データの割り当て方法の一例を示すフローチャートであ
る。13 is a flowchart illustrating an example of a method of assigning metadata to a specific folder according to the present invention, which is performed between the client illustrated in FIG. 1 and the information search device.

【図１４】情報検索装置によって行われる、本発明の
特定のＵＲＬのベクトル化方法の一例を示すフローチャ
ートである。FIG. 14 is a flowchart illustrating an example of a method of vectorizing a specific URL according to the present invention, which is performed by the information search device.

【符号の説明】[Explanation of symbols]

１情報検索システム１００情報検索装置１１０ＣＰＵ２００データ記憶装置２１０ユーザ管理データベース２２０ＭＤ管理データベース２３０フォルダ管理データベース２４０Ｕ−ＵＲＬ管理データベース２５０Ｓ-ＵＲＬ管理データベース２６０Ｕ−ＵＲＬ−ＭＤ管理データベース２７０Ｓ−ＵＲＬ−ＭＤ管理データベース２８０特定ＵＲＬに対するユーザ別ＭＤ管理
データベース２９０辞書管理データベース1 Information Retrieval System 100 Information Retrieval Device 110 CPU 200 Data Storage Device 210 User Management Database 220 MD Management Database 230 Folder Management Database 240 U-URL Management Database 250 S-URL Management Database 260 U-URL-MD Management Database 270 S-URL -MD management database 280 User-specific MD management database 290 for specific URL Dictionary management database

───────────────────────────────────────────────────── フロントページの続き (72)発明者清木康茨城県土浦市永国1148−24 Ｆターム(参考） 5B075 NK02 NR06 NR12 ────────────────────────────────────────────────── ─── Continued on the front page (72) Inventor Yasushi Kiyoshi 1148-24 Eikuni, Tsuchiura-shi, Ibaraki F-term (reference) 5B075 NK02 NR06 NR12

Claims

【特許請求の範囲】[Claims]

【請求項１】特定のコンテンツを指す特定のアドレス
に対して、複数の既定の記述子が各々与えられた複数の
ユーザが、前記コンテンツを表現するために割り当てた
前記複数の記述子のうちの一以上の記述子を前記ユーザ
から受け取るステップと、前記割り当てられた記述子に基づいて前記アドレスの記
述子を決定するステップとを有する検索用情報の生成方
法。1. For a specific address pointing to specific content, a plurality of users each given a plurality of predetermined descriptors, wherein the plurality of descriptors are assigned to represent the content. A method for generating search information, comprising: receiving one or more descriptors from the user; and determining a descriptor of the address based on the assigned descriptor.

【請求項２】前記受け取りステップは、それぞれ一以
上の前記記述子によって特徴付けられて特定の概念に対
応するフォルダを前記ユーザから受け取ることによって
前記記述子を受け取る請求項１記載の方法。2. The method of claim 1, wherein the receiving step receives the descriptor by receiving from the user a folder each characterized by one or more of the descriptors and corresponding to a particular concept.

【請求項３】前記決定ステップは、多くのユーザが共
通に割り当てた前記記述子を所定数だけ前記アドレスの
前記記述子に選択する請求項１記載の方法。3. The method according to claim 1, wherein the determining step selects a predetermined number of the descriptors commonly assigned by many users as the descriptors at the address.

【請求項４】前記方法は、前記割り当てられた記述子
を重み付けして評価付けするステップを更に有し、前記決定ステップは、評価の高い前記記述子を所定数だ
け前記アドレスの前記記述子に選択する請求項１記載の
方法。4. The method further comprises the step of weighing and assigning the assigned descriptor, and wherein the determining step includes a predetermined number of the highly evaluated descriptors being assigned to the descriptor at the address. The method of claim 1, wherein said method is selected.

【請求項５】前記記述子を所定の辞書を用いてベクト
ル化するステップと、前記アドレスに割り当てられた全ての記述子のベクトル
群を合成して前記アドレスのベクトルを生成するステッ
プとを更に有する請求項１記載の方法。5. The method according to claim 1, further comprising the steps of: vectorizing the descriptor using a predetermined dictionary; and generating a vector of the address by combining a vector group of all the descriptors assigned to the address. The method of claim 1.

【請求項６】複数の既定の記述子が与えられ、情報検
索装置にネットワーク接続された複数のユーザが、特定
のコンテンツを指す特定のアドレスに対して、前記コン
テンツを表現するために前記記述子を割り当てて前記情
報検索装置に送信するステップと、前記情報検索装置が、前記割り当てられた記述子に基づ
いて前記アドレスの記述子を決定するステップとを有す
る検索用情報の生成方法。6. A system in which a plurality of predetermined descriptors are provided, and a plurality of users network-connected to the information retrieval apparatus are configured to express the descriptor in order to express the content to a specific address pointing to specific content. And transmitting the information to the information search device; and the information search device determining a descriptor of the address based on the assigned descriptor.

【請求項７】特定のコンテンツを指す特定のアドレス
に対して、複数の既定の記述子が与えられた複数のユー
ザが、前記コンテンツを表現するために割り当てた前記
複数の記述子のうちの一以上の記述子を前記ユーザから
受け取るステップと、前記複数のユーザの中で、前記アドレスに対して、類似
する記述子を割り当てたユーザをグループ化するステッ
プとを有する検索用情報の生成方法。7. For a specific address pointing to specific content, a plurality of users given a plurality of predetermined descriptors may select one of the plurality of descriptors assigned to represent the content. A method for generating search information, comprising: receiving the above descriptor from the user; and grouping, among the plurality of users, users who have assigned similar descriptors to the address.

【請求項８】複数の既定の記述子が与えられ、情報検
索装置にネットワーク接続された複数のユーザが、特定
のコンテンツを指す特定のアドレスに対して、前記コン
テンツを表現するために前記記述子を割り当てて前記情
報検索装置に送信するステップと、前記情報検索装置が、前記複数のユーザの中で、前記ア
ドレスに対して、共通する記述子を割り当てたユーザを
グループ化するステップとを有する検索用情報の生成方
法。8. A system in which a plurality of predetermined descriptors are provided, and a plurality of users network-connected to the information retrieval apparatus are configured to express the content in order to express the content to a specific address pointing to a specific content. And transmitting the information to the information search device, and the information search device groups, among the plurality of users, users who have assigned a common descriptor to the address. How to generate usage information.

【請求項９】特定の概念に対応するフォルダに対して
複数のユーザによって割り当てられ、当該フォルダを特
徴付ける一以上の記述子を前記ユーザから受け取るステ
ップと、前記割り当てられた記述子に基づいて前記フォルダの記
述子を決定するステップとを有する検索用情報の生成方
法。9. A method for receiving one or more descriptors from a user assigned to a folder corresponding to a particular concept by the plurality of users and characterizing the folder, the folder based on the assigned descriptor. Determining a descriptor of the search information.

【請求項１０】前記決定ステップは、多くの利用者が
共通に割り当てた前記記述子を所定数だけ前記フォルダ
の前記記述子に選択する請求項９記載の方法。10. The method according to claim 9, wherein the determining step selects a predetermined number of the descriptors commonly assigned by many users as the descriptors of the folder.

【請求項１１】前記方法は、前記割り当てられた記述
子を重み付けして評価付けするステップを更に有し、前記決定ステップは、評価の高い記述子を所定数だけ前
記フォルダの前記記述子に選択する請求項９記載の方
法。11. The method further comprises the step of weighting and assigning the assigned descriptors, wherein the determining step selects a predetermined number of highly evaluated descriptors as the descriptors of the folder. 10. The method of claim 9, wherein the method comprises:

【請求項１２】情報検索装置にネットワーク接続され
た複数のユーザが、特定の概念に対応するフォルダに当
該フォルダを特徴付ける一以上の記述子を割り当てて前
記情報検索装置に送信するステップと、前記情報検索装置が、前記割り当てられた記述子に基づ
いて前記フォルダの記述子を決定するステップとを有す
る検索用情報の生成方法。12. A method in which a plurality of users network-connected to the information search device allocate one or more descriptors characterizing the folder to a folder corresponding to a specific concept and transmit the information to the information search device; A search device determining a descriptor of the folder based on the assigned descriptor.

【請求項１３】特定のコンテンツを指すアドレスを複
数含む第１の集合と、複数のユーザの各々が予め作成し
たアドレスを複数含む第２の集合とを比較するステップ
と、前記複数のユーザのうち、前記第１の集合に含まれる前
記アドレスを前記第２の集合に多く含むユーザを所定数
だけ選択するステップとを有する情報検索方法。13. A step of comparing a first set including a plurality of addresses indicating specific contents with a second set including a plurality of addresses previously created by each of a plurality of users; And selecting a predetermined number of users who include a large number of the addresses included in the first set in the second set.

【請求項１４】前記選択ステップは、前記第１の集合
と前記第２の集合との間の共通性を判断する際に重み付
けして評価付けするステップを含む請求項１３記載の方
法。14. The method of claim 13, wherein said selecting step includes the step of weighting and evaluating when determining commonality between said first set and said second set.

【請求項１５】情報検索装置が、特定のコンテンツを
指すアドレスが複数含まれた集合を、前記情報検索装置
にネットワーク接続された複数のユーザの各々から受け
取るステップと、前記情報検索装置が、前記集合を互いに比較し、前記複
数のユーザのうち、共通する前記アドレスを多く含む集
合に対応するユーザを所定数だけ選択するステップとを
有する情報検索方法。15. The information search device receives, from each of a plurality of users network-connected to the information search device, a set including a plurality of addresses indicating specific contents, the information search device comprising: Comparing sets with each other and selecting a predetermined number of users corresponding to the set including many common addresses among the plurality of users.

【請求項１６】ネットワーク接続されたユーザから受
け取った、一以上の記述子によって特徴付けられる問い
合わせデータに対応する情報を提供する情報検索装置で
あって、前記記述子をベクトル化する辞書を記憶するメモリと、前記問い合わせデータを特徴付ける一以上の前記記述子
に対応する第１のベクトルを前記メモリを参照して決定
すると共に、当該第１のベクトルと同一又は類似する第
２のベクトルを有する前記情報を決定する処理装置とを
有する情報検索装置。16. An information retrieval apparatus for providing information corresponding to query data received from a networked user and characterized by one or more descriptors, wherein the information retrieval apparatus stores a dictionary for vectorizing the descriptors. A memory, a first vector corresponding to one or more of the descriptors characterizing the inquiry data is determined with reference to the memory, and the information having a second vector that is the same as or similar to the first vector. And a processing device for determining the information retrieval.

【請求項１７】前記問い合わせデータは前記記述子を
複数有し、前記処理装置は、前記複数の記述子のそれぞれをベクト
ル化して合成することによって前記第１のベクトルを決
定する請求項１６記載の情報検索装置。17. The query data according to claim 16, wherein the inquiry data has a plurality of the descriptors, and the processing device determines the first vector by vectorizing and combining each of the plurality of descriptors. Information retrieval device.

【請求項１８】前記情報は、前記ユーザ及び前記情報
検索装置に接続されているネットワークのアドレスを含
む請求項１６記載の情報検索装置。18. The information search device according to claim 16, wherein the information includes an address of a network connected to the user and the information search device.

【請求項１９】特定のコンテンツを指す特定のアドレ
スに対して複数のユーザが前記コンテンツを表現するた
めに割り当てた複数の既定の記述子と、前記複数のユー
ザとを記憶するメモリと、前記複数のユーザの中で、前記アドレスに対して、共通
する記述子を割り当てたユーザをグループ化する処理装
置とを有する情報検索装置。19. A memory storing a plurality of predetermined descriptors assigned by a plurality of users to represent the content to a specific address pointing to a specific content; a memory storing the plurality of users; And a processing device for grouping, among the users, users who have assigned a common descriptor to the address.