CN104484379A - Method and device for determining relation among musical entities and inquiry processing method and device - Google Patents

Method and device for determining relation among musical entities and inquiry processing method and device Download PDF

Info

Publication number
CN104484379A
CN104484379A CN201410749432.2A CN201410749432A CN104484379A CN 104484379 A CN104484379 A CN 104484379A CN 201410749432 A CN201410749432 A CN 201410749432A CN 104484379 A CN104484379 A CN 104484379A
Authority
CN
China
Prior art keywords
music
information
music property
webpage
relation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410749432.2A
Other languages
Chinese (zh)
Other versions
CN104484379B (en
Inventor
雷小强
田振雷
王森
鲁晓莹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Online Network Technology Beijing Co Ltd
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201410749432.2A priority Critical patent/CN104484379B/en
Publication of CN104484379A publication Critical patent/CN104484379A/en
Application granted granted Critical
Publication of CN104484379B publication Critical patent/CN104484379B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/48Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Library & Information Science (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

An embodiment of the invention provides a method and a device for determining relation among musical entities. The method includes acquiring an original webpage from a network side; extracting a first webpage from the original webpage, wherein the first webpage refers to a webpage including musical relation keywords; judging whether a sample sentence exists in the first webpage, wherein the sample sentence refers to a sentence including at least two musical entities; determining the relation among the musical entities by performing semantic analysis on the sample sentence. By the method, determination of the relation among the musical entities is realized.

Description

Determine the method and apparatus of music property relation and inquiry processing method and device
Technical field
The embodiment of the present invention relates to information search technique field, particularly relate to a kind of determine music property relation method and apparatus and inquiry processing method and device.
Background technology
Along with the development of search engine technique, people are also got more and more by the demand of search engine search music.
In prior art, search engine only provides the list of each version for the search of music, and do not provide the relation between each version, as which is original singer, that turns over to sing.Wherein, turn over to sing and be mainly divided into two types: 1) lyrics are consistent with the music score of Chinese operas, but singer is different, and this situation often different singer has sung same a piece of music in the different periods, thus creates multiple different version; 2) music score of Chinese operas is identical, but the lyrics are different, this situation is the music score of Chinese operas that domestic singer uses the music of external singer greatly, the version formed through again composing a poem to a given tune of ci, especially a lot of classical music is all use the external music score of Chinese operas, then again composes a poem to a given tune of ci and renames and sing.
The search engine of current comparatively main flow is as Baidu, and 360 grades all provide the function of music searching, all simply show list related.For Fig. 1, user's query music in Baidu's search engine " is asked ", wherein " asks " as musical designation, simply show the list of each version that music " is asked " in the Search Results that Baidu's search engine represents.
Summary of the invention
The embodiment of the present invention provides a kind of method and apparatus determining music property relation, to determine the relation between different music property.
The embodiment of the present invention also provides a kind of inquiry processing method and device, with when a certain music of user search, the relation between the music of different editions is also supplied to user.
First aspect, embodiments provides a kind of method determining music property relation, comprising:
Original web page is obtained from network side;
From described original web page, extract the first webpage, described first webpage is the webpage including musical relations keyword;
Judge whether there is sample sentence in described first webpage, described sample sentence is for including the sentence of the information of at least two music property;
By carrying out semantic analysis to sample sentence, the relation described in determining between at least two music property.
Second aspect, embodiments provides a kind of device determining music property relation, comprising:
Webpage acquisition module, for obtaining original web page from network side;
First extraction module, for extracting the first webpage from described original web page, described first webpage is the webpage including musical relations keyword;
Sample judge module, for judging whether there is sample sentence in described first webpage, described sample sentence is for including the sentence of the information of at least two music property;
Relationship determination module, for by carrying out semantic analysis to sample sentence, the relation described in determining between at least two music property.
The embodiment of the present invention provides the method and apparatus of the happy entity relationship of accordatura really, after acquisition original web page, by musical relations keyword, original web page is filtered, obtain the webpage including musical relations keyword, by the information of music property, the sentence obtaining the information including different music property can be identified from the webpage including musical relations keyword, and analyzed by sentence semantics, from include different music property information sentence excavate the relation obtained between corresponding music property, namely the relation between different music property can be determined, relation between the music can determining different editions.
The third aspect, embodiments provides a kind of inquiry processing method, comprising:
Receive problem to be retrieved;
When including music information in described problem, from music property relation storehouse, search corresponding music property relation information according to described music information, wherein, music property relation stock contains the relation information between music property data and music property;
Return the music property relation information of described correspondence.
Fourth aspect, embodiments provides a kind of query processing device, comprising:
Problem receiver module, for receiving problem to be retrieved;
Relation searches module, for when including music information in described problem, from music property relation storehouse, search corresponding music property relation information according to described music information, wherein, music property relation stock contains the relation information between music property data and music property;
Return module, for returning the music property relation information of described correspondence.
The inquiry processing method that the embodiment of the present invention provides and device, after receiving problem to be retrieved, when including music information in described problem, including the relation information between the music property adopting any embodiment of the present invention to provide the method for the happy entity relationship of accordatura really to be formed, and mate the music information comprised in described problem in the music property relation storehouse including music property data, the music property relation information corresponding with the music information in described problem can be provided.
Accompanying drawing explanation
In order to be illustrated more clearly in the present invention, introduce doing one to the accompanying drawing used required in the present invention simply below, apparently, accompanying drawing in the following describes is some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.
Fig. 1 be in prior art user in a search engine query music time, the Search Results schematic diagram that search engine represents;
A kind of process flow diagram determining the method for music property relation that Fig. 2 a provides for the embodiment of the present invention one;
Fig. 2 b provides the process flow diagram extracting the first webpage in the method for the happy entity relationship of accordatura really from described original web page for the embodiment of the present invention;
Fig. 3 a provides in the method for the happy entity relationship of accordatura really the process flow diagram judging whether to there is sample sentence in described first webpage for the embodiment of the present invention;
A kind of process flow diagram setting up music libraries that Fig. 3 b provides for the embodiment of the present invention two;
A kind of structural representation determining the device of music property relation that Fig. 4 provides for the embodiment of the present invention three;
The process flow diagram of a kind of inquiry processing method that Fig. 5 a provides for the embodiment of the present invention four;
A kind of Search Results exploded view that Fig. 5 b provides for the embodiment of the present invention four;
The another kind of Search Results exploded view that Fig. 5 c provides for the embodiment of the present invention four;
Another Search Results exploded view that Fig. 5 d provides for the embodiment of the present invention four;
Another Search Results exploded view that Fig. 5 e provides for the embodiment of the present invention four;
Another Search Results exploded view that Fig. 5 f provides for the embodiment of the present invention four;
The structural representation of a kind of query processing device that Fig. 6 provides for the embodiment of the present invention five.
Embodiment
For making the object, technical solutions and advantages of the present invention clearly, be described in further detail the technical scheme in the embodiment of the present invention below in conjunction with accompanying drawing, obviously, described embodiment is the present invention's part embodiment, instead of whole embodiments.Be understandable that; specific embodiment described herein is only for explaining the present invention; but not limitation of the invention; based on the embodiment in the present invention; those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.It also should be noted that, for convenience of description, illustrate only part related to the present invention in accompanying drawing but not full content.
Embodiment one
The present embodiment provides the method for the happy entity relationship of accordatura really can be performed by the device being configured to hardware and/or the software simulating happy entity relationship of accordatura really, and this implement device is typically configured in the system such as search engine that music searching can be provided to serve.
Refer to Fig. 2 a, what the present embodiment provided a kind ofly determines that the method for music property relation comprises: operation 210 ~ operation 240.
In operation 210, obtain original web page from network side.
In operation 220, from described original web page, extract the first webpage, described first webpage is the webpage including musical relations keyword.
Wherein, musical relations corresponds to multiple music property, and the relation between multiple music property is musical relations.
Relation between music property belongs to the wherein one in problem answers class.Concrete, the retrieval type including music property information that problem correspondence is inputted by search engine by user, the relation between the music property represented in the corresponding Search Results of answer.
User is when the answer finding certain problem; usual meeting is directly searched in a search engine; but this original web page of the Search Results that search engine provides cannot meet the search need of user, also namely Search Results directly can not represent answer corresponding to problem usually.The webpage of existing a lot of question and answer type (such as, Baidu is known, Sina likes to ask) and professional forum comprise the problem of user's proposition usually, and the answer corresponding with problem that other users provide.
Similarly, for the relation between music property, when a lot of user directly cannot find the relation between music property from search engine, be also put question to by delivering the modes such as model, demand is in the answer of other users.Therefore, in the webpage that the relation between most music property is all present in question and answer type and forum Web pages, the webpage of this two type known all includes musical relations keyword.
Therefore, this operation specifically utilizes musical relations keyword to filter original web page, obtains the webpage including musical relations keyword.
In operation 230, judge whether there is sample sentence in described first webpage, described sample sentence is for including the sentence of the information of at least two music property.
In this operation, the sentence that can comprise described first webpage carries out described judgement operation respectively.
Music property refers to music object itself, and form can be * .mp3, * .mp4, * .wma or * .wav etc., and wherein, * represents arbitrary string, is generally musical designation.
The information of described music property can comprise at least one information in musical designation, bent author, word author, singer and player.
Because the information of music property can have various ways, the sentence including the information of at least two music property accordingly can have various ways, also namely sample sentence can have various ways, can be the sentence of the musical designation including at least two music property, can also be include the musical designation of at least two music property and the sentence of singer.Like this, being sung for turning over the different music property that still musical designation is constant, can be able to distinguish by adding singer.Sample sentence can also be comprise other information of music property or the sentence of information combination, and the present embodiment does not limit this.
In operation 240, by carrying out semantic analysis to sample sentence, the relation described in determining between at least two music property.
In this operation, can extract the feature of sample sentence, described feature mainly comprises following a few class: 1) grammar property, mainly refers to the interdependent path in the middle of two music property; 2) lexical characteristics, comprise the word of setting quantity and the part of speech of correspondence on the music property left side and the right, this quantity can adjust according to actual needs; 3) other features, mainly comprise the descriptor of the page, the features such as the structural similarity of upper and lower sentence.Based on these features, use machine learning algorithm to train a model, just can determine the relation described in sample sentence between at least two music property.
The technical scheme that the present embodiment provides, after acquisition original web page, by musical relations keyword, original web page is filtered, obtain the webpage including musical relations keyword, and by finding out the sentence of the information including different music property from the webpage including musical relations keyword, carry out semantic analysis, determine the relation between different music property that sentence comprises, thus the relation can excavated between the different music property that provide in network, determine the relation between related music property, and be supplied to user.
From described original web page, extract a kind of preferred implementation of first this operation of webpage, refer to Fig. 2 b, specifically comprise: operation 221 ~ operation 222.
In operation 221, from original web page, identify question and answer webpage and forum Web pages.
As previously mentioned, in the webpage that relation between most music property is all present in question and answer type and forum Web pages, therefore this operation by identifying question and answer webpage and forum Web pages from original web page, can obtain excavating the web page resources needed for relation between music property.
The question and answer webpage provided due to same question and answer website has one or more identical web page templates, therefore the arbitrary question and answer webpage had in question and answer webpage corresponding to certain same web page template can be chosen, utilize the html parser of sing on web kit (browser engine of increasing income) if HTMLParser is to HTML (the Hyper Text Mark-up Language of this question and answer webpage, HTML (Hypertext Markup Language)) source code resolves, obtain DOM (the Document Object Model that meets World Wide Web Consortium (W3C) standard, document dbject model) tree.Wherein, dom tree is a kind of method for expressing of webpage.
Then, that extracts dom tree structure cuts the features such as word feature, structure repeated characteristic and tag attributes semantic feature, machine learning algorithm is utilized to train these features, obtain the model of the question and answer webpage corresponding with this kind of same web page template, then predict with this model, all question and answer webpages corresponding with this kind of same web page template can be identified from original web page.
Similarly, the arbitrary forum Web pages had in forum Web pages corresponding to certain same web page template can be chosen, by resolving the html source code of this forum Web pages, the dom tree corresponding with this forum Web pages can be obtained, that extracts the structure of dom tree structure cuts word feature, the features such as structure repeated characteristic and tag attributes semantic feature, machine learning algorithm is utilized to train these features, obtain the model of the forum Web pages corresponding with this kind of same web page template, then predict with this model, the all forum Web pages corresponding with this kind of same web page template can be identified from original web page.
It should be noted that, original web page is filtered relative to employing keyword, obtain question and answer webpage and forum Web pages, technical difficulty is, because the information category of question and answer webpage and forum Web pages every page is various, therefore be difficult to determine suitable keyword, cause bringing impurity in the webpage screened, the accuracy of filtering question and answer webpage and the forum Web pages obtained is low.And dom tree make use of the general character web page template feature of the webpage that same website provides, such as, structure repeated characteristic and tag attributes semantic feature, to avoid in the webpage making screening because keyword is improper with impurity, improve the accuracy of question and answer webpage and the forum Web pages obtained.
In operation 222, from described question and answer webpage and forum Web pages, search described musical relations keyword, will the question and answer webpage of described musical relations keyword or forum Web pages be included as described first webpage.
It should be noted that, from original web page, identifying question and answer webpage and forum Web pages by operating 221, can obtain excavating the web page resources needed for relation between music property, achieve the filtration from original web page to question and answer webpage and forum Web pages; This operation is the further meticulous screening to the question and answer webpage recognized or forum Web pages, achieves from question and answer webpage or forum Web pages to the fine filtering of webpage including musical relations keyword.
This preferred embodiment, by filtering from original web page to the preliminary identification of question and answer webpage and forum Web pages, can obtain excavating the web page resources needed for relation between music property, utilize musical relations keyword, to the further meticulous screening of the question and answer webpage recognized or forum Web pages, achieve from question and answer webpage or forum Web pages to the fine filtering of webpage including musical relations keyword, navigated to the web page resources including musical relations keyword exactly.
After operation 240, the method for the determination application entity relation that the embodiment of the present invention provides can also comprise:
Sing for turning between described at least two music property, original singer, reorganization or original work bent relation time, by the solid data of described at least two music property and relation information corresponding stored, set up musical relations storehouse.
With original singer with turn over that to sing pass be that example is described.Suppose that first music property is " asking " this first song that Liang Jingru sings, the information of this music property comprises musical designation and " asks " and singer " Liang Jingru "; Second music property is " the asking " that Chen Shuhua sings, and the information of this music property comprises musical designation and " asks " and singer " Chen Shuhua "; 3rd music property is " the asking " that Lin Yilian sings, and the information of this music property comprises musical designation and " asks " and singer " Lin Yilian ".Identify from the webpage including musical relations keyword " original singer " and obtain puing question to sentence " original singer that song is asked " and answering sentence " Chen Shuhua ", and analyzed by sentence semantics, thus the excavation pass obtained between these three music property is: " asking " that Chen Shuhua sings is the original singer of " asking ", the turning over of " asking " that " asking " that other people sing sings for Chen Shuhua is sung, thus " ask .mp3 " by solid data corresponding for these three music property, " ask .wma " and " asking .wma " and relation information " " asking " that Chen Shuhua sings is original singer " corresponding stored, join in musical relations storehouse.Wherein, the form of list or graph of a relation can be adopted to carry out corresponding stored.
It should be noted that, in musical relations storehouse, the solid data of music property can contain a large amount of existing music property.
The musical relations storehouse that present embodiment provides can be configured in special music application, and also can be configured in search engine, present embodiment does not limit this.
Embodiment two
The present embodiment, on the basis of above-described embodiment, provides the preferred version judging whether to there is this operation of sample sentence in described first webpage.
Refer to Fig. 3 a, the present embodiment provides in the method for the happy entity relationship of accordatura really the flow process judging whether to exist in described first webpage sample sentence specifically to comprise: operation 3a1 ~ operation 3a3.
In operation 3a1, be sentence by the text dividing in described first webpage.
In this operation, if the rule of cutting sentence can not take full line for the right side of a line text, then this style of writing this export as a sentence; Also using punctuation works symbol if fullstop, branch and exclamation etc. are as the cutting segmentation symbol of sentence, text sentence cutting can be carried out.
In operation 3a2, mated with the music property in music libraries by described sentence, wherein, music libraries stores music property data and music property information.
Here multimode matching algorithm can be used to be mated with the music property in music libraries by sentence.Wherein, music property data refer to music object itself, and form can be * .mp3, * .mp4, * .wma or * .wav etc., and wherein, * represents arbitrary string, is generally musical designation.
Described music property information can comprise at least one information in musical designation, bent author, word author, singer and player.
In operation 3a3, when described sentence matches at least two music property, judge to there is sample sentence in described first webpage.
Such as, the sentence simultaneously including two music property information directly can match two music property in music libraries; Sentence also can indirect matching at least two music property, such as, include the sentence of the word such as " original singer ", " turn over and sing " or " reorganization ", only give one of them music property information.
The technical scheme of the present embodiment, after extract the webpage including musical relations keyword from original web page, because music property relation is encompassed in the sentence of the webpage including musical relations keyword, therefore by carrying out sentence cutting to the text in the webpage extracted, in units of sentence, utilize each sentence of music property information matches in music libraries, the sentence of the information including at least two music property can be obtained, navigated to the sentence resource including music property relation exactly.
The foundation of above-mentioned music libraries refers to Fig. 3 b, specifically comprises: operation 3b1 ~ operation 3b3.
In operation 3b1, from described original web page, extract music property data and information.
In internet, music property data and information spinner will provide on the website of music service various with the formal distribution of structural data, the webpage provided due to same website has one or more identical web page templates, therefore for each website, the arbitrary webpage had in webpage corresponding to certain same web page template of this website can be chosen, utilize the html source code of the HTMLParser of sing on web kit to this webpage to resolve, obtain the dom tree that meets W3C standard.Utilize dom tree to obtain the xpath set of specific data sets of fields, specific data field can be singer, musical designation, word author or bent author etc., thus can music property data in webpage corresponding to quick position this kind of same web page template and information.
Specifically first can obtain the set of the web page template that same website provides, the i.e. set of dom tree, and from web page template set, search the dom tree obtaining mating with the current web page of this website, by traveling through this dom tree, can splice the xpath of dom node in ergodic process, by comparison xpath, music property data and information can be obtained.
In operation 3b2, duplicate removal and fusion treatment are carried out to the described music property data extracted and information, obtains solid data and the entity information of each music property.
In internet, music property data and information spinner will provide in the webpage of the website of music service various with the formal distribution of structural data, an independent website is not had to cover all music property data and information, and have overlap between the music property data that provide of each website and information, therefore after extracting music property data and information, need to carry out described duplicate removal and fusion treatment, to make, the solid data of each music property obtained and entity information are not heavy not to be leaked, improve information integrity, and reduce information redundance.
Particularly, for the same music of different website, when the entity information of music is identical, select arbitrarily the entity information of a piece of music solid data and correspondence; When the entity information of music is incomplete same, get the entity information of the higher webpage music entity information of site quality as this music property.When the entity information of music has disappearance, then supplemented by the value of the identical entity information name of other site page.
In operation 3b3, set up the index of described each music property, generate described music libraries.
Present embodiment, music property data and information is extracted by the original web page that gets from network side, because music property data and information spinner will provide in the webpage of the website of music service various with the formal distribution of structural data, an independent website is not had to cover all music property data and information, and have overlap between the music property data that provide of each website and information, therefore after extracting music property data and information, by duplicate removal and fusion treatment, the solid data of each music property in the music libraries obtained and entity information are not weighed do not leak, improve information integrity, and reduce information redundance.
Embodiment three
Refer to Fig. 4, what the present embodiment provided a kind ofly determines that the device of music property relation comprises: webpage acquisition module 410, first extraction module 420, sample judge module 430 and relationship determination module 440.
Wherein, webpage acquisition module 410 is for obtaining original web page from network side; First extraction module 420 for extracting the first webpage from described original web page, and described first webpage is the webpage including musical relations keyword; Sample judge module 430 is for judging whether there is sample sentence in described first webpage, and described sample sentence is for including the sentence of the information of at least two music property; Relationship determination module 440 for by carrying out semantic analysis to sample sentence, the relation described in determining between at least two music property.
The technical scheme of the present embodiment, after acquisition original web page, by musical relations keyword, original web page is filtered, obtain the webpage including musical relations keyword, by the information of music property, the sentence obtaining the information including different music property can be identified from the webpage including musical relations keyword, and be analyzed by sentence semantics, thus can from include different music property information sentence excavate the relation obtained between corresponding music property.
In such scheme, described first extraction module 420 specifically may be used for:
Question and answer webpage and forum Web pages is identified from original web page;
From described question and answer webpage and forum Web pages, search described musical relations keyword, will the question and answer webpage of described musical relations keyword or forum Web pages be included as described first webpage.
In such scheme, described sample judge module 430 specifically may be used for:
Be sentence by the text dividing in described first webpage;
Mated with the music property in music libraries by described sentence, wherein, described music libraries stores music property data and music property information;
When described sentence matches at least two music property, judge to there is sample sentence in described first webpage.
In such scheme, described device can also comprise: the second extraction module, data processing module and first set up module.
Wherein, the second extraction module is used for extracting music property data and information from described original web page; Data processing module is used for carrying out duplicate removal and fusion treatment to the described music property data extracted and information, obtains solid data and the entity information of each music property; First sets up module for setting up the index of described each music property, generates described music libraries.
In such scheme, described device can also comprise: second sets up module, for singing for turning between at least two music property described in determining when described relationship determination module, original singer, reorganization or original work bent relation time, by the solid data of described at least two music property and relation information corresponding stored, set up musical relations storehouse.
The embodiment of the present invention provide the device of the happy entity relationship of accordatura really can perform any embodiment of the present invention the method for the happy entity relationship of accordatura is really provided, possess the corresponding function module and the beneficial effect of manner of execution.
Embodiment four
The method of the embodiment of the present invention can be performed by the query processing device being configured to hardware and/or software simulating, and this implement device is typically configured in the system such as search engine that music searching can be provided to serve.
Refer to Fig. 5 a, a kind of inquiry processing method that the present embodiment provides comprises: operation 510 ~ operation 530.
In operation 510, receive problem to be retrieved.
Such as, after terminal receives the problem that user inputs in the search interactive interface of browser, generating messages sends to server end or search engine, and server or search engine receive problem to be retrieved.Wherein, problem to be retrieved is carried in the message of transmission.
In operation 520, when including music information in described problem, from music property relation storehouse, search corresponding music property relation information according to described music information, wherein, music property relation stock contains the relation information between music property data and music property.
Wherein, the relation in music property storehouse between music property can provide the method for the happy entity relationship of accordatura really to obtain by embodiment one or embodiment two, repeats no more herein.
Wherein, the relations such as the relation between music property can be sung for turning over, original singer, reorganization or original work are bent.
This operation can have numerous embodiments, such as, comprise following wherein a kind of:
According to musical designation search the music property corresponding with described musical designation original singer, turn over sing, the information of the bent or arrangement of original work.
The original work song of the music property corresponding with described musical designation and composer or the information of arrangement is searched according to musical designation and composer.
Search the original singer of the song corresponding with described musical designation and composer according to song title and singer or turn over the information sung.
In operation 530, return the music property relation information of described correspondence.
After the music property relation information returning described correspondence, can also be illustrated in Search Results further, for user provides the music property relation information corresponding with the retrieval type including music information.
After search corresponding music property relation information from music property relation storehouse according to described music information, can also comprise: return the music property data that the music property relation information of described correspondence is corresponding.
Further, the music property relation information of described correspondence can be shown, and the music property data of correspondence.Specifically existing Search Results is carried out assembled with described corresponding music property relation information, show front end user.
The technical scheme of the present embodiment, after receiving problem to be retrieved, when including music information in described problem, including the relation information between the music property adopting any embodiment of the present invention to provide the method for the happy entity relationship of accordatura really to be formed, and mate the music information comprised in described problem in the music property relation storehouse including music property data, the music property relation information corresponding with the music information in described problem can be obtained.
Below in conjunction with Fig. 5 b-Fig. 5 f, different exhibition methods is described respectively.
The first exhibition method, when user directly searches the original singer of music, directly shows that in Search Results corresponding with original singer and user can the card of audition.
In Fig. 5 b, when user is by Baidu's search engine search " the flowers are in blossom for Cai Guoqing journey original singer ", in Search Results, show that musical designation is “ Hua Misaki く trip road ", singer is the audition card of " reason ", user can know that the musical designation associated with the music of search is " road that the flowers are in blossom " intuitively, singer is the music version of " Cai Guoqing " and musical designation is " flower Misaki く trip road ", singer is the relation between the music version of " reason ", specifically, the former is original singer not, the latter is only original singer, illustrate abundant and music property relation accurately, be conducive to Search Results user being guided to user's needs.
The second exhibition method, when user directly search music turn over sing time, directly show in Search Results with turn over sing corresponding and user can the card of audition.This exhibition method is similar with the first exhibition method.
In Fig. 5 c, when user is by Baidu's search engine search " following the turning over of へ of Kiroro is sung ", in Search Results, show that musical designation is " afterwards ", singer is the audition card of " Liu Ruoying ", user can know that the musical designation associated with the music of search is " future " intuitively, singer is the music version of " Kiroro " and musical designation is " afterwards ", singer is the relation between the music version of " Liu Ruoying ", specifically, the latter turns over the former one to sing, illustrate abundant and music property relation accurately, be conducive to Search Results user being guided to user's needs.
The third exhibition method, when certain music of user search contains multiple version, shows each version with list, and identifies original singer in Search Results.Wherein, described multiple version can the lyrics all identical with the music score of Chinese operas, but singer is different, also can the music score of Chinese operas identical, but the lyrics are different, also namely undertaken turning over singing by again composing a poem to a given tune of ci.
In Fig. 5 d, when user is by Baidu's search engine search " asking ", Search Results represents multiple version with tabular form, and original singer's version of music " being asked " directly indicates in Search Results, concrete, the singer that music " is asked " is " Chen Shuhua " corresponding original singer's version, have " original singer " in this version correspondence position mark, user can be known and the relation between multiple versions that the music of search associates intuitively, illustrate abundant and music property relation accurately, be conducive to Search Results user being guided to user's needs.
4th kind of exhibition method, when the song of user search has an original singer, audition card provides the link of original singer, and user can direct audition original singer on card, after audition completes, directly turns back to turn over and sings.
Refer to Fig. 5 e and Fig. 5 f.In Fig. 5 e, when user is by Baidu's search engine search " Old Boy's song ", audition card in Search Results provides the link " audition original singer version " あ り Ga と う " bridge Zhuo Mi " of original singer, user can direct audition original singer on card, after audition completes, audition card provides link " be back to " Old Boy " chopsticks brother " (as shown in figure 5f) of turning over and singing, can directly turn back to turn over and sing.Illustrate abundant and music property relation accurately in audition card, be conducive to Search Results user being guided to user's needs.
Embodiment five
Referring to Fig. 6, is the structural representation of a kind of query processing device that the embodiment of the present invention five provides.This device comprises: problem receiver module 610, relation are searched module 620 and returned module 630.
Wherein, problem receiver module 610 is for receiving problem to be retrieved; Relation searches module 620 for when including music information in described problem, from music property relation storehouse, corresponding music property relation information is searched according to described music information, wherein, music property relation stock contains the relation information between music property data and music property; Return module 630 for returning the music property relation information of described correspondence.
The technical scheme of the present embodiment, after receiving problem to be retrieved, when including music information in described problem, including the relation information between the music property adopting any embodiment of the present invention to provide the method for the happy entity relationship of accordatura really to be formed, and mate the music information comprised in described problem in the music property relation storehouse including music property data, the music property relation information corresponding with the music information in described problem can be obtained.
In such scheme, described relation is searched module 620 and specifically be may be used for:
According to musical designation search the music property corresponding with described musical designation original singer, turn over sing, the information of the bent or arrangement of original work;
Or,
The original work song of the music property corresponding with described musical designation and composer or the information of arrangement is searched according to musical designation and composer;
Or,
Search the original singer of the song corresponding with described musical designation and composer according to song title and singer or turn over the information sung.
In such scheme, described in return module 630 and can also be used for: return the music property data that the music property relation information of described correspondence is corresponding.
The query processing device that the embodiment of the present invention provides can perform the inquiry processing method that any embodiment of the present invention provides, and possesses the corresponding function module and the beneficial effect of manner of execution.
Last it is noted that above each embodiment is only for illustration of technical scheme of the present invention, but not be limited; In embodiment preferred embodiment, be not limited, to those skilled in the art, the present invention can have various change and change.All do within spirit of the present invention and principle any amendment, equivalent replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (16)

1. determine a method for music property relation, it is characterized in that, comprising:
Original web page is obtained from network side;
From described original web page, extract the first webpage, described first webpage is the webpage including musical relations keyword;
Judge whether there is sample sentence in described first webpage, described sample sentence is for including the sentence of the information of at least two music property;
By carrying out semantic analysis to sample sentence, the relation described in determining between at least two music property.
2. method according to claim 1, is characterized in that, extracts the first webpage, comprising from described original web page:
Question and answer webpage and forum Web pages is identified from original web page;
From described question and answer webpage and forum Web pages, search described musical relations keyword, will the question and answer webpage of described musical relations keyword or forum Web pages be included as described first webpage.
3. method according to claim 1, is characterized in that, judges whether there is sample sentence in described first webpage, comprising:
Be sentence by the text dividing in described first webpage;
Mated with the music property in music libraries by described sentence, wherein, described music libraries stores music property data and music property information;
When described sentence matches at least two music property, judge to there is sample sentence in described first webpage.
4. method according to claim 3, is characterized in that, the foundation of described music libraries, comprising:
Music property data and information is extracted from described original web page;
Duplicate removal and fusion treatment are carried out to the described music property data extracted and information, obtains solid data and the entity information of each music property;
Set up the index of described each music property, generate described music libraries.
5., according to the arbitrary described method of claim 1-4, it is characterized in that, by carrying out semantic analysis to sample sentence, after the relation described in determining between at least two music property, also comprising:
Sing for turning between described at least two music property, original singer, reorganization or original work bent relation time, by the solid data of described at least two music property and relation information corresponding stored, set up musical relations storehouse.
6. an inquiry processing method, is characterized in that, comprising:
Receive problem to be retrieved;
When including music information in described problem, from music property relation storehouse, search corresponding music property relation information according to described music information, wherein, music property relation stock contains the relation information between music property data and music property;
Return the music property relation information of described correspondence.
7. method according to claim 6, is characterized in that, searches corresponding music property relation information, comprising according to described music information from music property relation storehouse:
According to musical designation search the music property corresponding with described musical designation original singer, turn over sing, the information of the bent or arrangement of original work;
Or,
The original work song of the music property corresponding with described musical designation and composer or the information of arrangement is searched according to musical designation and composer;
Or,
Search the original singer of the song corresponding with described musical designation and composer according to song title and singer or turn over the information sung.
8. the method according to claim 6 or 7, is characterized in that, after search corresponding music property relation information from music property relation storehouse according to described music information, described method also comprises:
Return the music property data that the music property relation information of described correspondence is corresponding.
9. determine a device for music property relation, it is characterized in that, comprising:
Webpage acquisition module, for obtaining original web page from network side;
First extraction module, for extracting the first webpage from described original web page, described first webpage is the webpage including musical relations keyword;
Sample judge module, for judging whether there is sample sentence in described first webpage, described sample sentence is for including the sentence of the information of at least two music property;
Relationship determination module, for by carrying out semantic analysis to sample sentence, the relation described in determining between at least two music property.
10. device according to claim 9, is characterized in that, described first extraction module specifically for:
Question and answer webpage and forum Web pages is identified from original web page;
From described question and answer webpage and forum Web pages, search described musical relations keyword, will the question and answer webpage of described musical relations keyword or forum Web pages be included as described first webpage.
11. devices according to claim 9, is characterized in that, described sample judge module specifically for:
Be sentence by the text dividing in described first webpage;
Mated with the music property in music libraries by described sentence, wherein, described music libraries stores music property data and music property information;
When described sentence matches at least two music property, judge to there is sample sentence in described first webpage.
12. devices according to claim 11, is characterized in that, described device also comprises:
Second extraction module, for extracting music property data and information from described original web page;
Data processing module, for carrying out duplicate removal and fusion treatment to the described music property data extracted and information, obtains solid data and the entity information of each music property;
First sets up module, for setting up the index of described each music property, generates described music libraries.
13. according to the arbitrary described device of claim 9-12, and it is characterized in that, described device also comprises:
Second sets up module, for singing for turning between at least two music property described in determining when described relationship determination module, original singer, reorganization or original work bent relation time, by the solid data of described at least two music property and relation information corresponding stored, set up musical relations storehouse.
14. 1 kinds of query processing devices, is characterized in that, comprising:
Problem receiver module, for receiving problem to be retrieved;
Relation searches module, for when including music information in described problem, from music property relation storehouse, search corresponding music property relation information according to described music information, wherein, music property relation stock contains the relation information between music property data and music property;
Return module, for returning the music property relation information of described correspondence.
15. devices according to claim 14, is characterized in that, described relation search module specifically for:
According to musical designation search the music property corresponding with described musical designation original singer, turn over sing, the information of the bent or arrangement of original work;
Or,
The original work song of the music property corresponding with described musical designation and composer or the information of arrangement is searched according to musical designation and composer;
Or,
Search the original singer of the song corresponding with described musical designation and composer according to song title and singer or turn over the information sung.
16. devices according to claims 14 or 15, is characterized in that, described in return module also for: return the music property data that the music property relation information of described correspondence is corresponding.
CN201410749432.2A 2014-12-09 2014-12-09 Determine the method and apparatus of music property relationship and inquiry processing method and device Active CN104484379B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410749432.2A CN104484379B (en) 2014-12-09 2014-12-09 Determine the method and apparatus of music property relationship and inquiry processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410749432.2A CN104484379B (en) 2014-12-09 2014-12-09 Determine the method and apparatus of music property relationship and inquiry processing method and device

Publications (2)

Publication Number Publication Date
CN104484379A true CN104484379A (en) 2015-04-01
CN104484379B CN104484379B (en) 2018-06-12

Family

ID=52758920

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410749432.2A Active CN104484379B (en) 2014-12-09 2014-12-09 Determine the method and apparatus of music property relationship and inquiry processing method and device

Country Status (1)

Country Link
CN (1) CN104484379B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110717062A (en) * 2018-07-11 2020-01-21 阿里巴巴集团控股有限公司 Music searching and vehicle-mounted music playing method, device, equipment and storage medium
CN111552778A (en) * 2020-04-26 2020-08-18 北京达佳互联信息技术有限公司 Audio resource management method, device, computer readable storage medium and equipment
CN112948603A (en) * 2021-03-08 2021-06-11 北方自动控制技术研究所 Transportation delivery knowledge question-answering method based on transfer learning
CN113609309A (en) * 2021-08-16 2021-11-05 脸萌有限公司 Knowledge graph construction method and device, storage medium and electronic equipment
CN110245197B (en) * 2019-05-20 2022-01-28 北京百度网讯科技有限公司 Whole-network entity association method and system
WO2023040808A1 (en) * 2021-09-18 2023-03-23 华为技术有限公司 Webpage retrieval method and related device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101071422A (en) * 2006-06-15 2007-11-14 腾讯科技(深圳)有限公司 Musicfile search processing system and method
US20090106203A1 (en) * 2007-10-18 2009-04-23 Zhongmin Shi Method and apparatus for a web search engine generating summary-style search results
CN102708100A (en) * 2011-03-28 2012-10-03 北京百度网讯科技有限公司 Method and device for digging relation keyword of relevant entity word and application thereof

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101071422A (en) * 2006-06-15 2007-11-14 腾讯科技(深圳)有限公司 Musicfile search processing system and method
US20090106203A1 (en) * 2007-10-18 2009-04-23 Zhongmin Shi Method and apparatus for a web search engine generating summary-style search results
CN102708100A (en) * 2011-03-28 2012-10-03 北京百度网讯科技有限公司 Method and device for digging relation keyword of relevant entity word and application thereof

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110717062A (en) * 2018-07-11 2020-01-21 阿里巴巴集团控股有限公司 Music searching and vehicle-mounted music playing method, device, equipment and storage medium
CN110717062B (en) * 2018-07-11 2024-03-22 斑马智行网络(香港)有限公司 Music search and vehicle-mounted music playing method, device, equipment and storage medium
CN110245197B (en) * 2019-05-20 2022-01-28 北京百度网讯科技有限公司 Whole-network entity association method and system
CN111552778A (en) * 2020-04-26 2020-08-18 北京达佳互联信息技术有限公司 Audio resource management method, device, computer readable storage medium and equipment
CN111552778B (en) * 2020-04-26 2024-05-14 北京达佳互联信息技术有限公司 Audio resource management method, device, computer readable storage medium and equipment
CN112948603A (en) * 2021-03-08 2021-06-11 北方自动控制技术研究所 Transportation delivery knowledge question-answering method based on transfer learning
CN112948603B (en) * 2021-03-08 2023-05-05 北方自动控制技术研究所 Transport delivery knowledge question-answering method based on transfer learning
CN113609309A (en) * 2021-08-16 2021-11-05 脸萌有限公司 Knowledge graph construction method and device, storage medium and electronic equipment
CN113609309B (en) * 2021-08-16 2024-02-06 脸萌有限公司 Knowledge graph construction method and device, storage medium and electronic equipment
WO2023040808A1 (en) * 2021-09-18 2023-03-23 华为技术有限公司 Webpage retrieval method and related device

Also Published As

Publication number Publication date
CN104484379B (en) 2018-06-12

Similar Documents

Publication Publication Date Title
CN108829858B (en) Data query method and device and computer readable storage medium
US7739257B2 (en) Search engine
US8381095B1 (en) Automated document revision markup and change control
CN104484379A (en) Method and device for determining relation among musical entities and inquiry processing method and device
US10423649B2 (en) Natural question generation from query data using natural language processing system
Papadakis et al. Stavies: A system for information extraction from unknown web data sources through automatic web wrapper generation using clustering techniques
Arendarenko et al. Ontology-based information and event extraction for business intelligence
CN101192234A (en) Searching system and method based on web page extraction
CN111831911A (en) Query information processing method and device, storage medium and electronic device
US7853595B2 (en) Method and apparatus for creating a tool for generating an index for a document
Kumar Apache Solr search patterns
Chieze et al. An automatic system for summarization and information extraction of legal information
CN103020311A (en) Method and system for processing user search terms
Cuculovic et al. Semantics to the rescue of document‐based XML diff: A JATS case study
Adrian et al. Epiphany: Adaptable rdfa generation linking the web of documents to the web of data
KR102298397B1 (en) Citation Relationship Analysis Method and System Based on Citation Type
Yoon et al. A conference paper exploring system based on citing motivation and topic
CN103870590A (en) Webpage identification method and device with error-reported characteristic
YesuRaju et al. A language independent web data extraction using vision based page segmentation algorithm
Neubert Leveraging SKOS to Trace the Overhaul of the STW Thesaurus for Economics
Chou et al. Mining features for web ner model construction based on distant learning
Francom et al. Creating a web-based lexical corpus and information-extraction tools for the Semitic language Maltese
Tian et al. AutoCom: Automatic Comment Generation for C Code.
AU2012200686B2 (en) Improved search engine
AU2006200426B2 (en) Improved search engine

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant