CN103678675A - Method, server and system for achieving searching through Pinyin - Google Patents

Method, server and system for achieving searching through Pinyin Download PDF

Info

Publication number
CN103678675A
CN103678675A CN201310727117.5A CN201310727117A CN103678675A CN 103678675 A CN103678675 A CN 103678675A CN 201310727117 A CN201310727117 A CN 201310727117A CN 103678675 A CN103678675 A CN 103678675A
Authority
CN
China
Prior art keywords
word
phonetic
group
keyword
individual character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310727117.5A
Other languages
Chinese (zh)
Inventor
乔磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LeTV Information Technology Beijing Co Ltd
Original Assignee
LeTV Information Technology Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LeTV Information Technology Beijing Co Ltd filed Critical LeTV Information Technology Beijing Co Ltd
Priority to CN201310727117.5A priority Critical patent/CN103678675A/en
Publication of CN103678675A publication Critical patent/CN103678675A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses a method, server and system for achieving searching through Pinyin. The method comprises the steps that a Pinyin keyword comprising at least one Pinyin character is received, and matching is conducted in a Pinyin word bank through the Pinyin keyword, wherein the Pinyin word bank comprises a plurality of groups, each group corresponds to a character and comprises at least one Pinyin character, and relevant polyphone Pinyin exists between more than two Pinyin characters as for the group including more than two Pinyin characters; according to the groups matched with the Pinyin keyword, characters corresponding to the matched groups are obtained, and a character keyword corresponding to the Pinyin keyword is composed of the characters corresponding to the groups matched with the Pinyin keyword, wherein as for the Pinyin keyword matched with more than two groups, each Pinyin character in the Pinyin keyword is matched with one group. Even when mistakenly-used polyphone Pinyin is included in an input Pinyin keyword, the method, server and system for achieving searching through Pinyin can still obtain accurate characters through matching, and then resource searching is finished.

Description

The method of searching for by phonetic, server and system
Technical field
The application relates to streaming media playing field, is specifically related to a kind of method of searching for by phonetic, server and system.
Background technology
Nowadays, video website generally all supports user to search for results for video at the direct input Pinyin of search box.The word that the server of video website can come automatic adaptation phonetic to reflect according to the phonetic of input, then search for corresponding results for video according to these words.Therefore as long as the phonetic of user's input is correct, often just can find the video of thinking search.
But have a large amount of polyphones in Chinese character, a polyphone is to there being a plurality of pronunciations, and these different pronunciations can not be used with, and the collocation of different linguistic context need to be used corresponding pronunciation, otherwise entanglement can occur, and cannot obtain the meaning that really will express.If comprise the polyphone phonetic that this entanglement is used in the phonetic key word of inputting when user search, obviously a lot of errors will occur.For example, if user wishes search " discriminating Huan passes ", need input Pinyin " zhen huan zhuan ", " but biography " itself has two pronunciations, respectively " zhuan " (sound " is write ") and " chuan " (sound " ship "), and grasp not so good user for some mandarin pronunciations, the problem of input is obscured polyphone in as easy as rolling off a log generation, what for example these users wished input is " discriminating Huan passes ", but likely because polyphone is obscured, Pinyin Input is become to " zhuan huan chuan ", the result that so just cannot correctly find user to wish.
Therefore,, while comprising the polyphone phonetic of entanglement use in the phonetic key word of user's input, how the Obtaining Accurate correct Search Results corresponding with phonetic key word just becomes technical matters urgently to be resolved hurrily.
Summary of the invention
The application's technical matters to be solved is to provide a kind of method of searching for by phonetic, server and system, while comprising the polyphone phonetic of entanglement use in the phonetic keyword of user's input, still can obtain exactly the correct Search Results corresponding with phonetic key word.
In order to address the above problem, the application has disclosed a kind of method of searching for by phonetic, comprise: receive the phonetic keyword that comprises at least one phonetic word, according to described phonetic keyword, in pinyin lexicon, mate, wherein, in described pinyin lexicon, there are a plurality of groups, the corresponding word of each group, each group comprises at least one phonetic word, for the group that comprises two above phonetic words, between described two above phonetic words, has relevant polyphone phonetic; According to the group of mating with described phonetic keyword, obtain the word corresponding to group of described coupling, according to word corresponding to the described group matching, form the word keyword that described phonetic keyword is corresponding, wherein, for the described phonetic keyword that matches two above groups, each the phonetic word in described phonetic keyword matches Yi Ge group; According to described word keyword, form information and be sent to client demonstration, or, the resource corresponding with described word keyword according to described word keyword search.
Further, described pinyin lexicon, comprising: word phonetic word bank and individual character phonetic word bank, and wherein, the group in described word phonetic word bank is word group, the corresponding word word of each word group; Group in described individual character phonetic word bank is individual character group, the corresponding single word of each individual character group.
Further, the operation of mating in pinyin lexicon according to described phonetic keyword, specifically comprise: according to the orientation from left to right of described phonetic keyword, in described word phonetic word bank, described phonetic keyword is mated, match the word group of the first number; According to the orientation from right to left of described phonetic keyword, in described word phonetic word bank, described phonetic keyword is mated, match the word group of the second number; If described the first number is greater than the second number, by in described phonetic keyword not by the word group matches of described the first number to remainder form the first remainder, in described individual character phonetic word bank, described the first remainder is mated, match the individual character group of three number corresponding with phonetic word in described the first remainder, the group that the individual character group of the word group of described the first number and described the 3rd number is matched as described phonetic keyword; If described the second number is greater than the first number, to in described phonetic keyword not by the word group matches of described the second number to remainder form the second remainder, in described individual character phonetic word bank, described the second remainder is mated, match the individual character group of four number corresponding with phonetic word in described the second remainder, the group that the individual character group of the word group of described the second number and described the 4th number is matched as described phonetic keyword.
Further, according to the group of mating with described phonetic keyword, obtain the word corresponding to group of described coupling, according to word corresponding to the described group matching, form the operation of the word keyword that described phonetic keyword is corresponding, specifically comprise: when group that described phonetic keyword matches comprises the word group of described the first number and the individual character group of described the 3rd number, according to the word group of described the first number, obtain the word word of corresponding the first number, according to the individual character group of described the 3rd number, obtain the single word of the 3rd number, the word keyword that forms described phonetic keyword according to the single word of the word word of described the first number and the 3rd number, when group that described phonetic keyword matches comprises the word group of described the second number and the individual character group of described the 4th number, according to the word group of described the second number, obtain the word word of corresponding the second number, according to the individual character group of described the 4th number, obtain the single word of the 4th number, according to the single word of the word word of described the second number and the 4th number, form the word keyword of described phonetic keyword.
Further, each group in described pinyin lexicon also has respectively different weights; According to described phonetic keyword when described pinyin lexicon is mated, a phonetic word in described phonetic keyword matches a plurality of groups of corresponding different literals respectively simultaneously, according to weight separately of described a plurality of groups, from described a plurality of groups, select the group of a phonetic word match in a conduct and described phonetic keyword.
In order to address the above problem, the application has also disclosed a kind of server of searching for by phonetic, comprise: receiver module, matching module, acquisition module, composite module, execution module and pinyin lexicon, wherein, described receiver module, for receiving the phonetic keyword that comprises at least one phonetic word; Described matching module, for mating in pinyin lexicon according to described phonetic keyword, wherein, for the described phonetic keyword that matches two above groups, each the phonetic word in described phonetic keyword matches Yi Ge group; Described acquisition module, for according to the group of mating with described phonetic keyword, obtains the word corresponding to group of described coupling; Described composite module, forms for word corresponding to group matching described in basis the word keyword that described phonetic keyword is corresponding; Described execution module, shows for forming information according to described word keyword and being sent to client, or, the resource corresponding with described word keyword according to described word keyword search; Described pinyin lexicon, for preserving a plurality of groups, the corresponding word of each group, each group comprises at least one phonetic word, for the group that comprises two above phonetic words, between described two above phonetic words, has relevant polyphone phonetic.
Further, described pinyin lexicon, comprising: word phonetic word bank and individual character phonetic word bank, and wherein, the group in described word phonetic word bank is word group, the corresponding word word of each word group; Group in described individual character phonetic word bank is individual character group, the corresponding single word of each individual character group.
Further, described matching module specifically for according to the orientation from left to right of described phonetic keyword, mates described phonetic keyword in described word phonetic word bank, matches the word group of the first number; According to the orientation from right to left of described phonetic keyword, in described word phonetic word bank, described phonetic keyword is mated, match the word group of the second number; If described the first number is greater than the second number, by in described phonetic keyword not by the word group matches of described the first number to remainder form the first remainder, in described individual character phonetic word bank, described the first remainder is mated, match the individual character group of three number corresponding with phonetic word in described the first remainder, the group that the individual character group of the word group of described the first number and described the 3rd number is matched as described phonetic keyword; If described the second number is greater than the first number, to in described phonetic keyword not by the word group matches of described the second number to remainder form the second remainder, in described individual character phonetic word bank, described the second remainder is mated, match the individual character group of four number corresponding with phonetic word in described the second remainder, the group that the individual character group of the word group of described the second number and described the 4th number is matched as described phonetic keyword.
Further, described acquisition module, when group that described phonetic keyword matches comprises the word group of described the first number and the individual character group of described the 3rd number, specifically for obtain the word word of corresponding the first number according to the word group of described the first number, according to the individual character group of described the 3rd number, obtain the single word of the 3rd number; When group that described phonetic keyword matches comprises the word group of described the second number and the individual character group of described the 4th number, specifically for obtain the word word of corresponding the second number according to the word group of described the second number, according to the individual character group of described the 4th number, obtain the single word of the 4th number; Described composite module, specifically for form the word keyword of described phonetic keyword according to the single word of the word word of described the first number and the 3rd number, or, according to the single word of the word word of described the second number and the 4th number, form the word keyword of described phonetic keyword.
Further, described pinyin lexicon, specifically for configuring respectively different weights for each group wherein; Described matching module, specifically for according to described phonetic keyword when described pinyin lexicon is mated, a phonetic word in described phonetic keyword matches a plurality of groups of corresponding different literals respectively simultaneously, according to weight separately of described a plurality of groups, from described a plurality of groups, select the group of a phonetic word match in a conduct and described phonetic keyword.
In order to address the above problem, the application has also disclosed a kind of system of searching for by phonetic, comprising: one or more client and servers, and wherein, described server, comprises the foregoing server of searching for by phonetic; Described one or more client, for identifying the phonetic keyword that comprises at least one phonetic word of user's input, and sends to described server.
Further, described one or more client, periphery specifically for the display position at described phonetic keyword arranges a viewing area or opens new page, according to described word keyword, forms information and described information is presented to described viewing area or new page; Receive the click triggering of user to described information, to described server, send trigger message so that described server is retrieved in search engine according to word keyword.
Compared with prior art, the application can obtain and comprise following technique effect:
1) when user error, obscured the pronunciation of polyphone, and while using the pronunciation input Pinyin of this mistake, the application can be organized into phonetic storehouse by various polyphone pronunciations in group's mode by realizing, even if comprise like this polyphone phonetic that entanglement is used in input Pinyin keyword, use phonetic storehouse coupling can match word accurately, thereby complete the search of resource.
2) when phonetic keyword is mated, from left to right and from right to left two orientations are mated respectively can to use phonetic keyword, selection matches the more direction of word group and continues coupling as the direction that meets user and input original idea, further improve the accuracy of coupling, thereby be conducive to the search of resource.
3) for phonetic keyword, matching after accurate word, can also be shown to user by the mode of prompting, so that user is confirmed whether the result of coupling and meets the original idea of user's input, guaranteed substantially in the situation that polyphone pronunciation obscure the accuracy that use is mated, thereby complete the search of resource.
It should be noted that, above technique effect might not reach simultaneously.
Accompanying drawing explanation
Accompanying drawing described herein is used to provide further understanding of the present application, forms the application's a part, and the application's schematic description and description is used for explaining the application, does not form the improper restriction to the application.In the accompanying drawings:
Fig. 1 is the method flow diagram of the application the first embodiment.
Fig. 2 is the method flow diagram of the application the second embodiment.
Fig. 3 is the server architecture figure of the application the 3rd embodiment.
Fig. 4 is the system architecture diagram of the application the 4th embodiment.
Embodiment
To coordinate drawings and Examples to describe the application's embodiment in detail below, by this application's implementation procedure how application technology means solve technical matters and reach technology effect can be fully understood and be implemented according to this.
As used some vocabulary in the middle of instructions and claim, censure specific components.Those skilled in the art should understand, and hardware manufacturer may be called same assembly with different nouns.This specification and claims are not used as distinguishing the mode of assembly with the difference of title, but the difference in function is used as the criterion of distinguishing with assembly.If " comprising " mentioned in the middle of instructions and claim is in the whole text an open language, therefore should be construed to " comprise but be not limited to "." roughly " refer to that in receivable error range, those skilled in the art can solve the technical problem within the scope of certain error, reach described technique effect substantially.In addition, " couple " word and comprise directly any and electric property coupling means indirectly at this.Therefore, if describe a first device in literary composition, be coupled to one second device, represent that described first device can directly be electrically coupled to described the second device, or be indirectly electrically coupled to described the second device by other devices or the means that couple.Instructions subsequent descriptions is for implementing the application's better embodiment, and right described description is to illustrate that the application's rule is object, not in order to limit the application's scope.The application's protection domain is when being as the criterion depending on the claims person of defining.
embodiment mono-
As shown in Figure 1, the method flow diagram for the application the first embodiment, comprising:
Step S100, receives the phonetic keyword that comprises at least one phonetic word;
User after input Pinyin keyword, clicks search in the search box of client or browser, and server receives phonetic keyword;
In phonetic keyword, can comprise one or more phonetic words, certainly, user is when input and do not know whether phonetic keyword comprises phonetic word, angle from pinyin lexicon coupling, phonetic keyword comprises a phonetic word to I haven't seen you for ages, certainly, may be also a plurality of.
Step S102, according to described phonetic keyword, in pinyin lexicon, mate, wherein, in described pinyin lexicon, there are a plurality of groups, the corresponding word of each group, each group comprises at least one phonetic word, for the group that comprises two above phonetic words, between described two above phonetic words, has relevant polyphone phonetic;
For instance, because " biography " is polyphone, have the pronunciation of " zhuan " (sound " is write ") and " chuan " (sound " ship "), the phonetic word of " discriminating Huan passes " can be just " zhen huan zhuan " and " zhen huan chuan ".Therefore " zhen huan zhuan " and " zhen huan chuan " formed to Yi Ge group, the word " discriminating Huan passes " that this group is corresponding.Like this, no matter user's input is the phonetic that correct Zhen Huan passes, or the wrong phonetic that has used polyphone pronunciation, can correspond to the word of " discriminating Huan passes ".
The phonetic keyword of user's input must be comprised of one or more phonetic words, by coupling, just phonetic keyword can be decomposed into a plurality of phonetic words and hit group, this process is coupling on the one hand, also be equivalent on the other hand phonetic keyword to carry out cutting, cutting is a plurality of groups.Certainly, a complete noun has plenty of in the group of these couplings, is likely also a word.In order to mate more accurately, when building pinyin lexicon, just according to noun and word, set up dictionary respectively---described pinyin lexicon, comprise: word phonetic word bank and individual character phonetic word bank, wherein, group in described word phonetic word bank is word group, the corresponding word word of each word group; Group in described individual character phonetic word bank is individual character group, the corresponding single word of each individual character group.By internet and relevant dictionary, collect phonetic word corresponding to ordinary word Chinese language word and obtain primary word phylum group, be kept at word phonetic word bank; Meanwhile, for some special application, for example video class field, also obtains group by collecting the phonetic word of movie and television play title, star's name especially, is kept at word phonetic word bank.
For example: in pinyin lexicon, preserve " discriminate Huan pass " group, " discriminating Huan " group, " in " group and " " group, " discriminating Huan passes " phonetic word corresponding to group comprises " zhen huan zhuan " and " zhen huan chuan ", phonetic word corresponding to group comprises " zhen huan " " to discriminate Huan ", " in " phonetic word corresponding to group comprise " zhong ", " " phonetic word corresponding to group comprise " de " and " di ".The phonetic keyword of user input be " zhen huan zhuan zhong de zhen huan " (discriminating Zhen Huan of Huan crossing), just obtains above-mentioned Si Ge group after like this phonetic keyword being mated.
According to described phonetic keyword when pinyin lexicon is mated, to from orientation and the orientation from right to left from left to right of described phonetic keyword, carry out coupling twice respectively, described phonetic keyword is because from different orientations, semantic combination changes possibly, therefore can match different result, certainly, so-called result mainly refers to the Different matching result of word group here, does not consider individual character group here.In two orientations, with a fairly large number of direction of word group of the coupling that obtains, be as the criterion, according to this direction, will by the remainder of word group matches, do not mated again, the coupling of this remainder is mainly the coupling of individual character group.Using the word group matching in this direction and individual character group as matching result.This place can be with reference to embodiment bis-.
In addition, each group in described pinyin lexicon also has respectively different weights; According to described phonetic keyword when described pinyin lexicon is mated, a phonetic word in described phonetic keyword matches a plurality of groups of corresponding different literals respectively simultaneously, according to weight separately of described a plurality of groups, from described a plurality of groups, select the group of a phonetic word match in a conduct and described phonetic keyword.For example: " pleasure " has pronunciation " le " (sound " is strangled ") and " yue " (sound " is pleased "), and " moon " has pronunciation " yue " (sound " is pleased ").Like this, " le " and " yue " forms Yi Ge group, the word " pleasure " that this group is corresponding; " yue " forms Yi Ge group, the word " moon " that this group is corresponding.Obviously, in " pleasure " group and " moon " group, have identical phonetic word " yue ", Er Zheliangge group is the different word of correspondence respectively.At this moment in the phonetic key word of user input, during " yue " coupling group, can match " pleasure " group and " moon " group simultaneously, now need the weight in conjunction with Zhe Liangge group, the large group of weighted value is as the group of mating with " yue ".
Step S104, according to the group of mating with described phonetic keyword, obtain the word corresponding to group of described coupling, according to word corresponding to the described group matching, form the word keyword that described phonetic keyword is corresponding, wherein, for the described phonetic keyword that matches two above groups, each the phonetic word in described phonetic keyword matches Yi Ge group;
Described phonetic keyword only matches Yi Ge group after coupling, and at this moment word corresponding to this group is exactly the word keyword that described phonetic keyword is corresponding;
In described phonetic keyword, different piece is after coupling, each part matches respectively Yi Ge group, at this moment word corresponding to group each being matched, gets up according to each several part sequential combination in phonetic keyword, has just obtained the word keyword that described phonetic keyword is corresponding.
For example: in pinyin lexicon, preserve " discriminate Huan pass " group, " discriminating Huan " group, " in " group and " " group, " discriminating Huan passes " phonetic word corresponding to group comprises " zhen huan zhuan " and " zhen huan chuan ", phonetic word corresponding to group comprises " zhen huan " " to discriminate Huan ", " in " phonetic word corresponding to group comprise " zhong ", " " phonetic word corresponding to group comprise " de " and " di ".The phonetic keyword of user's input is " zhen huan zhuan zhong de zhenhuan ", like this to just obtaining above-mentioned Si Ge group after phonetic keyword coupling, the word corresponding according to each group, just can obtain word key word according to phonetic key word and " discriminate the Zhen Huan of Huan crossing ".
Step S106, forms information and is sent to client demonstration according to described word keyword, or, the resource corresponding with described word keyword according to described word keyword search.
The word keyword that can not obtain according to phonetic keyword user's prompting here, but directly according to word keyword, in search engine, retrieve, obtain the resource corresponding with described word keyword, these resources are shown to user;
Certainly, also after can being the word keyword obtaining according to phonetic keyword, client arranges a viewing area or opens new page at the periphery of the display position of described phonetic keyword, form information and described information is presented to described viewing area or thereby new page facilitates user to confirm according to described word keyword, receive the click triggering of user to described information, client triggers described server and retrieves in search engine according to word keyword, obtain the resource corresponding with described word keyword, these resources are shown to user.
embodiment bis-
As shown in Figure 2, be the method flow diagram of the application the second embodiment, mainly for step S102, carry out refinement, comprising:
Step S200 according to the orientation from left to right of described phonetic keyword, mates described phonetic keyword in described word phonetic word bank, matches the word group of the first number;
Step S202 according to the orientation from right to left of described phonetic keyword, mates described phonetic keyword in described word phonetic word bank, matches the word group of the second number;
The execution sequence of step S200 and S202 in no particular order.Described phonetic keyword is because from different orientations, and semantic combination changes possibly, therefore can match different result, and certainly, so-called result mainly refers to the Different matching result of word group here, does not consider individual character group here.
Here it should be noted that, coupling itself be not according in described phonetic keyword one by one letter consider, but phonetic word in described phonetic keyword is mated, therefore according to the orientation from right to left of described phonetic keyword, refer to the orientation of phonetic word.For example: by orientation from left to right, phonetic keyword is " zhen huan chuan ", by orientation from right to left, phonetic keyword is " chuan huan zhen ".
Step S204, more described the first number and the second number, according to comparative result execution step S206 or step S210;
Step S206, if described the first number is greater than the second number, by in described phonetic keyword not by the word group matches of described the first number to remainder form the first remainder, in described individual character phonetic word bank, described the first remainder is mated, match the individual character group of three number corresponding with phonetic word in described the first remainder, the group that the individual character group of the word group of described the first number and described the 3rd number is matched as described phonetic keyword;
If the first number is greater than the second number, illustrate according to the orientation from left to right of described phonetic keyword and matched more word group, remaining individual character is relatively less, the original idea of the input of can being more close to the users like this.So select to match the more direction of word group, i.e. orientation is from left to right as the direction that phonetic keyword is mated, after word group matches, remaining part is a small amount of individual character, according to individual character phonetic word bank, mate, just can obtain finally several word groups of matching by orientation from left to right and a small amount of individual character group.
Step S208, according to the word group of described the first number, obtain the word word of corresponding the first number, according to the individual character group of described the 3rd number, obtain the single word of the 3rd number, according to the single word of the word word of described the first number and the 3rd number, form the word keyword of described phonetic keyword;
Step S210, if described the second number is greater than the first number, to in described phonetic keyword not by the word group matches of described the second number to remainder form the second remainder, in described individual character phonetic word bank, described the second remainder is mated, match the individual character group of four number corresponding with phonetic word in described the second remainder, the group that the individual character group of the word group of described the second number and described the 4th number is matched as described phonetic keyword;
If the second number is greater than the first number, illustrate according to the orientation from right to left of described phonetic keyword and matched more word group, remaining individual character is relatively less, the original idea of the input of can being more close to the users like this.So select to match the more direction of word group, i.e. orientation is from right to left as the direction that phonetic keyword is mated, after word group matches, remaining part is a small amount of individual character, according to individual character phonetic word bank, mate, just can obtain finally several word groups of matching by orientation from left to right and a small amount of individual character group.
Step S212, according to the word group of described the second number, obtain the word word of corresponding the second number, according to the individual character group of described the 4th number, obtain the single word of the 4th number, according to the single word of the word word of described the second number and the 4th number, form the word keyword of described phonetic keyword.
Certainly, below also likely there is the situation that the second number equals the first number, because this programme is mainly used in video search field, user's input Pinyin keyword is with movie and television play title, movie star's name is called master, and movie and television play title, movie star's title is often all comparatively unique, and movie and television play title, movie star's title has often all been carried out whole pinyin marking and has been formed corresponding phonetic word in word phonetic word bank, therefore no matter be which orientation is mated, all be not easy to occur the variation of semantic combination, so when the second number equals the first number, the phonetic word of the first number often matching and the phonetic word of the second number are identical, now no matter using which orientation as the direction that phonetic keyword is mated, last matching result is all the same.
In addition, according to take in step S206 and S210 mode, to match the many orientations of word group be matching direction, can reduce widely the possibility that has deviation with the original idea of user's input, certainly, can not eliminate this possibility completely.Even but still there is deviation, can also by the information of step S106, allow user confirm, if user thinks that deviation is larger, can also revise by the mode of input characters.
embodiment tri-
As shown in Figure 3, be the server architecture figure of the application the 3rd embodiment.The server of searching for by phonetic, comprising: receiver module 30, matching module 31, acquisition module 32, composite module 33, execution module 34 and pinyin lexicon 35, wherein,
Described receiver module 30, for receiving the phonetic keyword that comprises at least one phonetic word;
Described matching module 31, couple with receiver module 30, pinyin lexicon 35 respectively, for mating in pinyin lexicon according to described phonetic keyword, wherein, for the described phonetic keyword that matches two above groups, each the phonetic word in described phonetic keyword matches Yi Ge group;
Described acquisition module 32, couples with matching module 31, for according to the group of mating with described phonetic keyword, obtains the word corresponding to group of described coupling;
Described composite module 33, couples with acquisition module 32, for word corresponding to group matching described in basis, forms the word keyword that described phonetic keyword is corresponding;
Described execution module 34, couples with composite module 33, for forming information according to described word keyword and being sent to client, shows, or, the resource corresponding with described word keyword according to described word keyword search;
Described pinyin lexicon 35, for preserving a plurality of groups, the corresponding word of each group, each group comprises at least one phonetic word, for the group that comprises two above phonetic words, between described two above phonetic words, has relevant polyphone phonetic; Described pinyin lexicon 35, comprising: word phonetic word bank 351 and individual character phonetic word bank 352, and wherein, the group in described word phonetic word bank 351 is word group, the corresponding word word of each word group; Group in described individual character phonetic word bank 352 is individual character group, the corresponding single word of each individual character group.
Described pinyin lexicon 35, specifically for configuring respectively different weights for each group wherein;
Described matching module 31, specifically for according to described phonetic keyword when described pinyin lexicon 35 is mated, a phonetic word in described phonetic keyword matches a plurality of groups of corresponding different literals respectively simultaneously, according to weight separately of described a plurality of groups, from described a plurality of groups, select the group of a phonetic word match in a conduct and described phonetic keyword.
Described matching module 31 specifically for according to the orientation from left to right of described phonetic keyword, mates described phonetic keyword in described word phonetic word bank 351, matches the word group of the first number; According to the orientation from right to left of described phonetic keyword, in described word phonetic word bank 351, described phonetic keyword is mated, match the word group of the second number; If described the first number is greater than the second number, by in described phonetic keyword not by the word group matches of described the first number to remainder form the first remainder, in described individual character phonetic word bank 352, described the first remainder is mated, match the individual character group of three number corresponding with phonetic word in described the first remainder, the group that the individual character group of the word group of described the first number and described the 3rd number is matched as described phonetic keyword; If described the second number is greater than the first number, to in described phonetic keyword not by the word group matches of described the second number to remainder form the second remainder, in described individual character phonetic word bank 352, described the second remainder is mated, match the individual character group of four number corresponding with phonetic word in described the second remainder, the group that the individual character group of the word group of described the second number and described the 4th number is matched as described phonetic keyword.
Described acquisition module, when group that described phonetic keyword matches comprises the word group of described the first number and the individual character group of described the 3rd number, specifically for obtain the word word of corresponding the first number according to the word group of described the first number, according to the individual character group of described the 3rd number, obtain the single word of the 3rd number; When group that described phonetic keyword matches comprises the word group of described the second number and the individual character group of described the 4th number, specifically for obtain the word word of corresponding the second number according to the word group of described the second number, according to the individual character group of described the 4th number, obtain the single word of the 4th number;
Described composite module 32, specifically for form the word keyword of described phonetic keyword according to the single word of the word word of described the first number and the 3rd number, or, according to the single word of the word word of described the second number and the 4th number, form the word keyword of described phonetic keyword.
embodiment tetra-
As shown in Figure 4, the system architecture diagram for the application the 4th embodiment, comprising: one or more clients 600 and server 620, and described one or more clients 600 are by server described in network insertion 620, wherein,
Described server 620, comprises that reference example three, does not repeat them here as in Fig. 3 and the shown server of searching for by phonetic of embodiment tri-.
Described client 600, for identifying the phonetic keyword that comprises at least one phonetic word of user's input, and sends to described server 620; Also the periphery for the display position at described phonetic keyword arranges a viewing area or opens new page, form information and described information is presented to described viewing area or thereby new page facilitates user to confirm according to described word keyword, receive the click triggering of user to described information, to server, send trigger message so that described server is retrieved in search engine according to word keyword.
Described client 600 can be Web TV, panel computer, PC, Internet video player (network machine top box), mobile phone etc., and the application is not limited to this.
Also it should be noted that, term " comprises ", " comprising " or its any other variant are intended to contain comprising of nonexcludability, thereby make to comprise that the commodity of a series of key elements or system not only comprise those key elements, but also comprise other key elements of clearly not listing, or be also included as this commodity or the intrinsic key element of system.The in the situation that of more restrictions not, the key element being limited by statement " comprising ... ", and be not precluded within and comprise in the commodity of described key element or system and also have other identical element.
Above-mentioned explanation has illustrated and has described some preferred embodiments of the application, but as previously mentioned, be to be understood that the application is not limited to disclosed form herein, should not regard the eliminating to other embodiment as, and can be used for various other combinations, modification and environment, and can, in invention contemplated scope described herein, by technology or the knowledge of above-mentioned instruction or association area, change.And the spirit and scope that the change that those skilled in the art carry out and variation do not depart from the application, all should be in the protection domain of the application's claims.

Claims (12)

1. a method of searching for by phonetic, is characterized in that, comprising:
Reception comprises the phonetic keyword of at least one phonetic word, according to described phonetic keyword, in pinyin lexicon, mate, wherein, in described pinyin lexicon, there are a plurality of groups, the corresponding word of each group, each group comprises at least one phonetic word, for the group that comprises two above phonetic words, between described two above phonetic words, has relevant polyphone phonetic;
According to the group of mating with described phonetic keyword, obtain the word corresponding to group of described coupling, according to word corresponding to the described group matching, form the word keyword that described phonetic keyword is corresponding, wherein, for the described phonetic keyword that matches two above groups, each the phonetic word in described phonetic keyword matches Yi Ge group;
According to described word keyword, form information and be sent to client demonstration, or, the resource corresponding with described word keyword according to described word keyword search.
2. the method for claim 1, is characterized in that,
Described pinyin lexicon, comprising: word phonetic word bank and individual character phonetic word bank, and wherein, the group in described word phonetic word bank is word group, the corresponding word word of each word group; Group in described individual character phonetic word bank is individual character group, the corresponding single word of each individual character group.
3. method as claimed in claim 2, is characterized in that, the operation of mating in pinyin lexicon according to described phonetic keyword, specifically comprises:
According to the orientation from left to right of described phonetic keyword, in described word phonetic word bank, described phonetic keyword is mated, match the word group of the first number; According to the orientation from right to left of described phonetic keyword, in described word phonetic word bank, described phonetic keyword is mated, match the word group of the second number;
If described the first number is greater than the second number, by in described phonetic keyword not by the word group matches of described the first number to remainder form the first remainder, in described individual character phonetic word bank, described the first remainder is mated, match the individual character group of three number corresponding with phonetic word in described the first remainder, the group that the individual character group of the word group of described the first number and described the 3rd number is matched as described phonetic keyword;
If described the second number is greater than the first number, to in described phonetic keyword not by the word group matches of described the second number to remainder form the second remainder, in described individual character phonetic word bank, described the second remainder is mated, match the individual character group of four number corresponding with phonetic word in described the second remainder, the group that the individual character group of the word group of described the second number and described the 4th number is matched as described phonetic keyword.
4. method as claimed in claim 3, it is characterized in that, according to the group of mating with described phonetic keyword, obtain the word corresponding to group of described coupling, according to word corresponding to the described group matching, form the operation of the word keyword that described phonetic keyword is corresponding, specifically comprise:
When group that described phonetic keyword matches comprises the word group of described the first number and the individual character group of described the 3rd number, according to the word group of described the first number, obtain the word word of corresponding the first number, according to the individual character group of described the 3rd number, obtain the single word of the 3rd number, according to the single word of the word word of described the first number and the 3rd number, form the word keyword of described phonetic keyword;
When group that described phonetic keyword matches comprises the word group of described the second number and the individual character group of described the 4th number, according to the word group of described the second number, obtain the word word of corresponding the second number, according to the individual character group of described the 4th number, obtain the single word of the 4th number, according to the single word of the word word of described the second number and the 4th number, form the word keyword of described phonetic keyword.
5. the method for claim 1, is characterized in that,
Each group in described pinyin lexicon also has respectively different weights;
According to described phonetic keyword when described pinyin lexicon is mated, a phonetic word in described phonetic keyword matches a plurality of groups of corresponding different literals respectively simultaneously, according to weight separately of described a plurality of groups, from described a plurality of groups, select the group of a phonetic word match in a conduct and described phonetic keyword.
6. a server of searching for by phonetic, is characterized in that, comprising: receiver module, matching module, acquisition module, composite module, execution module and pinyin lexicon, wherein,
Described receiver module, for receiving the phonetic keyword that comprises at least one phonetic word;
Described matching module, for mating in pinyin lexicon according to described phonetic keyword, wherein, for the described phonetic keyword that matches two above groups, each the phonetic word in described phonetic keyword matches Yi Ge group;
Described acquisition module, for according to the group of mating with described phonetic keyword, obtains the word corresponding to group of described coupling;
Described composite module, forms for word corresponding to group matching described in basis the word keyword that described phonetic keyword is corresponding;
Described execution module, shows for forming information according to described word keyword and being sent to client, or, the resource corresponding with described word keyword according to described word keyword search;
Described pinyin lexicon, for preserving a plurality of groups, the corresponding word of each group, each group comprises at least one phonetic word, for the group that comprises two above phonetic words, between described two above phonetic words, has relevant polyphone phonetic.
7. server as claimed in claim 6, is characterized in that,
Described pinyin lexicon, comprising: word phonetic word bank and individual character phonetic word bank, and wherein, the group in described word phonetic word bank is word group, the corresponding word word of each word group; Group in described individual character phonetic word bank is individual character group, the corresponding single word of each individual character group.
8. server as claimed in claim 7, is characterized in that,
Described matching module specifically for according to the orientation from left to right of described phonetic keyword, mates described phonetic keyword in described word phonetic word bank, matches the word group of the first number; According to the orientation from right to left of described phonetic keyword, in described word phonetic word bank, described phonetic keyword is mated, match the word group of the second number; If described the first number is greater than the second number, by in described phonetic keyword not by the word group matches of described the first number to remainder form the first remainder, in described individual character phonetic word bank, described the first remainder is mated, match the individual character group of three number corresponding with phonetic word in described the first remainder, the group that the individual character group of the word group of described the first number and described the 3rd number is matched as described phonetic keyword; If described the second number is greater than the first number, to in described phonetic keyword not by the word group matches of described the second number to remainder form the second remainder, in described individual character phonetic word bank, described the second remainder is mated, match the individual character group of four number corresponding with phonetic word in described the second remainder, the group that the individual character group of the word group of described the second number and described the 4th number is matched as described phonetic keyword.
9. server as claimed in claim 8, is characterized in that,
Described acquisition module, when group that described phonetic keyword matches comprises the word group of described the first number and the individual character group of described the 3rd number, specifically for obtain the word word of corresponding the first number according to the word group of described the first number, according to the individual character group of described the 3rd number, obtain the single word of the 3rd number; When group that described phonetic keyword matches comprises the word group of described the second number and the individual character group of described the 4th number, specifically for obtain the word word of corresponding the second number according to the word group of described the second number, according to the individual character group of described the 4th number, obtain the single word of the 4th number;
Described composite module, specifically for form the word keyword of described phonetic keyword according to the single word of the word word of described the first number and the 3rd number, or, according to the single word of the word word of described the second number and the 4th number, form the word keyword of described phonetic keyword.
10. server as claimed in claim 6, is characterized in that,
Described pinyin lexicon, specifically for configuring respectively different weights for each group wherein;
Described matching module, specifically for according to described phonetic keyword when described pinyin lexicon is mated, a phonetic word in described phonetic keyword matches a plurality of groups of corresponding different literals respectively simultaneously, according to weight separately of described a plurality of groups, from described a plurality of groups, select the group of a phonetic word match in a conduct and described phonetic keyword.
11. 1 kinds of systems of searching for by phonetic, is characterized in that, comprising: one or more client and servers, wherein,
Described server, comprises the server of searching for by phonetic as described in any one in claim 6-10;
Described one or more client, for identifying the phonetic keyword that comprises at least one phonetic word of user's input, and sends to described server.
12. systems as claimed in claim 11, is characterized in that,
Described one or more client, periphery specifically for the display position at described phonetic keyword arranges a viewing area or opens new page, according to described word keyword, forms information and described information is presented to described viewing area or new page; Receive the click triggering of user to described information, to described server, send trigger message so that described server is retrieved in search engine according to word keyword.
CN201310727117.5A 2013-12-25 2013-12-25 Method, server and system for achieving searching through Pinyin Pending CN103678675A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310727117.5A CN103678675A (en) 2013-12-25 2013-12-25 Method, server and system for achieving searching through Pinyin

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310727117.5A CN103678675A (en) 2013-12-25 2013-12-25 Method, server and system for achieving searching through Pinyin

Publications (1)

Publication Number Publication Date
CN103678675A true CN103678675A (en) 2014-03-26

Family

ID=50316219

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310727117.5A Pending CN103678675A (en) 2013-12-25 2013-12-25 Method, server and system for achieving searching through Pinyin

Country Status (1)

Country Link
CN (1) CN103678675A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106815197A (en) * 2015-11-27 2017-06-09 北京国双科技有限公司 The determination method and apparatus of text similarity
CN107967352A (en) * 2017-12-15 2018-04-27 四川汉科计算机信息技术有限公司 Information knowledge wisdom recommends method
CN108170293A (en) * 2017-12-29 2018-06-15 北京奇虎科技有限公司 Input the personalized recommendation method and device of association
CN109741755A (en) * 2018-12-25 2019-05-10 苏州思必驰信息科技有限公司 Voice wakes up word threshold management device and manages the method that voice wakes up word threshold value
CN113094470A (en) * 2021-04-08 2021-07-09 蔡堃 Text searching method and system
US11100921B2 (en) 2018-04-19 2021-08-24 Boe Technology Group Co., Ltd. Pinyin-based method and apparatus for semantic recognition, and system for human-machine dialog

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106815197A (en) * 2015-11-27 2017-06-09 北京国双科技有限公司 The determination method and apparatus of text similarity
CN107967352A (en) * 2017-12-15 2018-04-27 四川汉科计算机信息技术有限公司 Information knowledge wisdom recommends method
CN108170293A (en) * 2017-12-29 2018-06-15 北京奇虎科技有限公司 Input the personalized recommendation method and device of association
US11100921B2 (en) 2018-04-19 2021-08-24 Boe Technology Group Co., Ltd. Pinyin-based method and apparatus for semantic recognition, and system for human-machine dialog
CN109741755A (en) * 2018-12-25 2019-05-10 苏州思必驰信息科技有限公司 Voice wakes up word threshold management device and manages the method that voice wakes up word threshold value
CN109741755B (en) * 2018-12-25 2021-04-16 苏州思必驰信息科技有限公司 Voice awakening word threshold management device and method for managing voice awakening word threshold
CN113094470A (en) * 2021-04-08 2021-07-09 蔡堃 Text searching method and system
CN113094470B (en) * 2021-04-08 2022-05-24 蔡堃 Text searching method and system

Similar Documents

Publication Publication Date Title
US11636146B2 (en) Content analysis to enhance voice search
US11817013B2 (en) Display apparatus and method for question and answer
US8626486B2 (en) Automatic spelling correction for machine translation
CN103678675A (en) Method, server and system for achieving searching through Pinyin
CN103678674A (en) Method, device and system for achieving error correction searching through Pinyin
KR102047200B1 (en) Real-time natural language processing of datastreams
US8799307B2 (en) Cross-language information retrieval
US20140122407A1 (en) Chatbot system and method having auto-select input message with quality response
US9152717B2 (en) Search engine suggestion
US20160162555A1 (en) Accessing Messaging Applications in Search
US11487757B2 (en) Assistive browsing using context
US8515731B1 (en) Synonym verification
KR20100009520A (en) Transliteration for query expansion
WO2008151465A1 (en) Dictionary word and phrase determination
CN102770859A (en) Cross-language search options
CN103678560A (en) Multimedia resource error correction searching method and system and multimedia resource server
EP2846554A1 (en) A method, an electronic device, and a computer program
KR101446468B1 (en) System and method for prividing automatically completed query
US8868587B1 (en) Determining correction of queries with potentially inaccurate terms
WO2011106087A1 (en) Method for processing auxilary information for topic generation
RU2726009C1 (en) Method and system for correcting incorrect word set due to input error from keyboard and/or incorrect keyboard layout
RU2595531C2 (en) Method and system for generating definition of word based on multiple sources
CN105632500A (en) Voice recognition apparatus and method of controlling the same
US20170308519A1 (en) Learning semantic parsing
EP2016486A2 (en) Processing of query terms

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
AD01 Patent right deemed abandoned
AD01 Patent right deemed abandoned

Effective date of abandoning: 20180126