WO2019109663A1 - 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置 - Google Patents

一种跨语言搜索方法和装置、一种用于跨语言搜索的装置 Download PDF

Info

Publication number
WO2019109663A1
WO2019109663A1 PCT/CN2018/100990 CN2018100990W WO2019109663A1 WO 2019109663 A1 WO2019109663 A1 WO 2019109663A1 CN 2018100990 W CN2018100990 W CN 2018100990W WO 2019109663 A1 WO2019109663 A1 WO 2019109663A1
Authority
WO
WIPO (PCT)
Prior art keywords
source language
search term
source
translation
translation result
Prior art date
Application number
PCT/CN2018/100990
Other languages
English (en)
French (fr)
Inventor
许静芳
张嘉成
翟飞飞
戴磊
刘祺
方菲
丁彦卓
Original Assignee
北京搜狗科技发展有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京搜狗科技发展有限公司 filed Critical 北京搜狗科技发展有限公司
Publication of WO2019109663A1 publication Critical patent/WO2019109663A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/338Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3337Translation of the query language, e.g. Chinese to English

Definitions

  • the present application relates to the field of information search technology, and in particular, to a cross-language search method and apparatus, and a device for cross-language search.
  • search term (query) input by the user is “March”, and the search in the Chinese database may not satisfy the user's needs to the greatest extent, and the English database originating from the European and American websites may have better and more search results.
  • Cross-language search technology combines information retrieval technology with machine translation technology.
  • the existing cross-language search process may specifically include: firstly, converting the source language search words into search words in the target language form through machine translation technology, and then respectively searching for the search words according to the source language search words and the target language forms, respectively.
  • the information retrieval in the language database is performed to obtain multi-language search results, wherein the multi-language search results may include: search results in the form of source language and search results of the target language.
  • the existing solution can translate the target language search result and provide the translated source language translation result to the user.
  • users often need to carefully interpret in order to obtain the key information needed from the source language translation results, which will affect the efficiency of the user to obtain information from the source language translation results.
  • embodiments of the present application have been proposed in order to provide a cross-language search method, a cross-language search device, and a device for cross-language search that overcome the above problems or at least partially solve the above problems, and the embodiments of the present application can improve the target.
  • the coverage rate of the red content in the source language translation result corresponding to the language search term can further improve the efficiency of obtaining information from the source language translation result corresponding to the target language search term.
  • a cross-language search method which includes:
  • a cross-language search device including:
  • a search term translation module configured to translate a source language search term into a target language search term
  • a result obtaining module configured to search for a target language search result for the target language, and obtain a corresponding target language search result and a source language translation result; wherein the source language translation result is obtained by translating the target language search result;
  • a keyword acquisition module configured to obtain, from the source language translation result, a keyword that matches the target language search term
  • a mark display module configured to mark and display the keyword in the source language translation result.
  • an embodiment of the present application discloses an apparatus for cross-language search, including a memory, and one or more programs, wherein one or more programs are stored in a memory and configured to be Execution of the one or more programs by the one or more processors includes instructions for performing the following operations:
  • embodiments of the present application disclose one or more machine-readable media having stored thereon instructions that, when executed by one or more processors, cause a device to perform one or more of the aforementioned cross-language searches method.
  • the embodiment of the present application obtains a keyword matching the target language search term from the source language translation result, and displays the keyword in the source language translation result, thereby improving the source language corresponding to the target language search term.
  • the coverage of the red content in the translation result can improve the efficiency of obtaining information from the source language translation results corresponding to the target language search term.
  • FIG. 1 is a schematic diagram of an application environment of a cross-language search method of the present application
  • FIG. 2 is a flow chart showing the steps of a first embodiment of a cross-language search method according to the present application
  • FIG. 3 is a flow chart of steps of a second embodiment of a cross-language search method according to the present application.
  • FIG. 4 is a flow chart showing the steps of a third embodiment of a cross-language search method according to the present application.
  • FIG. 5 is a flow chart of steps of a fourth embodiment of a cross-language search method according to the present application.
  • FIG. 6 is a structural block diagram of an embodiment of a cross-language search device of the present application.
  • FIG. 7 is a block diagram of a device 900 for cross-language search as a terminal of the present application.
  • FIG. 8 is a schematic structural diagram of a device for cross-language search as a server of the present application.
  • the inventor found in the process of implementing the embodiment of the present application that the existing solution can translate the target language search result into the source language, and mark the content of the translated source language translation result that is the same as the source language search term. For example, if the source language search term is "Trump" and the corresponding target language search term is "Trump”, the "Trump” appearing in the source language translation result corresponding to "Trump” can be marked red. This allows the user to quickly obtain key information included in a source language translation result, which in turn allows the user to quickly determine whether a source language translation result meets his or her information needs.
  • the present invention provides a cross-language search solution, which can translate a source language search term into a target language search term, in view of the technical problem of low coverage of the red content in the source language translation result in the existing solution.
  • a target language search term obtaining a corresponding target language search result and a source language translation result; wherein the source language translation result may be obtained by translating the target language search result; obtaining the source language translation result a keyword matching the target language search term; and displaying the keyword in the source language translation result.
  • the embodiment of the present application obtains a keyword matching the target language search term from the source language translation result, and displays the keyword in the source language translation result, thereby improving the source language corresponding to the target language search term.
  • the coverage of the red content in the translation result can improve the efficiency of obtaining information from the source language translation results corresponding to the target language search term.
  • the search result corresponding to the target language search term may include: a target language search result and a source language translation result, wherein the source language translation result may be obtained by translating the target language search result, and the target language search result is obtained.
  • the translation results with the source language can correspond to the same search results (such as the same web page, video, picture, music, etc.), one of the differences between the two is the difference in language form.
  • the source language search term is “March”
  • the target language search term corresponding to “March” is “March”
  • the target language search result corresponding to “March” is included in the target language search result.
  • March-Wikipedia the translation result of the source language for the “March-Wikipedia” translation is "March-Wikipedia”
  • the embodiment of the present application can translate the result from the source language of "March” "March-Wikipedia”
  • the keyword "March” matching "March” is determined, and "March” in the source language translation result "March” is displayed in "March-Wikipedia", thereby improving the source.
  • the embodiment of the present application can be applied to a platform environment with a cross-language search function, such as a search APP (application), a search website (such as a search engine), etc., to improve the efficiency of obtaining information from search results corresponding to target language search words.
  • a cross-language search function such as a search APP (application), a search website (such as a search engine), etc.
  • the cross-language search method of the embodiment of the present application is mainly described by using the search APP as an example, and the cross-language search method corresponding to other platform environments such as the search website may be cross-referenced.
  • the cross-language search method provided by the embodiment of the present application can be applied to the application environment shown in FIG. 1.
  • the client 100 and the server 200 are located in a wired or wireless network, and the client is connected through the wired or wireless network. 100 performs data interaction with the server 200.
  • the cross-language search process of the embodiment of the present application may be performed by any one or combination of the client 100 and the server 200:
  • the client 100 may receive the source language search term input by the user, and send the source language search term to the server 200; after receiving the source language search term, the server 200 may translate the source language search term into the target language search term.
  • Obtaining a target language search result corresponding to the target language search term performing a translation from the target language to the source language on the target language search result to obtain a source language translation result; and obtaining a context from the source language translation result
  • the target language search term matches the keyword
  • the source language translation result and the keyword are sent to the client 100, so that the client 100 displays the source language translation result, and in the source language translation result
  • the keywords are displayed in a mark.
  • the client 100 may receive the source language search term input by the user, and after receiving the source language search term, first translate the source language search term into the target language search term, and then obtain the target corresponding to the target language search term. a language search result, then performing a translation from the target language to the source language on the target language search result to obtain a source language translation result; and obtaining a key matching the target language search term from the source language translation result a word; and then displaying the source language translation result, and displaying the keyword in the source language translation result.
  • the advantage of the resource richness of the server 200 can be exerted, and the source language translation result and the source language translation result are improved.
  • Word acquisition efficiency and accuracy For example, a cloud server can be deployed with a large number of highly configured computing devices, so the use of these computing devices for source language translation results, as well as the acquisition of keywords in the source language translation results, to improve the source language translation results, as well as the key words in the source language translation results.
  • the word acquisition efficiency and the acquisition accuracy rate can be saved; at the same time, the computing resources on the client side 100 can be saved, and the performance of the terminal corresponding to the client 100 can be improved.
  • the process of obtaining the keywords in the source language translation result and the source language translation result may be performed by any one of the server 200 and the client 100, and the method for acquiring the keywords in the source language translation result and the source language translation result in the embodiment of the present application
  • the corresponding specific execution subject is not limited.
  • the client 100 can run on the terminal, where the terminal specifically includes but is not limited to: a smart phone, a tablet computer, an e-book reader, and an MP3 (motion image expert compresses the standard audio layer 3, and the Moving Picture Experts Group Audio Layer III Player, MP4 (Moving Picture Experts Group Audio Layer IV) player, laptop portable computer, car computer, desktop computer, set-top box, smart TV, wearable device, etc.
  • the terminal specifically includes but is not limited to: a smart phone, a tablet computer, an e-book reader, and an MP3 (motion image expert compresses the standard audio layer 3, and the Moving Picture Experts Group Audio Layer III Player, MP4 (Moving Picture Experts Group Audio Layer IV) player, laptop portable computer, car computer, desktop computer, set-top box, smart TV, wearable device, etc.
  • MP3 motion image expert compresses the standard audio layer 3
  • MP4 Motion Picture Experts Group Audio Layer IV player
  • FIG. 2 a flow chart of the steps of the first embodiment of the cross-language search method of the present application is shown, which may specifically include the following steps:
  • Step 201 Translate a source language search term into a target language search term
  • Step 202 Obtain a corresponding target language search result and a source language translation result for the target language search term; wherein the source language translation result may be obtained by translating the target language search result;
  • Step 203 Obtain a keyword that matches the target language search term from the source language translation result.
  • Step 204 Mark the keyword in the source language translation result.
  • the source language search term may be a search term input by the user using the source language.
  • a client searching for an APP or a search website can provide a UI (User Interface), and the user can submit a source language search term to the client through a search box, a voice interface, or the like on the UI.
  • the client may display the received source language search term in the search box. Therefore, in the embodiment of the present application, the source language search term input by the user may include : The source language search term submitted by the user to the client in any way. It can be understood that the specific manner of obtaining the source language search term is not limited in the embodiment of the present application.
  • the source language and the target language may be used to represent different two languages.
  • the source language and the target language may be preset by a user, or may be analyzed by a search APP or a search website by analyzing a user's search behavior and/or browsing behavior. get.
  • the search APP or the search website may use the language most commonly used by the user as the source language, and the language used in addition to the source language as the target language.
  • the user's search behavior indicates that the search term used by the user is a Chinese search term, and the source language can be determined to be Chinese; the user's browsing behavior also indicates that the user has visited the translated website and has conducted Chinese and through the translation website.
  • the mutual translation behavior between English, so it can be determined that the target language is English.
  • the number of target languages in the embodiment of the present application may be one or more.
  • the source language may be Chinese
  • the target language may be English, Japanese, Korean, German.
  • the cross-language search method of the embodiment of the present application is mainly described by using the source language as the Chinese language and the target language as the English as an example.
  • the cross-language search methods corresponding to the other source languages and the target language may be cross-referenced.
  • step 201 may translate the source language search term into a target language search term by the client or server.
  • a plurality of different translation results may be obtained.
  • the one with the highest degree of confidence may be selected from a plurality of different translation results.
  • a target language search term it can be understood that the specific process of translating the source language search term into the target language search term is not limited. In an application example of the present application, if the source language search term is “March”, the target language search term may be “March”. In another application example of the present application, if the source language search term is “Disney”, the target language search term may be “disney”.
  • step 202 may search for a target language search result according to the target language search word obtained in step 201 to obtain a target language search result. It can be understood that the specific manner of obtaining the target language search result is not limited in the embodiment of the present application.
  • step 202 may perform a translation from the target language to the source language on the target language search result to obtain the source language translation result.
  • the target language search result and the source language translation result may correspond to the same search result (such as the same webpage, video, picture, music, etc.), and one of the differences is that the language form is different.
  • the obtaining process of the source language translation result may include: translating the target language search result from a target language to a source language by using a translation model; wherein the translation model may be a translation model commonly used in the translation field.
  • the translation model specific to the search result may also be used.
  • the embodiment of the present application does not limit the specific translation model and the specific translation process from the target language to the source language for the target language search result.
  • the translation process of the target language in the source language translation result may be consistent with the source language search term by improving the translation process of the target language search result from the target language to the source language. Sex, in this way, can improve the coverage of the red content.
  • the obtaining process of the source language translation result may include: translating the target content in the target language search result into the source in the process of translating the target language search result from the target language to the source language The translation result of the language search term is consistent; wherein the target content is the same content in the target language search result as the target language search term.
  • the translation result of the target search word "Trump” may include: “Champ”, “Trump”, “Ace”, “Good Man”, “Honest Person”, etc.
  • the embodiment of the present application can improve the coverage of the target content by improving the consistency between the translation result of the target content in the source language translation result and the source language search term.
  • the target language search term “Trump” can be translated into “Trump”, and the target language search result is All "Trump” included are translated as “Trump”; or, if the source language search word corresponding to the target language search word "Trump” is "Champ”, the target language search word "Trump” can be translated into “Champ” and translates all "Trump” included in the target language search result into “Champ”; thereby improving the coverage of the red content.
  • Step 203 Obtain, from the source language translation result, a keyword that matches the target language search term.
  • the language form corresponding to the keyword may be the source language form, that is, the language form corresponding to the keyword is consistent with the language form corresponding to the translation result of the source language.
  • the keyword matches the target language search term, which may include: the semantics corresponding to the keyword match the semantics corresponding to the target language search term, or the effect and target language generated by the keyword to the source language reader.
  • the search term matches the effect produced by the target language reader.
  • the step 203 of acquiring, by the source language translation result, a keyword that matches the target language search term may include: acquiring a key that matches the target language search term according to the translation dictionary. a word and finding the same content as the keyword from the source language translation result.
  • the translation dictionary may provide a corresponding source language vocabulary for the target language vocabulary.
  • the source language vocabulary provided for the target language vocabulary “March” may include “March,” “March,” “Forward,” and the like.
  • a keyword matching the target language search term may be obtained from the source language translation result.
  • step 203 may obtain a keyword that matches the target language search term from the source language translation result by a word alignment technique.
  • Word alignment refers to the correspondence between the source language vocabulary and the target language vocabulary in the pair of statement pairs.
  • the two-sentence pair is "I play game in the classroom”
  • the word alignment technology can be "I”, “in”, “classroom”, “play”, “game” and " I", "in”, “classroom”, “play”, “game” correspond. Since the above word alignment technique considers the alignment relationship between the lexical pairs of the translations in the double sentences, the accuracy of the keywords can be improved.
  • the process of obtaining, by using the word alignment technology, a keyword that matches the target language search term from the source language translation result may include: establishing a target language vocabulary included in the target language search result and the source a mapping relationship between source language vocabularies included in the language translation result; searching for the search term according to the target language, and performing a search in the mapping relationship to obtain the search result included in the source language and the target language search term Matching keywords.
  • the mapping relationship between the target language vocabulary included in the target language search result and the source language vocabulary included in the source language translation result may be established by using a word alignment technique, and specifically, the target language search result and the source may be acquired.
  • a word alignment model may be used to establish a mapping relationship between a target language vocabulary included in the target language search result and a source language vocabulary included in the source language translation result.
  • the word alignment model can be obtained by using bilingual alignment corpus training, so that the word alignment model includes statistical information for determining a pair of words in the double sentence that are mutually translated, since the word alignment model is a related art in the field, Do not repeat them.
  • the source language search term is “March”
  • the target language search term corresponding to “March” is “March”
  • the target language search result corresponding to “March” is assumed.
  • the embodiment of the present application can adopt a word alignment model for the target language. The search results are aligned with the double sentences corresponding to the source language translation results.
  • the above process of obtaining a keyword matching the target language search term from the translation result of the source language by using a translation dictionary or by a word alignment technique is only an optional embodiment, and in fact, those skilled in the art
  • Other processes for obtaining keywords matching the target language search words from the source language translation results may be adopted according to actual application requirements, for example, the attention mechanism of the neural network machine translation model may be utilized to determine the target language search words. Which word is translated into the source language form, etc., wherein the attention mechanism is a selection mechanism for assigning limited information processing capabilities. Attention is focused on selectively focusing on certain perceived information. This information may be objective or subjective, while ignoring other information received at the same time, this cognitive process.
  • the specific embodiment of the present application does not limit the specific process of obtaining keywords that match the target language search term from the source language translation result.
  • the presentation content corresponding to the source language translation result may include: a title, a summary, a promotion content, and the like
  • the step 204 may display the keyword in the presentation content corresponding to the source language translation result.
  • the keyword included in the title or the abstract may be displayed in a mark
  • the keyword included in the promotional content may be displayed in a mark.
  • the keyword may be marked and displayed by marking a font or a color of the keyword; for example, the font of the keyword may be bolded, underlined, tilted, stereoscopic, set to red, etc. This can improve the recognition of the keyword with respect to the presentation content of the normal font or the normal color.
  • the identification of the keyword is displayed while the keyword is displayed. For example, a corresponding icon or the like may be displayed at a position such as a superscript or a subscript of the keyword. It can be understood that the specific marking display process is not limited in the embodiment of the present application.
  • the search result corresponding to the target language search term may be used as the first search result, and the second search result corresponding to the source language search term may be obtained, and the target language form and/or source may be obtained.
  • the first search result of the language form is mixed with the second search result; thus, the search result derived from the multi-language database can be provided to the user.
  • the searching may be performed in a database of the source language to obtain a second search result corresponding to the source language search term.
  • the cross-language search method of the embodiment of the present application obtains keywords matching the target language search words from the source language translation results, and displays the keywords in the source language translation results.
  • the coverage rate of the red content in the source language translation result corresponding to the target language search term is improved, thereby improving the efficiency of obtaining information from the source language translation result corresponding to the target language search term.
  • FIG. 3 a flow chart of the steps of the second embodiment of the cross-language search method of the present application is shown, which may specifically include the following steps:
  • Step 301 Translate a source language search term into a target language search term
  • Step 302 Search for a target language search result and a source language translation result for the target language search term; wherein the source language translation result may be obtained by translating the target language search result;
  • Step 303 Obtain a keyword that matches the target language search term from the source language translation result.
  • Step 304 If the matching degree between the keyword and the source language search term meets a preset matching degree condition, mark the keyword in the source language translation result.
  • the process of displaying the keyword in the source language translation result in the second embodiment of the method shown in FIG. 3 may include: if the keyword and the source If the matching degree between the language search words meets the preset matching degree condition, the keywords in the source language translation result are marked and displayed. Since the matching degree between the keyword displayed by the mark and the source language search word conforms to the preset matching degree condition, the association between the keyword displayed by the mark and the source language search word can be improved, thereby making the key of the mark display Words are more in line with the user's information needs.
  • the matching degree between the keyword and the source language search term may include: a similarity between the keyword and the source language search term, and the preset matching degree condition may include : The similarity exceeds the similarity threshold.
  • the first word vector corresponding to the keyword and the second word vector corresponding to the source language search term may be separately determined, and then the similarity measure is performed by an Euclidean distance, an included cosine, and the like. The similarity between a word vector and a second word vector.
  • the matching degree of the keyword “March” and the source language search term “March” may be determined, and if the matching degree meets the preset matching degree condition, the source language may be The keyword "March” in the translation result is marked.
  • the matching degree may be determined by the keyword “Disney” and the source language search word “Disney”. If the matching degree meets the preset matching degree condition, the source language may be used. The keyword “Disney” in the translation results is marked.
  • the method may not be used in the source language translation result.
  • the keywords are displayed in a mark to avoid that the keywords with lower correlations with the source language search words are displayed by the mark.
  • FIG. 4 a flow chart of the steps of the third embodiment of the cross-language search method of the present application is shown, which may specifically include the following steps:
  • Step 401 Translate a source language search term into a target language search term
  • Step 402 Search for a target language search result and a source language translation result for the target language search term; wherein the source language translation result may be obtained by translating the target language search result;
  • Step 403 Determine whether the source language translation result includes the source language search term.
  • Step 404 If the source language translation result does not include the source language search term, obtain a keyword that matches the target language search term from the source language translation result;
  • Step 405 Mark the keyword in the source language translation result.
  • the third embodiment of the method shown in FIG. 4 can determine the source language translation before acquiring the keyword matching the target language search term from the source language translation result. Whether the result includes the source language search term, and if not, obtaining a keyword matching the target language search term from the source language translation result, and the keyword in the source language translation result The mark display is performed; thereby, the markup redness of the source language translation result can be realized without the source language search word in the source language translation result.
  • the embodiment of the present application does not limit the execution condition corresponding to “obtaining a keyword matching the target language search term from the source language translation result”.
  • a keyword matching the target language search term may also be acquired from the source language translation result, thereby realizing a source language.
  • the source language search words and keywords are simultaneously marked red.
  • the method of the embodiment of the present application may further include: if the source language translation result includes the source language search term, performing markup display on the source language search term in the source language translation result.
  • FIG. 5 a flow chart of the steps of the fourth embodiment of the cross-language search method of the present application is shown, which may specifically include the following steps:
  • Step 501 Translate a source language search term into a target language search term
  • Step 502 Search for a target language search result and a source language translation result for the target language search term; wherein the source language translation result may be obtained by translating the target language search result;
  • Step 503 Obtain, from the source language translation result, a keyword that matches the target language search term.
  • Step 504 Mark the keyword in the source language translation result.
  • the fourth embodiment of the method shown in FIG. 5 may further include:
  • Step 505 Acquire at least two translation results corresponding to the target language search term.
  • Step 506 Mark content displayed in the source language translation result that matches at least two translation results corresponding to the target language search term.
  • the source language translation result is obtained by translating the target language search result, the source language translation result often has multiple possibilities, so that only the content of the source language translation result and the source language search term are marked. Red, resulting in lower coverage of the red content.
  • the source language translation results may include: “Trump”, “Champ” and other translation possibilities, if the source language search term is "Trump” "Only, it is possible to realize the possible redemption for the translation of "Trump”, and it is impossible to achieve the possible redemption for other translations such as "Champ”, so the coverage of the red content is low.
  • the embodiment of the present application can obtain at least two translation results corresponding to the target language search term, and corresponding to the target language search term in the source language translation result.
  • the content of at least two translations matching the content is marked for display. Since at least two translation results can reflect at least two translation possibilities of the target language search term to the source language, the embodiment of the present application can implement at least two translation possible redoxes through the source language translation result, thereby improving the source language translation result.
  • the coverage of the winning red content can further improve the efficiency of obtaining information from the search results corresponding to the target language search term.
  • the corresponding target language search term is “Trump”
  • the target language search result corresponding to the target language search term “Trump” is included.
  • the corresponding source language translation results may include: “Trump”, “Champ” and other translation possibilities, then the embodiment of the present application can obtain “Trump” and "Champ” At least two kinds of translations are possible, and at least two possible translations are realized by the source language translation result, so that the coverage of the red content can be improved.
  • the embodiment of the present application may provide the following acquisition scheme:
  • the acquisition scheme 1 may perform translation from the target language to the source language on the target language search term, and filter the translation results of at least two source languages from the translated results.
  • the translation model may be used to translate the target language search term into multiple translation results, and at least two of the results obtained from the translation are selected.
  • the screening result of the at least two source languages is obtained by the above-mentioned translation result, and the method may include: selecting a confidence level from the translation result to meet a preset confidence level.
  • the translation results of at least two source languages of the condition For example, you can get multiple translation results based on “Trump” translation: “Champ”, “Trump”, “Ace”, “Good Man”, “Honest Man”, “Means”, “Beyond”, etc., and For the multiple translation results corresponding to Trump, select at least two translation results with higher confidence, such as "Champ” and "Trump".
  • the confidence of the source language translation result may be a translation probability of the target language search term to the source language translation result, and the translation probability may be calculated by the translation model. It can be understood that the confidence of the embodiment language in the source language translation result is There are no restrictions on how to get it.
  • the screening result of the at least two source languages is obtained by the above-mentioned translation result
  • the method may include: searching for the part of the word according to the source language, and obtaining the translation from the translation
  • At least two translation results were obtained by screening the results.
  • the optional embodiment can search for the part of speech according to the source language, and filter at least two translation results to obtain a translation result consistent with the part-of-speech of the source language search word, thereby improving the accuracy of the translation result.
  • the verb "beyond” can be filtered according to "noun”; or, There are nouns, filtering out the verb "Beyond” and non-professional nouns "Ace”, “Good Man”, “Honest Man”, “Means”.
  • the obtaining scheme 2 may record, in the process of translating the target language search result from the target language to the source language, the translation result corresponding to the target content in the target language search result, to obtain the target language search term. Corresponding at least two translation results; wherein the target content is the same content in the target language search result as the target language search term.
  • the background and context of the target language search results cause the same content in the target language search results to correspond to different translation results in different contexts and contexts.
  • “Apple” can be translated into “fruit”. It can also be translated as “Apple Phone” and can also be translated as “Bomb”.
  • the acquisition scheme 2 may correspond to the target content in the search result in the process of translating the target language search result from the target language to the source language.
  • the translation result is recorded; since the target content is the same content as the target language search term in the target language search result, after the translation is completed, at least the target language search term corresponding to the target content may be obtained from the recorded content. Two translation results.
  • the corresponding target content is also "Trump”
  • the translation model can be used to record why "Trump" included in the target language search result is translated, for example, corresponding to different contexts or contexts, Translated into “Trump”, “Champ”, or “Ace”, etc., all occurrences of translation results are recorded to obtain at least two translation results corresponding to the target language search term.
  • the recording the translation result corresponding to the target content in the search result may include: recording each translation result corresponding to the target content, or converting the current translation result corresponding to the target content with the recorded translation result. For comparison, if they are different, record them. If they are the same, they are not recorded.
  • the process of obtaining the at least two translation results corresponding to the target language search term is described in detail by the acquisition scheme 1 and the acquisition scheme 2, and it can be understood that those skilled in the art can adopt the acquisition scheme 1 and acquire according to actual application requirements. Any one or combination of the schemes 2, or other acquisition schemes, may be used. It can be understood that the specific process of obtaining the at least two translation results corresponding to the target language search term is not limited in the embodiment of the present application.
  • the cross-language search method in the embodiment of the present application can perform translation from the target language to the source language on the target language search result to obtain the source language translation result, and search for the source language translation result and the target language.
  • the content corresponding to the at least two translation results of the word is marked and displayed; since the at least two translation results can reflect at least two translation possibilities of the target language search word to the source language, the embodiment of the present application can translate the result through the source language.
  • the corresponding target language translation result is the same as the red, which can improve the coverage of the red content in the source language translation result of the source language, thereby improving the efficiency of the user to obtain information from the source language translation result.
  • FIG. 6 a structural block diagram of an embodiment of a cross-language search device of the present application is shown, which may specifically include: a search term translation module 601 , a result acquisition module 602 , a keyword acquisition module 603 , and a mark display module 603 .
  • the search term translation module 601 is configured to translate the source language search term into a target language search term
  • a result obtaining module 602 configured to search for a target language search result and a source language translation result for the target language search term; wherein the source language translation result is obtained by translating the target language search result;
  • the keyword obtaining module 603 is configured to acquire, from the source language translation result, a keyword that matches the target language search term;
  • the tag display module 604 is configured to perform tag display on the keywords in the source language translation result.
  • the keyword obtaining module 603 may include:
  • mapping establishing submodule configured to establish a mapping relationship between a target language vocabulary that can be included in the target language search result and a source language vocabulary that the source language translation result can include
  • mapping search submodule configured to search for a word according to the target language, and perform a search in the mapping relationship to obtain a keyword that may be included in the source language translation result and that matches the target language search term.
  • the mark display module 604 is configured to: if the matching degree between the keyword and the source language search term meets a preset matching degree condition, the method in the source language translation result Keywords are marked for display.
  • the device may further include:
  • a determining module configured to determine whether the source language translation result may include the source language search before the keyword obtaining module acquires a keyword matching the target language search term from the source language translation result a word, if the source language translation result does not include the source language search term, triggering the keyword acquisition module to acquire a keyword that matches the target language search term from the source language translation result.
  • the device may further include:
  • a second tag display module configured to mark the source language search term in the source language translation result if the source language translation result may include the source language search term.
  • the device may further include:
  • a translation result obtaining module configured to acquire at least two translation results corresponding to the target language search term
  • the third mark display module is configured to perform mark display on the content of the source language translation result that matches at least two translation results corresponding to the target language search term.
  • the source language translation result obtaining submodule may include:
  • a screening unit configured to perform translation from the target language to the source language on the target language search term, and filter the translation results of the at least two source languages from the translated results
  • the screening unit may include:
  • a first screening subunit configured to select, from the results obtained by the translation, a translation result of at least two source languages whose confidence meets a preset confidence condition;
  • the second screening subunit is configured to search for the part of speech of the word according to the source language, and screen at least two translation results from the result obtained by the translation.
  • the description is relatively simple, and the relevant parts can be referred to the description of the method embodiment.
  • Embodiments of the present application disclose an apparatus for cross-language search, including a memory, and one or more programs, wherein one or more programs are stored in a memory and configured to be comprised by one or more processors
  • Executing the one or more programs includes instructions for: translating a source language search term into a target language search term; obtaining, for the target language search term, a corresponding target language search result and a source language translation result Wherein the source language translation result is obtained by translating the target language search result; obtaining a keyword matching the target language search term from the source language translation result; and translating the source language translation result
  • the keywords in the tag are displayed.
  • the obtaining, by the source language translation result, a keyword that matches the target language search term includes:
  • Searching according to the target language performing a search in the mapping relationship to obtain a keyword that is included in the source language translation result and matches the target language search term.
  • the flag display comprises:
  • the keyword in the source language translation result is marked and displayed.
  • the apparatus is further configured to execute, by the one or more processors, the one or more programs to include instructions for:
  • the source language translation result does not include the source language search term, performing the step of acquiring a keyword that matches the target language search term from the source language translation result.
  • the apparatus is further configured to execute, by the one or more processors, the one or more programs to include instructions for:
  • the source language search term in the source language translation result is marked and displayed.
  • the apparatus is further configured to execute, by the one or more processors, the one or more programs to include instructions for:
  • the obtaining the at least two translation results corresponding to the target language search term includes:
  • the translation results obtained by translating the results obtained from the translation into at least two source languages include:
  • At least two translation results are screened from the results obtained by the translation.
  • FIG. 7 is a block diagram of a device 900 for cross-language search as a terminal, according to an exemplary embodiment.
  • device 900 can be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a gaming console, a tablet device, a medical device, a fitness device, a personal digital assistant, and the like.
  • apparatus 900 can include one or more of the following components: processing component 902, memory 904, power component 906, multimedia component 908, audio component 910, input/output (I/O) interface 912, sensor component 914, And a communication component 916.
  • Processing component 902 typically controls the overall operation of device 900, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations.
  • Processing component 902 can include one or more processors 920 to execute instructions to perform all or part of the steps described above.
  • processing component 902 can include one or more modules to facilitate interaction between component 902 and other components.
  • processing component 902 can include a multimedia module to facilitate interaction between multimedia component 908 and processing component 902.
  • Memory 904 is configured to store various types of data to support operation at device 900. Examples of such data include instructions for any application or method operating on device 900, contact data, phone book data, messages, pictures, videos, and the like.
  • the memory 904 can be implemented by any type of volatile or non-volatile storage device, or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), erasable.
  • SRAM static random access memory
  • EEPROM electrically erasable programmable read only memory
  • EPROM Programmable Read Only Memory
  • PROM Programmable Read Only Memory
  • ROM Read Only Memory
  • Magnetic Memory Flash Memory
  • Disk Disk or Optical Disk.
  • Power component 906 provides power to various components of device 900.
  • Power component 906 can include a power management system, one or more power sources, and other components associated with generating, managing, and distributing power for device 900.
  • the multimedia component 908 includes a screen between the device 900 and the user that provides an output interface.
  • the screen can include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen can be implemented as a touch screen to receive input signals from the user.
  • the touch panel includes one or more touch sensors to sense touches, slides, and gestures on the touch panel. The touch sensor may sense not only the boundary of the touch or sliding motion action, but also the duration and pressure associated with the touch or slide operation.
  • the multimedia component 908 includes a front camera and/or a rear camera. When the device 900 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera can receive external multimedia data. Each front and rear camera can be a fixed optical lens system or have focal length and optical zoom capabilities.
  • the audio component 910 is configured to output and/or input an audio signal.
  • audio component 910 includes a microphone (MIC) that is configured to receive an external audio signal when device 900 is in an operational mode, such as a call mode, a recording mode, and a voice recognition mode.
  • the received audio signal may be further stored in memory 904 or transmitted via communication component 916.
  • the audio component 910 also includes a speaker for outputting an audio signal.
  • the I/O interface 912 provides an interface between the processing component 902 and the peripheral interface module, which may be a keyboard, a click wheel, a button, or the like. These buttons may include, but are not limited to, a home button, a volume button, a start button, and a lock button.
  • Sensor assembly 914 includes one or more sensors for providing device 900 with various aspects of status assessment.
  • sensor component 914 can detect an open/closed state of device 900, relative positioning of components, such as the display and keypad of device 900, and sensor component 914 can also detect a change in position of one component of device 900 or device 900. The presence or absence of user contact with device 900, device 900 orientation or acceleration/deceleration, and temperature variation of device 900.
  • Sensor assembly 914 can include a proximity sensor configured to detect the presence of nearby objects without any physical contact.
  • Sensor assembly 914 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications.
  • the sensor component 914 can also include an acceleration sensor, a gyro sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
  • Communication component 916 is configured to facilitate wired or wireless communication between device 900 and other devices.
  • the device 900 can access a wireless network based on a communication standard, such as WiFi, 2G or 3G, or a combination thereof.
  • the communication component 916 receives a broadcast signal or broadcast associated information from an external broadcast management system via a broadcast channel.
  • the communication component 916 also includes a near field communication (NFC) module to facilitate short range communication.
  • the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
  • RFID radio frequency identification
  • IrDA infrared data association
  • UWB ultra-wideband
  • Bluetooth Bluetooth
  • device 900 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A gate array (FPGA), controller, microcontroller, microprocessor, or other electronic component implementation for performing the above methods.
  • ASICs application specific integrated circuits
  • DSPs digital signal processors
  • DSPDs digital signal processing devices
  • PLDs programmable logic devices
  • FPGA field programmable A gate array
  • controller microcontroller, microprocessor, or other electronic component implementation for performing the above methods.
  • non-transitory computer readable storage medium comprising instructions, such as a memory 904 comprising instructions executable by processor 920 of apparatus 900 to perform the above method.
  • the non-transitory computer readable storage medium may be a ROM, a random access memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, and an optical data storage device.
  • FIG. 8 is a block diagram of a device for cross-language search as a server, according to an exemplary embodiment.
  • the server 1900 can vary considerably depending on configuration or performance, and can include one or more central processing units (CPUs) 1922 (eg, one or more processors) and memory 1932, one or one The above storage medium 1942 or storage medium 1930 of data 1944 (eg, one or one storage device in Shanghai).
  • the memory 1932 and the storage medium 1930 may be short-term storage or persistent storage.
  • the program stored on storage medium 1930 may include one or more modules (not shown), each of which may include a series of instruction operations in the server.
  • central processor 1922 can be configured to communicate with storage medium 1930, which performs a series of instruction operations in storage medium 1930.
  • Server 1900 may also include one or more power sources 1926, one or more wired or wireless network interfaces 1950, one or more input and output interfaces 1958, one or more keyboards 1956, and/or one or more operating systems 1941.
  • power sources 1926 For example, Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM, etc.
  • a non-transitory computer readable storage medium when instructions in the storage medium are executed by a processor of a device (terminal or server), enabling a terminal to perform a cross-language search method, the method comprising: The language search term is translated into a target language search term; for the target language search term, the corresponding target language search result and the source language translation result are obtained; wherein the source language translation result is a translation of the target language search result Obtaining: obtaining, from the source language translation result, a keyword that matches the target language search term; and displaying the keyword in the source language translation result.
  • a cross-language search method, a cross-language search device, and a device for cross-language search are provided in detail.
  • the specific examples are used herein to apply the principles and implementation manners of the present application. It is to be noted that the description of the above embodiments is only for helping to understand the method of the present application and its core ideas; at the same time, for those of ordinary skill in the art, according to the idea of the present application, there will be changes in specific embodiments and applications. In the above, the contents of the present specification should not be construed as limiting the present application.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)

Abstract

本申请实施例提供了一种跨语言搜索方法和装置、一种用于跨语言搜索的装置,其中的方法具体包括:将源语言搜索词翻译为目标语言搜索词;针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其中,所述源语言翻译结果为对所述目标语言搜索结果进行翻译得到;从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;对所述源语言翻译结果中的所述关键词进行标记显示。本申请实施例可以提高目标语言搜索词对应的源语言翻译结果中标红内容的覆盖率,进而可以提高从目标语言搜索词对应的源语言翻译结果中获取信息的效率。

Description

一种跨语言搜索方法和装置、一种用于跨语言搜索的装置
本申请要求在2017年12月8日提交中国专利局、申请号为201711298752.0、发明名称为“一种跨语言搜索方法和装置、一种用于跨语言搜索的装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及信息搜索技术领域,特别是涉及一种跨语言搜索方法和装置、一种用于跨语言搜索的装置。
背景技术
随着互联网信息的不断增长,用户对于信息搜索提出了更高的要求,不再满足于在一种语种数据库中搜索,而要求获取多种语种数据。例如,用户输入的搜索词(query)为“3月”,中文数据库中的搜索可能并不能最大程度地满足用户需求,源自欧美网站的英文数据库中可能具有更优、更多的搜索结果。
跨语言搜索技术结合了信息检索技术和机器翻译技术。现有的跨语言搜索过程具体可以包括:首先,通过机器翻译技术将源语言搜索词转换为目标语言形式的搜索词,然后,分别依据源语言搜索词和目标语言形式的搜索词,分别在对应语言的数据库中进行信息检索,以得到多语言的搜索结果,其中,多语言的搜索结果可以包括:源语言形式的搜索结果和目标语言搜索结果。
为了满足不具备目标语言阅读能力、或者目标语言的阅读能力有限的用户的需求,现有方案可以对目标语言搜索结果进行翻译,并将翻译得到的源语言翻译结果提供给用户。然而,在实际应用中,用户通常需要认真解读才能从源语言翻译结果中获取所需的关键信息,这将影响用户从源语言翻译结果中获取信息的效率。
发明内容
鉴于上述问题,提出了本申请实施例以便提供一种克服上述问题或者至少部分地解决上述问题的跨语言搜索方法、跨语言搜索装置及用于跨语言搜索的装置,本申请实施例可以提高目标语言搜索词对应的源语言翻译结果中标红内容的覆盖率,进而可以提高从目标语言搜索词对应的源语言翻译结果中获取信息的效率。
为了解决上述问题,本申请实施例公开了一种跨语言搜索方法,包 括:
将源语言搜索词翻译为目标语言搜索词;
针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其中,所述源语言翻译结果为对所述目标语言搜索结果进行翻译得到;
从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;
对所述源语言翻译结果中的所述关键词进行标记显示。
另一方面,本申请实施例公开了一种跨语言搜索装置,包括:
搜索词翻译模块,配置为将源语言搜索词翻译为目标语言搜索词;
结果获取模块,配置为针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其中,所述源语言翻译结果为对所述目标语言搜索结果进行翻译得到;
关键词获取模块,配置为从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;以及
标记显示模块,配置为对所述源语言翻译结果中的所述关键词进行标记显示。
再一方面,本申请实施例公开了一种用于跨语言搜索的装置,包括有存储器,以及一个或者一个以上的程序,其中一个或者一个以上程序存储于存储器中,且经配置以由一个或者一个以上处理器执行所述一个或者一个以上程序包含用于进行以下操作的指令:
将源语言搜索词翻译为目标语言搜索词;
针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其中,所述源语言翻译结果为对所述目标语言搜索结果进行翻译得到;
从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;
对所述源语言翻译结果中的所述关键词进行标记显示。
又一方面,本申请实施例公开了一个或多个机器可读介质,其上存储有指令,当由一个或多个处理器执行时,使得装置执行前述一个或多个所述的跨语言搜索方法。
本申请实施例包括以下优点:
本申请实施例从源语言翻译结果中获取与目标语言搜索词相匹配的关键词,并对所述源语言翻译结果中的所述关键词进行标记显示,可以提高目标语言搜索词对应的源语言翻译结果中标红内容的覆盖率,进而可以提高从目标语言搜索词对应的源语言翻译结果中获取信息的效率。
附图说明
图1是本申请的一种跨语言搜索方法的应用环境的示意图;
图2是本申请的一种跨语言搜索方法实施例一的步骤流程图;
图3是本申请的一种跨语言搜索方法实施例二的步骤流程图;
图4是本申请的一种跨语言搜索方法实施例三的步骤流程图;
图5是本申请的一种跨语言搜索方法实施例四的步骤流程图;
图6是本申请的一种跨语言搜索装置实施例的结构框图;
图7是本申请的一种用于跨语言搜索的装置900作为终端时的框图;及
图8是本申请的一种用于跨语言搜索的装置作为服务器时的结构示意图。
具体实施方式
为使本申请的上述目的、特征和优点能够更加明显易懂,下面结合附图和具体实施方式对本申请作进一步详细的说明。
发明人在实施本申请实施例的过程中发现,现有方案可以将目标语言搜索结果翻译到源语言,并对翻译得到的源语言翻译结果中、与源语言搜索词相同的内容进行标红。例如,若源语言搜索词为“特朗普”,对应的目标语言搜索词为“Trump”,则可以将“Trump”对应的源语言翻译结果中出现的“特朗普”进行标红,由此可以使用户快速获取某条源语言翻译结果中包括的关键信息,进而可以使用户快速判断某条源语言翻译结果是否符合自身的信息需求。
然而,在实际应用中,翻译质量等因素导致源语言翻译结果中可能不存在源语言搜索词;此种情况下,将无法实现对于源语言翻译结果的标红,使得标红内容的覆盖率较低,进而影响用户对于某条源语言翻译结果包括的关键信息的获取效率。例如,假设源语言搜索词为“3月”,假设“3月”对应的目标语言搜索词为“March”,假设“March”对应的目标语言搜索结果中包括“March-Wikipedia”,假设针对“March-Wikipedia”翻译得到的源语言翻译结果为“三月-***”,可见源语言翻译结果中不存在源语言搜索词“3月”,因此无法实现对于源语言翻译结果中关键信息的标红。
针对现有方案中源语言翻译结果中标红内容的覆盖率低的技术问题,本申请实施例提供了一种跨语言搜索方案,该方案可以将源语言搜索词翻译为目标语言搜索词;针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其中,所述源语言翻译结果可以为对所述目标语言搜索结果进行翻译得到;从所述源语言翻译结果中获取与所述目标 语言搜索词相匹配的关键词;并对所述源语言翻译结果中的所述关键词进行标记显示。
本申请实施例从源语言翻译结果中获取与目标语言搜索词相匹配的关键词,并对所述源语言翻译结果中的所述关键词进行标记显示,可以提高目标语言搜索词对应的源语言翻译结果中标红内容的覆盖率,进而可以提高从目标语言搜索词对应的源语言翻译结果中获取信息的效率。
本申请实施例中,目标语言搜索词对应的搜索结果可以包括:目标语言搜索结果和源语言翻译结果,其中,源语言翻译结果可以为对所述目标语言搜索结果进行翻译得到,目标语言搜索结果和源语言翻译结果可以对应相同的搜索结果(如相同的网页、视频、图片、音乐等),两者的区别之一在于语言形式的不同。
在本申请的一种应用示例中,假设源语言搜索词为“3月”,假设“3月”对应的目标语言搜索词为“March”,假设“March”对应的目标语言搜索结果中包括“March-Wikipedia”,针对“March-Wikipedia”翻译得到的源语言翻译结果为“三月-***”,则本申请实施例可以从“March”对应的源语言翻译结果“三月-***”中确定出与“March”相匹配的关键词“三月”,并对“March”对应的源语言翻译结果“三月-***”中的“三月”进行标记显示,由此可以提高源语言翻译结果中标红内容的覆盖率。
本申请实施例可以应用于搜索APP(应用程序,Application)、搜索网站(如搜索引擎)等具有跨语言搜索功能的平台环境中,以提高从目标语言搜索词对应的搜索结果中获取信息的效率。本申请实施例主要以搜索APP为例对本申请实施例的跨语言搜索方法进行说明,搜索网站等其他平台环境对应的跨语言搜索方法相互参照即可。
本申请实施例提供的跨语言搜索方法可应用于图1所示的应用环境中,如图1所示,客户端100与服务器200位于有线或无线网络中,通过该有线或无线网络,客户端100与服务器200进行数据交互。
本申请实施例的跨语言搜索流程可由客户端100和服务器200中的任一或者组合执行:
例如,客户端100可以接收用户输入的源语言搜索词,并向服务器200发送该源语言搜索词;服务器200在接收该源语言搜索词后,可以将源语言搜索词翻译为目标语言搜索词,获取所述目标语言搜索词对应的目标语言搜索结果,对所述目标语言搜索结果进行从目标语言到源语言的翻译,以得到源语言翻译结果;以及从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;并向客户端100发送上述源语言翻译结果和上述关键词,以使客户端100展现所述源语言翻译结果,并对所述源语言翻译结果 中的所述关键词进行标记显示。
又如,客户端100可以接收用户输入的源语言搜索词,并在接收该源语言搜索词后,首先将源语言搜索词翻译为目标语言搜索词,接着获取所述目标语言搜索词对应的目标语言搜索结果,然后对所述目标语言搜索结果进行从目标语言到源语言的翻译,以得到源语言翻译结果;以及从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;进而展现所述源语言翻译结果,并对所述源语言翻译结果中的所述关键词进行标记显示。
其中,在源语言翻译结果、以及源语言翻译结果中关键词的获取过程由服务器200执行的情况下,能够发挥服务器200计算资源丰富的优势,提高源语言翻译结果、以及源语言翻译结果中关键词的获取效率和获取准确率。例如,云服务器可以部署有众多高配置的计算设备,故利用这些计算设备进行源语言翻译结果、以及源语言翻译结果中关键词的获取,以提高源语言翻译结果、以及源语言翻译结果中关键词的获取效率和获取准确率;同时可以节省客户端100侧的运算资源,提高客户端100所对应终端的性能。当然,源语言翻译结果和源语言翻译结果中关键词的获取过程可由服务器200和客户端100中的任一执行,本申请实施例对于源语言翻译结果和源语言翻译结果中关键词的获取过程对应的具体执行主体不加以限制。
可选地,客户端100可以运行在终端上,上述终端具体包括但不限:智能手机、平板电脑、电子书阅读器、MP3(动态影像专家压缩标准音频层面3,Moving Picture Experts Group Audio Layer III)播放器、MP4(动态影像专家压缩标准音频层面4,Moving Picture Experts Group Audio Layer IV)播放器、膝上型便携计算机、车载电脑、台式计算机、机顶盒、智能电视机、可穿戴设备等等。
方法实施例一
参照图2,示出了本申请的一种跨语言搜索方法实施例一的步骤流程图,具体可以包括如下步骤:
步骤201、将源语言搜索词翻译为目标语言搜索词;
步骤202、针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其中,所述源语言翻译结果可以为对所述目标语言搜索结果进行翻译得到;
步骤203、从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;
步骤204、对所述源语言翻译结果中的所述关键词进行标记显示。
本申请实施例中,源语言搜索词可以为用户使用源语言输入的搜索词。在实际应用中,搜索APP或者搜索网站的客户端可以提供UI(用户界 面,User Interface),则用户可以通过该UI上的搜索框、语音接口等方式向客户端提交源语言搜索词。无论用户通过何种方式向客户端提交源语言搜索词,客户端均可以将接收到的源语言搜索词显示在搜索框中,因此,本申请实施例中,用户输入的源语言搜索词可以包括:用户通过任意方式向客户端提交的源语言搜索词。可以理解,本申请实施例对于源语言搜索词的具体获取方式不加以限制。
本申请实施例中,源语言和目标语言可用于表示不同的两种语言,上述源语言和目标语言可由用户预置得到,也可由搜索APP或者搜索网站通过分析用户的搜索行为和/或浏览行为得到。可选地,搜索APP或者搜索网站可以将用户最常用的语言作为源语言,将除了源语言之外使用过的语言作为目标语言。例如,用户的搜索行为表明,用户之前使用的搜索词均为中文搜索词,则可以确定源语言为中文;用户的浏览行为还表明,用户访问过翻译网站,并通过该翻译网站进行过中文和英文之间的相互翻译行为,故可以确定目标语言为英文。可以理解,本申请实施例的目标语言的数量可以为一种或者多种,例如,对于以中文为母语的用户而言,源语言可以为中文,目标语言可以为英文、日文、韩文、德文、法文中的一种或者组合。本申请实施例主要以源语言为中文、目标语言为英文为例对本申请实施例的跨语言搜索方法进行说明,其他源语言和目标语言对应的跨语言搜索方法相互参照即可。
在实际应用中,步骤201可由客户端或者服务器将源语言搜索词翻译为目标语言搜索词。可选地,在将源语言搜索词翻译为目标语言搜索词的过程中,可能会得到多种不同的翻译结果,此种情况下,可以从多种不同的翻译结果中选择置信度最高的一种翻译结果作为目标语言搜索词,可以理解,本申请实施例对于将源语言搜索词翻译为目标语言搜索词的具体过程不加以限制。在本申请的一种应用示例中,若源语言搜索词为“3月”,则目标语言搜索词可以为“March”。在本申请的另一种应用示例中,若源语言搜索词为“迪士尼”,则目标语言搜索词可以为“disney”。
在实际应用中,步骤202可以依据步骤201得到的目标语言搜索词,在目标语言的数据库中进行检索,以得到目标语言搜索结果。可以理解,本申请实施例对于目标语言搜索结果的具体获取方式不加以限制。
由于步骤202中目标语言搜索结果表现为目标语言形式,该目标语言一般为该搜索用户的非母语语言或者为该搜索用户不熟悉的语言,这将影响用户从搜索结果中获取信息的效率。为了提高用户从搜索结果中获取信息的效率,步骤202可以对所述目标语言搜索结果进行从目标语言到源语言的翻译,以得到源语言翻译结果。其中,目标语言搜索结果和源语言翻译结果可以对应相同的搜索结果(如相同的网页、视频、图片、音乐等),两者 的区别之一在于语言形式的不同。
在实际应用中,源语言翻译结果的获取过程可以包括:利用翻译模型,对所述目标语言搜索结果进行从目标语言到源语言的翻译;其中,该翻译模型可以为翻译领域内通用的翻译模型,也可以为搜索结果专用的翻译模型,本申请实施例对于具体的翻译模型、以及对所述目标语言搜索结果进行从目标语言到源语言的具体翻译过程不加以限制。
在本申请的一种可选实施例中,可以通过改进对所述目标语言搜索结果从目标语言到源语言的翻译过程,实现源语言翻译结果中目标内容的翻译结果与源语言搜索词的一致性,这样,可以提高标红内容的覆盖率。相应地,源语言翻译结果的获取过程可以包括:在对所述目标语言搜索结果进行从目标语言到源语言的翻译的过程中,将所述目标语言搜索结果中目标内容翻译为与所述源语言搜索词一致的翻译结果;其中,所述目标内容为所述目标语言搜索结果中与所述目标语言搜索词相同的内容。例如,虽然目标搜索词“Trump”的翻译结果可以包括:“川普”、“特朗普”、“王牌”、“好人”、“老实人”等,然而,由于标红的对象为所述关键词,故本申请实施例可以通过提高源语言翻译结果中目标内容的翻译结果与源语言搜索词的一致性,提高标红内容的覆盖率。这样,若目标语言搜索词“Trump”对应的源语言搜索词为“特朗普”,则可以将目标语言搜索词“Trump”翻译为“特朗普”,并将所述目标语言搜索结果中包括的所有的“Trump”均翻译为“特朗普”;或者,若目标语言搜索词“Trump”对应的源语言搜索词为“川普”,则可以将目标语言搜索词“Trump”翻译为“川普”,并将所述目标语言搜索结果中包括的所有“Trump”均翻译为“川普”;由此可以提高标红内容的覆盖率。
步骤203可以从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词。其中,关键词对应的语言形式可以为源语言形式,也即关键词对应的语言形式与源语言翻译结果对应的语言形式一致。
本申请实施例中,关键词与目标语言搜索词相匹配,可以包括:关键词对应的语义与目标语言搜索词对应的语义相匹配,或者,关键词对源语言读者所产生的效果与目标语言搜索词对目标语言读者所产生的效果相匹配。
根据一种实施例,步骤203从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词的过程,可以包括:根据翻译词典获取与所述目标语言搜索词相匹配的关键词,并从所述源语言翻译结果中查找与所述关键词相同的内容。其中,上述翻译词典可以针对目标语言词汇提供对应的源语言词汇,如针对目标语言词汇“March”提供的源语言词汇可以包括:“三月”、“3月”、“前进”等,由此可以从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词。
根据另一种实施例,步骤203可以通过词对齐技术,从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词。词对齐是指将双语句对中的源语言词汇和目标语言词汇对应。例如,双语句对为“我在教室玩游戏”、“I play game in the classroom”,词对齐技术能够将“我”、“在”、“教室”、“玩”、“游戏”分别与“I”、“in”、“classroom”、“play”、“game”对应起来。由于上述词对齐技术考虑了双语句中互为译文的词汇对之间的对齐关系,故可以提高关键词的准确率。
上述通过词对齐技术,从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词的过程,可以包括:建立所述目标语言搜索结果中包括的目标语言词汇与所述源语言翻译结果包括的源语言词汇之间的映射关系;依据所述目标语言搜索词,在所述映射关系中进行查找,以得到所述源语言翻译结果包括的、与所述目标语言搜索词相匹配的关键词。
其中,可以通过词对齐技术,建立所述目标语言搜索结果中包括的目标语言词汇与所述源语言翻译结果包括的源语言词汇之间的映射关系,具体地,可以获取目标语言搜索结果和源语言翻译结果中包括的双语句,并在双语句中互为译文的目标语言词汇与源语言词汇之间建立映射关系。
在实际应用中,可以利用词对齐模型,建立所述目标语言搜索结果中包括的目标语言词汇与所述源语言翻译结果包括的源语言词汇之间的映射关系。其中,可以采用双语对齐语料训练得到上述词对齐模型,以使该词对齐模型包含用于确定双语句中互为译文的词汇对的统计信息,由于词对齐模型为本领域的相关技术,在此不作赘述。
在本申请实施例的一种应用示例中,假设源语言搜索词为“3月”,假设“3月”对应的目标语言搜索词为“March”,假设“March”对应的目标语言搜索结果中包括“March-Wikipedia”,假设针对“March-Wikipedia”翻译得到的、“March”对应的源语言翻译结果包括“三月-***”,则本申请实施例可以采用词对齐模型,对目标语言搜索结果和源语言翻译结果对应的双语句进行对齐,例如,对于双语句“March-Wikipedia”和“三月-***”,可以将“March”、“Wikipedia”分别与“三月”、“***”对应起来,由此,可以得到源语言翻译结果“三月-***”中包括的与“March”相匹配的关键词“三月”。
可以理解,上述通过翻译词典、或者通过词对齐技术、从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词的过程只是作为可选实施例,实际上本领域技术人员可以根据实际应用需求,采用从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词的其他过程,例如可以利用神经网络机器翻译模型的注意力机制,判断目标语言搜索词被翻译成了源语言形式的哪个词,等等,其中,注意力机制是一个用来分配有限的 信息处理能力的选择机制,注意力是指,选择性地专注在某些感受到的信息上,这些信息可能是客观或主观的,同时忽视同一时刻收到的其他信息,这一个认知过程。本申请实施例对于从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词的具体过程不加以限制。
在实际应用中,源语言翻译结果对应的展现内容可以包括:标题、摘要、推广内容等内容,则步骤204可以对所述源语言翻译结果对应的展现内容中的所述关键词进行标记显示。例如,可以对标题或者摘要中包括的所述关键词进行标记展示,或者,可以对推广内容中包括的所述关键词进行标记展示。
可选地,可以通过标记所述关键词的字体或者颜色,对所述关键词进行标记显示;例如,可以将所述关键词的字体加粗、加下划线、倾斜、立体化、设为红色等,这相对于正常字体或者正常颜色的展现内容,能够提高所述关键词的辨识度。或者,在展示所述关键词的同时,展示所述所述关键词的标识。例如,可以在所述关键词的上标、下标等位置,展现对应的图标等,可以理解,本申请实施例对于具体的标记显示过程不加以限制。
在本申请的一些实施例中,可以将所述目标语言搜索词对应的搜索结果作为第一搜索结果,还可以获取源语言搜索词对应的第二搜索结果,并将目标语言形式和/或源语言形式的第一搜索结果与第二搜索结果进行混合展现;由此可以向用户提供源自多种语言数据库的搜索结果。可选地,可以在源语言的数据库中进行检索,以得到源语言搜索词对应的第二搜索结果。
综上,本申请实施例的跨语言搜索方法,从源语言翻译结果中获取与目标语言搜索词相匹配的关键词,并对所述源语言翻译结果中的所述关键词进行标记显示,可以提高目标语言搜索词对应的源语言翻译结果中标红内容的覆盖率,进而可以提高从目标语言搜索词对应的源语言翻译结果中获取信息的效率。
方法实施例二
参照图3,示出了本申请的一种跨语言搜索方法实施例二的步骤流程图,具体可以包括如下步骤:
步骤301、将源语言搜索词翻译为目标语言搜索词;
步骤302、针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其中,所述源语言翻译结果可以为对所述目标语言搜索结果进行翻译得到;
步骤303、从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;
步骤304、若所述关键词与所述源语言搜索词之间的匹配度符合预置匹配度条件,则对所述源语言翻译结果中的所述关键词进行标记显示。
相对于图2所示方法实施例一,图3所示方法实施例二中对所述源语言翻译结果中的所述关键词进行标记显示的过程可以包括:若所述关键词与所述源语言搜索词之间的匹配度符合预置匹配度条件,则对所述源语言翻译结果中的所述关键词进行标记显示。由于标记显示的关键词与所述源语言搜索词之间的匹配度符合预置匹配度条件,故可以提高标记显示的关键词与源语言搜索词之间的关联,进而可以使标记显示的关键词更加符合用户的信息需求。
本申请实施例中,所述关键词与所述源语言搜索词之间的匹配度可以包括:所述关键词与所述源语言搜索词之间的相似度等,预置匹配度条件可以包括:相似度超过相似度阈值。在实际应用中,可以分别确定所述关键词对应的第一词向量、以及所述源语言搜索词对应的第二词向量,然后,通过欧氏距离、夹角余弦等相似度度量,计算第一词向量与第二词向量之间的相似度。
当然,上述相似度只是作为匹配度的一种可选实施例,实际上,本领域技术人员可以根据实际应用需求采用其他匹配度,如语义相关度等,本申请实施例对于具体的匹配度不加以限制。
在本申请实施例的一种应用示例中,可以对关键词“三月”与源语言搜索词“3月”进行匹配度判断,若该匹配度符合预置匹配度条件,则可以对源语言翻译结果中的关键词“三月”进行标记显示。
在本申请实施例的另一种应用示例中,可以对关键词为“迪士尼”与源语言搜索词“迪斯尼”进行匹配度判断,若该匹配度符合预置匹配度条件,则可以对源语言翻译结果中的关键词“迪士尼”进行标记显示。
可以理解,在本申请的其他实施例中,若所述关键词与所述源语言搜索词之间的匹配度不符合预置匹配度条件,则可以不对所述源语言翻译结果中的所述关键词进行标记显示,以避免与源语言搜索词之间的关联较低的关键词被标记显示。
方法实施例三
参照图4,示出了本申请的一种跨语言搜索方法实施例三的步骤流程图,具体可以包括如下步骤:
步骤401、将源语言搜索词翻译为目标语言搜索词;
步骤402、针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其中,所述源语言翻译结果可以为对所述目标语言搜索结果进行翻译得到;
步骤403、判断所述源语言翻译结果是否包括所述源语言搜索词;
步骤404、若所述源语言翻译结果不包括所述源语言搜索词,则从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;
步骤405、对所述源语言翻译结果中的所述关键词进行标记显示。
相对于图2所示方法实施例一,图4所示方法实施例三在从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词之前,可以判断所述源语言翻译结果是否包括所述源语言搜索词,若否,则从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词,并对所述源语言翻译结果中的所述关键词进行标记显示;由此可以在源语言翻译结果中不存在源语言搜索词的情况下,实现对于源语言翻译结果的标红。
可以理解,上述在所述源语言翻译结果不包括所述源语言搜索词的情况下、从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词,只是作为可选实施例,实际上,本申请实施例对于“从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词”对应的执行条件不加以限制。例如,在所述源语言翻译结果包括所述源语言搜索词的情况下,也可以从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词,由此可以实现源语言翻译结果中源语言搜索词和关键词的同时标红。相应地,本申请实施例的方法还可以包括:若所述源语言翻译结果包括所述源语言搜索词,则对所述源语言翻译结果中的所述源语言搜索词进行标记显示。
方法实施例四
参照图5,示出了本申请的一种跨语言搜索方法实施例四的步骤流程图,具体可以包括如下步骤:
步骤501、将源语言搜索词翻译为目标语言搜索词;
步骤502、针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其中,所述源语言翻译结果可以为对所述目标语言搜索结果进行翻译得到;
步骤503、从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;
步骤504、对所述源语言翻译结果中的所述关键词进行标记显示;
相对于图2所示方法实施例一,图5所示方法实施例四还可以包括:
步骤505、获取所述目标语言搜索词对应的至少两种翻译结果;
步骤506、对所述源语言翻译结果中、与所述目标语言搜索词对应的至少两种翻译结果相匹配的内容进行标记显示。
由于源语言翻译结果为对所述目标语言搜索结果进行翻译得到,故源语言翻译结果往往具有多种可能性,这样,若仅仅将源语言翻译结果中、与源语言搜索词相同的内容进行标红,则导致标红内容的覆盖率较低。例如,对于目标语言搜索结果包含的内容“Trump”而言,其源语言翻译结果 可能包括:“特朗普”、“川普”等多种翻译可能,若源语言搜索词为“特朗普”,则仅仅能够实现对于“特朗普”这一种翻译可能的标红,而无法实现对于“川普”等其他翻译可能的标红,故导致标红内容的覆盖率较低。
针对上述标红内容的覆盖率低的技术问题,本申请实施例可以获取目标语言搜索词对应的的至少两种翻译结果,并对所述源语言翻译结果中、与所述目标语言搜索词对应的至少两种翻译结果相匹配的内容进行标记显示。由于至少两种翻译结果可以反映目标语言搜索词到源语言的至少两种翻译可能,故本申请实施例能够通过源语言翻译结果实现至少两种翻译可能的标红,因此能够提高源语言翻译结果中标红内容的覆盖率,进而能够提高从目标语言搜索词对应的搜索结果中获取信息的效率。
在本申请的一种应用示例中,若源语言搜索词为“特朗普”,对应的目标语言搜索词为“Trump”,而对于目标语言搜索词“Trump”对应的目标语言搜索结果包含的内容“Trump”而言,其对应的源语言翻译结果可能包括:“特朗普”、“川普”等多种翻译可能,则本申请实施例可以获取“特朗普”、“川普”等至少两种翻译可能,并通过源语言翻译结果实现至少两种翻译可能的标红,因此能够提高标红内容的覆盖率。
对于获取所述目标语言搜索词对应的至少两种翻译结果,本申请实施例可以提供如下的获取方案:
获取方案1
获取方案1可以对所述目标语言搜索词进行从目标语言到源语言的翻译,从翻译得到的结果中筛选得到至少两种源语言的翻译结果。可选地,可以采用翻译模型将目标语言搜索词翻译为多个翻译结果,并从翻译得到的结果中筛选得到其中的至少两种。
在本申请的一种可选实施例中,上述从翻译得到的结果中筛选得到至少两种源语言的翻译结果,具体可以包括:从所述翻译得到的结果中选择置信度符合预置置信度条件的至少两种源语言的翻译结果。例如,可以依据“Trump”翻译得到多个翻译结果:“川普”、“特朗普”、“王牌”、“好人”、“老实人”、“手段”、“超越”等,并针对“Trump”对应的多个翻译结果,选择置信度较高的至少两种翻译结果,例如“川普”、“特朗普”。可选地,源语言翻译结果的置信度可以为目标语言搜索词到源语言翻译结果的翻译概率,该翻译概率可由翻译模型计算得到,可以理解,本申请实施例对于源语言翻译结果的置信度及其获取方式不加以限制。
在本申请的另一种可选实施例中,上述从翻译得到的结果中筛选得到至少两种源语言的翻译结果,具体可以包括:依据所述源语言搜索词的词性,从所述翻译得到的结果中筛选得到至少两种翻译结果。以“Trump”对 应的翻译结果为例,其中可以包括:名词“川普”、“特朗普”、“王牌”、“好人”、“老实人”、“手段”、以及动词“超越”等,本可选实施例可以依据源语言搜索词的词性,对至少两种翻译结果进行筛选,以得到与源语言搜索词的词性一致的翻译结果,因此能够提高翻译结果的准确度。例如,源语言搜索词为“特朗普”,源语言搜索词的词性为“名词”或者“专有名词”,则可以依据“名词”,过滤掉动词“超越”;或者,可以依据“专有名词”,过滤掉动词“超越”和非专有名词“王牌”、“好人”、“老实人”、“手段”。
获取方案2
获取方案2可以在对所述目标语言搜索结果进行从目标语言到源语言的翻译过程中,对所述目标语言搜索结果中的目标内容对应的翻译结果进行记录,以得到所述目标语言搜索词对应的至少两种翻译结果;其中,所述目标内容为所述目标语言搜索结果中与所述目标语言搜索词相同的内容。
在实际应用中,目标语言搜索结果的背景、上下文等因素,导致目标语言搜索结果中同一内容在不同背景和上下文中可以对应不同的翻译结果,例如,“Apple”可以被翻译为“水果”,也可以被翻译为“苹果手机”,还可以被翻译为“炸弹”等。
为了获得目标语言搜索结果中的目标内容对应的翻译结果,获取方案2可以在对所述目标语言搜索结果进行从目标语言到源语言的翻译过程中,对所述搜索结果中的目标内容对应的翻译结果进行记录;由于该目标内容为所述目标语言搜索结果中与所述目标语言搜索词相同的内容,因此,在翻译完成后,可以从所述记录内容中获得目标语言搜索词对应的至少两种翻译结果。以目标语言搜索词“Trump”为例,对应的目标内容也为“Trump”,可以通过翻译模型记录目标语言搜索结果中包括的“Trump”被翻译为什么,例如对应不同的语境或上下文,可能被翻译为了“特朗普”、“川普”、或者“王牌”等等,记录所有出现的翻译结果,以得到所述目标语言搜索词对应的至少两种翻译结果。可选地,上述,对所述搜索结果中目标内容对应的翻译结果进行记录可以包括:对目标内容对应的每一个翻译结果进行记录,或者,将目标内容对应的当前翻译结果与记录的翻译结果进行比较,若不同,则进行记录,若相同,则不记录。
以上通过获取方案1和获取方案2对获取所述目标语言搜索词对应的至少两种翻译结果的过程进行了详细介绍,可以理解,本领域技术人员可以根据实际应用需求,采用获取方案1和获取方案2中的任一或者组合,或者,还可以采用其他获取方案,可以理解,本申请实施例对于获取所述目标语言搜索词对应的至少两种翻译结果的具体过程不加以限制。
综上,本申请实施例的跨语言搜索方法,可以对目标语言搜索结果进行 从目标语言到源语言的翻译,以得到源语言翻译结果,并对所述源语言翻译结果中、与目标语言搜索词所对应至少两种翻译结果相匹配的内容进行标记显示;由于上述至少两种翻译结果可以反映目标语言搜索词到源语言的至少两种翻译可能,故本申请实施例能够通过源语言翻译结果实现至少两种翻译可能的标红,因此不仅可以实现对源语言翻译结果中与源语言搜索词相同内容的标红,还能实现对源语言翻译结果中与源语言搜索词相关内容(形式不同,但对应的目标语言翻译结果相同)的标红,从而能够提高源语言形式的源语言翻译结果中标红内容的覆盖率,进而能够提高用户从源语言翻译结果中获取信息的效率。
需要说明的是,对于方法实施例,为了简单描述,故将其都表述为一系列的运动动作组合,但是本领域技术人员应该知悉,本申请实施例并不受所描述的运动动作顺序的限制,因为依据本申请实施例,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的运动动作并不一定是本申请实施例所必须的。
装置实施例
参照图6,示出了本申请的一种跨语言搜索装置实施例的结构框图,具体可以包括:搜索词翻译模块601、结果获取模块602、关键词获取模块603和标记显示模块603。
其中,搜索词翻译模块601,配置为将源语言搜索词翻译为目标语言搜索词;
结果获取模块602,配置为针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其中,所述源语言翻译结果为对所述目标语言搜索结果进行翻译得到;
关键词获取模块603,配置为从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;
标记显示模块604,配置为对所述源语言翻译结果中的所述关键词进行标记显示。
可选地,所述关键词获取模块603可以包括:
映射建立子模块,配置为建立所述目标语言搜索结果中可以包括的目标语言词汇与所述源语言翻译结果可以包括的源语言词汇之间的映射关系;以及
映射查找子模块,配置为依据所述目标语言搜索词,在所述映射关系中进行查找,以得到所述源语言翻译结果可以包括的、与所述目标语言搜索词目匹配的关键词。
可选地,所述标记显示模块604,具体配置为若所述关键词与所述源语 言搜索词之间的匹配度符合预置匹配度条件,则对所述源语言翻译结果中的所述关键词进行标记显示。
可选地,所述装置还可以包括:
判断模块,配置为在所述关键词获取模块从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词之前,判断所述源语言翻译结果是否可以包括所述源语言搜索词,若所述源语言翻译结果不可以包括所述源语言搜索词,则触发所述关键词获取模块从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词。
可选地,所述装置还可以包括:
第二标记显示模块,配置为若所述源语言翻译结果可以包括所述源语言搜索词,则对所述源语言翻译结果中的所述源语言搜索词进行标记显示。
可选地,所述装置还可以包括:
翻译结果获取模块,配置为获取所述目标语言搜索词对应的至少两种翻译结果;
第三标记显示模块,配置为对所述源语言翻译结果中、与所述目标语言搜索词对应的至少两种翻译结果相匹配的内容进行标记显示。
可选地,所述源语言翻译结果获取子模块可以包括:
筛选单元,配置为对所述目标语言搜索词进行从目标语言到源语言的翻译,从翻译得到的结果中筛选得到至少两种源语言的翻译结果;
其中,所述筛选单元可以包括:
第一筛选子单元,配置为从所述翻译得到的结果中选择置信度符合预置置信度条件的至少两种源语言的翻译结果;或者,
第二筛选子单元,配置为依据所述源语言搜索词的词性,从所述翻译得到的结果中筛选得到至少两种翻译结果。
对于装置实施例而言,由于其与方法实施例基本相似,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。
本说明书中的各个实施例均采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似的部分互相参见即可。
关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不做详细阐述说明。
本申请实施例公开了一种用于跨语言搜索的装置,包括有存储器,以及一个或者一个以上的程序,其中一个或者一个以上程序存储于存储器中,且经配置以由一个或者一个以上处理器执行所述一个或者一个以上程序包含用于进行以下操作的指令:将源语言搜索词翻译为目标语言搜索词;针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其 中,所述源语言翻译结果为对所述目标语言搜索结果进行翻译得到;从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;对所述源语言翻译结果中的所述关键词进行标记显示。
可选地,所述从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词,包括:
建立所述目标语言搜索结果中包括的目标语言词汇与所述源语言翻译结果包括的源语言词汇之间的映射关系;
依据所述目标语言搜索词,在所述映射关系中进行查找,以得到所述源语言翻译结果包括的、与所述目标语言搜索词相匹配的关键词。
可选地,所述对所述源语言翻译结果中的所述关键词进行标记显示,包括:
若所述关键词与所述源语言搜索词之间的匹配度符合预置匹配度条件,则对所述源语言翻译结果中的所述关键词进行标记显示。
可选地,所述装置还经配置以由一个或者一个以上处理器执行所述一个或者一个以上程序包含用于进行以下操作的指令:
在所述从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词之前,判断所述源语言翻译结果是否包括所述源语言搜索词;
若所述源语言翻译结果不包括所述源语言搜索词,则执行所述从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词的步骤。
可选地,所述装置还经配置以由一个或者一个以上处理器执行所述一个或者一个以上程序包含用于进行以下操作的指令:
若所述源语言翻译结果包括所述源语言搜索词,则对所述源语言翻译结果中的所述源语言搜索词进行标记显示。
可选地,所述装置还经配置以由一个或者一个以上处理器执行所述一个或者一个以上程序包含用于进行以下操作的指令:
获取所述目标语言搜索词对应的至少两种翻译结果;
对所述源语言翻译结果中、与所述目标语言搜索词对应的至少两种翻译结果相匹配的内容进行标记显示。
可选地,所述获取所述目标语言搜索词对应的至少两种翻译结果,包括:
对所述目标语言搜索词进行从目标语言到源语言的翻译,从翻译得到的结果中筛选得到至少两种源语言的翻译结果;
其中,所述从翻译得到的结果中筛选得到至少两种源语言的翻译结果包括:
从所述翻译得到的结果中选择置信度符合预置置信度条件的至少两种源语言的翻译结果;或者,
依据所述源语言搜索词的词性,从所述翻译得到的结果中筛选得到至少两种翻译结果。
图7是根据一示例性实施例示出的一种用于跨语言搜索的装置900作为终端时的框图。例如,装置900可以是移动电话,计算机,数字广播终端,消息收发设备,游戏控制台,平板设备,医疗设备,健身设备,个人数字助理等。
参照图7,装置900可以包括以下一个或多个组件:处理组件902,存储器904,电源组件906,多媒体组件908,音频组件910,输入/输出(I/O)的接口912,传感器组件914,以及通信组件916。
处理组件902通常控制装置900的整体操作,诸如与显示,电话呼叫,数据通信,相机操作和记录操作相关联的操作。处理元件902可以包括一个或多个处理器920来执行指令,以完成上述的方法的全部或部分步骤。此外,处理组件902可以包括一个或多个模块,便于处理组件902和其他组件之间的交互。例如,处理组件902可以包括多媒体模块,以方便多媒体组件908和处理组件902之间的交互。
存储器904被配置为存储各种类型的数据以支持在设备900的操作。这些数据的示例包括用于在装置900上操作的任何应用程序或方法的指令,联系人数据,电话簿数据,消息,图片,视频等。存储器904可以由任何类型的易失性或非易失性存储设备或者它们的组合实现,如静态随机存取存储器(SRAM),电可擦除可编程只读存储器(EEPROM),可擦除可编程只读存储器(EPROM),可编程只读存储器(PROM),只读存储器(ROM),磁存储器,快闪存储器,磁盘或光盘。
电源组件906为装置900的各种组件提供电力。电源组件906可以包括电源管理***,一个或多个电源,及其他与为装置900生成、管理和分配电力相关联的组件。
多媒体组件908包括在所述装置900和用户之间的提供一个输出接口的屏幕。在一些实施例中,屏幕可以包括液晶显示器(LCD)和触摸面板(TP)。如果屏幕包括触摸面板,屏幕可以被实现为触摸屏,以接收来自用户的输入信号。触摸面板包括一个或多个触摸传感器以感测触摸、滑动和触摸面板上的手势。所述触摸传感器可以不仅感测触摸或滑动运动动作的边界,而且还检测与所述触摸或滑动操作相关的持续时间和压力。在一些实施例中,多媒体组件908包括一个前置摄像头和/或后置摄像头。当设备900处于操作模式,如拍摄模式或视频模式时,前置摄像头和/或后置摄像头可以接收外部的多媒体数据。每个前置摄像头和后置摄像头可以是一个固定的光学透镜***或具有焦距和光学变焦能力。
音频组件910被配置为输出和/或输入音频信号。例如,音频组件910 包括一个麦克风(MIC),当装置900处于操作模式,如呼叫模式、记录模式和语音识别模式时,麦克风被配置为接收外部音频信号。所接收的音频信号可以被进一步存储在存储器904或经由通信组件916发送。在一些实施例中,音频组件910还包括一个扬声器,用于输出音频信号。
I/O接口912为处理组件902和***接口模块之间提供接口,上述***接口模块可以是键盘,点击轮,按钮等。这些按钮可包括但不限于:主页按钮、音量按钮、启动按钮和锁定按钮。
传感器组件914包括一个或多个传感器,用于为装置900提供各个方面的状态评估。例如,传感器组件914可以检测到设备900的打开/关闭状态,组件的相对定位,例如所述组件为装置900的显示器和小键盘,传感器组件914还可以检测装置900或装置900一个组件的位置改变,用户与装置900接触的存在或不存在,装置900方位或加速/减速和装置900的温度变化。传感器组件914可以包括接近传感器,被配置用来在没有任何的物理接触时检测附近物体的存在。传感器组件914还可以包括光传感器,如CMOS或CCD图像传感器,用于在成像应用中使用。在一些实施例中,该传感器组件914还可以包括加速度传感器,陀螺仪传感器,磁传感器,压力传感器或温度传感器。
通信组件916被配置为便于装置900和其他设备之间有线或无线方式的通信。装置900可以接入基于通信标准的无线网络,如WiFi,2G或3G,或它们的组合。在一个示例性实施例中,通信部件916经由广播信道接收来自外部广播管理***的广播信号或广播相关信息。在一个示例性实施例中,所述通信部件916还包括近场通信(NFC)模块,以促进短程通信。例如,在NFC模块可基于射频识别(RFID)技术,红外数据协会(IrDA)技术,超宽带(UWB)技术,蓝牙(BT)技术和其他技术来实现。
在示例性实施例中,装置900可以被一个或多个应用专用集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理设备(DSPD)、可编程逻辑器件(PLD)、现场可编程门阵列(FPGA)、控制器、微控制器、微处理器或其他电子元件实现,用于执行上述方法。
在示例性实施例中,还提供了一种包括指令的非临时性计算机可读存储介质,例如包括指令的存储器904,上述指令可由装置900的处理器920执行以完成上述方法。例如,所述非临时性计算机可读存储介质可以是ROM、随机存取存储器(RAM)、CD-ROM、磁带、软盘和光数据存储设备等。
图8是根据一示例性实施例示出的一种用于跨语言搜索的装置作为服务器时的框图。该服务器1900可因配置或性能不同而产生比较大的差异,可以包括一个或一个以***处理器(central processing units,CPU)1922(例如,一个或一个以上处理器)和存储器1932,一个或一个以上存储应用程序 1942或数据1944的存储介质1930(例如一个或一个以上海量存储设备)。其中,存储器1932和存储介质1930可以是短暂存储或持久存储。存储在存储介质1930的程序可以包括一个或一个以上模块(图示没标出),每个模块可以包括对服务器中的一系列指令操作。更进一步地,中央处理器1922可以设置为与存储介质1930通信,在服务器1900上执行存储介质1930中的一系列指令操作。
服务器1900还可以包括一个或一个以上电源1926,一个或一个以上有线或无线网络接口1950,一个或一个以上输入输出接口1958,一个或一个以上键盘1956,和/或,一个或一个以上操作***1941,例如Windows ServerTM,Mac OS XTM,UnixTM,LinuxTM,FreeBSDTM等等。
一种非临时性计算机可读存储介质,当所述存储介质中的指令由装置(终端或服务器)的处理器执行时,使得终端能够执行一种跨语言搜索方法,所述方法包括:将源语言搜索词翻译为目标语言搜索词;针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其中,所述源语言翻译结果为对所述目标语言搜索结果进行翻译得到;从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;对所述源语言翻译结果中的所述关键词进行标记显示。
本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本申请的其它实施方案。本申请旨在涵盖本申请的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本申请的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本申请的真正范围和精神由下面的权利要求指出。
应当理解的是,本申请并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本申请的范围仅由所附的权利要求来限制
以上所述仅为本申请的较佳实施例,并不用以限制本申请,凡在本申请的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本申请的保护范围之内。
以上对本申请所提供的一种跨语言搜索方法、一种跨语言搜索装置和一种用于跨语言搜索的装置,进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的一般技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。

Claims (22)

  1. 一种跨语言搜索方法,其特征在于,包括:
    将源语言搜索词翻译为目标语言搜索词;
    针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其中,所述源语言翻译结果为对所述目标语言搜索结果进行翻译得到;
    从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;
    对所述源语言翻译结果中的所述关键词进行标记显示。
  2. 根据权利要求1所述的方法,其特征在于,所述从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词,包括:
    建立所述目标语言搜索结果中包括的目标语言词汇与所述源语言翻译结果包括的源语言词汇之间的映射关系;
    依据所述目标语言搜索词,在所述映射关系中进行查找,以得到所述源语言翻译结果包括的、与所述目标语言搜索词相匹配的关键词。
  3. 根据权利要求1或2所述的方法,其特征在于,所述对所述源语言翻译结果中的所述关键词进行标记显示,包括:
    若所述关键词与所述源语言搜索词之间的匹配度符合预置匹配度条件,则对所述源语言翻译结果中的所述关键词进行标记显示。
  4. 根据权利要求1或2所述的方法,其特征在于,在所述从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词之前,所述方法还包括:
    判断所述源语言翻译结果是否包括所述源语言搜索词;
    若所述源语言翻译结果不包括所述源语言搜索词,则执行所述从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词的步骤。
  5. 根据权利要求1或2所述的方法,其特征在于,所述方法还包括:
    若所述源语言翻译结果包括所述源语言搜索词,则对所述源语言翻译结果中的所述源语言搜索词进行标记显示。
  6. 根据权利要求1或2所述的方法,其特征在于,所述方法还包括:
    获取所述目标语言搜索词对应的至少两种翻译结果;
    对所述源语言翻译结果中、与所述目标语言搜索词对应的至少两种翻译结果相匹配的内容进行标记显示。
  7. 根据权利要求6所述的方法,其特征在于,所述获取所述目标语言搜索词对应的至少两种翻译结果,包括:
    对所述目标语言搜索词进行从目标语言到源语言的翻译,从翻译得到的结果中筛选得到至少两种源语言的翻译结果;
    其中,所述从翻译得到的结果中筛选得到至少两种源语言的翻译结果包括:
    从所述翻译得到的结果中选择置信度符合预置置信度条件的至少两种源语言的翻译结果;或者,
    依据所述源语言搜索词的词性,从所述翻译得到的结果中筛选得到至少两种翻译结果。
  8. 一种跨语言搜索装置,其特征在于,包括:
    搜索词翻译模块,配置为将源语言搜索词翻译为目标语言搜索词;
    结果获取模块,配置为针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其中,所述源语言翻译结果为对所述目标语言搜索结果进行翻译得到;
    关键词获取模块,配置为从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;以及
    标记显示模块,配置为对所述源语言翻译结果中的所述关键词进行标记显示。
  9. 根据权利要求8所述的装置,其特征在于,所述关键词获取模块包括:
    映射建立子模块,配置为建立所述目标语言搜索结果中包括的目标语言词汇与所述源语言翻译结果包括的源语言词汇之间的映射关系;以及
    映射查找子模块,配置为依据所述目标语言搜索词,在所述映射关系中进行查找,以得到所述源语言翻译结果包括的、与所述目标语言搜索词相匹配的关键词。
  10. 根据权利要求8或9所述的装置,其特征在于,所述标记显示模块,具体配置为若所述关键词与所述源语言搜索词之间的匹配度符合预置匹配度条件,则对所述源语言翻译结果中的所述关键词进行标记显示。
  11. 根据权利要求8或9所述的装置,其特征在于,所述装置还包括:
    判断模块,配置为在所述关键词获取模块从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词之前,判断所述源语言翻译结果是否包括所述源语言搜索词,若所述源语言翻译结果不包括所述源语言搜索词,则触发所述关键词获取模块从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词。
  12. 根据权利要求8或9所述的装置,其特征在于,所述装置还包括:
    第二标记显示模块,配置为若所述源语言翻译结果包括所述源语言搜索词,则对所述源语言翻译结果中的所述源语言搜索词进行标记显示。
  13. 根据权利要求8或9所述的装置,其特征在于,所述装置还包括:
    翻译结果获取模块,配置为获取所述目标语言搜索词对应的至少两种翻译结果;
    第三标记显示模块,配置为对所述源语言翻译结果中、与所述目标语言搜索词对应的至少两种翻译结果相匹配的内容进行标记显示。
  14. 根据权利要求13所述的装置,其特征在于,所述源语言翻译结果获取子模块包括:
    筛选单元,配置为对所述目标语言搜索词进行从目标语言到源语言的翻译,从翻译得到的结果中筛选得到至少两种源语言的翻译结果;
    其中,所述筛选单元包括:
    第一筛选子单元,配置为从所述翻译得到的结果中选择置信度符合预置置信度条件的至少两种源语言的翻译结果;或者,
    第二筛选子单元,配置为依据所述源语言搜索词的词性,从所述翻译得到的结果中筛选得到至少两种源语言翻译结果。
  15. 一种用于跨语言搜索的装置,其特征在于,包括有存储器,以及一个或者一个以上的程序,其中一个或者一个以上程序存储于存储器中,且经配置以由一个或者一个以上处理器执行所述一个或者一个以上程序包含用于进行以下操作的指令:
    将源语言搜索词翻译为目标语言搜索词;
    针对所述目标语言搜索词,获取其对应的目标语言搜索结果和源语言翻译结果;其中,所述源语言翻译结果为对所述目标语言搜索结果进行翻译得到;
    从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词;
    对所述源语言翻译结果中的所述关键词进行标记显示。
  16. 根据权利要求15所述的装置,其特征在于,所述从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词,包括:
    建立所述目标语言搜索结果中包括的目标语言词汇与所述源语言翻译结果包括的源语言词汇之间的映射关系;
    依据所述目标语言搜索词,在所述映射关系中进行查找,以得到所述源语言翻译结果包括的、与所述目标语言搜索词相匹配的关键词。
  17. 根据权利要求15或16所述的装置,其特征在于,所述对所述源语言翻译结果中的所述关键词进行标记显示,包括:
    若所述关键词与所述源语言搜索词之间的匹配度符合预置匹配度条件,则对所述源语言翻译结果中的所述关键词进行标记显示。
  18. 根据权利要求15或16所述的装置,其特征在于,所述装置还经配置以由一个或者一个以上处理器执行所述一个或者一个以上程序包含用于进行以下操作的指令:
    在所述从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词之前,判断所述源语言翻译结果是否包括所述源语言搜索词;
    若所述源语言翻译结果不包括所述源语言搜索词,则执行所述从所述源语言翻译结果中获取与所述目标语言搜索词相匹配的关键词的步骤。
  19. 根据权利要求15或16所述的装置,其特征在于,所述装置还经配置以由一个或者一个以上处理器执行所述一个或者一个以上程序包含用于进行以下操作的指令:
    若所述源语言翻译结果包括所述源语言搜索词,则对所述源语言翻译结果中的所述源语言搜索词进行标记显示。
  20. 根据权利要求15或16所述的装置,其特征在于,所述装置还经配置以由一个或者一个以上处理器执行所述一个或者一个以上程序包含用于进行以下操作的指令:
    获取所述目标语言搜索词对应的至少两种翻译结果;
    对所述源语言翻译结果中、与所述目标语言搜索词对应的至少两种翻译结果相匹配的内容进行标记显示。
  21. 根据权利要求20所述的装置,其特征在于,所述获取所述目标语言搜索词对应的至少两种翻译结果,包括:
    对所述目标语言搜索词进行从目标语言到源语言的翻译,从翻译得到的结果中筛选得到至少两种源语言的翻译结果;
    其中,所述从翻译得到的结果中筛选得到至少两种源语言的翻译结果包括:
    从所述翻译得到的结果中选择置信度符合预置置信度条件的至少两种源语言的翻译结果;或者,
    依据所述源语言搜索词的词性,从所述翻译得到的结果中筛选得到至少两种翻译结果。
  22. 一个或多个机器可读介质,其上存储有指令,当由一个或多个处理器执行时,使得装置执行如权利要求1至7中一个或多个所述的跨语言搜索方法。
PCT/CN2018/100990 2017-12-08 2018-08-17 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置 WO2019109663A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711298752.0 2017-12-08
CN201711298752.0A CN108255940A (zh) 2017-12-08 2017-12-08 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置

Publications (1)

Publication Number Publication Date
WO2019109663A1 true WO2019109663A1 (zh) 2019-06-13

Family

ID=62722397

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/100990 WO2019109663A1 (zh) 2017-12-08 2018-08-17 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置

Country Status (2)

Country Link
CN (1) CN108255940A (zh)
WO (1) WO2019109663A1 (zh)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108255940A (zh) * 2017-12-08 2018-07-06 北京搜狗科技发展有限公司 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置
CN109408822A (zh) * 2018-10-30 2019-03-01 中译语通科技股份有限公司 一种跨语言图书相关度分析方法及***
CN112528129B (zh) * 2019-08-28 2024-03-22 北京搜狗科技发展有限公司 多语种翻译***语种搜索方法及装置
CN111126087B (zh) * 2019-12-25 2023-08-29 北京百度网讯科技有限公司 领域翻译处理方法、装置及设备
CN112163433B (zh) * 2020-09-29 2022-04-05 北京字跳网络技术有限公司 关键词汇的匹配方法、装置、电子设备及存储介质

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140149445A1 (en) * 2012-11-26 2014-05-29 Inventec Corporation Webpage querying system and querying method thereof
CN106599206A (zh) * 2016-12-15 2017-04-26 北京小米移动软件有限公司 搜索信息的方法及装置
CN106919642A (zh) * 2017-01-13 2017-07-04 北京搜狗科技发展有限公司 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置
CN108255940A (zh) * 2017-12-08 2018-07-06 北京搜狗科技发展有限公司 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置
CN108304412A (zh) * 2017-01-13 2018-07-20 北京搜狗科技发展有限公司 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置
CN108334526A (zh) * 2017-01-20 2018-07-27 北京搜狗科技发展有限公司 搜索结果项的展示方法和装置

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140149445A1 (en) * 2012-11-26 2014-05-29 Inventec Corporation Webpage querying system and querying method thereof
CN106599206A (zh) * 2016-12-15 2017-04-26 北京小米移动软件有限公司 搜索信息的方法及装置
CN106919642A (zh) * 2017-01-13 2017-07-04 北京搜狗科技发展有限公司 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置
CN108304412A (zh) * 2017-01-13 2018-07-20 北京搜狗科技发展有限公司 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置
CN108334526A (zh) * 2017-01-20 2018-07-27 北京搜狗科技发展有限公司 搜索结果项的展示方法和装置
CN108255940A (zh) * 2017-12-08 2018-07-06 北京搜狗科技发展有限公司 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置

Also Published As

Publication number Publication date
CN108255940A (zh) 2018-07-06

Similar Documents

Publication Publication Date Title
WO2019109663A1 (zh) 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置
WO2019109664A1 (zh) 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置
JP2018504727A (ja) 参考文書の推薦方法及び装置
CN108304412B (zh) 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置
CN111128183B (zh) 语音识别方法、装置和介质
WO2019165832A1 (zh) 文字信息处理方法、装置及终端
CN110019675B (zh) 一种关键词提取的方法及装置
CN111708943B (zh) 一种搜索结果展示方法、装置和用于搜索结果展示的装置
WO2018120447A1 (zh) 一种医案信息的处理方法、装置和设备
WO2023078414A1 (zh) 相关文章搜索方法、装置、电子设备和存储介质
CN111538830B (zh) 法条检索方法、装置、计算机设备及存储介质
CN106919642B (zh) 一种跨语言搜索方法和装置、一种用于跨语言搜索的装置
CN107424612B (zh) 处理方法、装置和机器可读介质
CN108345625A (zh) 一种信息挖掘方法和装置、一种用于信息挖掘的装置
WO2018018912A1 (zh) 一种搜索方法、装置及电子设备
JP7208968B2 (ja) 情報処理方法、装置および記憶媒体
CN110020106B (zh) 一种推荐方法、推荐装置和用于推荐的装置
CN108628461B (zh) 一种输入方法和装置、一种更新词库的方法和装置
CN107784037B (zh) 信息处理方法和装置、用于信息处理的装置
CN113033163A (zh) 一种数据处理方法、装置和电子设备
US20230267282A1 (en) Poetry generation
CN108241614B (zh) 信息处理方法和装置、用于信息处理的装置
CN111752436A (zh) 一种推荐方法、装置和用于推荐的装置
CN109992790B (zh) 数据处理方法和装置、用于数据处理的装置
CN110929122B (zh) 一种数据处理方法、装置和用于数据处理的装置

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18885996

Country of ref document: EP

Kind code of ref document: A1