CN106681523A - Library configuration method, library configuration device and call handling method of input method - Google Patents

Library configuration method, library configuration device and call handling method of input method Download PDF

Info

Publication number
CN106681523A
CN106681523A CN201510752419.7A CN201510752419A CN106681523A CN 106681523 A CN106681523 A CN 106681523A CN 201510752419 A CN201510752419 A CN 201510752419A CN 106681523 A CN106681523 A CN 106681523A
Authority
CN
China
Prior art keywords
contact person
word
user
everyday words
intersection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510752419.7A
Other languages
Chinese (zh)
Inventor
王务志
王军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201510752419.7A priority Critical patent/CN106681523A/en
Publication of CN106681523A publication Critical patent/CN106681523A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods
    • G06F3/0237Character input methods using prediction or retrieval techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a library configuration method, a library configuration device and a call handling method of an input method. The library configuration method includes: looking up and extracting chat content between a user and all contact persons according to the contact persons in all social software in user terminal; converting all the chat content into a text collection and performing associated storage on the generated text collection and the corresponding contact persons; performing word segmentation on the text collection, extracting common words from word segmentation and storing the common words with use frequency greater than predetermined value in word segmentation libraries of the contact persons associated with the text collection. The word segmentation libraries can be specially configured for the different contact persons according to the chat content between the contact persons and the user, and the word segmentation libraries associated with the contact persons can be called during chatting of the user and the contact persons, so that the input method is more suitable for chat habits of the user on the different contact persons, and the hit rate of the common words suitable for the chat habits of the user and the contact persons can be greatly increased.

Description

A kind of character library collocation method of input method, device and calling processing method
Technical field
The application belongs to character library processing technology field, specifically, is related to the character library collocation method and device of a kind of input method.
Background technology
With the fast development of internet, the species of social software becomes increasingly to enrich, compare main flow including QQ, wechat, MSN etc., the instant messaging between user can be realized by internet.Generally, social software can all provide chatting object's list, similar to the address list of mobile phone, user may be selected in chatting object list someone chatted.
There is the people of social networks in chatting object's list with user comprising relative, friend, classmate, colleague etc., find through statistics, user can adopt different chat modes for the chatting object of different social networks, for example, when chatting with colleague, it is related to the related affairs of work mostly, the content of chat is more serious, and entry is more rigorous;When chatting with household, domestic trivia is related to mostly, the content of chat is more random, entry also compares colloquial style, or even some dialects;And when chatting with friend, the content of chat is related to amusement and recreation mostly, entry then can be relatively more active, light, even with some network hot words.
So, user is when using social software, for different chatting objects, commonly used word is different, and existing input method generally only includes a character library, no matter whom chatting object is, what input method was used is all same character library and for entry temperature uses same standard, it is impossible to for the character library that different chatting objects provides personalization.
The content of the invention
In view of this, technical problems to be solved in this application there is provided a kind of character library collocation method, device and the calling processing method of input method, exclusive point character library is configured for different contact persons, so that input method more conforms to user for the chat of different contact persons is accustomed to.
In order to solve above-mentioned technical problem, this application discloses a kind of character library collocation method of input method, including:
According to the contact person of each social software in user terminal, the chat content between user and the contact person is searched and extracted;
The chat content is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;
Participle is carried out to the word intersection and everyday words is extracted from participle, will be preserved into point character library of the contact person associated with the word intersection more than the everyday words of predetermined value using temperature.
Preferably, according to the contact person of each social software in user terminal, the chat content between user and the contact person is searched and is extracted, further include:
For each social software in user terminal, according to the social software searching contact person and extract the chat content between user and each contact person.
Preferably, the chat content is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person, further included:
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
Chat content described in each social software between identical contact person and user is converted to and is merged after word intersection, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
Preferably, the chat content is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person, further included:
For each social software, the chat content wherein between each contact person and user is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
To merge after the word intersection of identical contact person described in each social software, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
Preferably, the unifying identifier includes the following combination of one or more:User terminal number, contact name, remark names.
Preferably, will be preserved into point character library of the contact person associated with the word intersection more than the everyday words of predetermined value using temperature, further included:
According to the coding method of input method, parsing obtains the coding of the everyday words preserved in point character library, and by described be stored in point character library corresponding with the everyday words is encoded.
Preferably, the coding method, including:Phonetic, shape code, sound shape are combined, ISN.
Preferably, the chat content is converted to into word intersection, is further included:
Extract the voice and word in the chat content;
The voice is split as by multiple phonetic elements according to audio parameter, and according to default phonetic element and the mapping relations of word, the voice is converted to into word;
The word that conversion is obtained is merged into into word intersection with the word in the chat record.
Preferably, participle is carried out to the word intersection and extracts everyday words, further included:
Semantic analysis is carried out to the word intersection, multiple participles are obtained;
The stop words in the plurality of participle is filtered, everyday words is obtained.
Preferably, point character library to contact person association will be preserved more than the everyday words of predetermined value using temperature, is further included:
The word frequency of the everyday words is marked according to usage frequency, and generates word frequency list;
The weight of the everyday words is set according to preset rules, and adjusts the word frequency list to generate using temperature list according to the weight of the everyday words.
Preferably, the preset rules include:
The everyday words is set as into that the number of times for being placed in first place is proportional in inverse ratio or with during user input word with the common degree of the everyday words.
Disclosed herein as well is a kind of character library configuration device of input method, including:
Inquiry unit, for according to the contact person of each social software in user terminal, searching and extracting the chat content between user and the contact person;
Conversion equipment, for the chat content to be converted to into word intersection, and is associated preservation by the word intersection of generation with corresponding contact person;
Participle storage device, for participle to be carried out to the word intersection and everyday words is extracted from participle, will be preserved into point character library of the contact person associated with the word intersection using temperature more than the everyday words of predetermined value.
Preferably, the inquiry unit, is further used for:
For each social software in user terminal, according to the social software searching contact person and extract the chat content between user and each contact person.
Preferably, the conversion equipment, is further used for:
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
Chat content described in each social software between identical contact person and user is converted to and is merged after word intersection, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
Preferably, the conversion equipment, is further used for:
For each social software, the chat content wherein between each contact person and user is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
To merge after the word intersection of identical contact person described in each social software, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
Preferably, the unifying identifier includes the following combination of one or more:User terminal number, contact name, remark names.
Preferably, the participle storage device, is further used for:
According to the coding method of input method, parsing obtains the coding of the everyday words preserved in point character library, and by described be stored in point character library corresponding with the everyday words is encoded.
Preferably, the coding method, including:Phonetic, shape code, sound shape are combined, ISN.
Preferably, the conversion equipment, is further used for:
Extract the voice and word in the chat content;
The voice is split as by multiple phonetic elements according to audio parameter, and according to default phonetic element and the mapping relations of word, the voice is converted to into word;
The word that conversion is obtained is merged into into word intersection with the word in the chat record.
Preferably, the participle storage device, is further used for:
Semantic analysis is carried out to the word intersection, multiple participles are obtained;
The stop words in the plurality of participle is filtered, everyday words is obtained.
Preferably, the participle storage device, is further used for:
The word frequency of the everyday words is marked according to usage frequency, and generates word frequency list;
The weight of the everyday words is set according to preset rules, and adjusts the word frequency list to generate using temperature list according to the weight of the everyday words.
Preferably, the preset rules include:
The everyday words is set as into that the number of times for being placed in first place is proportional in inverse ratio or with during user input word with the common degree of the everyday words.
Disclosed herein as well is a kind of character library calling processing method of input method, including:
Start input method, the contact person of current social software in identifying user terminal;
Point character library of contact person's association is called, wherein, the configuration process of point character library of contact person's association is:According to the contact person of each social software in user terminal, the chat content between user and the contact person is searched and extracted;The chat content is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;Participle is carried out to the word intersection and everyday words is extracted from participle, will be preserved into point character library of the contact person associated with the word intersection more than the everyday words of predetermined value using temperature;
Obtain the input method coding of user input, in point character library of the contact person association matching everyday words related to the input method coding of user input and by the everyday words by being included in interface of input method using temperature order.
Preferably, according to the contact person of each social software in user terminal, the chat content between user and the contact person is searched and is extracted, further include:
For each social software in user terminal, according to the social software searching contact person and extract the chat content between user and each contact person.
Preferably, the chat content is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person, further included:
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
Chat content described in each social software between identical contact person and user is converted to and is merged after word intersection, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
Preferably, the chat content is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person, further included:
For each social software, the chat content wherein between each contact person and user is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
To merge after the word intersection of identical contact person described in each social software, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
Preferably, the unifying identifier includes the following combination of one or more:User terminal number, contact name, remark names.
Preferably, point character library of contact person's association is called, is further included:
According to the unifying identifier of the contact person of the current social software, point character library of contact person's association is searched and transferred.
Preferably, will be preserved into point character library of the contact person associated with the word intersection more than the everyday words of predetermined value using temperature, further included:
According to the coding method of input method, parsing obtains the coding of the everyday words preserved in point character library, and by described be stored in point character library corresponding with the everyday words is encoded.
Preferably, the coding method, including:Phonetic, shape code, sound shape are combined, ISN.
Preferably, obtain the input method coding of user input, in point character library of the contact person association matching everyday words related to the input method coding of user input and by the everyday words by including in interface of input method, further including using temperature order:
According to the input method coding of user input, the coding of the everyday words is searched in the analysis storehouse of contact person association, and by the everyday words related to the input method coding of the user input by being included in the interface of input method using temperature order.
Preferably, obtain the input method coding of user input, in point character library of the contact person association matching everyday words related to the input method coding of user input and by the everyday words by including in interface of input method, further including using temperature order:
According to the input method coding of user input, the coding of the everyday words is searched in the analysis storehouse of contact person association, if not finding the everyday words related to the input method coding of the user input, then the input method coding of the user input is disassembled, according to the multigroup input method coding for disassembling gained, search the coding of the everyday words in the analysis storehouse of contact person association respectively, and by the everyday words related to first group of input method coding for disassembling gained by being included in the interface of input method using temperature order;
After selecting the related everyday words of first group of input method coding for disassembling gained, by the everyday words related to second group of input method coding for disassembling gained by being included in the interface of input method using temperature order.
Preferably, if not finding the everyday words related to the input method coding of the user input, the input method coding of the user input is disassembled, according to the input method coding for disassembling gained, the coding of the everyday words is searched in the analysis storehouse of contact person association, is further included:
If not finding the everyday words related to the input method coding for disassembling gained in the library of molecules that the contact person is associated, then continue the input method coding to be disassembled for single coding, and the individual character related to the single coding for disassembling gained is included in the interface of input method.
Preferably, the chat content is converted to into word intersection, is further included:
Extract the voice and word in the chat content;
The voice is split as by multiple phonetic elements according to audio parameter, and according to default phonetic element and the mapping relations of word, the voice is converted to into word;
The word that conversion is obtained is merged into into word intersection with the word in the chat record.
Preferably, participle is carried out to the word intersection and extracts everyday words, further included:
Semantic analysis is carried out to the word intersection, multiple participles are obtained;
The stop words in the plurality of participle is filtered, everyday words is obtained.
Preferably, point character library to contact person association will be preserved more than the everyday words of predetermined value using temperature, is further included:
The word frequency of the everyday words is marked according to usage frequency, and generates word frequency list;
The weight of the everyday words is set according to preset rules, and adjusts the word frequency list to generate using temperature list according to the weight of the everyday words.
Preferably, the preset rules include:
The everyday words is set as into that the number of times for being placed in first place is proportional in inverse ratio or with during user input word with the common degree of the everyday words.
Preferably, obtain the input method coding of user input, in point character library of the contact person association matching everyday words related to the input method coding of user input and by the everyday words by including in interface of input method, also including using temperature order before:
The open command of monitoring interface of input method;
After interface of input method open command is monitored, and before the input method coding for getting user input, by the everyday words in point character library that contact person's association is shown using temperature order in the interface of input method.
Compared with prior art, the application can be obtained including following technique effect:
1) it is that different contact persons configure exclusive point character library, point character library that the contact person can be called to associate when user chats with contact person, so that input method more conforms to user for the chat of different contact persons is accustomed to according to contact person and the chat content of user.
2) the same contact person in multiple social softwares is multiplexed same point of character library, not only saves the space resources that point character library takes, and is also capable of achieving the centralized and unified management to contact person in social software.
3) when user and contact person chat, point character library of contact person's association can be directly invoked, and according to the coding of user input, by in point character library match related everyday words according to using temperature include in interface of input method for user select, the hit rate for meeting user and the everyday words of contact person chat custom is substantially increased, the input time of chat content is saved.
Certainly, implementing arbitrary product of the application must be not necessarily required to while reaching all the above technique effect.
Description of the drawings
Accompanying drawing described herein is used for providing further understanding of the present application, constitutes the part of the application, and the schematic description and description of the application is used to explain the application, does not constitute the improper restriction to the application.In the accompanying drawings:
Fig. 1 is the flow chart of the embodiment of the present application one;
Fig. 2 is the structural representation of the embodiment of the present application two;
Fig. 3 is the flow chart of the embodiment of the present application three.
Specific embodiment
Describe presently filed embodiment in detail below in conjunction with drawings and Examples, thereby to the application how application technology means come solve technical problem and reach technology effect realize that process can fully understand and implement according to this.
The application provides a kind of character library collocation method of input method, device and call method, can be applicable to the contact person in user terminal for each social software and set up exclusive point character library, and call corresponding point of character library of the contact person when user chats with contact person, so as to word that can be used in interface of input method in corresponding point of character library of the contact person, input method is set more to meet word custom of the user for different contact persons.
In addition, during the application is except being applied to the chat scenario of above-mentioned social software, applying also for other application occasion, this is not construed as limiting.
As shown in figure 1, embodiments herein one provides a kind of character library collocation method of input method, including:
Step 101, according to the contact person of each social software in user terminal, searches and extracts the chat content between user and the contact person;
Step 102, by the chat content word intersection is converted to, and the word intersection of generation is associated into preservation with corresponding contact person;
Step 103, participle is carried out to the word intersection and everyday words is extracted from participle, will be preserved into point character library of the contact person associated with the word intersection more than the everyday words of predetermined value using temperature.
Wherein, in step 101, when contact person and the user of each social software chat, chat content can be produced, the chat content can be the chat data that the chat record that each social software is automatically saved, or user terminal are actively preserved, and here is not limited.In addition, searching and extract chat content can be only in accordance with chat content currently stored in user terminal, also periodically chat content can be made a look up and is extracted, and at any time subsequent step can be performed according to all chat contents that are newest extraction or repeatedly extracting, to update point character library of contact person.
In step 102, due to the data type that chat content is included it is varied, can be before conversion word intersection action be performed, the data filtering that picture in chat content, expression, animation, video etc. cannot be converted to word is fallen, to avoid mess code occur in word intersection, so as to ensure the regular of word intersection, step 103 is facilitated to carry out participle.
Some participles can be obtained after the participle that step 103 performs to word intersection, everyday words is extracted from these participles, and temperature sequence is carried out to everyday words, just can be admitted in the participle storehouse of contact person's association more than the everyday words of predetermined value using temperature.
Due to the social software in user terminal may comprising multiple, and the contact person in social software be also it is multiple, in such a scenario:
In the present embodiment, step 101 is further included:
For each social software in user terminal, according to the social software searching contact person and extract the chat content between user and each contact person.
Step 101 not only needs the chat content for searching and extracting each contact person and user in single social software, also need will to search and extract the action of chat content and be applied in each social software, so as to all of chat content of each contact person for carrying out chatting with user in user terminal can be obtained, the chat content that the contact person occurs in multiple different social softwares with user may be included in all of chat content of the contact person.
In the present embodiment, the chat content occurred with user in different social softwares for same contact person is converted to the problem of word intersection, and step 102 provides various settling modes, such as:
Mode one:
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
Chat content described in each social software between identical contact person and user is converted to and is merged after word intersection, and the word intersection for merging generation is associated into preservation with the identical contact person and its unifying identifier.
Mode two:
For each social software, the chat content wherein between each contact person and user is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
To merge after the word intersection of identical contact person described in each social software, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
Same contact person is possibly comprised in multiple social softwares, such as " Zhang San " is both the contact person in user terminal in QQ, it is also the contact person in wechat, but No. QQ of " Zhang San " and micro-signal may be not consistent, under this situation, the Subscriber Number in social software cannot know which essence is same contact person.By setting unifying identifier for contact person, but identical contact person in each social software is identified by unifying identifier, although that is, No. QQ of " Zhang San " and micro-signal and different, but the unifying identifier of " Zhang San " is identical in QQ and wechat.
According to mode one, the actual identity that unifying identifier identifies respectively " Zhang San " in QQ and wechat can be passed through, then the chat content by " Zhang San " in QQ is converted to word intersection one, chat content by it in wechat is converted to word intersection two, and word intersection one and word intersection two are merged into into the final word intersection of " Zhang San ";According to mode two, the chat content of each contact person in each social software can be converted to word intersection, and word intersection and contact person are associated into preservation, so word intersection establishes binding relationship with contact person, again identical contact person in each social software identified by unifying identifier afterwards, the word intersection bound in same correspondents merge after be the final word intersection of the contact person.Finally, the unifying identifier of the final word intersection of the contact person of acquisition and contact person and contact person is associated preservation by mode one and mode two, for example, the final word intersection of " Zhang San " and " Zhang San " and its unifying identifier are associated into preservation.
Unifying identifier due to setting contact person in step 102, for convenience centralized and unified management is carried out to the contact person in each social software, step 102 can also set up a unification address list according to the unifying identifier, contact person in the unified address list covers the All Contacts in each social software, and have recorded the corresponding relation of the contact person in the unified address list and the contact person in each social software, correspondingly, point character library being subsequently generated directly can carry out corresponding association and preserve with the contact person in unified address list.The unified address list can be the address list carried in user terminal, such as cell phone address book, can also individually set up in the user terminal, it is not limited thereto, contact person's scope in unified address list can be more than but must include the scope of All Contacts in each social software, and unifying identifier can be added in the details of unified contact person in address list.So user can easily check which the social software belonging to certain contact person has in unified address list, and can quickly find out corresponding point of character library with the corresponding relation of contact person in social software according to contact person in address list is unified when follow-up input method calls point character library.
Preferably, the unifying identifier includes user terminal number, contact name, remark names etc..For above-mentioned unified address list, user terminal number (cell-phone number of such as contact person) can be set as unifying identifier, by the way that the cell-phone number of contact person is associated with its Subscriber Number in each social software, you can realize the centralized and unified management to contact person.Contact person " Zhang San " as escribed above, the cell-phone number of " Zhang San " can be associated with its No. QQ and micro-signal in unified address list, step 102 when searching and extracting the word intersection of " Zhang San " by its chat content in QQ and its chat content in wechat are quickly recognized according to the cell-phone number of " Zhang San ", and the word intersection for generating is associated into preservation with the cell-phone number of " Zhang San ".
In the present embodiment, for convenience follow-up input method is called to point character library of contact person, and step 103 is further included:
According to the coding method of input method, parsing obtains the coding of the everyday words preserved in point character library, and by described be stored in point character library corresponding with the everyday words is encoded.
When preservation everyday words extremely divides character library, everyday words itself can not only be preserved, also the corresponding various codings of the everyday words can be obtained according to coding methods such as phonetic, shape code, the combination of sound shape, ISNs, and the coding for obtaining is saved in participle storehouse with the conventional word association.So, each everyday words is associated with the codings such as its Pinyin coding, shape code coding in participle storehouse, during using input method, can quickly find corresponding everyday words and show user according to the coding of user input.
As described above, the data type for including in chat content is varied, wherein contains most important chat content in voice and word, in order to these important chat contents are converted to into word intersection.
In the present embodiment, step 102 is further included:
Extract the voice and word in the chat content;
The voice is split as by multiple phonetic elements according to audio parameter, and according to default phonetic element and the mapping relations of word, the voice is converted to into word;
The word that conversion is obtained is merged into into word intersection with the word in the chat record.
Wherein, the audio parameter includes time interval between modulation in tone, the word of sound etc., voice is split as by multiple phonetic elements according to audio parameter, phonetic element can be individual character either word or phrase, afterwards according to default phonetic element and the mapping relations of word, word is converted speech into.The default phonetic element can be stored as user terminal locally with the mapping relations of word, in may be alternatively stored in the webserver.
Step 103 is further included:
Semantic analysis is carried out to the word intersection, multiple participles are obtained;
The stop words in the plurality of participle is filtered, everyday words is obtained.
Wherein, carrying out semantic analysis to the word intersection includes, analyze noun, adjective etc., and whether analysis individual character can depend on word composition phrase of its front or behind etc., multiple participles can be obtained by semantic analysis, necessarily contain in the plurality of participle it is a large amount of " ", auxiliary words of mood or the non-personalized word such as " I ", these participles are classified as into stop words, and filter out from multiple participles, just obtain everyday words.
For the everyday words preserved into point character library, each everyday words is different in the service condition that user chats in custom, in order to distinguish the use temperature of everyday words, and more reasonably push order is provided during user's use, and step 103 is further included:
The word frequency of the everyday words is marked according to usage frequency, and generates word frequency list;
The weight of the everyday words is set according to preset rules, and adjusts the word frequency list to generate using temperature list according to the weight of the everyday words.
The usage frequency can be counted in participle, that is, count the number of times that everyday words occurs in participle, and according to usage frequency word frequency list is generated;And for the chat custom of user, be not the high everyday words of usage frequency be exactly that the word that uses most is wanted when user chats, in addition it is also necessary to the chatting object based on user considers the weight of everyday words.For example, " Zhang San " is the colleague of user, according to word frequency list, " yes " word frequency is far longer than the word frequency of " Media Stream ", but for colleague, " Media Stream " obviously has higher professional and specific aim, therefore the weight of " Media Stream " can be adjusted by weight so as to which the position in using temperature list is in the front of " yes ".
Preferably, the preset rules include:
The everyday words is set as into that the number of times for being placed in first place is proportional in inverse ratio or with during user input word with the common degree of the everyday words.
The conventional degree of more professional everyday words on the contrary can be lower, and the weight of everyday words can be in inverse ratio with its conventional degree under this scene;Or, everyday words is set as into that the number of times for being placed in first place is directly proportional using during input method according to user, user also can in this way manually adjust the weight of everyday words.
Shown in Fig. 2, the embodiment of the present application two provides a kind of character library configuration device of input method, including:
Inquiry unit 11, for according to the contact person of each social software in user terminal, searching and extracting the chat content between user and the contact person;
Conversion equipment 12, for the chat content to be converted to into word intersection, and is associated preservation by the word intersection of generation with corresponding contact person;
Participle storage device 13, for participle to be carried out to the word intersection and everyday words is extracted from participle, will be preserved into point character library of the contact person associated with the word intersection using temperature more than the everyday words of predetermined value.
Wherein, when the contact person of each social software chats with user, chat content can be produced, the chat content can be the chat data that the chat record that each social software is automatically saved, or user terminal are actively preserved, and here is not limited.In addition, chat content is searched and extracted in inquiry unit 11 can be only in accordance with chat content currently stored in user terminal, also periodically chat content can be made a look up and is extracted, and at any time subsequent step can be performed according to all chat contents that are newest extraction or repeatedly extracting, to update point character library of contact person.
Due to the data type that chat content is included it is varied, inquiry unit 11 can be additionally used in before conversion word intersection action is performed, the data filtering that picture in chat content, expression, animation, video etc. cannot be converted to word is fallen, to avoid mess code occur in word intersection, so as to ensure the regular of word intersection, participle storage device 13 is facilitated to carry out participle.
Some participles can be obtained after the participle that participle storage device 13 performs to word intersection, everyday words is extracted from these participles, and temperature sequence is carried out to everyday words, just can be admitted in the participle storehouse of contact person's association more than the everyday words of predetermined value using temperature.
Due to the social software in user terminal may comprising multiple, and the contact person in social software be also it is multiple, in such a scenario:
In the present embodiment, inquiry unit 11 is additionally operable to:
For each social software in user terminal, according to the social software searching contact person and extract the chat content between user and each contact person.
Inquiry unit 11 not only needs the chat content for searching and extracting each contact person and user in single social software, also need will to search and extract the action of chat content and be applied in each social software, so as to all of chat content of each contact person for carrying out chatting with user in user terminal can be obtained, the chat content that the contact person occurs in multiple different social softwares with user may be included in all of chat content of the contact person.
In the present embodiment, the chat content occurred with user in different social softwares for same contact person is converted to the problem of word intersection, and conversion equipment 12 provides various settling modes, such as:
Mode one:
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
Chat content described in each social software between identical contact person and user is converted to and is merged after word intersection, and the word intersection for merging generation is associated into preservation with the identical contact person and its unifying identifier.
Mode two:
For each social software, the chat content wherein between each contact person and user is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
To merge after the word intersection of identical contact person described in each social software, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
Same contact person is possibly comprised in multiple social softwares, such as " Zhang San " is both the contact person in user terminal in QQ, it is also the contact person in wechat, but No. QQ of " Zhang San " and micro-signal may be not consistent, under this situation, the Subscriber Number in social software cannot know which essence is same contact person.By setting unifying identifier for contact person, but identical contact person in each social software is identified by unifying identifier, although that is, No. QQ of " Zhang San " and micro-signal and different, but the unifying identifier of " Zhang San " is identical in QQ and wechat.
According to mode one, the actual identity that unifying identifier identifies respectively " Zhang San " in QQ and wechat can be passed through, then the chat content by " Zhang San " in QQ is converted to word intersection one, chat content by it in wechat is converted to word intersection two, and word intersection one and word intersection two are merged into into the final word intersection of " Zhang San ";According to mode two, the chat content of each contact person in each social software can be converted to word intersection, and word intersection and contact person are associated into preservation, so word intersection establishes binding relationship with contact person, again identical contact person in each social software identified by unifying identifier afterwards, the word intersection bound in same correspondents merge after be the final word intersection of the contact person.Finally, the unifying identifier of the final word intersection of the contact person of acquisition and contact person and contact person is associated preservation by mode one and mode two, for example, the final word intersection of " Zhang San " and " Zhang San " and its unifying identifier are associated into preservation.
Unifying identifier due to having preset contact person, for convenience centralized and unified management is carried out to the contact person in each social software, conversion equipment 12 can also set up a unification address list according to the unifying identifier, contact person in the unified address list covers the All Contacts in each social software, and have recorded the corresponding relation of the contact person in the unified address list and the contact person in each social software, correspondingly, point character library being subsequently generated directly can carry out corresponding association and preserve with the contact person in unified address list.The unified address list can be the address list carried in user terminal, such as cell phone address book, can also individually set up in the user terminal, it is not construed as limiting again, contact person's scope in unified address list can be more than but must include the scope of All Contacts in each social software, and unifying identifier can be added in the details of unified contact person in address list.So user can easily check which the social software belonging to certain contact person has in unified address list, and can quickly find out corresponding point of character library with the corresponding relation of contact person in social software according to contact person in address list is unified when follow-up input method calls point character library.
Preferably, the unifying identifier includes user terminal number, contact name, remark names etc..For above-mentioned unified address list, user terminal number (cell-phone number of such as contact person) can be set as unifying identifier, by the way that the cell-phone number of contact person is associated with its Subscriber Number in each social software, you can realize the centralized and unified management to contact person.Contact person " Zhang San " as escribed above, the cell-phone number of " Zhang San " can be associated with its No. QQ and micro-signal in unified address list, step 102 when searching and extracting the word intersection of " Zhang San " by its chat content in QQ and its chat content in wechat are quickly recognized according to the cell-phone number of " Zhang San ", and the word intersection for generating is associated into preservation with the cell-phone number of " Zhang San ".
In the present embodiment, for convenience follow-up input method is called to point character library of contact person, and participle storage device 13 is further used for:
According to the coding method of input method, parsing obtains the coding of the everyday words preserved in point character library, and by described be stored in point character library corresponding with the everyday words is encoded.
When preservation everyday words extremely divides character library, everyday words itself can not only be preserved, also the corresponding various codings of the everyday words can be obtained according to coding methods such as phonetic, shape code, the combination of sound shape, ISNs, and the coding for obtaining is saved in participle storehouse with the conventional word association.So, each everyday words is associated with the codings such as its Pinyin coding, shape code coding in participle storehouse, during using input method, can quickly find corresponding everyday words and show user according to the coding of user input.
As described above, the data type for including in chat content is varied, wherein contains most important chat content in voice and word, in order to these important chat contents are converted to into word intersection.
In the present embodiment, conversion equipment 12 is further used for:
Extract the voice and word in the chat content;
The voice is split as by multiple phonetic elements according to audio parameter, and according to default phonetic element and the mapping relations of word, the voice is converted to into word;
The word that conversion is obtained is merged into into word intersection with the word in the chat record.
Wherein, the audio parameter includes time interval between modulation in tone, the word of sound etc., voice is split as by multiple phonetic elements according to audio parameter, phonetic element can be individual character either word or phrase, afterwards according to default phonetic element and the mapping relations of word, word is converted speech into.The default phonetic element can be stored as user terminal locally with the mapping relations of word, in may be alternatively stored in the webserver.
Participle storage device 13 is further used for:
Semantic analysis is carried out to the word intersection, multiple participles are obtained;
The stop words in the plurality of participle is filtered, everyday words is obtained.
Wherein, carrying out semantic analysis to the word intersection includes, analyze noun, adjective etc., and whether analysis individual character can depend on word composition phrase of its front or behind etc., multiple participles can be obtained by semantic analysis, necessarily contain in the plurality of participle it is a large amount of " ", auxiliary words of mood or the non-personalized word such as " I ", these participles are classified as into stop words, and filter out from multiple participles, just obtain everyday words.
For the everyday words preserved into point character library, service condition of each everyday words in user's chat custom is different, in order to distinguish the use temperature of everyday words, and more reasonably push order is provided during user's use, participle storage device 13 is further used for:
The word frequency of the everyday words is marked according to usage frequency, and generates word frequency list;
The weight of the everyday words is set according to preset rules, and adjusts the word frequency list to generate using temperature list according to the weight of the everyday words.
The usage frequency can be counted in participle, that is, count the number of times that everyday words occurs in participle, and according to usage frequency word frequency list is generated;And for the chat custom of user, be not the high everyday words of usage frequency be exactly that the word that uses most is wanted when user chats, in addition it is also necessary to the chatting object based on user considers the weight of everyday words.For example, " Zhang San " is the colleague of user, according to word frequency list, " yes " word frequency is far longer than the word frequency of " Media Stream ", but for colleague, " Media Stream " obviously has higher professional and specific aim, therefore the weight of " Media Stream " can be adjusted by weight so as to which the position in using temperature list is in the front of " yes ".
Preferably, the preset rules include:
The everyday words is set as into that the number of times for being placed in first place is proportional in inverse ratio or with during user input word with the common degree of the everyday words.
The conventional degree of more professional everyday words on the contrary can be lower, and the weight of everyday words can be in inverse ratio with its conventional degree under this scene;Or, everyday words is set as into that the number of times for being placed in first place is directly proportional using during input method according to user, user also can in this way manually adjust the weight of everyday words.
As shown in figure 3, the embodiment of the present application three provides a kind of character library calling processing method of input method, including:
Step 201, starts input method, the contact person of current social software in identifying user terminal;
Step 202, calls point character library of contact person's association, wherein, the configuration process of point character library of contact person's association is:According to the contact person of each social software in user terminal, the chat content between user and the contact person is searched and extracted;The chat content is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;Participle is carried out to the word intersection and everyday words is extracted from participle, will be preserved into point character library of the contact person associated with the word intersection more than the everyday words of predetermined value using temperature;
Step 203, obtains the input method coding of user input, in point character library of the contact person association matching everyday words related to the input method coding of user input and by the everyday words by being included in interface of input method using temperature order.
Wherein, in step 201, when user is in social software in the chat interface of certain contact person, input method is started by triggering command, after input method starts, the corresponding contact person in current chat interface is recognized first, then according to contact person and the incidence relation of point character library, corresponding point character library is found.
In afterwards the step of 202, input method can call point character library that the contact person for finding associates, with using the word in described point of character library.Wherein described point character library is realized configuring according to above step 101~103.
In step 203, input method monitors the coding of user input, the coding can be phonetic, shape code, the combination of sound shape or ISN, the everyday words of correlation can be matched in point character library of the contact person association that step 202 is called according to the coding of user input, the everyday words for matching may have multiple, when the everyday words for matching is multiple, it is displayed in interface of input method according to the use temperature order of multiple everyday words, is selected for user.
In the present embodiment, in order to realize that input method successfully calls point character library of contact person's association, described point of character library can be pre-configured with step 202, specifically include herein below:
The scene of multiple social softwares in order to be suitable for user terminal, the process of step 202 configuration point character library is further included:
For each social software in user terminal, according to the social software searching contact person and extract the chat content between user and each contact person.
The multiplexing of point character library of the same correspondents in order to realize multiple social softwares, step 202 employs various ways, such as in point character library of configuration contact person:
Mode one:Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
Chat content described in each social software between identical contact person and user is converted to and is merged after word intersection, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
Mode two:For each social software, the chat content wherein between each contact person and user is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
To merge after the word intersection of identical contact person described in each social software, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
Same contact person is possibly comprised in multiple social softwares, such as " Zhang San " is both the contact person in user terminal in QQ, it is also the contact person in wechat, but No. QQ of " Zhang San " and micro-signal may be not consistent, under this situation, the Subscriber Number in social software cannot know which essence is same contact person.By setting unifying identifier for contact person, but identical contact person in each social software is identified by unifying identifier, although that is, No. QQ of " Zhang San " and micro-signal and different, but the unifying identifier of " Zhang San " is identical in QQ and wechat.
According to mode one, the actual identity that unifying identifier identifies respectively " Zhang San " in QQ and wechat can be passed through, then the chat content by " Zhang San " in QQ is converted to word intersection one, chat content by it in wechat is converted to word intersection two, and word intersection one and word intersection two are merged into into the final word intersection of " Zhang San ";According to mode two, the chat content of each contact person in each social software can be converted to word intersection, and word intersection and contact person are associated into preservation, so word intersection establishes binding relationship with contact person, again identical contact person in each social software identified by unifying identifier afterwards, the word intersection bound in same correspondents merge after be the final word intersection of the contact person.Finally, the unifying identifier of the final word intersection of the contact person of acquisition and contact person and contact person is associated preservation by mode one and mode two, for example, the final word intersection of " Zhang San " and " Zhang San " and its unifying identifier are associated into preservation.
Unifying identifier due to having preset contact person, for convenience centralized and unified management is carried out to the contact person in each social software, step 202 can also set up a unification address list according to the unifying identifier, contact person in the unified address list covers the All Contacts in each social software, and have recorded the corresponding relation of the contact person in the unified address list and the contact person in each social software, correspondingly, point character library of generation directly can carry out corresponding association preservation with the contact person in unified address list.The unified address list can be the address list carried in user terminal, such as cell phone address book, can also individually set up in the user terminal, it is not construed as limiting again, contact person's scope in unified address list can be more than but must include the scope of All Contacts in each social software, and unifying identifier can be added in the details of unified contact person in address list.So user can easily check which the social software belonging to certain contact person has in unified address list, and can quickly find out corresponding point of character library with the corresponding relation of contact person in social software according to contact person in address list is unified when input method calls point character library.
Preferably, the unifying identifier includes user terminal number, contact name, remark names etc..For above-mentioned unified address list, user terminal number (cell-phone number of such as contact person) can be set as unifying identifier, by the way that the cell-phone number of contact person is associated with its Subscriber Number in each social software, you can realize the centralized and unified management to contact person.Contact person " Zhang San " as escribed above, the cell-phone number of " Zhang San " can be associated with its No. QQ and micro-signal in unified address list, step 102 when searching and extracting the word intersection of " Zhang San " by its chat content in QQ and its chat content in wechat are quickly recognized according to the cell-phone number of " Zhang San ", and the word intersection for generating is associated into preservation with the cell-phone number of " Zhang San ".
In order to rapidly search and call contact person associate participle storehouse, step 202 configuration point character library process also include:
According to the unifying identifier of the contact person of the current social software, point character library of contact person's association is searched and transferred.According to the unifying identifier of contact person, corresponding contact person directly can be found in above-mentioned unified address list, and the contact person in unified address list finds point character library of its association.
In order to set up the rational sorting of everyday words in point character library, to ensure to be selected the front portion that input method is arranged in using the high everyday words of temperature for user when input method calls participle storehouse, the process of step 202 configuration point character library is further included:
According to the coding method of input method, parsing obtains the coding of the everyday words preserved in point character library, and by described be stored in point character library corresponding with the everyday words is encoded.
As described above, the data type for including in chat content is varied, wherein contains most important chat content in voice and word, and in order to these important chat contents are converted to into word intersection, the process of step 202 configuration point character library is further included:
Extract the voice and word in the chat content;
The voice is split as by multiple phonetic elements according to audio parameter, and according to default phonetic element and the mapping relations of word, the voice is converted to into word;
The word that conversion is obtained is merged into into word intersection with the word in the chat record.
Wherein, the audio parameter includes time interval between modulation in tone, the word of sound etc., voice is split as by multiple phonetic elements according to audio parameter, phonetic element can be individual character either word or phrase, afterwards according to default phonetic element and the mapping relations of word, word is converted speech into.The default phonetic element can be stored as user terminal locally with the mapping relations of word, in may be alternatively stored in the webserver.
The extraction of everyday words in order to realize point character library, the process of step 202 configuration point character library is further included:
Semantic analysis is carried out to the word intersection, multiple participles are obtained;
The stop words in the plurality of participle is filtered, everyday words is obtained.
Wherein, carrying out semantic analysis to the word intersection includes, analyze noun, adjective etc., and whether analysis individual character can depend on word composition phrase of its front or behind etc., multiple participles can be obtained by semantic analysis, necessarily contain in the plurality of participle it is a large amount of " ", auxiliary words of mood or the non-personalized word such as " I ", these participles are classified as into stop words, and filter out from multiple participles, just obtain everyday words.
For the everyday words preserved into point character library, service condition of each everyday words in user's chat custom is different, in order to distinguish the use temperature of everyday words, and more reasonably push order is provided during user's use, the process of step 202 configuration point character library is further included:
The word frequency of the everyday words is marked according to usage frequency, and generates word frequency list;
The weight of the everyday words is set according to preset rules, and adjusts the word frequency list to generate using temperature list according to the weight of the everyday words.
The usage frequency can be counted in participle, that is, count the number of times that everyday words occurs in participle, and according to usage frequency word frequency list is generated;And for the chat custom of user, be not the high everyday words of usage frequency be exactly that the word that uses most is wanted when user chats, in addition it is also necessary to the chatting object based on user considers the weight of everyday words.For example, " Zhang San " is the colleague of user, according to word frequency list, " yes " word frequency is far longer than the word frequency of " Media Stream ", but for colleague, " Media Stream " obviously has higher professional and specific aim, therefore the weight of " Media Stream " can be adjusted by weight so as to which the position in using temperature list is in the front of " yes ".
Preferably, the preset rules include:
The everyday words is set as into that the number of times for being placed in first place is proportional in inverse ratio or with during user input word with the common degree of the everyday words.
The conventional degree of more professional everyday words on the contrary can be lower, and the weight of everyday words can be in inverse ratio with its conventional degree under this scene;Or, everyday words is set as into that the number of times for being placed in first place is directly proportional using during input method according to user, user also can in this way manually adjust the weight of everyday words.
So far, step 202 completes the configuration process of point character library.
In the present embodiment, rule is shown in order to provide the user more reasonably everyday words in interface of input method, step 203 is further included:
According to the input method coding of user input, the coding of the everyday words is searched in the analysis storehouse of contact person association, and by the everyday words related to the input method coding of the user input by being included in the interface of input method using temperature order.
Wherein described use temperature is already recorded in the use temperature list generated according to the word frequency and weight of everyday words in point character library configuration process, after the coding of user input is monitored, when matching multiple related everyday words in library of molecules, to be ranked the first using temperature highest everyday words, remaining order is arranged.
Because the everyday words in participle storehouse is not unlimited, therefore, the input method coding of user input possibly cannot match the everyday words of correlation in participle storehouse, and when the coding of user input is long, corresponding short sentence also cannot be matched in participle storehouse.Under this scene, step 203 is further included:
According to the input method coding of user input, the coding of the everyday words is searched in the analysis storehouse of contact person association, if not finding the everyday words related to the input method coding of the user input, then the input method coding of the user input is disassembled, according to the multigroup input method coding for disassembling gained, search the coding of the everyday words in the analysis storehouse of contact person association respectively, and by the everyday words related to first group of input method coding for disassembling gained by being included in the interface of input method using temperature order;
After selecting the related everyday words of first group of input method coding for disassembling gained, by the everyday words related to second group of input method coding for disassembling gained by being included in the interface of input method using temperature order.
The rule that the input method coding of the user input is disassembled, can be according to the separator added during user input, or alternate manner, as long as the long sentence of user input coding can be disassembled as word coding, after disassembling, can be continued to match everyday words in point character library according to relatively low coding, if still cannot match can proceed with disassembled.
Preferably, if not finding the everyday words related to the input method coding for disassembling gained in the library of molecules that the contact person is associated, then continue the input method coding to be disassembled for single coding, and the individual character related to the single coding for disassembling gained is included in the interface of input method.Equally, after user is selected the corresponding individual character of first list coding, individual character or everyday words that follow-up coding shows correlation are continued as.
In order to further improve Consumer's Experience, also include before step 203:
The open command of monitoring interface of input method;
After interface of input method open command is monitored, and before the input method coding for getting user input, by the everyday words in point character library that contact person's association is shown using temperature order in the interface of input method.
Namely, after input method starts, if not monitoring the coding of user input, temperature highest everyday words ranks the first used in point character library for directly associating the contact person, thereafter everyday words is arranged according to using temperature order, such user can obtain the recommendation of everyday words without the need for input.For example, " Zhang San " for user good friend, " sound of sighing cry feed " is to use temperature highest everyday words in its corresponding point of character library, as long as then user starts input method, " sound of sighing cry feed " just comes the first place of input method, and user can directly choose and use.Greatly accelerate the typing speed of user.
Hereinafter the technical scheme of the application is elaborated by taking mobile phone terminal as an example.
Cell-phone number comprising multiple contact persons in cell phone address book, the cell-phone number of wherein " Zhang San " is associated with its No. QQ and micro-signal, in order to set up point character library of " Zhang San ", No. QQ and the micro-signal of " Zhang San " are found respectively in QQ programs and wechat program by the cell-phone number of " Zhang San ", and extract the QQ chat records and wechat chat record of " Zhang San " and user respectively, the QQ chat records and wechat chat record of extraction are respectively converted into word intersection and are merged, and the cell-phone number for merging word intersection and " Zhang San " for generating is associated into preservation, afterwards participle is carried out to the word intersection of " Zhang San " and extract everyday words, to be stored in point character library of " Zhang San " more than the everyday words of predetermined value using temperature.So far the configuration of point character library of " Zhang San " is completed.
After point character library is configured, when user initiates again the chat with " Zhang San " in QQ or wechat, starting input method can directly invoke point character library of " Zhang San " association, according to the coding that user is input in interface of input method, corresponding everyday words can be matched in point character library of " Zhang San ", when multiple everyday words are matched, the everyday words of correlation is shown in descending order according to the use temperature in described point of character library for everyday words setting, user can rapidly hit the everyday words for meeting its custom of chatting with " Zhang San ", substantially increase chat speed.
In a typical configuration, computing device includes one or more processors (CPU), input/output interface, network interface and internal memory.
Internal memory potentially includes the volatile memory in computer-readable medium, the such as form such as random access memory (RAM) and/or Nonvolatile memory, read-only storage (ROM) or flash memory (flash RAM).Internal memory is the example of computer-readable medium.
Computer-readable medium includes that permanent and non-permanent, removable and non-removable media can realize information Store by any method or technique.Information can be computer-readable instruction, data structure, the module of program or other data.The example of the storage medium of computer includes, but it is not limited to phase transition internal memory (PRAM), static RAM (SRAM), dynamic random access memory (DRAM), other kinds of random access memory (RAM), read-only storage (ROM), Electrically Erasable Read Only Memory (EEPROM), fast flash memory bank or other memory techniques, read-only optical disc read-only storage (CD-ROM), digital versatile disc (DVD) or other optical storages, magnetic cassette tape, tape magnetic rigid disk is stored or other magnetic storage apparatus or any other non-transmission medium, can be used to store the information that can be accessed by a computing device.Define according to herein, computer-readable medium does not include non-temporary computer readable media (transitory media), the such as data-signal and carrier wave of modulation.
As in specification and claim some vocabulary used in censuring specific components.Those skilled in the art are, it is to be appreciated that hardware manufacturer may call same component with different nouns.This specification and claims are not in the way of the difference of title is used as distinguishing component, but the difference with component functionally is used as the criterion distinguished.If the "comprising" in specification in the whole text and claim mentioned in is an open language, therefore should be construed to " include but be not limited to "." substantially " refer in receivable error range, those skilled in the art can solve the technical problem in the range of certain error, basically reach the technique effect.Additionally, " coupling " word here includes any directly and indirectly electric property coupling means.Therefore, if a first device is coupled to a second device described in text, representing the first device can directly be electrically coupled to the second device, or be electrically coupled to the second device indirectly by other devices or coupling means.Specification subsequent descriptions are to implement the better embodiment of the application, and so the description is for the purpose of the rule for illustrating the application, to be not limited to scope of the present application.The protection domain of the application ought be defined depending on the claims person of defining.
It can further be stated that, term " including ", "comprising" or its any other variant are intended to including for nonexcludability, so that not only including those key elements including the commodity or system of a series of key elements, but also including other key elements being not expressly set out, or also include the key element intrinsic for this commodity or system.In the absence of more restrictions, the key element for being limited by sentence "including a ...", it is not excluded that also there is other identical element in the commodity or system including the key element.
Described above illustrates and describes some preferred embodiments of the application, but as previously mentioned, it should be understood that the present invention is not limited to form disclosed herein, it is not to be taken as the exclusion to other embodiment, and can be used for various other combinations, modification and environment, and can be modified by the technology or knowledge of above-mentioned teaching or association area in invention contemplated scope described herein.And change that those skilled in the art are carried out and change be without departing from the spirit and scope of the present invention, then all should be in the protection domain of claims of the present invention.
This application discloses the character library collocation method of A1, a kind of input method, including:
According to the contact person of each social software in user terminal, the chat content between user and the contact person is searched and extracted;
The chat content is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;
Participle is carried out to the word intersection and everyday words is extracted from participle, will be preserved into point character library of the contact person associated with the word intersection more than the everyday words of predetermined value using temperature.
A2, the method as described in A1, according to the contact person of each social software in user terminal, search and extract the chat content between user and the contact person, further include:
For each social software in user terminal, according to the social software searching contact person and extract the chat content between user and each contact person.
A3, the method as described in A asks 2, by the chat content word intersection is converted to, and the word intersection of generation is associated into preservation with corresponding contact person, is further included:
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
Chat content described in each social software between identical contact person and user is converted to and is merged after word intersection, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
A4, the method as described in A2, by the chat content word intersection is converted to, and the word intersection of generation is associated into preservation with corresponding contact person, is further included:
For each social software, the chat content wherein between each contact person and user is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
To merge after the word intersection of identical contact person described in each social software, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
A5, the method as described in A3 or A4, the unifying identifier includes the following combination of one or more:User terminal number, contact name, remark names.
A6, the method as described in A1, will be preserved into point character library of the contact person associated with the word intersection using temperature more than the everyday words of predetermined value, be further included:
According to the coding method of input method, parsing obtains the coding of the everyday words preserved in point character library, and by described be stored in point character library corresponding with the everyday words is encoded.
A7, the scheme as described in A6, the coding method, including:Phonetic, shape code, sound shape are combined, ISN.
A8, the method as described in A1, by the chat content word intersection is converted to, and is further included:
Extract the voice and word in the chat content;
The voice is split as by multiple phonetic elements according to audio parameter, and according to default phonetic element and the mapping relations of word, the voice is converted to into word;
The word that conversion is obtained is merged into into word intersection with the word in the chat record.
A9, the method as described in A1, carry out participle and extract everyday words to the word intersection, further include:
Semantic analysis is carried out to the word intersection, multiple participles are obtained;
The stop words in the plurality of participle is filtered, everyday words is obtained.
A10, the method as described in A1, point character library to contact person association will be preserved using temperature more than the everyday words of predetermined value, are further included:
The word frequency of the everyday words is marked according to usage frequency, and generates word frequency list;
The weight of the everyday words is set according to preset rules, and adjusts the word frequency list to generate using temperature list according to the weight of the everyday words.
A11, the method as described in A9, the preset rules include:
The everyday words is set as into that the number of times for being placed in first place is proportional in inverse ratio or with during user input word with the common degree of the everyday words.
This application discloses the character library configuration device of B12, a kind of input method, including:
Inquiry unit, for according to the contact person of each social software in user terminal, searching and extracting the chat content between user and the contact person;
Conversion equipment, for the chat content to be converted to into word intersection, and is associated preservation by the word intersection of generation with corresponding contact person;
Participle storage device, for participle to be carried out to the word intersection and everyday words is extracted from participle, will be preserved into point character library of the contact person associated with the word intersection using temperature more than the everyday words of predetermined value.
B13, the device as described in B12, the inquiry unit is further used for:
For each social software in user terminal, according to the social software searching contact person and extract the chat content between user and each contact person.
B14, the device as described in B13, the conversion equipment is further used for:
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
Chat content described in each social software between identical contact person and user is converted to and is merged after word intersection, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
B15, the device as described in B13, the conversion equipment is further used for:
For each social software, the chat content wherein between each contact person and user is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
To merge after the word intersection of identical contact person described in each social software, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
B16, the device as described in B14 or B15, the unifying identifier includes the following combination of one or more:User terminal number, contact name, remark names.
B17, the device as described in B12, the participle storage device, are further used for:
According to the coding method of input method, parsing obtains the coding of the everyday words preserved in point character library, and by described be stored in point character library corresponding with the everyday words is encoded.
B18, the device as described in B17, the coding method, including:Phonetic, shape code, sound shape are combined, ISN.
B19, the device as described in B12, the conversion equipment is further used for:
Extract the voice and word in the chat content;
The voice is split as by multiple phonetic elements according to audio parameter, and according to default phonetic element and the mapping relations of word, the voice is converted to into word;
The word that conversion is obtained is merged into into word intersection with the word in the chat record.
B20, the device as described in B12, the participle storage device, are further used for:
Semantic analysis is carried out to the word intersection, multiple participles are obtained;
The stop words in the plurality of participle is filtered, everyday words is obtained.
B21, the device as described in B12, the participle storage device, are further used for:
The word frequency of the everyday words is marked according to usage frequency, and generates word frequency list;
The weight of the everyday words is set according to preset rules, and adjusts the word frequency list to generate using temperature list according to the weight of the everyday words.
B22, the device as described in B21, the preset rules include:
The everyday words is set as into that the number of times for being placed in first place is proportional in inverse ratio or with during user input word with the common degree of the everyday words.
Disclosed herein as well is the character library calling processing method of C23, a kind of input method, including:
Start input method, the contact person of current social software in identifying user terminal;
Point character library of contact person's association is called, wherein, the configuration process of point character library of contact person's association is:According to the contact person of each social software in user terminal, the chat content between user and the contact person is searched and extracted;The chat content is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;Participle is carried out to the word intersection and everyday words is extracted from participle, will be preserved into point character library of the contact person associated with the word intersection more than the everyday words of predetermined value using temperature;
Obtain the input method coding of user input, in point character library of the contact person association matching everyday words related to the input method coding of user input and by the everyday words by being included in interface of input method using temperature order.
C24, the method as described in C23, according to the contact person of each social software in user terminal, search and extract the chat content between user and the contact person, further include:
For each social software in user terminal, according to the social software searching contact person and extract the chat content between user and each contact person.
C25, the method as described in C24, by the chat content word intersection is converted to, and the word intersection of generation is associated into preservation with corresponding contact person, is further included:
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
Chat content described in each social software between identical contact person and user is converted to and is merged after word intersection, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
C26, the method as described in C24, by the chat content word intersection is converted to, and the word intersection of generation is associated into preservation with corresponding contact person, is further included:
For each social software, the chat content wherein between each contact person and user is converted to into word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein identical contact person has identical unifying identifier in each social software;
To merge after the word intersection of identical contact person described in each social software, and the word intersection of generation is associated into preservation with the identical contact person and its unifying identifier.
C27, the method as described in C25 or C26, the unifying identifier includes the following combination of one or more:User terminal number, contact name, remark names.
C28, the method as described in C25 or C26, call point character library of contact person's association, further include:
According to the unifying identifier of the contact person of the current social software, point character library of contact person's association is searched and transferred.
C29, the method as described in C23, will be preserved into point character library of the contact person associated with the word intersection using temperature more than the everyday words of predetermined value, be further included:
According to the coding method of input method, parsing obtains the coding of the everyday words preserved in point character library, and by described be stored in point character library corresponding with the everyday words is encoded.
C30, the scheme as described in C29, the coding method, including:Phonetic, shape code, sound shape are combined, ISN.
C31, method as described in C29, obtain the input method coding of user input, in point character library of the contact person association matching everyday words related to the input method coding of user input and by the everyday words by including in interface of input method, further including using temperature order:
According to the input method coding of user input, the coding of the everyday words is searched in the analysis storehouse of contact person association, and by the everyday words related to the input method coding of the user input by being included in the interface of input method using temperature order.
C32, the method as described in C29, obtain the input method coding of user input, in point character library of the contact person association matching everyday words related to the input method coding of user input and by the everyday words by including in interface of input method, further including using temperature order:
According to the input method coding of user input, the coding of the everyday words is searched in the analysis storehouse of contact person association, if not finding the everyday words related to the input method coding of the user input, then the input method coding of the user input is disassembled, according to the multigroup input method coding for disassembling gained, search the coding of the everyday words in the analysis storehouse of contact person association respectively, and by the everyday words related to first group of input method coding for disassembling gained by being included in the interface of input method using temperature order;
After selecting the related everyday words of first group of input method coding for disassembling gained, by the everyday words related to second group of input method coding for disassembling gained by being included in the interface of input method using temperature order.
C33, the method as described in C32, if not finding the everyday words related to the input method coding of the user input, then the input method coding of the user input is disassembled, according to the input method coding for disassembling gained, the coding of the everyday words is searched in the analysis storehouse of contact person association, is further included:
If not finding the everyday words related to the input method coding for disassembling gained in the library of molecules that the contact person is associated, then continue the input method coding to be disassembled for single coding, and the individual character related to the single coding for disassembling gained is included in the interface of input method.
C34, the method as described in C23, by the chat content word intersection is converted to, and is further included:
Extract the voice and word in the chat content;
The voice is split as by multiple phonetic elements according to audio parameter, and according to default phonetic element and the mapping relations of word, the voice is converted to into word;
The word that conversion is obtained is merged into into word intersection with the word in the chat record.
C35, the method as described in C23, carry out participle and extract everyday words to the word intersection, further include:
Semantic analysis is carried out to the word intersection, multiple participles are obtained;
The stop words in the plurality of participle is filtered, everyday words is obtained.
C36, the method as described in C23, point character library to contact person association will be preserved using temperature more than the everyday words of predetermined value, are further included:
The word frequency of the everyday words is marked according to usage frequency, and generates word frequency list;
The weight of the everyday words is set according to preset rules, and adjusts the word frequency list to generate using temperature list according to the weight of the everyday words.
C37, the method as described in C36, the preset rules include:
The everyday words is set as into that the number of times for being placed in first place is proportional in inverse ratio or with during user input word with the common degree of the everyday words.
C38, the method as described in C23, obtain the input method coding of user input, in point character library of the contact person association matching everyday words related to the input method coding of user input and by the everyday words by including in interface of input method, also including using temperature order before:
The open command of monitoring interface of input method;
After interface of input method open command is monitored, and before the input method coding for getting user input, by the everyday words in point character library that contact person's association is shown using temperature order in the interface of input method.

Claims (10)

1. the character library collocation method of a kind of input method, it is characterised in that include:
According to the contact person of each social software in user terminal, search and extract user and the contact person it Between chat content;
The chat content is converted to into word intersection, and by the word intersection for generating and corresponding contact person It is associated preservation;
Participle is carried out to the word intersection and everyday words is extracted from participle, will be using temperature more than predetermined The everyday words of value is preserved into point character library of the contact person associated with the word intersection.
2. the method for claim 1, it is characterised in that soft according to each social activity in user terminal The contact person of part, searches and extracts the chat content between user and the contact person, further includes:
For each social software in user terminal, according to the social software searching contact person and extract Chat content between user and each contact person.
3. method as claimed in claim 2, it is characterised in that the chat content is converted to into text Word intersection, and the word intersection of generation is associated into preservation with corresponding contact person, further include:
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein Identical contact person has identical unifying identifier in each social software;
Chat content described in each social software between identical contact person and user is converted to into text Merge after word intersection, and by generate word intersection and the identical contact person and its unifying identifier It is associated preservation.
4. method as claimed in claim 2, it is characterised in that the chat content is converted to into text Word intersection, and the word intersection of generation is associated into preservation with corresponding contact person, further include:
For each social software, the chat content wherein between each contact person and user is converted to into text Word intersection, and the word intersection of generation is associated into preservation with corresponding contact person;
Identical contact person in each social software is recognized in the user terminal by unifying identifier, wherein Identical contact person has identical unifying identifier in each social software;
To merge after the word intersection of identical contact person described in each social software, and will generate Word intersection be associated preservation with the identical contact person and its unifying identifier.
5. the method as described in claim 3 or 4, it is characterised in that
The unifying identifier includes the following combination of one or more:User terminal number, contact name, Remark names.
6. the method for claim 1, it is characterised in that will be using temperature more than predetermined value Everyday words is preserved into point character library of the contact person associated with the word intersection, is further included:
According to the coding method of input method, parsing obtains the volume of the everyday words preserved in point character library Code, by described be stored in point character library corresponding with the everyday words is encoded.
7. scheme as claimed in claim 6, it is characterised in that the coding method, including:Spell Sound, shape code, sound shape are combined, ISN.
8. the method for claim 1, it is characterised in that the chat content is converted to into text Word intersection, further includes:
Extract the voice and word in the chat content;
The voice is split as by multiple phonetic elements according to audio parameter, and according to default phonetic element With the mapping relations of word, the voice is converted to into word;
The word that conversion is obtained is merged into into word intersection with the word in the chat record.
9. the character library configuration device of a kind of input method, it is characterised in that include:
Inquiry unit, for according to the contact person of each social software in user terminal, searching and extracting user With the chat content between the contact person;
Conversion equipment, for the chat content to be converted to into word intersection, and the word intersection that will be generated Preservation is associated with corresponding contact person;
Participle storage device, for participle to be carried out to the word intersection and everyday words is extracted from participle, To be preserved to the contact person associated with the word intersection more than the everyday words of predetermined value using temperature Point character library in.
10. the character library calling processing method of a kind of input method, it is characterised in that include:
Start input method, the contact person of current social software in identifying user terminal;
Point character library of contact person's association is called, wherein, the configuration of point character library of contact person's association Process is:According to the contact person of each social software in user terminal, search and extract user and contact with described Chat content between people;The chat content is converted to into word intersection, and the word intersection that will be generated Preservation is associated with corresponding contact person;Participle is carried out to the word intersection and is extracted from participle normal Word, will be preserved to described with what the word intersection was associated using temperature more than the everyday words of predetermined value Be people point character library in;
The input method coding of user input is obtained, matching and user in point character library of contact person association The related everyday words of the input method coding of input and by the everyday words by being included defeated using temperature order In entering method interface.
CN201510752419.7A 2015-11-06 2015-11-06 Library configuration method, library configuration device and call handling method of input method Pending CN106681523A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510752419.7A CN106681523A (en) 2015-11-06 2015-11-06 Library configuration method, library configuration device and call handling method of input method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510752419.7A CN106681523A (en) 2015-11-06 2015-11-06 Library configuration method, library configuration device and call handling method of input method

Publications (1)

Publication Number Publication Date
CN106681523A true CN106681523A (en) 2017-05-17

Family

ID=58863017

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510752419.7A Pending CN106681523A (en) 2015-11-06 2015-11-06 Library configuration method, library configuration device and call handling method of input method

Country Status (1)

Country Link
CN (1) CN106681523A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107729420A (en) * 2017-09-27 2018-02-23 维沃移动通信有限公司 A kind of update method and mobile terminal of input method dictionary
CN108133018A (en) * 2017-12-23 2018-06-08 廖赟 A kind of data evidence obtaining recommendation method based on association polymerization
CN108932068A (en) * 2018-06-30 2018-12-04 珠海市魅族科技有限公司 A kind of input method application method and device
CN109213777A (en) * 2017-06-29 2019-01-15 杭州九阳小家电有限公司 A kind of voice-based recipe processing method and system
WO2020168763A1 (en) * 2019-02-19 2020-08-27 苏州龙信信息科技有限公司 Data classification and storage method and apparatus of application program, device, and storage medium
CN113127685A (en) * 2019-12-31 2021-07-16 广州科技贸易职业学院 Video retrieval device and method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101697098A (en) * 2009-09-29 2010-04-21 深圳华为通信技术有限公司 Method and terminal for inputting character
CN102508554A (en) * 2011-10-02 2012-06-20 上海量明科技发展有限公司 Input method with communication association, personal repertoire and system
CN102722483A (en) * 2011-03-29 2012-10-10 百度在线网络技术(北京)有限公司 Method, apparatus and equipment for determining candidate-item sequence of input method
CN103167172A (en) * 2013-02-08 2013-06-19 广州三星通信技术研究有限公司 Integration method and system for variety of chat records
CN103324627A (en) * 2012-03-21 2013-09-25 宇龙计算机通信科技(深圳)有限公司 Terminal and input processing method
CN103870001A (en) * 2012-12-11 2014-06-18 百度国际科技(深圳)有限公司 Input method candidate item generating method and electronic device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101697098A (en) * 2009-09-29 2010-04-21 深圳华为通信技术有限公司 Method and terminal for inputting character
CN102722483A (en) * 2011-03-29 2012-10-10 百度在线网络技术(北京)有限公司 Method, apparatus and equipment for determining candidate-item sequence of input method
CN102508554A (en) * 2011-10-02 2012-06-20 上海量明科技发展有限公司 Input method with communication association, personal repertoire and system
CN103324627A (en) * 2012-03-21 2013-09-25 宇龙计算机通信科技(深圳)有限公司 Terminal and input processing method
CN103870001A (en) * 2012-12-11 2014-06-18 百度国际科技(深圳)有限公司 Input method candidate item generating method and electronic device
CN103167172A (en) * 2013-02-08 2013-06-19 广州三星通信技术研究有限公司 Integration method and system for variety of chat records

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109213777A (en) * 2017-06-29 2019-01-15 杭州九阳小家电有限公司 A kind of voice-based recipe processing method and system
CN107729420A (en) * 2017-09-27 2018-02-23 维沃移动通信有限公司 A kind of update method and mobile terminal of input method dictionary
CN108133018A (en) * 2017-12-23 2018-06-08 廖赟 A kind of data evidence obtaining recommendation method based on association polymerization
CN108133018B (en) * 2017-12-23 2020-09-01 廖赟 Data evidence obtaining recommendation method based on association aggregation
CN108932068A (en) * 2018-06-30 2018-12-04 珠海市魅族科技有限公司 A kind of input method application method and device
WO2020168763A1 (en) * 2019-02-19 2020-08-27 苏州龙信信息科技有限公司 Data classification and storage method and apparatus of application program, device, and storage medium
CN113127685A (en) * 2019-12-31 2021-07-16 广州科技贸易职业学院 Video retrieval device and method

Similar Documents

Publication Publication Date Title
US10575037B2 (en) Video recommending method, server, and storage media
CN106681523A (en) Library configuration method, library configuration device and call handling method of input method
CN104735468B (en) A kind of method and system that image is synthesized to new video based on semantic analysis
KR102081495B1 (en) How to add accounts, terminals, servers, and computer storage media
CN108520046B (en) Method and device for searching chat records
WO2016101577A1 (en) Voice recognition method, client and terminal device
US10108698B2 (en) Common data repository for improving transactional efficiencies of user interactions with a computing device
US20140164371A1 (en) Extraction of media portions in association with correlated input
CN107798143A (en) A kind of information search method, device, terminal and readable storage medium storing program for executing
CN106559314A (en) A kind of cross-platform communication processing method, device
CN108133058B (en) Video retrieval method
CN108121455A (en) Identify method and device for correcting
WO2014054025A1 (en) Method and system for recommending multimedia contents through a multimedia platform
US11922209B2 (en) Invoking functions of agents via digital assistant applications using address templates
CN106559555A (en) A kind of construction method of address list, device
US20170169102A1 (en) Method and electronic device for controlling data query
US20230035155A1 (en) Automated Recording Highlights for Conferences
US20230280974A1 (en) Rendering visual components on applications in response to voice commands
US9454568B2 (en) Method, apparatus and computer storage medium for acquiring hot content
CN106558311A (en) Voice content reminding method and device
KR102135077B1 (en) System for providing topics of conversation in real time using intelligence speakers
CN108509059B (en) Information processing method, electronic equipment and computer storage medium
CN107992758B (en) Dynamic management method and device for security mechanism
CN107918606B (en) Method and device for identifying avatar nouns and computer readable storage medium
CN106559554A (en) A kind of communication processing method, device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170517

RJ01 Rejection of invention patent application after publication