CN111353099A - Input state analysis method and system in multi-language input method - Google Patents

Input state analysis method and system in multi-language input method Download PDF

Info

Publication number
CN111353099A
CN111353099A CN202010113216.4A CN202010113216A CN111353099A CN 111353099 A CN111353099 A CN 111353099A CN 202010113216 A CN202010113216 A CN 202010113216A CN 111353099 A CN111353099 A CN 111353099A
Authority
CN
China
Prior art keywords
node
use data
statistical
state
input state
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010113216.4A
Other languages
Chinese (zh)
Other versions
CN111353099B (en
Inventor
黄奕桐
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Aoe Network Technology Co ltd
Original Assignee
Shenzhen Aoe Network Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Aoe Network Technology Co ltd filed Critical Shenzhen Aoe Network Technology Co ltd
Priority to CN202010113216.4A priority Critical patent/CN111353099B/en
Publication of CN111353099A publication Critical patent/CN111353099A/en
Application granted granted Critical
Publication of CN111353099B publication Critical patent/CN111353099B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The input state analysis method in the multi-language input method provided by the invention extracts the use data of different users in the use process in real time, wherein the use data comprises the current input state and the last input state; filtering the use data according to preset filtering conditions; positioning the node position of the filtered use data in the state transition statistical graph; counting each node in the historical state conversion statistical graph to obtain statistical information of each node; and obtaining the context of each node according to the statistical information of each node, and updating the state transition statistical graph. The method can automatically generate the state transition statistical chart and has high accuracy.

Description

Input state analysis method and system in multi-language input method
Technical Field
The invention belongs to the technical field of multi-language input methods, and particularly relates to an input state analysis method and system in a multi-language input method.
Background
The multi-language input method contains a plurality of languages, the users who use the multi-language input method relate to a plurality of countries, and the multi-language input method can generate a plurality of versions in the process of continuous iterative updating. These new versions will have new characteristics, such as: add functions, modify functions, remove functions, and the like.
During the use process of the user, related staff (such as product personnel, testing personnel or operation personnel) need to check various information and data according to different requirements. For example: how many active users are today? New versions on the week, how many users have updated to the versions? How many users in the united states use a new function? In addition to this, these workers are also interested in statistical information and data, such as: how do the domestic users use the input method in the week? How well these functions are used? Which are used much more? Which are used less? Today, what are the problems with the new spanish version in the united states? How to discover these anomalies? In the prior art, the staff obtains the statistical information by the following method:
first, the dimension and the statistical information in the statistical information are obtained. Dimensions include time (week), country (usa), language (spanish), version (1.9.15), source (Facebook advertising), platform (*** play), and so forth. The statistical information includes input states (e.g., pinyin syllable input, stroke input, associative selection candidates, stroke selection symbols, pinyin selection candidates, scrolling, etc., different input state names). Then, the developer extracts corresponding information in the user using process according to the dimensions and the statistical information to obtain table 1:
table 1:
Figure BDA0002390699070000011
Figure BDA0002390699070000021
and after the background server receives the tables, a specific statistical table is obtained by filtering and summarizing dimension conditions according to specific problems. For example: how do the domestic users use the input method for the week? How well these functions are used? Which are used much more? The statistical table of which uses less is shown in table 2:
table 2:
Figure BDA0002390699070000022
the worker found the following problems in conjunction with table 2, for example: the number of times/users of stroke selection symbols is reduced and the number of users of stroke input is increased. And the staff restores the state transition statistical chart shown in the figure 1 according to the sequence of the input states and the association between the input states according to the table 2.
This method of analyzing statistical information has the following disadvantages: 1. each analysis needs repeated work to obtain a state transition statistical chart, the repeated work is too much, and when the input state is more, the workload is larger. 2. The staff who knows the product very much can know the sequence of the input states and the conversion connection among the input states, and the staff who knows the product not clearly can easily make mistakes or make the input states reverse. 3. Further statistical information is missing, such as: three state transition connections into the associative candidate selection are connected, which connect respective number of users, number of times/person?
Disclosure of Invention
Aiming at the defects in the prior art, the invention provides an input state analysis method and system in a multi-language input method, which can automatically generate a state conversion statistical chart and have high accuracy.
In a first aspect, a method for analyzing input states in a multi-language input method includes the following steps:
extracting use data in the use process of different users in real time, wherein the use data comprises a current input state and a last input state;
filtering the use data according to preset filtering conditions;
positioning the node position of the filtered use data in the state transition statistical graph;
counting each node in the historical state conversion statistical graph to obtain statistical information of each node;
and obtaining the context of each node according to the statistical information of each node, and updating the state transition statistical graph.
Preferably, the filtering the usage data according to a preset filtering condition specifically includes:
setting a filtering condition comprising a plurality of dimensions;
matching the use data with dimensions in the filtering condition in sequence;
when the usage data conforms to all dimensions in the filtering condition, retaining the usage data;
the usage data is filtered when the usage data does not conform to at least one dimension of the filtering criteria.
Preferably, the locating the node position of the filtered usage data in the state transition statistical graph specifically includes:
acquiring a historical state conversion statistical chart corresponding to the filtering condition;
acquiring a current input state in use data, and judging whether a node which is consistent with the current input state in the use data exists in a historical state conversion statistical graph or not; if yes, defining the node as the current node of the usage data; if the current node does not exist, a new node is created in the historical state conversion statistical graph according to the current input state in the use data and is used as the current node of the use data;
acquiring a previous input state in use data, judging whether a node which is consistent with the previous input state in the use data exists in a historical state conversion statistical graph or not, if so, defining the node as the previous node of the use data, and if not, creating a new node in the historical state conversion statistical graph according to the previous input state in the use data to serve as the previous node of the use data;
and establishing the connection relationship from the last node to the current node in the historical state transition statistical graph.
Preferably, the counting the nodes in the historical state transition statistical graph, and the obtaining the statistical information of the nodes specifically includes:
recording the use times and user ids of each node, and adding one to the number of users of the node when a new user id appears in the node to obtain the statistical information of the node;
recording the use times and user id of the connection relationship, and adding one to the number of users of the connection relationship when a new user id appears in the connection relationship to obtain the statistical information of the connection relationship;
and carrying out statistics based on the number of users and statistics based on the times according to the statistical information of each node and the connection relation.
Preferably, the obtaining the context of each node according to the statistical information of each node, and the updating the state transition statistical chart specifically includes:
acquiring two nodes which are connected with each other in a historical state conversion statistical graph;
defining the node with the maximum use times or the maximum user quantity in the statistical information of the two nodes as a front node and the other node as a rear node;
determining the front-back relationship of the two nodes according to the front node and the back node;
and updating the state transition statistical graph according to the context of all the nodes.
In a second aspect, an input state analysis system in a multi-language input method includes:
a collecting unit: the system comprises a data acquisition module, a data processing module and a data processing module, wherein the data acquisition module is used for acquiring use data of different users in use in real time, and the use data comprises a current input state and a last input state;
a filtering unit: the data processing device is used for filtering the use data according to preset filtering conditions;
a positioning unit: the node position of the filtered use data in the state transition statistical graph is positioned;
a statistic unit: the statistical information acquisition module is used for carrying out statistics on each node in the historical state conversion statistical graph to obtain statistical information of each node;
an update unit: and the statistical graph is used for obtaining the context of each node according to the statistical information of each node and updating the state transition statistical graph.
Preferably, the filtration unit is particularly adapted to:
setting a filtering condition comprising a plurality of dimensions;
matching the use data with dimensions in the filtering condition in sequence;
when the usage data conforms to all dimensions in the filtering condition, retaining the usage data;
the usage data is filtered when the usage data does not conform to at least one dimension of the filtering criteria.
Preferably, the positioning unit is specifically configured to:
acquiring a historical state conversion statistical chart corresponding to the filtering condition;
acquiring a current input state in use data, and judging whether a node which is consistent with the current input state in the use data exists in a historical state conversion statistical graph or not; if yes, defining the node as the current node of the usage data; if the current node does not exist, a new node is created in the historical state conversion statistical graph according to the current input state in the use data and is used as the current node of the use data;
acquiring a previous input state in use data, judging whether a node which is consistent with the previous input state in the use data exists in a historical state conversion statistical graph or not, if so, defining the node as the previous node of the use data, and if not, creating a new node in the historical state conversion statistical graph according to the previous input state in the use data to serve as the previous node of the use data;
and establishing the connection relationship from the last node to the current node in the historical state transition statistical graph.
Preferably, the statistical unit is specifically configured to:
recording the use times and user ids of each node, and adding one to the number of users of the node when a new user id appears in the node to obtain the statistical information of the node;
recording the use times and user id of the connection relationship, and adding one to the number of users of the connection relationship when a new user id appears in the connection relationship to obtain the statistical information of the connection relationship;
and carrying out statistics based on the number of users and statistics based on the times according to the statistical information of each node and the connection relation.
Preferably, the updating unit is specifically configured to:
acquiring two nodes which are connected with each other in a historical state conversion statistical graph;
defining the node with the maximum use times or the maximum user quantity in the statistical information of the two nodes as a front node and the other node as a rear node;
determining the front-back relationship of the two nodes according to the front node and the back node;
and updating the state transition statistical graph according to the context of all the nodes.
According to the technical scheme, the input state analysis method and the input state analysis system in the multi-language input method can automatically generate the state conversion statistical chart, and are high in accuracy.
Drawings
In order to more clearly illustrate the detailed description of the invention or the technical solutions in the prior art, the drawings that are needed in the detailed description of the invention or the prior art will be briefly described below. Throughout the drawings, like elements or portions are generally identified by like reference numerals. In the drawings, elements or portions are not necessarily drawn to scale.
Fig. 1 is a state transition statistical diagram generated by a conventional method in the background art.
Fig. 2 is a flowchart of an input state analysis method according to an embodiment of the present invention.
Fig. 3 is a flow chart of the filtering method of fig. 2.
Fig. 4 is a flowchart of the positioning method in fig. 2.
FIG. 5 is a flow chart of the statistical method of FIG. 2.
Fig. 6 is a flowchart of the updating method in fig. 2.
FIG. 7 is a generated state transition statistical chart according to an embodiment of the present invention.
Detailed Description
Embodiments of the present invention will be described in detail below with reference to the accompanying drawings. The following examples are only for illustrating the technical solutions of the present invention more clearly, and therefore are only examples, and the protection scope of the present invention is not limited thereby. It is to be noted that, unless otherwise specified, technical or scientific terms used herein shall have the ordinary meaning as understood by those skilled in the art to which the invention pertains.
It will be understood that the terms "comprises" and/or "comprising," when used in this specification and the appended claims, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
It is also to be understood that the terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in the specification of the present invention and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise.
It should be further understood that the term "and/or" as used in this specification and the appended claims refers to and includes any and all possible combinations of one or more of the associated listed items.
As used in this specification and the appended claims, the term "if" may be interpreted contextually as "when", "upon" or "in response to a determination" or "in response to a detection". Similarly, the phrase "if it is determined" or "if a [ described condition or event ] is detected" may be interpreted contextually to mean "upon determining" or "in response to determining" or "upon detecting [ described condition or event ]" or "in response to detecting [ described condition or event ]".
The state transition statistical diagram comprises the following basic graphic elements: a plurality of input states and a connection relationship between the input states. The input state refers to the state from thick to thin in the input process, such as 9-key pinyin, 9-key pinyin-pinyin input-spelling syllables, 9-key pinyin-pinyin input-selection syllables, 9-key pinyin-candidate on-screen, 9-key pinyin-word association-page down, 9-key pinyin-word association-page up and the like.
The first embodiment is as follows:
referring to fig. 2, an input state analysis method in a multilingual input method includes the steps of:
s1: extracting use data in the use process of different users in real time, wherein the use data comprises a current input state and a last input state;
specifically, the usage data includes certain statistical information, such as the number of state occurrences, the number of transition occurrences, the transition rate, and the like, in addition to the current input state and the last input state. Because the user data collected in the existing method lacks the conversion information of the input state, the conversion relation between the previous input state and the current input state cannot be maintained, so that the method saves the previous input state when collecting the use data, which is equivalent to saving the conversion information of the input state. For example, the usage data shown in table 3 was collected:
table 3:
Figure BDA0002390699070000081
in addition, the input state is not repeated, if the current input states of two pieces of use data are consistent, the two pieces of use data are the same input state, and therefore, one piece of use data can uniquely confirm the current input state and the last input state of the user.
S2: filtering the use data according to preset filtering conditions; referring to fig. 3, the method specifically includes:
setting a filtering condition comprising a plurality of dimensions;
matching the use data with dimensions in the filtering condition in sequence;
when the usage data conforms to all dimensions in the filtering condition, retaining the usage data;
the usage data is filtered when the usage data does not conform to at least one dimension of the filtering criteria.
In particular, dimensions include users, languages, countries, versions, sources, platforms, and the like. When a worker restores the state transition statistical chart, the worker usually puts forward requirements including defined dimension conditions. These requirements thus constitute a filter condition in which there is a "and" or "relationship between the dimensions. Therefore, when the method collects the use data, the collected use data needs to be filtered through the filtering condition, for each piece of use data, the dimension of the filtering condition is used for matching, and if at least one dimension is not met, the piece of use data is filtered. When the using data are sequentially matched with the dimensions in the filtering condition, the using data are firstly matched with the first dimension in the filtering condition, when the using data do not accord with the dimensions, the using data are filtered, when the using data accord with the dimensions, the using data are matched with the next dimension in the filtering condition, and the steps are repeated until all the dimensions are matched.
S3: positioning the node position of the filtered use data in the state transition statistical graph; referring to fig. 4, the method specifically includes:
acquiring a historical state conversion statistical chart corresponding to the filtering condition;
acquiring a current input state in use data, and judging whether a node which is consistent with the current input state in the use data exists in a historical state conversion statistical graph or not; if yes, defining the node as the current node of the usage data; if the current node does not exist, a new node is created in the historical state conversion statistical graph according to the current input state in the use data and is used as the current node of the use data;
acquiring a previous input state in use data, judging whether a node which is consistent with the previous input state in the use data exists in a historical state conversion statistical graph or not, if so, defining the node as the previous node of the use data, and if not, creating a new node in the historical state conversion statistical graph according to the previous input state in the use data to serve as the previous node of the use data;
and establishing the connection relationship from the last node to the current node in the historical state transition statistical graph.
Specifically, at the very beginning, there are no elements on the state transition statistical graph: including nodes, connections, and any statistical information, the initial state transition statistical map is an empty map. When a current input state is received, whether a corresponding node exists or not is firstly found on the state transition statistical graph, because all the input states are not repeated, if the node exists on the state transition statistical graph, the corresponding node position can be located, and if the node does not exist, a new node named by the name of the current input state is created on the state transition statistical graph. And similarly, the node position of the last input state is positioned by adopting the method. When the positions of the nodes in the current input state and the previous input state are well positioned, a connection relationship from the node corresponding to the previous input state to the node corresponding to the current input state is made to represent the context of state conversion, the context between the nodes in the state conversion statistical graph is only preliminarily confirmed according to a piece of use data, and the context is not necessarily true or accurate.
S4: counting each node in the historical state conversion statistical graph to obtain statistical information of each node; referring to fig. 5, the method specifically includes:
recording the use times and user ids of each node, and adding one to the number of users of the node when a new user id appears in the node to obtain the statistical information of the node;
recording the use times and user id of the connection relationship, and adding one to the number of users of the connection relationship when a new user id appears in the connection relationship to obtain the statistical information of the connection relationship;
and carrying out statistics based on the number of users and statistics based on the times according to the statistical information of each node and the connection relation.
Specifically, after the method completes positioning, statistics is performed on data of the nodes and the connection relations to obtain statistical information (the number of times of use and the number of unrepeated users) of the nodes or the connection relations. And then carrying out statistics based on the number of users and statistics based on the times according to the statistical information. For example: the association state can be switched to pinyin input and stroke input respectively, and 72% of users enter pinyin input from the association state and 28% of users enter stroke input from the association state through statistics based on the number of users. 82% of times can be obtained through statistics based on times, and the pinyin input is carried out from the association state, and 18% of times is carried out from the association state to the stroke input.
S5: and obtaining the context of each node according to the statistical information of each node, and updating the state transition statistical graph. Referring to fig. 6, the method specifically includes:
acquiring two nodes which are connected with each other in a historical state conversion statistical graph;
defining the node with the maximum use times or the maximum user quantity in the statistical information of the two nodes as a front node and the other node as a rear node;
determining the front-back relationship of the two nodes according to the front node and the back node;
and updating the state transition statistical chart according to the context of all the nodes, wherein the updated state transition statistical chart is shown in FIG. 7.
Specifically, the method has so far restored most of the information of the state transition statistical map, and only lacks accurate context between the respective input states. Since the input states are convertible, for example, the pinyin input state may enter the association selection state, and the association selection state may also enter the pinyin input state, it is not possible to accurately determine which input state is the preceding input state and which input state is the following input state by using only one piece of usage data in step S3.
In the state transition histogram, all other input states except the initial input state have the input state of the source. Therefore, based on this principle, the statistical information (number of times, number of people) of the initial input state is certainly the largest, the statistical information (number of times, number of people) of the later input state is smaller than that of the previous input state, and if the statistical information of two connected nodes is completely consistent, it shows that the two nodes do not have strict context, and they only show 1:1 interconversion.
On the state transition statistical diagram, if two nodes are not connected, the two nodes are not in a transition relation, the two nodes do not have a sequential meaning, and the front-back relation of the two nodes is not required to be embodied on the state transition statistical diagram.
When the statistical information is used for judging the front-back relation of the state transition statistical chart, the using times are compared preferentially, and then the number of users is compared, namely the nodes before the using times are larger, and the nodes before the using times are larger.
The method comprises the steps of firstly collecting enough use data, then filtering the use data, positioning the node positions of the filtered use data in a state transition diagram according to diagram position information of an input state, then counting and counting on the input state and a corresponding connection relation, finally restoring the front-back relation between nodes in the state transition diagram according to the statistical results of a large amount of use data, and updating the state transition statistical diagram, so that the state transition statistical diagram can be automatically generated, and the accuracy is high.
Example two:
an input state analysis system in a multi-language input method, comprising:
a collecting unit: the system comprises a data acquisition module, a data processing module and a data processing module, wherein the data acquisition module is used for acquiring use data of different users in use in real time, and the use data comprises a current input state and a last input state;
a filtering unit: the data processing device is used for filtering the use data according to preset filtering conditions;
a positioning unit: the node position of the filtered use data in the state transition statistical graph is positioned;
a statistic unit: the statistical information acquisition module is used for carrying out statistics on each node in the historical state conversion statistical graph to obtain statistical information of each node;
an update unit: and the statistical graph is used for obtaining the context of each node according to the statistical information of each node and updating the state transition statistical graph.
Preferably, the filtration unit is particularly adapted to:
setting a filtering condition comprising a plurality of dimensions;
matching the use data with dimensions in the filtering condition in sequence;
when the usage data conforms to all dimensions in the filtering condition, retaining the usage data;
the usage data is filtered when the usage data does not conform to at least one dimension of the filtering criteria.
Preferably, the positioning unit is specifically configured to:
acquiring a historical state conversion statistical chart corresponding to the filtering condition;
acquiring a current input state in use data, and judging whether a node which is consistent with the current input state in the use data exists in a historical state conversion statistical graph or not; if yes, defining the node as the current node of the usage data; if the current node does not exist, a new node is created in the historical state conversion statistical graph according to the current input state in the use data and is used as the current node of the use data;
acquiring a previous input state in use data, judging whether a node which is consistent with the previous input state in the use data exists in a historical state conversion statistical graph or not, if so, defining the node as the previous node of the use data, and if not, creating a new node in the historical state conversion statistical graph according to the previous input state in the use data to serve as the previous node of the use data;
and establishing the connection relationship from the last node to the current node in the historical state transition statistical graph.
Preferably, the statistical unit is specifically configured to:
recording the use times and user ids of each node, and adding one to the number of users of the node when a new user id appears in the node to obtain the statistical information of the node;
recording the use times and user id of the connection relationship, and adding one to the number of users of the connection relationship when a new user id appears in the connection relationship to obtain the statistical information of the connection relationship;
and carrying out statistics based on the number of users and statistics based on the times according to the statistical information of each node and the connection relation.
Preferably, the updating unit is specifically configured to:
acquiring two nodes which are connected with each other in a historical state conversion statistical graph;
defining the node with the maximum use times or the maximum user quantity in the statistical information of the two nodes as a front node and the other node as a rear node;
determining the front-back relationship of the two nodes according to the front node and the back node;
and updating the state transition statistical graph according to the context of all the nodes.
The system firstly collects enough use data, then filters the use data, positions the node positions of the filtered use data in the state transition diagram according to diagram position information of an input state, counts and counts on the input state and the corresponding connection relation, and finally restores the front-back relation between the nodes in the state transition diagram according to the statistical results of a large amount of use data and updates the state transition statistical diagram, so that the state transition statistical diagram can be automatically generated, and the accuracy is high.
For the sake of brief description, the system provided by the embodiment of the present invention may refer to the corresponding content in the foregoing method embodiments.
Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; such modifications and substitutions do not depart from the spirit and scope of the present invention, and they should be construed as being included in the following claims and description.

Claims (10)

1. An input state analysis method in a multi-language input method is characterized by comprising the following steps:
extracting use data in the use process of different users in real time, wherein the use data comprises a current input state and a last input state;
filtering the use data according to preset filtering conditions;
positioning the node position of the filtered use data in the state transition statistical graph;
counting each node in the historical state conversion statistical graph to obtain statistical information of each node;
and obtaining the context of each node according to the statistical information of each node, and updating the state transition statistical graph.
2. The method for analyzing input state in multi-lingual input method according to claim 1, wherein the filtering the usage data according to the predetermined filtering condition specifically comprises:
setting a filtering condition comprising a plurality of dimensions;
matching the use data with dimensions in the filtering condition in sequence;
when the usage data conforms to all dimensions in the filtering condition, retaining the usage data;
the usage data is filtered when the usage data does not conform to at least one dimension of the filtering criteria.
3. The method for analyzing an input state in a multi-lingual input method according to claim 1, wherein the locating the node position of the filtered usage data in the state transition statistical graph specifically comprises:
acquiring a historical state conversion statistical chart corresponding to the filtering condition;
acquiring a current input state in use data, and judging whether a node which is consistent with the current input state in the use data exists in a historical state conversion statistical graph or not; if yes, defining the node as the current node of the usage data; if the current node does not exist, a new node is created in the historical state conversion statistical graph according to the current input state in the use data and is used as the current node of the use data;
acquiring a previous input state in use data, judging whether a node which is consistent with the previous input state in the use data exists in a historical state conversion statistical graph or not, if so, defining the node as the previous node of the use data, and if not, creating a new node in the historical state conversion statistical graph according to the previous input state in the use data to serve as the previous node of the use data;
and establishing the connection relationship from the last node to the current node in the historical state transition statistical graph.
4. The method for analyzing input state in multi-language input method according to claim 3, wherein said counting each node in the historical state transition statistical graph to obtain statistical information of each node specifically comprises:
recording the use times and user ids of each node, and adding one to the number of users of the node when a new user id appears in the node to obtain the statistical information of the node;
recording the use times and user id of the connection relationship, and adding one to the number of users of the connection relationship when a new user id appears in the connection relationship to obtain the statistical information of the connection relationship;
and carrying out statistics based on the number of users and statistics based on the times according to the statistical information of each node and the connection relation.
5. The method according to claim 4, wherein the obtaining of the context of each node based on the statistical information of each node and the updating of the state transition statistical chart specifically comprises:
acquiring two nodes which are connected with each other in a historical state conversion statistical graph;
defining the node with the maximum use times or the maximum user quantity in the statistical information of the two nodes as a front node and the other node as a rear node;
determining the front-back relationship of the two nodes according to the front node and the back node;
and updating the state transition statistical graph according to the context of all the nodes.
6. An input state analysis system in a multilingual input method, comprising:
a collecting unit: the system comprises a data acquisition module, a data processing module and a data processing module, wherein the data acquisition module is used for acquiring use data of different users in use in real time, and the use data comprises a current input state and a last input state;
a filtering unit: the data processing device is used for filtering the use data according to preset filtering conditions;
a positioning unit: the node position of the filtered use data in the state transition statistical graph is positioned;
a statistic unit: the statistical information acquisition module is used for carrying out statistics on each node in the historical state conversion statistical graph to obtain statistical information of each node;
an update unit: and the statistical graph is used for obtaining the context of each node according to the statistical information of each node and updating the state transition statistical graph.
7. The system for analyzing input state in a multilingual input method of claim 6, wherein the filtering unit is specifically configured to:
setting a filtering condition comprising a plurality of dimensions;
matching the use data with dimensions in the filtering condition in sequence;
when the usage data conforms to all dimensions in the filtering condition, retaining the usage data;
the usage data is filtered when the usage data does not conform to at least one dimension of the filtering criteria.
8. The system for analyzing an input state in a multilingual input method of claim 6, wherein the locating unit is specifically configured to:
acquiring a historical state conversion statistical chart corresponding to the filtering condition;
acquiring a current input state in use data, and judging whether a node which is consistent with the current input state in the use data exists in a historical state conversion statistical graph or not; if yes, defining the node as the current node of the usage data; if the current node does not exist, a new node is created in the historical state conversion statistical graph according to the current input state in the use data and is used as the current node of the use data;
acquiring a previous input state in use data, judging whether a node which is consistent with the previous input state in the use data exists in a historical state conversion statistical graph or not, if so, defining the node as the previous node of the use data, and if not, creating a new node in the historical state conversion statistical graph according to the previous input state in the use data to serve as the previous node of the use data;
and establishing the connection relationship from the last node to the current node in the historical state transition statistical graph.
9. The system for analyzing input state in a multilingual input method of claim 8, wherein the statistical unit is specifically configured to:
recording the use times and user ids of each node, and adding one to the number of users of the node when a new user id appears in the node to obtain the statistical information of the node;
recording the use times and user id of the connection relationship, and adding one to the number of users of the connection relationship when a new user id appears in the connection relationship to obtain the statistical information of the connection relationship;
and carrying out statistics based on the number of users and statistics based on the times according to the statistical information of each node and the connection relation.
10. The system for analyzing an input state in a multilingual input method of claim 9, wherein the updating unit is specifically configured to:
acquiring two nodes which are connected with each other in a historical state conversion statistical graph;
defining the node with the maximum use times or the maximum user quantity in the statistical information of the two nodes as a front node and the other node as a rear node;
determining the front-back relationship of the two nodes according to the front node and the back node;
and updating the state transition statistical graph according to the context of all the nodes.
CN202010113216.4A 2020-02-24 2020-02-24 Input state analysis method and system in multi-language input method Active CN111353099B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010113216.4A CN111353099B (en) 2020-02-24 2020-02-24 Input state analysis method and system in multi-language input method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010113216.4A CN111353099B (en) 2020-02-24 2020-02-24 Input state analysis method and system in multi-language input method

Publications (2)

Publication Number Publication Date
CN111353099A true CN111353099A (en) 2020-06-30
CN111353099B CN111353099B (en) 2023-12-12

Family

ID=71195758

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010113216.4A Active CN111353099B (en) 2020-02-24 2020-02-24 Input state analysis method and system in multi-language input method

Country Status (1)

Country Link
CN (1) CN111353099B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101984389A (en) * 2010-11-19 2011-03-09 王博 Method, device and terminal device for inputting Chinese characters
CN105808923A (en) * 2016-02-29 2016-07-27 北京航空航天大学 Anomaly detection method and device of data sequence

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101984389A (en) * 2010-11-19 2011-03-09 王博 Method, device and terminal device for inputting Chinese characters
CN105808923A (en) * 2016-02-29 2016-07-27 北京航空航天大学 Anomaly detection method and device of data sequence

Also Published As

Publication number Publication date
CN111353099B (en) 2023-12-12

Similar Documents

Publication Publication Date Title
US7634741B2 (en) Method and apparatus for managing a selection list based on previous entries
EP3832488A2 (en) Method and apparatus for generating event theme, device and storage medium
US7313514B2 (en) Validating content of localization data files
CN104036004B (en) Search for error correction method and search error correction device
KR20160030351A (en) Method and device for processing information
CN111224843B (en) Resource link monitoring method, device, equipment and storage medium
CN111125438A (en) Entity information extraction method and device, electronic equipment and storage medium
CN111767334A (en) Information extraction method and device, electronic equipment and storage medium
CN115576974B (en) Data processing method, device, equipment and medium
EP4224338A1 (en) Data labeling method, apparatus, and system, device, and storage medium
CN110046086B (en) Expected data generation method and device for test and electronic equipment
CN111353099A (en) Input state analysis method and system in multi-language input method
CN111522863A (en) Topic concept mining method, device, equipment and storage medium
CN116074183B (en) C3 timeout analysis method, device and equipment based on rule engine
CN111949820A (en) Video associated interest point processing method and device and electronic equipment
CN111026916A (en) Text description conversion method and device, electronic equipment and storage medium
CN112560459B (en) Sample screening method, device, equipment and storage medium for model training
CN111860526B (en) Image-based question judging method and device, electronic equipment and computer storage medium
CN115600592A (en) Method, device, equipment and medium for extracting key information of text content
CN109785099B (en) Method and system for automatically processing service data information
CN107818177B (en) Business intelligent model building method and building device
CN111737398A (en) Method and device for searching sensitive words in text, electronic equipment and storage medium
CN111339756A (en) Text error detection method and device
KR102526873B1 (en) Electronic map search method, device, facility and medium
CN110929502A (en) Text error detection method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant