CN1525388A - Hanzi processing equipment and method - Google Patents

Hanzi processing equipment and method Download PDF

Info

Publication number
CN1525388A
CN1525388A CNA2004100072858A CN200410007285A CN1525388A CN 1525388 A CN1525388 A CN 1525388A CN A2004100072858 A CNA2004100072858 A CN A2004100072858A CN 200410007285 A CN200410007285 A CN 200410007285A CN 1525388 A CN1525388 A CN 1525388A
Authority
CN
China
Prior art keywords
character
language
code
kanji
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2004100072858A
Other languages
Chinese (zh)
Inventor
辉 刘
刘辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Original Assignee
Toshiba Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Toshiba Corp filed Critical Toshiba Corp
Publication of CN1525388A publication Critical patent/CN1525388A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)
  • Character Discrimination (AREA)

Abstract

To provide an apparatus, a method and a program for input characters which enable a user to input a hard-to-pronounce multilingual character, without referring a word dictionary and using a character entering program which are prepared for each language. The apparatus comprises a tablet 16 which receives the language (first language) input by a handwritten and an instruction for converting the handwritten language input into the converted language (second language), a kanji conversion program 22 which retrieves a character code for the first language responding to a handwritten character data input from the tablet 16, and a kanji code table 24 which stores the character codes of each language for sharing a common character of a plurality of languages and converts the retrieved character code into the character code for the second language. C)2004,JPO&NCIPI .

Description

Chinese character processing equipment and Chinese character processing method
Technical field
The present invention relates to a kind of character input device and method that is used for input Chinese character in multilingual (simplified and unsimplified Hanzi, kanji, Korean Chinese character, and other Chinese characters).
Background technology
Some modern languagess are used the symbol that is called Chinese character usually.In writing system, use the main languages of Chinese character to comprise Japanese, Korean, and Chinese dialects (standard Chinese, Guangdong language, and other dialects).
Be used for characters input method at computing machine input Chinese character comprise with the pronunciation from the Chinese character of inputs such as keyboard be converted to correspondence Chinese character " Chinese style pronunciation input method " and will use the character pattern of writing such as the hand input device of graphic tablet or mouse to be converted to " hand-writing input method " of character code.
In the prior art, Japanese Patent Application Publication publication No.8-137885 has described a kind of method based on " Chinese style pronunciation input method ", so that input comprises that the multilingual character string of Japanese, Korean and Chinese is as the Chinese style sequence of phonetic symbols according to dictionary.
In the method that Japanese Patent Application Publication publication No.8-137885 describes, language flag based on input is that language to be processed loads a character input program, and utilizes the character input program that is used for this language that loads based on this character denotation will be converted to the character code of corresponding this language with the pronunciation symbol that this language flag is imported together.
Therefore, prior art uses " Chinese style pronunciation input method " to come from keyboard input Chinese style pronunciation symbol (for example, Roman character), is Chinese character with this symbol transition then.Therefore, unless input Chinese style pronunciation symbol, prior art can't realize being converted to Chinese character.Utilize this method, when the operator has imported one piece with a foreign language (Chinese, Japanese or Korean) write document the time, if the operator can't import corresponding Chinese style pronunciation symbol exactly or not understand the Chinese style pronunciation, then these Chinese characters can't be transfused to or change.For example, when Japanese operator imported Chinese character (simplified or traditional font), owing to some Chinese characters do not exist in Japan, so he was difficult to these Chinese characters of input.
In addition, must prepare every kind of pairing character input program of language.Also must prepare a dictionary, this this dictionary comprises a large amount of words, but also provides corresponding relation between input of character string and the display string to every kind of language.A Chinese character has multiple pronunciation in the Japanese.Therefore, a dictionary must be arranged, it comprises a large amount of words, but also provides corresponding relation between input of character string (the Chinese style pronunciation symbol of words) and the display string (Chinese character, assumed name etc.) to this language.Therefore, adopt the miniature portable equipment of the storer that has only limited memory capacity to be difficult to the multilingual character string of input.
Summary of the invention
The purpose of this invention is to provide a kind of character input device and method that every kind of language corresponding characters loading routine and dictionary just can be imported the character in the multilingual of pronunciation the unknown that need not to prepare.
According to one embodiment of present invention, a kind of Chinese character processing equipment is provided, comprise: the unit that is used to specify first and second kinds of language, be used to import unit about the data of hand-written character, be used to produce unit, and the kanji code that is used for being produced by described generation unit is converted to the unit of the kanji code of second kind of language about the kanji code of the pairing first kind of language of the data of described hand-written character.
Description of drawings
Fig. 1 realizes the system configuration block diagram of the computing machine 1 of character processing apparatus according to an embodiment of the invention;
Fig. 2 is the functional configuration block diagram of realizing by executive routine in system configuration according to the present invention;
Fig. 3 is the display screen example that the exterior arrangement figure of computing machine 1 and be used for according to the present invention imports the character of multilingual;
Fig. 4 is the form of signal according to an example of multilingual code list of Hanzi 24 of the present invention;
Fig. 5 is by linguistic property part and the number kanji code that constitutes partly according to present embodiment;
Fig. 6 is the form of signal according to an example of the multilingual code list of Hanzi 25 of present embodiment;
Fig. 7 is according to the phonetic sign indicating number of present embodiment and the mapping table between the compound word;
Fig. 8 A and 8B are according to the present invention, have the Japanese compound word of identical meanings and the mapping table between (simplified) Chinese compound word;
Fig. 9 is the process flow diagram of signal according to the character input operation of present embodiment;
Figure 10 is the process flow diagram of signal according to the character input operation of present embodiment;
Figure 11 is the process flow diagram of signal according to the character input operation of present embodiment;
Figure 12 is according to the example of present embodiment based on handwriting input input (conversion) kanji;
Figure 13 will be converted to the example that simplified Hanzi is imported based on the kanji of handwriting input according to present embodiment;
Figure 14 will be converted to the example that kanji is imported based on the simplified Hanzi of handwriting input according to present embodiment;
Figure 15 will be converted to the example that the Japanese compound word is imported based on the Chinese compound word (simplified) of handwriting input according to present embodiment;
Figure 16 will be converted to the example that Chinese compound word (simplified) is imported based on the Japanese compound word of handwriting input according to embodiment (1);
Figure 17 will be converted to the example that Chinese compound word (simplified) is imported based on the Japanese compound word of handwriting input according to embodiment (2);
Figure 18 will be converted to the example that Chinese compound word (simplified) is imported based on the Japanese compound word of handwriting input according to embodiment (2);
Figure 19 will be converted to the example that the Japanese compound word is imported based on the Chinese compound word (simplified) of handwriting input according to present embodiment; And
Figure 20 is the synoptic diagram that will be converted to the example that the Japanese compound word imports according to present embodiment based on the Chinese compound word (simplified) of handwriting input;
Embodiment
Embodiments of the invention are described with reference to the accompanying drawings.
Fig. 1 realizes the system configuration block diagram of the computing machine 1 of character processing apparatus (only showing primary clustering) according to an embodiment of the invention.Computing machine 1 is carried in a program that writes down on the recording medium such as semiconductor memory, CD-ROM, DVD or disk.The operation of computing machine 1 is subjected to the control of this program.Computing machine 1 is for example to utilize, and is configured to, and for example the PDA(Personal Digital Assistant) of miniature portable equipment is realized.Computing machine 1 has by the Chinese character in the handwriting input multilingual, discerns these Chinese characters, and these Chinese characters are converted to multilingual function.
As shown in Figure 1, computing machine 1 (according to the character processing apparatus of present embodiment) is equipped with CPU 10, storer 12, display board 14, graphic tablet 16 (pen 19) and input media 18.
CPU 10 is according to each part of various programmed control of record in the storer 12.If character is transfused to, then import according to the handwriting recognition program 20 (with global IME 21) and character input program (comprising Chinese character converse routine 22 and the compound word converse routine 23) control character of storage in storer 12.
Computing machine 1 according to present embodiment uses " hand-writing input method " to come based on the data input character about hand-written character from graphic tablet 16 inputs.By import data according to character input program about hand-written character, can and will be identical (Chinese) character in the optional language just with optional language input (Chinese) character through character conversion hand-written and that import, that is to say to have identical origin but the slightly different Chinese character of stroke with input character.
In addition, character input program can not only make character unit change, and the speech (hereinafter being called compound word) of each self-contained a plurality of (Chinese) character is converted as unit.In the case, for having identical meanings, but in different language, use the compound word of kinds of characters, be converted into the compound word that has correct implication and use different Chinese character by character hand-written and input.
Be recorded in the storer 12 by CPU 10 performed various programs and data.Program that is write down and data not only comprise primary control program (PCP) (OS), also comprise the Handwritten Digits Recognition program 20 relevant with the input of character, whole world IME 21, character input program (Chinese character converse routine 22 and compound word converse routine 23), and code list of Hanzi 24 (referring to Fig. 4 (hereinafter describe)) and compound word code table 25 (referring to Fig. 6 (hereinafter description)), these two code tables respectively with the related use of the execution of Chinese character converse routine 22 and compound word converse routine 23.
Display board 14 shows various panels when CPU carries out various program.The input interface (referring to Fig. 3 (hereinafter describing)) that display board 14 provides hand-written character to use when inputting characters by handwriting.
Graphic tablet 16 is used as indicating device.Input face and display board 14 are overlapped.Utilization can provide direct instruction at the object that shows on the display board (button and button) on the input face of graphic tablet 16.In addition, but by utilizing pen 19 at the appointed area of this display hand-written character input character pattern (coordinate data sequence (coordinate data sequences)).
Input media 18 comprises mechanical switch and button, that is, and and power key, cursor key, and other function buttons and button.
Fig. 2 comprises the functional configuration block diagram of the program of character input program by CPU 10 realizations by execution in system configuration shown in Figure 1.As shown in Figure 2, computer equipment 1 (character processing apparatus) has handwriting input unit 30, mode selecting unit 31, Handwritten Digits Recognition unit 32, Chinese character converting unit 33, compound word converting unit 35 and Chinese character display unit 37.
Handwriting input unit 30 is by the data of graphic tablet 16 inputs about hand-written character, that is, expression constitutes the coordinate data sequence of the stroke of character.
Mode selecting unit 31 by graphic tablet 16 based on to the position command input of the key that provides in the input interface that on display board 14, shows appointment for model selection etc.Mode selecting unit 31 has one first input block, one second input block, one the 3rd input block, and conversion instruction input block.First input block input input pattern is selected, and specifies the language (first kind of language) of the character of handwriting input.Second input block input translative mode is selected, and specifies input pattern to select the object language (second kind of language) of character in the specified language.The conversion of the 3rd input block input compound word is specified, and specifies compound word that is made of a plurality of (Chinese) character of input.
32 identifications of Handwritten Digits Recognition unit are by the data about hand-written character of handwriting input unit 30 inputs.
The kanji code that input pattern selects the language of appointment to use described in the Chinese character converting unit 44 search code list of Hanzi 24, that is, and the sign indicating number of the character correspondence that handwritten Kanji recognition unit 32 is discerned.For the kanji code of record in code list of Hanzi 24, partly add the linguistic property partial data (referring to Fig. 5) of indication language to the number of expression universal character sign indicating number.In addition, Chinese character converting unit 33 will be converted to the kanji code of selecting specified language to use by translative mode from the kanji code that code list of Hanzi 24 retrieves.
If having imported the compound word conversion specifies, then compound word converting unit 35 will be converted to by the resulting a plurality of kanji codes of the conversion of Chinese character converting unit 33, the kanji code sequence at the compound word (character string) of compound word code table 25 records that translative mode selects that specified language uses.On the other hand, if in compound word code table 25, do not write down corresponding compound word, then compound word converting unit 35 will be converted to phonetic sign indicating number sequence (universal code sequence) by the resulting a plurality of kanji codes of the conversion of Chinese character converting unit 33, search for the compound word of corresponding this phonetic sign indicating number sequence in the kanji code sequence, and it is exported as transformation result.The Chinese style pronunciation symbol that phonetic representation Chinese character is used, and comprise the special code that corresponding Chinese character is used.
Chinese character conversion display unit 37 shows Chinese character with the language by translative mode selection appointment according to the kanji code that obtains by Chinese character converting unit 33 or by the kanji code sequence that compound word converting unit 35 obtains on display board 14.
Fig. 3 is the exterior arrangement figure of computing machine 1 and an example that is used to import the display screen (input interface) of multilingual character.
As shown in Figure 3, computing machine 1 is configured to miniature portable equipment.The summit portion of computing machine 1 is equipped with the button 18a that comprises in input display part and the input media 18, and in the input display part, the input face of the input face of display board 14 and graphic tablet 16 is overlapped.
Input display part shown in Figure 3 has shown display screen (input interface) example that is used for by the multilingual character of handwriting input.As shown in Figure 3, the input display part has the hand-written character district 40 of inputting characters by handwriting, be used for input pattern and select and indicate a plurality of speech selection keys 41 to 44 of languages, the language conversion key 45 that is used for the command execution language conversion, be used to specify the compound keys 46 of compound word conversion, show behind the character in being converted into the language of selecting appointment by mode switch by the character viewing area 50 of the character of hand-written and input, and backspace (BS) key 47 and deletion (Del) key 48 of being used for the character string (sentence) of editing character viewing area 50.
Handwriting input district 40 can fix, and perhaps ejects when the needs inputting characters by handwriting.Fig. 3 has illustrated to import three zones of respective symbols.Yet the form in hand-written character district 40 can arbitrarily be provided with.Speech selection key 41 to 44 according to present embodiment is used to specify four kinds of language, comprises Japanese (JP) key 41, Korean (KR) key 42, simplified Chinese (CJ) key 43, and traditional Chinese (CF) key 44.
By provide speech selection key 41 to 44 (" CJ "; " CF ", " JP " and " KR "), language conversion key 45; compound keys 46 and other keys that are used for providing instruction on graphic tablet 16 can be selected for input pattern at an easy rate and translative mode selects to switch languages and translative mode.In addition, by being provided for the compound keys 46 that command conversion comprises compound (speech) of two or more Chinese characters, can be fast and import compound word exactly.
If for example, to be converted to the Chinese character that in another kind of language, has same concept and origin through Chinese character hand-written and input, if and for example, the user can not write the Chinese character in the object language, then the user press speech selection key 41 to 44 (" CJ "; " CF ", " JP " and " KR " and in any) to switch to the input pattern that language that it can hand-written input is used.So pressing any one speech selection key 41 to 44 like the user class serves as the conversion object language to specify another languages.Then, the user presses the conversion of language conversion key 45 command execution.Under this state, when the user on graphic tablet 16 during a hand-written character, this character is automatically converted to the Chinese character that the language that is designated as object language is used.Then show (input) resulting character.
On the other hand, if a kind of compound word of language will be converted into the compound word of another kind of language, then press compound keys 46 again.This makes can import compound word and its correct conversion (for example, this compound word is converted to have identical meanings but the different compound word of character).
An example of multilingual code list of Hanzi 24 is described referring now to Fig. 4.
For the general Chinese character of multilingual, code list of Hanzi 24 comprises kanji code and the general sign indicating number (being the phonetic sign indicating number in the case) of kanji code that every kind of language is used, and kanji code and general code are all associated with each other to multilingual.Fig. 4 only shows simplified Hanzi, and unsimplified Hanzi and kanji are as the language example.
The Chinese character quantity that comprises in Chinese, Japanese and the Korean (CJK) is approximately 21,000.The main basic combination of character has the GB2312 (simplified Chinese) and the Big-5 (traditional Chinese) of China, JIS * 0208 of Japan and the KSC5601 of JIS * 0212 and Korea S.In these countries, even the Chinese character (that is the general Chinese character of multilingual) with same concept and origin is owing to historical and cultural reason shape also often have any different slightly (direction of stroke or length).
Have same origin but the slightly different Chinese character of stroke is represented by kanji code, each kanji code is by the number of the linguistic property part (CJ, CF, JP and KR) of indication languages and four figures constituting of (from 0000 to 9999) partly.
In addition, owing to the difference of language, have the Chinese-character pronunciation difference of same origin, and same Chinese character there are different pronunciations.Therefore, Chinese character combines record with the phonetic sign indicating number, and the phonetic sign indicating number is that to have the Chinese character of same origin in multilingual general, and is made of the Chinese style pronunciation symbol that standard Chinese is used.In Chinese, no matter be simplified or the traditional font, same Chinese-character pronunciation is identical, no matter always and use in what area fixing pronunciation is arranged." phonetic " is meant pronouncing pronunciation symbol.Use the corresponding input of phonetic sign indicating number input Chinese character to utilize the pronunciation of the kanji that Roman character represents, then Roman character is converted to Chinese character.In code list of Hanzi 24, each Chinese character and corresponding kanji code have one-to-one relationship.Yet, the corresponding phonetic sign indicating number of a plurality of Chinese characters is arranged.
With reference now to Fig. 6, an example of multilingual compound word code table 25 is described.
For the general character string of multilingual (compound word), compound word code table 25 comprises kanji code sequence and the general sign indicating number sequence (identical phonetic sign indicating number sequence) of this kanji code sequence that every kind of language is used, and kanji code sequence and universal code sequence are all associated with each other for multilingual.Fig. 5 only shows simplified Hanzi, and unsimplified Hanzi and kanji are as the language example.
In compound word code table 25, in Chinese, have the middle compound word quilt and identical phonetic sign indicating number serial correlation record of the different language of close pronunciation.For example, for Chinese shown in Figure 7 (Japanese) compound word, the different composite speech all with the phonetic sign indicating number serial correlation record shown in the figure left side.For example, phonetic sign indicating number sequence " Shi Jian " can with the different composite speech (write) “ Time Inter with Japanese " , “ real tramples ", " incident " is with “ Shi Inter " related.
In every kind of language, the Chinese character that uses in the Chinese compound word may have same origin but stroke is slightly different or have diverse origin or have reverse order.
For example, shown in Fig. 8 A and 8B, for Japanese compound (speech) “ Lesson Long ", " Meter picture ", " bodyguard Sample ", " worker Games ", “ Yu Measuring ", “ Inertia Xi ", “ Shi Let ", “ Sales Buy ", " fortification "; and “ Fu Juan person ", Chinese (simplified) compound word that has identical meanings with them shows in next corresponding zone.Shown in Fig. 8 A and 8B, also comprise different Chinese characters even have the compound word of identical meanings.For example, " section " is the Chinese character (simplified) of corresponding day " Lesson " in the words and phrases " Lesson Long ".Yet phonetic sign indicating number " Ke " is corresponding “ Lesson simultaneously " and " section ".Therefore, as shown in Figure 7, corresponding “ Lesson Long " phonetic sign indicating number sequence in Japanese and Chinese (simplified), all be " Ke Zhang ".That is to say that these speech are associated with each other by the phonetic sign indicating number.
By so writing down kanji code sequence and the phonetic sign indicating number sequence that the compound word in the multilingual is used at compound word code table 25, just can simple and exactly the compound word in a kind of language be converted to the compound word in the another kind of language, and need not use based on the identical meanings between the different language and comprise language coversion program of mass data or the like.
Fig. 6 has only showed compound (speech) of each self-contained two Chinese character.Can be for comprising the compound word record kanji code sequence and the phonetic sign indicating number sequence of three or more characters separately.
Referring now to the character input operation of the flow chart description shown in Fig. 9,10 and 11 according to present embodiment.
(1) Figure 12 shows an example by handwriting input (conversion) kanji.
At first, if by handwriting input (Chinese) character, then select input pattern (steps A 1).In the case, for selecting input pattern, the user utilizes pen 19 to specify Japanese (JP) key 41 (201) on graphic tablet 16.CPU 10 starts Handwritten Digits Recognition program 20 and global IME, and waits for an input hand-written character (step S21).
Then, when in the hand-written character district 40 at graphic tablet 16 during hand-written character, be transfused to about the data of this hand-written character.Example shown in Figure 12 has been illustrated by handwriting input kanji “ real " (202).
When the schedule time (is for example gone in a zone that is imported into from hand-written character in the hand-written character district 40,2 seconds) time, or when the user began inputting characters by handwriting to next zone, Handwritten Digits Recognition program 20 was determined the complete data of having imported about a hand-written character.The data of 20 pairs of inputs of Handwritten Digits Recognition program are carried out the process (steps A 22) (203) of identification handwritten Chinese character.
Then, if can't discern this hand-written character, then show a piece of news, the prompting user for example re-enters this hand-written character.Then, after the data of having imported once more about this hand-written character, execution character identifying similarly.
On the other hand, if successfully discerned this character, then CPU 10 starts Chinese character converse routine (steps A 23).According to Chinese character converse routine 22, CPU 10 is from code list of Hanzi 24 retrieval and obtain to be selected by input pattern the kanji code (JP2040) of language (the being Japanese in the case) usefulness of appointment, the character (205) that this kanji code correspondence is discerned.
At last, CPU 10 allows the pairing character of kanji code (JP2040) that obtained from code list of Hanzi 24 to show (steps A 24) (204) in the hand-written character district 40 and the character viewing area 50 of hand-written this character.
By select appointed language like this and inputting characters by handwriting by input pattern, just can import (Chinese) character in the appointed language.
(2) Figure 13 shows by kanji hand-written and input and is converted into the example that simplified Hanzi is imported.
At first, if by handwriting input (Chinese) character, then select input pattern.If also import this hand-written character behind the character in being converted into the language that is different from language under this hand-written character, then the user selects translative mode and command execution conversion (steps A 1).In the case, for selecting input pattern, the user utilizes pen 19 to specify Japanese (JP) key 41 on graphic tablet 16.In addition, the user specifies simplified Chinese (CJ) key 43 to be used for object language, then utilizes language conversion key 45 command execution conversions (301).CPU 10 starts Handwritten Digits Recognition program 20 and global IME and waits for an input hand-written character (step S31).
Then, when in the hand-written character district 40 of graphic tablet 16 during a hand-written character, be transfused to about the data of this hand-written character.Example signal shown in Figure 13 is by handwriting input kanji “ real " (302).
When the schedule time (is for example gone in a zone that is imported into from hand-written character in the hand-written character district 40,2 seconds) time, or when the user began inputting characters by handwriting to next zone, Handwritten Digits Recognition program 20 was determined the complete data of having imported about a hand-written character.The data of 20 pairs of inputs of Handwritten Digits Recognition program are carried out the process (steps A 32) (303) of identification handwritten Chinese character.
Then, if can't discern this hand-written character, then show a piece of news, the prompting user for example re-enters this hand-written character.Then, after the data of having imported once more about this hand-written character, execution character identifying similarly.
On the other hand, if successfully discerned this character, then CPU 10 starts Chinese character converse routine (steps A 33).According to Chinese character converse routine 22, CPU 10 is from code list of Hanzi 24 retrieval and obtain to be selected by input pattern the kanji code (JP2040) of language (the being Japanese in the case) usefulness of appointment, the character (304) that this kanji code correspondence is discerned.In addition, according to Chinese character converse routine 22, CPU 10 is from the kanji code that code list of Hanzi 24 is retrieved and acquisition selects the language of appointment to use by input pattern, the kanji code (A34) that this kanji code correspondence retrieves from code list of Hanzi 24.
In the case, if can't retrieve the kanji code that corresponding language is used, then for example, CPU 10 display messages " can not find coupling ", and the next Chinese character of wait input (steps A 35, A37).
On the other hand, if successfully retrieve the kanji code that this correspondence language is used, then carry out the conversion (305) of this kanji code from code list of Hanzi 24.In the case, kanji “ real " kanji code (JP2040) of usefulness is converted into the kanji code (CJ2040) that simplified Chinese is used.
At last, the CPU 10 pairing character of kanji code (J2040) that allows to be converted into simplified Hanzi shows (steps A 36) (306) in the hand-written character district 40 and the character viewing area 50 of hand-written this character.
By select appointed language like this and inputting characters by handwriting by input pattern, can import (Chinese) character in the appointed language.
If in code list of Hanzi 24 all kanji codes of using of every kind of language of record all with corresponding Chinese character associated record, then the linguistic property of kanji code part can be converted into the language of being selected appointment by translative mode separately.That is to say that the number part (4) of kanji code is all general to different language, so for example, " JP " in the linguistic property part only need convert " CJ " to.
If hand-written kanji is converted into simplified Hanzi input, then carry out the process that is similar to process shown in aforementioned Figure 13 (with the steps A 1 to A37 among Fig. 9).Therefore, omitted detailed description to this process.In the case, to be used to specify Japanese be that input pattern is selected to Japanese (JP) key 41.It is that translative mode is selected that traditional Chinese (CF) key 44 is used to specify traditional Chinese.As previously mentioned, by importing unsimplified Hanzis at the hand-written character district of graphic tablet 16 40 hand-written kanjis.
(3) Figure 14 shows hand-written simplified Hanzi and is converted into the example that kanji is imported.
In the case, carry out the process that is similar to process shown in aforementioned Figure 13 (with the steps A 1 to A37 among Fig. 9).Therefore, omitted detailed description to this process.In the case, to be used to specify simplified Chinese be that input pattern is selected to simplified Chinese (CJ) key 43.It is that translative mode is selected that Japanese (JP) key 41 is used to specify Japanese.As previously mentioned, by importing kanji (406) at the hand-written character district of graphic tablet 16 40 hand-written simplified Hanzis (402).
If hand-written unsimplified Hanzi is converted into kanji input, then carry out the process that is similar to process shown in aforementioned Figure 14 (with the steps A 1 to A37 among Fig. 9).Therefore, omitted detailed description to this process.In the case, to be used to specify traditional Chinese be that input pattern is selected to traditional Chinese (CF) key 44.It is that translative mode is selected that Japanese (JP) key 41 is used to specify Japanese.As previously mentioned, by importing kanjis at the hand-written character district of graphic tablet 16 40 hand-written unsimplified Hanzis.
(4) Figure 15 shows hand-written Chinese compound word (simplified Hanzi) and is converted into the example that the Japanese compound word is imported.
At first, if, then select input pattern by one of handwriting input compound (speech).If also import this hand-written compound word behind the compound word in being converted into the language that is different from language under this hand-written compound word, then the user selects translative mode and this compound word of command conversion.In the case, for selecting input pattern, the user utilizes pen 19 to specify simplified Chinese (CJ) key 43 on graphic tablet 16.In addition, the user specifies Japanese (JP) key 41 to be used for object language, then utilizes these compound words of compound keys 46 command conversion (501).CPU 10 starts Handwritten Digits Recognition program 20 and global IME and waits for an input hand-written character (steps A 42).
Then, when at the hand-written character district of graphic tablet 16 40 hand-written compound words (character string), be transfused to about the data of this hand-written character.The compound word (502) that example signal shown in Figure 15 is made of two simplified Chinese characters by handwriting input.
Then, when language conversion key 45 is used to the command execution conversion, 20 pairs of processes (steps A 43) (503) of carrying out this handwritten Chinese character of identification in the data of the hand-written character in the corresponding region in hand-written character district 40 of Handwritten Digits Recognition program.
Then, if can't discern this hand-written character, then show a piece of news, the prompting user for example re-enters this hand-written character.Then, when the data imported once more about this hand-written character, execution character identifying similarly.
On the other hand, if successfully discerned this character, then CPU 10 starts the Chinese character converse routine.According to Chinese character converse routine 22, CPU 10 is from code list of Hanzi 24 retrieval and obtain to be selected by input pattern the kanji code (CJ2040 and CJ1255) of language (the being simplified Chinese in the case) usefulness of appointment, the character (504) that these kanji code correspondences are discerned.In addition, according to Chinese character converse routine 22, CPU 10 is from code list of Hanzi 24 retrieval and obtain to be selected by input pattern the kanji code of language (the being Japanese in the case) usefulness of appointment, the kanji code that these kanji code correspondences retrieve from code list of Hanzi 24 (steps A 44).In this example, the used kanji code (CJ2040 and CJ1255) of Dui Ying simplified Hanzi is converted into the corresponding used corresponding kanji code (JP2040 and JP1255) (505) of kanji.
At last, CPU 10 allows to be converted into the kanji code (JP2040 and JP1255) of Japanese compound word) pairing character string shows (steps A 52) (507) in the hand-written character district 40 and the character viewing area 50 of hand-written this character.
(5) Figure 16 shows hand-written Japanese compound word and is converted into the example that Chinese compound word (simplified Hanzi) is imported.
In the case, carry out the process that is similar to process shown in aforementioned Figure 13 (with the steps A 42 to A47 among Figure 10).Therefore, omitted detailed description to this process.In the case, to be used to specify Japanese be that input pattern is selected to Japanese (JP) key 41.It is that translative mode is selected that simplified Chinese (CJ) key 43 is used to specify simplified Chinese.As previously mentioned, by importing Chinese compound words (unsimplified Hanzi) (607) at the hand-written character district of graphic tablet 16 40 hand-written Japanese compound words.
(6) Figure 17 and 18 shows hand-written Japanese compound word and is converted into the example (2) that Chinese compound word (simplified Hanzi) is imported.
In the example that reference Figure 16 describes, the kanji code sequence that is converted to by Chinese character converse routine 22 is retrieved from compound word code table 25.Yet Figure 17 and 18 shows the example that can't retrieve relevant kanji code sequence from compound word code table 25.
At this, supposed by handwriting input character string “ Lesson Long ", a Japanese compound word (702).By handling kanji code sequence (JP1010 and JP2580) that this hand-written character string (compound word) obtains is converted into Chinese compound word (simplified Hanzi) usefulness according to Chinese character converse routine 22 kanji code sequence (CJ1010 and CJ2580) (705).Situation as aforesaid steps A 42 to A46 is carried out said process.
Then, according to complex conversion program 23, determine whether to exist really the simplified Chinese compound word (step S47) (706) that constitutes by two kanji codes (CJ1010 and CJ2580) by the simplified Hanzi of searching in the compound vocabulary 25.As a result, can't retrieve the kanji code sequence (CJ1010 and CJ2580) that this Chinese compound word is used from compound word code table 25.
In the case, according to compound word converse routine 23, based on the kanji code sequence (CJ1010 and CJ2580) that this Chinese compound word is used, this character string is converted to the phonetic sign indicating number of the Chinese style pronunciation of the corresponding Chinese character of expression in addition.That is to say, obtain the pairing phonetic sign indicating number of this corresponding kanji code to produce phonetic sign indicating number sequence from code list of Hanzi 24.In the case, the corresponding kanji code of phonetic sign indicating number sequence " Ke Zhang " (CJ1010 and CJ2580).Then, the phonetic sign indicating number sequence (steps A 50) (707) that whether produces to some extent in the phonetic sign indicating number tabulation of search compound word code table 25.That is, determine by the phonetic sign indicating number sequence of search record in compound word code table 25 whether the combination of the phonetic sign indicating number that produces from this kanji code is included in this Chinese compound word really.
Then, if do not write down the phonetic sign indicating number sequence of should being correlated with in compound word code table 25, then display message " can not find coupling " (steps A 57).
On the other hand, if having only a phonetic sign indicating number sequences match, then the Chinese compound word of the kanji code sequence that should mate is used as character and shows (steps A 57).
On the other hand, if a plurality of phonetic sign indicating number sequences match are arranged, then search for two kanji codes of each kanji code combination.For example, Figure 17 shows the phonetic sign indicating number sequence (" Ke Zhang ") of corresponding these two kanji code sequences (CJ1355 and CJ2600) and (CJ1360 and CJ2580).Therefore, the kanji code sequence of corresponding these phonetic sign indicating number sequences is used as the candidate list record.
Then, according to compound word converse routine 23, contrast the character code sequence that the kanji code sequence checking that comprises in this candidate list arrives based on this phonetic sign indicating number sequence retrieval.For example, if the kanji code (steps A 53 and A54) of search back only finds the compound word of a coupling, then this kanji code sequence is confirmed as transformation result (steps A 53, A54 and A55) (708).In the case, Dui Ying kanji code sequence (CJ1360 and CJ2580) is confirmed as transformation result.
If for example, the kanji code (steps A 53 and A54) of search back finds a plurality of compound word candidate targets, and so for example, the kanji code sequence of first compound word candidate target is confirmed as Search Results (steps A 53, A54 and A56).
On the other hand, if do not write down relevant kanji code sequence in this candidate list, then display message " does not have coupling " (steps A 57).
At last, CPU 10 allows to be converted into the kanji code (CJ1360 and CJ2580) of simplified Chinese compound word) pairing character string shows (steps A 55) in the hand-written character district 40 and the character viewing area 50 of hand-written this character.
In steps A 48,, then can search for compound word code table 25 once more by this kanji code of counter-rotating (if this compound word comprises two characters) if can't in compound word code table 25, find the kanji code sequence of coupling in the search of steps A 47.Therefore, even the different compound word of character sequence for having identical meanings but between the different language, for example, De “ Inertia Xi in the Japanese shown in the corresponding diagram 8B ", “ Shi Let " etc. simplified Chinese compound word, can obtain corresponding kanji code sequence as candidate target by search compound word code table 25.
Therefore, if compound word code table 25 does not comprise by the pairing compound word of the resulting kanji code sequence of the conversion of Chinese character converse routine 22, then can obtain kanji code sequence (compound word) in the following manner as final transformation result.This compound word is converted into phonetic sign indicating number sequence.Then, the kanji code sequence of coupling in the search compound word code table 25.Subsequently, the kanji code sequence that relatively should mate and the resulting kanji code sequence of conversion of passing through Chinese character converse routine 22.
(7) Figure 19 and 20 shows hand-written simplified Chinese compound word and is converted into the example that the Japanese compound word is imported.
In the case, carry out the process that is similar to process shown in aforementioned Figure 17 and 18 (with the steps A 42 to A53 in Figure 10 and 11).Therefore, omitted detailed description to this process.In the case, to be used to specify simplified Chinese be that input pattern is selected to simplified Chinese (CJ) key 43.It is that translative mode is selected that Japanese (JP) key 41 is used to specify Japanese.In addition, compound keys 46 is used to the command conversion compound word.As previously mentioned, by importing Japanese (809) at the hand-written character district of graphic tablet 16 40 hand-written Chinese (simplified).
In the case, the phonetic sign indicating number sequence search compound word code table 25 (807) that is converted into based on kanji code sequence (JP1360 and JP2580).Therefore, indication Japanese Zi “ Lesson Long " kanji code sequence (JP1360 and JP2580) be retrieved and show.
In explanation above, before inputting characters by handwriting, appointment input and search pattern language are also operated shift key 45 and are changed with command execution.Yet, after having imported a character string that will be converted into compound word (character string) in the different language, can provide the order of carrying out conversion.For example, if hand-written Japanese character string (compound word) that comprises two characters is to import a simplified Hanzi character string (compound word), then at first operate suitable key to specify Japanese, specify simplified Chinese as translative mode as input pattern, and the command conversion compound word.Then, hand-written two kanjis, and operation shift key 45.Therefore, according to Chinese character converse routine 22 and compound word converse routine 23, utilize the character string of two characters to carry out this process as conversion unit.Then, if imported a plurality of character strings continuously, shift key of operation during the then each hand-written character string that will change.Therefore, according to the input pattern and the translative mode of previous appointment, this hand-written character string is processed and be converted into character string in the appointed language.For example, if imported the character string that all constitutes continuously, then after having imported three characters continuously, can operate shift key 45 by three characters.Therefore, if imported a plurality of character strings continuously,, so can effectively import these character strings then because the user only need carry out the operation of shift key 45 except the hand-written character string.
Therefore, even for pronunciation, write or the foreign character or the compound word of implication the unknown, as long as the character string by hand-written this Chinese character or this compound word, character or compound word that the operator just can convert thereof in the known language of operator show.In addition, the present invention can also handle that the character that uses in the compound word has same origin but the slightly different situation of stroke, has the situation of diverse origin, or the situation of reversed in order between the different language.Therefore, by input, by the compound word in the known language of hand-written operator, just the compound word that can exactly this compound word be converted in the object language is imported.In the present embodiment, the combination of kanji code and phonetic sign indicating number (the Chinese style pronunciation symbol of Chinese character) is used to Chinese character or the compound word that Chinese character or compound word with a national usefulness are transformed to another national usefulness.This does not just need to use the multilingual translation program that comprises the mass data that multilingual uses.In addition, only by write down the compound word that will change in compound word code table 25, the compound word in a kind of language can be effectively converted into the compound word in the another kind of language.
Those skilled in the art will readily appreciate that other advantage and modification.Therefore, the present invention is not limited to detail and the representative embodiment in this signal and description on its wider aspect.Therefore, can not depart from the spirit or scope of appended claims and the defined general inventive concept of equivalent thereof and carry out various modifications.

Claims (16)

1. Chinese character processing equipment is characterized in that comprising:
Be used to specify the device (31) of first and second kinds of language;
Be used to import device (30) about the data of a hand-written character;
Be used to produce and device (32,33) about a kanji code of first kind of corresponding language of the data of described hand-written character; And
Be used for to convert the device (33) of a kanji code of described second kind of language to by the described kanji code that described generation device produces.
2. according to the equipment of claim 1, it is characterized in that, also comprise character code table (24), be used for the general character of described first and second kinds of language is stored the kanji code of described first and second kinds of language associated with each otherly, and
Be characterised in that the described kanji code of described conversion equipment (33) by utilizing described character code table conversion to produce by described generation device.
3. according to the equipment of claim 1 or 2, it is characterized in that also comprising that a plurality of kanji codes that are used for obtaining by described conversion equipment convert the device (35) of the kanji code of a plurality of second kind of language to.
4. according to the equipment of claim 3, it is characterized in that also comprising character string code table (25), be used for character string that the character string of the general character of described first and second kinds of language is stored the kanji code of described first and second kinds of language associated with each otherly, and
Be characterised in that described conversion equipment (33) is changed described a plurality of kanji codes of described first kind of language by utilizing described character string code table.
5. according to the equipment of claim 1, it is characterized in that also comprising the device that is used to import the instruction of carrying out conversion, and
Be characterised in that described conversion equipment is at the described kanji code of the described instruction of device input back conversion of described input instruction.
6. according to the equipment of claim 1, it is characterized in that described input media is from the data of graphic tablet input about described hand-written character.
7. according to the equipment of claim 1, it is characterized in that in the attribute data of indication language, adding the described kanji code that produces by described generation device.
8. according to the equipment of claim 7, it is characterized in that described conversion equipment is converted to the attribute data that adds to described character code the attribute data of described second kind of language.
9. Chinese character processing equipment is characterized in that comprising:
Be used to specify the device (31) of first and second kinds of language;
Be used to import device (30) about the data of hand-written character;
Be used to produce and device (32,33) about the kanji code of the corresponding described first kind of language of the data of described hand-written character; And
Be used for to convert first conversion equipment (33) of the kanji code of described second kind of language to by the described kanji code that described generation device produces;
Be used to store the character code table (24) of a plurality of yards set, each yard set comprises the kanji code of described first kind of language, the kanji code of described second kind of language, and the general general code of the kanji code of described first and second kinds of language;
Be used to store the character string code table (25) of a plurality of kanji code set, each yard set comprises a plurality of kanji codes of described first kind of language, a plurality of kanji codes of described second kind of language, and the general a plurality of general code of the kanji code of described first and second kinds of language;
Be used for by utilizing described character code table will convert second conversion equipment (35) of a plurality of first general code to by a plurality of character codes that described first conversion equipment obtains;
Be used for searching for the device (35) of a plurality of second general code of corresponding described a plurality of first general code of described character string code table; And
Be used to obtain the device (35) of a plurality of character codes of described second kind of language of corresponding described a plurality of second general code.
10. according to the equipment of claim 9, it is characterized in that described general code is the phonetic sign indicating number.
11. according to the equipment of claim 9, it is characterized in that also comprising the device that is used to import the instruction of carrying out conversion, and
Be characterised in that described first conversion equipment is at the described character code of the described instruction of device input back conversion of described input instruction.
12., it is characterized in that described input media is from the data of graphic tablet input about described hand-written character according to the equipment of claim 9.
13., it is characterized in that in the attribute data of indication language, adding the described kanji code that produces by described generation device according to the equipment of claim 9.
14., it is characterized in that described conversion equipment is converted to the attribute data that adds the attribute data of described second kind of language in described character code according to the equipment of claim 13.
15. a Chinese character processing method is characterized in that comprising:
Specify first and second kinds of language (A1);
Input is about the data (A31) of hand-written character;
Generation is about the kanji code (A32) of the pairing described first kind of language of input data of described hand-written character; And
The kanji code of described generation is converted to the kanji code (A34) of described second kind of language.
16., it is characterized in that also comprising according to the method for claim 15:
The a plurality of kanji codes that are converted into described second kind of language are converted to the kanji code sequence (A46) of described second kind of language.
CNA2004100072858A 2003-02-28 2004-02-27 Hanzi processing equipment and method Pending CN1525388A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP054682/2003 2003-02-28
JP2003054682A JP2004265136A (en) 2003-02-28 2003-02-28 Apparatus, method and program for input characters

Publications (1)

Publication Number Publication Date
CN1525388A true CN1525388A (en) 2004-09-01

Family

ID=33118949

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2004100072858A Pending CN1525388A (en) 2003-02-28 2004-02-27 Hanzi processing equipment and method

Country Status (2)

Country Link
JP (1) JP2004265136A (en)
CN (1) CN1525388A (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7428516B2 (en) * 2005-06-23 2008-09-23 Microsoft Corporation Handwriting recognition using neural networks
JP5239419B2 (en) 2008-03-14 2013-07-17 オムロン株式会社 Character recognition program, character recognition electronic component, character recognition device, character recognition method, and data structure
CN102915215B (en) 2011-08-03 2015-05-27 精工爱普生株式会社 Control device and control method
JP5790267B2 (en) * 2011-08-03 2015-10-07 セイコーエプソン株式会社 Output control system and control method
JP5984375B2 (en) * 2011-12-15 2016-09-06 株式会社日立公共システム Simplified character / correct character conversion device and simplified character / correct character conversion method using the device
JP2013125450A (en) * 2011-12-15 2013-06-24 Hitachi Government & Public Corporation System Engineering Ltd Foreigner name traditional chinese character output system and foreigner name traditional chinese character output method
JP6297449B2 (en) * 2014-08-19 2018-03-20 アルパイン株式会社 Audio apparatus and computer program

Also Published As

Publication number Publication date
JP2004265136A (en) 2004-09-24

Similar Documents

Publication Publication Date Title
CN1296806C (en) Reduced keyboard disambiguating system
CN1024050C (en) Method and apparatus for encoding and recording Chinese characters
CN1269014C (en) Character input device
US20050027534A1 (en) Phonetic and stroke input methods of Chinese characters and phrases
CN1232226A (en) Sentence processing apparatus and method thereof
CN1777858A (en) Unambiguous text input method for touch screens and reduced keyboard systems
CN100342317C (en) Character inputting device and method
CN101079268A (en) System and method for sign language synthesis and display
CN1993692A (en) A character display system
CN1095560C (en) Kanji conversion result amending system
CN1591297A (en) Chinese character input method and apparatus
CN101038508A (en) GB phoneticize input method
CA2496872C (en) Phonetic and stroke input methods of chinese characters and phrases
CN1525388A (en) Hanzi processing equipment and method
CN1704879A (en) Method and apparatus for inputting Chinese characters and phrases
CN1556458A (en) Chinese whole sentence input method
CN1991743A (en) Method and device for voice input method
CN1106619C (en) Chinese input transition processing device and Chinese input transition processing method
CN1136496C (en) Simplified spelling-touching screen mouse chinese character input method
CN101046706A (en) Universal input method for different person computer and mobile phone
CN1053976C (en) Full and double phoneticizing combined type Chinese input method
CN101714141A (en) Handwriting recognition character searching and translating system and method
CN100339808C (en) U Code Chinese character inputting method
CN1177285C (en) Ultralarge Chinese character information treating device and method
CN1052200A (en) Pronunciation-form-meaning words encode series with compatibility and keyboard

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication