CN1143231C - Chinese data processor - Google Patents

Chinese data processor Download PDF

Info

Publication number
CN1143231C
CN1143231C CNB961059796A CN96105979A CN1143231C CN 1143231 C CN1143231 C CN 1143231C CN B961059796 A CNB961059796 A CN B961059796A CN 96105979 A CN96105979 A CN 96105979A CN 1143231 C CN1143231 C CN 1143231C
Authority
CN
China
Prior art keywords
code
data
pinyin
chinese
sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB961059796A
Other languages
Chinese (zh)
Other versions
CN1140858A (en
Inventor
泉田智史
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sharp Corp
Original Assignee
Sharp Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sharp Corp filed Critical Sharp Corp
Publication of CN1140858A publication Critical patent/CN1140858A/en
Application granted granted Critical
Publication of CN1143231C publication Critical patent/CN1143231C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

To enable batch processing for plural KANJIs (Chinese character) having the same pinyin by converting a KANJI code into a pinyin code by using a conversion table and performing information processing according to the pinyin code. A KANJI code-pinyin code converting means 2 has a KANJI code-pinyin code conversion table 1 wherein KANJI codes of Chinese and corresponding pinyin codes are arrayed corresponding to each other, and KANJI codes are converted into pinyin codes in sequence by using the conversion table 1. Then a processing means 3 perform the information processing on the basis of the obtained pinyin codes. In this case, a data rearranging means 2 as a processing means 3 converts information on KANJI codes stored in a main storage means 5 into information on a pinyin code series by using the KANJI- pinyin converting means 2 and performs rearrangement to the standards of pinyin on the basis of the converted information. Namely, the information in the main storage means 5 is rearranged in the large to small order of pinyin codes through the rearranging process.

Description

Chinese information, treating apparatus
Technical field
The present invention relates to the signal conditioning package of a kind of for example word processor etc., carry out data retrieval when particularly relating to a kind of input of carrying out Chinese article and editor and arrange the Chinese data processor of the processing of replacing etc.
Background technology
Originally, in the input of Chinese article and edit in the used Chinese data processor, be to use by " phonetic " of the pronunciation of letter representation Chinese characters mark to import, be transformed to Chinese character again, particularly in Japanese publication 1. JP-A-62-93744,2. JP-A-3-28964,3. disclosed scheme among the JP-A-6-208560, its formation is in the conversion input that can realize during to the Chinese character conversion from pronouncing indicia allowing that the pronunciation mark is ambiguous.
That is to say, in the formation of in the JP-A-62-93744 communique, being put down in writing, when importing according to the Chinese of phonetic, except that making phonetic and word corresponding phonetic conversion one to one character library with it, make phonetic and the word similar ambiguous conversion dictionary of phonetic one to one in addition, when phonetic-Chinese character conversion, when retrieval phonetic conversion character library does not find the alternated Chinese character of necessity with it, retrieve the ambiguous character library of phonetic again, show alternated Chinese character.
The scheme that the JP-A-3-28964 communique is put down in writing is according to phonetic and four tones of standard Chinese pronunciation input Chinese character, wherein make the phonetic and the four tones of standard Chinese pronunciation and Chinese character corresponding character library device one to one with it, not with phonetic of being imported and the corresponding alternated Chinese character of the four tones of standard Chinese pronunciation time, no matter the Chinese character of phonetic unanimity only retrieved in the four tones of standard Chinese pronunciation, and show as alternated Chinese character.
And the formation of being put down in writing in the JP-A-6-208560 communique is by word sound mark input Chinese character, wherein except that word sound mark is transformed to the Chinese character sequence corresponding with it, record and narrate the ambiguous character library of the mutual ambiguous relation of word sound mark in addition, even imported the word sound mark of the pronunciation that has accent during the input Chinese character, also can be transformed to correct Chinese.
The existing shortcoming of these schemes is to be difficult to difference pronunciation correctly because the user adopts with the Chinese character input method of phonetic, and the difference of pronunciation is felt ambiguous and can be obscured, so be difficult to come into operation.
On the other hand, 4. disclosing a kind of formation in the JP-A-4-156666 communique, is when some Chinese character of input, from the article of having imported, select the Chinese character of identical phonetic, append input four tones of standard Chinese pronunciation information again, come the Chinese character retrieval character library from phonetic and four tones of standard Chinese pronunciation information, and alternated Chinese character is shown.
The problem that this scheme will solve is owing to just show desirable Chinese character input Pinyin after, therefore, necessary multitap, thus the key operation number of times increased, can not show the purpose Chinese character rapidly.According to this scheme, from picture, select the unisonance Chinese character the shown Chinese character, import tone simultaneously, demonstrate the Chinese character group of candidate again, from shown Chinese character group, select desirable Chinese character then, so this scheme can promptly be imported Chinese character with simple key operation.
Really,, during to the Chinese character conversion, can allow the ambiguous of pronunciation mark, and adopt simple key operation just can import Chinese character rapidly with the scheme of 4. communique from pronouncing indicia with disclosed scheme in the above-mentioned communique 1.~3..That is to say, in Chinese character information treating device, various improvement can both provide practical application, but, comprise above-mentioned communique 1.~4. in interior original Chinese data processor, still obtained sufficient improvement hardly, yet had the multiple problem that will solve in actual use, the user wishes to improve its operability consumingly.
In a word, in general, Chinese data processor originally is to manage the Chinese written language data with encode Chinese characters for computer, even in the above-mentioned communique 1.~3., does not also disclose the technical conceive of managing the Chinese written language data of having stored with Pinyin code.
According to the data management based on the GB code of Chinese national standard defined, its one-level code can be managed by pinyin order, and still, the Chinese character of its secondary code just can not be managed by pinyin order.Therefore, in original Chinese data processor, when for example carrying out many Chinese characters word sequence arrangement replacement processing, the Chinese character word sequence that is arranged the result who replaces with kanji code order and is the one-level code that belongs to kanji code is by the series arrangement according to phonetic, and the word sequence that belongs to secondary code is by the series arrangement according to radicals by which characters are arranged in traditional Chinese dictionaries.That is: when arranging the Chinese character sequence of replacing one-level code Chinese character and the mixing of secondary code Chinese character, can not get the arrangement of pinyin order completely and replace the result, this is very inconvenient aspect data preparation.
In addition, in Chinese, different fonts, non-standard word, simplified/complex form of Chinese characters and numeral write (capitalization) although etc. be same implication usage, be to use the situation of different literals still to happen occasionally.For example " chaos ", " chaos " both pronunciations all are " hundun ", and implication is also identical.In addition, in the onomatopoeia word, because mainly be to stress its pronunciation, so exist the Chinese character that uses how much to understand the situation of some variation, Figure 52 has just represented such example.
; as previously mentioned; because original Chinese data processor is a lteral data of managing Chinese character with kanji code; so kanji code also is used as the retrieval key when retrieval, therefore, for example above-mentioned " chaos ", " chaos "; though both pronunciations are all identical with implication; but, must do quadratic search and handle because the kanji code difference still can not be retrieved simultaneously.In a word, Chinese data processor originally can not carry out retrieval process effectively for the Chinese article that has the different situation of the identical and employed Chinese character of implication usage.
On the other hand, the disclosed scheme of communique 4. is to select the Chinese character of identical phonetic from the article of having imported, and adds four tones of standard Chinese pronunciation information, shows the Chinese character of candidate then.But this scheme yet will turn back to phonetic to the lteral data of having imported for input characters, does not also disclose the technical conceive of managing the Chinese written language of having stored with phonetic in above-mentioned communique 4..
According to original Chinese data processor, the operator must know the correct method of joining together of wanting the word sequence imported, but, in Chinese characters, it is identical or similar to pronounce, and the similarly difficult again situation about differentiating of implication and usage has the place greatly simultaneously, therefore, the operator does not find out under the situation of the method for joining together at once, just must go to consult the dictionary to confirm.
And, in the scheme of 4. communique, there is same inconvenience, that is: in this scheme, the input of the four tones of standard Chinese pronunciation is indispensable, and the operator must know the correct phonetic and the four tones of standard Chinese pronunciation of target characters, in addition, another problem that the scheme of this communique exists is that the operator must find out the Chinese character (conversion unit) with identical phonetic with visual from the article of being imported, and has just become burden concerning the operator, and, the article of having imported can not be in this way than under the short situation.
Though must from the article of having imported, can solve by near the position of input target characters, import the Chinese character that becomes conversion unit method with the visual problem of finding out the Chinese character of conversion unit, but, also can not import under the sort of situation conversion unit Chinese character and directly with the pronunciation mark of phonetic input target characters, but this has just lost the meaning of this prior art.
In addition, with original Chinese data processor management " name ", also have problems during the address entry information in " address " and so on, as mentioned above, the kanji code specification of China is that the one-level code of GB code is to arrange by pinyin order, secondary code is to arrange by radical order, so, manage name or the certificate address information of being recorded and narrated with phonetic according to code system, promptly press the information of sequential arrangement of phonetic, want simultaneously to carry out under the situation by the retrieval of Pinyin code, with what represent with Chinese character is that the word sequence in name or address is different, just must import the pronunciation of its word sequence with phonetic.
Also having a kind of situation is that to manage simultaneously with American-European place name or name or with Hong Kong be that the alphabetic flag etc. of the Cantonese used, the area of representative reaches in English the name of mark or the situation in address more.But, in this case, with original Chinese letter treating apparatus, owing to adopt the data of kanji code management Chinese, and the data of managing English with the code of alphabetic literal sequence, so manage the Chinese article data and the English letter data are impossible unifiedly.
Summary of the invention
The purpose of this invention is to provide a kind of burden that neither increases the operator and can handle the Chinese data processor of using the Chinese canned data again effectively.
For achieving the above object, a kind of Chinese data processor of the present invention, be provided with the input media of input retrieval key, data storage device with the data of storing the kanji code form, it is characterized in that comprising: have the kanji code-Pinyin code map table of Pinyin code, and the data conversion of the kanji code form that is stored in described data storage device is arrived the Chinese character-phonetic converting means of Pinyin code form with this map table corresponding to the Chinese characters code; To press the apparatus for temporary storage of Pinyin code form storage by the retrieval key of above-mentioned input media input; From being transformed to by above-mentioned Chinese character-Pinyin code converting means among the data of Pinyin code form, extract the indexing unit of the code consistent with the retrieval key of described apparatus for temporary storage stored; With the display device that shows result for retrieval.
According to above-mentioned formation, the kanji code of Chinese character shown in Figure 1-phonetic converting means 2 usefulness Chinese and this code kanji code-Pinyin code map table one to one are transformed into Pinyin code one by one to kanji code; And treating apparatus 3 carries out information processing according to resulting Pinyin code.Like this, not with kanji code but manage the lteral data of the Chinese of for example having stored with regard to adopting with the form that phonetic carried out coding, as a result, treating apparatus just can be to handling by the data management of pinyin order and one of a plurality of Chinese character with identical phonetic.
For example use Chinese data processor, when the entry information of the address in management " name ", " address " and so on, though can come the order of one-level code according to phonetic managed with management according to the data of the GB code of common Chinese national Specification, but owing to can not manage the Chinese character of secondary code by the order of phonetic, so in the time will managing the address entry information with phonetic, different with the word sequence in name of representing with Chinese character or address, the pronunciation of its word sequence must be imported once more, so bother very much with phonetic.But, according to the formation of claim 1 record, because needn't be once more with the phonetic input, so can alleviate operator's burden.And encodedization manages because data are by the order of phonetic, so typing management in address just is very easy to carry out.
As shown in Figure 2, the treating apparatus 3 of Chinese data processor of the present invention also comprises data ordering alternative 4, above-mentioned Chinese character-phonetic the converting means 2 for the treatment of apparatus 3 usefulness is the data conversion of the kanji code form of data storage device 22 stored the Pinyin code form, and above-mentioned data ordering alternative 4 is replaced the data that are transformed to the Pinyin code form again according to the series arrangement of phonetic.Like this, by arranging to replace just to handle the data in the data storage device 22 are lined up by the size sequence of Pinyin code.
With original formation the Chinese information stored is implemented to arrange and replace when handling, can not replace with pinyin order completely to the Chinese character series arrangement that the Chinese character of the Chinese character of the one-level code in the GB code of stipulating in the Chinese national standard for example and secondary code mixes.According to above-mentioned formation, even Chinese data processor of the present invention in the Chinese character word sequence that the Chinese character of the Chinese character of for example one-level code and secondary code mixes, also can not be subjected to the influence of kanji code form and arrange by the order of phonetic.As a result, owing to removed the trouble of user from this input Pinyin, thus can reach the effect that can carry out data processing easily.
As shown in Figure 3, the treating apparatus 3 of Chinese data processor of the present invention also comprises indexing unit 6, this indexing unit 6 is provided with the apparatus for temporary storage 7 with the form memory scan key of Pinyin code, be the data conversion of the kanji code forms of being stored in the data storage device 22 form of Pinyin code with above-mentioned Chinese character-phonetic converting means 2 simultaneously, and from the information of conversion, extract the consistent data of retrieval key with apparatus for temporary storage 7 stored.That is to say that the information in the data storage device 22 is not with kanji code but with the Pinyin code retrieval and extracts.
Therefore, for example the phonetic as " chaos ", " chaos " is identical and word sequences that Chinese character is different once just can retrieve and extract.Consequently during the different Chinese article of Chinese character, just can effectively retrieve, thereby can improve the treatment effeciency of Chinese data processor handling so-called identical meanings usage.
As shown in Figure 4, Chinese data processor of the present invention be characterised in that treating apparatus 3 be provided with from the literal of input media input by the apparatus for temporary storage 10 of the form storage of kanji code, the Chinese character word sequence is transformed to the Chinese character sequence-pinyin sequence converting means 11 of Pinyin code sequence, is the Pinyin code sequence that unit transformation is the pinyin sequence-Chinese character sequence transformation device 12 of Chinese character word sequence with word or word sequence with above-mentioned kanji code-Pinyin code converting means.
This scheme is to have utilized in the Chinese often also similar characteristic (for example " horse " and " scolding ", the phonetic of " mother " all is " ma " in addition) of its pronunciation of the similar Chinese character of mark.That is to say, though the operator wants to import the word sequence that is made of certain Chinese character, but not to know at once under its correct situation of joining together, the operator is just being input as that word with the similar Chinese character of correct Chinese character, Chinese character sequence-pinyin sequence converting means 11 is the Chinese character sequence transformation of being imported the Pinyin code sequence just, and this Pinyin code sequence by pinyin sequence-Chinese character sequence transformation device 12 is transformed to the Chinese character word sequence, thereby the word sequence of correctly being joined together.
Like this, the operator just needn't be as original, check correct joining together one by one with dictionary etc., and, also needn't must import the four tones of standard Chinese pronunciation as original, by the operator with the visual Chinese character of from the article of having imported, finding out with identical phonetic, further, the inconvenient part that does not also exist the article imported can not import in short-term.Its result makes the article input become very simple, thereby the operability in the Chinese data processor is improved.
As shown in Figure 5, Chinese data processor of the present invention is characterised in that it is the pinyin sequence-alphabetical sequence converting means 15 of alphabetic literal code signing and the alphabetical treating apparatus 5 that carries out data processing according to the alphabetic literal code that is obtained by this pinyin sequence-alphabetical sequence converting means 15 that treating apparatus 3 also is provided with the data conversion of Pinyin code mark.
For example: when the entry information of the address in management " name ", " address " and so on, American-European place name or name are arranged in the data that should manage or be the alphabetic flag etc. of the Cantonese used, the area of representative and the name of mark or the situation that the address mixing exists in English more with Hong Kong.According to above-mentioned formation, because the character data of Chinese also can be handled with the character code of letter, so, the data of Chinese not only, the female mark of loigature, phonetic mark can both centralized managements, thereby the data-handling efficiency of Chinese data processor is improved.
Can make other purposes of the present invention, feature and advantage clearer according to following record, can understand benefit of the present invention with reference to the following explanation of accompanying drawing.
Description of drawings
Fig. 1 is the block scheme of an example of expression formation of the present invention.
Fig. 2 is the block scheme of other examples of expression formation of the present invention.
Fig. 3 is the block scheme of the other example of expression formation of the present invention.
Fig. 4 is the block scheme of the other example of expression formation of the present invention.
Fig. 5 is the block scheme of the other example of expression formation of the present invention.
Fig. 6 is the power block diagram that the summary of control square frame of the Chinese data processor of expression one embodiment of the present of invention constitutes.
Fig. 7 is the pie graph of kanji code-Pinyin code map table.
Fig. 8 is the arrangement key diagram of the kanji code (GB code) stipulated in the CNS.
Fig. 9 is the allocation table key diagram of Pinyin code.
Figure 10 is the key diagram of the staging area used of memory scan key.
The key diagram of the data structure of Figure 11 operating area that to be expression use the kanji code sequence transformation for the Pinyin code sequence.
Figure 12 is the key diagram that expression is stored in the data structure in the main storage means.
Figure 13 is that expression is used for the key diagram of the demonstration of the data that memory scan obtains with buffer structure.
Figure 14 is the key diagram that expression writes the inputoutput data form of independent variable buffer zone, rreturn value buffer zone.
Figure 15 is the pie graph of Pinyin code-letters shift (LTRS) table (letter-Pinyin code map table).
Figure 16 is the process flow diagram of the entire process process of expression address typing management system.
Figure 17 is the process flow diagram of the retrieval process process of expression address typing management system.
Figure 18 is that expression is the alphabetic literal sequence transformation process flow diagram of the processing procedure of Pinyin code.
Figure 19 is the process flow diagram that the kanji code of the data of being taken out is transformed to the processing procedure of Pinyin code.
Figure 20 is the process flow diagram that judges whether to comprise the processing procedure of retrieving key when representing a data conversion of taking out for Pinyin code.
Figure 21 is the process flow diagram that expression is transformed to kanji code the treatment step of Pinyin code.
Figure 22 is that the process flow diagram that whether comprises the processing procedure of retrieving key in " name " item is judged in expression.
The key diagram of the picture when Fig. 2 is the data retrieval of expression address typing management system.
Figure 24 is the process flow diagram of step of the result for retrieval display process of expression address typing management system.
Figure 25 is the process flow diagram of the step of appending the input processing of expression address typing management system.
The key diagram of the picture when Figure 26 is the data input of expression address typing management system.
The key diagram of the picture when Figure 27 is the data input of the original address typing management system of expression.
That Figure 28 represents is other embodiment of the present invention, is the block scheme of the formation of expression Chinese data processor.
The key diagram of the picture when Figure 29 is the data input of expression address typing management system.
Fig. 3 O is the process flow diagram of the retrieval process process of expression address typing management system.
To be expression be transformed to the phonetic of the Chinese character part of data word sequence with alphabetic flag, be the kanji code sequence transformation key diagram of the data structure of the used operating area of alphabetical word sequence Figure 31.
Figure 32 is that expression is the Pinyin code sequence transformation process flow diagram of the processing procedure of alphabetic literal sequence.
Figure 33 is the key diagram that expression writes the inputoutput data form of independent variable buffer zone, rreturn value buffer zone.
That Figure 34 represents is other embodiment of the present invention, is the block scheme of the formation of expression Chinese data processor.
Figure 35 is the process flow diagram of the result for retrieval display process process of expression address typing management system.
Figure 36 is the process flow diagram of the result for retrieval display process process of expression address typing management system.
Figure 37 is the key diagram of the picture of the grouping that shows of ecbatic.
Figure 38 is the key diagram of the picture of the grouping that shows of the original result of expression.
Figure 39 is the key diagram of formed sorted table T1 in the expression operating area.
Figure 40 is the key diagram of formed sorted table T2 in the expression operating area.
Figure 41 is the process flow diagram that the treatment step of sorted table T1 is made in expression.
Figure 42 is the process flow diagram that the treatment step of sorted table T2 is made in expression.
That Figure 43 represents is other embodiment of the present invention, is the block scheme of the formation of expression Chinese data processor.Figure 44 be expression arrange replace it before and arrange the key diagram of the data ordering after replacing it.
Figure 45 is Chinese character mark, the rank of its kanji code and the key diagram of phonetic mark of each data shown in Figure 44.
Figure 46 be original arrangement replace handle in, arrange the key diagram of replacing the data ordering after handling.
That Figure 47 represents is other embodiment of the present invention, is the block scheme of the formation of expression Chinese data processor.
The key diagram of Figure 48 2 kinds of Chinese written language sequences that to be expression retrieve with the retrieval key of a phonetic.
The key diagram of the data structure of Figure 49 operating area that to be expression use the kanji code sequence transformation for the Pinyin code sequence.
That Figure 50 represents is other embodiment of the present invention, is the block scheme of the formation of expression Chinese data processor.
Figure 51 is the key diagram of expression literal input picture.
Figure 52 is the key diagram of the object lesson of expression onomatopoeia word.
Embodiment
(embodiment 1)
According to Fig. 6 to Figure 27 one embodiment of the present of invention are described as follows, are illustrated as example with the situation of using Chinese data processor to carry out address typing management in the present embodiment.
As shown in Figure 6, Chinese data processor is provided with input media 21, data storage device 22, kanji code-Pinyin code map table storer 23, central processing unit 24, operation storer 25, shows with Chinese characters font ROM26, display device 27 and letter-Pinyin code map table storer 28.
Input media 21 is made of keyboard and electronic pen device, and it is used for allowing the operator select the function of address typing management usefulness or import data such as new name and address, and other indication and data etc. are also from input here.
Data storage device 22 is made of the jumbo outer cryopreservation device of for example hard disk and so on, and in the present embodiment, the address logging data just is stored in here.As shown in figure 12, the structure of the address logging data of being stored in this data storage device 22 is that " name ", " phone ", " address ", " remarks " 4 information are managed as an event data, and on each event data the affix management number (001,002 ... n).In addition, in the data storage device 22, not only store the address data, also storing other information certainly.
Kanji code-Pinyin code map table storer 23 is being stored and is being used for kanji code is transformed to the kanji code-Pinyin code map table of Pinyin code.So-called kanji code is the GB code of for example Chinese character code specifications, has represented the arrangement of GB code among Fig. 8; So-called phonetic is the pronunciation of representing the Chinese characters mark with letter, in the present embodiment, as shown in Figure 9, the Pinyin code of pinyin syllable " a " is that " 001 ", " ai " are " 002 ", and from " a " to " zuo " distributes the Code Number from " 001 " to " 461 ".
What Fig. 7 was represented is kanji code-Pinyin code map table 50, and wherein, the candidate code that is fit to the Pinyin code of each kanji code according to circumstances is aligned to the 4th candidate code.In this kanji code-Pinyin code table 50, Pinyin code equals the no candidate code of 0 expression, having corresponding to a Chinese character under the situation of a plurality of phonetic candidate codes, kanji code-Pinyin code map table 50 in those candidate codes the phonetic of general usefulness as the 1st candidate code.This kanji code-Pinyin code map table 50 is no matter how the rank of Chinese Ideogram Coding System all makes Pinyin code for whole kanji codes, in addition, in the drawings, shows in the lump corresponding to the Chinese character and the phonetic of each kanji code of being put down in writing.
Central processing unit 24 carries out the control by the selected retrieval of above-mentioned input media 21 and input, Presentation Function, and this central processing unit 24 and above-mentioned kanji code-Pinyin code map table 50 constitutes kanji code of the present invention-Pinyin code converting means.And, central processing unit 24 relies on this kanji code-Pinyin code converting means with Pinyin code and without the information in the kanji code management data memory storage 22, detailed control content about this central processing unit 24 is described with reference to process flow diagram in the back, and indexing unit of the present invention is formed in this central processing unit 24 and operation is used in the memory storage 25.
Operation is to handle retrieval with memory storage 25, show, the temporary transient apparatus for temporary storage that uses during each function of input, adopt semiconductor memory, use in the memory storage 25 in this operation, be provided with the aforesaid staging area B1 (with reference to Figure 10) that the memory scan key is used, the operating area B2 (with reference to Figure 11) that the kanji code sequence transformation is used for the Pinyin code sequence, the demonstration that temporary result for retrieval is used data buffer area B3 (with reference to Figure 13), point to the indicator of data storage device 22, be provided with independent variable buffer zone 110 (with reference to Figure 14 (a)) in addition, with rreturn value buffer zone 111 (with reference to Figure 14 (b)), the temporary transient data of using are stored in wherein when input and output independent variable and rreturn value.
Show with Chinese characters font ROM26 be storage read as on display device 27 demonstration based on the Chinese character of kanji code and the Chinese character style of usefulness special-purpose storer.
Display device 27 is under the control of central processing unit 24 operator to be shown various message, shows the display device that result for retrieval is used.
As shown in figure 15, letter-Pinyin code the map table 52 of letter-Pinyin code map table storer 28 storage Pinyin codes and 1: 1 correspondence of alphabetic literal sequence, and, from Pinyin code during to letters shift (LTRS), this letter-Pinyin code map table 52 just becomes Pinyin code-letters shift (LTRS) table, uses to some extent in embodiment 2 about this point.
The action of the address management system in the above-mentioned Chinese data processor is described below, at first comes the molar behavior of illustrative system with reference to the process flow diagram of Figure 16.
In step (being designated hereinafter simply as S) 1, central processing unit 24 at first shows the picture (not shown this picture) of urging the operator to select some processing of " retrieval ", " appending input ", " end " on display device 27.In case the operator has carried out the selection of operating with input media 21, just carry out suitable retrieval process (S2) or append input processing (S3) or end process (S4) according to this selection.At this, under the situation of the retrieval process of carrying out S2,, carry out result for retrieval display process (S5) with that in order to show result for retrieval, and, turn back to S1 after be through with S3 and the S5.On the other hand, selected at S1 under the situation of " end ", implemented the end process of S4, processing procedure has been finished.
Next, illustrate that with each process flow diagram of Figure 17, Figure 24, Figure 25 the retrieval process among the above-mentioned S2, the result for retrieval display process among the S5, the input of appending among the S3 handle respectively.
At first the process flow diagram with reference to Figure 17 illustrates retrieval process.
At S11, constitute the Chinese character word sequence of retrieval key (retrieve data) by the phonetic input of alphabetic flag with input media 21 by the operator, the retrieval key of being imported is displayed in the display field on the 51b of picture shown in Figure 23 51, confirm to use for the operator, simultaneously, the retrieval key of input being stored in storage operation shown in Figure 10 uses in the buffer zone 101 with the retrieval key letter of the staging area B1 of the retrieval key in the memory storage 25.Here, the phonetic that constitutes the word sequence of retrieval key is alphabetic flag, in the picture 51a of Figure 23, the word sequence D that the word sequence C that the Chinese written language sequence A is equivalent to " name ", " address " that the Chinese written language sequence B is equivalent to Japanese, the Chinese of Japanese is equivalent to " remarks " of Japanese, Chinese is equivalent to " phone " of Japanese.
At S12, be the alphabetic literal sequence transformation of being imported the Pinyin code sequence, and be stored in alphabetical the using in the buffer zone 102 of retrieval key that storage operation shown in Figure 10 is used the staging area B1 of the retrieval key in the memory storage 25.The detailed step that the back should be moved with reference to the flow chart description of Figure 18.
At S13, the operator selects the project that will select with the form shown in the 51a in the picture shown in Figure 23 51, at S14, in order from data storage device 22, to read institute's canned data, be the content setting of the indicator of specifying sense information the 1st group of canned data.This indicator is set at operation with in the memory storage 25, is used to refer to from the beginning which part of number in the data of being stored in the given data memory storage 22.As previously mentioned, the structure of the data of being stored in the data storage device 22 as shown in Figure 12.
At S15, operation is carried out initial setting with the demonstration of the temporary result for retrieval in the memory storage 25 with data buffer B3, the structure of this demonstration usefulness buffer zone as shown in Figure 13, it stores " name ", " address ", " phone ", " remarks " these 4 information as an event data, and, show with the size of buffer zone B3 number of packages to change according to the data of being stored.
At S16, check on the position of the indicator indication in the data storage device 22 and whether storing data, if data are arranged, just it is read out, and make carbon copies in the metadata buffer zone 103 of the operating area B2 that kanji code sequence-Pinyin code sequence transformation shown in Figure 11 uses at S17.
At S18, each kanji code of the kanji code sequence of the data of institute's unloading on the metadata buffer zone 103 of operating area B2 is transformed to Pinyin code, making becomes the from the 1st to the 4th Pinyin code sequence, and is written to the 1st candidate buffer zone 104~the 4th candidate buffer zone 107.The detailed step of this action will be described with reference to the process flow diagram of Figure 19 in the back.
At S19, check on whether retrieval key phonetic in the staging area B1 shown in Figure 10 be included in the picture shown in Figure 23 51 in the 1st candidate buffer zone 104 shown in Figure 11~the 4th candidate buffer zone 107 with the Pinyin code sequence of buffer zone 102 the 51a in the item selected, if in not being included in, just turn back to S16.If be included in wherein, just carry out the processing of S20.The detailed step of this action will be described with reference to the process flow diagram of Figure 20 in the back.
At S20, the data supplementing of the metadata buffer zone 103 in the operating area B2 shown in Figure 11 is arrived demonstration shown in Figure 13 with after the buffer zone B3, turn back to S16.
After this, repeat the processing of S16~S20,, just judge that the retrieval to institute's canned data in the data memory storage 22 finishes, thereby finish retrieval process, then carry out the result for retrieval display process of the S5 of Figure 16 if do not have data at S16.
At this, before appending of result for retrieval display process that S5 is described and S3 imported treatment step, S12, the S18 in the process flow diagram of the above-mentioned Figure 17 of explanation, the processing of S19 earlier.
At first the process flow diagram with Figure 18 illustrates that S12's is the treatment step of Pinyin code sequence to the alphabetic literal sequence transformation.
At S21, retrieval key in staging area B1 shown in Figure 10 letter is included in the maximum sequence of number of words in letter shown in Figure 15-Pinyin code map table 52 with retrieval the alphabetic literal sequence of the beginning of buffer zone 101, and obtains the Pinyin code of correspondence.At S22, resulting Pinyin code is sent to the interior retrieval key phonetic of staging area B1 buffer zone 102.
At S23, judge whether the step that the alphabetical alphabetic literal sequence with buffer zone 101 of the retrieval key in the B1 of staging area all is transformed to Pinyin code finishes, and if all conversion finishes, treatment step just finishes, enter the S13 of Figure 17, if all conversion does not finish, then turn back to step S21, repeat the step from S21 to S23, the unclosed section processes of conversion intact after, enter the S13 of Figure 17.
Then, the process flow diagram with Figure 19 illustrates that the kanji code of the data that the handle of S18 takes out among Figure 17 is transformed to the treatment step of Pinyin code.
At S31, the 1st candidate buffer zone 104~the 4th candidate buffer zone 107 of the operation area B2 that operation shown in Figure 11 is used with the sequence of the kanji code in the memory storage 25-Pinyin code sequence transformation is initialized as the state of no datat.At S32, sensing is written in the kanji code sequence of conversion unit (being also referred to as metadata) of metadata buffer zone 103 and answers the indicator of a Chinese character of conversion to carry out initial setting, and indicator is set in the 1st word of metadata.At S33, whether the value of check indicator refers to if do not point to the end, just enter step S34 after the last kanji code of the kanji code sequence of conversion unit.
At S34, see whether the value of a word of indicator indication is 0, if be not 0, just think kanji code, enter S35, if 0, just think to enter S37 by no kanji code.At S35, using the map table 50 (with reference to Fig. 7) that Chinese character is transformed to Pinyin code is the metadata that indicator points to that the kanji code sequence transformation is a Pinyin code.The detailed step of this action is described in the back with reference to the process flow diagram of Figure 21.
At S36, the Pinyin code that obtains at S35 is deposited in the suitable position of the 1st candidate buffer zone 104~the 4th candidate buffer zone 107 of operation area B2, at S37, on indicator, add after 1, turn back to S33.Before intact the whole conversion of kanji code sequence, repeat S33~S37, at S33, if the value of judgement indicator refers to the end at the last kanji code of the kanji code sequence of conversion unit, just the kanji code sequence all is transformed to Pinyin code, with regard to end process, enter the S19 of Figure 17 then.
Then illustrate that at this process flow diagram the metadata that the indicator of above-mentioned S35 is pointed to is that the kanji code sequence transformation is the treatment step of Pinyin code sequence with Figure 21.Figure 14 (a) (b) represents operation respectively with the independent variable buffer zone 110 in the memory storage 25 and each data mode of rreturn value buffer zone 111, and the number of the candidate in independent variable buffer zone 110, the rreturn value buffer zone 111 separately all is 2 bytes with the 4th candidate code of regional 111d, Pinyin code with regional 111e with the 3rd candidate code of regional 111c, Pinyin code with the 2nd candidate code of regional 111b, Pinyin code with the 1st candidate code of regional 111a, Pinyin code.
At S41, independent variable 110 shown in Figure 14 (a) (promptly will be transformed to the kanji code of Pinyin code) is arranged in the register, at S42, calculate address to kanji code shown in Figure 7-Pinyin code map table 50 according to the value of set register, the formula of appropriate address that calculates kanji code-Pinyin code map table 50 is as follows:
The beginning address of corresponding address=kanji code-Pinyin code map table+
8 * [(upper 1 byte of kanji code-20H) * 94+ (the next 1 byte of kanji code-20H)]
At S43, initial setting rreturn value buffer zone 111, and the counter n zero clearing that the candidate counting number is used, at S44, judge whether to exist the 1st suitable candidate code by kanji code-Pinyin code map table 50 of Fig. 7, if there is no, just enter S56, if present, enter S45.At S45, counter n adds 1, obtains the 1st candidate Pinyin code at S46 from kanji code-Pinyin code map table 50, and it is sent to the regional 111b of the 1st candidate code of rreturn value buffer zone 111.
At S47, judge whether to have the 2nd suitable candidate code by kanji code-Pinyin code map table 50, if there is no, just enter S56, if present, enter S48.At S48, counter n adds 1, obtains the 2nd candidate Pinyin code at S49 from kanji code-Pinyin code map table 50, and it is sent to the regional 111c of the 2nd candidate code of rreturn value buffer zone 111.
At S50, judge whether to have the 3rd suitable candidate code by kanji code-Pinyin code map table 50, if there is no, just enter S56, if present, enter S51.At S51, counter n adds 1, obtains the 3rd candidate Pinyin code at S52 from kanji code-Pinyin code map table 50, and it is sent to the regional 111d of the 3rd candidate of rreturn value buffer zone 111.
At S53, judge whether to have the 4th suitable candidate code by kanji code-Pinyin code map table 50, if there is no, just enter S56, if present, enter S54.At S54, counter n adds 1, obtains the 4th candidate Pinyin code at S55 from kanji code-Pinyin code map table 50, and it is sent to the regional 111e of the 4th candidate code of rreturn value buffer zone 111.
At S56, the value of counter n is sent to the regional 111a of number of the candidate of rreturn value buffer zone 111, processing so far finishes.
Next, judge in the Pinyin code sequence of data of selected project of the S19 among Figure 17 whether comprise the treatment step of retrieving key with the process flow diagram explanation of Figure 20.
At S61, judge whether to have selected " name " to be searching object, if chosen, just enter S62, if not selected, then enter S64.At S62, carry out the processing from S70 to S80 shown in Figure 22 described later, check in the Pinyin code sequence of name whether comprise the Pinyin code sequence of retrieving key.At S63, judge whether comprise the retrieval key in " name " according to the checked result of S62, if in being included in, just enter the S20 of Figure 17, otherwise enter S64.
Equally, in carrying out the Pinyin code sequence in " address ", S64~S66 whether comprises the judgement of the Pinyin code sequence of retrieving key; In carrying out the Pinyin code sequence of " remarks ", S67-S69 whether comprises the judgement of the Pinyin code sequence of retrieving key.And, be judged as under the situation about comprising at S66, S69, also enter the S20 of Figure 17.
Illustrate in the name of judging above-mentioned S62 whether include the treatment step of retrieving key with Figure 22 here.And, also all identical therewith to the explanation of the address of S65, S68, remarks, for for simplicity, omitted explanation to them at this.In this is handled, used two indicators, they are made as indicator P1P2.
At S70, the literal number of the name data among the B2 of operating area is arranged among the register m2, at S71, indicator P2 is set in the 1st literal place of operating area B2.At S72, the crucial yardage of retrieval is arranged among the register m1, at S73, indicator P1 is set in the 1st Pinyin code place of retrieval key.
At S74, judge whether the value of register ml is 0, if register ml is not 0, just enter S75, the value of register m1 and the value of register m2 are compared, if the value of register m2 greater than the value of register m1, just enters S76.
At S76, whether the Pinyin code of judging indicator P1 indication is included in the Pinyin code corresponding to the literal of indicator P2 indication, under the situation in not being included in, enter S77, indicator P2 takes a step forward, and at S78 the value of register m2 is subtracted after 1, turns back to S74.Begin to retrieve the next literal of name data in this step.
On the other hand, under the Pinyin code of indicator P1 indication is included in corresponding to the situation in the Pinyin code of the literal of indicator P2 indication, enter S79, indicator P1P2 takes a step forward respectively, and at S80 the value of register m1m2 is subtracted respectively after 1, turn back to S74.Next Pinyin code in the Pinyin code sequence that this step begins to retrieve the next literal of name data and retrieve key.
After this, at S74, when the value of register m1 became 0, the Pinyin code of retrieval key just all was comprised in the word sequence of name, in case comprised the retrieval key, just enters S63, enters S20 from S63 again.
On the other hand,,, that is to say in remaining literal to include the Pinyin code of retrieving key if the value of register m2 becomes less than the value of register m1 at S75, so, just think not comprise the retrieval key, and enter S63, enter S64 again.Judge whether to include the retrieval key like this.
Below, the step of the result for retrieval display process of the S5 that implements the Figure 17 after the retrieval process is described with the process flow diagram of Figure 24.
At S81, the result for retrieval display part 51c initialization of picture shown in Figure 23 51; At S82, the position that should show indication is the value _ initialization of the indicator 1 of which row; 83, the counter M that initial setting is counted the number of packages of data presented on the picture; At S84, use the demonstration of memory storage 25 with taking out an event data the buffer zone B3 from operation shown in Figure 13; At S85, judge whether to have taken out data at S84, if taken out data, enter S86, the footline of the result for retrieval display part 51c of the value of check indicator 1 _ whether be picture 51.Be that footline just enters S87, the result for retrieval display part 51c of picture 51 is scrolled up delegation, if not footline, then enter S88, indicator 1 adds 1.
At S89, the data presentation of being taken out on the row of indicator 1 indication; At S90, counter M adds 1; At S91, check that whether counter M has reached the line number of result for retrieval display part 51c, if also do not reach, just turns back to S84, otherwise just enters S92.At S92, urge the operator to provide down the indication of the demonstration of one page, up to before one page 51d is assigned to input media 21 down, stop display process, after specifying, turn back to S83.And, repeat the processing from S84 to S92, at S85, when judging when S84 has taken out data, just think and taken out total data with buffer zone that display process finishes, and enters the S1 of Figure 17 from showing.
With such processing, shown in the result for retrieval display part 51c of the picture 51 of Figure 23, result for retrieval just is revealed.Selecting " address " on this picture 51 is project, and input " tianjnshi " is extracted the people's who stays in Tianjin name together and shown with telephone number for the retrieval key.
The treatment step that appends input of the S3 of Figure 17 is described with the process flow diagram of Figure 25 below.
At first, at S101, obtain total number of packages X of the data of being stored in the data storage device 22; Then, at S102, carry out the initialization of picture and the initial setting of operating area B2 shown in Figure 11; At S103, as shown in figure 26, show the management number that appends the input data on the management number column 54a on the picture 54 when appending input, the management number is that the total number of packages X that is stored in the data in the data storage device 22 adds 1.Figure 26 picture 54 on, Chinese written language sequence E is equivalent to " the management number " of Japanese, on picture 54, the management number be " 003 ".For the purpose of reference, Figure 27 has represented original picture when appending input.
At S104, cursor is presented in the name input field 54b of back of Chinese written language sequence A of " name " that being connected on the picture 54 be equivalent to Japanese, expression is to be in the state that can import name data; At S105, the urgency operator imports data or makes the function indication, and at this moment, the operator can carry out the input of " name " with input media 21, also can select functions such as what is called " registrations of data ", " selection of cuit ".
At S105, if selected " registrations of data ", just enter S106, begin to carry out the data registry reason; At S109, the data supplementing of being stored in the metadata buffer zone 103 of operation with the operating area B2 in the memory storage 25 is registered in the data storage device 22.After this,, total number of packages X of data is added 1, turn back to S102 then at S110.
On the other hand,,, just enter S107, begin to carry out the selection processing of cuit if selected " selection of cuit " at S105; At S111, cursor is moved along following order, that is: the continue remarks input field 54e → name input field 54b of the telephone number input field 54d → Chinese written language sequence C that continues of the address input field 54c → Chinese written language sequence D that continues of the name input field 54b → Chinese written language sequence B that continues of Chinese written language sequence A, select cuit, turn back to S105 then.
At S105, selection function not, and under the situation of S108 input, at S112, the position of the data presentation of being imported, when upgrading cursor position, on the suitable position of the metadata buffer zone 103 of the operation area B2 of data storage in operation usefulness memory storage 25 at the cursor of picture.
In case selected " end " at S105,, turned back to S1 then in the S113 end process.Append the address logging data like this.
As mentioned above, in the Chinese data processor of present embodiment, central processing unit 24 usefulness kanji code-Pinyin code map table 50 stored in the data storage device 22 the address logging data be transformed to Pinyin code, and, carry out the display process of retrieval process and result for retrieval according to this Pinyin code management data.
Use original Chinese data processor, when entry information such as the address in management " name ", " address " etc., with common data management based on the GB code of stipulating in the Chinese national standard, though can press the sequence management of phonetic to one-level code, but because can not be by the sequence management Chinese characters of level 2 of phonetic, so, managing with phonetic under the situation of address entry information, different with the word sequence in name that shows with Chinese character or address, must import the pronunciation of its word sequence with phonetic, therefore, bother very much.
To this, in the Chinese data processor of present embodiment, no matter which rank Chinese character is in, an input Pinyin just can be retrieved the Chinese character word sequence that extract when for example retrieving in the address logging data of data storage device 22 stored.But, because being encoded, phonetic changed, and carry out retrieval process with the phonetic of numeralization not and compare, because data are compressed states, so, the needed time of retrieval is shortened, thereby not only can improve operability, and, its processing power can be improved especially.
In addition, in the present embodiment, retrieve for the project of appointments such as " name ", " address ", and needn't retrieve unnecessary project simultaneously.
[embodiment 2]
Drawing and the Figure 28 to Figure 33 used according to the explanation that is used for previous embodiment 1 are described as follows other embodiment of the present invention.For the purpose of illustrative ease, for having the same symbol of parts marks of said function, and omit its explanation with the parts shown in the aforesaid embodiment.
Under the situation of the name of managing usefulness Chinese or certificate address information, exist and to want to manage simultaneously in English the name of mark or the situation in address, American-European place name or name are exactly the one example, and are that the alphabetic flag of the Cantonese used in the region of representative also is an example more with Hong Kong.In the Chinese data processor of present embodiment, its starting point is that the phonetic letter in English of Chinese comes mark, and its purpose is can both carry out the centralized management by the letter of name or certificate address information no matter be Chinese or English.
The difference of the Chinese data processor of present embodiment and aforesaid embodiment 1 is the retrieval process of data, and is identical about the display process of result for retrieval with previous embodiment 1.
As shown in figure 28, the Chinese data processor of present embodiment is provided with input media 21, data storage device 22, kanji code-Pinyin code map table storer 23, operation memory storage 25, shows with Chinese character style ROM26, display device 27, central processing unit 30 and phonetic-letters shift (LTRS) table storer 31.Wherein input media 21, data storage device 22, kanji code-Pinyin code map table storer 23, operation are same with Chinese character style ROM26, display device 27 with the Chinese data processor of embodiment 1 with memory storage 25, demonstration.But input media 21 can not only can also be imported " name " and address etc. by the word sequence with alphabetic flag by the Chinese character word sequence, and Figure 29 has represented the picture 55 in data when input of input media 21.
Central processing unit 30 carries out by the retrieval of above-mentioned input media 21 selections and the control of input function, central processing unit 24 at embodiment 1 is the information conversion in the data storage device 22 Pinyin code, and it is managed, implement retrieval process with the treatment step shown in the process flow diagram of Figure 17, and in the central processing unit 30 of present embodiment (back will be described in detail), be the information conversion in the data storage device 22 Pinyin code earlier, further be transformed to the character code of letter again, character code with letter carries out data management, and implements retrieval process.
Pinyin code-letters shift (LTRS) table storer 31 is used for being stored as the Pinyin code-letters shift (LTRS) table 52 that Pinyin code is transformed to the alphabetic literal sequence.As shown in figure 15, the formation of this Pinyin code-letters shift (LTRS) table 52 is alphabetic literal sequence and the corresponding one by one arrangement of each Pinyin code.
The same with aforesaid embodiment 1, operation is when handling each function of retrieval, demonstration, input with memory storage 25, by the memory storage of temporary transient usefulness, what adopted is semiconductor memory, it be provided be used for the memory scan key staging area B1 (with reference to Figure 10), be used for the demonstration of temporary transient search result storage with buffer zone B3 (with reference to Figure 13), point to the indicator of data storage device 22, also have independent variable buffer zone and rreturn value buffer zone in addition; Also be provided with the operating area B4 (with reference to Figure 31) that kanji code sequence-alphabetic literal sequence transformation of the operating area B2 (with reference to Figure 11) that kanji code sequence-Pinyin code sequence transformation of replacing previous embodiment 1 uses is used simultaneously.
The step of the retrieval process of the address management system in the Chinese data processor of present embodiment then is described with the process flow diagram of Figure 30.
At S111, constitute the word sequence of retrieval key by the phonetic input of alphabetic flag with input media 21 by the operator, the retrieval key of being imported is displayed in the display field on the 51b of picture shown in Figure 23 51, confirm to use for the operator, simultaneously, the retrieval key of input being stored in storage operation shown in Figure 10 uses in the buffer zone 101 with the retrieval key letter of the staging area B1 of the retrieval key in the memory storage 25.
At S112, the operator selects the project that will select with the form shown in the 51a in the picture shown in Figure 23 51, at S113, in order from data storage device 22, to read institute's canned data, be the content setting of the indicator of specifying sense information the 1st group of canned data.This indicator is set at operation with in the memory storage 25, is used to refer to from the beginning which part of number in the data of being stored in the given data memory storage 22.
At S114, operation is carried out initial setting with the demonstration of the temporary result for retrieval in the memory storage 25 with data buffer B3.At S115, check on the indicated position of indicator in the data storage device 22 whether storing data, if data are arranged, just it is read out, and make carbon copies in the metadata buffer zone 115 of operating area B4 shown in Figure 31 at S116.
At S117 and S118, each kanji code of the Chinese character word sequence of the data of institute's unloading on the metadata buffer zone 115 of operating area B4 is transformed to Pinyin code, on the other hand, further each Pinyin code is transformed to letter, making becomes the from the 1st to the 4th alphabetical sequence (alphabetic literal sequence), and is written to the 1st candidate buffer zone 116~the 4th candidate buffer zone 119.In these actions,, omitted because the processing of the S18 among S117 and the aforesaid embodiment is identical.On Figure 32 S118 has been represented its detailed steps, will have been described in the back this.
At S119, check on the 51a of the picture shown in Figure 23 51 in whether retrieval key letter in the staging area B1 shown in Figure 10 be included in operating area B4 shown in Figure 31 with the alphabetic literal sequence of buffer zone 101 the 1st candidate buffer zone 116~the 4th candidate buffer zone 119 in the item selected, if in not being included in, just turn back to S115.If be included in wherein, just carry out the processing of S120.At S120, the data supplementing of the metadata buffer zone 115 of Figure 31 after demonstration shown in Figure 13 is used on the buffer zone B3, is turned back to S115.
After this, repeat the processing of S115~S120,,, just judge that the retrieval to institute's canned data in the data memory storage 22 finishes, thereby stop retrieval process, then enter the S5 of Figure 16 if there are not data at S115.
Next, the process flow diagram with Figure 32 illustrates that be the treatment step of alphabetic literal sequence to the Pinyin code sequence transformation of the S118 among Figure 30.Figure 33 (a) (b) represents that respectively operation is with the independent variable buffer zone 110 in the memory storage and each data mode of rreturn value buffer zone 111.
At S121, rreturn value buffer zone 111 initialization shown in Figure 33 (b); At S122, calculate address to Pinyin code shown in Figure 15-alphabetic literal sequence transformation table 52 from independent variable; At S123, each zone of making carbon copies rreturn value buffer zone 111 with the word sequence (6 words) of letter record; At S124, obtain the number of words of alphabetic literal sequence, and the number of words that it is deposited in the word sequence in the rreturn value buffer zone 111 enters S119 then with among the regional 111a.
As mentioned above, in the Chinese data processor of present embodiment, central processing unit 30 usefulness kanji code-Pinyin code map table 50 is transformed to Pinyin code to the address logging data of being stored in the data storage device 22 from kanji code, then Pinyin code further is transformed to the word sequence of letter with Pinyin code-alphabetic literal sequence transformation table 52, character code according to letter comes management data again, and the line retrieval of going forward side by side is handled.
Therefore, use Chinese data processor, when logging data such as the address in management " name ", " address " etc., it in America and Europe's name or place name or with Hong Kong the alphabetic flag etc. of the Cantonese used, the region of representative and in English under the name or situation that the address mixes of mark more, owing to can manage with the character code of letter, so, can retrieve content simultaneously with alphabetic flag.As a result, the thing that the data of for example data markers of Chinese not only, and alphabetic flag, phonetic mark mix also can both a management, and what make address typing management becomes especially easy, and, can improve the operability in the Chinese data processor.
[embodiment 3]
Explanation according to previous embodiment is described as follows other embodiment of the present invention at used drawing and Figure 34 to Figure 42.For the purpose of illustrative ease, for having the same symbol of parts marks of said function, and omit its explanation with the parts shown in the aforesaid embodiment.
Under the situation in the name of managing Chinese with phonetic or address, there is the word that frequently appears at the word sequence beginning (to be equivalent to represent the phonetic of initial consonant, for example " z ", " c ", " s ") and (this is equivalent to represent the part of the phonetic of simple or compound vowel of a Chinese syllable can not appear at the word of word sequence beginning, for example " i ", " u ", " v ").Therefore, name or certificate address information are being categorized as under the situation of the group from alphabetical A to Z the group that will occur getting the many groups of information and almost not have information, inconvenience all when management and later retrieval according to the phonetic of the beginning of the word sequence of each information for example.
Yet, in the Chinese data processor of present embodiment, be not to come information is classified according to the phonetic that literal starts, be automatically to make the resulting canned data number equalization as much as possible of respectively organizing of classification when dividing into groups, and, how to carry out to such an extent that grouping all clearly is shown to the operator to information, be easy to manage.
According to the Chinese data processor of present embodiment, aspect retrieval process, the display process of its retrieve data is different from aforesaid embodiment 1.
As shown in figure 34, the Chinese data processor according to present embodiment is provided with input media 21, data storage device 22, kanji code-Pinyin code map table storer 23, operation memory storage 25, shows with Chinese characters font ROM26, display device 27, letter-Pinyin code map table storer 28 and central processing unit 33.
Wherein input media 21, data storage device 22, kanji code-Pinyin code map table storer 23, operation are identical with Chinese character style ROM26, display device 27, letter-Pinyin code map table storer 28 with the Chinese data processor of embodiment 1 with memory storage 25, demonstration.
Central processing unit 33 carries out by the retrieval of above-mentioned input media 21 selections and the control of input function, the central processing unit 24 of embodiment 1 be with the treatment step shown in the process flow diagram of Figure 24 implement S5 the result for retrieval display process, and the central processing unit 33 of present embodiment is to implement the result for retrieval display process according to the process flow diagram of Figure 35, Figure 36.
The same with aforesaid embodiment 1, operation is to handle retrieval with memory storage 25, show, the temporary transient apparatus for temporary storage that uses during each function of input, adopt semiconductor memory, use in the memory storage 25 in this operation, be provided with the aforesaid staging area B1 (with reference to Figure 10) that the memory scan key is used, the operating area B2 (with reference to Figure 11) that kanji code sequence-Pinyin code sequence is used, the demonstration that temporary result for retrieval is used data buffer area B3 (with reference to Figure 13), point to the indicator of data storage device 22, be provided with independent variable buffer zone and rreturn value buffer zone in addition, the buffer zone of promising making sorted table T1T2 described later also is set in addition.
Below, the step of the result for retrieval display process in the management system of above-mentioned address is described with Figure 35, Figure 36.
At S131, make sorted table T1 shown in Figure 39, classify according to the Pinyin code of the 1st Chinese character of " name " project with the data of being stored in the buffer zone B3 showing, and count, the treatment step when this sorted table T1 making is described with Figure 41 in the back.
At S1 32, make the such sorted table T2 of Figure 40 according to sorted table T1, obtain the Pinyin code of the 1st Chinese character of " name " project of the 1st event data in selected each group of operator and the number of packages of corresponding data, the treatment step of making this sorted table T2 will be described with Figure 42 in the back.
At S133,, shown in the picture 55 as shown in figure 37, demonstrate through each index organized of classification and the number of packages of data with reference to sorted table T2.In this picture 55, Chinese written language sequence F is equivalent to " classification " in the Japanese, and Chinese written language sequence G is equivalent to " number of packages " in the Japanese, and Chinese written language sequence H is equivalent to " total " in the Japanese.For as a reference, represented to show the picture of the number of packages of the index of each group under the situation of original mechanically classification and data on Figure 38.
At S134, urge the operator to select a group according to the picture of Figure 37, after the input group selection indication, enter S135, at S135, obtain the minimum value and the maximal value of value of Pinyin code of the 1st Chinese character of name of the data of selected group.
At S136, the result for retrieval display part 51c initialization of picture shown in Figure 23 51; At S137, which indicator 1 initialization the position that should show indication be; At S138, the counter M that the number of packages of the data shown on the picture 51 is counted is carried out initial setting.
At S139, from demonstration shown in Figure 13 taking-up one event data the buffer zone B3; At S140, if just enter S141 with having taken out data in the buffer zone from showing; At S141, with reference to the maximal value and the minimum value of the Pinyin code of obtaining at S135, judge whether the data of being taken out are to be within this scope, if do not correspond in this scope, just turn back to S139, if in this scope, just enter S142.
At S142, whether the value of check indicator 1 is the footline of the result for retrieval display part 51a of picture 51, if footline just enters S143, if not footline, just enters S144, and add 1 on indicator 1.
At S145, the data presentation of being taken out on the row of indicator 1 indication; At S146, counter M adds 1; At S147, check whether counter M has reached the line number of viewing area, if do not reach, turns back to S139, if reached, then enters S148.
At S148, urge the operator to provide showing the indication of one page down, up to stopping display process before the following one page 51d in input media 21 assigned picture 51, and after appointment, turn back to S138.And, do not take out data as if being judged as at S139, just stop display process, and enter the S1 among Figure 16 at S140
At this, the treatment step of the sorted table T1 (with reference to Figure 39) of the S131 in the construction drawing 35 is described with the process flow diagram of Figure 41.
At S151, making the buffer zone initialization that sorted table T1 uses; At S152, from the buffer zone B3 that shows usefulness, take out an event data; At S153, judge whether to have taken out data, if take out, just enter S154; At S154, the 1st word of " name " in the data of being taken out is transformed to Pinyin code, then at S155, the number of packages corresponding to the hurdle of the Pinyin code of the 1st literal in the sorted table is added after 1, turn back to S152 again.
Carry out the processing from S152 to S155 repeatedly, and taking-up is stored in the data that show with in the buffer zone B3 in order, the number of packages on the hurdle of the Pinyin code of the 1st literal of corresponding its " name " is added down, after this, be judged as at S153 and do not take out under the data conditions, just finish to take out data, enter S156 then from showing with buffer zone B3, after the indicator initialization of the data that should from show, take out indication, enter S132 with buffer zone B3.
Below, the treatment step of the sorted table T2 (with reference to Figure 40) of the S132 in the construction drawing 35 is described with the process flow diagram of Figure 42.
At S161, G is made as 10 the group number register, and the register g that sets of numbers is used is set at 1, and the register a that the phonetic number is used is decided to be 1, and the register s that the accumulative total number of packages is used is set at 0.Organizing several 10 is set to such an extent that be suitable for the size of the capacity of display of display device 27.
At S162, obtain total number of packages Y that retrieval shows the data of being stored in the buffer zone B3 of usefulness; At S163, the Pinyin code of the sets of numbers shown in the register g that the sets of numbers of sorted table T2 is used is set at Pinyin code with the value shown in the register a.
At S164, whether the value that judgement accumulative total number of packages is used register s little situation under, enters S165 greater than (Y/G) * g, and the number of packages of representing the Pinyin code that is worth of the register a that the Pinyin code of sorted table T1 is used is added in the accumulative total number of packages with on the register s, enters S166 then.And, Pinyin code register a is being added after 1, be judged to be the processing that repeats S164~S166 before being equal to or greater than at S164.
At S164, accumulative total is judged as when being equal to or greater than (Y/G) * g with the value of register s, enters step S167, and obtains the data number of packages of sets of numbers of value of the sets of numbers usefulness register g of sorted table T2.As follows in this used computing formula: S → T22 (g) (g=1)
S - Σ n - 1 T 22 ( n ) → T 22 ( g ) . . . ( g ≠ 1 )
At S168, sets of numbers adds after 1 with the value of register g, at S169, judge that whether sets of numbers becomes with the value of register g and organize number and equate, if unequal, just turns back to S163, the processing that repeats S163~S169 if equate, just enters S170 till equating.
At S170, use the computing formula identical to obtain after the data number of packages of sets of numbers of sorted table T2 with the sets of numbers of the value of register g with above-mentioned S167, enter the S133 of Figure 35.
As mentioned above, in the Chinese data processor of present embodiment, imported at input media 21 under the situation of retrieval key, central processing unit 35 for the search terms of the correspondence of a plurality of groups Chinese character word sequence of data storage device 22 stored with reference to conversion that kanji code-Pinyin code map table 50 carries out to Pinyin code, be included under the situation in this information with the identical data of retrieval key, with the group is that unit obtains corresponding Chinese character word sequence, obtaining then has several groups altogether, this is counted as dividend, predefined group of number is that divisor carries out division arithmetic with the capacity of display size that is suitable for display device 27, calculates the average group number of each group.With this average group number is that from the beginning benchmark is cut apart down the group of resulting word sequence in the retrieval by pinyin order, and, after the grouping,, which kind of goes with display device 27 outputs the information of dividing into groups being cut apart with.And the many groups word sequence that belongs to the group of being selected by the operator is outputed to display device 27.Resulting according to this output also can be presented in each most group each group.
In grouping, as required, the group number than the much smaller situation of 26 (alphabetical A is to the numbers of words of Z) under, the letter of the 1st word of the word sequence of each group is (when the 1st word is Chinese character, earlier this Chinese character being transformed to Pinyin code, adopting the 1st word that is transformed to the resulting letter of Pinyin code again) identical group also can cross over plural group and do not cut apart; Equally, as required, the letter of the 1st word that is stored in the word sequence of each group in the memory storage concentrates under the situation of specific beginning literal, also can make the 1st word group identical with the 2nd word cross over plural group, and not cut apart.
[embodiment 4]
According to the employed figure of the explanation of previous embodiment and Figure 43 to Figure 46 other embodiment of the present invention are described as follows, for convenience of explanation, have the identical symbol of part mark of identical function with the part shown in the previous embodiment, its explanation is omitted.
According to original Chinese data processor, when execution is replaced function side by side, for the order by kanji code is replaced, for example under the situation of GB code, the Chinese character word sequence that belongs to one-level code will be arranged by pinyin order, but the Chinese character word sequence that belongs to secondary code will be by the series arrangement of the radicals by which characters are arranged in traditional Chinese dictionaries back at the Chinese character word sequence of one-level code, therefore, arrange when replacing in the set of the Chinese character of the Chinese character of first-level Chinese characters code and secondary code being mended the word sequence mix, just can not obtain by the result of the series arrangement replacement of Pinyin code completely.
Yet, prepare to specify according to the Chinese data processor of present embodiment and arrange when replacing, in case be the Pinyin code sequence, just can obtain arranging the result of replacement to the data conversion that should change arrangement by pinyin order completely.
As shown in figure 43, the Chinese data processor according to present embodiment is provided with input media 21, data storage device 22, kanji code-Pinyin code map table storer 23, operation memory storage 25, shows with Chinese characters font ROM26, display device 27 and central processing unit 35.Wherein, input media 21, data storage device 22, kanji code-Pinyin code map table storer 23, operation are identical with Chinese character style ROM26, display device 27 with the Chinese data processor of embodiment 1 with memory storage 25, demonstration.
When carrying out data ordering replacement processing by input media 21 indications, as described later, central processing unit 35 just carries out the arrangement replacement of data to the information of data memory storage 22, that is to say that data ordering alternative of the present invention constitutes this central processing unit 35.
At this, in data storage device 22, four data 200a~200d shown in Figure 44 (a) as metadata 200, sequential storage according to 200a, 200b, 200c, 200d is got up, 200a~200d management number and word sequence data formation group separately stored, and these management do not repeat with number.
In the Chinese data processor of above-mentioned formation, central processing unit 35 is replaced the indication of handling, the following a series of control of beginning according to urging input media 21 implementation data to arrange.
At first, kanji code Pinyin code map table 50 (with reference to Fig. 7) with kanji code-Pinyin code map table storer 23 all is transformed to the Pinyin code sequence to each word of the content of metadata 200, wherein, obtain in a Chinese character under the situation of a plurality of Pinyin code candidate codes, what adopt usually is the 1st candidate code.
Represented the phonetic mark of each Chinese character of composition data 200a~200d and the rank in the kanji code among Figure 45, the figure acceptance of the bid by 1. be the Chinese character that belongs to one-level code, indicate 2. be the Chinese character that belongs to secondary code.
Then, each data 200a~200d of these Pinyin code sequences is lined up in an orderly manner by the Pinyin code ascending, then, metadata by with arrange the identical series arrangement of Pinyin code sequence replaced, thereby obtain the arrangement replacement data 201 shown in Figure 44 (b).At last, with showing this arrangement replacement data 201 is pressed Chinese character conversion font, and be presented on the display device with Chinese characters font ROM26.At this, also can replace the result to data ordering and be stored in the data storage device 22 with the form of data file.
Therefore, as original, when metadata 200 is arranged replacement by the order of kanji code separately, the data 200 that arrangement replacement data 200 such comprising shown in Figure 46 belong to the kanji code of secondary code just have been arranged on the back of the data that are made of the first-level Chinese characters code, arrangement replacement data 201 shown in Figure 44 (b) then can be replaced by pinyin order arrangement completely.
[embodiment 5]
Drawing and Figure 47 to Figure 49 used in the explanation according to previous embodiment are described as follows other embodiment of the present invention.For the purpose of illustrative ease, for having the same symbol of parts marks of said function, and omit its explanation with the parts shown in the aforesaid embodiment.
Original Chinese data processor in, be to be that benchmark is retrieved with the kanji code,, in Chinese character, although the Chinese-character writing (capitalization) of variant Chinese character, character in popular form, simplified/complex form of Chinese characters, numeral etc., implication is identical with usage, but use the situation of different literals also to happen occasionally, therefore, Chinese written language H for example shown in Figure 48 and Chinese written language J are although its phonetic all is " sanqianyuan ", and, implication is identical, but because corresponding Chinese character code difference just can not be retrieved both simultaneously.
Yet the Chinese data processor of present embodiment is when carrying out retrieval process, and the Chinese character of identical phonetic mark is even Chinese character word sequence difference also can be handled by primary retrieval and retrieve out.
As shown in figure 47, the Chinese data processor according to present embodiment is provided with input media 21, data storage device 22, kanji code-Pinyin code map table storer 23, operation memory storage 25, shows with Chinese characters font ROM26, display device 27, letter-Pinyin code map table storer 28 and central processing unit 37.Wherein, input media 21, data storage device 22, kanji code-Pinyin code map table storer 23, operation are identical with Chinese character style ROM26, display device 27, letter-Pinyin code map table storer 28 with the Chinese data processor of embodiment 1 with memory storage 25, demonstration.
When retrieval process is carried out in above-mentioned input media 21 indications, as described later, 22 canned datas of 37 pairs of data memory storages of central processing unit are retrieved, and that is to say, this central processing unit 37 is exactly the indexing unit of the present invention that is made of with memory storage operation.Owing among the embodiment 1 retrieval process is described in detail, has just omitted detailed description thereof here.
The same with aforesaid embodiment 1, operation is to handle retrieval with memory storage 25, show, the temporary transient memory storage that uses during each function of input, adopt semiconductor memory, use in the memory storage 25 in this operation, be provided with the staging area B1 (with reference to Figure 10) that the memory scan key is used, the demonstration that temporary result for retrieval is used data buffer area B3 (with reference to Figure 13), point to the indicator of data storage device 22, be provided with independent variable buffer zone and rreturn value buffer zone in addition, simultaneously, also be provided with the operating area B5 that kanji code sequence-Pinyin code sequence transformation shown in Figure 47 is used, the operating area B2 (with reference to Figure 11) that is used for replacing the kanji code sequence-Pinyin code sequence of previous embodiment 1 to use.
In addition, the Chinese written language sequence K and the Chinese written language sequence J that include Figure 48 here in the metadata of being stored in the tentation data memory storage 22.
In above-mentioned Chinese data processor, central processing unit 37 begins following a series of control according to the indication of implementing retrieval process from the urgency of input media 21.
At first, the message of urging the operator to import the retrieval key is presented on the display device 27; Wait for that always the operator is made of the phonetic of retrieval coding with the letter input input media, if import, just this alphabetical sequence is written to and is used for the retrieval key letter of staging area B1 (with reference to Figure 10) of memory scan key with in the buffer zone 101; After this, from the beginning retrieval key letter is transformed to Pinyin code seriatim with the data in the buffer zone 101, and is stored in retrieval key phonetic with in the buffer zone 102 with letter-phonetic map table 50.
Next, from the beginning read the data of a storage in data storage device 22, and write in the metadata buffer zone 121 shown in Figure 49, after this, with kanji code-Pinyin code map table 50 each word of Chinese character word sequence (kanji code sequence) that is write in the metadata buffer zone 121 all is transformed to Pinyin code, and is written to the 1st candidate buffer zone 122~the 4th candidate buffer zone 125.
Then, whether the retrieval key phonetic of inspection Figure 10 is included in the 1st candidate buffer zone 122~the 4th candidate buffer zone 125 with the pinyin sequence of buffer zone 102 stored, if be included in wherein, just the data in the metadata buffer zone 103 are transformed to Chinese character style with showing with Chinese characters font ROM26, and are presented on the display device 27.Last event data of the data of being stored in data storage device 22 all repeats this treatment step.At this moment, if on display device, do not show, also can remain on the form of data file in the data storage device.
Like this, in the present embodiment, indicated under the situation of carrying out retrieval process, central processing unit just is stored in operation to the retrieval key from input media 21 inputs with Pinyin code and uses in the buffer zone 102 with the retrieval key phonetic of the staging area B1 the memory storage 25, on the other hand, with institute's canned data in the data storage device 22 and kanji code-Pinyin code map table 50 are transformed to Pinyin code to kanji code, and whether the data of retrieving this Pinyin code are included in the retrieval key phonetic of staging area B1 with in the buffer zone 102.
Therefore, Chinese written language sequence K and the Chinese written language preface (with reference to Figure 48) in the metadata of retrieve data memory storage 22 stored simultaneously, thus can shorten the needed time of retrieval process.
And, in the present embodiment, be to import the retrieval key with input media 21, but also can set with other treating apparatus by the operator.
[embodiment 6]
Drawing and the Figure 28 to Figure 33 used according to the explanation that is used for previous embodiment are described as follows other embodiment of the present invention.For the purpose of illustrative ease, for having the same symbol of parts marks of said function, and omit its explanation with the parts shown in the aforesaid embodiment.
According to original pen type Chinese data processor, the operator must know and wants correctly joining together of the word sequence imported, but, in Chinese characters, identical or similar, the implication usage of pronouncing also similarly situation be of common occurrence, therefore, can not find out at once under its situation of joining together, just must go to consult the dictionary to confirm the operator.
Yet, the Chinese data processor of present embodiment has utilized often also similar characteristic of its pronunciation of the similar Chinese character of mark in the Chinese, even for example operator's knowledge is ambiguous, the four tones of standard Chinese pronunciation also are entirely ignorant of, this device temporarily is transformed to Pinyin code to the word sequence of input earlier, again it being carried out conversion from phonetic to the Chinese character word, is that unit carries out conversion by correct joining together with word or sentence then, thereby can the input characters sequence.
As shown in figure 50, the Chinese data processor according to present embodiment is provided with input media 21, data storage device 22, kanji code-Pinyin code map table storer 23, operation memory storage 25, shows with Chinese characters font R0M26, display device 27, Pinyin code-kanji code sequence transformation character library 39 and central processing unit 38.Wherein, input media 21, data storage device 22, kanji code-Pinyin code map table storer 23, operation are identical with Chinese character style ROM26, display device 27 with the Chinese data processor of embodiment 1 with memory storage 25, demonstration.
Import under the situation that contains ambiguous word with above-mentioned input media 21, central processing unit 38 just carry out control one-level that picture shows from the Chinese character to the Pinyin code and also the control of conversion again from the Pinyin code sequence to the Chinese character word sequence, and implement the word sequence that contains this word become and be the correct processing of joining together.That is to say that this central processing unit carries out the processing of ambiguous word sequence converting means of the present invention.
Input method as wherein Chinese character word sequence, can consider to adopt handwriting input with electronic pen, by the input of radicals by which characters are arranged in traditional Chinese dictionaries and stroke, by input of phonetic or the like, in the present embodiment, as above-mentioned input media 21, be by the operator with electronic pen Chinese-character writing on display device 27, again under the control of central processing unit 38, discern its person's handwriting and export the code of Chinese character.In addition, shown in Figure 51, in this input media, first perception contacts identification key 55a and the transfer key 55b on the picture 55 that is presented at display device 27 with electronic pen, then signal is delivered to central processing unit 38.
The memory storage of temporary transient usefulness when operation is conversion process with memory storage 25, what adopted is semiconductor memory.The operating area of form shown in Figure 49 is stored in wherein really.
Disposition when the input of the central processing unit 38 in the above-mentioned Chinese data processor is described below.
At first, central processing unit 38 is presented at the picture shown in Figure 51 (a) on the display device 27, and wherein 55c is the zone that is used for showing transformation results, and 55d is the zone of going into Chinese character with electronic notebook.And, aforesaid 55a starts the identification key that the Chinese character of being charged to is transformed to the processing of kanji code, 55b is a transfer key, being used for starting the kanji code sequence transformation that is shown that the regional 55c shown in Figure 51 (b) is gone up as recognition result 55e is the Pinyin code sequence, be that unit transformation is a Chinese character with word or sentence more then, and this word be modified to the function of regular Chinese character.
The operator remembers that with electronic pen the people wants the Chinese character of importing on regional 55d, the implication " Egyptian (ai ji) " of input " エ ジ プ ト " is wanted in supposition here, the operator is not owing to remember its correct joining together, at first, Chinese character with the identical phonetic known to the pen handle is write into as this word, Figure 51 (a) writes to finish, and has charged to " suffering (ai) " with “ Very (ji) " state.
Shown in Figure 51 (b), when the operator uses electronics style of writing and regional 55a, under the control of central processing unit 38, at first finish the identification of the Chinese character of being charged to, recognition result 55e is presented on the regional 55c, the underscore of recognition result 55e represents that it is the recognition result of the literal charged to electronic pen again.
Shown in Figure 51 (c), when the operator uses electronics style of writing and regional 55b, central processing unit is at first made carbon copies the word sequence of underscore part in the metadata buffer zone 121 of operating area B5 shown in Figure 49, be the data conversion that is written in the metadata buffer zone 121 the Pinyin code sequence with kanji code-Pinyin code map table 50 then, more resulting Pinyin code sequence be stored in the 1st candidate buffer zone 122~the 4th candidate buffer zone 125.
In addition, in the present embodiment, central processing unit 37 is used as conversion candidate selecting arrangement, for avoiding increasing the candidate code number after the conversion of Chinese character word sequence, only using the 1st candidate buffer zone 122 wherein, is the 1st Pinyin code preface of waiting buffer zone 122 that unit transformation is the Chinese character word sequence with word or sentence, again in the Chinese character word sequence, obtain thus under the situation of a plurality of Chinese character word sequences, only adopt the 1st candidate code.
Because Pinyin code-kanji code sequence transformation dictionary 39 comes scrambling transformation candidate code by the high order of usage frequency, so, the 1st candidate code can just be obtained without the processing of complexity.The Chinese character word sequence that obtains like this is presented on the picture 55 shown in Figure 51 (c), this has just obtained the Chinese character of correctly joining together from the word of target again.Here, also can be stored in the form of the Chinese character of correctly joining together in the unshowned data storage device by data file.
Like this, when word sequence is imported, even for example operator's knowledge is ambiguous, the four tones of standard Chinese pronunciation also are entirely ignorant of, this device temporarily is transformed to Pinyin code to the word sequence of input earlier, again it is carried out conversion from phonetic to the Chinese character word, is that unit carries out conversion by correct joining together with word or sentence then, that is to say that the operator needn't consult correct joining together with dictionary one by one as original, thereby can come the input characters sequence by enough suitable target words.
In the present embodiment, each step all is only to adopt the 1st candidate code according to usage frequency as the candidate code, and constitute the conversion selecting arrangement by central processing unit 37, have under a plurality of situations at resulting Chinese character word sequence, also can be for conversion absent Chinese character sequence, they are presented on the picture, urge the operator to select one, again the word sequence of the selected candidate of correctly joining together is presented on the picture of display device 27.
Concrete embodiment of being done in detailed description of the invention or embodiment have explained record content of the present invention, but the present invention is not limited to this specific embodiment, in the scope that does not deviate from design of the present invention and the following claim of putting down in writing, can implement various variations.

Claims (13)

1. a Chinese data processor is provided with the input media of input retrieval key and stores the data storage device of the data of kanji code form, it is characterized in that comprising:
Have the kanji code-Pinyin code map table of Pinyin code, and the data conversion of the kanji code form that is stored in described data storage device is arrived the Chinese character-phonetic converting means of Pinyin code form with this map table corresponding to the Chinese characters code;
To press the apparatus for temporary storage of Pinyin code form storage by the retrieval key of above-mentioned input media input;
From being transformed to by above-mentioned Chinese character-Pinyin code converting means among the data of Pinyin code form, extract the indexing unit of the code consistent with the retrieval key of described apparatus for temporary storage stored; With
The display device that shows result for retrieval.
2. according to the Chinese data processor of claim 1, it is characterized in that, described indexing unit is provided with sorter, this sorter calculates the average data number of packages of each group according to predetermined group number, again according to the average data number of packages of being calculated the data of data storage device stored by group categories.
3. according to the Chinese data processor of claim 2, it is characterized in that, the input of described sorter takes out from data storage device, and be transformed to the data of Pinyin code form with Chinese character-phonetic converting means, according to above-mentioned average data number of packages from the beginning these data is divided into group by the order of phonetic again.
4. according to the Chinese data processor of claim 2, it is characterized in that, also be provided with the picture of the operator being pointed out relevant information of how to divide into groups.
5. according to the Chinese data processor of claim 1, it is characterized in that, having under the situation of the corresponding many Pinyin codes correspondence of at least one Pinyin code for a Chinese character, described kanji code-Pinyin code map table is additional priority ranking on these a plurality of Pinyin codes.
6. according to the Chinese data processor of claim 1, it is characterized in that described data storage device is when being a plurality of projects and storage to data qualification, the operator is with the project of described input media appointment as searching object; Described indexing unit is being retrieved in the data of being stored in the project by operator's appointment as searching object.
7. according to the Chinese data processor of claim 1, it is characterized in that also being provided with:
Is the data conversion of the Pinyin code mark by above-mentioned Chinese character-phonetic converting means conversion the pinyin sequence-alphabetical sequence converting means of alphabetic literal mark;
Above-mentioned apparatus for temporary storage storage replaces the retrieval key of above-mentioned Pinyin code form by the retrieval key of the alphabetic literal mark of above-mentioned input media input;
Above-mentioned indexing unit, replace being transformed to the data of above-mentioned Pinyin code form, from being transformed to by above-mentioned Chinese character-phonetic converting means and above-mentioned pinyin sequence-alphabetical sequence converting means among the data of alphabetic literal, extract the code consistent with the retrieval key of described apparatus for temporary storage stored.
8. Chinese data processor, the data storage device with data of the input media of the indication input that will impel the execution that scrambling transformation handles and storage kanji code form is characterized in that comprising:
Has the kanji code-Pinyin code map table that Pinyin code is corresponded respectively to the Chinese characters code, and according to the indication that impels the execution that scrambling transformation handles from described input media, use this map table, the Chinese character-phonetic converting means of the data conversion of the kanji code form that is stored in described data storage device to the Pinyin code form;
Accept input and be transformed to the data of Pinyin code form with described Chinese character-phonetic map table, and rearrange these data by the order of phonetic, and, change the data ordering alternative that is arranged as the order identical with the data of the Pinyin code form that has rearranged with the data of the kanji code form of described data storage device; With
The result's that display change is arranged display device.
9. Chinese data processor with the input media that is used to import Chinese character string is characterized in that described treating apparatus is provided with:
The apparatus for temporary storage of pressing the form storage of kanji code by the literal of described input media input;
Have the kanji code-Pinyin code map table that Pinyin code is corresponded respectively to the Chinese characters code, and literal Chinese character-phonetic arranged side by side that the word sequence that is stored in the kanji code form in the described apparatus for temporary storage is transformed to the Pinyin code form is become converting means with this map table;
Import described Chinese character-phonetic converting means conversion the Pinyin code sequence, this Pinyin code sequence transformation for being the pinyin sequence-Chinese character sequence transformation device of the Chinese character word sequence of unit with word or word sequence; With
Demonstration by described pinyin sequence-Chinese character sequence device institute conversion the display device of Chinese character sequence.
10. according to the Chinese data processor of claim 9, it is characterized in that, also be provided with described pinyin sequence-select when Chinese character sequence transformation device the obtains multiclass Chinese character word sequence conversion candidate selecting arrangement of one of them.
11. the Chinese data processor according to claim 10 is characterized in that, also is provided with the character library with reference to described pinyin sequence-Chinese character sequence transformation device; Simultaneously, under the situation of the corresponding a plurality of Chinese character word sequences of a pinyin sequence, described character library is additional priority ranking between these a plurality of Chinese character word sequences.
12. the Chinese data processor according to claim 9 is characterized in that, described input media is provided with the cause operator and carries out the person's handwriting of the electronic pen of handwriting input and identifying operation person handwriting input and export the recognition device of kanji code.
13. the Chinese data processor according to claim 9 is characterized in that, described input media is provided with by the operator and imports radicals by which characters are arranged in traditional Chinese dictionaries and stroke and according to the Chinese input unit of radicals by which characters are arranged in traditional Chinese dictionaries of being imported and stroke output kanji code.
CNB961059796A 1995-04-20 1996-03-22 Chinese data processor Expired - Fee Related CN1143231C (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP95699/95 1995-04-20
JP09569995A JP3266755B2 (en) 1995-04-20 1995-04-20 Chinese information processing device
JP95699/1995 1995-04-20

Publications (2)

Publication Number Publication Date
CN1140858A CN1140858A (en) 1997-01-22
CN1143231C true CN1143231C (en) 2004-03-24

Family

ID=14144756

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB961059796A Expired - Fee Related CN1143231C (en) 1995-04-20 1996-03-22 Chinese data processor

Country Status (2)

Country Link
JP (1) JP3266755B2 (en)
CN (1) CN1143231C (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7260780B2 (en) * 2005-01-03 2007-08-21 Microsoft Corporation Method and apparatus for providing foreign language text display when encoding is not available
CN101246478B (en) * 2007-02-14 2010-08-25 高德软件有限公司 Information storage and retrieval method
CN117875267B (en) * 2024-03-11 2024-05-24 江西曼荼罗软件有限公司 Method and system for converting Chinese characters into pinyin

Also Published As

Publication number Publication date
CN1140858A (en) 1997-01-22
JPH08292941A (en) 1996-11-05
JP3266755B2 (en) 2002-03-18

Similar Documents

Publication Publication Date Title
CN1194319C (en) Method for retrieving, listing and sorting table-formatted data, and recording medium recorded retrieving, listing or sorting program
CN1171162C (en) Apparatus and method for retrieving charater string based on classification of character
CN1215433C (en) Online character identifying device, method and program and computer readable recording media
CN1120442C (en) File picture processing apparatus and method therefor
CN1174332C (en) Method and device for converting expressing mode
CN1158627C (en) Method and apparatus for character recognition
CN1109994C (en) Document processor and recording medium
CN1040276A (en) Simplified and complex character root Chinese character entering technique and keyboard thereof
CN1281191A (en) Information retrieval method and information retrieval device
CN1132564A (en) Method and appts. for data storage and retrieval
CN1014845B (en) Technique for creating and expanding element marks in a structured document
CN1137320A (en) Semantic object modeling system for creating relational database schemas
CN101034414A (en) Information processing device, method, and program
CN1032251A (en) Computer editing and composing system and method thereof
CN1225550A (en) Data processing device, data display system, data display method, and storage medium
CN1677399A (en) Hierarchical database management system, hierarchical database management method, and hierarchical database management program
CN1144004A (en) Data base system shared by plurality of client apparatuses, data updating method and application to character processor
CN1151558A (en) Information searching method and system
CN1300718C (en) Information display device and information display processing program
CN1143231C (en) Chinese data processor
CN1120438C (en) File information storing and searching device and its program recording medium
CN1449531A (en) Data compiling method
CN1261862C (en) Input prediction processing method, device and program and the program recording medium
CN1690956A (en) Programme projecting device and method
CN1754170A (en) Product design support system, product design support method, and program

Legal Events

Date Code Title Description
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C06 Publication
PB01 Publication
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20040324

Termination date: 20140322