CN1828494B - Chinese character input method for computer - Google Patents

Chinese character input method for computer Download PDF

Info

Publication number
CN1828494B
CN1828494B CN2006100107837A CN200610010783A CN1828494B CN 1828494 B CN1828494 B CN 1828494B CN 2006100107837 A CN2006100107837 A CN 2006100107837A CN 200610010783 A CN200610010783 A CN 200610010783A CN 1828494 B CN1828494 B CN 1828494B
Authority
CN
China
Prior art keywords
parts
stroke
word
character
chinese
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2006100107837A
Other languages
Chinese (zh)
Other versions
CN1828494A (en
Inventor
徐祖华
徐东岿
徐东明
徐东蓉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN2006100107837A priority Critical patent/CN1828494B/en
Publication of CN1828494A publication Critical patent/CN1828494A/en
Application granted granted Critical
Publication of CN1828494B publication Critical patent/CN1828494B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The invention relates to a yitong Chinese characters coding input method by computer, which is characteristic in that: (1) coding each Chinese character with equal length codes from the whole shape parts disassembled from itself; (2) selecting the proper part according to national standard; (3) using a Chinese code input research software to select the most proper component strokes; (4) determining most parts by two pieces of appointed information; (5) determining the key by stroke information by the stroke information of the last part. This invention is easy to master and input, and has a little coincident code.

Description

The easily logical encode method for entering Chinese characters of the defeated word of computing machine
Technical field
The present invention relates to computer Chinese information and handle, particularly defeated word of computing machine and standardized characters are write knowledge in conjunction with must good encode method for entering Chinese characters.
Background technology
Five-stroke character input method takes the lead in obtaining with computer keyboard Chinese character to be imported fast the way of computing machine, over five-stroke character input method is born surplus in the of 20 year, China has worked out nearly thousand kinds of Chinese character shape code input methods, the defeated word of the coding that five-stroke character input method is had is difficult to be combined with functional literacy, find it difficult to learn and easily forget, difficulty is overcome by shortcomings such as common people's grasps, Erbi input method has carried out than major reform the encoding scheme of five-stroke character input method, be that first was evaluated by the Ministry of Education after five-stroke character input method was born, so far also be uniquely to evaluate by the Ministry of Education, can be at the encode method for entering Chinese characters of middle and primary schools' teaching field use, the popularization practice that comes 4 years clearlys show that the encoding scheme of Erbi input method is also undesirable more.
The kind and the quantity of the books of selling from each relevant bookstore of introducing encode method for entering Chinese characters can obviously be found out, so far in Chinese character shape code and phoneme-shape code input method that all have been promoted, the encode method for entering Chinese characters that market share is the highest still is a five-stroke character input method.Because the immense success that five-stroke character input method obtains in actual applications, a lot of program developers also on the five-stroke character input method basis, have been developed some the Five-stroke Method Input Softwares that differ from one another except that having developed multiple defeated word learning software.Publish as the publishing house of University of Electronic Science and Technology of selling during summer vacation in 2005, Chen Maosheng, to magnificence, remove in the CD of " whole people learn the Five-stroke Method " book that Wang Tao writes and included outside " four the most popular; the most effective the Five-stroke Method learning software " such as " easily practicing five; five typewriting master-hands; typewriting pioneer; beat soon for five ", return the user " reading blue or green five " is provided, " omnipotent five ", " five add input method ", " intelligence five-stroke input method ", Deng four the most " popular input method software " reach fresh contents such as " future the Five-stroke Method are checked express quickly ".
Chapter 2 " the Five-stroke Method development history " by " whole people learn a Five-stroke Method " book knows that " king's sign indicating number the Five-stroke Method 86 editions ", " king's sign indicating number the Five-stroke Method 98 editions " and " WB-18030 version " three processes have probably been experienced in the development of the Five-stroke Method so far.Introduction by this chapter is known, is that " king's sign indicating number the Five-stroke Method 98 editions " or " WB-18030 version " all do not change the basic way of 86 editions employings of the Five-stroke Method " only all will memorizing mechanically for the key position of encode Chinese characters for computer and addressable part with part of Chinese character table shape parts ".Can prove, this two basic way is that king's sign indicating number the Five-stroke Method 86 editions exists that the defeated word technology of coding are difficult to be combined and the find it difficult to learn the most basic generation root of shortcomings such as easily forgetting of the defeated word technology of encode with functional literacy is fine, therefore " king's sign indicating number the Five-stroke Method 98 editions " and " WB-18030 version " same exist encode fail word difficulty and functional literacy fine combine and find it difficult to learn shortcoming such as easily forget.
(patent No.: encoding scheme ZL97102717X) is to obtain on the basis of having proved " necessary condition that Chinese character shape code can combine with functional literacy is: except that identification code; the input block of each Chinese character is all identical with table shape parts " " a kind of input block and consistent Hanzi coding input method of table shape parts of encoding ", can find five-stroke character input method before the formulation encoding scheme accurately exists the defeated word difficulty of coding to combine with functional literacy, find it difficult to learn and easily forget, difficult by the generation root of shortcomings such as common people's grasp, code used scheme conforms to Chinese character is actual, its scientific and rationality is better than the coded input method of popularization that the Five-stroke Method etc. all will be memorized mechanically to encode Chinese characters for computer and addressable part key position with part of Chinese character table shape parts, during with this coded input method of computer research, the addressable part storehouse is natural Hanzi component structure word information bank, the technical barrier that runs into of dealing with problems is few, institute's palpus library file and program file are simple, can be according to the every actual needs that must work that obtains the measured Chinese character shape code input method of matter, cooperate with this school Computer Teacher Chen Chunjiang lecturer, develop function more complete " encode Chinese characters for computer development software ", make all technical barriers of running in Chinese character shape code input method that research can obtain with this encoding scheme and the phoneme-shape code input method process all available computers quick and precisely solve, allow the main way of respectively encoding of this coded input method and use actual being closely connected with Hanzi structure reality, the authenticity of each statistics of the correctness of each main way of gained coded input method and reflection coding quality, all the function that provides of available software is quick and precisely investigated, allow the research low-consumption high-efficiency high-quality of the encode method for entering Chinese characters that can combine with functional literacy, long-standing Chinese character shape code input method research problem difficult and coding quality evaluation and test difficulty obviously becomes simple since allowing encode method for entering Chinese characters be born, decapacitation works out the above-mentioned shortcoming that five-stroke character input method is existed and is overcome, the repeated code that five-stroke character input method is had is few, code length is short, outside the Chinese character shape code input method that the defeated fast advantage of word rate obtains keeping, also according to the huge convenience that obtains with " encode Chinese characters for computer development software " research encode method for entering Chinese characters, take the lead in having expected replacing radicals by which characters are arranged in traditional Chinese dictionaries, the fuzzy search function is used for the exploitation way of the Chinese computer dictionary of Chinese-character word-phrase information retrieval with parts.
" encode Chinese characters for computer development software " was put on display on the international show of inventions in Beijing in September, 1996, obtained a silver medal, and obtained computer software copyright registration certificate, registration number 980126 on March 30th, 1998; " a kind of input block and consistent Hanzi coding input method of table shape parts of encoding " (patent No.: ZL97102717.X) obtain the 13 national show of inventions silver medal September calendar year 2001, replace radicals by which characters are arranged in traditional Chinese dictionaries according to what the dos operating system function was compiled with parts, " a kind of multi-functional Chinese computer dictionary " that the fuzzy search function be used for the Chinese-character word-phrase information retrieval obtains invention patent certificate, the patent No.: ZL991151283 on Dec 4th, 2002; The patented claim of " a kind of Chinese that contains the information inquiry of Hanzi component structure word easily switch on brain lexicography " compiled according to the Windows operation system function on March 23rd, 2005 know know property right publishing house publish open, application number 2004100407194.
When compiling the application for patent of " a kind of input block and consistent Hanzi coding input method of table shape parts of encoding ", " Modern Chinese general words normative stroke order " also not issue is compiled by the standardization effort council of State Language Work Committee, all has inconsiderate problem on therefore determining the problems such as consumption of way and interpolation way and positioning element in the key position of identification code.Used " Chinese-character order of strokes standard " not provide on the determining of identification code and be not the stroke information that can both accurately determine, the consumption of positioning element is also useful must to omit many problems, exists coding to fail word and can not reach with combining of functional literacy effect problem should be arranged.In the discussion of back, we abbreviate encode method for entering Chinese characters of the present invention as " easily logical input method ".
Summary of the invention:
The purpose of this invention is to provide a kind of and standardized characters and write knowledge in conjunction with must be good, repeated code is few, code length is short, fail the fast Chinese character shape code input method of word.
The present invention is on relative merits that the serious analysis five-stroke character input method has and the basis that produces root, both take the lead in finding directly with each Chinese character self tear open whole table shape parts give encode Chinese characters for computer, encode for a large amount of general Hanzi components with the stroke information of each Hanzi component again, writing the knowledge combination with standardized characters must be good, repeated code is few, code length is short, fail the encoding scheme of the fast Chinese character shape code input method of word and can allow this encoding scheme realize, chooses way with the Hanzi component that Chinese character-shaped body structure is closely connected.
When taking the lead in proposing the computer input of Chinese characters words, word input is also adopted to be had the brevity code person is defeated with brevity code, no brevity code person is with the defeated words input scheme of all-key, the word coding rule is according to Chinese-character words characteristics and the WinDows operating system back characteristics of selecting window once to show 10 Chinese-character word-phrases that input provides to Chinese-character word-phrase, get by the word coding rule of revising five-stroke character input method, the decapacitation of used word coding method input scheme has more outside the rationality coding of two words and five character word and the above word of five words, also make the word space encoder obtain enlarging, can stay the space of adding a large amount of " self-word creation " to the user, the rule of getting the word brevity code is according to allowing the user get from the needs that candidate window is obtained required words easily.
The encoding scheme exploitation of divining by means of characters of the Chinese-character order of strokes standard formulated according to the State Language Work Committee and Yi Tong input method is provided to the user, contain the full and accurate defeated word learning softwares of helping prompt content such as " Chinese character, brevity code, phonetic, frequency, addressable part, the parts order of strokes observed in calligraphy, parts situation, defeated keyboard position, keystroke situation, Chinese-character order of strokes ", make the user can be by the helping prompt of defeated word learning software, grasp defeated word technology and standardized characters more smoothly and write knowledge, the learning and mastering difficulty of the defeated word technology of coding and the learning difficulty that standard Chinese character is write knowledge are all reduced.
Take the lead in proposing with the contained stroke information inquiry of any Chinese character in the operating system Chinese character base by this Chinese character, any one sub-information in the comprehensive inquiry information that component information inquiry and Pinyin information inquiry combine, find the exploitation way of encode Chinese characters for computer resource discovery tool of the information such as defeated word code of the character library code of the target complete Chinese character that satisfies querying condition in the operating system Chinese character base and each target characters and correlative coding input method, both allowed the user can be more convenient fast smoothly the output function system with any one Chinese character in the Chinese character base, the coinage instrument that allows the user use operating system to provide again, during the Chinese character that do not have in the manufacturing operation system Chinese character base, can find faster coinage must reference word, make Chinese character input problem solve more perfectly than prior art.
Concrete technical scheme of the present invention is: 1. each Chinese character all directly tear open with self whole table shape parts encode with equal-length code, each table shape parts all only takies a defeated keyboard position during coding, table shape parts deficiency person mend with identification code, each identification code also only takies a defeated keyboard position, 2. can participate in whole table shape parts of coding, remove the full Chinese character that surrounds Chinese character and contain full encirclement group word stroke structure tear open full encirclement parts, special semi-surrounding Chinese character and the Chinese character that contains special semi-surrounding group word stroke structure tear open special semi-surrounding parts choose can not the strict Chinese-character order of strokes standard of formulating by the State Language Work Committee carry out outside, choosing of the table shape parts of other participation coding, all the Chinese-character order of strokes standard of formulating by the State Language Work Committee is chosen successively, 3. the table shape parts of participating in coding are all under total prerequisite that the group word stroke structure that contains crossing stroke is not torn open without exception, choose with physical structure of Chinese characters with the more complete encode method for entering Chinese characters of function research software and to match, group word frequency is high and group word frequency is high relatively and the discrete effective group word stroke structure of coding, 4. all show in the shape parts, there are ten parts to arrange the defeated keyboard position of specified coding, these ten parts are positioning element, remaining part is general parts, the defeated keyboard position of the coding of general parts determines that with the first stroke of a Chinese character form of a stroke or a combination of strokes and the stroke number of parts 5. the key position of identification code is determined with the appointed information of the contained stroke of last table shape parts; When the defeated keyboard position of determining general parts with the first stroke of a Chinese character form of a stroke or a combination of strokes and stroke number, and when getting 30 keys and giving 6763 encodes Chinese characters for computer of GB2312-80 with quadruple linkage one word, four parts torn at most open in every word, the single part word adds three identification codes with self component code and encodes, two parts words add two identification codes with self two component codes successively and encode, three parts words add 1 identification code with self 3 component codes successively and encode, four parts words are encoded with 4 component codes of self successively, two parts words, three parts words, the parts order of four parts words is got the order of the resulting part of divining by means of characters without exception, the Chinese character that must add identification code, identification code is placed on after last component code without exception, first three concrete condition of planting that Chinese character adds identification code is: 1. when the single part word is the single parts, add the form of a stroke or a combination of strokes and the stroke number that three identification codes all are taken as these single parts; When the single part word is two when drawing parts, add second form of a stroke or a combination of strokes and the stroke number that three identification codes all are taken as this single part word; When the single part word is three when drawing parts, first identification code is got second form of a stroke or a combination of strokes and the stroke number of this single part word, and second identification code and the 3rd identification code are all got the 3rd form of a stroke or a combination of strokes and stroke number of this single part word; When the single part word is four to draw and four pictures during with upper-part, first identification code is got second form of a stroke or a combination of strokes and the stroke number of this single part word, second identification code got the 3rd form of a stroke or a combination of strokes and stroke number of this single part word, and the 3rd identification code got the 4th form of a stroke or a combination of strokes and stroke number of this single part word; 2. when second parts of two parts words are the single parts, add the form of a stroke or a combination of strokes and the stroke number that two identification codes all are these single parts; When second parts of two parts words are two picture parts, to add two identification codes all be this two second form of a stroke or a combination of strokes and stroke number of drawing parts, when second parts of two parts words are three to draw and three pictures during with upper-part, first identification code is got second form of a stroke or a combination of strokes and the stroke number of second parts, and second identification code got the 3rd form of a stroke or a combination of strokes and stroke number of second parts; 3. when the 3rd parts of three parts words are the single parts, add the form of a stroke or a combination of strokes and the stroke number that identification code is just got these single parts, when the 3rd parts of three parts words are two to draw or during two parts more than drawing, add second form of a stroke or a combination of strokes and the stroke number that identification code is just got the 3rd parts; Used 30 defeated keyboard positions are 30 keys such as 26 English alphabet keys and branch, comma, fullstop and brace number, and wherein 15 keys in 15 keys in left hand keystroke district and right hand keystrokes district respectively have triplex row five row; Since the separatrix in two keystroke districts, the row sign indicating number right-to-left of the 5 row keys in left hand keystroke district is followed successively by 1,2,3,4,5, and the row sign indicating number of the 5 row keys in right hand keystrokes district is followed successively by 1,2,3,4,5 from left to right; From the row by numerical key, the row sign indicating number of three line units in left hand keystroke district is followed successively by 3,1,5, and the row sign indicating number of three line units in right hand keystrokes district is followed successively by 4,2,6; Row sign indicating numbers is that the key bit code of the key of j is ij for the capable sign indicating number of i, the Chinese-character order of strokes regulation and stipulation that formulate the State Language Work Committee, the form of a stroke or a combination of strokes code that is used to represent five kinds of basic strokes of Chinese-character order of strokes standard is: " horizontal 1; perpendicular 2; cast aside 3; point 4; folding 5 ", with about in the two keystroke districts row sign indicating number be followed successively by 1,2,3,4,5 five row keys are that the defeated keyboard position and the identification code form of a stroke or a combination of strokes of the general parts of " horizontal stroke; perpendicular; cast aside; point; folding " is " horizontal stroke; perpendicular successively as the first stroke of a Chinese character form of a stroke or a combination of strokes successively successively, cast aside, the point, folding " the defeated keyboard position of identification code; the sign indicating number of will going is followed successively by " 1,2,3,4; 5 " five-element's key be a picture as stroke number successively, two draw, three draw, four draw, the five general parts of drawing and the key position of identification code, to be 6 key draw and 6 draw the above general parts and the defeated keyboard position of identification code as 6 for row sign indicating number, must get that 30 keys solve the general parts that use when word problem is failed in encode Chinese characters for computer with quadruple linkage one word and the key mapping table of identification code is:
Table one: when determining the key position of general parts with the first stroke of a Chinese character form of a stroke or a combination of strokes and stroke number, the key mapping table of general parts and identification code
Figure GSB00000056150400041
Illustrate: in the table Test pencil shape is that horizontal stroke, stroke number are followed successively by a picture, two pictures, three pictures, four pictures, the general parts of five pictures and the defeated keyboard position of identification code successively,
Figure GSB00000056150400043
Test pencil shape is that 6 pictures and 6 are drawn the above general parts and the defeated keyboard position of identification code for horizontal, stroke number; Shu 1, Shu 2, Shu 3, Shu 4, Shu 5 successively test pencil shape one draw for perpendicular, stroke number are followed successively by, two draw, three draw, four draw, the five general parts of drawing and the defeated keyboard position of identification code, Shu 6 test pencil shapes are 6 to draw and 6 draw the above general parts and the defeated keyboard position of identification code for perpendicular, stroke number; Pie 1, Pie 2, Pie 3, Pie 4, Pie 5 test pencil shape successively are that left-falling stroke, stroke number are followed successively by a picture, two pictures, three pictures, four pictures, the general parts of five pictures and the defeated keyboard position of identification code, and Pie 6 test pencil shapes are that left-falling stroke, stroke number are 6 pictures and the general parts more than 6 pictures and the defeated keyboard position of identification code; Dian 1, Dian 2, Dian 3, Dian 4, Dian 5 test pencil shape successively are that point, stroke number are followed successively by a picture, two pictures, three pictures, four pictures, the five general parts of drawing and the defeated keyboard position of identification code, and Dian 6 test pencil shapes are that point, stroke number are 6 pictures and the general parts more than 6 pictures and the defeated keyboard position of identification code;
Figure GSB00000056150400044
Test pencil shape is that folding, stroke number are followed successively by a picture, two pictures, three pictures, four pictures, the five general parts of drawing and the defeated keyboard position of identification code successively,
Figure GSB00000056150400045
Test pencil shape is that 6 pictures and 6 are drawn the above general parts and the defeated keyboard position of identification code for folding, stroke number.
The words input has all adopted the brevity code, and the person is defeated with brevity code, and no brevity code person is with the defeated way of all-key, and wherein the brevity code of word input adopts and encodes to word with five yard one speech earlier, and then obtains the code value of word input with the way of getting brevity code; The rule of word coding is: two words add preceding trigram, four words that complete four yards of second word, three words add the 3rd word with first yard of the first two word successively successively with first yard the first two yard that adds the 4th word of first three word with first yard of first word successively, five character word is used first yard of every word successively, and first yard of the first five word used successively in the above word of five words; The rule that brevity code is got in ordering is: the ordering of code value size pressed in the different word of code value, and the little person of code value is preceding; The ordering of word number of words pressed in the word that code value is identical, and the few person of number of words is preceding; The word frequency ordering according to first letter that number of words is identical, the high person of lead-in frequency is preceding; The ordering of secondary word frequency pressed in the word that lead-in is identical, and the high person of secondary word frequency is preceding; Whether the word quantity of each code value can once show the words quantity of same code value in candidate window on having the Chinese character of this code value to decide when getting brevity code; The order of getting brevity code is successively: get earlier the one-level brevity code, inferior get the secondary brevity code, again get three, get the level Four brevity code at last, got the word of level Four brevity code after, what be left is exactly five yards speech; Words with same code value shows speech after the order that candidate window shows is to show word earlier; The order that word shows is the different word of number of words, and number of words is lacked the person preceding, the word that number of words is identical, and the high person of lead-in frequency is preceding; The identical high person of word secondary word frequency of lead-in is preceding.
Chinese-character order of strokes standard and association that defeated word learning software is formulated according to the State Language Work Committee use the defeated word scheme of the coding of divining by means of characters of the present invention to solve the knowledge that defeated word problem must be grasped, each Chinese character among the GB2312-80 all tabulated to provide comprise " Chinese character; brevity code; phonetic; frequency; addressable part; the parts order of strokes observed in calligraphy, the parts situation, defeated keyboard position, the keystroke situation, Chinese-character order of strokes " helping prompt of all contents; wherein to not splitting out full encirclement parts and not splitting out the Chinese character of special semi-surrounding parts; choosing of each parts can both strict be carried out by the Chinese-character order of strokes standard successively, and its Chinese-character order of strokes provides with the component type order of strokes observed in calligraphy and component type sequence number order of strokes observed in calligraphy dual mode; To splitting out full encirclement parts and the Chinese character that can split out special semi-surrounding parts, when tearing full encirclement parts open and tearing special semi-surrounding parts open, the Chinese-character order of strokes standard is damaged, these two kinds of Chinese characters, at the Chinese-character order of strokes standard place that is damaged, all adopt the parts plug-in type order of strokes observed in calligraphy and parts plug-in type sequence number order of strokes observed in calligraphy dual mode to provide, the order of strokes observed in calligraphy that the used unit plug-in type order of strokes observed in calligraphy and the parts plug-in type sequence number order of strokes observed in calligraphy are obtained is represented situation, and is consistent with the trailing type order of strokes observed in calligraphy and the sequence number formula order of strokes observed in calligraphy in the Chinese-character order of strokes standard that formulates the State Language Work Committee respectively.
Provide the integrated information inquiry way that stroke information inquiry, component information are inquired about and the Pinyin information inquiry combines that contains by Chinese character to solve the instrument of encode Chinese characters for computer information inquiry to the Chinese character in the operating system Chinese character base to the user, wherein the stroke inquiry comprises stroke number and stroke character string two contents of wanting inquiry of Chinese character, component queries comprises the parts character string of wanting inquiry of Chinese character and the mode of searching two contents of parts character string, and pinyin queries contains pinyin character string one content of wanting inquiry of Chinese character; Character in the stroke character string is the stroke in five kinds of basic strokes both, also represents the asterisk wildcard of any one basic stroke, the character in the parts character string, and both concrete Hanzi component is also represented the asterisk wildcard of Hanzi component; Character in the pinyin character string, the asterisk wildcard of phonetic alphabet also represented in both phonetic alphabet; The mode of searching of parts character string has " with field beginning coupling ", " with any part coupling of field " to reach and " whole fields match " three kinds; When opening integrated information inquiry session frame, stroke number, the stroke character string, the parts character string, the default value of the input character in the pinyin character string all is taken as corresponding asterisk wildcard respectively, the default value of the mode of searching of parts character string is got and field beginning coupling, the user imports any one sub-information of wanting in the contained above-mentioned Query Information of inquiry of Chinese character in accordance with regulations, after sending execution command, program in the encode Chinese characters for computer resource discovery tool can allow computing machine by the user the situation of defeated Query Information from whole records of query facility information bank, find out the target complete Chinese character that satisfies the respective queries condition rapidly, and each target characters place is recorded in the information such as " Chinese characters; stroke number; phonetic; parts; easy make code; character library code " of storing in the information bank is shown to the user by set tabulation display mode, wherein " Chinese character; stroke number; phonetic; parts; easy make code; character library code " is the column heading that tabulation shows, the order button that to be again the contained display message of target characters that satisfies querying condition sort by the specified attribute of this column information.
The present invention has following good effect compared with the prior art:
Earlier special semi-surrounding parts, special semi-surrounding Chinese character and special semi-surrounding group word stroke structure three terms are described:
Special semi-surrounding parts: when divining by means of characters, the semi-surrounding parts that can not the strict Chinese-character order of strokes standard of formulating by the State Language Work Committee split out are special semi-surrounding parts.Tear " shooting a retrievable arrow the worker " open as " formula ", parts " are shooted a retrievable arrow " and can not the strict Chinese-character order of strokes standard of formulating by the State Language Work Committee be split out, and claim parts " to shoot a retrievable arrow " and are the special semi-surrounding parts of special semi-surrounding Chinese character " formula ".Note, special semi-surrounding parts can only and contain from special semi-surrounding Chinese character to be torn open the Chinese character of special semi-surrounding group word stroke structure, contain " shooting a retrievable arrow " as " kite ", but " kite " is to go up the mo(u)ld bottom half Chinese character, parts in " kite " " are shooted a retrievable arrow " and can be split out by the Chinese-character order of strokes standard, are not special semi-surrounding parts.
Special semi-surrounding Chinese character: the radicals by which characters are arranged in traditional Chinese dictionaries according to " witch, deep and remote, refreshing, shocking, Bin " are followed the example of, embodiment will " witch tears workman people open, deep and remote tears mountain one one open, refreshingly tears big line in the Eight Diagrams line in the Eight Diagrams open, the shocking king of tearing open Song Song, Bin tear mountain pig pig open ", and, be referred to as special semi-surrounding Chinese character with can not the strict Chinese-character order of strokes standard of formulating by the State Language Work Committee in " witch, deep and remote, refreshing, shocking, Bin " and the semi-surrounding Chinese character splitting out the Chinese character of semi-surrounding parts.The special semi-surrounding parts that are contained in special semi-surrounding Chinese character that embodiment uses remove " Contraband, shoot a retrievable arrow,
Figure GSB00000056150400051
, dagger-axe, Wu, ,
Figure GSB00000056150400053
" wait outside 7 parts, also have " worker ", " mountain ", " greatly " in " feeling well " in " deep and remote, Bin " in " witch ", " king " in " shocking ".The special semi-surrounding Chinese character that can split out special semi-surrounding parts that relates among the embodiment has 41 words, the accumulative total usage frequency is 0.6636%, and this 41 word is: minister gangster craftsman is huge, and district's deficient plaque two formula of casket doctor's impossible box of circle of the hideing glucoside of making a mistake of rectifying is military or guard against army and become relative to defend the salty the eleventh of the twelve Earthly Branches of prestige to cut out to wear to cut to plant and carry the refreshing shocking Bin of good Wu You.
Special semi-surrounding group word stroke structure: this paper will can split out the semi-surrounding group word stroke structure of special semi-surrounding parts on special semi-surrounding group word stroke structure, as " the refined literary composition of tearing open among the embodiment
Figure GSB00000056150400054
End ", claim that " force " is the refined special semi-surrounding group word stroke structure of Chinese character.And for example " a surname tears open Contraband ", claim that " minister " is Chinese character a surname's special semi-surrounding group word stroke structure.Relating to the Chinese character that can split out containing of special semi-surrounding parts of special semi-surrounding group word stroke structure among the embodiment has 71 words, and the accumulative total usage frequency is 0.2008%.This 71 word is: Ou Ou beats up the evil official of Yan, a state in the Zhou Dynasty bowl that crouches and scratches a basket frame socket of the eye and vomit and macerate body and drive the pivot weir and pound the bow ballad of laying down and deceive toilet case used by women in ancient China and pull up and sip crash rugged satisfied old woman's coffin with a corpse in it Chinese torreya a surname that irritates greasy examination horizontal bar in the front of a carriage used as an armrest terbium of wiping away of promethium Po small suitcase a round bamboo basket of sinking in and murder the puzzled commandment of the refined tax of the nautilus suede tool territory preesed finish chirp of the sincere Wei in city of the thief Guo marmoset threshold Ji a fabulous creature, said to be like a turtle and blow poisonous sand in man's face augury bright tomahawk of maple that rivers bend and hide.
The present invention directly with each Chinese character self tear open whole table shape parts give encode Chinese characters for computer with equal-length code, table shape parts deficiency person mend with identification code, all in the table shape parts, the Chinese character that has only full encirclement Chinese character and contain full encirclement group word stroke structure tear open full encirclement parts (mouthful,
Figure GSB00000056150400057
), special semi-surrounding Chinese character tear open special semi-surrounding parts and the Chinese character that contains special semi-surrounding group word stroke structure tear open choosing of special semi-surrounding parts do not have the strict Chinese-character order of strokes standard of formulating by the State Language Work Committee to carry out, the all strict Chinese-character order of strokes standard of formulating by the State Language Work Committee of choosing of remaining part is carried out successively, each addressable part all only takies a defeated keyboard position, remove organic write little with the handwritten word difference, the defeated word of coding easily with advantages such as functional literacy combines outside, when failing word problem with the computer research coding, the addressable part storehouse contains Hanzi component structure word information bank, deal with problems must library file and program file all simpler, easily according to the more full encode method for entering Chinese characters research software of the actual needs development function of research encode method for entering Chinese characters, passing through of running in the research encode method for entering Chinese characters process looked up the dictionary, look into card, all technical barriers that manual mode of operation such as note down is difficult to even cannot solves, all general-purpose computers quick and precisely solve, make the research energy low-consumption high-efficiency high-quality of coded input method, enable to participate in coding Hanzi component choose way, the arrangement of defeated keyboard position, identification code choose way and keys arrangement, the formulation of coding rule, choose and the keys arrangement etc. of positioning element obtain the measured Chinese character shape code input method of matter conscientiously the specific practice of ready-made every work all do not have blindness or subjective random, the actual performance of each statistics of the correctness of each way and reflection coding quality is investigated fast and accurately with the function that software provides, and Chinese character shape code input method research problem difficult and coding quality evaluation and test difficulty is all obviously become simple.
Participating in the choosing of Hanzi component of coding gets way by Optimization Dept.'s first-selection and gets, all under total prerequisite that the group word stroke structure that contains crossing stroke is not torn open without exception, choose with Chinese character-shaped body structure with the more complete encode method for entering Chinese characters of function research software and to match, group word frequency is high and group word frequency is high relatively and the discrete effective group word stroke structure of coding obtains, parts were both chosen and were conformed to Hanzi structure is actual, use actual being closely connected with Chinese character again, the Chinese character radicals that uses in the former words allusion quotation, lower and produce again the more person of repeated code except that minority group word frequency, mostly all got work and can be participated in the table shape parts of coding.In 726 addressable parts that gained embodiment uses at present, radicals by which characters are arranged in traditional Chinese dictionaries for " radical table (draft) unified in Chinese character " defined, except that the radicals by which characters are arranged in traditional Chinese dictionaries that are used for the complex form of Chinese characters, there are 25 radicals by which characters are arranged in traditional Chinese dictionaries such as " than, old, minister, tongue, look, neat, wheat, red, halogen, city, tortoise, green grass or young crops, non-, tooth, Mian, mound, face, perfume (or spice), sound, separate, height, Huang, ancient cooking vessel, broomcorn millet, drum " not to be taken as parts to the encode Chinese characters for computer of GB2132-80.In used 726 addressable parts, character formation component has 390, character non-formation component has 306, identification code has 30, when giving encode Chinese characters for computer with 4 keys, one word, four addressable parts all used in every word, gives 6763 encodes Chinese characters for computer, and total consumption of addressable part is 6763 * 4=27052 (individual), in these 27052 addressable parts, the consumption of character formation component is 14979, accounts for 55.37% of whole addressable part consumptions, and the consumption of character non-formation component is 3394, account for 12.55% of whole addressable part consumptions, the consumption of identification code is 8679, accounts for 32.10% of whole addressable part consumptions, and the consumption of character formation component is about 4.4 times of character non-formation component consumption.In 6763 words, except that identification code, be that the Chinese character of character formation component has 3986 words entirely, the accumulative total usage frequency is 63.5556%.Wherein the single part word has 175 words, and the accumulative total usage frequency is 13.50%; Two parts words have 1984 words, and the accumulative total usage frequency is 38.95%; Three parts words have 1433 words, and the accumulative total usage frequency is 9.60%; Four parts words have 394 words, and the accumulative total usage frequency is 1.51%.The Chinese character that is character non-formation component entirely has 89 words, and the accumulative total usage frequency is 3.62%.Wherein two parts words have 65 words, and the accumulative total usage frequency is 3.46%; Three parts words have 23 words, and the accumulative total usage frequency is 0.14%; Four parts words have 1 word, and usage frequency is 0.02%.The Chinese character quantity that is character formation component entirely is for being more than 44.7 times of Chinese character quantity of character non-formation component entirely, and the ratio of accumulative total usage frequency is 17.56: 1.
In the code used parts, most of common components can be remembered very soon, being contained the group word structure that intersects stroke does not tear open, divine by means of characters and match with physical structure of Chinese characters and the restriction of four conditions such as parts at most only torn open in every word, the uncertain factor that the coding of divining by means of characters exists is less, although the used table of this coded input method shape parts more (about 700) need not be memorized mechanically, it is few that the defeated GB2312-80 Chinese character of repeatedly all can not failing is torn in association's back examination open.The choosing of Hanzi component has that to grasp the technology of divining by means of characters must the flower memory capacitance little, and the situation of divining by means of characters is easily grasped, and tears the advantage that commute is not corrected open.
At present gained embodiment has used 10 positioning elements, and " A, S, D, F, G, H, the J " that be arranged at the capable left side of reference key waits on 7 defeated keyboard positions that are connected with each other, and generally the defeated keyboard bit table of parts, identification code and positioning element is:
The defeated keyboard bit table of general parts, identification code and positioning element that table two: embodiment uses
Figure GSB00000056150400071
Use gained coded input method of the present invention to fail word, except that the defeated keyboard position of a small amount of several positioning elements need be memorized mechanically, the key position of all the other general parts and identification code can both be determined with the contained stroke information of parts, when determining the key position of general parts and getting the keys arrangement of key mapping table with the first stroke of a Chinese character form of a stroke or a combination of strokes and stroke number, the first stroke of a Chinese character of seeing general parts is just known the row sign indicating number of parts and is used what finger keystroke, the stroke number of finding out general parts just know defeated parts which of which keystroke district, not only grasping defeated keyboard position need spend memory capacitance obviously to reduce, the definite of keystroke key position also obviously becomes different, and the thinking difficulty of the coding of divining by means of characters obviously becomes simple.In a word, solve the defeated word problem of encode Chinese characters for computer with the encoding scheme of divining by means of characters of the present invention, coding rule, Chinese character split, the key position of general parts, and all there is the advantage that can not forget substantially after easy note of easy and the association key position of positioning element, choose and the key position of identification code.
When using the key position of above-mentioned general parts, identification code and positioning element to give 6763 encodes Chinese characters for computer of GB2312-80, the repeated code in brevity code storehouse is adjusted into 167 groups, the accumulative total usage frequency reduces at 0.4339% o'clock, and the situation of brevity code word is: one-level brevity code word 30 words, accumulative total usage frequency 19.3012%; Secondary simple code Chinese character 829 words, accumulative total usage frequency 45.3092%; Three word 4076 words, accumulative total usage frequency 28.4323%; All-key word 1828 words, accumulative total usage frequency 4.4939%.Wherein 30 one-level brevity code words are: once not come very much be for a short time in the state in the electricity the people must see I main say house this learns with sub that people's energy.When failing word with the encode Chinese characters for computer of embodiment, the Chinese character that must add 3 identification codes has 53 words, and the accumulative total usage frequency is 0.3521%; The Chinese character that must add 2 identification codes has 922 words, and the accumulative total usage frequency is 4.6204%; The Chinese character that must add 1 identification code has 1764 words, and the accumulative total usage frequency is 18.8122%; Input block has 3168 words with the identical word of table shape parts, and the accumulative total usage frequency is 52.3169%; The word that input block is less than table shape parts has 856 words, and the accumulative total usage frequency is 21.4350%.
The situation of repeated code word is: two words are mutually heavy 142 groups, accumulative total usage frequency 0.3304%; Three words are mutually heavy 24 groups, accumulative total usage frequency 0.1016%; Four words are mutually heavy 1 group, and the accumulative total usage frequency is 0.0019%.In the repeated code word, what primary word and primary word were mutually heavy has following 18 groups:
1 disobey far welcome, 2 step circuitous, 3 great slaves drive, 4 allusion quotations are lain prone, 5 Pu'er tea floods, 6 slip Lip rivers, 7 hold concurrently mediocre, 8 hold back quick, spill at 9 Pus, 10 blind locks fast, 11 be jealous of donkey, 12 to relax to speed, 13 barium sodium, 14 cyanogen limb nitriles, 15 angry terrified, 16 carp Yu, 17 pah caye, 18 bite.Wherein: " disobey far welcome, great slave drives, blind lock is fast, cyanogen limb nitrile " to be two primary words heavy mutually with a secondary word for 4 group of three mutually heavy repeated code of word, the accumulative total usage frequency of these 18 groups of repeated code words is 0.1374%.
By 63 pages of former " Chinese information " magazine 1996 the 1st phases, " " cognitive sign indicating number " standardization an is inquired into " literary composition of " Zhou Xian " is known, in 3755 first-level Chinese characters commonly used, the repeated code word of the Five-stroke Method is 68 pairs, and the repeated code word of Zheng's sign indicating number is 89 pairs, and grey a word used in person's names sign indicating number is 105 pairs, the holographic sign indicating number latest edition of Du Bing toad is 206 pairs, configuration code is 264 pairs, as seen with the defeated word of gained coded input method of the present invention, still has few, the defeated fast advantage of word of repeated code.Be necessary explanation, suitably increase the positioning element consumption, the repeated code number of words is reduced again.
The word coding rule of five-stroke character input method and Erbi input method all exists the coding rule of two words and inconsistent the reaching only to Chinese character input use brevity code of coding rule of multi-character words, to the problem of word input use brevity code, the existence of this problem is not made troubles to word input.Knowing that from the receipts speech situation of Chinese dictionary two words are maximum in the Chinese-character words, secondly is four words, and three words, five words and the above word of five words all are less than two words and four words.These characteristics that have according to Chinese-character words quantity and the WinDows operating system candidate window that input provides to Chinese-character word-phrase once can show the characteristics of 10 words, easily logical input method is determined, the words input has all adopted the brevity code, and the person is defeated with brevity code, the way that no brevity code person fails with all-key.The word coding rule gets by the word coding rule of revising five-stroke character input method, the coding rule of used five yard one speech is similar to the code fetch situation of the Chinese character coding rule of single part word, two parts words, three parts words, four parts words, can make two words, three words, four words, five character word consistent with the coding rule of the above word of five words.Easy grasp advantages of application is arranged, encode to word with five yard one speech, getting the way that brevity code imports can allow the word space encoder effectively be enlarged, the utilization factor of word space encoder is improved, increase the word quantity that the coding input can be held, can guarantee that again the speed of words input can not slow down because of the increase of coding code length.The rule of getting the word brevity code had both allowed the words with same code value once show at the candidate window, can allow the words of WinDows operating system candidate window prompting arrange in perfect order again, can allow the user obtain own required word, the raising of favourable input speed easily.
The helping prompt content of having promoted external cause that font code input method and phoneme-shape code input method find it difficult to learn and be defeated word learning software is too simple, the user is difficult to grasp defeated word technology more smoothly by the helping prompt of defeated word learning software, and four learning softwares such as " easily learning five, five typewriting master-hands, typewriting pioneer, five beats soon " of including in the CD as " whole people learn the Five-stroke Method " are failed all in various degree this problems of existence of helping prompt that Chinese character gives to each.The defeated word learning software of easily logical input method, export the knowledge that each Chinese character must be grasped smoothly according to the student, each Chinese character among the GB2312-80 is all failed the word scheme according to the Chinese-character order of strokes standard of State Language Work Committee formulation and the coding of divining by means of characters of Yi Tong input method, tabulation provides and comprises " Chinese character; brevity code; phonetic; frequency; addressable part; the parts order of strokes observed in calligraphy, the parts situation, the key position, the keystroke situation, Chinese-character order of strokes " helping prompt of all contents; wherein to not splitting out full encirclement parts and not splitting out the Chinese character of special semi-surrounding parts; choosing of each parts can both strict be carried out by the Chinese-character order of strokes standard successively, and its Chinese-character order of strokes provides with the component type order of strokes observed in calligraphy and component type sequence number order of strokes observed in calligraphy dual mode; To splitting out full encirclement parts and the Chinese character that can split out special semi-surrounding parts, when tearing full encirclement parts open and tearing special semi-surrounding parts open, the Chinese-character order of strokes standard is damaged, these two kinds of Chinese characters, at the Chinese-character order of strokes standard place that is damaged, all adopt the parts plug-in type order of strokes observed in calligraphy and parts plug-in type sequence number order of strokes observed in calligraphy dual mode to provide, the order of strokes observed in calligraphy that the used unit plug-in type order of strokes observed in calligraphy and the parts plug-in type sequence number order of strokes observed in calligraphy are obtained is represented situation, and is consistent with the trailing type order of strokes observed in calligraphy and the sequence number formula order of strokes observed in calligraphy in the Chinese-character order of strokes standard that formulates the State Language Work Committee respectively.
Fast as " fast " Chai “  Xun ", choosing of three parts can both strict be carried out by the Chinese-character order of strokes standard successively, and the helping prompt that defeated word learning software is given " fast " word is:
Yong “  Xun is fast " Chinese-character order of strokes of expression " fast ", be the component type order of strokes observed in calligraphy of " fast ", with the Chinese-character order of strokes of " 5+12+454 " expression " fast ", be the component type sequence number order of strokes observed in calligraphy of " fast ".
And for example " or " tear " dagger-axe mouth one " open, not choosing of first parts " dagger-axe " undertaken by the Chinese-character order of strokes standard, the problem that the Chinese-character order of strokes standard is damaged appears behind the first stroke of tearing " dagger-axe " open " horizontal stroke ", defeated word learning software to " or " helping prompt given of word is:
With " one
Figure GSB00000056150400095
Or " expression " or " Chinese-character order of strokes, be between the first strokes of first parts " dagger-axe " and back three, insert second parts " mouth " and the 3rd parts " " and get, claim " one
Figure GSB00000056150400097
Or " be Chinese character " or " the parts plug-in type order of strokes observed in calligraphy; with " 1-251+1-534 " expression " or " Chinese-character order of strokes the time; be between the first stroke sequence number " 1 " and back three sequence numbers " 534 " of the sequence number formula order of strokes observed in calligraphy " 1534 " of first parts " dagger-axe "; insert the sequence number formula order of strokes observed in calligraphy " 1 " of the sequence number formula order of strokes observed in calligraphy " 251 " of second parts " mouth " and the 3rd parts " " and get, title " 1-251+1-534 " be Chinese character " or " the parts plug-in type sequence number order of strokes observed in calligraphy.
Because defeated word learning software of the present invention has comprised the A to Z of main points that this word of smooth output must be grasped to the helping prompt content that the defeated Chinese character of each provides, therefore, as long as to the student do defeated word ABC explanation about 2 hours and defeated word learning software manipulate exercise after, the student just can be by the helping prompt of defeated word learning software, not only learn the defeated word technology of easily logical input method more smoothly, also in the defeated word technology of basic association, grasp the normative stroke order knowledge of Chinese characters in common use and Hanzi component substantially.Each is learned defeated Chinese character provide the full and accurate helping prompt of content, can reduce learning difficulty, save learning time, increase the confidence of the defeated word technology of association's Chinese character shape code input method.
At present to write the way of knowledge be impossible the fine student of church write standardized characters to primary school's teaching material canon model wordbook, the one, and Chinese character number of words multiword shape complex structure, teachers ' teaching are given birth to and are learnt standardized characters to write the time compole of knowledge limited, and the 2nd, difficulty is supervised in inspection.With the easy defeated word of logical input method, the key position of a large amount of general parts and all the key position of identification codes all use the form of a stroke or a combination of strokes of unit stroke and stroke number to determine, the standard person that writes, defeated word is just smooth, can force the student to study hard each and fail standardized characters of Chinese character and write situation.The helping prompt of defeated word learning software both had been convenient to the student and is learned each standardized characters of learning defeated Chinese character and write knowledge, can write the normative stroke order that knowledge is converted into more than 400 simple components to the standardized characters of 6763 Chinese characters of GB2312-80 again and grasp.The problem lack of standardization that exists during the student writes can obtain fine correction in the process of the defeated word of study, correct halfway problem, can also continue to obtain in defeated word process to correct.With the easy defeated word of logical input method can check the student write standard whether work, correct write nonstandard work and supervise the conscious work of writing standardized characters of student to give computing machine and carry out of student, carrying out of favourable national language literal method, this of being to use the defeated word of easily logical input method to have has promoted the advantage of great use that encode method for entering Chinese characters does not have.
Having promoted encode method for entering Chinese characters does not all provide the integrated information inquiry way that stroke information inquiry, component information are inquired about and the Pinyin information inquiry combines that contains by Chinese character to solve the instrument of encode Chinese characters for computer information inquiry to the Chinese character in the operating system Chinese character base to the user, finishes defeated word work smoothly for the user and makes troubles.Easily logical input method takes the lead in providing to the user to the Chinese character in the operating system Chinese character base, and with failing the contained stroke information inquiry of Chinese character, the integrated information inquiry way that component information inquiry and Pinyin information inquiry combine solves the instrument of encode Chinese characters for computer information inquiry.The contained three kinds of Query Informations of query facility can be distinguished separately and to use, also can part or three kinds lump together use.When inquiring about as independent use stroke information, if institute's transmission information is that stroke number is 6, first stroke of stroke character string is " Dian ", other stroke is any, after sending execution command, it is that " Dian " stroke number is the information such as " Chinese characters; stroke number; phonetic; parts; easy make code; character library code " that the 6 target complete Chinese characters of drawing and each target characters are being stored in information bank that program in the query facility can allow computing machine find all first stroke of a Chinese character in the operating system Chinese character base rapidly, and it is shown to the user by set tabulation display mode, when and for example list is inquired about with component information, if the user is wanting the input window input block " husband " of the contained parts of inquiry of Chinese character, the mode of searching in elected is when starting coupling with field, program in the query facility can allow computing machine find out rapidly that first parts are husband's all target characters and relevant information in the operating system Chinese character base, and it is shown to the user by set tabulation display mode, the target characters that wherein belongs to GB2132-80 has " husband; rule; replace; the man-drawn carriage used in ancient times " four words; The mode of searching in elected is for any part coupling of field the time, program in the query facility can allow computing machine find rapidly that all contain the target characters and the relevant information of parts " husband " in the operating system Chinese character base, and it is shown to the user by set tabulation display mode, the target characters that wherein belongs to GB2312-80 has: husband, rule, for, man-drawn carriage used in ancient times, skin, hold up, mistake, cottonrose hibiscus, furan, Geld, fall, peep, drive away, dive, 15 words such as bran; The mode of searching is during with whole fields match in elected, and the target characters that obtains has only " husband " word one word.The contained three kinds of Query Informations part of query facility or three kinds lump together use, can effectively enlarge the spendable Query Information scope of retrieval character library Chinese character information, increase the spendable Query Information amount of retrieval character library Chinese character information.As retrieval to " Gu " word, except that can distinguishing stroke information, component information and the Pinyin information that uses " Gu " word separately, the anyon information that also available stroke information by " Gu " word, component information and Pinyin information combine is as with multiple information such as " first stroke of a Chinese character be horizontal second parts be mouthful ", " first stroke of a Chinese character is that horizontal second parts are that the voice mother is g ", " stroke number is that 5 second parts are that the voice mother is g ", " first parts are that ten initial consonants are g ".
Easily logical input method is to the Chinese character in the operating system Chinese character base, provide the inquiry of the stroke information of using Chinese character to the user, the integrated information inquiry way that component information inquiry and Pinyin information inquiry combine solves the instrument of encode Chinese characters for computer information inquiry, can make the user fail in the process of word on computers, when running into any one Chinese character that can not fail or can not fail, can both be easily with failing the contained stroke information of Chinese character, any one sub-information in the comprehensive inquiry information that component information and Pinyin information combine, information such as " Chinese characters; stroke number; phonetic; parts; easy make code; character library code " of finding that the target complete Chinese character that satisfies querying condition in the operating system Chinese character base and each target characters storing in information bank, in the time will failing Chinese character and belong to Chinese character in the operating system Chinese character base, failing Chinese character is contained in the target characters, the user can find easy make code or the character library code that will fail Chinese character, with corresponding input method or duplicate the stickup way and will fail Chinese character and export smoothly, in the time will failing Chinese character and not belonging to Chinese character in the operating system Chinese character base, to not fail Chinese character in the target characters, but the way that can contain the specified parts Chinese character by inspection, the very fast reference word that finds coinage institute palpus preferably, the more convenient function of creating characters that provides with operating system is produced the Chinese character that will fail, allow on computers the work of defeated Chinese character, solve more perfectly than prior art.
The easily logical encode method for entering Chinese characters of the defeated word of computing machine, also can determine the defeated keyboard position of general parts in order to the first stroke of a Chinese character form of a stroke or a combination of strokes and stroke number, and get 30 keys and obtain for the method for encode Chinese characters for computer with five keys, one word, the key position of at this moment general parts and identification code, determine the defeated keyboard position of general parts with aforesaid with the first stroke of a Chinese character form of a stroke or a combination of strokes and stroke number, and get 30 keys and give the identical of encode Chinese characters for computer with quadruple linkage one word.The way that four parts at most only torn open in the every word of the both desirable qualification of the situation of divining by means of characters obtains, and the way that also can adopt the every word of qualification at most only to tear 5 parts open obtains.When getting 30 keys and giving encode Chinese characters for computer with five keys, one word, no matter use limits every word and at most only tears the scheme of divining by means of characters of 4 parts open and divine by means of characters, the scheme of divining by means of characters that also is to use the every word of qualification to tear 5 parts at most open is divined by means of characters, for guaranteeing that choosing with Chinese character is actual of addressable part conforms to, having the situation of divining by means of characters easily grasps, tear the not advantage of commute correction open, all follow the group word stroke structure that contains crossing stroke and do not tear open without exception, parts are chosen the general principle and the group word frequency junior that match with physical structure of Chinese characters will the discrete good effect of coding.All following the single part word during coding adds 4 identification codes with self component code and encodes, two parts words add 3 identification codes with self two component codes successively and encode, three parts words add 2 identification codes with self 3 component codes successively and encode, four parts words add 1 identification code with self 4 component codes successively and encode, different being to use do not have five parts words when limiting the scheme of divining by means of characters that every word at most only tears four parts open, when use limiting the multipotency of every word and tearing the scheme of divining by means of characters of five parts open, the coding rule that five parts words and five parts words are arranged is to encode with 5 component codes of self successively.
" general knowledge " by the 5th page of introduction of " whole people learn the Five-stroke Method " book known, " since now country with 18030 character libraries as the pressure standard; regulation does not reach the computer product that GB18030 requires and is all forbidden to sell in China; thus at present for a good input method of Chinese character, must employing GB18030 character library.Five companies of king's sign indicating number have released the input method version of compatible GB18030 character library, i.e. king's sign indicating number WB-18030 ".The GB18030 character library is included Chinese character 27484 words.Data introduction according to another relevant " character library complete or collected works ", be the coding of further standard Chinese character, International standardization (ISO) tissue expands Chinese character, on August 22nd, 2000, the SuperCJK V10.2 that announces comprises 70205 words altogether, is the full content of up-to-date upright super large character library.
Get 30 keys and give encode Chinese characters for computer with five keys, one word, with get 30 keys and compare to encode Chinese characters for computer with quadruple linkage one word, space encoder has enlarged 30 times, 70205 words are about 10.4 times of 6763 words, when noticing with the defeated word of brevity code, the long word of sign indicating number is after deserted word and space encoder enlarge substantially, after the heavy mutually fact of everyday character and everyday character can not occurring, just know that getting 30 keys solves encode Chinese characters for computer input problem with five keys, one word, 27484 Chinese character repeated codes that can not only make 21003 Chinese characters of GBK or GB18030 seldom, 70205 words of comprising for exactly new upright big character library solve the defeated word problem of coding, and the few superiority of repeated code is arranged too.So far as can be seen, give encode Chinese characters for computer with the encoding scheme of divining by means of characters of the present invention, both can give generally speaking, as long as solve a large amount of general personnel of defeated word problem with 6763 words of GB2312-80, the coding input scheme that finds and smoothly with the way of contained any one Chinese character output of operating system, the coding way of divining by means of characters that find can for again the personnel that must be able to export contained any one Chinese character in the big character library smoothly.
The Chinese-character stroke information that the Chinese-character order of strokes standard energy let us that formulate the State Language Work Committee accurately obtains has only the stroke number of Chinese character, the form of a stroke or a combination of strokes and the order of strokes observed in calligraphy of each stroke, the computer key position at grade, except that the single parts, determine two stroke information of key position palpus of defeated word parts, the information of the defeated keyboard position that is used for definite Hanzi component that the Chinese-character order of strokes standard that can formulate from the State Language Work Committee accurately obtains has only two classes, the first kind is that any one specifies the form of a stroke or a combination of strokes of stroke and the stroke number of parts in the parts, and second class is any two forms of a stroke or a combination of strokes of specifying stroke in the parts.The first stroke of a Chinese character form of a stroke or a combination of strokes and stroke number are the simplyst in the type I information the easiliest to determine and information that each parts all have; First and second form of a stroke or a combination of strokes is the simplyst, the easiest in second category information to determine and each parts all have except that the single parts information.Therefore when obtaining the defeated word of computing machine and easily lead to encode method for entering Chinese characters with the described encoding scheme of divining by means of characters of claim 1 of the present invention, determine that the spendable best stroke information in defeated keyboard position of general parts has only two kinds, a kind of is the first stroke of a Chinese character form of a stroke or a combination of strokes and the stroke number of parts, a kind of be parts, two forms of a stroke or a combination of strokes (the single parts are drawn with the single of self and determined).
Get 30 keys, give encode Chinese characters for computer with quadruple linkage one word or five keys, one word, the key position of single parts is drawn with the single of self and is determined, the key position of the general parts of non-single, and when determining with one, two form of a stroke or a combination of strokes of parts, the key mapping table of general parts and identification code is:
Table three: the key position of single parts is drawn with the single of self and is determined, when the key position of the general parts of non-single is determined with one or two forms of a stroke or a combination of strokes of parts, and the key mapping table of general parts and identification code
When determining the key position of general parts with one or two forms of a stroke or a combination of strokes of parts, the first stroke form of a stroke or a combination of strokes of general parts is used for determining the row sign indicating number (keystroke finger) in two hand keystroke districts about place, keystroke key position, the capable code position that second form of a stroke or a combination of strokes of general parts is used for being expert at definite keystroke key position (which of which keystroke district); When general parts were the single parts, its key position was on five capable keys of the reference key in right hand keystrokes district.
Solve the defeated word problem of the coding of divining by means of characters with the described encoding scheme of divining by means of characters of claim 1 of the present invention, can study software by more full encode method for entering Chinese characters according to the actual needs development function of research encode method for entering Chinese characters, make divine by means of characters the coding research work can be under the prerequisite that difficulty obviously reduces, allow each main specific practice not have blindness, after determining that the stroke information of general parts key position is selected, choosing of corresponding Chinese character parts, the formulation of coding rule, determine the stroke information that identification code should be used, the arrangement of the strong position of defeated word of general parts and identification code, choosing and keys arrangement of positioning element, all can and use actual that find or best way according to Hanzi structure reality, it is few and do not appear on the main way of coding to obtain uncertain factor that the optimum coding input method exists, can not occur with the Five-stroke Method be representative only solve the defeated word problem of divining by means of characters coding with part of Chinese character table shape parts the time " ten thousand yards Pentium " problem and " coding pollutes " problem of once occurring.
Embodiment
Embodiment: (coding generation way)
The choosing of the choosing of Hanzi component, identification code, the formulation of coding rule, the keys arrangement of addressable part, be 4 key elements that produce the Chinese character shape code input method that uses identification code, have only these 4 problems have all been resolved, could obtain the measured Chinese character shape code input method of matter.The keys arrangement of following the example of with addressable part of the coding rule that embodiment uses, identification code specifies in claim 1, therefore the Hanzi component that uses among the embodiment only is discussed here and how is obtained, the following foundation of obtaining Hanzi component of seeing earlier.
Complicated Chinese character is formed by better simply Chinese character and group word stroke textural association, and physical structure of Chinese characters (being Hanzi font) is to determine according to the situation of contained group of word stroke structure of Chinese character.If with the physical structure of Chinese characters type by the convention of primary school teaching be divided into " independent body; Up and down; Upper, middle and lower; About; Left, center, right; Semi-surrounding, the full encirclement, isosceles triangle, special " etc. nine kinds; According to the characteristics of various structure types; Will be " up and down, about, semi-surrounding, the full encirclement " four types Chinese character is split as two parts; With " upper; Middle and lower, the left, center; Right, isosceles triangle " etc. the Chinese character of three kinds of structure types be split as three parts; After the Chinese character of special type was split as two or more parts, the physique structure type of the group word stroke structure of each part of gained still belonged to above-mentioned nine types category.In the last mo (u) ld bottom half Chinese character as Lv portion, words such as " skill; Chinese mugwort; Macrophylla; Splendid achnatherum; Grassland; Reed, seedling, English " tear open for behind two parts up and down up and down two parts all be the independent body type; The next part that words such as " sweet smell; Hardship, sweet potato, tongue; Grass; Tea, not, cyanines; Chinaroot greenbrier; Taro, Mu, water spinach " is torn open behind two parts up and down is to go up mo (u) ld bottom half; The next part that words such as " water chestnut; Collection; Climing; The heart of a lotus seed; Lamb's-quarters; A kind of sedges " is torn open behind two parts up and down is the upper, middle and lower type; " flower, desert, the membrane inside the rush stalk; Model, eggplant, eat; Calabash, a small bundle of straw, etc. for silkworms to spin cocoons on; Thin, rattan, algae; Lian , , Scorched; Luxuriant, " wait word tear open about next part behind two parts be left right model; Next part about " common vetch, Heng " two words are torn open behind two parts is the left, center, right type; The next part that words such as " Li, severe, careless; Tong; Lotus, Portugal, Qian; Fluffy; Sugarcane, Lin, fern; Tibetan, fringed pinks " is torn open behind two parts up and down is the semi-surrounding type; Next part about three words such as " mattress, fennel, bacterium " are torn open behind two parts is the type that surrounds entirely; Next part about words such as " Li, Taipa, Onion, Bud, stamens " is torn open behind two parts is Chinese character pin-shaped; " Wu; Fei; E; Alpine rush or palm-bark rain cape; " the four words next part of tearing open behind two parts up and down is special type.As seen; The scheme structure of adopting Chinese character form has provided the basic model that Chinese character group word stroke structure structure word has; These characteristics that Chinese character word-building has show; Choosing of the addressable part of Chinese character shape code input method; If can carry out according to the physical structure of Chinese characters situation; Then used Hanzi component must not memorized mechanically, and the Chinese character fractionation has easy easily to be remembered, tears the not advantage of commute correction open.
According to above-mentioned Hanzi structure reality, at present handy 30 keys of institute are got the encode Chinese characters for computer of 4 keys, one word to GB2312-80, and determine that with the first stroke of a Chinese character form of a stroke or a combination of strokes and stroke number the addressable part generation way of coded input method of the key position of general parts divides following two steps to carry out:
One. the primary election of Hanzi component
The strictness of primary election Hanzi component is carried out according to physical structure of Chinese characters is actual, get 30 keys and give the encode Chinese characters for computer of GB2312-80 with quadruple linkage one word, for guarantee each Chinese character can both directly tear open with self whole table shape parts encode with equal-length code, must limit every word and at most only tear four parts open.The principle that the primary election parts are followed is: become word group word stroke structure preferential, that be taken as radicals by which characters are arranged in traditional Chinese dictionaries group word stroke structure preferential, that group word frequency is high preferential, contain the phase Chinese character of word stroke structure on the same group, contained phase on the same group the word stroke structural portion break Hanzi component identical, contain the group word stroke structure that intersects stroke and do not tear open without exception.Main specific practice is: tear two parts open by last mo(u)ld bottom half, left right model Chinese character that two simple group word stroke structures constitute, as: two parts all only torn open in words such as skill, Chinese mugwort, Macrophylla, splendid achnatherum, grassland, reed, seedling, English, and " Lv " is their common unit; Besieged part is that two parts torn open in full encirclement Chinese character and the semi-surrounding Chinese character of simply organizing the word stroke structure, as: return tear open mouthful mouthful, chimney tears open Sunset, limit are torn power Chuo open, are rushed and tear a horse open; Three parts torn open in the left, center, right that is made of simple group of word stroke structure, the Chinese character of upper, middle and lower type structure, as spot tear open Wang Wen king, tree tear open wood again cun, gram tears ten mouths open, chapter was torn open upright day ten; Three parts torn without exception open in the Chinese character of isosceles triangle, as stand tall and upright tear open straight straight straight, the prosperous Jin Jinjin that tears open ..., to complicated Chinese character, tear 3-4 parts open according to Chinese character institute tool physique structure situation, both noticed when divining by means of characters that the fractionation situation easily grasped, note again containing mutually the Chinese character of word stroke structure on the same group tear open parts identical, as " warding off " is the left right model Chinese character, in the Chinese character of GB2312-80, the Chinese character that contains " warding off " has " wall; arm; keep away; split; example; show favour to, an ancient piece of jade, round, flat and with a hole in its centre, bark of a cork tree, brick, thumb, folds in a garment, grind or sharpen, thunderclap, out-of-the-way, Pi, break off, Pi, addiction " etc. 18 words; in this 18 word; remove group word stroke structure and " ward off " outside (the maximum common unit of this 18 word); " suffering " in the group word stroke structure of being left and " warding off " all is the radicals by which characters are arranged in traditional Chinese dictionaries that are taken as parts, group word stroke structure Only be contained in " warding off " and contain the Chinese character of " warding off ", will " ward off " and tear open "
Figure GSB00000056150400133
Hot " two parts, 3 parts torn open in 18 Chinese characters that contain " warding off ", and it splits situation easy to perform, and Hanzi component structure word information provides effective." doubting " is the left right model Chinese character, and the Chinese character neutralization " doubting " of GB2312-80 has mutually word stroke structure on the same group
Figure GSB00000056150400134
Chinese character, " coagulate, a word used in place name, study " three words are arranged, " an ancient type of spoon, arrow, Bing, mountain, then " all is the radicals by which characters are arranged in traditional Chinese dictionaries that are taken as parts, group word stroke structure
Figure GSB00000056150400135
Only be contained in " doubt, coagulate, a word used in place name " three words, get
Figure GSB00000056150400136
Be parts, " doubting " tears " an ancient type of spoon arrow open
Figure GSB00000056150400137
" three parts, " coagulating " tear that " Bing an ancient type of spoon is vowed open
Figure GSB00000056150400138
" four parts, " a word used in place names " tear that " mountain an ancient type of spoon is vowed open " four parts, " studying " tear " an ancient type of spoon is vowed then " three parts open, the fractionation situation of " doubt, coagulate, a word used in place name, study " four words is consistent easily to be grasped, and Hanzi component structure word information provides effective." win, win, Luo, thin " four words are upper, middle and lower type Chinese characters for another example, " win " and be contained in " sea ", " die, mouthful " all is character formation component, when tearing this five word open, " die, mouthful " respectively is taken as parts, " win, win, Luo, thin " respectively tears three parts open, and " sea " tears four parts open, and wherein " mouth of dying is torn in win open
Figure GSB000000561504001310
Win and tear the mouth of dying open Luo tears the mouth of dying open The thin mouth of dying of tearing open
Figure GSB000000561504001313
The sea is torn the Rui mouth of dying open
Figure GSB000000561504001314
", the fractionation situation of " win, win, Luo, thin, sea " five words is easily grasped, and Hanzi component structure word information provides effect better.
Two. with encode Chinese characters for computer research software find out with step 1 tear open Hanzi component when giving encode Chinese characters for computer, produce the many and high parts of accumulative total usage frequency of repeated code number of words, obtain and the measure of the actual reduction repeated code number of words that conforms to of Chinese character with the accumulative total usage frequency, the measure of using among the embodiment mainly contains following 16:
Will " Lv, Rui, Ren, mouth, soil, wood, day, say, by, several " etc. 10 parts be taken as the positioning element that computing machine is failed word, the 7th page table two (the defeated keyboard bit table of general parts, identification code and the positioning element of embodiment use) is seen in the key position at their places.
2. do not tear and limit every word open in the group word stroke structure that contains crossing stroke and at most only tear open under the prerequisite of four parts, the difference of divining by means of characters by physical structure of Chinese characters is torn open the Chinese character of method, get the few method of tearing open of repeated code that produces.In the Chinese character as GB2312-80,12 words such as " endure superfluous stroll the proud ouch storehouse for grain, etc. of proud large fierce dog perverse chela a flat iron plate for making cakes huge legendary turtle " all are made up of " Ao " and the radicals by which characters are arranged in traditional Chinese dictionaries that are taken as parts, not tearing and limit every word open in the group word stroke structure that contains crossing stroke at most only tears open under the prerequisite of four parts, divine by means of characters by physical structure of Chinese characters, these 12 Chinese characters that contain " Ao " have three kinds of differences to tear method open: when 1. tearing this 12 word open, " Ao " do not tear open without exception; When 2. tearing this 12 word open, " Ao " tear open without exception "
Figure GSB00000056150400141
The-Fan " two parts; When 3. tearing this 12 word open, " Ao " tear open without exception "
Figure GSB00000056150400142
Ten thousand The-Fan " three parts.Tear open in the method for these three kinds, 3. the plant that to tear the method repeated code open few and the accumulative total usage frequency is low, has only " a flat iron plate for making cakes huge legendary turtle " two words heavy mutually, and this two word all be the secondary word, embodiment to this 12 word get the 3. kind tear method open.
3. in the character formation component, remove " east, each, the last of the twelve Earthly Branches, card, can, year, stone, emerging, friendly, first, prop up " etc. 11 character formation components no matter whether with other parts group word, under any circumstance all only be taken as outside the parts, do not tear open when other character formation component and other parts are formed Chinese character, it is removable to satisfy removable condition person when being one one-tenth word separately.As " nose " is character formation component, " nose " be removable " from field Ji " 3 parts when becoming word separately, tear open " cutting off the nose, have a stuffy nose, snore, Cha, blow, Bi " etc. during word " nose " only be taken as parts, i.e. " nose Dao is torn in cutting off the nose open ", " have a stuffy nose and tear nose nine open ", " nasal cavity dryness is torn in snore open ", " Cha tear open nose wood day one ", " blow and tear the Rolling nose open ", " Bi tears the Rui nose open ".
4. the first sum of is in the Chinese character of " ", to be that the first sum of " one " is removable be the character formation component of single parts to 25 words such as " the third bad beans two do open thanks to two flat three dead kings mutually do not have five west and descend Asias just to the towering pig of Chu Ji ", the first sum of when being one one-tenth word separately " one " is removable to be parts, do not tear open during with other parts group word (wherein remove " three tear open one one by one, flat tear open a Ha ten, extremely tear open one sunset an ancient type of spoon, beans tear open flatly
Figure GSB00000056150400143
" 4 words tear open outside three parts when becoming word separately, two parts all torn separately when becoming word open in all the other 21 words); 9 words such as " more draw the beautiful rain that goes out extend again the Qi of separating " no matter whether with other parts group word, its first sum of " one " under any circumstance all tear open be parts (remove " and extend tear open one day one, draw and tear a field Qian open, separate and tear open flatly
Figure GSB00000056150400145
" 3 words are to tear open without exception outside three parts, whether all the other 6 words all tear two parts open without exception with other parts group word).
5. the end pen is in the Chinese character of " ", " two worker scholars with start black order ware bird assistant officer urgently " wait 12 words to be that end " " is removable to be the character formation component of single parts, be separately one one-tenth word time Mo pen " " removable be parts (two parts all respectively torn open in this 12 word), do not tear open during with other parts group word; " and " end pen " " " and " word and last parts be " and " Chinese character in (as: in words such as the suitable friendship of group elder sister) removable, in words such as " the outstanding Ju hoe that helps in county are few ", group word stroke structure " and " do not tear open (county tears open and Si, outstanding tear open and the Si heart, help tear open and power, Ju tears open and Cui, hoe tear open Jin and power, few Chai Http Myeon and branch), the end pen is that the removable Chinese character of " " and its " " also has " defending ", and " defending " tears " Jie one " open.
6. the first sum of is that " Shu " and its " Shu " removable Chinese has only " little old " two words, wherein " little " word is the first sum of " Shu " removable character formation component, the first sum of when being one one-tenth word separately " Shu " is removable to be parts (tearing Shu eight for a short time open), and " little " do not tear open during with other parts group word; " old " word is torn " Shu day " open.
7. the first sum of is in the Chinese character of " Pie ", to be that the first sum of " Pie " is removable be the character formation component of single parts to 18 words such as " eight youngsters, thousand torr river Fan ox hairs are died young to give birth to and lose standing grain in vain from blood system the ninth of the ten Heavenly Stems ", the first sum of when being one one-tenth word separately " Pie " is removable to be that parts (are wherein torn open three parts except that " Pie Shu Shu is torn in the river open " is, two parts all torn open in all the other 17 words), do not tear open during with other parts group word; " lack " and " losing " two words no matter whether with other parts group word, its the first sum of " Pie " under any circumstance all removable to be parts (weary tearing open " Pie it " two parts, lose and tear " Pie goes " two parts open), the first sum of " Pie " of " liter, coin, offspring " respectively tears parts open, and (liter tears that Pie European-allies, coin tear Pie towel open, the offspring tears Pie one open open Yin does not have the Chinese character that contains " liter, coin, offspring " among the GB2312-80).
8. the first sum of is in the Chinese character of " Dian ", and to be that the first sum of " Dian " is removable be the character formation component of single parts to 7 words such as " the good justice in wide family, side are main forever ", and the first sum of when being one one-tenth word separately " Dian " removable (all tearing two parts open) do not torn open during with other parts group word.
9. the end pen is in the Chinese character of " Dian ", 27 words such as " the one Jian pawl is shooted a retrievable arrow in the eight fork chi worm Fa Fuge dragons of a specified duration third constellations of asking dog sword spoon art to defend my penta outstanding jade make melon " are last " Dian " removable character formation components, be separately that one one-tenth word time Mo pen " Dian " is removable, do not tear open during with other parts group word (wherein except that " people is torn in Zhao Chai  Shu Dian, order open Dian, Gua Chai  Si Dian " 3 words are to tear open outside three parts, two parts all torn open in all the other 24 words); " all rabbits " two words no matter whether with other parts group word, its end pen " Dian " is under any circumstance all removable to be parts (all tear open several Dian, rabbit is torn open and exempts from Dian), the end pen " Dian " of " book pang " two words also removable (book tear open book Dian, pang tear mound Dian open).
10. the first sum of Shi “ Ya " Chinese character in, " practise, department " 2 words are the first sum of “ Ya " removablely be the character formation component of single parts, the first sum of “  when being one one-tenth word separately " removable (Xi Chai  Bing, Si Chai  a bite), do not tear open during with other parts group word.Tricky no matter whether with other parts group word, Chai “  one without exception " two parts, " with " no matter whether with other parts group word, tear open without exception " Dian people " three parts, " buying " no matter whether with other parts group word, Chai “ Ya head without exception " two parts, " lices " are torn " second Pie worm " open, " fast " tears " second ten Chuo " open.
A 11. end Shi “ Ya " Chinese character in, " youngster, oneself, the sixth of the twelve Earthly Branches, die, Cannibals " 5 words are last Bi “ Ya " removable character formation component, be separately one one-tenth word time Mo Bi “ Ya " removable (all tearing two parts open), do not tear open during with other parts group word.The last Bi “ Ya of " Qiang " two words " also removable, " " tear open "
Figure GSB000000561504001515
Yin ", " Qiang " Chai “  Yin ".
5 words such as " 12. the end is the Xiao Zhu fork-like farm tools used in ancient China not " are the character formation components of last two removable " eight ", when being one one-tenth word separately, last two removable be that eight (does not tear open at the end
Figure GSB000000561504001516
Eight, tear open for a short time to Shu eight, Zhu tear ox eight open, fork-like farm tool used in ancient China tears rich eight open), do not tear open during with other parts group word; 6 words such as " red fruits find pleasure in bundle also " no matter whether with other parts group word, tearing two parts and last two without exception open, all to tear open be eight (red tearing open Eight, fruit is torn open Eight, tear open Eight, happy tearing open Eight, bundle is torn open Eight, also tear open Eight); " east, card " two words no matter whether with other parts group word, do not tear open without exception; Grasp and tear open Eight.
13. contain in the Chinese character of " directly ", " directly " in 13 words such as " the true Yunnan, top of standing tall and upright careful fill out angry mountain peak, town to be full of careful Zhen insane " do not tear (only being taken as parts) open, " directly " in 6 words such as " directly plant grow value put clay " tears " ten open
Figure GSB000000561504001524
" two parts (" and " and " directly " be the group word stroke structure of " do not tear open after having, tear two without male offspring open ").
14. " staying " in " Bao Bao pot is praised and preserved blankets, cloth for baby " 6 words only is taken as parts, with 6 words such as " the Chang Dangshang skirt hall palms " the consistent method of tearing open arranged for guaranteeing " Chinese bush cherry ", it is two parts that staying in " Chinese bush cherry " torn open, and promptly " Chinese bush cherry " tears "  mouth wood " open.
15. in the semi-surrounding Chinese character, remove the removable " day of semi-surrounding parts "Yes" of " topic spoon Wei " three words
Figure GSB000000561504001526
" and the semi-surrounding parts " Yao " of " sticking up " removable "
Figure GSB000000561504001527
Towering " outside, the semi-surrounding parts of all the other semi-surrounding Chinese characters are not all torn (only being taken as parts) open.
16. contain in the Chinese character of special semi-surrounding parts, " defending " with " salty " is character formation component, the contained special semi-surrounding group word stroke structures of 8 words " salty " such as " senses cry out shake regret seal alkali and subtract didactic literary composition " " are defended " and reached to the contained special semi-surrounding group word stroke structure of 3 words such as " slight thin bamboo strip Mie " all can only be taken as parts, can not split out special semi-surrounding parts " penta ".
Illustrate: Hanzi frequency count is taken from " Gao Gengsheng, Tan Dezi, Wang Liyan " chief editor " Modern Chinese knowledge dictionary " appendix eight, Modern Chinese everyday character frequency statistics.
The addressable part and the brevity lists of 16 districts---27 district's Chinese characters:
Figure GSB00000056150400171
Figure GSB00000056150400201
Figure GSB00000056150400211
Figure GSB00000056150400221
Figure GSB00000056150400231

Claims (4)

1. the defeated word of computing machine easily leads to encode method for entering Chinese characters, it is characterized in that 1. each Chinese character all directly tear open with self whole table shape parts encode with equal-length code, each table shape parts all only takies a defeated keyboard position during coding, table shape parts deficiency person mend with identification code, each identification code also only takies a defeated keyboard position, 2. can participate in the table shape parts of coding chooses successively by the Chinese-character order of strokes standard that formulate the State Language Work Committee substantially, 3. the table shape parts of participating in coding are all under total prerequisite that the group word stroke structure that contains crossing stroke is not torn open without exception, choose with physical structure of Chinese characters with the more complete encode method for entering Chinese characters of function research software and to match, group word frequency is high and group word frequency is high relatively and the discrete effective group word stroke structure of coding, 4. all show in the shape parts, there are ten parts to arrange the defeated keyboard position of specified coding, these ten parts are positioning element, remaining part is general parts, the defeated keyboard position of the coding of general parts determines that with the first stroke of a Chinese character form of a stroke or a combination of strokes and the stroke number of parts 5. the key position of identification code is determined with the appointed information of the contained stroke of last table shape parts; When the defeated keyboard position of determining general parts with the first stroke of a Chinese character form of a stroke or a combination of strokes and stroke number, and when getting 30 keys and giving 6763 encodes Chinese characters for computer of GB2312-80 with quadruple linkage one word, four parts torn at most open in every word, the single part word adds three identification codes with self component code and encodes, two parts words add two identification codes with self two component codes successively and encode, three parts words add 1 identification code with self 3 component codes successively and encode, four parts words are encoded with 4 component codes of self successively, two parts words, three parts words, the parts order of four parts words is got the order of the resulting part of divining by means of characters without exception, the Chinese character that must add identification code, identification code is placed on after last component code without exception, first three concrete condition of planting that Chinese character adds identification code is: 1. when the single part word is the single parts, add the form of a stroke or a combination of strokes and the stroke number that three identification codes all are taken as these single parts; When the single part word is two when drawing parts, add second form of a stroke or a combination of strokes and the stroke number that three identification codes all are taken as this single part word; When the single part word is three when drawing parts, first identification code is got second form of a stroke or a combination of strokes and the stroke number of this single part word, and second identification code and the 3rd identification code are all got the 3rd form of a stroke or a combination of strokes and stroke number of this single part word; When the single part word is four to draw and four pictures during with upper-part, first identification code is got second form of a stroke or a combination of strokes and the stroke number of this single part word, second identification code got the 3rd form of a stroke or a combination of strokes and stroke number of this single part word, and the 3rd identification code got the 4th form of a stroke or a combination of strokes and stroke number of this single part word; 2. when second parts of two parts words are the single parts, add the form of a stroke or a combination of strokes and the stroke number that two identification codes all are these single parts; When second parts of two parts words are two picture parts, to add two identification codes all be this two second form of a stroke or a combination of strokes and stroke number of drawing parts, when second parts of two parts words are three to draw and three pictures during with upper-part, first identification code is got second form of a stroke or a combination of strokes and the stroke number of second parts, and second identification code got the 3rd form of a stroke or a combination of strokes and stroke number of second parts; 3. when the 3rd parts of three parts words are the single parts, add the form of a stroke or a combination of strokes and the stroke number that identification code is just got these single parts, when the 3rd parts of three parts words are two to draw or during two parts more than drawing, add second form of a stroke or a combination of strokes and the stroke number that identification code is just got the 3rd parts; Used 30 defeated keyboard positions are 30 keys such as 26 English alphabet keys and branch, comma, fullstop and brace number, and wherein 15 keys in 15 keys in left hand keystroke district and right hand keystrokes district respectively have triplex row five row; Since the separatrix in two keystroke districts, the row sign indicating number right-to-left of the 5 row keys in left hand keystroke district is followed successively by 1,2,3,4,5, and the row sign indicating number of the 5 row keys in right hand keystrokes district is followed successively by 1,2,3,4,5 from left to right; From the row by numerical key, the row sign indicating number of three line units in left hand keystroke district is followed successively by 3,1,5, and the row sign indicating number of three line units in right hand keystrokes district is followed successively by 4,2,6; Row sign indicating numbers is that the key bit code of the key of j is ij for the capable sign indicating number of i, the Chinese-character order of strokes regulation and stipulation that formulate the State Language Work Committee, the form of a stroke or a combination of strokes code that is used to represent five kinds of basic strokes of Chinese-character order of strokes standard is: " horizontal 1; perpendicular 2; cast aside 3; point 4; folding 5 ", with about in the two keystroke districts row sign indicating number be followed successively by 1,2,3,4,5 five row keys are that the defeated keyboard position and the identification code form of a stroke or a combination of strokes of the general parts of " horizontal stroke; perpendicular; cast aside; point; folding " is " horizontal stroke; perpendicular successively as the first stroke of a Chinese character form of a stroke or a combination of strokes successively successively, cast aside, the point, folding " the defeated keyboard position of identification code; the sign indicating number of will going is followed successively by " 1,2,3,4; 5 " five-element's key be a picture as stroke number successively, two draw, three draw, four draw, the five general parts of drawing and the key position of identification code, to be 6 key draw and 6 draw the above general parts and the defeated keyboard position of identification code as 6 for row sign indicating number, must get that 30 keys solve the general parts that use when word problem is failed in encode Chinese characters for computer with quadruple linkage one word and the key mapping table of identification code is:
Table one: when determining the key position of general parts with the first stroke of a Chinese character form of a stroke or a combination of strokes and stroke number, the key mapping table of general parts and identification code
Figure FSB00000056150300021
In the table Test pencil shape is that horizontal stroke, stroke number are followed successively by a picture, two pictures, three pictures, four pictures, the general parts of five pictures and the defeated keyboard position of identification code successively, Test pencil shape is that 6 pictures and 6 are drawn the above general parts and the defeated keyboard position of identification code for horizontal, stroke number; Shu 1, Shu 2, Shu 3, Shu 4, Shu 5 successively test pencil shape one draw for perpendicular, stroke number are followed successively by, two draw, three draw, four draw, the five general parts of drawing and the defeated keyboard position of identification code, Shu 6 test pencil shapes are 6 to draw and 6 draw the above general parts and the defeated keyboard position of identification code for perpendicular, stroke number; Pie 1, Pie 2, Pie 3, Pie 4, Pie 5 test pencil shape successively are that left-falling stroke, stroke number are followed successively by a picture, two pictures, three pictures, four pictures, the general parts of five pictures and the defeated keyboard position of identification code, and Pie 6 test pencil shapes are that left-falling stroke, stroke number are 6 pictures and the general parts more than 6 pictures and the defeated keyboard position of identification code; Dian 1, Dian 2, Dian 3, Dian 4, Dian 5 test pencil shape successively are that point, stroke number are followed successively by a picture, two pictures, three pictures, four pictures, the five general parts of drawing and the defeated keyboard position of identification code, and Dian 6 test pencil shapes are that point, stroke number are 6 pictures and the general parts more than 6 pictures and the defeated keyboard position of identification code; Test pencil shape is that folding, stroke number are followed successively by a picture, two pictures, three pictures, four pictures, the five general parts of drawing and the defeated keyboard position of identification code successively,
Figure FSB00000056150300032
Test pencil shape is that 6 pictures and 6 are drawn the above general parts and the defeated keyboard position of identification code for folding, stroke number.
2. encode method for entering Chinese characters according to claim 1, the person is defeated with brevity code to it is characterized in that the words input all having adopted the brevity code, no brevity code person is with the defeated way of all-key, wherein the brevity code of word input adopts and encodes to word with five yard one speech earlier, and then obtains the code value of word input with the way of getting brevity code; The rule of word coding is: two words add preceding trigram, four words that complete four yards of second word, three words add the 3rd word with first yard of the first two word successively successively with first yard the first two yard that adds the 4th word of first three word with first yard of first word successively, five character word is used first yard of every word successively, and first yard of the first five word used successively in the above word of five words; The rule that brevity code is got in ordering is: the ordering of code value size pressed in the different word of code value, and the little person of code value is preceding; The ordering of word number of words pressed in the word that code value is identical, and the few person of number of words is preceding; The word frequency ordering according to first letter that number of words is identical, the high person of lead-in frequency is preceding; The ordering of secondary word frequency pressed in the word that lead-in is identical, and the high person of secondary word frequency is preceding; Whether the word quantity of each code value can once show the words quantity of same code value in candidate window on having the Chinese character of this code value to decide when getting brevity code; The order of getting brevity code is successively: get earlier the one-level brevity code, inferior get the secondary brevity code, again get three, get the level Four brevity code at last, got the word of level Four brevity code after, what be left is exactly five yards speech; Words with same code value shows speech after the order that candidate window shows is to show word earlier; The order that word shows is the different word of number of words, and number of words is lacked the person preceding, the word that number of words is identical, and the high person of lead-in frequency is preceding; The identical high person of word secondary word frequency of lead-in is preceding.
3. encode method for entering Chinese characters according to claim 1, it is characterized in that defeated word learning software all tabulates to provide to each Chinese character among the GB2312-80 comprises " Chinese character; brevity code; phonetic; frequency; addressable part; the parts order of strokes observed in calligraphy, the parts situation, defeated keyboard position, the keystroke situation, Chinese-character order of strokes " helping prompt of all contents; wherein to not splitting out full encirclement parts and not splitting out the Chinese character of special semi-surrounding parts; choosing of each parts can both strict be carried out by the Chinese-character order of strokes standard successively, and its Chinese-character order of strokes provides with the component type order of strokes observed in calligraphy and component type sequence number order of strokes observed in calligraphy dual mode; To splitting out full encirclement parts and the Chinese character that can split out special semi-surrounding parts, when tearing full encirclement parts open and tearing special semi-surrounding parts open, the Chinese-character order of strokes standard is damaged, these two kinds of Chinese characters, at the Chinese-character order of strokes standard place that is damaged, all adopt the parts plug-in type order of strokes observed in calligraphy and parts plug-in type sequence number order of strokes observed in calligraphy dual mode to provide, the order of strokes observed in calligraphy that the used unit plug-in type order of strokes observed in calligraphy and the parts plug-in type sequence number order of strokes observed in calligraphy are obtained is represented situation, and is consistent with the trailing type order of strokes observed in calligraphy and the sequence number formula order of strokes observed in calligraphy in the Chinese-character order of strokes standard that formulates the State Language Work Committee respectively.
4. encode method for entering Chinese characters according to claim 1, it is characterized in that providing the stroke information inquiry that contains by Chinese character to the user to the Chinese character in the operating system Chinese character base, the integrated information inquiry way that component information inquiry and Pinyin information inquiry combine solves the instrument of encode Chinese characters for computer information inquiry, wherein the stroke information inquiry comprises stroke number and stroke character string two contents of wanting inquiry of Chinese character, component information inquiry comprises the parts character string of wanting inquiry of Chinese character and the mode of searching two contents of parts character string, and Pinyin information is inquired about and contained pinyin character string one content of wanting inquiry of Chinese character; Character in the stroke character string is the stroke in five kinds of basic strokes both, also represents the asterisk wildcard of any one basic stroke, the character in the parts character string, and both concrete Hanzi component is also represented the asterisk wildcard of Hanzi component; Character in the pinyin character string, the asterisk wildcard of phonetic alphabet also represented in both phonetic alphabet; The mode of searching of parts character string has " with field beginning coupling ", " with any part coupling of field " to reach and " whole fields match " three kinds; When opening integrated information inquiry session frame, stroke number, the stroke character string, the parts character string, the default value of the input character in the pinyin character string all is taken as corresponding asterisk wildcard respectively, the default value of the mode of searching of parts character string is got and field beginning coupling, the user imports any one sub-information of wanting in the contained above-mentioned Query Information of inquiry of Chinese character in accordance with regulations, after sending execution command, program in the encode Chinese characters for computer resource discovery tool can allow computing machine by the user the situation of defeated Query Information from whole records of query facility information bank, find out the target complete Chinese character that satisfies the respective queries condition rapidly, and each target characters place is recorded in " Chinese character; stroke number; phonetic; parts; easy make code; character library code " six information lists of storing in the information bank is shown to the user, wherein " Chinese character; stroke number; phonetic; parts; easy make code; character library code " is the column heading that tabulation shows, the order button that to be again the contained display message of target characters that satisfies querying condition sort by the specified attribute of this column information.
CN2006100107837A 2006-03-30 2006-03-30 Chinese character input method for computer Expired - Fee Related CN1828494B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2006100107837A CN1828494B (en) 2006-03-30 2006-03-30 Chinese character input method for computer

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2006100107837A CN1828494B (en) 2006-03-30 2006-03-30 Chinese character input method for computer

Publications (2)

Publication Number Publication Date
CN1828494A CN1828494A (en) 2006-09-06
CN1828494B true CN1828494B (en) 2011-06-01

Family

ID=36946919

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2006100107837A Expired - Fee Related CN1828494B (en) 2006-03-30 2006-03-30 Chinese character input method for computer

Country Status (1)

Country Link
CN (1) CN1828494B (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1193767A (en) * 1997-03-18 1998-09-23 徐祖华 Chinese character coding method with coding components corresponding with shaping components

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1193767A (en) * 1997-03-18 1998-09-23 徐祖华 Chinese character coding method with coding components corresponding with shaping components

Also Published As

Publication number Publication date
CN1828494A (en) 2006-09-06

Similar Documents

Publication Publication Date Title
Nadel A black Byzantium: The kingdom of Nupe in Nigeria
Cockshott et al. Classical econophysics
Atkinson Inventing Inventors in Renaissance Europe: Polydore Vergil's De Inventoribus Rerum
CN1828494B (en) Chinese character input method for computer
Koerner Women and utility in Enlightenment science
Salazar African Englishes in the Oxford English Dictionary
Roddy A Love of Labor: The Ethnographic Turn of Zhuzhici
Galla et al. Perpetuating Hula
Cahill Some thoughts on the history and post-history of Chinese painting
JAYAPRAKASH Myths and motives: Kodagu and the story of the kaveri purana
CN101825951B (en) Simple method for inputting Chinese characters by classifying radicals and double pinyin
CN100375947C (en) Thirty-key Renzhi Code Chinese character input method
Ferrini THE DYNAMIC LANDSCAPE. DESIGN, ECOLOGY AND MANAGEMENT OF N ATURALISTIC URBAN PLANTING. Dunnett Nigel, and James Hitchmough. S pon Press. Taylor & Francis Group. London and New York, 2004. pp. 332. ISBN 0-415-25620
Rahkonen Fiddling Way out Yonder: The Life and Music of Melvin Wine
Zhang et al. The Small Endless Sorrow of the South: Jottings after Reading Aunt Lili’s Small South
CN101086689A (en) Intelligent Chinese input system for studying and using Chinese character
CN1904811B (en) Chinese character encoding input method
CN1598813A (en) Chinese computer dictionary compile method Chinese word-building information requiring containing
Newton Step Dancing in Cape Breton and Other Complicated Relationships: A Review Essay
Stanley-Baker et al. Toward an integrated methodology
Knez A Korean Village: Between Farm and Sea. By Vincent SR Brandt. Cambridge: Harvard University Press, 1972. 240 pp. Bibliography, Index. $8.95.
Peterson GOLDEN AGE, THE TANG DYNASTY
Leahy The Use of Virgil's Eclogues and Georgics in the Eneados of Gavin Douglas.
Valleriani Review of: Peterson, Mark A.: Galileo's Muse: Renaissance mathematics and the arts. Cambridge [ua]: Harvard University Press 2011
Botkin the Avant-Folkways of Lorine niedecker

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20110601

Termination date: 20150330

EXPY Termination of patent right or utility model