CN111353047A - Method for generating artificial intelligence knowledge map based on Chinese character making method - Google Patents

Method for generating artificial intelligence knowledge map based on Chinese character making method Download PDF

Info

Publication number
CN111353047A
CN111353047A CN201911391218.3A CN201911391218A CN111353047A CN 111353047 A CN111353047 A CN 111353047A CN 201911391218 A CN201911391218 A CN 201911391218A CN 111353047 A CN111353047 A CN 111353047A
Authority
CN
China
Prior art keywords
chinese character
chinese
characters
character
section coding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911391218.3A
Other languages
Chinese (zh)
Inventor
彭韡铭
邓耀臣
申明磊
乔明琼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Changshu Xinmu Qibao Software Development Co ltd
Original Assignee
Changshu Xinmu Qibao Software Development Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Changshu Xinmu Qibao Software Development Co ltd filed Critical Changshu Xinmu Qibao Software Development Co ltd
Priority to CN201911391218.3A priority Critical patent/CN111353047A/en
Publication of CN111353047A publication Critical patent/CN111353047A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The invention discloses a method for generating artificial intelligence knowledge map based on Chinese character making method, which comprises establishing stroke number of Chinese character, phonetic part of Chinese character, phonetic and polyphonic character of Chinese character, part of speech of Chinese character, variant character of Chinese character, false word of Chinese character, four corner number of Chinese character, and segment code of Chinese character making method by defining Chinese character range and set, and finally establishing basic dimension using stroke number of Chinese character, phonetic part of Chinese character, part of speech of Chinese character, variant character of Chinese character, false word of Chinese character, four corner number of Chinese character, the artificial intelligent knowledge map based on the Chinese character forming method is generated by taking the Chinese character forming method forming mode as the core dimension, through the mode, the method improves the accuracy and efficiency of natural language processing of the artificial intelligent deep learning engine for Chinese semantic and the comprehension capability of computer semantic, and can be used as a Chinese character learning auxiliary tool to more intuitively display the venation and connection of Chinese character development.

Description

Method for generating artificial intelligence knowledge map based on Chinese character making method
Technical Field
The invention relates to a knowledge map method, in particular to a method for generating an artificial intelligence knowledge map based on a Chinese character making method.
Background
1. Knowledge graph:
knowledge map (Knowledge Graph) is a series of different graphs displaying Knowledge development process and structure relationship in the book intelligence field, describing Knowledge resources and carriers thereof by using visualization technology, mining, analyzing, constructing, drawing and displaying Knowledge and mutual relation between Knowledge resources and Knowledge carriers.
The knowledge graph is a modern theory which achieves the aim of multi-discipline fusion by combining theories and methods of applying subjects such as mathematics, graphics, information visualization technology, information science and the like with methods such as metrology introduction analysis, co-occurrence analysis and the like and utilizing a visualized graph to vividly display the core structure, development history, frontier field and overall knowledge framework of the subjects. It can provide practical and valuable reference for subject research.
Specifically, the knowledge graph is a modern theory which achieves the purpose of multi-discipline fusion by combining theories and methods of applying subjects such as mathematics, graphics, information visualization technology, information science and the like with methods such as metrology introduction analysis, co-occurrence analysis and the like and utilizing a visualized graph to vividly display the core structure, development history, frontier field and overall knowledge framework of the subjects. The method displays the complex knowledge field through data mining, information processing, knowledge measurement and graph drawing, reveals the dynamic development rule of the knowledge field, and provides a practical and valuable reference for subject research.
2. Deep learning:
deep Learning (DL) is a new research direction in the field of Machine Learning (ML), and is an intrinsic rule and expression level of Learning sample data, and information obtained in the Learning process is very helpful for interpretation of data such as characters, images and sounds. The final aim of the method is to enable the machine to have the analysis and learning capability like a human, and to recognize data such as characters, images and sounds.
3. Data mining:
data mining refers to a nontrivial process of revealing implicit, previously unknown and potentially valuable information from a large amount of data in a database, and is a decision support process which is mainly based on artificial intelligence, machine learning, pattern recognition, statistics, databases, visualization technologies and the like, analyzes data of enterprises in a highly automated manner, makes inductive reasoning, excavates potential patterns from the data, helps decision makers to adjust market strategies, reduces risks and makes correct decisions.
Data mining is a technology for searching a rule from a large amount of data by analyzing each piece of data, and mainly comprises three steps of data preparation, rule searching and rule representation. The data preparation is to select required data from related data sources and integrate the data into a data set for data mining; the rule searching is to find out the rule contained in the data set by a certain method; the law representation is to represent the found laws as much as possible in a manner understandable to the user (e.g., visualization). The data mining task comprises association analysis, cluster analysis, classification analysis, anomaly analysis, specific group analysis, evolution analysis and the like.
4. And (3) natural language processing:
natural language processing, i.e., implementing man-machine natural language communication, or implementing natural language understanding and natural language generation, is very difficult. The root cause of the difficulty is the wide variety of ambiguities or ambiguities that exist widely across the various levels of natural language text and dialog. To resolve ambiguities, an extremely large amount of knowledge and reasoning is required. How to collect and sort out the knowledge more completely; how to find the proper forms and store them in the computer system; and how to effectively use them to disambiguate are very labor intensive and difficult tasks.
5. Chinese character making method
The Chinese character forming method includes pictograph, fingering, meeting meaning and pictophonetic sound. The ancient Chinese has a six-book method for word-making, and in addition to the four methods, the method also includes transfer and dummy borrowing.
Pictograph method
Some pictographs are not used to represent concrete objects directly, but are used to represent concepts related to the objects. For example: the big character originally has the shape of a person with a right front side, and the spread of hands and feet means the big character. This word-making method, which is generalized from nouns to adjectives, was developed later. Pictograms are the earliest in origin, but the least in number. Because there are many things in the universe, it is impossible to make a pictograph according to the characteristics of each thing, many abstract concepts cannot be drawn at all, and even if drawn, the method of drawing a cucurbit according to the drawing is too unwieldy. Later, in order to overcome the limitation of pictographs and meet the requirement of social development, another word-making method, namely a finger affair method, is invented.
Finger exercise
It is a word-making method that uses symbolic symbols or adds indicative symbols on the graph to express meaning. Such word-building methods can be broadly divided into two categories: one type is a simple symbol, i.e., a non-graphical symbol (graphical, i.e., pictographic). Another type is a symbol attached to a graphic.
The first step of Chinese character development from pictograph to ideograph is to make characters by finger affairs. However, this method cannot be applied to many things generally, and the dots, horizontal lines, etc. as symbols are mixed with other strokes and are not easy to distinguish, so that our ancestors have thought of a method in which several pictures are combined to represent a meaning, i.e., a method of meeting the meaning.
Method of meeting
That is, a new meaning of a new word-making method is represented by combining two or more words according to their meanings.
The Chinese characters are developed to this step, and the requirement of representation cannot be met. Because some do not solve the problem by drawing three or four pictures. The problem can be solved only by drawing five, six or seven, eight pictures, and unlike a character, the reverse image is drawn in a year. If the fact is true, writing a short article is more troublesome today, and writing a thick comic book is equivalent to compiling a thick comic book. Our ancestors have endless wisdom, and they think that the words are recorded and speeches are related to the voices, and then can the words be recorded or not recorded with the voices? Therefore, the pictophonetic characters are invented.
Shape and sound method
It is a word-making method by combining the side of the figure and the side of the sound. Because the pictophonetic characters are generated on the basis of pictographs, fingering and meeting. Although it belongs to ideographic characters, the Chinese character form constraint is broken through, and the relation between Chinese characters in squares and voice is communicated, which is a great development of Chinese characters from ideographic step to phonetic step and becomes the mainstream of Chinese character development. More than 80% of Chinese characters used today are pictophonetic characters. The method is very convenient for making words by using an iconophonetic method, and the words related to the golden wood, the water and fire, the birds, the beasts, the insects and the fishes, the sigh words and the pseudonymphs are almost all the pictophonetic words. The word-making method can continuously make new words today. For example, the recently discovered chemical elements "americium, curium, lawrencium" are named by the new-made pictophonetic characters. For simplifying traditional Chinese characters, the sound and shape methods such as "you", "bang", etc. are also preferred.
Transfer method
The past has many descriptions, and the more popular description is a word making method adopted to adapt to the development of the partials and the voice. If the original 'old' is used for representing an older meaning, and the 'k a o' sound is used for representing the old meaning due to different time and regions, then a 'exam' word which is the same as the 'old' word and has similar pronunciation and the same meaning is created. But the side of the "exam" word is "? ", the consonant side is"? "therefore, the" kao "word is also an pictophonetic word in terms of its word-making method.
Dummy borrowing method
It is a method of using homophone to express new meaning (the meaning of "false" is also used), for example, "Ru" is the name of water, and is used as pronouncing for second person. This kind of false borrowing phenomenon is mainly because there is no word and temporarily borrows a homophone word to replace at first, but later use all the time, borrow for a long time, magpie dove occupies. There is a meaning of the word itself, and it is only after another way is found that the word falls. For example, "then" means that the fire is already in the four points below, but after being borrowed by "then", the user only needs to add another fire beside the fire to turn into the word "fire" (form and sound).
6. Existing research tools and methods:
at present, in the industry, a method for generating an artificial intelligence knowledge graph based on a Chinese character making method is blank. The existing research tools and methods mainly take forms and card type arrangement as main points. The Chinese characters are used as basic units, efficient logic matrixes are not formed yet, the Chinese characters are still in a mutually split and isolated state, natural language processing, deep learning and data mining are carried out, and accuracy and efficiency are not high.
Meanwhile, a large number of Chinese character learners lack tools and methods capable of visually displaying the internal connection and venation of the Chinese characters, can only learn by relying on a large number of exercises, mainly take a large amount of time and energy by reading, transcribing, reciting and mersible writing, and have poor effects.
The invention builds segment codes for different character codes of Chinese characters based on a Chinese character making method, and the method comprises the following steps: chinese characters-stroke number-phonetic part-phonetic alphabet and polyphone-part of speech-variant character-common false character-four corner number-Chinese character making method-unique digital ID number of multi-section code. The method is characterized in that the number of strokes of a Chinese character, the phonetic parts of the Chinese phonetic alphabet, the part of speech of the Chinese character, variant characters of the Chinese character, common kana characters of the Chinese character and four corner numbers of the Chinese character are taken as basic dimensions, and the composition mode of a Chinese character making method is taken as a core dimension to be induced, analyzed and associated. Finally, an artificial intelligence knowledge map based on the Chinese character making method is generated.
The method comprises the steps of coding Chinese character characteristic sections, uniquely digitizing ID, and generating an artificial intelligence knowledge map based on a Chinese character word-making method, and establishing a logic array and the knowledge map at Chinese character information granularity. The natural language processing engine can generate a knowledge graph based on the method, so that Chinese can be understood more accurately and efficiently, and data mining and machine deep learning are performed. The logical chain of the dimension of the Chinese character making method is opened, and the comprehension capability of computer semantic calculation is improved.
And the Chinese character learner can intuitively and efficiently obtain the Chinese character data rule and vein by the method. Fast mastering Chinese characters, a great deal of Chinese characters, and becoming a high-level language learner. Meanwhile, the system can be spread and popularized at low cost through the network Internet, and promotes social equity and high-quality education guarantee of the people.
Disclosure of Invention
The invention mainly solves the technical problem of providing a method for generating an artificial intelligence knowledge map based on a Chinese character forming method, which establishes section codes for different feature codes of Chinese characters by taking the Chinese character forming method as a basis to form the following steps: chinese characters-stroke number-phonetic part-phonetic alphabet and polyphone-part of speech-variant character-common false character-four corner number-Chinese character making method-unique digital ID number of multi-section code. The method is characterized in that the number of strokes of a Chinese character, the phonetic parts of the Chinese phonetic alphabet, the part of speech of the Chinese character, variant characters of the Chinese character, common kana characters of the Chinese character and four corner numbers of the Chinese character are taken as basic dimensions, and the composition mode of a Chinese character making method is taken as a core dimension to be induced, analyzed and associated. Finally, an artificial intelligence knowledge map based on the Chinese character making method is generated.
The invention is realized by the following technical scheme.
The invention provides a method for generating an artificial intelligence knowledge map based on a Chinese character making method, which is characterized by comprising the following steps: the specific working steps comprise:
1) defining the range and set of Chinese characters;
2) carrying out section coding processing on the stroke number of the Chinese character, and determining the unique digital ID number of the stroke number section coding;
3) carrying out section coding processing on the Chinese pinyin part, and determining a unique digital ID number of the section coding of the Chinese pinyin part;
4) carrying out section coding processing on Chinese pinyin and polyphone of the Chinese characters, and determining the unique digital ID number of the section coding of the Chinese pinyin and the polyphone of the Chinese characters;
5) carrying out section coding processing on part of speech of the Chinese character, and determining a unique digital ID number of the section coding of the part of speech of the Chinese character;
6) carrying out section coding processing on the Chinese character variant characters, and determining the unique digital ID number of the section coding of the Chinese character variant characters;
7) carrying out section coding processing on the Chinese character Tong-Han, and determining the unique digital ID number of the section coding of the Chinese character Tong-Han;
8) carrying out section coding processing on the four-corner number of the Chinese character, and determining the unique digital ID number of the section coding of the four-corner number of the Chinese character;
9) carrying out section coding processing on the Chinese character word-making method, and determining the unique digital ID number of the section coding of the Chinese character word-making method;
10) establishing an artificial intelligence knowledge graph based on the ID number of the Chinese character word-building method by taking the ID number of the Chinese character stroke number, the ID number of the Chinese pinyin sound part, the part-of-speech ID number of the Chinese character, the ID number of the foreign-type character of the Chinese character, the ID number of the common-type character of the Chinese character and the ID number of the four-corner number of the Chinese character as basic dimensions and taking the ID number of the Chinese character word-building method as a core dimension;
11) and establishing the transverse connection of the knowledge graph through a knowledge graph editing tool.
Further, the Chinese character range and the set in the step 1) comprise all Chinese characters such as oracle, traditional Chinese characters, simplified Chinese characters and the like.
Further, the Chinese character stroke number segment codes in the step 2) are arranged in sequence according to the size of the Chinese character stroke number Arabic numerals.
Further, the part of speech of the Chinese characters in the step 5) is classified into: nouns, verbs, adjectives, adverbs, pronouns, prepositions, quantifiers, conjunctions, auxiliary words, numerators, sigers, and pseudonyms, and section coding is performed according to the part-of-speech classification of the modern Chinese.
Further, the Chinese characters of step 7) are arranged according to the sound parts and are subjected to segment coding.
Further, the Chinese character making method of the step 9) carries out section coding according to pictographs, fingering, meeting meanings, pictophonetics, commentary and pretending.
Further, the step 10) generates an artificial intelligence knowledge map based on the Chinese character making method, and can display a core structure and an integral knowledge framework and establish a logic network.
The invention has the beneficial effects that: the invention creates a logic array and a knowledge map by coding the Chinese character characteristic sections, uniquely digitizing ID and generating an artificial intelligence knowledge map based on a Chinese character word-making method. The natural language processing engine can generate a knowledge graph based on the method, so that Chinese can be understood more accurately and efficiently, and data mining and machine deep learning are performed. The logical chain of the dimension of the Chinese character making method is opened, and the comprehension capability of the computer semantic calculation can be improved. And the Chinese character learner can intuitively and efficiently obtain the Chinese character data rule and venation through the method, quickly master a large number of Chinese characters and become a high-level language learner. Meanwhile, the system can be spread and popularized at low cost through the network Internet, and promotes social equity and high-quality education guarantee of the people.
Drawings
FIG. 1 is a sequence table of the Chinese character stroke section coding of the present invention.
FIG. 2 is an example of "line" word stroke segment encoding of the present invention.
FIG. 3 is the sequence table of the Chinese character Pinyin part segment codes of the present invention.
FIG. 4 is an example of the "line" phonetic part knowledge map of the present invention.
FIG. 5 is an example of a knowledge graph of polyphone in the phonetic part of the "line" character of the present invention
FIG. 6 is a sequence table of part-of-speech section codes of Chinese characters in accordance with the present invention.
FIG. 7 is an example of a knowledge map of the multiple parts of speech of polyphone in the "line" character pronunciation part of the present invention.
FIG. 8 is an example of the knowledge map of the multiple-word variant characters of the polyphone part of the "line" character pronunciation part of the present invention.
FIG. 9 is an example of a knowledge map of the multi-pronouncing variant kanji of the "line" character pronunciation portion of the present invention.
FIG. 10 is a knowledge graph of four corner numbers of pronouncing characters and pronouncing characters of the line part of the Chinese character.
FIG. 11 is a block code table of the present invention.
FIG. 12 is a knowledge graph example of the four corner number word-making method for the polyphone characters of the phonetic part of the "line" character
FIG. 13 is a knowledge map of the unique digitized ID numbers of the "line" word multi-segment codes of the present invention.
FIG. 14 is the artificial intelligence knowledge map generated by the Chinese character line-making method of the present invention.
FIG. 15 is a pictographic method of creating an artificial intelligence knowledge map and building a connection in accordance with the present invention.
FIG. 16 is a diagram of the artificial word method of the present invention for generating an artificial intelligence knowledge graph and establishing connections.
FIG. 17 is an artificial intelligence knowledge graph generated and associated by the idea word-building method of the present invention.
FIG. 18 is a diagram of the method for forming words by shape and sound of the present invention to generate artificial intelligence knowledge map and establish relationship.
FIG. 19 is a method for transferring words to create an artificial intelligence knowledge map and establish relationships in accordance with the present invention.
FIG. 20 is a diagram of artificial intelligence knowledge graph generation and association establishment by the fake borrowing method of the present invention.
Detailed Description
The following detailed description of the preferred embodiments of the present invention is provided to enable those skilled in the art to more readily understand the advantages and features of the present invention, and to clearly and unequivocally define the scope of the present invention.
A method for generating artificial intelligence knowledge map based on Chinese character making method includes the following steps:
1) defining the range and set of Chinese characters;
the character set comprises all Chinese characters such as oracle-bone inscription, traditional Chinese characters, simplified Chinese characters and the like,
oracle is based on the book "Oracle dictionary" by Wangbing Benxing, published by Beijing Art Press;
the traditional Chinese characters are based on the book Wen Jie word and the book Kangxi dictionary;
simplified Chinese characters are based on the simplified Chinese character scheme published by State administration of the people's republic of China in 1956, and finally the simplified character general table is formulated. 2) Referring to fig. 1 and 2, the segment coding process is performed according to the stroke number of the Chinese character, and a unique digital ID number containing the stroke number segment coding is established;
in the process of establishing Chinese character coding, firstly, the established coding section adopts stroke number as the unique digital ID number and adopts stroke number sequencing. Meanwhile, since the Chinese character font changes greatly in ancient times and nowadays, especially, many of the constituent elements adopted by oracle and seal characters are annihilated in the evolution history and are inconsistent with the modern Chinese characters, the classification mode of the radicals is not adopted,
stroke number segment codes are arranged in sequence according to the stroke number Arabic numerals of the Chinese characters;
divide into one, two, three, up to thirty-three;
and uniformly classifying the Chinese characters with More strokes into More parts of More than one.
3) Referring to fig. 3 and 4, the section encoding process is performed according to the chinese pinyin part, and a unique digitized ID number containing the section encoding of the chinese pinyin part is established. The Chinese Pinyin is sorted according to 26 letters, wherein the Chinese has no initial parts of i, v and u, and the total number of the initial parts is 23.
4) Referring to fig. 5, segment coding processing is performed according to the chinese pinyin and polyphone of the chinese character, and a unique digital ID number including the segment coding of the chinese pinyin and polyphone of the chinese character is determined;
polyphone is a word with two or more pronunciations, different pronunciations have different meanings, different usage and different parts of speech. The pronunciation distinguishes the functions of part of speech and word meaning; according to different use conditions and different pronunciations, the pronunciations have different usage functions,
the Chinese phonetic alphabet and the multi-phonetic character section coding are that the Chinese character is distinguished to have a plurality of phonetic parts, each phonetic part has a plurality of pronunciations, then the section coding is analogized, and the number of the pronunciations is marked as several.
If there are multiple pronunciations in a certain part of a Chinese character, for simplicity, the mode is labeled "number/total number".
Example (c):
xing is an
① go around three people, and have the question of my teacher (statement & description)
Line h ng
② Yuanyang Shiqi-twelve Royal.
5) Referring to fig. 6 and 7, the segment coding process is performed according to the part of speech of the chinese character, and a unique digital ID number including the segment coding of the part of speech of the chinese character is established; according to the classification of parts of speech of modern Chinese, the classification is as follows: 1 noun 2 verb 3 adjective 4 adverb 5 pronoun 6 preposition 7 adverb 8 conjunctions 9 helpwords 10 pronouns 11 sigh words 12 analogies.
6) Referring to fig. 8, the section coding process is performed according to the variant Chinese characters, and a unique digital ID number including the section coding of the variant Chinese characters is determined; variant characters: the Chinese characters are a group of characters with the same pronunciation and meaning but different font style, because the Chinese characters are composed of ideograms, notes and marks, the angle of selection of the ideograms is different from people to people, the notes are different from the letters in the alphabetic writing, therefore, the phenomenon of one character with multiple shapes is all the same in the history of the Chinese characters,
the number of the character is 1, if no variant character exists, the mark is 1;
if there is variant word, arrange in order behind the number 1;
● if a Chinese character has multiple variant characters, it is labeled as "serial/total" mode for simplicity.
7) Referring to fig. 9, the section coding process is performed according to the kanji character, and a unique digital ID number including the section coding of the kanji character is determined;
if there is no kana, labeled 0;
if the number is a logical character, the numbers 1 are arranged in sequence;
if a Chinese character has multiple common false, for simplicity, it is labeled as "sequence/total" mode.
The examples of the pass-false characters arranged according to the sound parts are as follows:
universal kana example table
Figure BDA0002345011900000081
Figure BDA0002345011900000091
Figure BDA0002345011900000101
Figure BDA0002345011900000111
Figure BDA0002345011900000121
Figure BDA0002345011900000131
Figure BDA0002345011900000141
Figure BDA0002345011900000151
Figure BDA0002345011900000161
Figure BDA0002345011900000171
Figure BDA0002345011900000181
Figure BDA0002345011900000191
Figure BDA0002345011900000201
Figure BDA0002345011900000211
Figure BDA0002345011900000221
Figure BDA0002345011900000231
8) Referring to fig. 10, the sector coding process is performed according to the four-corner number of the chinese character, and a unique digital ID number including the sector coding of the four-corner number of the chinese character is determined;
the four-corner number character searching method is a common character searching method as the pinyin character searching method and the radical character searching method, the four-corner number can be used for inputting Chinese characters like strokes, the efficiency is higher than that of strokes, and code fetching is visual and convenient.
In the four-corner number character-looking up method, basic strokes of horizontal stroke, vertical stroke, left falling stroke and right falling stroke are reserved and are called single strokes, and meanwhile, some character-forming units called compound strokes are added, and are combinations of a plurality of basic strokes, such as , ten and the like which are intersected with each other, and the compound strokes are called fork; the 'eight' and 'person' formed by left falling and right falling are called 'eight', and the complex strokes are also common in Chinese characters. Ten numbers are defined in the four corner numbers by a single pen and a complex pen, and are respectively represented by ten numbers from 0 to 9. The four-corner number character-searching method is characterized by that according to the single stroke or complex stroke contained in Chinese character the Chinese character can be numbered, it does not depend on writing stroke order, but adopts the single stroke or complex stroke form of four corners of left upper corner, right upper corner, left lower corner and right lower corner of Chinese character, so that it has four codes, and one Chinese character can be represented by four numerals. The coding method can be directly used for small keyboard input or can be used for large keyboard input. And performing section coding processing according to the four-corner number of the Chinese character, and determining a unique digital ID number containing the section coding of the four-corner number of the Chinese character.
9) Referring to fig. 11 and 12, the section coding process is performed according to the chinese character construction method, and a unique digital ID number including the section coding of the chinese character construction method is established;
the Chinese character forming method includes pictograph, fingering, meeting, pictophonetic, commentary and deception;
building a section code: 01 pictograph-02 means-03 meeting-04 pictophonetic-05 notes-06 pretends.
10) Referring to fig. 13, an artificial intelligence knowledge graph based on the Chinese character word-making method is generated by taking the Chinese character stroke number ID number, the Chinese pinyin part ID number, the Chinese character part of speech ID number, the Chinese character variant character ID number, the Chinese character false-word ID number and the Chinese character four-corner number ID number as basic dimensions and taking the Chinese character word-making method ID number as a core dimension.
11) Referring to fig. 14, 15, 16, 17, 18, 19 and 20, an artificial intelligence knowledge graph based on the Chinese character forming method is generated, a core structure and an overall knowledge framework can be displayed, and a logic network is established.
Those skilled in the art will appreciate that, in addition to implementing the system and its various modules, devices, units provided by the present invention in pure computer readable program code, the system and its various devices provided by the present invention can be implemented with the same functionality in the form of logic gates, switches, application specific integrated circuits, programmable logic controllers, embedded microcontrollers and the like by entirely logically programming method steps. Therefore, the system and various devices thereof provided by the present invention can be regarded as a hardware component, and the devices included in the system and various devices thereof for realizing various functions can also be regarded as structures in the hardware component; means for performing the functions may also be regarded as structures within both software modules and hardware components for performing the methods.
What has been described above is merely a specific embodiment of the present invention. It will be apparent to those skilled in the art that various changes and modifications can be made without departing from the inventive concept thereof, and these changes and modifications can be made without departing from the spirit and scope of the invention.

Claims (7)

1. A method for generating artificial intelligence knowledge map based on Chinese character making method is characterized in that: the specific working steps comprise:
1) defining the range and set of Chinese characters;
2) carrying out section coding processing on the stroke number of the Chinese character, and determining the unique digital ID number of the stroke number section coding;
3) carrying out section coding processing on the Chinese pinyin part, and determining a unique digital ID number of the section coding of the Chinese pinyin part;
4) carrying out section coding processing on Chinese pinyin and polyphone of the Chinese characters, and determining the unique digital ID number of the section coding of the Chinese pinyin and the polyphone of the Chinese characters;
5) carrying out section coding processing on part of speech of the Chinese character, and determining a unique digital ID number of the section coding of the part of speech of the Chinese character;
6) carrying out section coding processing on the Chinese character variant characters, and determining the unique digital ID number of the section coding of the Chinese character variant characters;
7) carrying out section coding processing on the Chinese character Tong-Han, and determining the unique digital ID number of the section coding of the Chinese character Tong-Han;
8) carrying out section coding processing on the four-corner number of the Chinese character, and determining the unique digital ID number of the section coding of the four-corner number of the Chinese character;
9) carrying out section coding processing on the Chinese character word-making method, and determining the unique digital ID number of the section coding of the Chinese character word-making method;
10) finally, establishing an artificial intelligence knowledge graph based on the Chinese character word-making method by taking the number ID number of strokes of the Chinese character, the number ID of the phonetic part of the Chinese character, the part-of-speech ID number of the Chinese character, the number ID of variant characters of the Chinese character, the number ID of common-case characters of the Chinese character and the number ID of four corners of the Chinese character as basic dimensions and taking the number ID of the Chinese character word-making method as a core dimension;
11) and establishing the transverse connection of the knowledge graph through a knowledge graph editing tool.
2. The method for generating artificial intelligence knowledge graph based on Chinese character construction method according to claim 1, wherein the Chinese character range and set of step 1) includes all Chinese characters such as oracle, traditional Chinese characters and simplified Chinese characters.
3. The method for generating artificial intelligence knowledge map based on Chinese character making method according to claim 1, wherein the stroke number segment codes of the Chinese characters of step 2) are arranged in sequence according to the Arabic numeral size of the stroke number of the Chinese characters.
4. The method for generating artificial intelligence knowledge map based on Chinese character making method according to claim 1, wherein the part of speech of Chinese characters in step 5) is classified according to the part of speech of modern Chinese characters as follows: nouns, verbs, adjectives, adverbs, pronouns, prepositions, quantifiers, conjunctions, auxiliary words, numerators, sigers, and pseudonyms, and section coding is performed according to the part-of-speech classification of the modern Chinese.
5. The method for generating artificial intelligence knowledge graph based on Chinese character making method according to claim 1, wherein said Chinese characters of step 7) are segmented and coded according to the arrangement of sound parts.
6. The method for generating artificial intelligence knowledge graph based on Chinese character modeling method according to claim 1, wherein the Chinese character modeling method of step 9) performs section coding according to pictographs, fingering, meeting, pictophonetics, commentary, and fiction.
7. The method for generating an artificial intelligence knowledge graph based on the Chinese character forming method according to claim 1, wherein the artificial intelligence knowledge graph based on the Chinese character forming method generated in the step 10) can show a core structure and an overall knowledge framework and establish a logic network.
CN201911391218.3A 2019-12-30 2019-12-30 Method for generating artificial intelligence knowledge map based on Chinese character making method Pending CN111353047A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911391218.3A CN111353047A (en) 2019-12-30 2019-12-30 Method for generating artificial intelligence knowledge map based on Chinese character making method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911391218.3A CN111353047A (en) 2019-12-30 2019-12-30 Method for generating artificial intelligence knowledge map based on Chinese character making method

Publications (1)

Publication Number Publication Date
CN111353047A true CN111353047A (en) 2020-06-30

Family

ID=71195734

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911391218.3A Pending CN111353047A (en) 2019-12-30 2019-12-30 Method for generating artificial intelligence knowledge map based on Chinese character making method

Country Status (1)

Country Link
CN (1) CN111353047A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1286176A (en) * 1999-12-01 2001-03-07 罗宇翔 Method for editing index-type Chinese dictionary and manner for expressing pronunciation, shape and meaning of word
CN1409240A (en) * 2001-09-05 2003-04-09 王金根 New audio-video information system of attributing all things and words
CN101493999A (en) * 2009-02-18 2009-07-29 贾守治 Hanzi direct search tri-code word detection method
CN104462512A (en) * 2014-12-19 2015-03-25 北京奇虎科技有限公司 Chinese information search method and device based on knowledge graph
CN109656385A (en) * 2018-12-28 2019-04-19 北京金山安全软件有限公司 Input prediction method and device based on knowledge graph and electronic equipment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1286176A (en) * 1999-12-01 2001-03-07 罗宇翔 Method for editing index-type Chinese dictionary and manner for expressing pronunciation, shape and meaning of word
CN1409240A (en) * 2001-09-05 2003-04-09 王金根 New audio-video information system of attributing all things and words
CN101493999A (en) * 2009-02-18 2009-07-29 贾守治 Hanzi direct search tri-code word detection method
CN104462512A (en) * 2014-12-19 2015-03-25 北京奇虎科技有限公司 Chinese information search method and device based on knowledge graph
CN109656385A (en) * 2018-12-28 2019-04-19 北京金山安全软件有限公司 Input prediction method and device based on knowledge graph and electronic equipment

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
陈涛: "《康熙字典》的古汉语知识图谱构建方法研究" *

Similar Documents

Publication Publication Date Title
Coulmas Writing systems: An introduction to their linguistic analysis
CN110489760A (en) Based on deep neural network text auto-collation and device
US5119296A (en) Method and apparatus for inputting radical-encoded chinese characters
CN105677822A (en) Enrollment automatic question-answering method and system based on conversation robot
Secord Science in Print: Essays on the History of Science and the Culture of Print
Cavalli-Sforza et al. Arabic readability research: current state and future directions
CN1027558C (en) Five-stroke and two-dimension encoding method and keyboard
CN105045410A (en) Method for correspondingly identifying formalized phonetic alphabets and Chinese characters
Youguang Modernization of the Chinese language
CN115859962B (en) Text readability evaluation method and system
CN111353047A (en) Method for generating artificial intelligence knowledge map based on Chinese character making method
CN103838393A (en) Chinese character structure digital literacy input method
Hsieh Hanzi, Concept and Computation: A preliminary survey of Chinese Characters as a Knowledge Resource in NLP
Gupta et al. Implementation of rule based algorithm for Sandhi-Vicheda of compound hindi words
Vachkova et al. School in Digital Age: How Big Data Help to Transform the Curriculum. Information 2021, 12, 33
CN110502128B (en) Chinese character multi-element input method and system
Arafah et al. Typos’ Effects on Web-Based Programming Code Output: A Computational Linguistics Study
Zhao Chinese character components, radicals and phonetics
Li et al. CLGC: A corpus for Chinese literary grace evaluation
KR20000072146A (en) The high speed interpretative method program of the English sentences with the color pens
He et al. The Establishment of Chinese Character Reference Framework for the English Monophthong
Fakhriniso METHODS OF TEACHING JAPANESE HIEROGLYPHS USING COMPUTER SIMULATION MODELS BASED ON THE TYPE OF READING
Li Constant Evaluation of L2 Students' English Witing Ability.
CN114115886A (en) Method for realizing pure Chinese programming and establishing pure Chinese operating system by using Chinese character positive and negative numerical code method
Lyu et al. Multimodal Discourse Analysis of English Teaching Materials in Colleges and Universities from the Perspective of Ecological Philosophical Perspective

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20200630

RJ01 Rejection of invention patent application after publication