CN112699210B - Foreign word management system - Google Patents

Foreign word management system Download PDF

Info

Publication number
CN112699210B
CN112699210B CN202010099421.XA CN202010099421A CN112699210B CN 112699210 B CN112699210 B CN 112699210B CN 202010099421 A CN202010099421 A CN 202010099421A CN 112699210 B CN112699210 B CN 112699210B
Authority
CN
China
Prior art keywords
word
external
foreign
information
foreign word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010099421.XA
Other languages
Chinese (zh)
Other versions
CN112699210A (en
Inventor
野岛伸一
关口晋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kazo Publishing Co ltd
Original Assignee
Kazo Publishing Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kazo Publishing Co ltd filed Critical Kazo Publishing Co ltd
Publication of CN112699210A publication Critical patent/CN112699210A/en
Application granted granted Critical
Publication of CN112699210B publication Critical patent/CN112699210B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Document Processing Apparatus (AREA)

Abstract

A foreign word management system capable of performing uniform or standard foreign word information management to a certain extent by individually judging a plurality of organizations that continuously adopt and register new foreign words in a computer system. The foreign word management system includes: an external word management server having an external word main database, a similarity database, a customer information database, an external word main editing processing unit, and an external word information acquisition processing unit; and a plurality of organization computer systems, each organization computer system having foreign word management software having functions of performing foreign word master database collation, character pattern similarity determination, and new foreign word registration processing in conjunction with the held character base management system.

Description

Foreign word management system
Technical Field
The invention relates to a character basic management system capable of managing the character information of the Chinese character (common Chinese character) and the foreign character (unusual Chinese character) in the daily text in a computer system, and the computer is not driven out and replaced by codes, in particular to a composition capable of uniformly managing the character information of the Chinese character and the foreign character in a computer system with a plurality of organizations of different character systems.
Background
In a computer system, standard character codes for processing characters (kanji, katakana, hiragana, letters, etc.) are defined. In addition to the national standards of japan such as JIS code and Shift JIS code, there are international standards such as EUC and Unicode. Standard specification literal codes are adopted in many computer systems, so that the exchange of literal information between different computer systems is easy to realize.
Among Chinese characters used in Japan, there are foreign words having the same meaning or reading method but different fonts.
Fig. 8 shows an example of such a variant word. As shown, it is known that a plurality of variant characters (consent, homonym, variant character groups) exist for " ", " ", "1", etc. of the usual japanese chinese fonts. Different character codes are assigned to different foreign words in the daily text, and if the number of the foreign words is huge, it is difficult to assign standard character codes to all the foreign words. Foreign words which are not given a standard character code but are actually used in various systems are registered as foreign words (very words, computer is not run out, and codes are used instead). In each system, the character codes of the common Chinese characters, the standard character codes of the foreign characters and the character codes of the foreign characters are registered by being correlated as the same foreign character group.
In addition to the above, words ("wild" and " " (japanese pronunciation is "both" the "of the" on ") and the like, which are synonymous with homophones and have significantly different fonts, words (" koku "in japanese is the meaning of wood chips or veneer, the same as the" on ") and" persimmon (the japanese pronunciation is the "part of the" on the "in japanese, the same as the" on the "basis of the" on the "persimmon), and the like, which are significantly similar in the fonts and are not synonymous with each other, may be registered as the same variant word group by establishing a relationship.
Thus, each organization has its own foreign word group in the computer system. For merging organizations and exchanging information between organizations, a configuration is proposed in which foreign word groups that exist independently of each other are associated with each other.
Patent document 1 discloses a technique for specifying an external word based on a dot pattern of characters recognized by OCR, patent document 2 discloses a technique for specifying an external word based on external word feature information stored in advance, and patent document 3 discloses a technique for assigning a main code to each character having a different font in advance and performing correspondence between different systems.
Japanese patent application laid-open No. 2011-128688
Japanese patent application laid-open No. 2006-179026
Japanese patent application laid-open No. 2010-009532 (patent document 3)
As described above, various configurations have been proposed in which the external word groups held independently are associated with each other among computer systems of different organizations. But foreign words are typically new and taken/registered by individual judgment of each organisation. The method of determining the foreign word according to the word shape cannot unify the similarity of what degree as a reference, and the method of establishing the correspondence based on the pre-stored information cannot cope with the newly generated word shape.
As a result, as shown in fig. 8, in the case where new foreign word groups are continuously registered every day in the computer systems of various organizations, it is impossible to always make the foreign word groups between these different systems correspond to each other or to assign codes having commonality.
Disclosure of Invention
The present invention has been made in view of the above circumstances, and provides an external word management system capable of managing external word information to a certain extent uniformly or standardizedly by a plurality of organizations each independently judging that a new external word is continuously employed/registered in a computer system.
In view of the above-described problems, the inventors of the present application have conceived to provide an external word management server for an external word system held by a computer system that holds each organization in a unified manner, and to receive a collation of external words from each organization for external word collection/accumulation information that is newly generated and employed/registered in each organization at any time, thereby enabling unified management of all external word systems, and completed the present application.
That is, the present invention provides an external word management system including a computer system including a plurality of organizations and an external word management server, the external word management system including: an external word main database for storing the font data of the external word, the identification code in the external word management server, and the identification code in the computer system of more than one organization; a customer information database storing information related to a plurality of the organizations; an external word main editing processing unit for editing the registration contents of the external word main database; and an external word information acquisition processing unit that acquires information of an external word newly registered in the computer system of the organization and registers the information in the external word master database, the computer system of the organization including: and an external word management software that operates in conjunction with a text base management system held by the computer system, the external word management software including: an external word master database collation unit for collating external words by referring to an external word master database of the external word management server; a similarity determination processing unit for determining the similarity of characters; and a new foreign word registration processing unit that transmits information of a foreign word newly registered in the character base management system to the foreign word management server.
In the external word management system according to the present invention, when the first external word having no information is input to the text base management system of the computer system of the organization, the external word master database collation unit refers to the external word master database of the external word management server to acquire information of the corresponding external word, and supplies the information to the text base management system.
Thus, the character base management system of the computer system of each organization can register the external character as the external character which is associated with the registration information in the system, instead of registering the external character which is not held by itself.
In the foreign word management system according to the present invention, when information cannot be acquired from a foreign word master database of the foreign word management server for the first foreign word, the similarity determination processing unit refers to the foreign word master database, acquires information of one or more foreign words similar to the character pattern of the first foreign word, and supplies the acquired information to the character base management system.
Thus, it is possible to identify an external word having a somewhat similar character pattern to an external word registered in the system, although the character pattern is different from the external word registered in the system, and to determine the external word as the same as the registered external word, thereby preventing a new registration of an external word having a slightly different character pattern from being confused.
In the foreign word management system according to the present invention, the similarity determination processing unit may be configured to cause a user to select which foreign word to use when information of two or more foreign words similar to the first foreign word is acquired with reference to the foreign word master database.
Thus, semi-automatic operation including human-based judgment in the similarity judgment of the foreign word can be realized.
In the external word management system according to the present invention, the external word master database of the external word management server stores grouping information of external words and information of recommendation degrees of the external words belonging to the corresponding groups, and when information cannot be acquired from the external word master database of the external word management server for the first external word, the similarity determination processing unit acquires information of one or more external words similar to the character pattern of the first external word, information of external words belonging to the same group as the one or more external words, and information of recommendation degrees thereof, with reference to the external word master database, and presents the acquired external words to a user according to a sequence of recommendation degrees, thereby allowing the user to select which external word to use.
This can promote the use of foreign words with high recommendation in the computer systems of the organizations.
In the external word management system according to the present invention, the external word master database of the external word management server stores grouping information of external words and information of recommendation degrees of the external words belonging to the corresponding groups, and the similarity determination processing unit refers to the external word master database to acquire information of one or more external words similar to the first external word in a case where information cannot be acquired from the external word master database of the external word management server for the first external word, and information of recommendation degrees of the external words belonging to the same group as the one or more external words, and uses an external word having the highest recommendation degree among the acquired external words.
Thus, foreign words conforming to the policy of the system administrator can be activated in the computer system of each organization.
In the foreign word management system according to the present invention, the foreign word management server includes a similarity database that holds information on the similarity of the fonts, and the similarity determination processing unit determines the similarity of the fonts by referring to the similarity database of the foreign word management server when acquiring information of one or more foreign words similar to the fonts of the first foreign word from the foreign word master database.
The similarity database is preferably constructed based on known similar word recognition techniques, and updated if new technology advances.
In the external word management system according to the present invention, when information cannot be acquired from the external word master database of the external word management server and information of external words having similar fonts cannot be acquired, the new external word registration processing unit acquires an identification code of the first external word given when the first external word is newly registered as an external word in the corresponding character base management system, and transmits the information of the newly registered external word to the external word management server.
Thus, the present system can collectively manage information of foreign words newly registered in any time in the computer systems of the organizations.
In the external word management system according to the present invention, the external word master database of the external word management server stores at least one of the corresponding positive word information, the radical information, and the meaning information of the external word.
In the foreign word management system according to the present invention, the foreign word management server includes a software configuration processing unit that configures and updates the foreign word management software for the computer systems of the organizations.
In the foreign word management system of the present invention, the foreign word management server has: an inter-system external word correspondence data generation processing unit that generates inter-system external word correspondence data for establishing correspondence between external word information held between computer systems of two or more different organizations, the inter-system external word correspondence data including at least a correspondence table of identification codes of external words in the computer systems between the two or more organizations.
Thus, even when an external word is newly registered in the computer system of each organization at any time, the information of the external word between the organizations can be associated with each other.
The present invention also provides an external word management system including an external word management server for performing external word management in conjunction with a character base management system held by computer systems of a plurality of organizations, the external word management system including: an external word main database for storing the font data of the external word, the identification code in the external word management server, and the identification code in the computer system of more than one organization; a customer information database storing information related to a plurality of the organizations; an external word main editing processing unit for editing the registration contents of the external word main database; an external word master database collation unit for collating external words with reference to the external word master database by the computer system of the organization; a similarity determination processing unit for determining the similarity of characters; and an external word information acquisition processing unit that acquires information of an external word newly registered in a character base management system of the computer system of the organization and registers the information in the external word master database.
In the external word management system according to the present invention, when a first external word having no information is input to a character base management system of a computer system of the organization, the external word management server which has received the information of the first external word from the computer system acquires the information of the corresponding external word by referring to an external word master database of the external word management server through the external word master database collation unit, and supplies the acquired information to the computer system.
Thus, the character base management system of the computer system of each organization can register the external character as the external character which is associated with the registration information in the system, instead of registering the external character which is not held by itself.
In the foreign word management system according to the present invention, when information cannot be acquired from the foreign word master database for the first foreign word, the foreign word management server refers to the foreign word master database to acquire information of one or more foreign words similar to the first foreign word in the form of the first foreign word, and supplies the acquired information to the corresponding computer system.
Thus, it is possible to identify an external word having a somewhat similar character pattern to an external word registered in the system, although the character pattern is different from the external word registered in the system, and to determine the external word as the same as the registered external word, thereby preventing a new registration of an external word having a slightly different character pattern from being confused.
In the foreign word management system according to the present invention, the similarity determination processing unit may be configured to present information of two or more foreign words similar to the font of the first foreign word to the corresponding computer system so as to select which foreign word to use when the information of the two or more foreign words is acquired with reference to the foreign word master database.
Thus, semi-automatic operation including human-based judgment in the similarity judgment of the foreign word can be realized.
In the external word management system according to the present invention, the external word master database stores grouping information of external words and information of recommendation degrees of the external words belonging to the corresponding groups, and the similarity determination processing unit refers to the external word master database to acquire information of one or more external words similar to the first external word and information of recommendation degrees of the external words belonging to the same group as the one or more external words when information cannot be acquired from the external word master database for the first external word, and supplies the acquired information to the corresponding computer system.
This can promote the use of foreign words with high recommendation in the computer systems of the organizations.
In the external word management system according to the present invention, the external word master database stores grouping information of external words and information of recommendation degrees of the external words belonging to the corresponding groups, and when information cannot be acquired from the external word master database for the first external word, the similarity determination processing unit refers to the external word master database, acquires information of one or more external words similar to the first external word in the font type, and information of an external word belonging to the same group as the one or more external words and information of recommendation degrees thereof, and supplies an external word having the highest recommendation degree to the corresponding computer system from among the acquired external words.
Thus, foreign words conforming to the policy of the system administrator can be activated in the computer system of each organization.
In the foreign word management system according to the present invention, the foreign word management server includes a similarity database that holds information on the similarity of the fonts, and the similarity determination processing unit determines the similarity of the fonts by referring to the similarity database of the foreign word management server when acquiring information of one or more foreign words similar to the fonts of the first foreign word from the foreign word master database.
The similarity database is preferably constructed based on known similar word recognition techniques, and updated if new technology advances.
In the external word management system according to the present invention, the external word information acquisition processing unit acquires the identification code of the first external word given when the external word is newly registered in the corresponding character base management system, and registers the information of the newly registered external word in the external word management server.
Thus, the present system can collectively manage information of foreign words newly registered in any time in the computer systems of the organizations.
In the external word management system according to the present invention, the external word master database of the external word management server stores at least one of the corresponding positive word information, the radical information, and the meaning information of the external word.
In the foreign word management system according to the present invention, the foreign word management server includes: an inter-system external word correspondence data generation processing unit that generates inter-system external word correspondence data for establishing correspondence between external word information held between computer systems of two or more different organizations, the inter-system external word correspondence data including at least a correspondence table of identification codes of external words in the computer systems between the two or more organizations.
Thus, even when an external word is newly registered in the computer system of each organization at any time, the information of the external word between the organizations can be associated with each other.
(Effects of the invention)
As described above, according to the foreign word management system of the present invention, it is possible to collect and accumulate information for foreign words newly generated and used and registered in various organizations at any time, and to receive a comparison of foreign words from each organization, thereby performing unified management of all the foreign word systems.
Even when an external word is newly registered in a computer system of each organization at any time, a correspondence relationship can be integrally established for the external word, and a correspondence relationship can be established for an external word newly generated and used/registered in each organization.
Drawings
Fig. 1 is a diagram schematically showing the overall configuration of the foreign word management system of the present invention.
Fig. 2 is a diagram schematically showing an internal (system) configuration of the foreign word management server shown in fig. 1.
Fig. 3 is a diagram schematically showing an example of the data table configuration of the outer word master database shown in fig. 2.
Fig. 4 is a diagram schematically showing an internal (system) configuration of the computer system of one of the organizations shown in fig. 1.
Fig. 5 is a flowchart showing a flow of foreign word collation/registration processing in the foreign word management system of the present invention.
Fig. 6 is a diagram showing an example of a data table configuration of inter-system external word correspondence data generated in the external word management system according to the present invention.
Fig. 7 is an example of a screen display when a user selects a candidate for registering an external word in the external word management system of the present invention.
Fig. 8 is a diagram for explaining variant characters of chinese characters used in japan.
Fig. 9 is a diagram for explaining the condition of a standard character group and an external character group registered in a computer system of a different organization.
Detailed Description
Hereinafter, preferred embodiments of the foreign word management system for implementing the present invention will be described in detail with reference to the accompanying drawings. Fig. 1 to 7 are diagrams illustrating embodiments of the present invention, and in these diagrams, the portions denoted by the same reference numerals represent the same objects, and the basic configuration and operation are the same.
System constitution
Fig. 1 is a diagram schematically showing the overall configuration of the foreign word management system of the present invention.
As shown in fig. 1, the foreign word management system of the present invention is configured such that a foreign word management server as a server side and a plurality of computer systems as organizations on a client side are connected via a communication network.
Fig. 2 is a diagram schematically showing an internal (system) configuration of the foreign word management server shown in fig. 1.
In fig. 2, the foreign word management server has a database group including: an external word main database which stores the character form, identification code, corresponding positive word information, radical information, meaning information and the like of the external word; a similarity database which holds information related to the similarity of the outer word patterns; and a customer information database for storing information of customers as users of the foreign word management system. The software processing unit includes: an external word main editing processing part for editing the registration content of the external word main database; a software configuration processing unit that configures and updates software for use in the external word management system for each organization that is a user; an external word information acquisition processing unit that acquires information of newly registered external words from each organization as a user; and an inter-system foreign word correspondence data generation processing unit that generates data for establishing correspondence between foreign word information held between systems of different organizations. Further, the present invention is provided with means necessary for general processing such as input/output means, communication means, temporary storage means, and the like.
Fig. 3 is a diagram schematically showing an example of the data table configuration of the outer word master database shown in fig. 2.
In fig. 3, the foreign word master database holds, for each foreign word, a foreign word code for recognition, font data, information of a corresponding positive word, information of radicals as constituent elements of the font, meaning information, and identification codes (customer unique codes 1,2, …) in a customer system using the foreign word.
Fig. 4 is a diagram schematically showing an internal (system) configuration of the computer system of one of the organizations shown in fig. 1.
In fig. 4, the computer system of the organization has, as a conventional text base management system, a positive word database, a negative word database, and a negative word registration processing unit (only the components related to the present invention are shown here). The foreign word management system according to the present invention includes a foreign word master database collation unit, a similarity determination processing unit, and a new foreign word registration processing unit. Further, the present invention is provided with means necessary for general processing such as input/output means, communication means, temporary storage means, and the like.
Details of foreign word management processing
The details of the foreign word management processing in the foreign word management system of the present invention configured as shown in fig. 1 to 4 will be described in detail below.
(1) Foreign word collation/registration process
In the case of processing a new foreign word in the computer system of the organiser (not held by the text base management system), the foreign word management software incorporated in the system performs a collation/registration process of the new foreign word.
Fig. 5 is a flowchart showing the flow of the foreign word collation/registration process.
In fig. 5, when input of font data (handwritten character data or the like) of an external character is received, first, an external character database of the system is referred to and checked against a registered external character. The comparison is performed by comparing the fonts, and whether the fonts match or not is determined based on a predetermined reference. Here, the case where no registration is made in the external database of the present system is assumed.
Therefore, the foreign word management software refers to the foreign word master database of the foreign word management server by the foreign word master database collation unit, and collates the input foreign word with the registered foreign word.
If there is a registration, a record of the foreign word is acquired. The external word registration processing unit of the character base management system receives the record and registers the character data of the input external word and the information of the external word in the external word database. At this time, the character code given to the input foreign word is notified to the foreign word management server, and registered in the foreign word master database as the customer-specific code of the foreign word in the system.
If the character is not registered, the character is searched by referring to the character master database, and the character having a character shape similar to the character shape of the input character or more to a certain extent is obtained. The similarity determination processing unit of the foreign word management software determines, from among the obtained foreign words, a foreign word most similar to the inputted foreign word font by using a similarity database of the foreign word management server, and registers font data of the inputted foreign word and information of the foreign word in the foreign word database. The similarity determination process may be automatically performed by a software process using yield accumulated in a similarity database based on known similar word recognition techniques. Alternatively, the selection may be performed semi-automatically by the user from several candidates. At this time, the character code given to the input foreign word is notified to the foreign word management server, and registered in the foreign word master database as the customer-specific code of the foreign word in the system.
Even if an external word having a font similar to the font of the input external word to a certain extent or more is not found with reference to the external word master database, the input external word is registered as a new external word.
The external word registration processing unit of the character base management system receives input of information (preferably, information items based on registration information of the external word main database shown in fig. 3) such as a positive word, a reading method, and meaning corresponding to an input external word, and registers the font data of the input external word and the information thereof in the external word database.
The foreign word management software obtains information of a foreign word newly registered in the foreign word database (including a character code given to the foreign word) by the new foreign word registration processing unit, and transmits the information to the foreign word management server.
The foreign word information acquisition processing unit of the foreign word management server that has received these pieces of information performs registration processing as a new foreign word in the foreign word master database.
By providing the above-described foreign word matching and registration processing function, in the foreign word management system of the present invention, when a new foreign word is input into any one of a plurality of organizations that are users of the system, even if the text base management system of the organization is not held, the foreign word held in the foreign word master database of the foreign word management server can be registered as a new foreign word in the text base management system in association with the registration information thereof. Further, a new foreign word that is not held in the foreign word master database of the text base management system and the foreign word management server for the organization can be registered as a new foreign word in both of them.
That is, the foreign word management server can store information of all foreign words in the user space of the system, and the text base management system of each organization can always store the foreign word information associated with the foreign word information stored in the foreign word management server.
If the organization starts to newly use the system, it is preferable to perform the foreign word matching/registration process for all the foreign word information held by the text base management system of the organization. Thus, the information of the foreign word held by the text base management system but not by the foreign word management server can be absorbed, and the information of the foreign word held by the text base management system can be associated with the information of the foreign word held by the foreign word management server.
(2) Inter-system foreign word correspondence data generation processing
As described above, the character base management system of each organization always holds the foreign word information associated with the foreign word information held by the foreign word management server. But the kept foreign word information may not be completely consistent among the word basis management systems of different organizations. Therefore, unrecognized foreign words may occur when information is communicated between the computer systems of these organizations.
In order to solve the problem, the external word management system of the present invention performs processing for generating external word correspondence data between systems by an external word correspondence data generation processing unit between systems of an external word management server. The generation process is performed in response to a request from foreign word management software of an arbitrary organization or in response to an instruction/operation performed by an administrator of the foreign word management server. Or may be performed when predetermined conditions set in advance are satisfied.
The predetermined condition is, for example, generation (update) at regular intervals, generation (update) when a new foreign word registration exists in the relevant organization system, or the like.
Fig. 6 is a diagram showing an example of a data table configuration of inter-system external word correspondence data generated in the external word management system according to the present invention.
The inter-system foreign word correspondence data between the a company and the B company illustrated in fig. 6 includes, for each foreign word record, a foreign word code in a foreign word master database of a foreign word management server, a unique code in the a company system, a unique code in the B company system, and the like. Here, the foreign word code and/or the font data of the foreign word master database of the foreign word management server are not necessarily constituent elements.
In the company a system and the company B system that receive the inter-system external word correspondence data generated in this way, the external words held by the system and the external words held by the counterpart system can be associated one-to-one, and therefore, even if the systems of the external words held by each other are different, the data exchange or the like between the two can be performed without any problem.
Other system configuration examples
The foreign word management system of the present invention may be configured as a system other than the one described above.
For example, the functions of the foreign word management software in the computer system of each organization shown in fig. 4 may be configured on the foreign word management server side. In this case, the foreign word management server further includes a foreign word master database collation unit, a similarity determination processing unit, and a new foreign word registration processing unit shown in fig. 4. On the other hand, the computer systems of the organizations have a system configuration (so-called ASP-type or cloud-type system configuration) having only a function of accessing the foreign word management server and performing necessary information communication. In this case, the new foreign word registration processing unit of the foreign word management server needs to have an arbitrary configuration for quickly detecting the presence of a new foreign word registration in the computer system of each organization and acquiring information thereof. In addition, a software configuration processing section is not required in the foreign word management server.
The foreign word management system according to the present invention is configured to be able to execute the foreign word management processing described above.
Yet another system configuration example
The similarity determination processing unit in the foreign word management system according to the present invention may be configured to have a configuration other than the one described above.
In this example, the foreign word master database of the foreign word management server shown in fig. 2 stores foreign words in groups. There are various methods of packetizing the outer word as shown below, one or more of which may be accomplished.
(1) Groups corresponding to the same positive word
For example, "zhai", " ", "zipan" (japanese kanji in all above), …, etc. corresponding to the positive word " " are grouped.
(2) Group having different meanings but high similarity of fonts
For example, "kaki (meaning of Japanese pronunciation ' kake ' in Japanese) and" kaki (meaning of Japanese pronunciation ' kaki) are grouped together.
(3) Groups of glyphs having low similarity but identical meaning or identical word composition
For example, "wild" and " " (japanese pronunciation is "periphery") are grouped.
The foreign word master database also stores a predetermined degree of recommendation for each foreign word included in the above-described foreign word group. The recommendation degree can be arbitrarily set by the system manager, and for example, the level of IPA or JIS can be used.
In the computer system of the organization shown in fig. 4, the foreign word management software refers to the foreign word master database of the foreign word management server by the foreign word master database collation unit, collates the input foreign word with the registered foreign word, and when there is no registration of a foreign word whose font matches, the similarity determination processing unit refers to the foreign word master database as described above, and acquires corresponding candidates from the registered foreign word with the similarity of the font as a clue.
In this case, the similarity determination processing unit may acquire candidates for the foreign word group. For example, when the input foreign word is similar to "zhai", all the registered foreign words belonging to the group corresponding to the positive word " " are acquired as candidates. At this time, information on the degree of recommendation of each of the registered foreign words in the group is also acquired together.
A similarity determination processing unit which acquires a plurality of candidates of a registered foreign word corresponding to an input foreign word from a foreign word master database of a foreign word management server presents the candidates to a user, and prompts the user to select. In this case, candidates of the registered foreign word can be displayed in a sequence corresponding to the degree of recommendation, and the user can be induced to easily select the registered foreign word having a high degree of recommendation. Fig. 7 shows an example of such screen display. Alternatively, only candidates having a recommendation degree equal to or higher than a predetermined value may be displayed in a sequence corresponding to the recommendation degree. Alternatively, the candidate with the highest recommendation degree may be automatically activated without depending on the user selection.
With this configuration, the computer system of each organization can promote the activation of the foreign word with high recommendation degree. That is, the use of the foreign word management system according to the policy such as the arrangement of the text base information and the international standardization can be realized in each organization on the user side without requiring a specific cost or labor.
In the above, the foreign word management system of the present invention has been described in the specific embodiment using japanese kanji as an example, but the present invention is not limited thereto. Those skilled in the art should be able to apply various changes and modifications to the constitution and functions of the foreign word management server, the computer system of each organization, and the like in the above-described embodiments without departing from the spirit of the present invention.
[ INDUSTRIAL APPLICABILITY ]
As shown in fig. 1 to 7, the foreign word management system of the present invention can be realized by an OS, an application program, a database, a network system, etc. constructed on hardware resources including a CPU, a memory, an auxiliary storage device, a display, an input device, etc. of a computer, and can specifically realize information processing such as management of registration information of a foreign word using the above hardware resources, and therefore, the foreign word management system corresponds to a technical idea utilizing natural laws, and can be utilized in the software industry.

Claims (21)

1. An external word management system comprising a plurality of computer systems of organizations and an external word management server, the external word management system characterized in that,
The foreign word management server has:
An external word main database for storing the font data of the external word, the identification code in the external word management server, and the identification code in the computer system of more than one organization;
a customer information database storing information related to a plurality of the organizations;
An external word main editing processing unit for editing the registration contents of the external word main database; and
An external word information acquisition processing unit that acquires information of an external word newly registered in the computer system of the organization and registers the information in the external word master database,
The computer system of the organization has external word management software which works in linkage with a text base management system held by the computer system and comprises: an external word master database collation unit for collating external words by referring to an external word master database of the external word management server; a similarity determination processing unit for determining the similarity of characters; and a new foreign word registration processing unit that transmits information of a foreign word newly registered in the character base management system to the foreign word management server.
2. The foreign word management system of claim 1,
When the text base management system of the computer system having entered the organizer does not have the first foreign word of information,
The external word master database comparison part refers to the external word master database of the external word management server to acquire corresponding external word information and provides the information to the word basic management system.
3. The foreign word management system of claim 2,
In the case that the information cannot be obtained from the foreign word master database of the foreign word management server for the first foreign word,
The similarity determination processing unit refers to the external word master database, acquires information of one or more external words similar to the first external word in the form of the first external word, and supplies the acquired information to the character base management system.
4. The foreign word management system of claim 3,
The similarity determination processing unit causes a user to select which external word to use when information of two or more external words similar to the first external word in the font style is acquired with reference to the external word master database.
5. The foreign word management system of claim 2,
The foreign word master database of the foreign word management server stores grouping information of the foreign words and information of recommendation degree of each foreign word belonging to the corresponding group,
For the first foreign word, in the case that information cannot be obtained from a foreign word master database of the foreign word management server,
The similarity determination processing unit refers to the external word master database, acquires information of one or more external words similar to the first external word, and information of external words belonging to the same group as the one or more external words and their recommendation degrees, presents the acquired external words to the user according to the sequence of recommendation degrees, and makes the user select which external word to use.
6. The foreign word management system of claim 2,
The foreign word master database of the foreign word management server stores grouping information of the foreign words and information of recommendation degree of each foreign word belonging to the corresponding group,
For the first foreign word, in the case that information cannot be obtained from a foreign word master database of the foreign word management server,
The similarity determination processing unit refers to the external word master database, acquires information of one or more external words similar to the first external word, and information of external words belonging to the same group as the one or more external words and their recommendation degrees, and uses an external word having the highest recommendation degree among the acquired external words.
7. The foreign word management system of any one of claims 3 to 6,
The foreign word management server has a similarity database holding information related to the similarity of glyphs,
The similarity determination processing unit refers to a similarity database of the external word management server to determine the similarity of the character pattern when acquiring information of one or more external words similar to the character pattern of the first external word from the external word master database.
8. The foreign word management system of any one of claims 2 to 6,
In the case where information cannot be obtained from the foreign word master database of the foreign word management server and information of a foreign word having a similar font cannot be obtained for the first foreign word,
The new foreign word registration processing unit acquires the identification code of the first foreign word assigned when the new foreign word is registered in the corresponding text base management system, and transmits the information of the newly registered foreign word to the foreign word management server.
9. The foreign word management system of any one of claims 1 to 6,
The foreign word main database of the foreign word management server stores at least one of corresponding positive word information, radical information and meaning information of the foreign word.
10. The foreign word management system of any one of claims 1 to 6,
The foreign word management server has a software configuration processing unit that configures and updates the foreign word management software for the computer systems of the organizations.
11. The foreign word management system of any one of claims 1 to 6,
The foreign word management server has: an inter-system external word correspondence data generation processing unit that generates inter-system external word correspondence data for establishing correspondence between external word information held between computer systems of two or more different organizations, the inter-system external word correspondence data including at least a correspondence table of identification codes of external words in the computer systems between the two or more organizations.
12. An external word management system including an external word management server for performing external word management in conjunction with a text base management system held by computer systems of a plurality of organizations, the external word management system characterized in that,
The foreign word management server has:
An external word main database for storing the font data of the external word, the identification code in the external word management server, and the identification code in the computer system of more than one organization;
a customer information database storing information related to a plurality of the organizations;
an external word main editing processing unit for editing the registration contents of the external word main database;
an external word master database collation unit for collating external words with reference to the external word master database by the computer system of the organization;
A similarity determination processing unit for determining the similarity of characters; and
And an external word information acquisition processing unit that acquires information of an external word newly registered in a character base management system of the computer system of the organization and registers the information in the external word master database.
13. The foreign word management system of claim 12,
When a first foreign word of no information in the text base management system of the organization's computer system is entered,
The foreign word management server, which has received the information of the first foreign word from the computer system, obtains the information of the corresponding foreign word by referring to the foreign word master database of the foreign word management server through the foreign word master database collation unit, and supplies the information to the computer system.
14. The foreign word management system of claim 13,
For the first foreign word, in the case that information cannot be obtained from the foreign word master database,
The foreign word management server obtains information of one or more foreign words similar to the first foreign word in the form of the character pattern by referring to the foreign word master database by the similarity determination processing unit, and supplies the obtained information to the corresponding computer system.
15. The foreign word management system of claim 14,
The similarity determination processing unit presents information of two or more external words similar to the first external word to a corresponding computer system so as to select which external word to use when the information of the two or more external words is acquired with reference to the external word master database.
16. The foreign word management system of claim 13,
The foreign word master database stores grouping information of foreign words and information of recommendation degrees of the respective foreign words belonging to the corresponding groups,
For the first foreign word, in the case that information cannot be obtained from the foreign word master database,
The similarity determination processing unit refers to the external word master database, acquires information of one or more external words similar to the first external word, and information of an external word belonging to the same group as the one or more external words and recommendation degree thereof, and supplies the acquired information to the corresponding computer system.
17. The foreign word management system of claim 13,
The foreign word master database stores grouping information of foreign words and information of recommendation degrees of the respective foreign words belonging to the corresponding groups,
For the first foreign word, in the case that information cannot be obtained from the foreign word master database,
The similarity determination processing unit refers to the external word master database, acquires information of one or more external words similar to the first external word and information of external words belonging to the same group as the one or more external words and recommendation degrees thereof, and supplies an external word having the highest recommendation degree from among the acquired external words to the corresponding computer system.
18. The foreign word management system of any one of claims 14 to 17,
The foreign word management server has a similarity database holding information related to the similarity of glyphs,
The similarity determination processing unit refers to a similarity database of the external word management server to determine the similarity of the character pattern when acquiring information of one or more external words similar to the character pattern of the first external word from the external word master database.
19. The foreign word management system of any one of claims 13 to 17,
The character information acquisition processing unit acquires the identification code of the first character, which is given when the character is newly registered in the corresponding character base management system, and registers the character information including the newly registered character in the character management server.
20. The foreign word management system of any one of claims 12 to 17,
The foreign word main database of the foreign word management server stores at least one of corresponding positive word information, radical information and meaning information of the foreign word.
21. The foreign word management system of any one of claims 12 to 17,
The foreign word management server has: an inter-system external word correspondence data generation processing unit that generates inter-system external word correspondence data for establishing correspondence between external word information held between computer systems of two or more different organizations, the inter-system external word correspondence data including at least a correspondence table of identification codes of external words in the computer systems between the two or more organizations.
CN202010099421.XA 2019-10-23 2020-02-18 Foreign word management system Active CN112699210B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2019-192783 2019-10-23
JP2019192783A JP6713657B1 (en) 2019-10-23 2019-10-23 Gaiji management system

Publications (2)

Publication Number Publication Date
CN112699210A CN112699210A (en) 2021-04-23
CN112699210B true CN112699210B (en) 2024-07-05

Family

ID=71103986

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010099421.XA Active CN112699210B (en) 2019-10-23 2020-02-18 Foreign word management system

Country Status (3)

Country Link
JP (1) JP6713657B1 (en)
CN (1) CN112699210B (en)
TW (1) TWI747172B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004334708A (en) * 2003-05-09 2004-11-25 Nec System Technologies Ltd External character managing system and method
JP2010165302A (en) * 2009-01-19 2010-07-29 National Printing Bureau System and method for retrieval of external character

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5151697A (en) * 1990-10-15 1992-09-29 Board Of Regents Of The University Of Washington Data structure management tagging system
US6389166B1 (en) * 1998-10-26 2002-05-14 Matsushita Electric Industrial Co., Ltd. On-line handwritten Chinese character recognition apparatus
US6603478B1 (en) * 2000-04-21 2003-08-05 Dynalab, Inc. System, method and a computer readable medium for improving character access
JP3602480B2 (en) * 2001-07-12 2004-12-15 株式会社リコー Font providing system, font switching system, character search system, font management server, client thereof, font providing method, font switching method, character code conversion method, character search method, and program thereof
CN1801050A (en) * 2005-01-06 2006-07-12 名伦通讯科技公司 System and method for inputting international literal character
TW200919210A (en) * 2007-07-18 2009-05-01 Steven Kays Adaptive electronic design
JP5326382B2 (en) * 2008-06-30 2013-10-30 富士通株式会社 Conversion management device
US20100231598A1 (en) * 2009-03-10 2010-09-16 Google Inc. Serving Font Glyphs
CN103186511B (en) * 2011-12-31 2017-03-08 北京大学 Chinese characters word-formation method and apparatus, the method for construction fontlib
CN104424196B (en) * 2013-08-20 2018-05-01 北大方正集团有限公司 The sorting and storing method and device of inlay, the method and device for creating supplement character library
TWI627540B (en) * 2014-01-06 2018-06-21 Academia Sinica A font cloud service system
CN105528345B (en) * 2014-09-28 2020-08-07 北大方正集团有限公司 Terminal, server and character complementing method
JP6542546B2 (en) * 2015-02-27 2019-07-10 株式会社日立システムズ Document data processing method and system
CN106294742B (en) * 2016-08-10 2019-05-14 中国科学技术大学 A kind of space launching site security reliability database construction method and analysis and assessment system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004334708A (en) * 2003-05-09 2004-11-25 Nec System Technologies Ltd External character managing system and method
JP2010165302A (en) * 2009-01-19 2010-07-29 National Printing Bureau System and method for retrieval of external character

Also Published As

Publication number Publication date
JP6713657B1 (en) 2020-06-24
TW202117580A (en) 2021-05-01
JP2021068166A (en) 2021-04-30
CN112699210A (en) 2021-04-23
TWI747172B (en) 2021-11-21

Similar Documents

Publication Publication Date Title
US7539326B2 (en) Method for verifying an intended address by OCR percentage address matching
JP2019502979A (en) Automatic interpretation of structured multi-field file layouts
CN111125343A (en) Text analysis method and device suitable for human-sentry matching recommendation system
JP6357621B1 (en) Accounting processing apparatus, accounting processing system, accounting processing method and program
JPH07160389A (en) Data input workstation
CN108629052B (en) Kettle task scheduling method, system, computer equipment and storage medium
CN108062422B (en) Sorting method, intelligent terminal, system and storage medium for paging query
US9442901B2 (en) Resembling character data search supporting method, resembling candidate extracting method, and resembling candidate extracting apparatus
US10956669B2 (en) Expression recognition using character skipping
CN112699210B (en) Foreign word management system
US8549023B2 (en) Method and apparatus for resorting a sequence of sorted strings
CN112052672A (en) Unit area identification method and device based on address text and computer equipment
JP5669041B2 (en) Document processing apparatus and document processing method
JP2015103035A (en) Name card data verification system
CN113177392B (en) Method for synchronizing row segment information in proofreading interface, computing device and storage medium
EP4006743A1 (en) Information search system
US6345269B1 (en) System and method for communicating with various electronic archive systems
CN113901075A (en) Method and device for generating SQL (structured query language) statement, computer equipment and storage medium
CN112000701A (en) Data query method, device, equipment and storage medium
CN112685414A (en) Method and device for associating information resource catalog with data resource
US20070038617A1 (en) Cultural property independent programming
US9015573B2 (en) Object recognition and describing structure of graphical objects
CN111311329B (en) Tag data acquisition method, device, equipment and readable storage medium
CN112328739B (en) Character query method, device, computer equipment and computer readable storage medium
CN117032787A (en) Form processing flow generating method, form processing method and related device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant