CN113282298B - Chinese character code conversion method, device, equipment, medium and product - Google Patents

Chinese character code conversion method, device, equipment, medium and product Download PDF

Info

Publication number
CN113282298B
CN113282298B CN202110654817.0A CN202110654817A CN113282298B CN 113282298 B CN113282298 B CN 113282298B CN 202110654817 A CN202110654817 A CN 202110654817A CN 113282298 B CN113282298 B CN 113282298B
Authority
CN
China
Prior art keywords
target
stream data
byte stream
mode
code
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110654817.0A
Other languages
Chinese (zh)
Other versions
CN113282298A (en
Inventor
居浩宇
王珂
赵晓飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Agricultural Bank of China
Original Assignee
Agricultural Bank of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Agricultural Bank of China filed Critical Agricultural Bank of China
Priority to CN202110654817.0A priority Critical patent/CN113282298B/en
Publication of CN113282298A publication Critical patent/CN113282298A/en
Application granted granted Critical
Publication of CN113282298B publication Critical patent/CN113282298B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformation of program code
    • G06F8/41Compilation
    • G06F8/44Encoding

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The embodiment of the invention provides a Chinese character code conversion method, a device, equipment, a medium and a product, wherein the method comprises the following steps: receiving target byte stream data sent by a platform server; determining the number of bytes in the target byte stream data and a transcoding mode; if the target byte stream data is determined to comprise two bytes, determining a target sub-mapping area table matched with the target byte stream data in a preset storage area according to a two-bit coding and transcoding mode of a first byte; determining a target code after the target byte stream data is transcoded in a target mapping area table according to the four-bit code corresponding to the target byte stream data; the target byte stream data is converted to the target encoding. According to the Chinese character code conversion method, the target codes after the codes are converted in the target mapping area table can be rapidly positioned according to the four-bit codes corresponding to the target byte stream data, so that higher-efficiency codes can be realized.

Description

Chinese character code conversion method, device, equipment, medium and product
Technical Field
The embodiment of the invention relates to the technical field of data processing, in particular to a Chinese character code conversion method, a device, equipment, a medium and a product.
Background
With the continuous development of technology, various computer software developed by means of open source and closed data source platforms increases convenience in life and work of people. For example, in financial institutions, a system architecture of a closed platform and an open platform combined together is generally adopted for the purpose of security and convenience. Open platforms typically have multiple mainstream coding formats such as Unicode (chinese: unicode), GB18030 (chinese: information technology chinese Code character set), GBK (fully: CHINESE INTERNAL Code Specification, chinese: chinese character inner Code extension specification), etc., while closed platforms mainly employ EBCDIC (fully: extended Binary Coded DECIMAL INTERCHANGE Code) single character coding. Therefore, when the transaction messages of each front-end transaction system (open platform research and development) in the financial system are communicated with the host computer to which the software developed by the closed data source belongs in the background, code system conversion is required.
The current code system conversion mode, especially the code system conversion of Chinese characters, mainly aims at UNICODE coding and EBCDIC coding. Firstly, establishing a mapping relation table of UNICODE codes and EBCDIC codes, and then searching all mapping relations in the mapping relation table to find the corresponding converted codes so as to realize a transcoding function. However, the number of Chinese characters required to be coded and converted is huge in daily life, and the efficiency of transcoding is low by the current mode of searching the mapping relation table.
Disclosure of Invention
The invention provides a Chinese character code conversion method, a device, equipment, a medium and a product, which are used for solving the problem of low efficiency of the traditional Chinese character code conversion.
The first aspect of the embodiment of the invention provides a Chinese character code conversion method, which comprises the following steps:
receiving target byte stream data sent by a platform server;
Determining the number of bytes in the target byte stream data and a transcoding mode, wherein one of an EBCDIC mode and a Unicode mode in the transcoding mode is an original encoding mode, and the other is a target encoding mode;
if the target byte stream data comprises two bytes, determining a target sub-mapping area table matched with the target byte stream data in a preset storage area according to a two-bit coding and transcoding mode of a first byte, wherein a plurality of sub-mapping area tables corresponding to the transcoding mode are stored in the preset storage area, and each sub-mapping area table is formed by dividing a total mapping area table corresponding to the transcoding mode according to a preset coding range corresponding to the first byte;
determining a target code after the target byte stream data is transcoded in a target mapping area table according to the four-bit code corresponding to the target byte stream data;
the target byte stream data is converted to the target encoding.
Optionally, before the determining, according to the two-bit encoding and transcoding manner of the first byte, the target sub-mapping area table matched with the target byte stream data in the preset storage area, the method further includes:
acquiring a preset range byte corresponding to each transcoding mode;
Constructing a total mapping area table corresponding to the original coding mode according to the arrangement sequence of four-bit codes corresponding to two bytes in the bytes in a preset range;
Dividing the total mapping region table according to a preset coding dividing range corresponding to the first byte to form each sub mapping region table corresponding to the corresponding conversion mode.
Optionally, in the method as described above, the transcoding mode is an EBCDIC mode to Unicode mode, and the bytes in the preset range are all the Chinese character bytes supported by EBCDIC coding;
or the transcoding mode is a Unicode mode to EBCDIC mode, and the preset range bytes are Chinese character bytes corresponding to all Chinese character bytes supported by the EBCDIC coding.
Optionally, in the method as described above, the determining, in a preset storage area, a target sub-mapping area table that the target byte stream data matches according to a two-bit encoding and transcoding manner of the first byte includes:
determining a plurality of candidate sub-region mapping tables corresponding to the transcoding mode in a preset storage area according to the transcoding mode;
Judging whether the two-bit code of the first byte is in a preset code division range of the first byte of a certain candidate sub-mapping area table or not;
And if the candidate sub-mapping area table is determined to be within the preset coding division range of the first byte of the candidate sub-mapping area table, determining the candidate sub-mapping area table as a target sub-mapping area table.
Optionally, in the method as described above, the determining, in a target mapping area table, the target code after the target byte stream data is transcoded according to the four-bit code corresponding to the target byte stream data includes:
determining the arrangement position of the target code of the target byte stream data in the target sub-mapping area table according to the four-bit code corresponding to the target byte stream data and the initial code of two bytes of the target sub-mapping area table;
And determining target codes from a target sub-region mapping table according to the arrangement positions.
Optionally, in the method as described above, the determining, according to the four-bit code corresponding to the target byte stream data and the initial code of two bytes of the target sub-map area table, an arrangement position of the target code of the target byte stream data in the target sub-map area table includes:
Determining a difference value between a four-bit code corresponding to target byte stream data and a four-bit code of the initial code as a deviation value of the target byte stream data in a target sub-mapping area table;
And determining the arrangement position of the target byte stream data in a target sub-mapping area table according to the four-bit code of the initial code and the deviation value.
Optionally, in the method as described above, if the target byte stream data includes four bytes and the original encoding mode is Unicode mode, the method further includes:
Performing hash calculation on the codes of each byte of the target byte stream data to generate hash values corresponding to the bytes;
And generating EBCDIC codes corresponding to the bytes according to the hash value and the preset splicing value.
Optionally, in the method as described above, if the target byte stream data includes four bytes, the original encoding mode is an EBCDIC mode, and the method further includes:
Removing the preset splicing value in the code of each byte in the target byte stream data to generate an intermediate value to be subjected to hash inverse operation;
And carrying out hash inverse operation on the intermediate value to generate Unicode codes corresponding to the bytes.
A second aspect of the embodiment of the present invention provides a chinese character encoding and converting device, including:
The receiving module is used for receiving the target byte stream data sent by the platform server;
the byte determining module is used for determining the number of bytes in the target byte stream data and a transcoding mode, wherein one of an EBCDIC mode and a Unicode mode in the transcoding mode is an original encoding mode, and the other is a target encoding mode;
The mapping table determining module is configured to determine, if it is determined that the target byte stream data includes two bytes, a target sub-mapping region table that matches the target byte stream data in a preset storage area according to a two-bit encoding and transcoding manner of a first byte, where a plurality of sub-mapping region tables corresponding to the transcoding manner are stored in the preset storage area, and each sub-mapping region table is formed by dividing a total mapping region table corresponding to the transcoding manner according to a preset encoding range corresponding to the first byte;
The code determining module is used for determining a target code after the target byte stream data is transcoded in a target mapping area table according to the four-bit code corresponding to the target byte stream data;
And the conversion module is used for converting the target byte stream data into the target code.
Optionally, the apparatus as described above, the apparatus further comprises:
The construction module is used for acquiring preset range bytes corresponding to each transcoding mode; constructing a total mapping area table corresponding to the original coding mode according to the arrangement sequence of four-bit codes corresponding to two bytes in the bytes in a preset range; dividing the total mapping region table according to a preset coding dividing range corresponding to the first byte to form each sub mapping region table corresponding to the corresponding conversion mode.
Optionally, in the apparatus as described above, the transcoding mode is EBCDIC mode to Unicode mode, and the preset range bytes are all kanji bytes supported by EBCDIC coding;
or the transcoding mode is a Unicode mode to EBCDIC mode, and the preset range bytes are Chinese character bytes corresponding to all Chinese character bytes supported by the EBCDIC coding.
Optionally, in the apparatus as described above, the mapping table determining module is specifically configured to, when determining, in a preset storage area, a target sub-mapping area table that the target byte stream data matches according to a two-bit encoding and transcoding manner of the first byte:
Determining a plurality of candidate sub-region mapping tables corresponding to the transcoding mode in a preset storage area according to the transcoding mode; judging whether the two-bit code of the first byte is in a preset code division range of the first byte of a certain candidate sub-mapping area table or not; and if the candidate sub-mapping area table is determined to be within the preset coding division range of the first byte of the candidate sub-mapping area table, determining the candidate sub-mapping area table as a target sub-mapping area table.
Optionally, in the apparatus as described above, the code determining module is specifically configured to:
Determining the arrangement position of the target code of the target byte stream data in the target sub-mapping area table according to the four-bit code corresponding to the target byte stream data and the initial code of two bytes of the target sub-mapping area table; and determining target codes from a target sub-region mapping table according to the arrangement positions.
Optionally, in the apparatus as described above, the code determining module is specifically configured to, when determining an arrangement position of the target code of the target byte stream data in the target sub-map area table according to the four-bit code corresponding to the target byte stream data and the start code of two bytes in the target sub-map area table:
Determining a difference value between a four-bit code corresponding to target byte stream data and a four-bit code of the initial code as a deviation value of the target byte stream data in a target sub-mapping area table; and determining the arrangement position of the target byte stream data in a target sub-mapping area table according to the four-bit code of the initial code and the deviation value.
Optionally, in the apparatus as described above, if the target byte stream data includes four bytes and the original encoding mode is Unicode mode, the apparatus further includes:
The EBCDIC transcoding module is used for carrying out hash calculation on the codes of each byte of the target byte stream data so as to generate hash values corresponding to the bytes; and generating EBCDIC codes corresponding to the bytes according to the hash value and the preset splicing value.
Optionally, in the apparatus as described above, if the target byte stream data includes four bytes, the original encoding mode is EBCDIC mode, and the apparatus further includes:
The Unicode transcoding module is used for removing the preset splicing value in the encoding of each byte in the target byte stream data to generate an intermediate value to be subjected to hash inverse operation; and carrying out hash inverse operation on the intermediate value to generate Unicode codes corresponding to the bytes.
A third aspect of an embodiment of the present invention provides an electronic device, including: a memory, a processor;
a memory; a memory for storing the processor-executable instructions;
wherein the processor is configured to perform the chinese character transcoding method of any one of the first aspect by the processor.
A fourth aspect of the embodiment of the present invention provides a computer-readable storage medium having stored therein computer-executable instructions for implementing the chinese character transcoding method of any one of the first aspects when executed by a processor.
A fifth aspect of an embodiment of the present invention provides a computer program product comprising a computer program which, when executed by a processor, implements the chinese character transcoding method of any one of the first aspects.
The embodiment of the invention provides a Chinese character code conversion method, a device, equipment, a medium and a product, wherein the method comprises the following steps: receiving target byte stream data sent by a platform server; determining the number of bytes in the target byte stream data and a transcoding mode, wherein one of an EBCDIC mode and a Unicode mode in the transcoding mode is an original encoding mode, and the other is a target encoding mode; if the target byte stream data comprises two bytes, determining a target sub-mapping area table matched with the target byte stream data in a preset storage area according to a two-bit coding and transcoding mode of a first byte, wherein a plurality of sub-mapping area tables corresponding to the transcoding mode are stored in the preset storage area, and each sub-mapping area table is formed by dividing a total mapping area table corresponding to the transcoding mode according to a preset coding range corresponding to the first byte; determining a target code after the target byte stream data is transcoded in a target mapping area table according to the four-bit code corresponding to the target byte stream data; the target byte stream data is converted to the target encoding. According to the Chinese character code conversion method, after target byte stream data sent by a platform server are received, whether the transcoding mode and the byte number of the target byte stream data are two bytes or not is determined. Different transcoding approaches match different target sub-map region tables. After the determination, the target byte stream data is matched with a plurality of sub-map region tables. And determining the target code after the target byte stream data is transcoded in the target mapping area table according to the four-bit code corresponding to the target byte stream data, so as to realize transcoding. Because the sub-mapping region table is formed by dividing the total mapping region table according to the preset coding range corresponding to the first byte, the sub-mapping region table only has part of codes in the total mapping region table, so that the target codes after the codes are converted in the target mapping region table can be rapidly positioned according to the four-bit codes corresponding to the target byte stream data, and the higher-efficiency codes can be realized.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the invention and together with the description, serve to explain the principles of the invention.
FIG. 1 is a diagram of a Chinese character transcoding method in which embodiments of the present invention may be implemented;
Fig. 2 is a flowchart illustrating a method for transcoding chinese characters according to a first embodiment of the present invention;
FIG. 3 is a flowchart illustrating a method for transcoding Chinese characters according to a second embodiment of the present invention;
FIG. 4 is a diagram showing the intent of sub-map areas in a method for transcoding Chinese characters according to a second embodiment of the present invention;
fig. 5 is a schematic structural diagram of a chinese character encoding and converting device according to a third embodiment of the present invention;
fig. 6 is a schematic structural diagram of a chinese character encoding and converting device according to a fourth embodiment of the present invention;
Fig. 7 is a schematic structural diagram of an electronic device according to a fifth embodiment of the present invention.
Specific embodiments of the present invention have been shown by way of the above drawings and will be described in more detail below. The drawings and the written description are not intended to limit the scope of the inventive concepts in any way, but rather to illustrate the inventive concepts to those skilled in the art by reference to the specific embodiments.
Detailed Description
Reference will now be made in detail to exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numbers in different drawings refer to the same or similar elements, unless otherwise indicated. The implementations described in the following exemplary examples do not represent all implementations consistent with the invention. Rather, they are merely examples of apparatus and methods consistent with aspects of the invention as detailed in the accompanying claims.
The technical scheme of the invention is described in detail below by specific examples. The following embodiments may be combined with each other, and the same or similar concepts or processes may not be described in detail in some embodiments. Embodiments of the present invention will be described below with reference to the accompanying drawings.
For a clear understanding of the technical solutions of the present application, the prior art solutions will be described in detail first. At present, the code system conversion of Chinese characters is that the number of Chinese characters is huge, for example, 27533 Chinese characters defined in GB18030-2000 are available. Meanwhile, along with the gradual development of Chinese character coding, the Chinese character coding needs to increase the rarely used Chinese characters, and the number of the rarely used Chinese characters reaches about 5000, so that the number of codes in a mapping table for code conversion is more. When performing transcoding between coding formats supporting Chinese character coding, for example, interconversion between EBCDIC coding and UNICODE coding, generally, a mapping relation table of UNICODE coding and EBCDIC coding is scanned in a full table, corresponding converted codes are searched one by one, and finally conversion between codes is realized according to the searched converted codes. Therefore, in the current transcoding mode, the number of codes in the mapping relation table of UNICODE codes and EBCDIC codes is large, and the full-table scanning is performed on each input code, so that the transcoding efficiency is low.
Therefore, in order to solve the problem of low efficiency of the current Chinese character code conversion in the prior art, the inventor discovers in research that the total mapping area table of the UNICODE code and the EBCDIC code can be divided into a plurality of sub-mapping area tables according to the preset coding range corresponding to the first byte. Therefore, when transcoding is carried out, only the target byte stream data is matched with the corresponding sub-mapping area table, and the converted target code is determined in the sub-mapping area table, so that transcoding is realized, and the transcoding efficiency is higher. Specifically, after receiving the target byte stream data sent by the platform server, determining whether the transcoding mode and the byte number of the target byte stream data are two bytes. Different transcoding approaches match different target sub-map region tables. After determining that the target byte stream data includes two bytes, the target byte stream data is matched with a plurality of sub-map area tables. And determining the target code after the target byte stream data is transcoded in the target mapping area table according to the four-bit code corresponding to the target byte stream data, so as to realize transcoding. Because the sub-mapping region table is formed by dividing the total mapping region table according to the preset coding range corresponding to the first byte, the sub-mapping region table only has part of codes in the total mapping region table, so that the target codes after the codes are converted in the target mapping region table can be rapidly positioned according to the four-bit codes corresponding to the target byte stream data, and the higher-efficiency codes can be realized.
The inventor proposes the technical scheme of the application based on the creative discovery.
The application scenario of the Chinese character code conversion method provided by the embodiment of the invention is described below. As shown in fig. 1,1 is an electronic device, and 2 is a platform server. The network architecture of the application scene corresponding to the Chinese character code conversion method provided by the embodiment of the invention comprises the following steps: an electronic device 1 and a platform server 2. The electronic device 1 and the platform server 2 originate from different types of data sources. When the platform server 2 needs to interact with the electronic device 1, the platform server transmits target byte stream data required for interaction to the electronic device 1. The electronic device 1 determines the number of bytes in the target byte stream data and the transcoding scheme, wherein one of the EBCDIC scheme and Unicode scheme in the transcoding scheme is the original encoding scheme, and the other is the target encoding scheme. Assuming that the target byte stream data sent by the platform server is in EBCDIC mode, the target byte stream data needs to be converted from EBCDIC mode to Unicode mode. Meanwhile, if the electronic device 1 determines that the target byte stream data includes two bytes, determining a target sub-mapping area table matched with the target byte stream data in a preset storage area according to the two-bit coding and transcoding mode of the first byte, and storing a plurality of sub-mapping area tables corresponding to the transcoding mode in the preset storage area, wherein each sub-mapping area table is formed by dividing a total mapping area table corresponding to the transcoding mode according to a preset coding range corresponding to the first byte. The electronic device 1 determines a target code after transcoding the target byte stream data, i.e. a code in Unicode mode, in the target mapping area table according to the four-bit code corresponding to the target byte stream data. Finally, the target byte stream data is converted into target encoding. If the target byte stream data sent by the platform server 2 is Unicode code, the Unicode code is transcoded in the same manner as described above for EBCDIC code.
Embodiments of the present invention will now be described with reference to the accompanying drawings.
Fig. 2 is a flow chart of a method for transcoding chinese characters according to a first embodiment of the present invention, as shown in fig. 2, in this embodiment, the execution body of the embodiment of the present invention is a device for transcoding chinese characters, and the device for transcoding chinese characters may be integrated in an electronic device. The method for converting Chinese character codes provided by the embodiment comprises the following steps:
step S101, receiving target byte stream data sent by a platform server.
In this embodiment, the platform server may be a platform server based on an open source or a closed source, and the target byte stream data sent by the platform server may be an encoding format based on an open platform, such as UNICODE encoding, or an encoding format based on a closed platform, such as EBCDIC encoding.
Step S102, determining the number of bytes in the target byte stream data and a transcoding mode, wherein one of an EBCDIC mode and a Unicode mode in the transcoding mode is an original encoding mode, and the other is a target encoding mode.
In this embodiment, the transcoding mode includes converting from EBCDIC encoding to Unicode encoding or converting from Unicode encoding to EBCDIC encoding. The number of bytes in the target byte stream data can be divided into one byte, two or four bytes, one byte corresponds to various numbers, english, symbols and the like, two bytes correspond to Chinese character bytes and the like, and four bytes correspond to special or custom bytes.
And providing a basis for the two subsequent transcoding modes by determining the transcoding modes.
Step S103, if it is determined that the target byte stream data includes two bytes, determining a target sub-mapping area table matched with the target byte stream data in a preset storage area according to the two-bit coding and transcoding mode of the first byte, and storing a plurality of sub-mapping area tables corresponding to the transcoding mode in the preset storage area, wherein each sub-mapping area table is formed by dividing a total mapping area table corresponding to the transcoding mode according to a preset coding range corresponding to the first byte.
In this embodiment, the EBCDIC mode and Unicode mode are mainly 16-ary codes, and 16-ary codes are composed of numbers 0-9 and A, B, C, D, E, F. Two bytes correspond to a four bit 16-ary code. The range of four-bit encodings of two bytes includes 0000-FFFF, where the first byte is the first two-bit encoding. The preset storage area may be a certain area in the memory of the electronic device.
The sub-mapping area table is formed by dividing the total mapping area table according to a preset coding range corresponding to the first byte. For example, the total mapping area table covers the coding mapping relation in the range of 4141-FDFE, and the total mapping area table can be divided according to the first two bits of codes. Such as into 5 zones, 4141-6cfd,6d00-80FD,8100-A1FD, a200-CCFD, CD00-FDFE. These 5 regions are 5 sub-mapped region tables.
In this embodiment, the sub-map area table may be set according to actual requirements, and the total map area table may be divided into a plurality of sub-map area tables on average, or may be divided unevenly, which is not limited in this embodiment.
Step S104, determining the target code after the target byte stream data is transcoded in the target mapping area table according to the four-bit code corresponding to the target byte stream data.
In this embodiment, the mapping relationships of the codes in the target mapping area table are arranged according to the coding order. For example, the coding range in the target mapping area table is 4141-6FFF, the first bit is 4141-coded mapping relation, the second bit is 4142-coded mapping relation, and the last bit is 6 FFF-coded mapping relation. Thus, the target encoding after the target byte stream data is transcoded may be determined in the target mapping area table according to the four-bit encoding corresponding to the target byte stream data, for example, the target encoding after the target byte stream data is transcoded may be determined by comparing with the initial encoding in the target mapping area table, or may be determined by other manners, which is not limited in this embodiment.
Step S105, converting the target byte stream data into target encoding.
In this embodiment, after the target byte stream data is converted into the target code, the information network interaction with the platform server can be realized.
The embodiment of the invention provides a Chinese character code conversion method, which comprises the following steps: and receiving target byte stream data sent by the platform server. And determining the number of bytes in the target byte stream data and a transcoding mode, wherein one of an EBCDIC mode and a Unicode mode in the transcoding mode is an original encoding mode, and the other is a target encoding mode. If it is determined that the target byte stream data comprises two bytes, determining a target sub-mapping area table matched with the target byte stream data in a preset storage area according to a two-bit coding and transcoding mode of a first byte, and storing a plurality of sub-mapping area tables corresponding to the transcoding mode in the preset storage area, wherein each sub-mapping area table is formed by dividing a total mapping area table corresponding to the transcoding mode according to a preset coding range corresponding to the first byte. And determining a target code after the target byte stream data is transcoded in the target mapping area table according to the four-bit code corresponding to the target byte stream data. The target byte stream data is converted into target encoding. According to the Chinese character code conversion method, after target byte stream data sent by a platform server are received, whether the transcoding mode and the byte number of the target byte stream data are two bytes or not is determined. Different transcoding approaches match different target sub-map region tables. After the determination, the target byte stream data is matched with a plurality of sub-map region tables. And determining the target code after the target byte stream data is transcoded in the target mapping area table according to the four-bit code corresponding to the target byte stream data, so as to realize transcoding. Because the sub-mapping region table is formed by dividing the total mapping region table according to the preset coding range corresponding to the first byte, the sub-mapping region table only has part of codes in the total mapping region table, so that the target codes after the codes are converted in the target mapping region table can be rapidly positioned according to the four-bit codes corresponding to the target byte stream data, and the higher-efficiency codes can be realized.
Fig. 3 is a flow chart of a method for transcoding chinese characters according to a second embodiment of the present invention, as shown in fig. 3, in which each step is further refined based on the method for transcoding chinese characters according to the previous embodiment of the present invention. The method for converting Chinese character codes provided by the embodiment comprises the following steps.
In step S201, target byte stream data sent by the platform server is received.
In this embodiment, the implementation manner of step 201 is similar to that of step 101 in the previous embodiment of the present invention, and will not be described in detail here.
Optionally, in this embodiment, before determining, in the preset storage area, the target sub-mapping area table that matches the target byte stream data according to the two-bit encoding and transcoding manner of the first byte, the method further includes:
And acquiring a preset range byte corresponding to each transcoding mode.
And constructing a total mapping area table corresponding to the original coding mode according to the arrangement sequence of four-bit codes corresponding to the two bytes in the preset range.
Dividing the total mapping region table according to a preset coding dividing range corresponding to the first byte to form each sub mapping region table corresponding to the corresponding conversion mode.
In this embodiment, different encoding modes support different ranges of Chinese character bytes, the EBCDIC mode basically supports 27533 Chinese characters in GB18030, and the UNICODE mode supports 27533 Chinese characters in GB18030, but also supports other partially uncommon words. Therefore, the total mapping area table generated according to the Chinese character byte coding range supported by the two modes can realize the conversion of Chinese character codes between the UNICODE mode and the EBCDIC mode. Byte encodings supported by Unicode are shown in table 1.
TABLE 1 coding Range Table supported by Unicode mode
Optionally, in this embodiment, the transcoding mode is EBCDIC mode to Unicode mode, and the preset range bytes are all the kanji bytes supported by EBCDIC coding.
Or the transcoding mode is a Unicode mode to EBCDIC mode, and the preset range bytes are Chinese character bytes corresponding to all Chinese character bytes supported by EBCDIC coding.
In this embodiment, the predetermined range byte of the EBCDIC mode needs to be matched with the predetermined range byte corresponding to the Unicode mode.
Step S202, determining the number of bytes in the target byte stream data and a transcoding mode, wherein one of an EBCDIC mode and a Unicode mode in the transcoding mode is an original encoding mode, and the other is a target encoding mode.
In this embodiment, the implementation of step 202 is similar to that of step 102 in the previous embodiment of the present invention, and will not be described in detail here.
Step S203, if it is determined that the target byte stream data includes two bytes, determining a plurality of candidate sub-area mapping tables corresponding to the transcoding mode in the preset storage area according to the transcoding mode.
In this embodiment, the candidate sub-region mapping table is a sub-region mapping table that may be matched with the target byte stream data.
Optionally, in this embodiment, if the target byte stream data includes four bytes and the original encoding mode is Unicode mode, the method further includes:
and carrying out hash calculation on the codes of each byte of the target byte stream data to generate hash values corresponding to the bytes.
And generating EBCDIC codes corresponding to the bytes according to the hash value and the preset splicing value.
In this embodiment, a custom region in the EBCDIC encoded character encoding table, for example, a region at the beginning of F1 in the EBCDIC encoded character encoding table, may be preset in advance. Chinese character bytes supported by the Unicode mode can be set in the custom area, and Chinese character byte codes which are not supported by the EBCDIC mode originally can be set. The code corresponds to a Unicode mode, after hash calculation is carried out by the Unicode code, the EBCDIC code set in the custom region is generated through a preset splicing value, and therefore code conversion between the Unicode code and the EBCDIC code is achieved.
Optionally, in this embodiment, if the target byte stream data includes four bytes, the original encoding mode is an EBCDIC mode, and the method further includes:
and removing a preset splicing value in the coding of each byte in the target byte stream data to generate an intermediate value to be subjected to hash inverse operation.
Hash inverse operation is carried out on the intermediate value to generate Unicode codes corresponding to the bytes.
In this embodiment, corresponding to the conversion from the Unicode mode to the EBCDIC mode, the preset concatenation value in the encoding of each byte in the target byte stream data is removed first from the EBCDIC mode to the Unicode mode, and then the hash inverse operation is performed, so as to obtain the corresponding Unicode encoding.
In step S204, it is determined whether the two-bit code of the first byte is within the preset code division range of the first byte of the candidate sub-map area table. If yes, step S205 is executed, and if no, step S209 is executed.
In this embodiment, the EBCDIC coding region is shown in fig. 4, in which the coding range of the first byte is the vertical 41-FE, and the coding range of the second byte is the horizontal 41-FD in fig. 4. Assuming that the target byte stream data is EBCDIC encoded, the candidate sub-map Area table is Area1-Area5 in fig. 4, and the Reserved Area is an Area Reserved for custom setting in EBCDIC encoding. The content in the () subsequent to Area in the figure indicates the range to which encoding relates and the number of encodings.
In step S205, the candidate sub-map area table is determined as the target sub-map area table.
In this embodiment, if the two-bit code of the first byte is within the preset code division range of the first byte of a candidate sub-map area table, the code corresponding to the target byte stream data is represented in the candidate sub-map area table.
Step S206, determining the arrangement position of the target code of the target byte stream data in the target sub-mapping area table according to the four-bit code corresponding to the target byte stream data and the initial code of two bytes of the target sub-mapping area table.
In this embodiment, the mapping relationships of the codes in the target mapping area table are arranged according to the coding order, and assuming that the initial code of the target mapping area table is 4444 and the code of the target byte stream data is 4450, the arrangement position of the code of the target byte stream data in the target sub-mapping area table is 13 th bit.
Optionally, in this embodiment, determining an arrangement position of the target code of the target byte stream data in the target sub-map area table according to the four-bit code corresponding to the target byte stream data and the initial code of two bytes of the target sub-map area table includes:
And determining the difference value between the four-bit code corresponding to the target byte stream data and the four-bit code of the initial code as the deviation value of the target byte stream data in the target sub-mapping area table.
And determining the arrangement position of the target byte stream data in the target sub-mapping area table according to the four-bit code of the initial code and the deviation value.
In this embodiment, since the mapping relationships of the codes in the target mapping area table are arranged according to the coding order, the arrangement position of the target byte stream data in the target sub-mapping area table can be determined according to the difference between the four-bit code corresponding to the target byte stream data and the four-bit code of the initial code, so that the arrangement position of the target byte stream data in the target sub-mapping area table can be determined more quickly, and a foundation is provided for the subsequent higher-efficiency transcoding.
Step S207, determining target codes from the target sub-region mapping table according to the arrangement positions.
In this embodiment, the target code may be determined from the target sub-region mapping table according to the arrangement position, so as to implement subsequent transcoding.
Step S208, the target byte stream data is converted into target encoding.
In this embodiment, after determining the target encoding, the target byte stream data may be converted into the target encoding.
In step S209, the next byte stream data sent by the platform server is received.
In this embodiment, if the code of the first byte of the target byte stream data is not within the preset code division range of the first byte of any candidate sub-map area table, it represents that the target byte stream data cannot find the corresponding target code through the sub-map area table.
According to the Chinese character code conversion method provided by the embodiment of the invention, the arrangement position of the target byte stream data in the target sub-mapping area table is determined through the difference value between the four-bit code corresponding to the target byte stream data and the four-bit code of the initial code, so that the arrangement position of the target byte stream data in the target sub-mapping area table can be determined more quickly, and higher-efficiency transcoding is realized.
Fig. 5 is a schematic structural diagram of a chinese character code conversion device according to a third embodiment of the present invention, as shown in fig. 5, in this embodiment, the chinese character code conversion device 300 includes:
a receiving module 301, configured to receive target byte stream data sent by the platform server.
The byte determining module 302 is configured to determine the number of bytes in the target byte stream data and a transcoding mode, where one of the EBCDIC mode and the Unicode mode is an original encoding mode and the other is a target encoding mode.
The mapping table determining module 303 is configured to determine, if it is determined that the target byte stream data includes two bytes, a target sub-mapping area table that matches the target byte stream data in a preset storage area according to a two-bit encoding and transcoding manner of a first byte, where a plurality of sub-mapping area tables corresponding to the transcoding manner are stored in the preset storage area, and each sub-mapping area table is formed by dividing a total mapping area table corresponding to the transcoding manner according to a preset encoding range corresponding to the first byte.
The code determining module 304 is configured to determine a target code after the target byte stream data is transcoded in the target mapping area table according to the four-bit code corresponding to the target byte stream data.
A conversion module 305 for converting the target byte stream data into target encoding.
The chinese character code conversion device provided in this embodiment may implement the technical scheme of the method embodiment shown in fig. 2, and its implementation principle and technical effects are similar to those of the method embodiment shown in fig. 2, and are not described in detail herein.
Meanwhile, fig. 6 is a schematic structural diagram of a chinese character encoding and converting apparatus according to a fourth embodiment of the present invention, and as shown in fig. 6, the chinese character encoding and converting apparatus 400 is further refined based on the chinese character encoding and converting apparatus according to the previous embodiment.
Optionally, in this embodiment, the chinese character code conversion device 400 further includes:
The construction module 401 is configured to obtain a preset range byte corresponding to each transcoding mode. And constructing a total mapping area table corresponding to the original coding mode according to the arrangement sequence of four-bit codes corresponding to the two bytes in the preset range. Dividing the total mapping region table according to a preset coding dividing range corresponding to the first byte to form each sub mapping region table corresponding to the corresponding conversion mode.
Optionally, in this embodiment, the transcoding mode is EBCDIC mode to Unicode mode, and the preset range bytes are all the kanji bytes supported by EBCDIC coding.
Or the transcoding mode is a Unicode mode to EBCDIC mode, and the preset range bytes are Chinese character bytes corresponding to all Chinese character bytes supported by EBCDIC coding.
Optionally, in this embodiment, when the mapping table determining module 303 determines, in the preset storage area, the target sub-mapping area table that matches the target byte stream data according to the two-bit encoding and transcoding manner of the first byte, the mapping table determining module is specifically configured to:
And determining a plurality of candidate sub-region mapping tables corresponding to the transcoding mode in a preset storage region according to the transcoding mode. Judging whether the two-bit code of the first byte is in the preset code division range of the first byte of a certain candidate sub-mapping area table. And if the candidate sub-mapping area table is determined to be within the preset coding division range of the first byte of the candidate sub-mapping area table, determining the candidate sub-mapping area table as a target sub-mapping area table.
Optionally, in this embodiment, the code determining module 303 is specifically configured to:
and determining the arrangement position of the target code of the target byte stream data in the target sub-mapping area table according to the four-bit code corresponding to the target byte stream data and the initial code of two bytes of the target sub-mapping area table. And determining target codes from the target subarea mapping table according to the arrangement positions.
Optionally, in this embodiment, when determining the arrangement position of the target code of the target byte stream data in the target sub-map area table according to the four-bit code corresponding to the target byte stream data and the initial code of two bytes of the target sub-map area table, the code determining module 304 is specifically configured to:
And determining the difference value between the four-bit code corresponding to the target byte stream data and the four-bit code of the initial code as the deviation value of the target byte stream data in the target sub-mapping area table. And determining the arrangement position of the target byte stream data in the target sub-mapping area table according to the four-bit code of the initial code and the deviation value.
Optionally, in this embodiment, if the target byte stream data includes four bytes and the original encoding mode is Unicode mode, the chinese character transcoding device 400 further includes:
The EBCDIC transcoding module 402 is configured to perform hash computation on the encoding of each byte of the target byte stream data, so as to generate a hash value corresponding to each byte. And generating EBCDIC codes corresponding to the bytes according to the hash value and the preset splicing value.
Optionally, in this embodiment, if the target byte stream data includes four bytes, the original encoding mode is EBCDIC mode, and the chinese character transcoding device 400 further includes:
The Unicode transcoding module 403 is configured to remove a preset splicing value in the encoding of each byte in the target byte stream data, so as to generate an intermediate value to be subjected to hash inverse operation. Hash inverse operation is carried out on the intermediate value to generate Unicode codes corresponding to the bytes.
The chinese character code conversion device provided in this embodiment may implement the technical scheme of the method embodiment shown in fig. 2 to 4, and its implementation principle and technical effects are similar to those of the method embodiment shown in fig. 2 to 4, and are not described in detail herein.
According to embodiments of the present invention, the present invention also provides an electronic device, a computer-readable storage medium, and a computer program product.
As shown in fig. 7, fig. 7 is a schematic structural diagram of an electronic device according to a fifth embodiment of the present invention. Electronic devices are intended for various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular telephones, smartphones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be exemplary only, and are not meant to limit implementations of the inventions described and/or claimed herein.
As shown in fig. 7, the electronic device includes: a processor 501, and a memory 502. The various components are interconnected using different buses and may be mounted on a common motherboard or in other manners as desired. The processor may process instructions executing within the electronic device.
Memory 502 is a non-transitory computer readable storage medium provided by the present invention. The memory stores instructions executable by the at least one processor to cause the at least one processor to execute the Chinese character code conversion method provided by the invention. The non-transitory computer readable storage medium of the present invention stores computer instructions for causing a computer to execute the chinese character transcoding method provided by the present invention.
The memory 502 is used as a non-transitory computer readable storage medium for storing non-transitory software programs, non-transitory computer executable programs, and modules, such as program instructions/modules (e.g., the receiving module 301, the byte determining module 302, the mapping table determining module 303, the encoding determining module 304, and the converting module 305 shown in fig. 5) corresponding to the chinese character encoding conversion method according to the embodiment of the present invention. The processor 501 executes various functional applications of the server and data processing by running non-transitory software programs, instructions, and modules stored in the memory 502, that is, implements the chinese character transcoding method in the above-described method embodiment.
Meanwhile, the present embodiment also provides a computer product, which when executed by a processor of an electronic device, enables the electronic device to perform the chinese character transcoding method of the first and second embodiments.
Other implementations of the examples of the invention will be apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. This invention is intended to cover any variations, uses, or adaptations of embodiments of the invention following, in general, the principles of the embodiments of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the embodiments of the invention pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the embodiments being indicated by the following claims.
It is to be understood that the embodiments of the invention are not limited to the precise arrangements and instrumentalities shown in the drawings, which have been described above, and that various modifications and changes may be made without departing from the scope thereof. The scope of embodiments of the invention is limited only by the appended claims.

Claims (8)

1. A Chinese character code conversion method is characterized by comprising the following steps:
receiving target byte stream data sent by a platform server;
Determining the number of bytes in the target byte stream data and a transcoding mode, wherein one of an EBCDIC mode and a Unicode mode in the transcoding mode is an original encoding mode, and the other is a target encoding mode;
if the target byte stream data comprises two bytes, determining a target sub-mapping area table matched with the target byte stream data in a preset storage area according to a two-bit coding and transcoding mode of a first byte, wherein a plurality of sub-mapping area tables corresponding to the transcoding mode are stored in the preset storage area, and each sub-mapping area table is formed by dividing a total mapping area table corresponding to the transcoding mode according to a preset coding range corresponding to the first byte;
determining a target code after the target byte stream data is transcoded in a target mapping area table according to the four-bit code corresponding to the target byte stream data;
Converting the target byte stream data into the target code;
before the target sub-mapping area table matched with the target byte stream data is determined in the preset storage area according to the two-bit coding and transcoding mode of the first byte, the method further comprises the following steps:
acquiring a preset range byte corresponding to each transcoding mode;
Constructing a total mapping area table corresponding to the original coding mode according to the arrangement sequence of four-bit codes corresponding to two bytes in the bytes in a preset range;
Dividing the total mapping region table according to a preset coding dividing range corresponding to the first byte to form each sub mapping region table corresponding to the corresponding conversion mode;
The determining the target sub-mapping area table matched with the target byte stream data in a preset storage area according to the two-bit coding and transcoding mode of the first byte comprises the following steps:
determining a plurality of candidate sub-region mapping tables corresponding to the transcoding mode in a preset storage area according to the transcoding mode;
Judging whether the two-bit code of the first byte is in a preset code division range of the first byte of a certain candidate sub-mapping area table or not;
If the candidate sub-mapping area table is determined to be within the preset coding division range of the first byte of the candidate sub-mapping area table, determining the candidate sub-mapping area table as a target sub-mapping area table;
the determining the target code after the target byte stream data is transcoded in the target mapping area table according to the four-bit code corresponding to the target byte stream data comprises the following steps:
determining the arrangement position of the target code of the target byte stream data in the target sub-mapping area table according to the four-bit code corresponding to the target byte stream data and the initial code of two bytes of the target sub-mapping area table;
determining target codes from a target sub-region mapping table according to the arrangement positions;
the determining the arrangement position of the target code of the target byte stream data in the target sub-mapping area table according to the four-bit code corresponding to the target byte stream data and the initial code of two bytes of the target sub-mapping area table comprises the following steps:
Determining a difference value between a four-bit code corresponding to target byte stream data and a four-bit code of the initial code as a deviation value of the target byte stream data in a target sub-mapping area table;
And determining the arrangement position of the target byte stream data in a target sub-mapping area table according to the four-bit code of the initial code and the deviation value.
2. The method according to claim 1, wherein the transcoding mode is EBCDIC mode to Unicode mode, and the preset range bytes are all Chinese character bytes supported by EBCDIC coding;
Or the transcoding mode is a Unicode mode to an EBCDIC mode, and the bytes in the preset range are Chinese character bytes corresponding to all Chinese character bytes supported by the EBCDIC coding.
3. The method according to claim 1 or 2, wherein if the target byte stream data comprises four bytes and the original encoding mode is Unicode mode, the method further comprises:
Performing hash calculation on the codes of each byte of the target byte stream data to generate hash values corresponding to the bytes;
And generating EBCDIC codes corresponding to the bytes according to the hash value and the preset splicing value.
4. The method of claim 3, wherein if the target byte stream data comprises four bytes, the original encoding mode is EBCDIC mode, the method further comprising:
Removing the preset splicing value in the code of each byte in the target byte stream data to generate an intermediate value to be subjected to hash inverse operation;
And carrying out hash inverse operation on the intermediate value to generate Unicode codes corresponding to the bytes.
5. A chinese character code conversion device, comprising:
The receiving module is used for receiving the target byte stream data sent by the platform server;
the byte determining module is used for determining the number of bytes in the target byte stream data and a transcoding mode, wherein one of an EBCDIC mode and a Unicode mode in the transcoding mode is an original encoding mode, and the other is a target encoding mode;
The mapping table determining module is configured to determine, if it is determined that the target byte stream data includes two bytes, a target sub-mapping region table that matches the target byte stream data in a preset storage area according to a two-bit encoding and transcoding manner of a first byte, where a plurality of sub-mapping region tables corresponding to the transcoding manner are stored in the preset storage area, and each sub-mapping region table is formed by dividing a total mapping region table corresponding to the transcoding manner according to a preset encoding range corresponding to the first byte;
The code determining module is used for determining a target code after the target byte stream data is transcoded in a target mapping area table according to the four-bit code corresponding to the target byte stream data;
A conversion module for converting the target byte stream data into the target code;
The apparatus further comprises:
The construction module is used for acquiring preset range bytes corresponding to each transcoding mode; constructing a total mapping area table corresponding to the original coding mode according to the arrangement sequence of four-bit codes corresponding to two bytes in the bytes in a preset range; dividing the total mapping region table according to a preset coding dividing range corresponding to the first byte to form each sub mapping region table corresponding to the corresponding conversion mode;
The mapping table determining module is specifically configured to, when determining, in a preset storage area according to the two-bit encoding and transcoding manner of the first byte, a target sub-mapping area table that matches the target byte stream data:
Determining a plurality of candidate sub-region mapping tables corresponding to the transcoding mode in a preset storage area according to the transcoding mode; judging whether the two-bit code of the first byte is in a preset code division range of the first byte of a certain candidate sub-mapping area table or not; if the candidate sub-mapping area table is determined to be within the preset coding division range of the first byte of the candidate sub-mapping area table, determining the candidate sub-mapping area table as a target sub-mapping area table;
The code determining module is specifically configured to:
Determining the arrangement position of the target code of the target byte stream data in the target sub-mapping area table according to the four-bit code corresponding to the target byte stream data and the initial code of two bytes of the target sub-mapping area table; determining target codes from a target sub-region mapping table according to the arrangement positions;
The code determining module is specifically configured to, when determining an arrangement position of a target code of the target byte stream data in the target sub-mapping area table according to a four-bit code corresponding to the target byte stream data and a start code of two bytes of the target sub-mapping area table:
Determining a difference value between a four-bit code corresponding to target byte stream data and a four-bit code of the initial code as a deviation value of the target byte stream data in a target sub-mapping area table; and determining the arrangement position of the target byte stream data in a target sub-mapping area table according to the four-bit code of the initial code and the deviation value.
6. An electronic device, comprising: a memory, a processor;
a memory; a memory for storing the processor-executable instructions;
wherein the processor is configured to execute the chinese character transcoding method of any one of claims 1 to 4 by the processor.
7. A computer readable storage medium having stored therein computer executable instructions which when executed by a processor are for implementing the chinese character transcoding method of any one of claims 1 to 4.
8. A computer program product comprising a computer program which, when executed by a processor, implements the chinese character transcoding method of any one of claims 1 to 4.
CN202110654817.0A 2021-06-11 2021-06-11 Chinese character code conversion method, device, equipment, medium and product Active CN113282298B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110654817.0A CN113282298B (en) 2021-06-11 2021-06-11 Chinese character code conversion method, device, equipment, medium and product

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110654817.0A CN113282298B (en) 2021-06-11 2021-06-11 Chinese character code conversion method, device, equipment, medium and product

Publications (2)

Publication Number Publication Date
CN113282298A CN113282298A (en) 2021-08-20
CN113282298B true CN113282298B (en) 2024-06-07

Family

ID=77284373

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110654817.0A Active CN113282298B (en) 2021-06-11 2021-06-11 Chinese character code conversion method, device, equipment, medium and product

Country Status (1)

Country Link
CN (1) CN113282298B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101553779A (en) * 2006-10-17 2009-10-07 三星Sds株式会社 Migration apparatus which convert application program of mainframe system into application program of open system and method for thereof
CN103914436A (en) * 2013-01-09 2014-07-09 国际商业机器公司 Code conversion method, computer device for providing small target encoding space and program product
CN106484661A (en) * 2015-08-25 2017-03-08 北京郑码世纪信息技术有限公司 A kind of method of EBCDIC coding extension
CN112036121A (en) * 2020-08-31 2020-12-04 浪潮商用机器有限公司 Simplified Chinese character and traditional Chinese character conversion method and related device

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7251667B2 (en) * 2002-03-21 2007-07-31 International Business Machines Corporation Unicode input method editor
CA2426496A1 (en) * 2003-04-24 2004-10-24 Ibm Canada Limited - Ibm Canada Limitee Processing fixed-format data in a unicode environment
CN100489847C (en) * 2005-07-18 2009-05-20 王宏源 Method for inquiring four syllable character at network and its system
US9048854B2 (en) * 2013-03-12 2015-06-02 Bmc Software Inc. Unicode character conversion using one of two conversion services
CN111428441B (en) * 2020-03-18 2024-01-19 中国电子科技集团公司第二十八研究所 Information system cross-platform application oriented Chinese character code conversion method and equipment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101553779A (en) * 2006-10-17 2009-10-07 三星Sds株式会社 Migration apparatus which convert application program of mainframe system into application program of open system and method for thereof
CN103914436A (en) * 2013-01-09 2014-07-09 国际商业机器公司 Code conversion method, computer device for providing small target encoding space and program product
CN106484661A (en) * 2015-08-25 2017-03-08 北京郑码世纪信息技术有限公司 A kind of method of EBCDIC coding extension
CN112036121A (en) * 2020-08-31 2020-12-04 浪潮商用机器有限公司 Simplified Chinese character and traditional Chinese character conversion method and related device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
IBM系列主机和PC的汉字编码及其代码转换;苗诚;;中国金融电脑;20060325(03);第42-43页 *
汉字国标码"压缩密码型"通讯编码的设计和实现;葛友昌;计算机工程;19881220(06);第62-67页 *

Also Published As

Publication number Publication date
CN113282298A (en) 2021-08-20

Similar Documents

Publication Publication Date Title
US9697423B1 (en) Identifying the lines of a table
US9940744B2 (en) Remote font management
US11042427B2 (en) Automated consolidation of API specifications
CN107592116B (en) Data compression method, device and storage medium
RU2406142C2 (en) System and method of storing documents in serial binary format
US9467166B2 (en) Enhanced compression, encoding, and naming for resource strings
AU2016367801B2 (en) Method and apparatus for generating random character string
CN107526742B (en) Method and apparatus for processing multilingual text
US20200050589A1 (en) Performing a code conversion in a smaller target encoding space
US12034463B2 (en) Systems and methods for lossless compression of tabular numeric data
CN113282298B (en) Chinese character code conversion method, device, equipment, medium and product
US20120091203A1 (en) Method of encoding and decoding text on a matrix code symbol
US20160203151A1 (en) Enhanced compression, encoding, and naming for resource strings
WO2024066271A1 (en) Database watermark embedding method and apparatus, database watermark tracing method and apparatus, and electronic device
CN115686476A (en) Language conversion method, language conversion device, computer equipment and computer readable storage medium
CN106663020B (en) Migration support device
CN114611120A (en) Data encryption method, device, equipment and medium
CN109344941B (en) Big data secure sharing method and device
CN112395468A (en) Number management method and device, electronic equipment and storage medium
CN110889035A (en) Sensitive information filtering method and device and computer readable storage medium
CN113496111A (en) Character string processing method and device
Bossard et al. Refining the Unrestricted Character Encoding for Japanese.
CN105893335B (en) The method and apparatus of display text
CN105404615B (en) Word retrieval method and device
CN117235345B (en) Open format document OFD searching method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant