TWI634439B - Matching system for to-be-translated documents - Google Patents

Matching system for to-be-translated documents Download PDF

Info

Publication number
TWI634439B
TWI634439B TW106135683A TW106135683A TWI634439B TW I634439 B TWI634439 B TW I634439B TW 106135683 A TW106135683 A TW 106135683A TW 106135683 A TW106135683 A TW 106135683A TW I634439 B TWI634439 B TW I634439B
Authority
TW
Taiwan
Prior art keywords
language
translation
database
category
file
Prior art date
Application number
TW106135683A
Other languages
Chinese (zh)
Other versions
TW201917604A (en
Inventor
陳智偉
吳雋偉
Original Assignee
統一數位翻譯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 統一數位翻譯股份有限公司 filed Critical 統一數位翻譯股份有限公司
Priority to TW106135683A priority Critical patent/TWI634439B/en
Application granted granted Critical
Publication of TWI634439B publication Critical patent/TWI634439B/en
Publication of TW201917604A publication Critical patent/TW201917604A/en

Links

Landscapes

  • Machine Translation (AREA)

Abstract

本發明之一種翻譯文件的媒合系統包括至少一電子裝置及一後端伺服器,電子裝置包括一翻譯文件,且電子裝置以網路連線的方式連接至該後端伺服器。後端伺服器包括一語言辨識單元、一類別辨識單元、一指派人員資料庫、一翻譯師資料庫、一媒合單元、一原文資料庫及一比對單元。語言辨識單元依據翻譯文件的內容產生一語言分析結果。類別辨識單元包括一領域資料庫,類別辨識單元將翻譯文件與領域資料庫內的資料進行比對,以產生一類別分析結果。指派人員資料庫包括多個不同的指派人員的派案紀錄,派案紀錄用以統計該指派人員擅長分案的語言及領域類別。翻譯師資料庫包括多個不同的翻譯師的翻譯紀錄,翻譯紀錄具有所對應的翻譯師已翻譯過的語言及領域類別。媒合單元依據語言分析結果及類別分析結果媒合並統計同樣語言及同樣領域類別的派案紀錄及翻譯紀錄。 A mediation system for translating files includes at least one electronic device and a backend server. The electronic device includes a translation file, and the electronic device is connected to the backend server in a network connection manner. The backend server includes a language recognition unit, a category identification unit, an assigned personnel database, a translator database, a media unit, a source database, and a comparison unit. The language recognition unit generates a language analysis result based on the content of the translated file. The category identification unit includes a domain database, and the category identification unit compares the translation file with the data in the domain database to generate a category analysis result. The assigned personnel database includes the dispatch records of a number of different assignees, and the dispatch records are used to count the language and domain categories in which the assignee is good at dividing the case. The translator's database consists of translation records of a number of different translators with the translated language and domain categories that the corresponding translator has translated. The media unit combines the linguistic analysis results and the category analysis results to collect statistical records and translation records of the same language and the same field category.

Description

翻譯文件的媒合系統 Translation system

本發明提供一翻譯文件的媒合系統,特別是一種具有語言辨識單元、類別辨識單元及比對單元的媒合系統。 The invention provides a mediation system for translating files, in particular a mediation system with a language recognition unit, a category identification unit and a comparison unit.

現今,翻譯接案最初的作業處理流程通常需要仰賴人工判斷的作業程序,「業務人員」先分配給適合的翻譯文件「指派人員」,「指派人員」再指派翻譯工作給適合的「翻譯師」。 Nowadays, the initial job processing process of the translation case usually needs to rely on the manual judgment of the operation program. The "business personnel" is first assigned to the appropriate translation file "assignment personnel", and the "designation personnel" then assigns the translation work to the appropriate "translator". .

詳細來說:承接翻譯案件的「業務人員」及「指派人員」必需要先對翻譯文件進行初步的判斷:判斷文件的語言別及專業領域類別。判斷的依據為客戶提供的資訊及判斷人員的學識、經驗。若客戶提供的資訊有錯誤或判斷人員的學識、經驗不足,就有可能判斷錯誤。判斷錯誤會造成指派的「翻譯師」不適當。詳細來說,每位「翻譯師」都有其真正擅長翻譯的文件類型,若「翻譯師」硬接不擅長的文件類型時,除了會需要花更多的時間、心力進行翻譯外,翻譯的品質也容易出問題。此外,就算文件判斷正確,能否指派文件給最適合的「翻譯師」,也會因「指派人員」處理這類文件的經驗而受影響,若由經驗較不足的「指派人員」來處理,產生問題的機會也會比較高。 In detail: the "business personnel" and "designated personnel" who undertake translation cases must first make a preliminary judgment on the translation documents: determine the language of the documents and the categories of professional fields. The basis for judgment is the information provided by the customer and the knowledge and experience of the judge. If the information provided by the customer is wrong or the knowledge of the judge is insufficient, it is possible to judge the error. Judging errors can make the assigned "translator" inappropriate. In detail, each "translator" has a file type that he is really good at translating. If the "translator" hard-wires a file type that is not good at it, it will take more time and effort to translate. Quality is also prone to problems. In addition, even if the documents are judged correctly, the ability to assign documents to the most suitable "translators" will be affected by the experience of "designating staff" in handling such documents. If it is handled by an "designated person" with insufficient experience, The chances of causing problems will also be higher.

另外,當有大量或有時間壓力的翻譯文件需要進行判斷時,處理人員也不容易在短時間內消化並正確判斷所有的文件。仰賴人工判斷的作業程序,除了 容易判斷錯誤外,指派翻譯工作的效率也不會太好,影響到翻譯案件的處理進度。 In addition, when there are a large number of or time-stressed translation files that need to be judged, it is not easy for the processing personnel to digest and correctly judge all the documents in a short time. Relying on manual judgment of the operating procedures, except In addition to making it easy to judge mistakes, the efficiency of assigning translation work will not be too good, affecting the progress of the translation case.

因此,如何建立一個自動化的翻譯文件媒合系統,來幫助不同類型的翻譯文件迅速且正確的找到適合之「指派人員」及「翻譯師」,便是本領域具有通常知識者值得去思量地。 Therefore, how to establish an automated translation document matching system to help different types of translation files quickly and correctly find suitable "designers" and "translators" is a common knowledge in the field worth considering.

本發明之目的在於提供一翻譯文件的媒合系統,該媒合系統能針對客戶送翻譯文件,進行「語言別」、「類別」辨識及「近似文件」比對,進而提供適合的「指派人員」、「翻譯師」及「近似文件」之推薦,提高內部處理流程的效率及正確性,降低成本,減少人為因素造成的損失。 The object of the present invention is to provide a mediation system for translating documents, which can send translation files to customers, perform "language", "category" identification and "approximate document" comparison, thereby providing suitable "designers" Recommendations for "Translators" and "Approximate Documents" to improve the efficiency and correctness of internal processes, reduce costs and reduce losses caused by human factors.

本發明之一種翻譯文件的媒合系統包括至少一電子裝置及一後端伺服器,電子裝置包括一翻譯文件,且電子裝置以網路連線的方式連接至該後端伺服器。此外,後端伺服器包括一翻譯師資料庫、一語言辨識單元、一類別辨識單元及一媒合單元。其中,翻譯師資料庫包括多個不同的翻譯師的翻譯紀錄,翻譯紀錄具有所對應的翻譯師已翻譯過的語言及領域類別。語言辨識單元依據翻譯文件的內容產生一語言分析結果,而類別辨識單元包括一領域資料庫,類別辨識單元將翻譯文件與領域資料庫內的資料進行比對,以產生一類別分析結果。媒合單元依據該語言分析結果及類別分析結果媒合並統計同樣語言及同樣領域類別的翻譯紀錄,提供翻譯師推薦。 A mediation system for translating files includes at least one electronic device and a backend server. The electronic device includes a translation file, and the electronic device is connected to the backend server in a network connection manner. In addition, the backend server includes a translator database, a language recognition unit, a category identification unit, and a mediation unit. Among them, the translator database includes translation records of a number of different translators, and the translation records have the translated language and domain categories that the corresponding translator has translated. The language recognition unit generates a language analysis result according to the content of the translation file, and the category identification unit includes a domain database, and the category identification unit compares the translation file with the data in the domain database to generate a category analysis result. Based on the language analysis results and the category analysis results, the media unit combines the translation records of the same language and the same field category to provide translators' recommendations.

在上所述之翻譯文件的媒合系統,其中後端伺服器還包括一指派人員資料庫,該指派人員資料庫包括多個不同的指派人員的派案紀錄,派案紀錄用以統計該指派人員擅長分案的語言及領域類別。在上所述之翻譯文件的媒合系統,其中媒合單元依據該語言分析結果及該類別分析結果媒合並統計相同語言及相同領域類別的派案紀錄。提供指派人員推薦。 The mediation system for translating files as described above, wherein the backend server further includes an assigned personnel database, the dispatcher database includes a plurality of dispatch records of different assignees, and the dispatch records are used to count the assignments. The staff is good at dividing the language and domain categories. In the above-mentioned mediation system for translating documents, the mediation unit combines the statistics of the language analysis result and the analysis result of the category to collect the record records of the same language and the same domain category. Provide a designated person recommendation.

在上所述之翻譯文件的媒合系統還包括一原文資料庫及一比對單元,原文資料庫內儲存多個原文資料,每一個原文資料具有一相對應的翻譯師資料及一相對應的指派人員資料。比對單元將翻譯文件與每一個原文資料進行比對,依近似程度,提供相似文件推薦。 The mediation system for translating the file further includes an original database and a comparison unit. The original database stores a plurality of original materials, each of which has a corresponding translator data and a corresponding Assign personnel information. The comparison unit compares the translated document with each original document and provides similar document recommendation according to the degree of approximation.

在上所述之翻譯文件的媒合系統,其中該電子裝置還包括一顯示介面,且該語言分析結果及該類別分析結果被傳送至該顯示介面。 The mediation system for translating a file as described above, wherein the electronic device further comprises a display interface, and the language analysis result and the category analysis result are transmitted to the display interface.

在上所述之翻譯文件的媒合系統,其中顯示介面包括一學習反饋機制,學習反饋機制依據一使用者傳送一反饋訊息至類別辨識單元,再將一學習資訊匯入該領域資料庫。 In the mediation system for translating the file, the display interface includes a learning feedback mechanism. The learning feedback mechanism transmits a feedback message to the category identification unit according to a user, and then the learning information is imported into the domain database.

為讓上述特徵和優點能更明顯易懂,下文特舉較佳實施例,並配合所附圖式,作詳細說明如下。 In order to make the above features and advantages more apparent, the preferred embodiments are described below, and in conjunction with the accompanying drawings.

1、2‧‧‧翻譯文件的媒合系統 1, 2‧‧‧Mediating system for translation documents

10‧‧‧後端伺服器 10‧‧‧Backend server

101、201‧‧‧翻譯師資料庫 101, 201‧‧‧Translator Database

101D‧‧‧翻譯紀錄 101D‧‧‧ translation record

102‧‧‧語言辨識單元 102‧‧‧Language Identification Unit

102A‧‧‧語言分析結果 102A‧‧‧Language analysis results

103‧‧‧類別辨識單元 103‧‧‧Category Identification Unit

103B‧‧‧領域資料庫 103B‧‧‧ Domain Database

103A‧‧‧類別分析結果 103A‧‧‧ category analysis results

104‧‧‧媒合單元 104‧‧‧Media Unit

201D‧‧‧翻譯紀錄 201D‧‧‧ translation record

305‧‧‧指派人員資料庫 305‧‧‧ Designated personnel database

305D‧‧‧派案紀錄 305D‧‧‧ record

403‧‧‧比對單元 403‧‧‧ comparison unit

405‧‧‧原文資料庫 405‧‧‧ original database

405A‧‧‧原文資料 405A‧‧‧ original information

405B‧‧‧翻譯師資料 405B‧‧‧Translator Information

405C‧‧‧指派人員資料 405C‧‧‧Designation of personnel information

12‧‧‧電子裝置 12‧‧‧Electronic devices

120‧‧‧翻譯文件 120‧‧‧Translated documents

121‧‧‧顯示介面 121‧‧‧Display interface

121R‧‧‧學習反饋機制 121R‧‧‧ learning feedback mechanism

1212‧‧‧反饋訊息 1212‧‧‧Feedback

圖1所繪示為本實施例之翻譯文件的媒合系統1的表格示意圖。 FIG. 1 is a schematic diagram showing a table of the mediation system 1 for translating files according to the embodiment.

圖2所繪示為顯示介面121的示意圖。 FIG. 2 is a schematic diagram of the display interface 121.

圖3所繪示為另一實施例之翻譯文件的媒合系統2的表格示意圖。 FIG. 3 is a table diagram showing a mediation system 2 for translating files according to another embodiment.

請參閱圖1,圖1所繪示為本實施例之翻譯文件的媒合系統1的表格示意圖。翻譯文件的媒合系統1包括至少一電子裝置12及一後端伺服器10。其中,電子裝置12包括一翻譯文件120及一顯示介面121,電子裝置12例如為桌上型電腦、筆記型電腦、平板電腦及智慧型手機,而翻譯文件120便是新客戶的翻譯案件,也就是尚未分案給「指派人員」及「翻譯師」的文件。此外,電子裝置12是以網路連線的方式連接至後端伺服器10,後端伺服器10包括一翻譯師資料庫101、一語言辨識單元102、類別辨識單元103及一媒合單元104。其中,翻譯師資料庫101包括多個不同的翻譯紀錄101D,翻譯紀錄101D是記錄著所對應的翻譯師已經翻 譯過的語言及領域類別。詳細來說,每一個翻譯紀錄101D是對應到一個翻譯師(不同的翻譯紀錄101D分別對應到翻譯師A、翻譯師B及翻譯師C),而每一個翻譯師都有擅長翻譯的“語言”及“領域類別”。其中,常見『語言』,例如:中文、英文、日文、韓文...等,『領域類別』指的是翻譯文件120所屬的專業領域,例如:電子、機械、電腦、化學、電機、通信、物理...等。此外,語言辨識單元102是利用各語言的特徵來分析翻譯文件120的內容,以決定翻譯文件120所屬的“語言”。舉例來說,若翻譯文件120的文字編碼、使用的標點符號、句構…等特徵屬於繁體中文時,則語言辨識單元102會判定這個翻譯文件120所屬的『語言』為“繁體中文”。語言分析結果102A會被傳送至顯示介面121。例如圖2中所顯示的繁體中文的比例佔整個文件的95.9%、英文的比例佔整個文件的4.1%。這樣一來,該使用者也能透過顯示介面121了解翻譯文件120的語言組成。 Please refer to FIG. 1. FIG. 1 is a schematic diagram of a table of the mediation system 1 for translating files according to the embodiment. The mediation system 1 for translating files includes at least one electronic device 12 and a backend server 10. The electronic device 12 includes a translation file 120 and a display interface 121. The electronic device 12 is, for example, a desktop computer, a notebook computer, a tablet computer, and a smart phone. The translated file 120 is a translation case of a new customer. It is a document that has not been divided into "designated personnel" and "translators". In addition, the electronic device 12 is connected to the backend server 10 by means of a network connection. The backend server 10 includes a translator database 101, a language recognition unit 102, a category identification unit 103, and a mediation unit 104. . The translator database 101 includes a plurality of different translation records 101D, and the translation record 101D records that the corresponding translator has turned over. Translated language and domain categories. In detail, each translation record 101D corresponds to a translator (different translation records 101D correspond to translator A, translator B and translator C respectively), and each translator has a "language" that is good at translation. And "domain categories." Among them, common "language", such as: Chinese, English, Japanese, Korean, etc., "domain category" refers to the professional field to which the translation file 120 belongs, such as: electronics, machinery, computers, chemistry, motors, communications, Physics...etc. Further, the language recognizing unit 102 analyzes the content of the translated file 120 using the features of each language to determine the "language" to which the translated file 120 belongs. For example, if the character encoding, the punctuation, the sentence structure, and the like of the translation file 120 belong to the traditional Chinese, the language recognition unit 102 determines that the "language" to which the translation file 120 belongs is "traditional Chinese". The language analysis result 102A is transmitted to the display interface 121. For example, the proportion of traditional Chinese shown in Figure 2 accounts for 95.9% of the entire file, and the proportion of English accounts for 4.1% of the entire file. In this way, the user can also understand the language composition of the translated file 120 through the display interface 121.

另外,類別辨識單元103包括一領域資料庫103B,領域資料庫103B內會儲存屬於各種不同專業領域的範本資料集,這些範本資料集的來源主要是維基百科,下載原始資料後,先經分類,再經文字探勘技術的分析整理,完成不同專業領域的範本資料集。類別辨識單元103將翻譯文件120與領域資料庫103B內的資料進行比對分析,以產生一類別分析結果103A。類別分析結果103A會依翻譯文件120與各領域相似的比例顯示,例如圖2所顯示的通信/網路領域所佔的比例為39.8%、運輸所佔領域的比例為20.9%、電腦/資訊科技領域所佔的比例為20.2%、電機領域所佔的比例為19.1%。類別分析結果103A同樣會被傳送至顯示介面121,這樣一來,使用者也能透過顯示介面121了解翻譯文件120的主要專業領域類別。 In addition, the category identification unit 103 includes a domain database 103B. The domain database 103B stores template data sets belonging to various professional fields. The source of the template data sets is mainly Wikipedia. After downloading the original data, the classification is performed. After the analysis and collation of the text exploration technology, the model data sets of different professional fields are completed. The category identifying unit 103 compares the translated file 120 with the data in the domain database 103B to generate a category analysis result 103A. The category analysis result 103A will be displayed in a similar proportion to the translated domain 120, for example, the proportion of the communication/network field shown in Figure 2 is 39.8%, the proportion of the transportation field is 20.9%, and the computer/information technology The proportion of the field is 20.2%, and the proportion of the motor field is 19.1%. The category analysis result 103A is also transmitted to the display interface 121, so that the user can also understand the main area of interest of the translated file 120 through the display interface 121.

另外,翻譯文件的媒合系統1的後端伺服器10更包括一指派人員資料庫305。其中,指派人員資料庫305包括多個不同的指派人員的派案紀錄305D,派案紀錄305D是用以統計指派人員擅長分案的屬性。詳細來說,每一個派案紀錄 305D是對應到一個指派人員(不同的派案紀錄305D分別對應到指派人員D、指派人員E及指派人員F),該屬性也是類指翻譯文件120的『語言』及『類別』。因此,媒合單元依據語言分析結果102A及類別分析結果103A媒合並統計相同語言及相同領域類別的派案紀錄305D。這樣一來,不同的翻譯文件120便能快速找到適合的「指派人員」進行分案作業。 In addition, the backend server 10 of the mediation system 1 that translates files further includes an assigned personnel database 305. The assigned personnel database 305 includes a plurality of dispatching records 305D of different assigning personnel, and the dispatching record 305D is used to count the attributes of the assigning personnel to be good at dividing the case. In detail, each case record 305D corresponds to an assigned person (different dispatch records 305D correspond to assigning personnel D, assigning personnel E, and assigning personnel F respectively), and the attribute is also referred to as "language" and "category" of the translated file 120. Therefore, the merging unit combines the linguistic analysis result 102A and the category analysis result 103A to collect the genre record 305D of the same language and the same domain category. In this way, different translation files 120 can quickly find a suitable "designator" to perform the divisional work.

之後,媒合單元104依據該翻譯文件的『語言』及領域『類別』媒合並統計同樣語言及同樣領域類別的翻譯紀錄101D。因此,經由媒合而來的翻譯紀錄101D便能找出推薦的翻譯師(翻譯紀錄101D所對應的翻譯師)。如此一來,各種類型的翻譯文件120都能快速找到適合的「翻譯師」進行翻譯作業,以提高內部處理流程的效率及正確性,降低人為因素的損失。 Thereafter, the matching unit 104 combines the translation records 101D of the same language and the same domain category according to the "language" and the field "category" of the translated document. Therefore, the translated translator 101D can find the recommended translator (translator corresponding to the translation record 101D). In this way, various types of translation files 120 can quickly find a suitable "translator" to translate, to improve the efficiency and correctness of the internal processing process, and reduce the loss of human factors.

請參閱圖2,顯示介面121還具有一學習反饋機制121R,學習反饋機制121R依據一使用者傳送一反饋訊息1212至類別辨識單元103,再將一學習資訊匯入領域資料庫103B。其中,反饋訊息1212例如為「正確」、「錯誤」或「不確定」。如此一來,學習反饋機制121R便能幫助類別辨識單元103進行自我學習,以提高之後辨識翻譯文件120所屬專業領域類別的正確率。 Referring to FIG. 2, the display interface 121 further has a learning feedback mechanism 121R. The learning feedback mechanism 121R transmits a learning message 1212 to the category identifying unit 103 according to a user, and then imports a learning information into the domain database 103B. The feedback message 1212 is, for example, "correct", "error" or "unsure". In this way, the learning feedback mechanism 121R can help the category recognizing unit 103 perform self-learning to improve the correct rate of the professional domain category to which the translation file 120 belongs later.

請參閱圖3,圖3所繪示為又一實施例之翻譯文件的媒合系統2的表格示意圖。翻譯文件的媒合系統2是經由翻譯文件的媒合系統1所演化而來,翻譯文件的媒合系統2更包括一原文資料庫405及一比對單元403。其中,原文資料庫405內儲存多個原文資料405A,且每一個原文資料405A具有一相對應的翻譯師資料405B及一相對應的指派人員資料405C。並且,比對單元403是用於比對每一個原文資料405A與翻譯文件120的近似程度,找出近似翻譯文件120的一筆或多筆原文資料405A,並可同時得知所對應的翻譯師資料405B及指派人員資料405C。找到的一筆或多筆近似原文資料405A,可供「業務人員」及「指派人員」利用:直接使用加上潤稿完成翻譯或提供給被指派到該翻譯文件的翻譯師參考或找近 似原文資料405A的原翻譯師405B來完成翻譯。找出近似的原文資料405A除了能夠幫助找到適合的翻譯師人選之外,也能減少需翻譯的字數,加快翻譯師作業的時間,節省翻譯成本。 Please refer to FIG. 3. FIG. 3 is a schematic diagram of a table of the mediation system 2 for translating files according to still another embodiment. The mediation system 2 for translating files is evolved through the mediation system 1 for translating files. The mediation system 2 for translating files further includes an original document database 405 and a comparison unit 403. The original database 405 stores a plurality of original materials 405A, and each of the original materials 405A has a corresponding translator information 405B and a corresponding assigned personnel data 405C. Moreover, the comparison unit 403 is configured to compare the degree of approximation of each original document 405A with the translation file 120, find one or more original text materials 405A of the approximate translation file 120, and simultaneously obtain the corresponding translator data. 405B and assign personnel data 405C. One or more similar original texts found 405A, which can be used by "business people" and "designated personnel": direct use plus the draft to complete the translation or provide to the translator assigned to the translation document for reference or near The original translator 405B, like the original material 405A, completes the translation. Finding the approximate source material 405A can help reduce the number of words to be translated, speed up the time for translators, and save translation costs, in addition to helping to find suitable translator candidates.

雖然本發明已以較佳實施例揭露如上,然其並非用以限定本發明,任何所屬技術領域中具有通常知識者,在不脫離本發明之精神和範圍內,當可作些許之更動與潤飾,因此本發明之保護範圍當視後附之申請專利範圍所界定者為準。 Although the present invention has been disclosed in the above preferred embodiments, it is not intended to limit the invention, and any one of ordinary skill in the art can make some modifications and refinements without departing from the spirit and scope of the invention. Therefore, the scope of the invention is defined by the scope of the appended claims.

Claims (5)

一種翻譯文件的媒合系統,包括:至少一電子裝置,包括一翻譯文件;及一後端伺服器,該電子裝置以網路連線的方式連接至該後端伺服器,該後端伺服器,包括:一翻譯師資料庫,包括多個不同的翻譯師的翻譯紀錄,該翻譯紀錄具有所對應的翻譯師已翻譯過的語言及領域類別;一語言辨識單元,該語言辨識單元依據該翻譯文件的內容產生一語言分析結果;一類別辨識單元,包括一領域資料庫,該類別辨識單元將翻譯文件與領域資料庫內的資料進行比對,以產生一類別分析結果;一媒合單元,該媒合單元依據該語言分析結果及該類別分析結果媒合並統計同樣語言及同樣領域類別的該翻譯紀錄;其中該後端伺服器還包括一指派人員資料庫,該指派人員資料庫包括多個不同的指派人員的派案紀錄,該派案紀錄用以統計該指派人員擅長分案的語言及領域類別。 A mediation system for translating files, comprising: at least one electronic device, including a translation file; and a backend server connected to the backend server by a network connection, the backend server , comprising: a database of translators, including translation records of a plurality of different translators, the translation records having corresponding language and domain categories translated by the translator; a language recognition unit, the language recognition unit is based on the translation The content of the file generates a language analysis result; a category identification unit includes a domain database, and the category identification unit compares the translation file with the data in the domain database to generate a category analysis result; The media unit combines the translation result of the language and the analysis result of the same language and the same domain category according to the language analysis result; wherein the backend server further includes an assigned personnel database, and the designated personnel database includes multiple The record of the assignment of different assignees, which is used to count the language and field in which the assignee is good at dividing the case. do not. 如申請專利範圍第1項所述之翻譯文件的媒合系統,其中該媒合單元依據該語言分析結果及該類別分析結果媒合並統計相同語言及相同領域類別的該派案紀錄。 For example, the mediation system for translating documents according to item 1 of the patent application, wherein the media unit combines the results of the language analysis and the analysis results of the category to count the records of the same language and the same field category. 如申請專利範圍第1項所述之翻譯文件的媒合系統,還包括:一原文資料庫,該原文資料庫內儲存多個原文資料,每一個原文資料具有一相對應的翻譯師資料及一相對應的指派人員資料;一比對單元,該比對單元比對每一個原文資料與該翻譯文件的近似程度。 The mediation system for translating documents as described in claim 1 further includes: a source database storing a plurality of source materials, each of the source materials having a corresponding translator profile and a Corresponding assigned personnel data; a comparison unit that compares the approximate degree of each original material with the translated file. 如申請專利範圍第1項所述之翻譯文件的媒合系統,其中該電子裝置還包括一顯示介面,且該語言分析結果及該類別分析結果被傳送至該顯示介面。 The mediation system of the translation file of claim 1, wherein the electronic device further comprises a display interface, and the language analysis result and the category analysis result are transmitted to the display interface. 如申請專利範圍第4項所述之翻譯文件的媒合系統,其中該顯示介面包括一學習反饋機制,該學習反饋機制依據一使用者傳送一反饋訊息至類別辨識單元,再將一學習資訊匯入該領域資料庫。 The mediation system of the translation document described in claim 4, wherein the display interface comprises a learning feedback mechanism, and the learning feedback mechanism transmits a feedback message to the category identification unit according to a user, and then learns a learning information. Into the field database.
TW106135683A 2017-10-18 2017-10-18 Matching system for to-be-translated documents TWI634439B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW106135683A TWI634439B (en) 2017-10-18 2017-10-18 Matching system for to-be-translated documents

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW106135683A TWI634439B (en) 2017-10-18 2017-10-18 Matching system for to-be-translated documents

Publications (2)

Publication Number Publication Date
TWI634439B true TWI634439B (en) 2018-09-01
TW201917604A TW201917604A (en) 2019-05-01

Family

ID=64453036

Family Applications (1)

Application Number Title Priority Date Filing Date
TW106135683A TWI634439B (en) 2017-10-18 2017-10-18 Matching system for to-be-translated documents

Country Status (1)

Country Link
TW (1) TWI634439B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140019113A1 (en) * 2012-07-13 2014-01-16 Microsoft Corporation Phrase-based dictionary extraction and translation quality evaluation
CN105138521A (en) * 2015-08-27 2015-12-09 武汉传神信息技术有限公司 General translator recommendation method for risk project in translation industry
TW201629796A (en) * 2015-01-16 2016-08-16 Gurunavi Inc Information multilingual conversion system
TWM543395U (en) * 2017-03-24 2017-06-11 shi-cheng Zhuang Translation assistance system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140019113A1 (en) * 2012-07-13 2014-01-16 Microsoft Corporation Phrase-based dictionary extraction and translation quality evaluation
TW201629796A (en) * 2015-01-16 2016-08-16 Gurunavi Inc Information multilingual conversion system
CN105138521A (en) * 2015-08-27 2015-12-09 武汉传神信息技术有限公司 General translator recommendation method for risk project in translation industry
TWM543395U (en) * 2017-03-24 2017-06-11 shi-cheng Zhuang Translation assistance system

Also Published As

Publication number Publication date
TW201917604A (en) 2019-05-01

Similar Documents

Publication Publication Date Title
US20190197119A1 (en) Language-agnostic understanding
US10891430B2 (en) Semi-automated methods for translating structured document content to chat-based interaction
US11914968B2 (en) Official document processing method, device, computer equipment and storage medium
US20160371246A1 (en) System and method of template creation for a data extraction tool
WO2021151270A1 (en) Method and apparatus for extracting structured data from image, and device and storage medium
US20190171873A1 (en) Cognitive document image digitization
US11727213B2 (en) Automatic conversation bot generation using input form
CN111512315A (en) Block-wise extraction of document metadata
US10496751B2 (en) Avoiding sentiment model overfitting in a machine language model
US20200184584A1 (en) Systems and methods for legal clause matching and explanation
TWI743623B (en) Artificial intelligence-based business intelligence system and its analysis method
CN111263943B (en) Semantic normalization in document digitization
US20200057799A1 (en) System and method for generating a proposal based on a request for proposal (rfp)
TW202134931A (en) A method and system for automatically generating a data collection module
WO2019035280A1 (en) Intellectual property support device, intellectual property support method, and intellectual property support program
CN112364145A (en) Work order processing method and device, electronic equipment and storage medium
US11423219B2 (en) Generation and population of new application document utilizing historical application documents
US11386263B2 (en) Automatic generation of form application
CN110990527A (en) Automatic question answering method and device, storage medium and electronic equipment
US20190197103A1 (en) Asynchronous speech act detection in text-based messages
EP3738053A1 (en) Smart search for annotations and inking
TWI634439B (en) Matching system for to-be-translated documents
CN105721586A (en) Information intelligent distribution device, method and system
US20210165835A1 (en) Computer driven question identification and understanding within a commerical tender document for automated bid processing for rapid bid submission and win rate enhancement
US20220405503A1 (en) Machine learning-based document splitting and labeling in an electronic document system