TWI762103B

TWI762103B - Method and system for machine reading comprehension

Info

Publication number: TWI762103B
Application number: TW109145608A
Authority: TW
Inventors: 吳宣緯
Original assignee: 財團法人工業技術研究院
Priority date: 2020-12-23
Filing date: 2020-12-23
Publication date: 2022-04-21
Also published as: TW202226000A; CN114741484A

Abstract

A method for machine reading comprehension comprises obtaining a question text and an article text associated with the question text, generating a first knowledge text corresponding to the question text and a second knowledge text corresponding to the article text according to a knowledge set, encoding the question and the article text to generate an initial target text code, encoding the first knowledge text and the second knowledge text to generate a knowledge text code, executing a fusion calculation of the initial target text code and the knowledge text code to introduce part of knowledge in the knowledge set into the initial target text code to generate a strengthened target text code, and obtaining an answer corresponding to the question text based on the strengthened target text code and outputting the answer.

Description

機器閱讀理解方法及系統Machine reading comprehension method and system

本發明係關於一種自然語言處理方法。 The present invention relates to a natural language processing method.

機器閱讀理解(Machine Reading Comprehension，MRC)為一種讓電腦閱讀文章並解答相關問題的技術。近年來，各行各業之文字資料大量產生，若以傳統的人工處理方式，例如羅列FAQ，將面臨處理速度慢、開銷巨大、問答對無法全面覆蓋等問題，甚至成為企業發展的瓶頸，因此對於機器閱讀理解的需求逐步提升。 Machine Reading Comprehension (MRC) is a technology that allows computers to read texts and answer related questions. In recent years, a large number of written data from all walks of life have been generated. If traditional manual processing methods are used, such as listing FAQs, there will be problems such as slow processing speed, huge overhead, and inability to comprehensively cover questions and answers, and even become a bottleneck for enterprise development. Therefore, for The demand for machine reading comprehension is gradually increasing.

然而，一般來說，為了文章的簡潔及文學之美，作者在撰寫文章時往往會省略人們既有的常識，另外，專業文章(例如醫學論文)之作者在撰寫時，亦常假設讀者有相關之背景知識，便不於文章中納入過多背景知識的介紹。因此，若以此類文章作為訓練資料或是目標查找答案之資料時，機器閱讀理解系統所取得之答案的精準度將相當地低。 However, in general, for the sake of brevity and literary beauty, authors tend to omit existing common sense when writing articles. In addition, authors of professional articles (such as medical papers) often assume that readers have relevant knowledge when writing articles. Background knowledge, so it is not appropriate to include too much background knowledge in the article. Therefore, if such articles are used as training data or target data for finding answers, the accuracy of the answers obtained by the machine reading comprehension system will be quite low.

鑒於上述，本發明提供一種機器閱讀理解方法及系統。 In view of the above, the present invention provides a machine reading comprehension method and system.

依據本發明一實施例的機器閱讀理解方法，包含取得問題文本及關聯於問題文本的文章文本，依據知識集產生對應於問題文本的第一知識文本及對應於文章文本的第二知識文本，編碼問題文本及文章文本以產生原始目標文本編碼，編碼第一知識文本及第二知識文本以產生知識文本編碼，對原始目標文本編碼及知識文本編碼執行融合運算以將知識集中的部分知識導入原始目標文本編碼而產生強化目標文本編碼，以及基於強化目標文本編碼取得對應於問題文本的答案，並輸出答案。 A machine reading comprehension method according to an embodiment of the present invention includes obtaining a question text and an article text associated with the question text, generating a first knowledge text corresponding to the question text and a second knowledge text corresponding to the article text according to a knowledge set, and encoding Question text and article text to produce Generate the original target text code, encode the first knowledge text and the second knowledge text to generate the knowledge text code, perform a fusion operation on the original target text code and the knowledge text code to import part of the knowledge in the knowledge set into the original target text code to generate the reinforcement target Text encoding, and obtaining an answer corresponding to the question text based on the reinforcement target text encoding, and outputting the answer.

依據本發明一實施例的機器閱讀理解系統，包含輸入輸出介面、知識文本產生器、語意編碼器、編碼融合器及答案擷取器，其中知識文本產生器連接於輸入輸出介面，語意編碼器連接於輸入輸出介面及知識文本產生器，編碼融合器連接於語意編碼器，且答案擷取器連接於編碼融合器。輸入輸出介面用於取得問題文本及關聯於問題文本的文章文本。知識文本產生器用於依據知識集取得對應於問題文本的第一知識文本及對應於文章文本的第二知識文本。語意編碼器用於編碼問題文本及文章文本以產生原始目標文本編碼，以及編碼第一知識文本及第二知識文本以產生知識文本編碼。編碼融合器用於對原始目標文本編碼及知識文本編碼執行融合運算，以將知識集中的部分知識導入原始目標文本編碼而產生強化目標文本編碼。答案擷取器用於基於強化目標文本編碼，取得對應於問題文本的答案，並輸出答案。 A machine reading comprehension system according to an embodiment of the present invention includes an input and output interface, a knowledge text generator, a semantic encoder, an encoding fusion device and an answer extractor, wherein the knowledge text generator is connected to the input and output interface, and the semantic encoder is connected to In the input-output interface and the knowledge text generator, the encoding fusion device is connected to the semantic encoder, and the answer extractor is connected to the encoding fusion device. The input-output interface is used to obtain the question text and the article text associated with the question text. The knowledge text generator is used for obtaining the first knowledge text corresponding to the question text and the second knowledge text corresponding to the article text according to the knowledge set. The semantic encoder is used to encode the question text and the article text to generate the original target text encoding, and to encode the first knowledge text and the second knowledge text to generate the knowledge text encoding. The code fusion unit is used to perform fusion operation on the original target text code and the knowledge text code, so as to import part of the knowledge in the knowledge set into the original target text code to generate the enhanced target text code. The answer extractor is used to obtain the answer corresponding to the question text based on the enhanced target text encoding, and output the answer.

藉由上述架構，本案所揭示的機器閱讀理解方法及系統，可以執行特殊的編碼運算及融合運算，以在分析問題及文章的過程中導入外部知識，藉此避免文章內容精簡而難以從中取得正確答案的問題，進而提升預測答案之精準度。 With the above structure, the machine reading comprehension method and system disclosed in this case can perform special coding operations and fusion operations to import external knowledge in the process of analyzing problems and articles, thereby avoiding the conciseness of article content and making it difficult to obtain correctness from it. answer questions, thereby improving the accuracy of the predicted answer.

以上之關於本揭露內容之說明及以下之實施方式之說明係用以示範與解釋本發明之精神與原理，並且提供本發明之專利申請範圍更進一步之解釋。 The above description of the present disclosure and the following description of the embodiments are used to demonstrate and explain the spirit and principle of the present invention, and provide further explanation of the scope of the patent application of the present invention.

1:機器閱讀理解系統 1: Machine Reading Comprehension System

11:輸入輸出介面 11: Input and output interface

12:知識文本產生器 12: Knowledge Text Generator

13:語意編碼器 13: Semantic encoder

14:編碼融合器 14: Coding Fusion

15:答案擷取器 15: Answer Picker

21:非結構化知識資料庫 21: Unstructured Knowledge Repository

22:結構化知識資料庫 22: Structured Knowledge Repository

x₁~x₄:單字 x ₁ ~x ₄ : single word

a₁~a₄:初始向量 a ₁ ~a ₄ : initial vector

b₁~b₄、b₁’~b₄’:編碼向量 b ₁ ~b ₄ , b ₁ '~b ₄ ': encoding vector

aq₁~aq₄、bq₁~bq₄:查詢向量 aq ₁ ~aq ₄ , bq ₁ ~bq ₄ : query vector

ak₁~ak₄、bk₁’~bk₄’:鍵向量 ak ₁ ~ak ₄ , bk ₁ '~bk ₄ ': key vector

av₁~av₄、bv₁’~bv₄’:值向量 av ₁ ~av ₄ , bv ₁ '~bv ₄ ': vector of values

α_1,1~α_1,4、β_1,1'~β_1,4':初始權重 α _1,1 ~α _1,4 , β _{1,1 '} ~β _{1,4 '} : initial weight

~

、

~

:歸一化權重

~

,

~

: normalized weight

m₁~m₄:融合向量 m ₁ ~m ₄ : fusion vector

c₁:加權和向量 c ₁ : weighted sum vector

S1~S7:步驟 S1~S7: Steps

S21~S25:步驟 S21~S25: Steps

S61~S62:步驟 S61~S62: Steps

S8~S11:步驟 S8~S11: Steps

圖1係依據本發明一實施例所繪示的機器閱讀理解系統及外部知識資料庫的功能方塊圖。 FIG. 1 is a functional block diagram of a machine reading comprehension system and an external knowledge database according to an embodiment of the present invention.

圖2係依據本發明一實施例所繪示的機器閱讀理解方法的流程圖。 FIG. 2 is a flowchart of a machine reading comprehension method according to an embodiment of the present invention.

圖3係依據本發明一實施例所繪示的機器閱讀理解方法中之產生知識文本的流程圖。 FIG. 3 is a flowchart of generating knowledge text in a machine reading comprehension method according to an embodiment of the present invention.

圖4A~圖4C係依據本發明一實施例所繪示的機器閱讀理解方法中之編碼作業的運算示意圖。 4A to 4C are schematic diagrams of operations of encoding operations in a machine reading comprehension method according to an embodiment of the present invention.

圖5A~圖5C係依據本發明一實施例所繪示的機器閱讀理解方法中之融合運算的運算示意圖。 5A to 5C are schematic diagrams of operations of fusion operations in a machine reading comprehension method according to an embodiment of the present invention.

圖6A及圖6B係依據本發明一實施例所繪示的機器閱讀理解方法中之答案擷取作業的流程圖。 FIG. 6A and FIG. 6B are flowcharts of an answer retrieval operation in a machine reading comprehension method according to an embodiment of the present invention.

圖7係依據本發明一實施例所繪示的機器閱讀理解方法中之最佳化使用參數的流程圖。 FIG. 7 is a flow chart of optimizing parameters used in a machine reading comprehension method according to an embodiment of the present invention.

圖8A係現有機器閱讀理解系統與本發明一實施例的機器閱讀理解系統以第一種資料進行訓練而得的實驗數據比較圖。 FIG. 8A is a comparison diagram of experimental data obtained by training the existing machine reading comprehension system and the machine reading comprehension system of an embodiment of the present invention with the first data.

圖8B係現有機器閱讀理解系統與本發明一實施例的機器閱讀理解系統以第二種資料進行訓練而得的實驗數據比較圖。 FIG. 8B is a comparison diagram of experimental data obtained by training the existing machine reading comprehension system and the machine reading comprehension system of an embodiment of the present invention with the second data.

以下在實施方式中詳細敘述本發明之詳細特徵以及優點，其內容足以使任何熟習相關技藝者了解本發明之技術內容並據以實施，且根據本說明書所揭露之內容、申請專利範圍及圖式，任何熟習相關技藝者可輕易地理解本發明相關之目的及優點。以下之實施例係進一步詳細說明本發明之觀點，但非以任何觀點限制本發明之範疇。 The detailed features and advantages of the present invention are described in detail below in the embodiments, and the content is sufficient to enable any person skilled in the relevant art to understand the technical content of the present invention and implement it accordingly, and according to the content disclosed in this specification, the scope of the patent application and the drawings , any person skilled in the related art can easily understand the related objects and advantages of the present invention. The following examples further illustrate the viewpoints of the present invention in detail, but do not limit the scope of the present invention in any viewpoint.

請參考圖1，圖1係依據本發明一實施例所繪示的機器閱讀理解系統及外部知識資料庫的功能方塊圖。如圖1所示，機器閱讀理解系統1包含輸入輸出介面11、知識文本產生器12、語意編碼器13、編碼融合器14及答案擷取器15，其中知識文本產生器12連接於輸入輸出介面11且可以連接於系統外的非結構化知識資料庫21或/及結構化知識資料庫22，語意編碼器13連接於輸入輸出介面11及知識文本產生器12，編碼融合器14連接於語意編碼器13，且答案擷取器15連接於編碼融合器14及輸入輸出介面11。 Please refer to FIG. 1 , which is a functional block diagram of a machine reading comprehension system and an external knowledge database according to an embodiment of the present invention. As shown in FIG. 1 , the machine reading comprehension system 1 includes an input and output interface 11 , a knowledge text generator 12 , a semantic encoder 13 , an encoding fusion device 14 and an answer extractor 15 , wherein the knowledge text generator 12 is connected to the input and output interface. 11 and can be connected to the unstructured knowledge database 21 or/and the structured knowledge database 22 outside the system, the semantic encoder 13 is connected to the input and output interface 11 and the knowledge text generator 12, and the encoding fusion device 14 is connected to the semantic encoding and the answer extractor 15 is connected to the code fusion unit 14 and the input/output interface 11 .

輸入輸出介面11用以取得問題文本及關聯於問題文本的文章文本，且可用於輸出系統之其他裝置所判定之對應於問題文本的答案。其中，問題文本及文章文本可以為文字檔案，問題文本指示欲尋求答案之問題，而文章文本則指示答案可能的出處。舉例來說，在智能客服應用上，產品說明文件或活動規則可以作為文章文本，產品使用方法或活動優惠內容之詢問可以作為問題文本。舉另個例子來說，在智慧醫療應用上，病歷或醫學論文可以作為文章文本，病因或療法之詢問可以作為問題文本。上述僅為舉例，並非意圖限制本發明。 The input-output interface 11 is used to obtain the question text and the article text related to the question text, and can be used to output the answer corresponding to the question text determined by other devices of the system. The question text and the article text may be text files, the question text indicates the question for which the answer is to be sought, and the article text indicates the possible source of the answer. For example, in a smart customer service application, product description documents or activity rules can be used as article text, and questions about product usage or activity offers can be used as question text. For another example, in smart medical applications, medical records or medical papers can be used as article texts, and questions about causes or treatments can be used as question texts. The above are only examples and are not intended to limit the present invention.

輸入輸出介面11可以包含鍵盤、滑鼠或觸控螢幕等輸入設備，以供使用者輸入或選擇問題文本及文章文本，亦可包含螢幕等輸出設備，以輸出答案擷取器15所產生之答案。或者，輸入輸出介面11可以為有線或無線連接埠，用以連接系統外部裝置(例如手機、平板、個人電腦等)以接收問題文本及文章文本或選擇特定問題文本及文章文本的指令，且可將答案擷取器15所產生之答案傳送至系統外部裝置。再或者，輸入輸出介面11除了上述輸入輸出設備或連接埠之外，可以更包含處理模組。輸入輸出介面11可以藉由輸入設備或連接埠接收問題文本或選擇特定問題文本的指令，再藉由處理模組從系統外部或內部資料庫查找關聯於問題文本的文章文本。進一步來說，處理模組可以依據問題文本中的關鍵字或是問題文本帶有的標籤判斷問題文本的類型或所屬之事件，並查找相同類型或所屬事件的文章文本。 The input-output interface 11 may include input devices such as a keyboard, a mouse, or a touch screen for the user to input or select question text and article text, and may also include an output device such as a screen for outputting answers. The answer generated by the case extractor 15. Alternatively, the input/output interface 11 may be a wired or wireless port for connecting to external devices of the system (such as mobile phones, tablets, personal computers, etc.) to receive question texts and article texts or instructions for selecting specific question texts and article texts, and may The answer generated by the answer extractor 15 is transmitted to the external device of the system. Alternatively, the input/output interface 11 may further include a processing module in addition to the above-mentioned input/output devices or connection ports. The input and output interface 11 can receive question text or an instruction to select a specific question text through an input device or a connection port, and then search for the article text related to the question text from the external or internal database of the system through the processing module. Further, the processing module can judge the type of the question text or the event to which it belongs according to the keywords in the question text or the tags attached to the question text, and search for article texts of the same type or the event.

知識文本產生器12、語意編碼器13、編碼融合器14、答案擷取器15及前述輸入輸出介面11可能具有之處理模組可以由同個處理器或多個處理器來實施，其中所謂處理器例如為中央處理器(Central Processing Unit，CPU)、微控制器、可程式化邏輯控制器(Programmable Logic Controller，PLC)等。 The knowledge text generator 12 , the semantic encoder 13 , the code fusion unit 14 , the answer extractor 15 and the possible processing modules of the aforementioned I/O interface 11 may be implemented by the same processor or multiple processors, wherein the so-called processing The device is, for example, a central processing unit (Central Processing Unit, CPU), a microcontroller, a programmable logic controller (Programmable Logic Controller, PLC), and the like.

知識文本產生器12用於從輸入輸出介面11接收問題文本及文章文本，且依據一知識集，產生對應於問題文本的第一知識文本及對應於文章文本的第二知識文本。知識集可以由非結構化知識資料庫21及結構化知識資料庫22中的一或二者提供。非結構化知識資料庫21及結構化知識資料庫22可以為網路上公開的資料庫或業者內部的資料庫。非結構化知識資料庫21儲存多個非結構化知識，其中所述多個非結構化知識可以分別為多個特定詞的文字描述。舉例來說，非結構化知識庫21可以包含維基百科、辭典等。結構化知識資料庫22儲存多個結構化知識，其中所述多個結構化知識可以分別為多個特定詞於其他詞的關係，例如以「實體-實體關係-實體」之三元組的形式來表示，且多個三元組可以組成知識圖譜。也就是說，結構化知識資料庫22可以包含多種領域的知識圖譜。另外，知識文本產生器12可以透過輸入輸出介面11輸出知識集的至少一部分。進一步來說，知識文本產生器12可以透過輸入輸出介面11輸出非結構化知識庫21或/及結構化知識資料庫22所儲存之知識資料，也可以輸出知識文本產生器12所產生的知識文本，以供使用者查看或調整。有關知識文本產生器12依據上述知識集產生知識文本之進一步實施方式將於後描述。 The knowledge text generator 12 is configured to receive the question text and the article text from the input-output interface 11, and generate a first knowledge text corresponding to the question text and a second knowledge text corresponding to the article text according to a knowledge set. The knowledge set may be provided by one or both of the unstructured knowledge repository 21 and the structured knowledge repository 22 . The unstructured knowledge database 21 and the structured knowledge database 22 may be databases published on the Internet or databases within the industry. The unstructured knowledge database 21 stores a plurality of unstructured knowledge, wherein the plurality of unstructured knowledge can be text descriptions of a plurality of specific words respectively. For example, the unstructured knowledge base 21 may include Wikipedia, thesaurus, and the like. The structured knowledge database 22 stores a plurality of structured knowledges, wherein the plurality of structured knowledges can be the relationship between a plurality of specific words and other words, for example, in the form of a triplet of "entity-entity relationship-entity" to represent, and multiple triples can be Form a knowledge graph. That is, the structured knowledge repository 22 may contain knowledge graphs of various domains. In addition, the knowledge text generator 12 can output at least a part of the knowledge set through the input and output interface 11 . Further, the knowledge text generator 12 can output the knowledge data stored in the unstructured knowledge base 21 or/and the structured knowledge base 22 through the input-output interface 11 , and can also output the knowledge text generated by the knowledge text generator 12 , for users to view or adjust. Further implementation of the knowledge text generator 12 to generate the knowledge text according to the above-mentioned knowledge set will be described later.

語意編碼器13用於從輸入輸出介面11接收問題文本及文章文本，編碼問題文本及文章文本以產生原始目標文本編碼，從知識文本產生器12接收其所產生之第一知識文本及第二知識文本，並編碼第一知識文本及第二知識文本以產生知識文本編碼。語意編碼器13可以多種方式進行編碼作業，包含無涉及上下文關係(non-contextualized)的編碼方式以及涉及上下文關係(contextualized)的編碼方式，其中進一步的實施方式將於後描述。 The semantic encoder 13 is used to receive the question text and the article text from the input-output interface 11 , encode the question text and the article text to generate the original target text code, and receive the first knowledge text and the second knowledge generated by the knowledge text generator 12 text, and encode the first knowledge text and the second knowledge text to generate a knowledge text code. The semantic encoder 13 can perform encoding operations in various ways, including non-contextualized encoding and contextualized encoding, further implementations of which will be described later.

編碼融合器14用於對語意編碼器13產生之原始目標文本編碼及知識文本編碼執行融合運算，以將知識集中的部分知識導入原始目標文本編碼而產生強化目標文本編碼。答案擷取器15用於基於強化目標文本編碼，取得對應於問題文本的答案，並透過輸入輸出介面輸出答案，其中所述輸出介面例如為螢幕等輸出設備，或者為有線或無線連接埠用以連接並將答案傳送至系統外部裝置(例如手機、平板、個人電腦等)。有關編碼融合器14所執行之融合運算及答案擷取器15所執行之答案擷取作業的進一步實施方式將於後描述。 The code fusion unit 14 is used for performing a fusion operation on the original target text code and the knowledge text code generated by the semantic encoder 13, so as to import part of the knowledge in the knowledge set into the original target text code to generate the enhanced target text code. The answer extractor 15 is used for obtaining an answer corresponding to the question text based on the enhanced target text encoding, and outputting the answer through an input-output interface, wherein the output interface is, for example, an output device such as a screen, or a wired or wireless connection port for Connect and transmit answers to devices external to the system (eg mobile phones, tablets, PCs, etc.). Further implementation of the fusion operation performed by the code fuser 14 and the answer extraction operation performed by the answer extractor 15 will be described later.

請參考圖1及圖2，其中圖2係依據本發明一實施例所繪示的機器閱讀理解方法的流程圖。圖2所示的機器閱讀理解方法適用於圖1所示的機器閱讀理解系統1，但不限於此。如圖2所示，機器閱讀理解方法包含步驟S1：取得問題文本及關聯於問題文本的文章文本；步驟S2：依據知識集，產生對應於問題文本的第一知識文本及對應於文章文本的第二知識文本；步驟S3：編碼問題文本及文章文本以產生原始目標文本編碼；步驟S4：編碼第一知識文本及第二知識文本以產生知識文本編碼；步驟S5：對原始目標文本編碼及知識文本編碼執行融合運算，以將知識集中的部分知識導入原始目標文本編碼而產生強化目標文本編碼；步驟S6：基於強化目標文本編碼，取得對應於問題文本的答案；步驟S7：輸出答案。以下示例性地以圖1所示的機器閱讀理解系統1包含的裝置來進一步說明圖2所示之資源配置方法的各種實施方式。 Please refer to FIG. 1 and FIG. 2 , wherein FIG. 2 is a flowchart of a machine reading comprehension method according to an embodiment of the present invention. The machine reading comprehension method shown in FIG. 2 is applicable to the machine reading comprehension system 1 shown in FIG. 1 , but is not limited thereto. As shown in Figure 2, the machine reading comprehension method includes step S1: obtaining the question The question text and the article text associated with the question text; Step S2: According to the knowledge set, generate a first knowledge text corresponding to the question text and a second knowledge text corresponding to the article text; Step S3: Encode the question text and the article text to generate The original target text encoding; Step S4: encoding the first knowledge text and the second knowledge text to generate the knowledge text encoding; Step S5: performing a fusion operation on the original target text encoding and the knowledge text encoding to import part of the knowledge in the knowledge set into the original target text encoding to generate an enhanced target text encoding; step S6 : obtaining an answer corresponding to the question text based on the enhanced target text encoding; step S7 : outputting the answer. Various implementations of the resource allocation method shown in FIG. 2 are further described below by exemplifying the devices included in the machine reading comprehension system 1 shown in FIG. 1 .

於步驟S1中，輸入輸出介面11可以取得問題文本及關聯於問題文本的文章文本。進一步來說，輸入輸出介面11可以直接接收問題文本及文章文本之檔案，或接收選擇特定問題文本及文章文本的指令，或者可以接收問題文本或選擇特定問題文本的指令，再從系統外部或內部資料庫查找關聯於問題文本的文章文本。其中，查找關聯於問題文本的文章文本的方式可以為：依據問題文本中的關鍵字或是問題文本帶有的標籤判斷問題文本的類型或所屬之事件，再查找相同類型或所屬事件的文章文本。舉例來說，當輸入輸出介面11判斷問題文本為醫學類時，便查找醫學類的文章文本；當輸入輸出介面11判斷問題文本指示周年慶活動之相關問題時，便查找周年慶活動之相關文章。上述僅為舉例，並非意圖限制本發明。 In step S1, the input/output interface 11 can obtain the question text and the article text related to the question text. Further, the I/O interface 11 can directly receive files of question texts and article texts, or receive instructions to select specific question texts and article texts, or can receive question texts or instructions to select specific question texts, and then access the files from outside or inside the system. The database looks for article text associated with the question text. Among them, the method of finding the article text related to the question text may be: according to the keywords in the question text or the label of the question text to determine the type of the question text or the event to which it belongs, and then search for the article text of the same type or the event to which it belongs. . For example, when the I/O interface 11 judges that the question text is medical, it searches for the medical article text; when the I/O interface 11 judges that the question text indicates a problem related to an anniversary event, it searches for an article related to the anniversary event. . The above are only examples and are not intended to limit the present invention.

於步驟S2中，知識文本產生器12可以依據知識集，產生對應於問題文本的第一知識文本及對應於文章文本的第二知識文本，也就是說，知識文本產生器12可以分別將問題文本及文章文本作為待處理文本來處理，以分別產生對應之知識文本。知識集包含非結構化知識資料庫21及結構化知識資料庫22中的一或二者所儲存的知識，也就是說，知識文本產生器12可以從非結構化知識資料庫21或/及結構化知識資料庫22查找用於產生第一知識文本及第二知識文本的材料。 In step S2, the knowledge text generator 12 can generate the first knowledge text corresponding to the question text and the second knowledge text corresponding to the article text according to the knowledge set, that is to say, the knowledge text generator 12 can generate the question text respectively. and article texts are processed as pending texts to generate corresponding knowledge texts respectively. The knowledge set includes the unstructured knowledge database 21 and the structured knowledge database 22 One or both of the stored knowledge, that is, the knowledge text generator 12 can search from the unstructured knowledge database 21 or/and the structured knowledge database 22 for generating the first knowledge text and the second knowledge text s material.

進一步來說明產生知識文本之流程，請參考圖1及圖3，圖3係依據本發明一實施例所繪示的機器閱讀理解方法中之產生知識文本的流程圖。如圖3所示，產生知識文本之流程可以包含步驟S21：將待處理文本分割為多個詞；步驟S22：依據所述多個詞查找知識集，以取得至少一相關知識；步驟S23：判斷相關知識的數量為一或大於一；當相關知識的數量為一時，執行步驟S24：以此相關知識產生目標知識文本；而當相關知識的數量大於一時，執行步驟S25：依據所述多個詞的排列順序及預設範本，組合相關知識以產生目標知識文本。其中，以問題文本作為待處理文本而產生之目標知識文本即為第一知識文本，而以文章文本作為待處理文本而產生之目標知識文本即為第二知識文本。 To further describe the process of generating knowledge text, please refer to FIG. 1 and FIG. 3 . FIG. 3 is a flowchart of generating knowledge text in a machine reading comprehension method according to an embodiment of the present invention. As shown in FIG. 3, the process of generating knowledge text may include step S21: dividing the text to be processed into a plurality of words; step S22: searching a knowledge set according to the plurality of words to obtain at least one relevant knowledge; step S23: judging The number of related knowledge is one or more than one; when the number of related knowledge is one, execute step S24: generate the target knowledge text based on the related knowledge; and when the number of related knowledge is greater than one, execute step S25: according to the multiple words The order of arrangement and the preset template, and the related knowledge is combined to generate the target knowledge text. The target knowledge text generated by taking the question text as the text to be processed is the first knowledge text, and the target knowledge text generated by using the article text as the text to be processed is the second knowledge text.

於步驟S21中，知識文本產生器12可以藉由自然語言分析技術將待處理文本分割為多個詞。於步驟S22中，知識文本產生器12可以將分割出的詞各作為關鍵字以從知識集中查找與關鍵字相關的知識，也就是從非結構化知識資料庫21或/及結構化知識資料庫22查找與關鍵字相關的知識。特別來說，待處理文本所包含之關鍵字的數量與查找到之相關知識的數量不一定對應，一關鍵字可能對應於零個、一個或多個相關知識。也就是說，知識文本產生器12所取得之相關知識可能為零個、一個或多個。當相關知識的數量為零時，知識文本產生器12便停止運作或/及輸出錯誤訊號；當相關知識的數量為一或大於一時，知識文本產生器12之運作則如下所述。 In step S21 , the knowledge text generator 12 can divide the text to be processed into a plurality of words by natural language analysis technology. In step S22, the knowledge text generator 12 can use the segmented words as keywords to search for knowledge related to the keywords from the knowledge set, that is, from the unstructured knowledge database 21 or/and the structured knowledge database. 22 Find knowledge related to keywords. In particular, the number of keywords contained in the text to be processed does not necessarily correspond to the number of related knowledge found, and a keyword may correspond to zero, one or more related knowledge. That is, the relevant knowledge acquired by the knowledge text generator 12 may be zero, one or more. When the number of related knowledge is zero, the knowledge text generator 12 stops operating or/and outputs an error signal; when the number of related knowledge is one or more, the operation of the knowledge text generator 12 is as follows.

於步驟S23~S25中，當相關知識的數量為一時，知識文本產生器12便依據此相關知識產生目標知識文本；而當相關知識的數量大於一時，知識文本產生器12會依據分割出來的詞的排列順序及一預設範本(第一預設範本)，組合這些相關知識以產生目標知識文本。舉例來說，第一預設範本指示串接所有相關知識之文字描述，且每兩相關知識間以分隔符號(例如句號)隔開，其中串接的順序同於詞的排列順序，但不限於此。於另一實施例中，知識文本產生器12可以再將串接完的文字描述透過文本摘要系統產生精簡版本的知識文本以作為目標知識文本。另外，當知識文本產生器12所取得之相關知識的數量大於一預設處理上限時，知識文本產生器12可以依據待處理文本的類型或所屬活動(例如基於文本帶有的標籤)或依據相關知識來源的可信度(例如期刊論文優先於網路文章)來篩選相關知識，以留下不大於預設處理上限之數量的相關知識。 In steps S23-S25, when the quantity of the relevant knowledge is one, the knowledge text generator 12 generates the target knowledge text according to the relevant knowledge; and when the quantity of the relevant knowledge is greater than one, the knowledge text generator 12 generates the target knowledge text according to the segmented word. The order of arrangement and a preset template (the first preset template) are combined to generate the target knowledge text. For example, the first default template indicates that the text descriptions of all related knowledge are concatenated, and each two related knowledges are separated by a separator (such as a period), wherein the sequence of concatenation is the same as the order of words, but not limited to this. In another embodiment, the knowledge text generator 12 may generate a condensed version of the knowledge text through the text summarization system from the concatenated text description as the target knowledge text. In addition, when the amount of related knowledge acquired by the knowledge text generator 12 is greater than a preset processing upper limit, the knowledge text generator 12 can determine the type of the text to be processed or the activity it belongs to (for example, based on the tags attached to the text) or the related knowledge. The credibility of the knowledge source (eg, journal articles over Internet articles) is used to screen relevant knowledge to leave no more than a predetermined processing limit of relevant knowledge.

如前所述，知識文本產生器12依據關鍵字所取得之相關知識可能來自於非結構化知識資料庫21或/及結構化知識資料庫22，也就是說，相關知識可能包含非結構化知識或/及結構化知識。對於屬於非結構化知識的相關知識，其形式本身便為文字描述，因此知識文本產生器12可以直接以相關知識來產生目標知識文本。對於屬於結構化知識的相關知識，知識文本產生器12在產生目標知識文本之前，會先依據另一預設範本(第二預設範本)將相關知識之形式轉換為文字描述。以形式為「實體(A)-實體關係(B)-實體(C)」之三元組的非結構化知識為例，第二預設範本可以設定為「A的B為C」，但不限於此。 As mentioned above, the relevant knowledge obtained by the knowledge text generator 12 according to the keywords may come from the unstructured knowledge database 21 or/and the structured knowledge database 22, that is, the relevant knowledge may include unstructured knowledge or/and structured knowledge. For the relevant knowledge belonging to unstructured knowledge, the form itself is a text description, so the knowledge text generator 12 can directly generate the target knowledge text based on the relevant knowledge. For the related knowledge belonging to the structured knowledge, before generating the target knowledge text, the knowledge text generator 12 will first convert the form of the related knowledge into a text description according to another preset template (the second preset template). Taking unstructured knowledge in the form of a triplet of "entity (A)-entity relation (B)-entity (C)" as an example, the second default template can be set as "B of A is C", but not limited to this.

下述舉三個以問題文本作為待處理文本之例子，分別為相關知識皆屬非結構化知識的例子、相關知識皆屬結構化知識的例子以及相關知識兼具非結構化知識及結構化知識的例子。這些例子僅為示例性的說明，非意圖限制本發明。 The following three examples of using the question text as the text to be processed are the examples in which the relevant knowledge is all unstructured knowledge, the example in which the relevant knowledge is all structured knowledge, and the relevant knowledge is both. Examples of unstructured and structured knowledge. These examples are illustrative only and are not intended to limit the invention.

於第一個例子中，問題文本為「原告想要維護什麼權利」，且知識文本產生器12從知識集中取得關鍵字「原告」的文字描述及關鍵字「權利」的文字描述，則知識文本產生器12可以產生第一知識文本「(原告的文字描述)。(權利的文字描述)」。於第二個例子中，問題文本為「坐月子可以洗澡嗎」，且知識文本產生器12從知識集中取得關鍵字「坐月子」的三元組「坐月子-概念-產後護理」及關鍵字「洗澡」的三元組「洗澡-功效-清除污垢」，則知識文本產生器12可以先將兩三元組分別轉換為文字描述「坐月子的概念為產後護理」及「洗澡的功效為清除污垢」，再將兩個文字描述依關鍵字在問題文本中的順序串接以產生目標知識文本。於第三個例子中，問題文本為「婚生子女的出生年月日為何」，且知識文本產生器12從知識集中取得關鍵字「婚生子女」的文字描述及關鍵字「年月日」的三元組，則知識文本產生器12會先將「年月日」的三元組依前述方式轉換為文字描述，再依關鍵字在問題文本中的順序串接文字描述。上述例子僅為示例性的說明，非意圖限制本發明。 In the first example, the question text is "what right does the plaintiff want to defend", and the knowledge text generator 12 obtains the text description of the keyword "plaintiff" and the text description of the keyword "right" from the knowledge set, then the knowledge text The generator 12 may generate the first knowledge text "(text description of the plaintiff). (text description of the right)". In the second example, the question text is "Can confinement children take a bath", and the knowledge text generator 12 obtains the triplet "confinement-concept-postpartum care" of the keyword "confinement" from the knowledge set and the triplet “bathing-efficacy-removing dirt” of the keyword “bathing”, the knowledge text generator 12 can first convert the two or three-tuples into text descriptions “the concept of confinement is postpartum care” and “bathing” The effect is to remove dirt”, and then the two text descriptions are concatenated in the order of the keywords in the question text to generate the target knowledge text. In the third example, the question text is "what is the date of birth of the child born in wedlock", and the knowledge text generator 12 obtains the text description of the keyword "children born in wedlock" and the keyword "date of birth" from the knowledge set The knowledge text generator 12 first converts the triples of "year, month, and day" into text descriptions in the aforementioned manner, and then concatenates the text descriptions according to the sequence of keywords in the question text. The above examples are illustrative only and are not intended to limit the present invention.

如上所述，機器閱讀理解系統1可以藉由知識文本產生器12將結構化知識轉換為文字描述，以整合非結構化及結構化的知識，藉此，後續分析文章以產出答案之運算相較於直接分析結構化資料以產出答案之運算可以具有較低的運算複雜度。 As mentioned above, the machine reading comprehension system 1 can use the knowledge text generator 12 to convert structured knowledge into text descriptions, so as to integrate unstructured and structured knowledge, thereby, the subsequent analysis of the article to generate an answer to the operation phase The operation may have lower computational complexity than directly analyzing the structured data to produce an answer.

接著說明圖2之步驟S3及S4。於此要特別說明的是，圖2示例性地繪示步驟S4執行於步驟S3之後，然而於其他實施例中，步驟S4可以執行於步驟S3之前，也可以與步驟S3同時執行。於步驟S3及S4中，語意編碼器13可以編碼問題文本及文章文本以產生原始目標文本編碼，並編碼第一知識文本及第二知識文本以產生知識文本編碼。也就是說，於步驟S3中，語意編碼器13將問題文本與文章文本之組合作為編碼運算的執行對象，而於步驟S4中，將第一知識文本與第二知識文本之組合作為編碼運算的執行對象，其中，所謂組合可以係將兩文本直接串連而形成，或是串連後於文本串首端、尾端及兩文本之間加入分割符(例如於首端加入[CLS]，於尾端加入[SEP]且於兩文本之間加入[SEP])而形成，但不限於此。 Next, steps S3 and S4 in FIG. 2 will be described. It should be noted here that, FIG. 2 exemplarily shows that step S4 is performed after step S3, but in other embodiments, step S4 may be performed before step S3, or may be performed simultaneously with step S3. In steps S3 and S4, the semantic encoder 13 can encode The question text and the article text are used to generate the original target text code, and the first knowledge text and the second knowledge text are encoded to generate the knowledge text code. That is to say, in step S3, the semantic encoder 13 uses the combination of the question text and the article text as the execution object of the encoding operation, and in step S4, the combination of the first knowledge text and the second knowledge text is used as the encoding operation. Execution object, in which the so-called combination can be formed by directly concatenating two texts, or adding separators at the beginning and end of the text string and between the two texts after the concatenation (for example, adding [CLS] at the beginning, in It is formed by adding [SEP] at the end and adding [SEP]) between two texts, but not limited to this.

語意編碼器13可以藉由無涉及上下文關係的編碼方式或涉及上下文關係的方式來執行編碼運算以產生原始目標文本編碼或知識文本編碼。特別來說，產生原始目標文本編碼及知識文本編碼的方式可以利用相同或不同之編碼方式。無涉及上下文關係的編碼方式可以包含：將執行對象分割為多個單字，取得各單字分別對應的初始向量以及組合初始向量以產生原始目標文本編碼或知識文本編碼。以英文作為執行對象為例，語意編碼器13可直接依據執行對象中的空格來分割詞，或可藉由WordPiece演算法來分割成子詞(subword)，例如將playing分割為play及##ing；另以中文作為執行對象為例，語意編碼器13可以將執行對象分割為多個字元，或可藉由自然語言分析技術將執行對象分割為多個詞。上述僅為舉例，本發明不以此為限。 The semantic encoder 13 may perform an encoding operation to generate the original target text encoding or the knowledge text encoding by a context-free encoding method or a context-involving encoding method. In particular, the manner of generating the original target text encoding and the knowledge text encoding may utilize the same or different encoding methods. The encoding method that does not involve the context relationship may include: dividing the execution object into multiple words, obtaining initial vectors corresponding to each word, and combining the initial vectors to generate the original target text code or knowledge text code. Taking English as the execution object as an example, the semantic encoder 13 can directly divide words according to the spaces in the execution object, or can be divided into subwords (subwords) by using the WordPiece algorithm, such as dividing playing into play and ##ing; Taking Chinese as the execution object as an example, the semantic encoder 13 can divide the execution object into multiple characters, or can divide the execution object into multiple words by natural language analysis technology. The above are only examples, and the present invention is not limited thereto.

所述初始向量可以僅為單字向量(Token Embedding)，或者包含同一維度的單字向量、分段向量(Segment Embedding)及位置向量(Position Embedding)，例如為三向量之和。單字向量表示對應的單字在向量空間中的代表向量，而取得單字向量的方式可以利用Word2Vec模型或GloVe模型來實施。分段向量表示對應的單字係屬於執行對象中的第一文本或第二文本。以問題文本與文章文本之組合作為執行對象為例，所述第一文本表示問題文本，且對應之分段向量為編碼代號為0的向量，所述第二文本表示文章文本，且對應之分段向量為編碼代號為1的向量。位置向量則表示對應的單字在所有單字中的位置。原始目標文本編碼或知識文本編碼可以為初始向量所組成的向量矩陣。 The initial vector may only be a single-word vector (Token Embedding), or may include a single-word vector, a segment vector (Segment Embedding) and a position vector (Position Embedding) of the same dimension, such as the sum of three vectors. The single-word vector represents the representative vector of the corresponding single-word in the vector space, and the way to obtain the single-word vector can be implemented by using the Word2Vec model or the GloVe model. The segment vector indicates that the corresponding word belongs to the first text in the execution object. this or a second text. Taking the combination of the question text and the article text as the execution object as an example, the first text represents the question text, and the corresponding segment vector is a vector with an encoding code of 0, the second text represents the article text, and the corresponding The segment vector is a vector with encoding code 1. The position vector represents the position of the corresponding word in all words. The original target text encoding or knowledge text encoding can be a vector matrix composed of initial vectors.

涉及上下文關係的編碼方式則可以包含：將執行對象分割為多個單字；取得這些單字分別對應的多個初始向量；對這些初始向量執行上下文編碼以產生多個編碼向量；以及組合這些編碼向量以產生原始目標文本編碼或知識文本編碼。所述初始向量如前所述，可以僅為單字向量，或者包含同一維度的單字向量、分段向量及位置向量，例如為三向量之和。其中單字向量、分段向量及位置向量之意義如前所述於此便不再贅述。 The encoding method involving the context relationship may include: dividing the execution object into multiple words; obtaining multiple initial vectors corresponding to these words; performing context encoding on these initial vectors to generate multiple encoding vectors; and combining these encoding vectors to Generates original target text encodings or knowledge text encodings. As mentioned above, the initial vector may only be a single-word vector, or may include a single-word vector, a segment vector, and a position vector of the same dimension, for example, the sum of three vectors. The meanings of the single word vector, the segment vector and the position vector are described above and will not be repeated here.

進一步來說明一種上下文編碼方式，請參考圖1及圖4A~圖4C，圖4A~圖4C係依據本發明一實施例所繪示的機器閱讀理解方法中之編碼作業的運算示意圖。於圖4A中，語意編碼器13將執行對象分割為單字x₁~x₄，並依前述方式取得單字x₁~x₄分別對應的初始向量a₁~a₄，語意編碼器13接著分別對初始向量a₁~a₄執行上下文編碼，以產生多個編碼向量b₁~b₄。其中，對初始向量a₁~a₄所執行之上下文編碼可以平行執行，也可以依特定順序執行。圖4B及圖4C示例性地繪示對初始向量a₁執行上下文編碼以取得編碼向量b₁，其他初始向量a₂~a₄亦以相同運算來取得編碼向量b₂~b₄，因此無另外繪示。另外要特別說明的是，圖4A~圖4C所繪示之單字數量僅為舉例，本發明不限於此。 To further illustrate a context encoding method, please refer to FIG. 1 and FIGS. 4A to 4C . FIGS. 4A to 4C are schematic operation diagrams of encoding operations in a machine reading comprehension method according to an embodiment of the present invention. In FIG. 4A , the semantic encoder 13 divides the execution object into single words x ₁ ˜x ₄ , and obtains the initial vectors a ₁ ˜a ₄ corresponding to the single words x ₁ ˜x ₄ in the aforementioned manner, and the semantic encoder 13 then respectively The initial vectors a ₁ to a ₄ perform context encoding to generate a plurality of encoded vectors b ₁ to b ₄ . The context encoding performed on the initial vectors a ₁ to a ₄ may be performed in parallel, or may be performed in a specific order. FIG. 4B and FIG. 4C exemplarily show that the context encoding is performed on the initial vector a ₁ to obtain the encoding vector b ₁ , and the other initial vectors a ₂ -a ₄ also use the same operation to obtain the encoding vectors b ₂ -b ₄ , so there is no additional drawing. In addition, it should be noted that the number of words shown in FIGS. 4A to 4C is only an example, and the present invention is not limited thereto.

如圖4B所示，語意編碼器13可以依據初始向量a₁~a₄產生對應的多個查詢向量aq₁~aq₄、多個鍵向量ak₁~ak₄及多個值向量av₁~av₄。進一步來說，表示查詢向量aq₁~aq₄、鍵向量ak₁~ak₄及值向量av₁~av₄的數學式可以表示如下：aq_i=W_aqa_i As shown in FIG. 4B , the semantic encoder 13 can generate corresponding multiple query vectors aq ₁ ˜aq ₄ , multiple key vectors ak ₁ ˜ak ₄ , and multiple value vectors av ₁ ˜av according to the initial vectors a ₁ ˜a ₄ . ₄ . Further, the mathematical expression representing the query vector aq ₁ ~aq ₄ , the key vector ak ₁ ~ak ₄ and the value vector av ₁ ~av ₄ can be expressed as follows: aq _i =W _aq a _i

ak_i=W_aka_i ak _i =W _ak a _i

av_i=W_ava_i av _i =W _av a _i

其中，W_aq、W_ak及W_av為隨機給定之權重矩陣，可藉由分析機器閱讀理解系統1多次的執行表現來決定最佳值，進一步的最佳化流程將於後描述。 Among them, W _aq , W _ak and W _av are randomly given weight matrices, and the optimal value can be determined by analyzing the execution performance of the machine reading comprehension system 1 for many times. The further optimization process will be described later.

接著，語意編碼器13將查詢向量aq₁分別與鍵向量ak₁~ak₄執行內積運算，以得到多個初始權重α_1,1~α_1,4。或者，在進行內積運算之後可以更除以查詢向量aq₁及鍵向量ak₁~ak₄所屬之維度，以得多個初始權重α_1,1~α_1,4，以數學式表示可以為：

Next, the semantic encoder 13 performs an inner product operation on the query vector aq ₁ and the key vectors ak ₁ ˜ak ₄ respectively, so as to obtain a plurality of initial weights α _1,1 ˜α _1,4 . Or, after the inner product operation is performed, it can be divided by the dimension to which the query vector aq ₁ and the key vector ak ₁ ~ak ₄ belong to obtain a plurality of initial weights α _1,1 ~α _1,4 , which can be expressed in mathematical expressions as :

其中d為查詢向量aq₁及鍵向量ak₁~ak₄所屬之維度。 d is the dimension to which the query vector aq ₁ and the key vector ak ₁ ~ak ₄ belong.

語意編碼器13再分別對初始權重α_1,1~α_1,4執行歸一化運算，以得到多個歸一化權重

~

。其中，所述歸一化運算可以利用Softmax函式來執行，以Softmax函式運算而得之歸一化權重

~

以數學式可以表示如下，然本發明之歸一化運算亦可藉由其他使權重之總和為1之函數來執行，不以下列數學式為限：

The semantic encoder 13 then performs a normalization operation on the initial weights α _1,1 to α _1,4 respectively to obtain a plurality of normalized weights

~

. Wherein, the normalization operation can be performed by using the Softmax function, and the normalized weight is obtained by the Softmax function operation

~

The mathematical formula can be expressed as follows, but the normalization operation of the present invention can also be performed by other functions that make the sum of the weights equal to 1, and is not limited to the following mathematical formula:

接著如圖4C所示，語意編碼器13將歸一化權重

~

與值向量av₁~av₄進行加權和運算，以得到加權和向量，而此加權和向量便作為編碼向量b₁，以數學式表示可以為：

Next, as shown in Figure 4C, the semantic encoder 13 normalizes the weights

~

Perform a weighted sum operation with the value vectors av ₁ ~av ₄ to obtain a weighted sum vector, and this weighted sum vector is used as a coding vector b ₁ , which can be expressed mathematically as:

編碼向量b₂~b₄亦由語意編碼器13以上述運算方式產生。於另一實施例中，上述利用查詢向量aq₁~aq₄、鍵向量ak₁~ak₄及值向量av₁~av₄進行之運算可以重複執行多次。也就是說，圖4A中的上下文編碼程序方塊可以為多層，語意編碼器13將初始向量a₁~a₄作為第一層的輸入，其輸出(加權和向量)作為下一層的輸入，以此類推，最後一層輸出的運算結果便作為編碼向量b₁~b₄，其中，每次上下文編碼程序中用於產生查詢向量、鍵向量及值向量之權重矩陣會有所不同。藉此，可以提升機器閱讀理解系統1對於文本的理解程度。當編碼運算的執行對象為問題文本與文章文本之組合時，編碼向量b₁~b₄所組合之矩陣為原始目標文本編碼，而當執行對象為第一知識文本與第二知識文本之組合時，編碼向量b₁~b₄所組合之矩陣為知識文本編碼。 The coding vectors b ₂ ˜b ₄ are also generated by the semantic encoder 13 in the above-mentioned operation manner. In another embodiment, the above-mentioned operations using the query vectors aq ₁ to aq ₄ , the key vectors ak ₁ to ak ₄ and the value vectors av ₁ to av ₄ can be repeated for multiple times. That is to say, the context encoding program block in FIG. 4A can be multi-layered, and the semantic encoder 13 uses the initial vectors a ₁ to a ₄ as the input of the first layer, and the output (weighted sum vector) as the input of the next layer, so that By analogy, the operation results output by the last layer are used as encoding vectors b ₁ ~b ₄ , wherein the weight matrices used to generate the query vector, key vector and value vector in each context encoding program will be different. In this way, the comprehension level of the machine reading comprehension system 1 for the text can be improved. When the execution object of the encoding operation is the combination of the question text and the article text, the matrix combined by the encoding vectors b ₁ to b ₄ is the original target text encoding, and when the execution object is the combination of the first knowledge text and the second knowledge text , the matrix composed of encoding vectors b ₁ ~ b ₄ is the knowledge text encoding.

除了圖4A~圖4C所示之上下文編碼方式，語意編碼器13亦可執行其他種上下文編碼器的編碼方式，例如BERT、RoBERTa、XLNet、ALBERT、採用長短期記憶模型(Long Short-Term Memory，LSTM)的ELMo等編碼器之編碼方式。 In addition to the context encoding methods shown in FIGS. 4A to 4C , the semantic encoder 13 can also implement other encoding methods of context encoders, such as BERT, RoBERTa, XLNet, ALBERT, using a long short-term memory model (Long Short-Term Memory, The encoding method of encoders such as ELMo of LSTM).

經上述語意編碼器13執行編碼作業以產生原始目標文本編碼及知識文本編碼之後，編碼融合器14可以對原始目標文本編碼及知識文本編碼執行融合運算，以將知識集中的部分知識導入原始目標文本編碼而產生強化目標文本編碼，即圖2所示之步驟S5。進一步來說，請參考圖1及圖5A~圖5C，其中圖5A~圖5C係依據本發明一實施例所繪示的機器閱讀理解方法中之融合運算的運算示意圖。於圖5A中，編碼向量b₁~b₄表示原始目標文本編碼所包含之編碼向量，編碼向量b₁’~b₄’則表示知識文本編碼所包含之編碼向量。編碼融合器14可以對原始目標文本編碼之編碼向量b₁~b₄及知識文本編碼之編碼向量b₁’~b₄’執行融合運算，以產生融合向量m₁~m₄。其中，用於產生融合向量m₁~m₄所執行之融合運算可以平行執行，也可以依特定順序執行。圖5B及圖5C示例性地繪示對編碼向量b₁與編碼向量b₁’~b₄’執行融合運算以取得融合向量m₁，而其他編碼向量b₂~b₄亦各可與編碼向量b₁’~b₄’進行相同運算來取得融合向量m₂~m₄，因此無另外繪示。另外要特別說明的是，圖5A~圖5C僅示例性地繪示編碼向量之數量，且原始目標文本編碼所包含之編碼向量的數量與知識文本編碼所包含之編碼向量的數量實際上不需一致。 After the above-mentioned semantic encoder 13 performs the encoding operation to generate the original target text code and the knowledge text code, the code fusion unit 14 can perform a fusion operation on the original target text code and the knowledge text code, so as to import part of the knowledge in the knowledge set into the original target text. Encoding to generate enhanced target text encoding, that is, step S5 shown in FIG. 2 . Further, please refer to FIG. 1 and FIG. 5A to FIG. 5C , wherein FIG. 5A to FIG. 5C are schematic operation diagrams of the fusion operation in the machine reading comprehension method according to an embodiment of the present invention. In FIG. 5A , the encoding vectors b ₁ to b ₄ represent the encoding vectors included in the original target text encoding, and the encoding vectors b ₁ ' to b ₄ ' represent the encoding vectors included in the knowledge text encoding. The code fusion unit 14 may perform a fusion operation on the code vectors b _{1 ˜b 4 of the original target text code and the code vectors b 1} _{′˜b 4} _′ _of the knowledge text code to generate fusion vectors m ₁ ˜m ₄ . The fusion operations performed for generating fusion vectors m ₁ to m ₄ may be performed in parallel, or may be performed in a specific order. FIG. 5B and FIG. 5C exemplarily show that a fusion operation is performed on the encoding vector b ₁ and the encoding vectors b ₁ ′~b ₄ ′ to obtain the fusion vector m ₁ , and the other encoding vectors b ₂ -b ₄ can also be combined with the encoding vectors b ₁ '~b ₄ ' perform the same operation to obtain the fusion vectors m ₂ ~m ₄ , so there is no additional illustration. In addition, it should be noted that FIG. 5A to FIG. 5C only illustrate the number of encoding vectors, and the number of encoding vectors included in the original target text encoding and the number of encoding vectors included in the knowledge text encoding are not actually required. Consistent.

如圖5B所示，編碼融合器14可以依據原始目標文本編碼之編碼向量b₁~b₄，產生對應的多個查詢向量bq₁~bq₄，並依據知識文本編碼之編碼向量b₁’~b₄’，產生對應的多個鍵向量bk₁’~bk₄’及多個值向量bv₁’~bv₄’。進一步來說，表示查詢向量bq₁~bq₄、鍵向量bk₁’~bk₄’及值向量bv₁’~bv₄’的數學式可以表示如下：bq_i=W_bqb_i As shown in FIG. 5B , the code fusion unit 14 can generate a plurality of corresponding query vectors bq ₁ ˜bq ₄ according to the coding vectors b _{1 ˜b 4 coded by the original target text, and generate the corresponding query vectors b 1 ˜b 4 according to the coding vectors b 1} _′ ～b ₄ coded by the knowledge text b ₄ ', generating corresponding multiple key vectors bk ₁ '~bk ₄ ' and multiple value vectors bv ₁ '~bv ₄ '. Further, the mathematical expressions representing the query vector bq ₁ ~bq ₄ , the key vector bk ₁ '~bk ₄ ' and the value vector bv ₁ '~bv ₄ ' can be expressed as follows: bq _i =W _bq b _i

bk_i’=W_bkb_i’ bk _i '=W _bk b _i '

bv_i’=W_bvb_i’ bv _i '=W _bv b _i '

其中，W_bq、W_bk及W_bv為隨機給定之權重矩陣，可藉由分析機器閱讀理解系統1多次的執行表現來決定最佳值，進一步的最佳化流程將於後描述。 Among them, W _bq , W _bk and W _bv are randomly given weight matrices, and the optimal values can be determined by analyzing the execution performance of the machine reading comprehension system 1 for many times. Further optimization procedures will be described later.

接著，編碼融合器14將查詢向量bq₁分別與鍵向量bk₁’~bk₄’執行內積運算，以得到多個初始權重β_1,1'~β_1,4'。或者，在進行內積運算之後可以更除以查詢向量bq₁及鍵向量bk₁’~bk₄’所屬之維度，以得多個初始權重β_1,1'~β_1,4'，以數學式表示可以為：

Next, the code fusion unit 14 performs an inner product operation on the query vector bq ₁ and the key vector bk ₁ ' to bk ₄ ' respectively, so as to obtain a plurality of initial weights β _{1,1 '} to β _{1,4 '} . Or, after performing the inner product operation, it can be divided by the dimension to which the query vector bq ₁ and the key vector bk ₁ '~bk ₄ ' belong to obtain a plurality of initial weights β _{1,1 '} ~β _{1,4 '} , which can be calculated mathematically The formula can be expressed as:

其中d為查詢向量bq₁及鍵向量bk₁’~bk₄’所屬之維度。上述運算可視為原始目標文本編碼的編碼向量b₁分別與知識文本編碼的編碼向量b₁’~b₄’之間的相似度判斷。特別來說，編碼融合器14亦可以其他具有相似度判斷功能之函數來實施原始目標文本編碼與知識文本編碼之間的相似度判斷。 where d is the dimension to which the query vector bq ₁ and the key vector bk ₁ '~bk ₄ ' belong. The above operations can be regarded as the similarity judgment between the encoding vector b _{1 encoded by the original target text and the encoding vector b 1} _' ~b ₄ ' encoded by the knowledge text respectively. In particular, the code fusion unit 14 can also implement the similarity judgment between the original target text code and the knowledge text code by other functions having a similarity judgment function.

編碼融合器14再對初始權重β_1,1'~β_1,4'執行歸一化運算，以得到多個歸一化權重

~

~

The encoding fusion unit 14 then performs a normalization operation on the initial weights β _{1,1 ′} ~ β _{1,4 ′} to obtain a plurality of normalized weights

~

接著如圖5C所示，編碼融合器14將歸一化權重

~

與值向量bv₁’~bv₄’進行加權和運算，以得到加權和向量c₁，以數學式表示可以為：

Next, as shown in FIG. 5C, the code fuser 14 normalizes the weights

~

Perform a weighted sum operation with the value vector bv ₁ '~bv ₄ ' to obtain a weighted sum vector c ₁ , which can be expressed mathematically as:

編碼融合器14可以再將加權和向量c₁與對應的編碼向量b₁相加，並將相加結果作為融合向量m₁。或者，編碼融合器14可以將加權和向量c₁與對應的編碼向量b₁作串接，並將串接的結果作為融合向量m₁(變成兩倍維度，假設原本加權和向量c₁與編碼向量b₁各為d維，則串接兩者而產生之融合向量m₁為2d維)。融合向量m₂~m₄亦由編碼融合器14以上述運算方式產生。編碼融合器14可以組合融合向量m₁~m₄以形成矩陣，並將此矩陣作為強化目標文本編碼。 The code fusion unit 14 may then add the weighted sum vector c ₁ to the corresponding code vector b ₁ , and use the addition result as a fusion vector m ₁ . Alternatively, the code fusion unit 14 can concatenate the weighted sum vector c ₁ and the corresponding code vector b ₁ , and use the concatenated result as a fusion vector m ₁ (to double the dimension, assuming that the original weighted sum vector c ₁ and the code Each vector b ₁ is d-dimensional, then the fusion vector m ₁ produced by concatenating the two is 2d-dimensional). The fusion vectors m ₂ ˜m ₄ are also generated by the encoding fusion unit 14 in the above-mentioned operation manner. The encoding fuser 14 can combine the fusing vectors m ₁ to m ₄ to form a matrix, and encode the matrix as the reinforcement target text.

經上述編碼融合器14執行融合運算以將知識導入目標文本編碼而產生強化目標文本編碼之後，答案擷取器15可以基於強化目標文本編碼取得對應於問題文本的答案，並透過輸入輸出介面11輸出此答案，即圖2所示之步驟S6及步驟S7。進一步來說，答案擷取器15可以從強化目標文本編碼中擷取出對應於問題文本的答案。請參考圖1、圖6A及圖6B，其中圖6A及圖6B分別係依據本發明兩實施例所繪示的機器閱讀理解方法中之答案擷取作業的流程圖。 After the above-mentioned code fusion unit 14 performs a fusion operation to import knowledge into the target text code to generate the enhanced target text code, the answer extractor 15 can obtain the answer corresponding to the question text based on the enhanced target text code, and output it through the input-output interface 11 This answer is step S6 and step S7 shown in FIG. 2 . Further, the answer extractor 15 can extract the answer corresponding to the question text from the enhanced target text encoding. Please refer to FIG. 1 , FIG. 6A and FIG. 6B , wherein FIG. 6A and FIG. 6B are flowcharts of the answer retrieval operation in the machine reading comprehension method according to two embodiments of the present invention, respectively.

如圖6A所示，答案擷取器15所執行之答案擷取作業可以包含步驟S61：將強化目標文本編碼中對應於文章文本的部分編碼與起始分類向量執行矩陣運算及歸一化運算，以取得多個起始機率；步驟S62：將所述部分編碼與結束分類向量執行矩陣運算及歸一化運算，以取得多個結束機率；步驟S63：依據起始機率中具有最大數值者，決定答案於所述部分編碼中的起始位置；以及步驟S64：依據結束機率中具有最大數值者，決定答案於所述部分編碼中的結束位置。 As shown in FIG. 6A , the answer extraction operation performed by the answer extractor 15 may include step S61 : performing matrix operation and normalization operation on the partial code corresponding to the article text in the enhanced target text code and the initial classification vector, to obtain a plurality of start probabilities; Step S62 : perform matrix operation and normalization operation on the partial encoding and the end classification vector to obtain a plurality of end probabilities; Step S63 : determine according to the one with the largest value among the start probabilities The starting position of the answer in the partial encoding; and step S64 : determining the ending position of the answer in the partial encoding according to the one with the largest value among the ending probabilities.

於步驟S61及步驟S62中，答案擷取器15將強化目標文本編碼中對應於文章文本的部分編碼分別與起始分類向量及結束分類向量進行矩陣運算(特別是內積運算)及歸一化運算而取得多個起始機率及多個結束機率。進一步來說，所述部分編碼係由編碼融合器14所取得之多個融合向量中對應於屬於文章文本之初始向量的融合向量所組成的向量矩陣。更進一步來說，各融合向量所對應之問題文本及文章文本，在問題文本及文章文本輸入時便具有指示符(例如是0/1遮罩)以表示其位置是屬於文章或是問題。步驟S61所述之運算可以表示如下列數學式：

In step S61 and step S62, the answer extractor 15 performs matrix operation (especially inner product operation) and normalization on the part of the code corresponding to the article text in the enhanced target text code, the start classification vector and the end classification vector, respectively. operation to obtain a plurality of start probabilities and a plurality of end probabilities. Further, the partial encoding is a vector matrix formed by the fusion vector corresponding to the initial vector belonging to the text of the article among the plurality of fusion vectors obtained by the encoding fusion unit 14 . Furthermore, the question text and article text corresponding to each fusion vector have an indicator (eg, a 0/1 mask) when the question text and article text are input to indicate whether the position belongs to the article or the question. The operation described in step S61 can be expressed as the following mathematical formula:

其中，

表示起始機率向量中的第i個起始機率，其中起始機率向量包含多個起始機率且每一起始機率表示所述部分編碼中所對應之融合向量作為答案起始位置的機率，S表示起始分類向量，T _i則表示所述部分編碼中的第i個融合向量。同理，步驟S62可以上列數學式表示，其中

替換為

以表示結束機率向量中的第i個結束機率，其包含多個結束機率且每一結束機率表示所述部分編碼中所對應之融合向量作為答案結束位置的機率，且S替換為E以表示結束分類向量。起始分類向量及結束分類向量為隨機給定之向量，可藉由分析機器閱讀理解系統1多次的執行表現來決定最佳向量，進一步的最佳化流程將於後描述。 in,

represents the ith starting probability in the starting probability vector, wherein the starting probability vector includes multiple starting probabilities and each starting probability represents the probability that the corresponding fusion vector in the partial code is used as the starting position of the answer, S represents the initial classification vector, and T _i represents the ith fusion vector in the partial encoding. Similarly, step S62 can be represented by the above mathematical formula, where

replace with

Denote the ith ending probability in the ending probability vector, which includes multiple ending probabilities and each ending probability represents the probability that the fusion vector corresponding to the partial encoding is used as the ending position of the answer, and S is replaced by E to indicate the ending categorical vector. The start classification vector and the end classification vector are randomly given vectors, and the optimal vector can be determined by analyzing the execution performance of the machine reading comprehension system 1 for many times. The further optimization process will be described later.

於步驟S63及S64中，答案擷取器15可以決定起始機率中具有最大數值者所對應的融合向量為答案的起始位置(即起始索引)，且決定結束機率中具有最大數值者所對應的融合向量為答案的結束位置(即結束索引)。舉例來說，若起始機率向量中的多個起始機率之數值依序為0.02、0.90、0.05、0.01及0.02，則答案擷取器15決定答案的起始位置對應於目標文本編碼中對應於文章文本的第二個融合向量。答案的結束位置之決定同理於起始位置之決定，於此不另舉例說明。 In steps S63 and S64, the answer extractor 15 can determine the fusion vector corresponding to the one with the largest value among the starting probabilities as the starting position (ie, the starting index) of the answer, and determine the one with the largest value among the ending probabilities. The corresponding fusion vector is the end position (ie end index) of the answer. For example, if the values of the multiple starting probabilities in the starting probability vector are 0.02, 0.90, 0.05, 0.01 and 0.02 in sequence, the answer extractor 15 determines that the starting position of the answer corresponds to the corresponding starting position in the target text encoding The second fusion vector for the article text. The determination of the ending position of the answer is the same as the determination of the starting position, and no further explanation is given here.

於此要特別說明的是，步驟S63執行於步驟S61之後，步驟S64執行於步驟S62之後，然而，本發明不限制步驟S61與步驟S62之間的先後關係，不限制步驟S61與步驟S64之間的先後關係，不限制步驟S62與步驟S63之間的先後關係，亦不限制步驟S63與步驟S64之間的先後關係。 It should be noted here that step S63 is executed after step S61, and step S64 is executed after step S62. However, the present invention does not limit the sequence relationship between step S61 and step S62. The sequence relationship between step S61 and step S64 is limited, the sequence relationship between step S62 and step S63 is not limited, and the sequence relationship between step S63 and step S64 is not limited.

答案擷取器15亦可以另一實施方式執行答案擷取作業。如圖6B所示，答案擷取作業可以包含步驟S61’：將強化目標文本編碼中對應於文章文本的部分編碼與起始分類向量執行矩陣運算及歸一化運算，以取得多個起始機率；步驟S62’：將所述部分編碼與結束分類向量執行矩陣運算及歸一化運算，以取得多個結束機率；步驟S63’：依照起始機率之數值由大至小的順序，選擇前數個以作為多個候選起始機率；步驟S64’：依照結束機率之數值由大至小的順序，選擇前數個以作為多個候選結束機率；步驟S65’：將候選起始機率與候選結束機率進行配對以產生多個候選配對，其中每一候選配對中的候選起始機率所對應的位置先於候選結束機率所對應的位置；步驟S66’：計算每一候選配對中的候選起始機率與候選結束機率之和值或乘積；步驟S67’：依據候選配對中具有最大和值者或最大乘積者中的候選起始機率及候選結束機率，決定該答案於部分編碼中的起始位置及結束位置。 The answer extractor 15 can also perform the answer extraction operation in another embodiment. As shown in FIG. 6B , the answer retrieval operation may include step S61 ′: performing matrix operation and normalization operation on the partial code corresponding to the article text in the enhanced target text code and the initial classification vector, so as to obtain a plurality of initial probabilities ; Step S62 ': perform matrix operation and normalization operation with the described partial code and the end classification vector, to obtain a plurality of end probabilities; Step S63 ': according to the numerical value of the initial probability from large to small, select the first number as multiple candidate start probabilities; Step S64': select the first few as multiple candidate end probabilities according to the numerical value of the end probabilities in descending order; Step S65': compare the candidate start probabilities with the candidate end probabilities The probability is paired to generate a plurality of candidate pairs, wherein the position corresponding to the candidate start probability in each candidate pair is earlier than the position corresponding to the candidate end probability; Step S66 ′: Calculate the candidate start probability in each candidate pair and the sum or product of the candidate end probability; step S67 ′: according to the candidate start probability and the candidate end probability of the candidate pair with the largest sum or the largest product, determine the start position of the answer in the partial encoding and end position.

步驟S61’及S62’的進一步執行內容分別同於圖6A之步驟S61及S62，於此不再贅述。於步驟S63’及步驟S64’中，答案擷取器15先分別選擇前幾大的起始機率作為候選起始機率並選擇前幾大的結束機率作為候選結束機率，其中所選擇之候選起始/結束機率的數量例如為5，但不限於此。於步驟S65’中，答案擷取器15可以將候選起始機率與候選結束機率兩兩配對，並篩除掉候選起始機率所對應的位置位於候選結束機率所對應的位置之後的配對，以產生多個候選配對。換句話說，每一候選配對中的候選起始機率所對應的位置皆會先於候選結束機率所對應的位置。於步驟S66’及步驟S67’中，答案擷取器15將每一候選配對中的候選起始機率之值與候選結束機率之值相加或相乘，並決定具有最大和值或乘積的候選配對中的候選起始機率所對應的融合向量為答案的起始位置，且此後選配對中的候選結束機率所對應的融合向量為答案的結束位置。 The further execution contents of steps S61' and S62' are respectively the same as those of steps S61 and S62 in FIG. 6A , and will not be repeated here. In step S63 ′ and step S64 ′, the answer extractor 15 first selects the top starting probabilities as candidate starting probabilities and selects the top ending probabilities as candidate ending probabilities, wherein the selected candidate starting probabilities are selected. The number of /end probabilities is, for example, 5, but not limited thereto. In step S65', the answer extractor 15 may pair the candidate start probability with the candidate end probability, and filter out the pairing whose position corresponding to the candidate start probability is located after the position corresponding to the candidate end probability to Generate multiple candidate pairs. In other words, the position corresponding to the candidate start probability in each candidate pair is prior to the position corresponding to the candidate end probability. In step S66' and step S67', the answer extractor 15 The value of the candidate starting probability in the pairing and the value of the candidate ending probability are added or multiplied, and the fusion vector corresponding to the candidate starting probability in the candidate pairing with the largest sum or product is determined as the starting position of the answer, And then the fusion vector corresponding to the candidate end probability in the matching pair is the end position of the answer.

藉由圖6B所示之答案擷取作業的實施方式，答案擷取器15可以避免起始位置大於結束位置(即起始位置在結束位置之後)的狀況，進而提升答案的精準度。於此要特別說明的是，步驟S63’執行於步驟S61’之後且步驟S64’執行於步驟S62’之後，然而，本發明不限制步驟S61’與步驟S62’之間的先後關係，不限制步驟S61’與步驟S64’之間的先後關係，不限制步驟S62’與步驟S63’之間的先後關係，亦不限制步驟S63’與步驟S64’之間的先後關係。 With the implementation of the answer extraction operation shown in FIG. 6B , the answer extractor 15 can avoid the situation where the start position is greater than the end position (ie, the start position is after the end position), thereby improving the accuracy of the answer. It should be noted here that step S63' is executed after step S61' and step S64' is executed after step S62'. However, the present invention does not limit the sequence relationship between step S61' and step S62', and does not limit the steps The sequence relationship between S61' and step S64' does not limit the sequence relationship between step S62' and step S63', nor does it limit the sequence relationship between step S63' and step S64'.

另外，如前所述，語意編碼器13所執行之編碼作業的使用參數(例如權重矩陣W_aq、W_ak及W_av)、編碼融合器14所執行之融合運算的使用參數(權重矩陣W_bq、W_bk及W_bv)及答案擷取器15所執行之答案擷取作業的使用參數(起始分類向量及結束分類向量)可以藉由最佳化使用參數的流程以設定為最佳化之使用參數。特別來說，圖2所示之機器閱讀理解方法的步驟S2~S6可以為已經訓練過程訓練之機器閱讀理解系統1執行答案預測的過程，也可以為機器閱讀理解系統1的訓練過程中的一環，其中，所述訓練過程包含最佳化使用參數的流程。 In addition, as mentioned above, the usage parameters of the encoding operation performed by the semantic encoder 13 (such as the weight matrices W _aq , W _ak and W _av ), the usage parameters of the fusion operation performed by the encoding fuser 14 (the weight matrix W _bq , W _bk and W _bv ) and the use parameters (start classification vector and end classification vector) of the answer extraction operation performed by the answer extractor 15 can be set to be optimized through the process of optimizing the use parameters Use parameters. In particular, steps S2 to S6 of the machine reading comprehension method shown in FIG. 2 may be the process of performing answer prediction for the machine reading comprehension system 1 trained by the training process, or may be a part of the training process of the machine reading comprehension system 1 , wherein the training process includes a process of optimizing the parameters used.

請參考圖1、圖2及圖7，圖7係依據本發明一實施例所繪示的機器閱讀理解方法中之最佳化使用參數的流程圖。如圖7所示，最佳化使用參數的流程可以包含步驟S8：對多個第一訓練資料執行第一編碼作業、第二編碼作業、融合運算及答案擷取作業，以產生多個第一訓練答案，且依據所述多個第一訓練答案及損失函數計算第一損失值；步驟S9：依據第一損失值對第一編碼作業、第二編碼作業、融合運算及答案擷取作業之多個使用參數中的一或多者執行調整；步驟S10：於調整後，對多個第二訓練資料執行第一編碼作業、第二編碼作業、融合運算及答案擷取作業，以產生多個第二訓練答案，且依據所述多個第二訓練答案及損失函數計算第二損失值；步驟S11：依據第二損失值對第一編碼作業、第二編碼作業、融合運算及答案擷取作業之多個使用參數中的一或多者執行調整。其中，每個第一/第二訓練資料包含一問題文本及一文章文本，第一編碼作業包含前述實施例中的編碼問題文本及文章文本以產生原始目標文本編碼之步驟，第二編碼作業包含依據知識集產生第一知識文本及第二知識文本之步驟以及編碼第一知識文本及第二知識文本以產生知識文本編碼之步驟。也就是說，圖7之步驟S8可以包含對多個第一訓練資料中的每一個執行圖2之步驟S2~S6，圖7之步驟S10可以包含對多個第二訓練資料的每一個執行圖2之步驟S2~S6。 Please refer to FIG. 1 , FIG. 2 , and FIG. 7 . FIG. 7 is a flowchart of optimizing parameters used in a machine reading comprehension method according to an embodiment of the present invention. As shown in FIG. 7 , the process of optimizing the use parameters may include step S8 : performing a first encoding operation, a second encoding operation, a fusion operation and an answer retrieval operation on a plurality of first training data to generate a plurality of first training data training answers, and calculates a first loss value according to the plurality of first training answers and the loss function; step S9: according to the first loss value, the first coding operation, the second One or more of the multiple usage parameters of the encoding operation, the fusion operation and the answer retrieval operation are adjusted; Step S10 : after the adjustment, perform the first encoding operation, the second encoding operation, the fusion operation on the plurality of second training data Operations and answer retrieval operations are performed to generate a plurality of second training answers, and a second loss value is calculated according to the plurality of second training answers and the loss function; Step S11 : According to the second loss value, the first encoding operation, the second Adjustment is performed on one or more of the multiple usage parameters of the two encoding operations, fusion operations, and answer retrieval operations. Wherein, each of the first/second training data includes a question text and an article text, the first encoding operation includes the step of encoding the question text and article text in the foregoing embodiment to generate the original target text encoding, and the second encoding operation includes The step of generating the first knowledge text and the second knowledge text according to the knowledge set and the step of encoding the first knowledge text and the second knowledge text to generate the knowledge text encoding. That is, step S8 of FIG. 7 may include performing steps S2 to S6 of FIG. 2 for each of the plurality of first training materials, and step S10 of FIG. 7 may include performing the steps S2 to S6 of FIG. 2 for each of the plurality of second training materials. Steps S2 to S6 of 2.

步驟S8~S11可以由設置於機器閱讀理解系統1之外部或內部的處理裝置來執行。此處理裝置包含中央處理器(Central Processing Unit，CPU)、微控制器、可程式化邏輯控制器(Programmable Logic Controller，PLC)或其他處理器，連接於語意編碼器13、編碼融合器14及答案擷取器15，可以控制這些裝置以當前使用參數對多個第一訓練資料執行運作以產生多個第一訓練答案，依據所述多個第一訓練答案及損失函數產生第一損失值，且依據第一損失值對這些裝置之多個使用參數中的一或多者進行調整，控制這些裝置在使用參數調整後對多個第二訓練資料再次執行運作以產生多個第二訓練答案，並依據第二訓練答案及損失函數計算第二損失值，再依據第二損失值對所述多個使用參數中的一或多者進行調整，其中用以計算第一/第二損失值的損失函數可以表示如下列數學式：

Steps S8 to S11 may be performed by a processing device provided outside or inside the machine reading comprehension system 1 . The processing device includes a central processing unit (CPU), a microcontroller, a programmable logic controller (PLC) or other processors, and is connected to the semantic encoder 13, the code fusion unit 14 and the answer The extractor 15 can control these devices to perform operations on a plurality of first training data with currently used parameters to generate a plurality of first training answers, and generate a first loss value according to the plurality of first training answers and the loss function, and adjusting one or more of a plurality of usage parameters of the devices according to the first loss value, controlling the devices to perform operations on a plurality of second training data again after the usage parameters are adjusted to generate a plurality of second training answers, and Calculate a second loss value according to the second training answer and the loss function, and then adjust one or more of the plurality of usage parameters according to the second loss value, wherein the loss function used to calculate the first/second loss value It can be expressed as the following mathematical formula:

其中，

為表示正解起始位置之向量，

表示答案擷取器15所計算而得之起始機率向量，

為表示正解結束位置之向量，

表示答案擷取器15所計算而得之結束機率向量，N則表示用以產生訓練答案的訓練資料的數量。 in,

is the vector representing the starting position of the positive solution,

represents the initial probability vector calculated by the answer extractor 15,

is the vector representing the end position of the positive solution,

represents the ending probability vector calculated by the answer extractor 15, and N represents the number of training data used to generate the training answer.

在步驟S11後，處理裝置可以再對其他多個訓練資料執行步驟S10以計算出損失值，並以此損失值再次執行步驟S11，反覆執行多次。也就是說，處理裝置可以執行多次訓練，每次訓練計算而得之損失值可以作為下一次訓練前之使用參數調整的依據。進一步來說，處理裝置可以一批量(batch size)的訓練資料(第一訓練資料)及當前的使用參數來計算答案(第一訓練答案)，並依據答案計算損失值(第一損失值)，接著依據此損失值調整使用參數，再以另一批量的訓練資料(第二訓練資料)及調整後的使用參數來計算答案(第二訓練答案)及對應的損失值(第二損失值)，接著再以此損失值調整使用參數，以又一批量的訓練資料及調整後的使用參數來計算答案及對應的損失值，以此類推。舉例來說，假設訓練資料的總數為2560，每批32筆，則上述使用參數之調整及調整後的答案及損失值計算需執行80次才訓練完一輪(one epoch)。在訓練完一輪後，處理裝置更可以將所有訓練資料進行洗牌(shuffle)，再執行下一輪訓練。特別來說，要執行幾輪為超參數之設定，其數值可以藉由將訓練資料集中的部分資料保留下來作為驗證集(Validation set)並針對此驗證集的表現(例如損失值、EM或F1分數)來選擇。 After step S11 , the processing apparatus may perform step S10 on other multiple training data to calculate the loss value, and perform step S11 again with the loss value, and perform the steps repeatedly. That is to say, the processing device can perform multiple training sessions, and the loss value calculated for each training session can be used as a basis for adjusting the used parameters before the next training session. Further, the processing device can calculate the answer (the first training answer) with a batch size of training data (the first training data) and the current usage parameters, and calculate the loss value (the first loss value) according to the answer, Then, the usage parameters are adjusted according to the loss value, and another batch of training data (second training data) and the adjusted usage parameters are used to calculate the answer (second training answer) and the corresponding loss value (second loss value), Then, the usage parameters are adjusted with this loss value, and the answer and the corresponding loss value are calculated with another batch of training data and the adjusted usage parameters, and so on. For example, assuming that the total number of training data is 2560, and each batch is 32, the above-mentioned adjustment of the used parameters and the calculation of the adjusted answer and loss value need to be performed 80 times before one epoch of training is completed. After one round of training, the processing device may further shuffle all the training data, and then execute the next round of training. In particular, several rounds of hyperparameter setting are performed, the values of which can be determined by keeping part of the training data set Left as a Validation set and selected for performance on this validation set (e.g. loss value, EM or F1 score).

理論上，隨著訓練次數增加，使用參數會更符合於訓練資料，然而當使用參數太過符合(over fit)訓練資料時，對於新資料(待預測資料)之預測準確度反而可能下降，因此如上述將訓練資料集中的部分資料保留下來作為驗證集，並對驗證集進行預測以取得對應的預測表現，可以據以決定適當的訓練輪數。舉例來說，在一輪訓練完成後，處理裝置可以判斷驗證集的表現相較於上一輪訓練對應之驗證集表現是否更佳(例如損失值更低或EM/F1分數更高)。若本輪對應之驗證集表現相較於上一輪對應之驗證集表現更佳，則繼續下一輪訓練；而若更差或變化不大，便停止訓練。經上述訓練流程後之使用參數便可作為最佳化使用參數。 Theoretically, as the number of training increases, the parameters used will be more in line with the training data. However, when the parameters used are overfit to the training data, the prediction accuracy for new data (data to be predicted) may decrease. Therefore, As mentioned above, part of the data in the training data set is retained as the validation set, and the validation set is predicted to obtain the corresponding prediction performance, which can be used to determine the appropriate number of training rounds. For example, after a round of training is completed, the processing device may determine whether the performance of the validation set is better than that of the validation set corresponding to the previous round of training (eg, a lower loss value or a higher EM/F1 score). If the performance of the validation set corresponding to the current round is better than the performance of the validation set corresponding to the previous round, continue the next round of training; if it is worse or has little change, stop training. The usage parameters after the above training process can be used as optimized usage parameters.

所述用來進行訓練之問題文本及文章文本的來源可以為目標標記資料集，即系統預計訓練來執行預測的資料集，而用以產生知識文本的知識集的來源為目標標記資料集所對應(例如屬於同類型)的知識資料庫。於另一實施例中，在以目標標記資料集訓練之前，機器閱讀理解方法可以先以外部標記資料集及對應(例如屬於同類型)的知識資料庫來進行訓練，即將外部標記資料集作為問題文本及文章文本的來源，且將對應於外部標記資料集之知識資料庫作為知識集來源，以初次決定最佳化使用參數。舉例來說，假設標記資料集包含DRCD、CMRC 2018及CAIL 2019，則當目標資料集為DRCD時，可以先以CMRC 2018及CAIL 2019中的一或兩者作為訓練資料集而初次決定最佳化使用參數，再以DRCD作為訓練資料集而再次決定最佳化使用參數。藉由上述多次最佳化使用參數的過程，可以避免目標標記資料集之標記不完全而致使訓練結果不理想的問題。 The source of the question text and article text used for training can be the target labeled data set, that is, the data set that the system expects to train to perform prediction, and the source of the knowledge set used to generate the knowledge text is the target labeled data set. (e.g. of the same type) knowledge repositories. In another embodiment, before training with the target labeled data set, the machine reading comprehension method can be trained with an external labeled data set and a corresponding (for example, of the same type) knowledge database, that is, the external labeled data set is used as a question. The source of the text and article text, and the knowledge database corresponding to the external tag data set is used as the knowledge set source to initially determine the optimal use parameters. For example, assuming that the labeled data set includes DRCD, CMRC 2018 and CAIL 2019, when the target data set is DRCD, one or both of CMRC 2018 and CAIL 2019 can be used as the training data set for the initial determination of optimization Use parameters, and then use DRCD as a training data set to determine the optimal use parameters again. Using the above multiple optimizations The parameter process can avoid the problem that the labeling of the target labeling data set is incomplete and the training result is not ideal.

請參考圖8A及8B，圖8A及8B分別係現有機器閱讀理解方法及系統(multi-Bert)與本發明上述之機器閱讀理解方法及系統以兩種資料集進行訓練而得的實驗數據比較圖。於圖8A的實驗中，本案之機器閱讀理解方法及系統與現有機器閱讀理解方法及系統以法律領域之CAIL 2019資料集作為訓練資料來源，且本案之機器閱讀理解方法及系統更以OpenBase知識庫(非結構化知識)及HowNet知識庫(結構化知識)作為知識集來源。於圖8B的實驗中，本案之機器閱讀理解方法及系統與現有機器閱讀理解方法及系統以百科領域之DRCD資料集作為訓練資料來源，且本案之機器閱讀理解方法及系統更以HowNet知識庫作為知識集來源。 Please refer to FIGS. 8A and 8B . FIGS. 8A and 8B are comparison diagrams of experimental data obtained by training the existing machine reading comprehension method and system (multi-Bert) and the above-mentioned machine reading comprehension method and system of the present invention with two data sets, respectively. . In the experiment in Figure 8A, the machine reading comprehension method and system in this case and the existing machine reading comprehension method and system use the CAIL 2019 data set in the legal field as the training data source, and the machine reading comprehension method and system in this case use the OpenBase knowledge base. (unstructured knowledge) and HowNet knowledge base (structured knowledge) as knowledge set sources. In the experiment of Fig. 8B, the machine reading comprehension method and system of this case and the existing machine reading comprehension method and system use the DRCD data set in the field of encyclopedia as the training data source, and the machine reading comprehension method and system of this case use the HowNet knowledge base as the source of training data. Knowledge set sources.

圖8A及圖8B所示之實驗數據EM(Exact Match)表示預測答案與標準答案一致的比率(單位：%)，F1則是計算預測答案與標準答案單詞化後的準確分數。進一步來說，F1可以下列數學式表示：

The experimental data EM (Exact Match) shown in FIG. 8A and FIG. 8B represents the ratio (unit: %) between the predicted answer and the standard answer, and F1 is the exact score after calculating the predicted answer and the standard answer. Further, F1 can be expressed by the following mathematical formula:

其中，precision表示預測答案有多大比例的單詞在標準答案中出現，recall則表示標準答案中有多大比例的單詞在預測答案中出現。 Among them, precision indicates what proportion of the words in the predicted answer appear in the standard answer, and recall indicates what proportion of the words in the standard answer appear in the predicted answer.

如圖8A及圖8B所示，本案的機器閱讀理解方法及系統相較於現有機器閱讀理解方法及系統皆具有較高的EM及F1，即具有較高的答案預測精準度。本案的機器閱讀理解方法及系統在訓練資料量少時具有相當大的效能，表示在系統訓練初期能夠輔助標記人員加快資料標記的進行，甚至在僅1k之訓練資料的狀態下，可達到EM之值可達人為判斷水準80%，即可替代人為作業且維持相當的精準度。另外，F1分數亦可接近人類水平(F1分數：92)。 As shown in FIG. 8A and FIG. 8B , the machine reading comprehension method and system of the present application have higher EM and F1 than existing machine reading comprehension methods and systems, that is, higher answer prediction accuracy. The machine reading comprehension method and system in this case have considerable performance when the amount of training data is small, which means that in the early stage of system training, it can assist the labeler to speed up the data labeling, even in the case of only 1k training resources. In the state of raw materials, the value of EM can reach 80% of the human judgment level, which can replace manual work and maintain a considerable accuracy. In addition, the F1 score can also be close to human level (F1 score: 92).

雖然本發明以前述之實施例揭露如上，然其並非用以限定本發明。在不脫離本發明之精神和範圍內，所為之更動與潤飾，均屬本發明之專利保護範圍。關於本發明所界定之保護範圍請參考所附之申請專利範圍。 Although the present invention is disclosed in the foregoing embodiments, it is not intended to limit the present invention. Changes and modifications made without departing from the spirit and scope of the present invention belong to the scope of patent protection of the present invention. For the protection scope defined by the present invention, please refer to the attached patent application scope.

S1~S7:步驟 S1~S7: Steps

Claims

一種機器閱讀理解方法，包含由一機器閱讀理解系統執行：取得一問題文本及關聯於該問題文本的一文章文本；依據一知識集，產生對應於該問題文本的一第一知識文本及對應於該文章文本的一第二知識文本；編碼該問題文本及該文章文本以產生一原始目標文本編碼；編碼該第一知識文本及該第二知識文本以產生一知識文本編碼；對該原始目標文本編碼及該知識文本編碼執行融合運算，以將該知識集中的部分知識導入該原始目標文本編碼而產生一強化目標文本編碼；以及基於該強化目標文本編碼，取得對應於該問題文本的一答案，並輸出該答案。 A machine reading comprehension method, comprising executing by a machine reading comprehension system: obtaining a question text and an article text associated with the question text; generating a first knowledge text corresponding to the question text and corresponding to the question text according to a knowledge set A second knowledge text of the article text; encoding the question text and the article text to generate an original target text code; encoding the first knowledge text and the second knowledge text to generate a knowledge text code; the original target text performing a fusion operation on the code and the knowledge text code to import part of the knowledge in the knowledge set into the original target text code to generate an enhanced target text code; and obtaining an answer corresponding to the question text based on the enhanced target text code, and output that answer.

如請求項1所述之機器閱讀理解方法，其中依據該知識集，產生對應於該問題文本的該第一知識文本及對應於該文章文本的該第二知識文本包含：分別將該問題文本及該文章文本作為一待處理文本，執行：將該待處理文本分割為多個詞；依據該些詞查找該知識集，以取得至少一相關知識；當該至少一相關知識的數量為一時，依據該相關知識產生一目標知識文本；以及當該至少一相關知識的數量大於一時，依據該些詞的排列順序及一預設範本，組合該些相關知識以產生該目標知識文本；其中對應於該問題文本的該目標知識文本為該第一知識文本，且對應於該文章文本的該目標知識文本為該第二知識文本。 The machine reading comprehension method according to claim 1, wherein generating the first knowledge text corresponding to the question text and the second knowledge text corresponding to the article text according to the knowledge set comprises: the question text and The article text is regarded as a text to be processed, and the execution is performed: dividing the text to be processed into a plurality of words; searching the knowledge set according to the words to obtain at least one related knowledge; when the number of the at least one related knowledge is one, according to The related knowledge generates a target knowledge text; and when the number of the at least one related knowledge is greater than one, combining the related knowledge to generate the target knowledge text according to the arrangement order of the words and a preset template; The target knowledge text corresponding to the question text is the first knowledge text, and the target knowledge text corresponding to the article text is the second knowledge text.

如請求項2所述之機器閱讀理解方法，其中依據該知識集，產生對應於該問題文本的該第一知識文本及對應於該文章文本的該第二知識文本更包含：若該至少一相關知識屬於結構化知識，則在產生該目標知識文本之前，依據另一預設範本將該至少一相關知識之形式轉換為文字描述。 The machine reading comprehension method according to claim 2, wherein generating the first knowledge text corresponding to the question text and the second knowledge text corresponding to the article text according to the knowledge set further comprises: if the at least one relevant If the knowledge belongs to structured knowledge, before generating the target knowledge text, the form of the at least one related knowledge is converted into a text description according to another preset template.

如請求項1所述之機器閱讀理解方法，其中對該原始目標文本編碼及該知識文本編碼執行融合運算，以將該知識集中的該部分知識導入該原始目標文本編碼而產生該強化目標文本編碼包含：依據該原始目標文本編碼，產生多個查詢向量；依據該知識文本編碼，產生多個鍵向量及多個值向量；對每一該些查詢向量，執行：將該查詢向量分別與該些鍵向量執行內積運算，以得到多個初始權重；分別對該些初始權重執行歸一化運算，以得到多個歸一化權重；以及將該些歸一化向量與該些值向量執行加權和運算，以得到一加權和向量；以及依據每一該些查詢向量對應的該加權和向量，產生該強化目標文本編碼。 The machine reading comprehension method of claim 1, wherein a fusion operation is performed on the original target text code and the knowledge text code to import the partial knowledge in the knowledge set into the original target text code to generate the enhanced target text code Including: generating a plurality of query vectors according to the original target text encoding; generating a plurality of key vectors and a plurality of value vectors according to the knowledge text encoding; for each of the query vectors, executing: the query vector and the Perform an inner product operation on the key vectors to obtain a plurality of initial weights; respectively perform a normalization operation on the initial weights to obtain a plurality of normalized weights; and perform weighting on the normalized vectors and the value vectors performing a sum operation to obtain a weighted sum vector; and generating the enhanced target text code according to the weighted sum vector corresponding to each of the query vectors.

如請求項4所述之機器閱讀理解方法，其中該原始目標文本編碼包含多個編碼向量，分別對應於該些查詢向量，且依據每一該些查詢向量對應的該加權和向量，產生該強化目標文本編碼包含：將每一該些查詢向量對應的該加權和向量與對應的該編碼向量相加或串接，以得到多個融合向量；以及組合該些融合向量以產生該強化目標文本編碼。 The machine reading comprehension method of claim 4, wherein the original target text code includes a plurality of code vectors, respectively corresponding to the query vectors, and the enhancement is generated according to the weighted sum vector corresponding to each of the query vectors The target text encoding includes: adding or concatenating the weighted sum vector corresponding to each of the query vectors and the corresponding encoding vector to obtain a plurality of fusion vectors; and combining the fusion vectors to generate the enhanced target text encoding .

如請求項1所述之機器閱讀理解方法，其中編碼該問題文本及該文章文本包含：將該問題文本與該文章文本之組合作為一編碼運算的一執行對象，編碼該第一知識文本及該第二知識文本包含：將該第一知識文本與該第二知識文本之組合作為該編碼運算的該執行對象，且該編碼運算包含：將該執行對象分割為多個單字；取得該些單字分別對應的多個初始向量；以及組合該些初始向量以產生該原始目標文本編碼或該知識文本編碼。 The machine reading comprehension method according to claim 1, wherein encoding the question text and the article text comprises: taking the combination of the question text and the article text as an execution object of an encoding operation, encoding the first knowledge text and the article text The second knowledge text includes: the combination of the first knowledge text and the second knowledge text is used as the execution object of the encoding operation, and the encoding operation includes: dividing the execution object into a plurality of single words; a plurality of corresponding initial vectors; and combining the initial vectors to generate the original target text code or the knowledge text code.

如請求項1所述之機器閱讀理解方法，其中編碼該問題文本及該文章文本包含：將該問題文本與該文章文本之組合作為一編碼運算的一執行對象，編碼該第一知識文本及該第二知識文本包含：將該第一知識文本與該第二知識文本之組合作為該編碼運算的該執行對象，且該編碼運算包含：將該執行對象分割為多個單字；取得該些單字分別對應的多個初始向量；依據該些初始向量，產生對應的多個查詢向量、多個鍵向量及多個值向量；對每一該些查詢向量，執行：將該查詢向量分別與該些鍵向量執行內積運算，以得到多個初始權重；分別對該些初始權重執行歸一化運算，以得到多個歸一化權重；以及將該些歸一化權重與該些值向量執行加權和運算，以得到一加權和向量；以及依據每一該些查詢向量對應的該加權和向量，產生該些編碼向量；以及組合該些編碼向量以產生該原始目標文本編碼或該知識文本編碼。 The machine reading comprehension method according to claim 1, wherein encoding the question text and the article text comprises: taking the combination of the question text and the article text as an execution object of an encoding operation, encoding the first knowledge text and the article text The second knowledge text includes: the combination of the first knowledge text and the second knowledge text is used as the execution object of the encoding operation, and the encoding operation includes: dividing the execution object into a plurality of single words; Corresponding multiple initial vectors; According to these initial vectors, generate corresponding multiple query vectors, multiple key vectors and multiple value vectors; for each of these query vectors, execute: respectively perform an inner product operation on the query vector and the key vectors to obtain obtaining a plurality of initial weights; performing a normalization operation on the initial weights respectively to obtain a plurality of normalization weights; and performing a weighted sum operation on the normalized weights and the value vectors to obtain a weighted sum and generating the encoding vectors according to the weighted sum vector corresponding to each of the query vectors; and combining the encoding vectors to generate the original target text encoding or the knowledge text encoding.

如請求項1所述之機器閱讀理解方法，其中基於該強化目標文本編碼，取得對應於該問題文本的該***含：將該強化目標文本編碼中對應於該文章文本的部分編碼與一起始分類向量執行矩陣運算及歸一化運算，以取得多個起始機率；將該部分編碼與一結束分類向量執行該矩陣運算及該歸一化運算，以取得多個結束機率；依據該些起始機率中具有最大數值者，決定該答案於該部分編碼中的起始位置；以及依據該些結束機率中具有最大數值者，決定該答案於該部分編碼中的結束位置。 The machine reading comprehension method according to claim 1, wherein, based on the enhanced target text encoding, obtaining the answer corresponding to the question text comprises: combining a partial encoding of the enhanced target text encoding corresponding to the article text with an initial classification performing a matrix operation and a normalization operation on the vector to obtain a plurality of starting probabilities; performing the matrix operation and the normalizing operation on the partial code and an end classification vector to obtain a plurality of ending probabilities; according to the starting probabilities The one with the largest value among the probabilities determines the starting position of the answer in the partial code; and the one with the largest value among the ending probabilities determines the ending position of the answer in the partial code.

如請求項1所述之機器閱讀理解方法，其中基於該強化目標文本編碼，取得對應於該問題文本的該***含：將該強化目標文本編碼中對應於該文章文本的部分編碼與一起始分類向量執行矩陣運算及歸一化運算，以取得多個起始機率；將該部分編碼與一結束分類向量執行該矩陣運算及該歸一化運算，以取得多個結束機率；依照該些起始機率之數值由大至小的順序，選擇前數個以作為多個候選起始機率；依照該些結束機率之數值由大至小的順序，選擇前數個以作為多個候選結束機率；將該些候選起始機率與該些候選結束機率進行配對以產生多個候選配對，其中每一該些候選配對中的該候選起始機率所對應的位置先於該候選結束機率所對應的位置；計算每一該些候選配對中的該候選起始機率與該候選結束機率之和值或乘積；以及依據該些候選配對中具有最大和值者或最大乘積者中的該候選起始機率及該候選結束機率，決定該答案於該部分編碼中的起始位置及結束位置。 The machine reading comprehension method according to claim 1, wherein, based on the enhanced target text encoding, obtaining the answer corresponding to the question text comprises: combining a partial encoding of the enhanced target text encoding corresponding to the article text with an initial classification performing a matrix operation and a normalization operation on the vector to obtain a plurality of starting probabilities; performing the matrix operation and the normalizing operation on the partial code and an end classification vector to obtain a plurality of ending probabilities; according to the starting probabilities In descending order of probability values, select the first few as multiple candidate starting probabilities; according to the descending order of these ending probabilities, select the first few as multiple candidate ending probabilities; The candidate start probabilities are paired with the candidate end probabilities to generate a plurality of candidate pairs, wherein the position corresponding to the candidate start probability in each of the candidate pairs precedes the position corresponding to the candidate end probability; calculating the sum or product of the candidate starting probability and the candidate ending probability in each of the candidate pairs; and according to the candidate starting probability and the The candidate ending probability determines the starting position and ending position of the answer in the partial code.

如請求項1所述之機器閱讀理解方法，更包含由該機器閱讀理解系統執行：對多個第一訓練資料執行一第一編碼作業、一第二編碼作業、該融合運算及一答案擷取作業，以產生多個第一訓練答案，且依據該些第一訓練答案及一損失函數計算一第一損失值；依據該第一損失值，對該第一編碼作業、該第二編碼作業、該融合運算及該答案擷取作業之多個使用參數中的一或多者執行調整；於該調整後，對多個第二訓練資料執行該第一編碼作業、該第二編碼作業、該融合運算及該答案擷取作業，以產生多個第二訓練答案，並依據該些第二訓練答案及該損失函數計算一第二損失值；以及依據該第二損失值，對該些使用參數中的一或多者執行調整；其中該第一編碼作業包含編碼該問題文本及該文章文本，該第二編碼作業包含產生及編碼該第一知識文本及該第二知識文本，且該答案擷取作業包含取得對應於該問題文本的該答案。 The machine reading comprehension method according to claim 1, further comprising: executing, by the machine reading comprehension system: a first encoding operation, a second encoding operation, the fusion operation and an answer retrieval on a plurality of first training data an operation to generate a plurality of first training answers, and calculate a first loss value according to the first training answers and a loss function; According to the first loss value, adjust one or more of the multiple usage parameters of the first encoding operation, the second encoding operation, the fusion operation, and the answer retrieval operation; Execute the first encoding operation, the second encoding operation, the fusion operation, and the answer retrieval operation with the second training data to generate a plurality of second training answers, and calculate according to the second training answers and the loss function a second loss value; and adjusting one or more of the usage parameters according to the second loss value; wherein the first encoding operation includes encoding the question text and the article text, and the second encoding operation includes The first knowledge text and the second knowledge text are generated and encoded, and the answer retrieval operation includes obtaining the answer corresponding to the question text.

一種機器閱讀理解系統，包含：一輸入輸出介面，用於取得一問題文本及關聯於該問題文本的一文章文本；一知識文本產生器，連接於該輸入輸出介面，且用於依據一知識集取得對應於該問題文本的一第一知識文本及對應於該文章文本的一第二知識文本；一語意編碼器，連接於該輸入輸出介面及該知識文本產生器，且用於編碼該問題文本及該文章文本以產生一原始目標文本編碼，以及編碼該第一知識文本及該第二知識文本以產生一知識文本編碼；一編碼融合器，連接於該語意編碼器，且用於對該原始目標文本編碼及該知識文本編碼執行融合運算，以將該知識集中的部分知識導入該原始目標文本編碼而產生一強化目標文本編碼；一答案擷取器，連接於該編碼融合器及該輸入輸出介面，且用於基於該強化目標文本編碼，取得對應於該問題文本的一答案，並透過該輸入輸出介面輸出該答案。 A machine reading comprehension system, comprising: an input and output interface for obtaining a question text and an article text related to the question text; a knowledge text generator, connected to the input and output interface, and used for according to a knowledge set obtaining a first knowledge text corresponding to the question text and a second knowledge text corresponding to the article text; a semantic encoder connected to the input-output interface and the knowledge text generator, and used for encoding the question text and the article text to generate an original target text code, and encode the first knowledge text and the second knowledge text to generate a knowledge text code; an encoding fusion unit, connected to the semantic encoder, and used for the original The target text code and the knowledge text code perform a fusion operation to import part of the knowledge in the knowledge set into the original target text code to generate an enhanced target text code; An answer extractor is connected to the code fusion unit and the input-output interface, and is used for obtaining an answer corresponding to the question text based on the enhanced target text code, and outputting the answer through the input-output interface.

如請求項11所述之機器閱讀理解系統，其中該知識文本產生器所執行之依據該知識集，產生對應於該問題文本的該第一知識文本及對應於該文章文本的該第二知識文本包含：分別將該問題文本及該文章文本作為一待處理文本，執行：將該待處理文本分割為多個詞；依據該些詞查找該知識集，以取得至少一相關知識；當該至少一相關知識的數量為一時，依據該相關知識產生一目標知識文本；以及當該至少一相關知識的數量大於一時，依據該些詞的排列順序及一預設範本，組合該些相關知識以產生該目標知識文本；其中對應於該問題文本的該目標知識文本為該第一知識文本，且對應於該文章文本的該目標知識文本為該第二知識文本。 The machine reading comprehension system of claim 11, wherein the knowledge text generator is executed to generate the first knowledge text corresponding to the question text and the second knowledge text corresponding to the article text according to the knowledge set Including: respectively taking the question text and the article text as a text to be processed, and executing: dividing the text to be processed into a plurality of words; searching the knowledge set according to the words to obtain at least one relevant knowledge; when the at least one When the number of related knowledge is one, generate a target knowledge text according to the related knowledge; and when the number of the at least one related knowledge is greater than one, combine the related knowledge to generate the related knowledge according to the arrangement order of the words and a preset template target knowledge text; wherein the target knowledge text corresponding to the question text is the first knowledge text, and the target knowledge text corresponding to the article text is the second knowledge text.

如請求項12所述之機器閱讀理解系統，其中該知識文本產生器所執行之依據該知識集，產生對應於該問題文本的該第一知識文本及對應於該文章文本的該第二知識文本更包含：若該至少一相關知識屬於結構化知識，則在產生該目標知識文本之前，依據另一預設範本將該至少一相關知識之形式轉換為文字描述。 The machine reading comprehension system of claim 12, wherein the knowledge text generator executes to generate the first knowledge text corresponding to the question text and the second knowledge text corresponding to the article text according to the knowledge set It further includes: if the at least one related knowledge is structured knowledge, before generating the target knowledge text, converting the form of the at least one related knowledge into a text description according to another preset template.

如請求項11所述之機器閱讀理解系統，其中該編碼融合器所執行之對該原始目標文本編碼及該知識文本編碼執行融合運算，以將該知識集中的該部分知識導入該原始目標文本編碼而產生該強化目標文本編碼包含：依據該原始目標文本編碼，產生多個查詢向量；依據該知識文本編碼，產生多個鍵向量及多個值向量；對每一該些查詢向量，執行：將該查詢向量分別與該些鍵向量執行內積運算，以得到多個初始權重；分別對該些初始權重執行歸一化運算，以得到多個歸一化權重；以及將該些歸一化向量與該些值向量執行加權和運算，以得到一加權和向量；以及依據每一該些查詢向量對應的該加權和向量，產生該強化目標文本編碼。 The machine reading comprehension system of claim 11, wherein the code fusion unit performs a fusion operation on the original target text code and the knowledge text code to combine The part of the knowledge in the knowledge set is imported into the original target text code to generate the enhanced target text code including: generating a plurality of query vectors according to the original target text code; generating a plurality of key vectors and a plurality of values according to the knowledge text code vector; for each of these query vectors, execute: perform an inner product operation on the query vector and these key vectors respectively to obtain a plurality of initial weights; respectively perform a normalization operation on these initial weights to obtain a plurality of initial weights normalizing weights; and performing a weighted sum operation on the normalized vectors and the value vectors to obtain a weighted sum vector; and generating the enhanced target text according to the weighted sum vector corresponding to each of the query vectors coding.

如請求項14所述之機器閱讀理解系統，其中該原始目標文本編碼包含多個編碼向量，分別對應於該些查詢向量，且該編碼融合器所執行之依據每一該些查詢向量對應的該加權和向量，產生該強化目標文本編碼包含：將每一該些查詢向量對應的該加權和向量與對應的該編碼向量相加或串接，以得到多個融合向量；以及組合該些融合向量以產生該強化目標文本編碼。 The machine reading comprehension system of claim 14, wherein the original target text encoding comprises a plurality of encoding vectors, respectively corresponding to the query vectors, and the encoding fusion is executed according to the corresponding query vector. A weighted sum vector, generating the enhanced target text encoding includes: adding or concatenating the weighted sum vector corresponding to each of the query vectors and the corresponding encoding vector to obtain a plurality of fusion vectors; and combining the fusion vectors to generate the enhanced target text encoding.

如請求項11所述之機器閱讀理解系統，其中該語意編碼器所執行之編碼該問題文本與該文章文本係將該問題文本與該文章文本之組合作為一編碼運算的一執行對象，該語意編碼器所執行之編碼該第一知識文本及該第二知識文本係將該第一知識文本與該第二知識文本之組合作為該編碼運算的該執行對象，且該編碼運算包含：將該執行對象分割為多個單字；取得該些單字分別對應的多個初始向量；以及組合該些初始向量以產生該原始目標文本編碼或該知識文本編碼。 The machine reading comprehension system of claim 11, wherein the encoding of the question text and the article text performed by the semantic encoder is a combination of the question text and the article text as an execution object of an encoding operation, the semantic The encoding performed by the encoder is the first known The knowledge text and the second knowledge text are the combination of the first knowledge text and the second knowledge text as the execution object of the encoding operation, and the encoding operation includes: dividing the execution object into a plurality of single words; obtaining the and combining the initial vectors to generate the original target text code or the knowledge text code.

如請求項11所述之機器閱讀理解系統，其中該語意編碼器所執行之編碼該問題文本與該文章文本係將該問題文本與該文章文本之組合作為一編碼運算的一執行對象，該語意編碼器所執行之編碼該第一知識文本及該第二知識文本係將該第一知識文本與該第二知識文本之組合作為該編碼運算的該執行對象，且該編碼運算包含：將該執行對象分割為多個單字；取得該些單字分別對應的多個初始向量；依據該些初始向量，產生對應的多個查詢向量、多個鍵向量及多個值向量；對每一該些查詢向量，執行：將該查詢向量分別與該些鍵向量執行內積運算，以得到多個初始權重；分別對該些初始權重執行歸一化運算，以得到多個歸一化權重；以及將該些歸一化權重與該些值向量執行加權和運算，以得到一加權和向量；以及依據每一該些查詢向量對應的該加權和向量，產生該些編碼向量；以及組合該些編碼向量以產生該原始目標文本編碼或該知識文本編碼。 The machine reading comprehension system of claim 11, wherein the encoding of the question text and the article text performed by the semantic encoder is a combination of the question text and the article text as an execution object of an encoding operation, the semantic The encoding of the first knowledge text and the second knowledge text performed by the encoder takes the combination of the first knowledge text and the second knowledge text as the execution object of the encoding operation, and the encoding operation includes: executing the The object is divided into a plurality of words; a plurality of initial vectors corresponding to the words are obtained; according to the initial vectors, a plurality of corresponding query vectors, a plurality of key vectors and a plurality of value vectors are generated; for each of the query vectors , execute: respectively perform inner product operation on the query vector and these key vectors to obtain a plurality of initial weights; respectively perform a normalization operation on these initial weights to obtain a plurality of normalized weights; performing a weighted sum operation on the normalized weights with the value vectors to obtain a weighted sum vector; and generating the encoding vectors according to the weighted sum vector corresponding to each of the query vectors; and combining the encoding vectors to generate the original target text encoding or the knowledge text encoding.

如請求項11所述之機器閱讀理解系統，其中該答案擷取器所執行之基於該強化目標文本編碼，取得對應於該問題文本的該***含：將該強化目標文本編碼中對應於該文章文本的部分編碼與一起始分類向量執行矩陣運算及歸一化運算，以取得多個起始機率；將該部分編碼與一結束分類向量執行該矩陣運算及該歸一化運算，以取得多個結束機率；依據該些起始機率中具有最大數值者，決定該答案於該部分編碼中的起始位置；以及依據該些結束機率中具有最大數值者，決定該答案於該部分編碼中的結束位置。 The machine reading comprehension system of claim 11, wherein obtaining the answer corresponding to the question text based on the enhanced target text encoding executed by the answer extractor comprises: corresponding to the article in the enhanced target text encoding A matrix operation and a normalization operation are performed on the partial code of the text and an initial classification vector to obtain a plurality of initial probabilities; the matrix operation and the normalization operation are performed on the partial code and an end classification vector to obtain a plurality of End probability; determine the starting position of the answer in the partial code according to the one with the largest value among the starting probabilities; and determine the ending of the answer in the partial code according to the one with the largest value among the ending probabilities Location.

如請求項11所述之機器閱讀理解系統，其中該答案擷取器所執行之基於該強化目標文本編碼，取得對應於該問題文本的該***含：將該強化目標文本編碼中對應於該文章文本的部分編碼與一起始分類向量執行矩陣運算及歸一化運算，以取得多個起始機率；將該部分編碼與一結束分類向量執行該矩陣運算及該歸一化運算，以取得多個結束機率；依照該些起始機率之數值由大至小的順序，選擇前數個以作為多個候選起始機率；依照該些結束機率之數值由大至小的順序，選擇前數個以作為多個候選結束機率；將該些候選起始機率與該些候選結束機率進行配對以產生多個候選配對，其中每一該些候選配對中的該候選起始機率所對應的位置先於該候選結束機率所對應的位置；計算每一該些候選配對中的該候選起始機率與該候選結束機率之和值或乘積；以及依據該些候選配對中具有最大和值者或最大乘積者中的該候選起始機率及該候選結束機率，決定該答案於該部分編碼中的起始位置及結束位置。 The machine reading comprehension system of claim 11, wherein obtaining the answer corresponding to the question text based on the enhanced target text encoding executed by the answer extractor comprises: corresponding to the article in the enhanced target text encoding A matrix operation and a normalization operation are performed on the partial code of the text and an initial classification vector to obtain a plurality of initial probabilities; the matrix operation and the normalization operation are performed on the partial code and an end classification vector to obtain a plurality of probability of ending; According to the numerical value of these starting probabilities in descending order, select the first few as multiple candidate starting probabilities; according to the descending order of the ending probabilities, select the first few as multiple candidate starting probabilities candidate end probabilities; pairing the candidate start probabilities with the candidate end probabilities to generate a plurality of candidate pairs, wherein the position corresponding to the candidate start probability in each of the candidate pairs precedes the candidate end probability corresponding position; calculating the sum or product of the candidate start probability and the candidate end probability in each of the candidate pairs; and according to the candidate of the candidate pairs with the largest sum or the largest product The start probability and the candidate end probability determine the start position and end position of the answer in the partial code.

如請求項11所述之機器閱讀理解系統，更包含一處理裝置，該處理裝置連接於該語意編碼器、該編碼融合器及該答案擷取器，且用於控制該語意編碼器、該編碼融合器及該答案擷取器對多個第一訓練資料執行運作以產多個生第一訓練答案，依據該些第一訓練答案及一損失函數計算一第一損失值，依據該第一損失值對該語意編碼器、該編碼融合器及該答案擷取器之多個使用參數中的一或多者執行調整，於該調整後控制該語意編碼器、該編碼融合器及該答案擷取器對多個第二訓練資料執行運作以產生多個第二訓練答案並依據該些第二訓練答案及該損失函數計算一第二損失值，且依據該第二損失值調整該些使用參數中的一或多者。 The machine reading comprehension system of claim 11, further comprising a processing device, the processing device is connected to the semantic encoder, the code fusion device and the answer extractor, and is used for controlling the semantic encoder, the encoding The fusion unit and the answer extractor operate on a plurality of first training data to generate a plurality of first training answers, calculate a first loss value according to the first training answers and a loss function, and calculate a first loss value according to the first loss The value performs adjustment of one or more of the plurality of usage parameters of the semantic encoder, the encoding fuser, and the answer extractor, and after the adjustment controls the semantic encoder, the encoding fusion device, and the answer extraction The processor operates on a plurality of second training data to generate a plurality of second training answers, calculates a second loss value according to the second training answers and the loss function, and adjusts the use parameters according to the second loss value one or more of.

如請求項11所述之機器閱讀理解系統，其中該輸入輸出介面更用於輸出該知識集的至少一部分。 The machine reading comprehension system of claim 11, wherein the input-output interface is further configured to output at least a portion of the knowledge set.