WO2022127425A1 - Question assistance method, apparatus and system - Google Patents

Question assistance method, apparatus and system Download PDF

Info

Publication number
WO2022127425A1
WO2022127425A1 PCT/CN2021/128829 CN2021128829W WO2022127425A1 WO 2022127425 A1 WO2022127425 A1 WO 2022127425A1 CN 2021128829 W CN2021128829 W CN 2021128829W WO 2022127425 A1 WO2022127425 A1 WO 2022127425A1
Authority
WO
WIPO (PCT)
Prior art keywords
solving
topic
display
content
question
Prior art date
Application number
PCT/CN2021/128829
Other languages
French (fr)
Chinese (zh)
Inventor
何涛
罗欢
陈明权
Original Assignee
杭州大拿科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 杭州大拿科技股份有限公司 filed Critical 杭州大拿科技股份有限公司
Publication of WO2022127425A1 publication Critical patent/WO2022127425A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/412Layout analysis of documents structured with printed lines or input boxes, e.g. business forms or tables
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Definitions

  • the present disclosure relates to the technical field of artificial intelligence, and in particular, to a topic assistance method, device and system.
  • the purpose of the present disclosure is to provide a topic assistance method and a topic assistance system.
  • a problem assistance method including: acquiring a problem image, and identifying the content of the problem according to the image; generating a problem-solving answer and a problem-solving process according to the problem content, wherein the problem-solving process includes solving The problem-solving step or the problem-solving process includes the problem-solving step and the step description of at least part of the problem-solving step; the problem-solving process is set in the corresponding display level; and the problem-solving process is displayed according to at least one of the default display rules and the received display instructions The corresponding display level.
  • a topic assistance device comprising a memory, a processor, and instructions stored on the memory, and when the instructions are executed by the processor, the above is achieved The steps of the topic-assisted method described.
  • a topic assistance system includes a user terminal and a server, wherein: the user terminal is configured to acquire a topic image; At least one of them is configured to identify the question content according to the question image; the server is configured to generate a question-solving answer and a problem-solving process according to the question content, wherein the problem-solving process includes a problem-solving step or the problem-solving process includes a problem-solving step and a problem-solving process.
  • Step descriptions of at least part of the problem solving steps at least one of the user terminal and the server is further configured to set the problem solving process in a corresponding display level; and the user terminal is configured to display according to a default at least one of the rules and the received display instruction to display the corresponding display level.
  • a non-transitory computer-readable storage medium is provided, and instructions are stored on the non-transitory computer-readable storage medium, and when the instructions are executed by the processor, the above-mentioned implementation is achieved The steps of the topic-assisted method described.
  • a computer program product comprising instructions which, when executed by the processor, implement the steps of the topic assistance method as described above.
  • FIG. 1 schematically shows a flow chart of a topic assistance method according to an exemplary embodiment of the present disclosure.
  • FIG. 2 is a schematic flowchart of step S100 in the topic assistance method of FIG. 1 .
  • FIG. 3 is a schematic flowchart of at least part of the process of step S200 in the topic assistance method of FIG. 1 .
  • FIG. 4 is a diagram schematically illustrating a display hierarchy structure according to a specific embodiment of the present disclosure.
  • FIG. 5 is a schematic flowchart of step S300 in the topic assistance method of FIG. 1 .
  • FIG. 6 is a block diagram schematically illustrating a topic assisting apparatus according to an exemplary embodiment of the present disclosure.
  • FIG. 7 is a block diagram schematically illustrating a topic assistance system according to an exemplary embodiment of the present disclosure.
  • the present disclosure provides a topic assistance method that can be used, for example, in learning and teaching.
  • the corresponding problem-solving process can be set and displayed according to the display level, so as to help the user to select the display mode of the problem-solving process according to their own needs, so as to realize flexible interaction and improve the learning and teaching effect. .
  • the topic assistance method may include:
  • step S100 a subject image is acquired, and the subject content is identified according to the subject image.
  • step S100 may include:
  • step S110 an image acquisition device is used to acquire the subject image.
  • the image acquisition device can take pictures or videos of the subject that needs to be assisted, so as to acquire the subject image.
  • the image acquisition device may include a camera, an imaging module, an image processing module, etc., and may also include a communication module for receiving or downloading images.
  • Title images can include any form of visual presentation, such as photographs or videos.
  • identifying the subject content according to the subject image may also include:
  • step S120 the area identification model deployed in the user terminal is used to determine the topic area where the topic content in the topic image is located, and the number of topics in the topic area is determined.
  • the area identification model may include a first neural network model, the input of the first neural network model is the subject image, and the output is the subject area where the subject content in the subject image is located.
  • the first neural network model can be pre-trained by any known method by using a large number of training samples and according to the above-mentioned input and output. For example, it can be obtained by training through the following process: establishing a training set of subject image samples, wherein each subject image sample includes at least one subject. Perform labeling processing on each title image sample to mark the position of the title area where at least one title in each title image sample is located; and train the first neural network through the labelled title image sample training set to obtain Obtain the first neural network model.
  • the first neural network may be any known neural network, such as a deep residual network, a recurrent neural network, or the like.
  • Training the first neural network may further include: testing the output accuracy of the trained first neural network model based on the test set of subject image samples; if the output accuracy is less than a predetermined first threshold, adding subject image samples The number of subject image samples in the training set, and each subject image sample in the increased subject image samples has undergone the above labeling processing; and through the subject image sample training set after the number of subject image samples has been increased, the first neural network to train. Then, the output accuracy of the retrained first neural network model is tested again based on the subject image sample test set until the output accuracy of the first neural network model meets the requirements, that is, is not less than a predetermined first threshold.
  • one or more subject image samples in the subject image sample training set can be placed in the subject image sample test set, or one or more subject image samples in the subject image sample test set can be placed in the subject image sample test set as required. to the subject image sample training set.
  • the area identification model can further acquire the number of topics contained in the determined topic area for further processing.
  • identifying the subject content according to the subject image may also include:
  • Step S131 when the number of questions is equal to one, adopt the first character recognition model deployed in the user terminal to identify the title content in the title area, and upload the identified title content to the server;
  • Step S132 when the number of questions is greater than one, upload at least the question area in the title image to the server, and use the second character recognition model deployed on the server to identify the title content.
  • the first character recognition model or the second character recognition model may include a second neural network model
  • the input of the second neural network model is the subject area in the subject image (for example, the subject area cut out from the complete subject image)
  • the output is the characters in the title area, that is, the title content.
  • the characters referred to herein include characters (including text characters, graphic characters, letters, numbers, symbols, etc.), pictures, and the like.
  • the second neural network model can be pre-trained by any known method using a large number of training samples, according to the above-mentioned input and output. For example, it can be obtained by training through the following process: establishing a training set of subject image samples, where each subject image sample is a subject area. Perform labeling processing on each title image sample to mark title characters in the title area in each title image sample; and train the second neural network through the labelled title image sample training set to obtain a second neural network model.
  • the second neural network can be any known neural network.
  • the training of the second neural network may further include verifying the output accuracy of the second neural network model with a test set of subject image samples. If the accuracy does not meet the requirements You can increase the number of samples in the training set of subject image samples and re-train.
  • the area recognition model and the relatively simple first character recognition model for single-question recognition are deployed in the local user terminal, while the relatively complex second character recognition model for multi-question recognition is used for
  • the model is deployed in the server, which helps to increase the recognition speed as much as possible and improve the recognition efficiency on the premise of avoiding occupying the limited resources of the user terminal.
  • both the region recognition model and the character recognition model may be deployed in the server. That is to say, regardless of whether the number of questions in the topic area is one or more, the acquired topic image will be uploaded from the user terminal to the server, and then recognized by the area recognition model and character recognition model in the server. deal with.
  • topic assistance method can also include:
  • Step S200 generating a problem-solving answer and a problem-solving process according to the content of the problem, wherein the problem-solving process includes a problem-solving step or a problem-solving process includes a problem-solving step and a step description of at least part of the problem-solving step.
  • the problem-solving steps in the problem-solving process represent the specific processing from the question content to the problem-solving answer, and the step description may include an explanation of why such a problem-solving step is adopted, and an explanation of the mathematical tools used in the problem-solving step. explain and so on.
  • the problem-solving steps are usually indispensable, and the step descriptions can be given according to the complexity of the problem-solving steps. For example, for very simple problem-solving steps, the relevant step descriptions may not be generated.
  • the problem-solving process can be expressed by words, or can also be expressed by graphics (for example, including the function curves involved in the content of the problem, etc.).
  • generating the answer according to the content of the question may include:
  • Step S211 determining a problem-solving model deployed on the server according to the problem type determined by the problem content
  • Step S212 using the problem-solving model to generate a problem-solving answer.
  • the question types may include calculation questions, application questions, fill-in-the-blank questions, multiple-choice questions, operation questions, and the like.
  • a third neural network model can be used to determine the topic type according to the topic content, that is, the input of the third neural network model is the topic content, and the output is the topic type.
  • the third neural network model can be obtained by pre-training the third neural network by any known method according to the above input and output using a large number of training samples.
  • the third neural network may be any known neural network, such as a deep convolutional neural network or the like.
  • the problem-solving model may specifically include a computational model for computational problems, a natural language processing model and/or a vector model for applied problems, and the like.
  • a calculation model deployed on the server can be used to generate the answer to the question.
  • a natural language processing model, a vector model, etc. deployed on the server can be used to generate the answer to the question.
  • Natural language processing models and vector models can be used separately for problem solving, or in some cases can be combined to improve problem solving speed or accuracy.
  • the natural language processing model can analyze and extract key data according to the language of the application question, and then calculate the answer to the question based on the data.
  • using a vector model to generate a problem-solving answer may specifically include: performing feature extraction on the content of the question to generate a feature vector; retrieving a standard vector matching the feature vector from a preset question bank; The vector answer corresponding to the vector produces the solution answer.
  • the feature vector can be a two-dimensional feature vector
  • the two-dimensional feature vector can be a feature map (feature map), which can be generated by any method known in the art.
  • the image area is processed to extract.
  • the first two-dimensional feature vector is generated for the text in the application question
  • the second two-dimensional feature vector is generated for the picture in the application question; and the first two-dimensional feature vector and the second two-dimensional feature vector are spliced to obtain two dimensional feature vector.
  • the fourth neural network model can be used to achieve feature extraction, that is, the input of the fourth neural network model is the topic content (including text and pictures), and the output is the feature vector corresponding to the topic content.
  • the fourth neural network model can be obtained by pre-training the fourth neural network by any known method according to the above-mentioned input and output by using a large number of training samples.
  • the fourth neural network may be any known neural network, such as a deep convolutional neural network or the like.
  • the preset question bank can include multiple groups, and each group can include one or more vectors. These vectors are all feature vectors generated by performing feature extraction on the questions of the known applied questions (for example, the questions in the test question bank of the pre-collected applied questions). Any two vectors from the same group have the same length, and any two vectors from different groups have different lengths.
  • Searching for the standard vector from the preset question bank may include: firstly, according to the length of the feature vector, finding a group matching the length of the feature vector in the preset question bank; and then searching the group whose length matches to find the standard vector. In this way, the standard vector matching the feature vector can be searched more quickly.
  • each group may have a respective index that matches (eg, is equal to) the length of each vector in the group, and finding a group matching the length of the feature vector in the preset question bank includes: according to The length of the feature vector indexes into the matched group.
  • the solution answer to the applied question can be generated according to the preset vector answer associated with the standard vector.
  • the vector answer may also come from a pre-collected test question bank of application questions.
  • the test question bank includes question content and reference answers corresponding to the question content.
  • the vector answer can be deformed according to the difference between the content of the question and the closest standard vector to obtain the answer to the question.
  • generating the problem solving process according to the content of the question may include generating the solving process according to the question type determined by the content of the question.
  • the corresponding rules can be obtained from the preset rule library according to the formal characteristics of the content of the question (such as the number of unknowns, the highest power, the position and the calculation symbol, etc.). And the problem-solving process is generated according to the corresponding rules.
  • Obtaining a problem-solving rule for calculating the sum of fractions in the preset rule base may include three steps in sequence: unifying the denominators of the fractions in the formula, combining the denominators and calculating the sum of the numerators, and reducing the fractions. A rule including these three steps can then generate a literal problem-solving process as shown in Figure 4.
  • a graphical problem solving process can also be generated. For example, if the title of the identified calculation question is calculation the value of x in . Then, the graphical problem-solving rules for calculating a linear equation in one variable can be obtained in the preset rule base, and the specific rules are drawn separately. and The function image of , and find the x-coordinate of the intersection point as the answer to the problem, thereby generating a graphical problem-solving process.
  • topic assistance method can also include:
  • Step S300 setting the problem solving process in the corresponding display level.
  • each problem-solving step or step description in the same display level usually corresponds to the serial operations performed in order to achieve a specific operation purpose in the problem-solving process, while the problem-solving steps or step descriptions in different display levels There can be contained or contained relationships between them.
  • problem solving steps or step descriptions denoted by the same numbering and indentation are in the same display level. That is, for example "1. Unify the denominators of fractions in the formula", "2. Combine the denominators and calculate the sum of the numerators" and "3. Reduced fractions" in one display level, while for example "(1) Find the denominator LCM of 2, 3, 6" and "(2) make all denominators in the equation equal to LCM" in another display level, and so on.
  • setting the problem solving process in the corresponding display level may include:
  • Step S310 determining a hierarchical rule according to the topic type determined by the topic content.
  • Step S320 according to the hierarchical rules, the problem-solving steps in the problem-solving process, or the problem-solving steps and step descriptions in the problem-solving process, are respectively set in the corresponding display levels.
  • Multiple hierarchical rules corresponding to multiple topic types can be preset to set corresponding display levels for different topic characteristics, wherein the specific number of display levels is related to the corresponding hierarchical rules, which is not limited here.
  • existing layering rules can be updated or new layering rules can be added to optimize the display of the problem solving process.
  • a new question type is determined according to the content of the question, a new hierarchical rule corresponding to the new question type can be obtained;
  • the problem solving steps and step descriptions in the problem process are respectively set in the corresponding display levels.
  • the display level can be set according to the problem-solving steps, the sub-steps of the problem-solving steps, and the corresponding step descriptions.
  • the main steps in the problem solving steps may be set in the first display level, and the step descriptions corresponding to at least part of the main steps may be set in the second display level.
  • the main step refers to the most important step to obtain the answer to the problem according to the content of the question.
  • the main step ignores some specific details of the problem-solving process, so as to better show the thinking framework of the problem-solving and improve the learning and teaching effect.
  • the user can flexibly control the display of the problem-solving steps and step descriptions as required.
  • a user when a user is proofreading his own answer, he can control to display only the problem-solving steps to save proofreading time; when the user needs a prompt during the answering process, he can control to display at least part of the step description instead of specific problem-solving steps. ; During the process of viewing the problem solving steps, users can control to display only the step descriptions of the problem solving steps that they are not clear about, so as to improve the efficiency of learning and teaching.
  • the problem solving steps and the corresponding step descriptions may also be set in the same display level to be displayed.
  • Steps can be divided into more sub-steps.
  • Different levels of substeps can be set in different display levels.
  • a first sub-step included in at least one main step is arranged in a third display level
  • a second sub-step included in at least one first sub-step is arranged in a fifth display level
  • the corresponding step descriptions of at least part of the first sub-step and at least part of the second sub-step may be set in the fourth display level and the sixth display level, respectively. It can be understood that, if the second sub-step can be further subdivided, then more display levels can be set.
  • the sub-steps and corresponding step descriptions may also be set in the same display level to be displayed.
  • topic assistance method can also include:
  • Step S400 displaying a corresponding display level according to at least one of a default display rule and a received display instruction.
  • the problem solving process can be automatically displayed according to the default display rules without manual intervention by the user, so as to simplify the operation of the user.
  • the default display rules may include, for example, displaying the main steps in the same display level in sequence according to preset time intervals, so as to give the user a certain time to think, thereby improving the learning and teaching effect.
  • the main steps may be displayed sequentially every two minutes.
  • the default display rule may also include, for example, displaying all main steps in the same display level after a preset time interval, for example, after acquiring the subject image, displaying all main steps at an interval of five minutes.
  • other default display rules can also be set to display the main step or the sub-steps of a certain main step, and so on.
  • the corresponding display level may be displayed according to the received display instruction to facilitate user interaction.
  • the display instruction may be given by the user triggering a corresponding button on the user terminal, or performing an action in the touch area, or the like.
  • the problem solving process may be displayed only when triggered by the user. After the user obtains the answer to the problem by viewing the display device of the user terminal, he first thinks about the problem solving steps by himself, and then triggers the display device to display the problem solving process when the user needs to view the problem solving steps.
  • the touch area on the display device may be provided with icons such as "solution answer”, “problem solving step”, “step explanation” or arrows pointing in various directions, or the touch area may sense that the user is on the display device. actions performed above. The user can control the display of the problem-solving process by clicking on a corresponding icon, or by swiping, double-clicking, clicking, zooming, and other actions in the touch area.
  • the answers that have not yet been displayed can be displayed, or the answers that have been displayed can be highlighted; by clicking on "solving steps” or by touching a preset area in the area Display the corresponding problem solving step that is not yet displayed, or highlight a problem solving step that has been displayed; get the step by step description by clicking "step explanation” or double-clicking the corresponding problem solving step; and by clicking down arrows to display or highlight the next solution step in the same display level, click the up arrow to display or highlight the previous solution step in the same display level, click the right arrow to display or highlight another solution step A sub-step of that problem solving step in one display level, showing or highlighting a problem solving step that includes this problem solving step in another display level by clicking the left arrow, etc.
  • buttons, etc. may also be set on the display device, so as to obtain the display instruction.
  • the topic assistance device may include a memory 610 , a processor 620 , and an instruction stored in the memory 610 .
  • the instruction is executed by the processor 620 , the above-mentioned implementation is realized. Steps of the topic-assisted method described.
  • the processor 620 can perform various actions and processes according to the instructions stored in the memory 610 .
  • the processor 620 may be an integrated circuit chip with signal processing capability.
  • the aforementioned processors may be general purpose processors, digital signal processors (DSPs), application specific integrated circuits (ASICs), off-the-shelf programmable gate arrays (FPGAs) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components.
  • DSPs digital signal processors
  • ASICs application specific integrated circuits
  • FPGAs off-the-shelf programmable gate arrays
  • Various methods, steps and logic block diagrams disclosed in the embodiments of the present disclosure can be implemented or executed.
  • the general-purpose processor may be a microprocessor, or the processor may be any conventional processor, etc., and may be an X86 architecture or an ARM architecture, or the like.
  • the memory 610 stores executable instructions that are executed by the processor 620 to perform the object recognition method described above.
  • Memory 610 may be volatile memory or nonvolatile memory, or may include both volatile and nonvolatile memory.
  • the nonvolatile memory may be read only memory (ROM), programmable read only memory (PROM), erasable programmable read only memory (EPROM), electrically erasable programmable read only memory (EEPROM), or flash memory.
  • Volatile memory may be random access memory (RAM), which acts as an external cache.
  • RAM Random Access Memory
  • SRAM static random access memory
  • DRAM dynamic random access memory
  • SDRAM synchronous dynamic random access memory
  • DDRSDRAM double data rate synchronous dynamic Random Access Memory
  • ESDRAM Enhanced Synchronous Dynamic Random Access Memory
  • SLDRAM Synchronous Link Dynamic Random Access Memory
  • DR RAM Direct Memory Bus Random Access Memory
  • the topic assistance system may include a user terminal 710 and a server 720 .
  • the user terminal 710 may be configured to obtain the subject image.
  • the image capturing device 711 may be included in the user terminal 710 .
  • At least one of the user terminal 710 and the server 720 may be configured to identify topic content from the topic image.
  • the user terminal 710 can deploy a region recognition module 712 and a first character recognition module 713 for single-question recognition, and the server can deploy a second character recognition module for multi-question recognition Character recognition module 721.
  • the server 720 may be configured to generate a problem-solving answer and a problem-solving process according to the content of the problem, wherein the problem-solving process may include a problem-solving step or the problem-solving process includes a problem-solving step and a step description of at least part of the problem-solving step.
  • the problem-solving process may include a problem-solving step or the problem-solving process includes a problem-solving step and a step description of at least part of the problem-solving step.
  • a problem solving model 722 and a preset question bank 723 matching the problem solving model 722 may be included in the server 720 .
  • the server 720 may also include a preset rule base 724 for generating problem solving procedures.
  • At least one of the user terminal 710 and the server 720 may also be configured to set the problem solving process in the corresponding display level.
  • the user terminal 710 may also be configured to display a corresponding display level according to at least one of a default display rule and a received display instruction.
  • a display device 714 for displaying may be included in the user terminal 710 .
  • the user terminal 710 may also render the answer to the problem or the problem solving process before displaying it, so as to beautify the display effect.
  • the subject image is transmitted to the area recognition model 712 to identify the subject area, and then the first character recognition model 713 or the second character recognition model 721 identifies the subject content in the subject area. According to the identified content of the question, the answer to the question and the procedure of solving the question can be generated, and the answer and procedure of solving the question are transmitted from the server 720 back to the user terminal 710 to be displayed by the display device 714 .
  • the present disclosure also proposes a non-transitory computer-readable storage medium, on which instructions are stored, and when the instructions are executed, the steps in the topic assistance method described above can be implemented.
  • non-transitory computer-readable storage media in embodiments of the present disclosure may be volatile memory or non-volatile memory, or may include both volatile and non-volatile memory. It should be noted that computer-readable storage media described herein are intended to include, but not be limited to, these and any other suitable types of memory.
  • the present disclosure also proposes a computer program product that can include instructions that, when executed by a processor, can implement the steps of the topic assistance method as described above.
  • the instructions may be any set of instructions to be executed directly by one or more processors, such as machine code, or any set of instructions to be executed indirectly, such as scripts.
  • the terms "instructions,” “applications,” “processes,” “steps,” and “programs” are used interchangeably herein. Instructions may be stored in object code format for direct processing by one or more processors, or in any other computer language, including scripts or collections of self-contained source code modules that are interpreted on demand or compiled ahead of time.
  • the instructions may include instructions that cause, for example, one or more processors to function as the various neural networks herein. The functions, methods, and routines of the instructions are explained in more detail elsewhere in this document.
  • embodiments of the present disclosure may also include the following examples:
  • a topic assistance method comprising:
  • the problem-solving process includes a problem-solving step or the problem-solving process includes a problem-solving step and a step description of at least part of the problem-solving step;
  • the corresponding display level is displayed according to at least one of a default display rule and a received display instruction.
  • acquiring a topic image, and identifying the topic content according to the topic image includes:
  • the first character recognition model deployed in the user terminal is used to identify the topic content in the topic area, and the identified topic content is uploaded to the server;
  • generating the answer to the question according to the content of the question includes:
  • a problem-solving model is used to generate a problem-solving answer.
  • the problem solving model includes a computational model for the problem whose problem type is a computational problem, and a natural language processing model and/or a vector model for the problem whose problem type is an applied problem.
  • using a vector model to generate the answer to the question includes:
  • the problem-solving answer is generated.
  • the problem solving process according to the content of the problem includes:
  • a problem solving process is generated.
  • the problem solving process according to the content of the problem includes:
  • the corresponding rules are obtained from the preset rule base, and the problem-solving process is generated according to the corresponding rules.
  • setting the problem solving process in the corresponding display level includes:
  • the problem-solving steps in the problem-solving process are respectively set in the corresponding display levels according to the hierarchical rules.
  • the hierarchical rules are further configured to place step descriptions corresponding to at least part of the main steps in the second display level.
  • the problem assistance method according to 9, the hierarchical rule is further configured to set the problem solving answer in the first display level.
  • the hierarchical rule is further configured to arrange the first sub-step included in the at least one main step in a third display level.
  • the hierarchical rule is further configured to arrange step descriptions corresponding to at least part of the first sub-steps in a fourth display level.
  • the hierarchical rules are further configured to place step descriptions corresponding to at least part of the second sub-steps in a sixth display level.
  • setting the problem solving process in the corresponding display level further includes:
  • the problem-solving steps in the problem-solving process or the problem-solving steps and step descriptions in the problem-solving process, are respectively set in the corresponding display levels.
  • the topic assistance method of 1, the display instructions are configured to implement at least one of the following actions:
  • a problem aid device comprising a memory, a processor, and instructions stored on the memory, which, when executed by the processor, implement the method described in any one of 1 to 17. Steps of the topic-assisted method described.
  • a topic assistance system comprising a user terminal and a server, wherein:
  • the user terminal is configured to obtain the subject image
  • At least one of the user terminal and the server is configured to identify the topic content according to the topic image
  • the server is configured to generate a problem-solving answer and a problem-solving process according to the content of the problem, wherein the problem-solving process includes a problem-solving step or the problem-solving process includes a problem-solving step and a step description of at least part of the problem-solving step;
  • At least one of the user terminal and the server is further configured to set the problem solving process in a corresponding display level
  • the user terminal is configured to display a corresponding display level according to at least one of a default display rule and a received display instruction.
  • an image acquisition device configured to acquire a subject image
  • a display device configured to display a corresponding display level according to at least one of a default display rule and a received display instruction.
  • a non-transitory computer-readable storage medium having instructions stored thereon that, when executed by the processor, implement the method described in any of 1 to 17. Steps of the topic-assisted method described.
  • a computer program product comprising instructions which, when executed by the processor, implement the steps of the topic assistance method according to any one of 1 to 17.
  • each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logic for implementing the specified logic Executable instructions for the function.
  • the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented in dedicated hardware-based systems that perform the specified functions or operations , or can be implemented in a combination of dedicated hardware and computer instructions.
  • the various example embodiments of the present disclosure may be implemented in hardware or special purpose circuits, software, firmware, logic, or any combination thereof. Certain aspects may be implemented in hardware, while other aspects may be implemented in firmware or software that may be executed by a controller, microprocessor or other computing device. While aspects of the embodiments of the present disclosure are illustrated or described as block diagrams, flowcharts, or using some other graphical representation, it is to be understood that the blocks, apparatus, systems, techniques, or methods described herein may be taken as non-limiting Examples are implemented in hardware, software, firmware, special purpose circuits or logic, general purpose hardware or controllers or other computing devices, or some combination thereof.
  • the word "exemplary” means “serving as an example, instance, or illustration” rather than as a “model” to be exactly reproduced. Any implementation illustratively described herein is not necessarily to be construed as preferred or advantageous over other implementations. Furthermore, the present disclosure is not to be bound by any expressed or implied theory presented in the preceding technical field, background, brief summary or detailed description.
  • the word “substantially” is meant to encompass any minor variation due to design or manufacturing imperfections, tolerances of devices or elements, environmental influences, and/or other factors.
  • the word “substantially” also allows for differences from a perfect or ideal situation due to parasitics, noise, and other practical considerations that may exist in an actual implementation.
  • connection means that one element/node/feature is electrically, mechanically, logically or otherwise directly connected to another element/node/feature (or direct communication).
  • coupled means that one element/node/feature can be mechanically, electrically, logically or otherwise linked, directly or indirectly, with another element/node/feature to allow interaction, even though the two features may not be directly connected. That is, “coupled” is intended to encompass both direct and indirect connections of elements or other features, including connections utilizing one or more intervening elements.
  • first,” “second,” and the like may also be used herein for reference purposes only, and are thus not intended to be limiting.
  • the terms “first,” “second,” and other such numerical terms referring to structures or elements do not imply a sequence or order unless the context clearly dictates otherwise.
  • providing is used broadly to encompass all ways of obtaining an object, thus “providing something” includes, but is not limited to, “purchasing,” “preparing/manufacturing,” “arranging/arranging,” “installing/ Assembly”, and/or “Order” objects, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Molecular Biology (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present disclosure relates to a question assistance method, apparatus, and system, the question assistance method comprising: acquiring a question image, and identifying question content according to the question image; generating an answer to the question and a question solving process according to the question content, wherein the question solving process comprises question solving steps, or the question solving process comprises question solving steps and descriptions of at least some of the question solving steps; setting the question solving process in a corresponding display hierarchy; and displaying the corresponding display hierarchy according to at least one of a default display rule and a received display instruction.

Description

题目辅助方法、装置和***Method, device and system for topic assistance 技术领域technical field
本公开涉及人工智能技术领域,尤其涉及一种题目辅助方法、装置和***。The present disclosure relates to the technical field of artificial intelligence, and in particular, to a topic assistance method, device and system.
背景技术Background technique
近年来,人工智能技术已经广泛应用于日常的学习和教学中。例如,可以利用各种电子设备来拍摄、识别题目并进行解题,从而获得与题目对应的解题答案和解题过程,以辅助学习和教学。In recent years, artificial intelligence technology has been widely used in daily learning and teaching. For example, various electronic devices can be used to photograph, identify and solve problems, so as to obtain problem-solving answers and problem-solving processes corresponding to the problems, so as to assist learning and teaching.
因此,存在对新技术的需求。Therefore, there is a need for new technologies.
发明内容SUMMARY OF THE INVENTION
本公开的目的是提供一种题目辅助方法和题目辅助***。The purpose of the present disclosure is to provide a topic assistance method and a topic assistance system.
根据本公开的第一方面,提供了一种题目辅助方法,包括:获取题目影像,并根据题目影像识别出题目内容;根据题目内容产生解题答案和解题过程,其中,解题过程包括解题步骤或者解题过程包括解题步骤和至少部分解题步骤的步骤说明;将解题过程设置在相应的显示层级中;以及根据默认的显示规则和接收到的显示指令中的至少一个来显示对应的显示层级。According to a first aspect of the present disclosure, a problem assistance method is provided, including: acquiring a problem image, and identifying the content of the problem according to the image; generating a problem-solving answer and a problem-solving process according to the problem content, wherein the problem-solving process includes solving The problem-solving step or the problem-solving process includes the problem-solving step and the step description of at least part of the problem-solving step; the problem-solving process is set in the corresponding display level; and the problem-solving process is displayed according to at least one of the default display rules and the received display instructions The corresponding display level.
根据本公开的第二方面,提供了一种题目辅助装置,所述题目辅助装置包括存储器、处理器以及存储在所述存储器上的指令,当所述指令被所述处理器执行时,实现如上所述的题目辅助方法的步骤。According to a second aspect of the present disclosure, there is provided a topic assistance device, the topic assistance device comprising a memory, a processor, and instructions stored on the memory, and when the instructions are executed by the processor, the above is achieved The steps of the topic-assisted method described.
根据本公开的第三方面,提供了一种题目辅助***,所述题目辅助***包括用户终端以及服务器,其中:所述用户终端被配置为获取题目影像;所述用户终端和所述服务器中的至少一者被配置为根据题目影像识别出题目内容;所述服务器被配置为根据题目内容产生解题答案和解题过程,其中,解题过程包括解题步骤或者解题过程包括解题步骤和至少部分解题步骤的步骤说明;所述用户终端和所述服务器中的至少一者还被配置为将解题过程设置在相应的显示层级中;以及所述用户终端被配置为根据默认的显示规则和接 收到的显示指令中的至少一个来显示对应的显示层级。According to a third aspect of the present disclosure, a topic assistance system is provided, the topic assistance system includes a user terminal and a server, wherein: the user terminal is configured to acquire a topic image; At least one of them is configured to identify the question content according to the question image; the server is configured to generate a question-solving answer and a problem-solving process according to the question content, wherein the problem-solving process includes a problem-solving step or the problem-solving process includes a problem-solving step and a problem-solving process. Step descriptions of at least part of the problem solving steps; at least one of the user terminal and the server is further configured to set the problem solving process in a corresponding display level; and the user terminal is configured to display according to a default at least one of the rules and the received display instruction to display the corresponding display level.
根据本公开的第四方面,提供了一种非暂态计算机可读存储介质,所述非暂态计算机可读存储介质上存储有指令,当所述指令被所述处理器执行时,实现如上所述的题目辅助方法的步骤。According to a fourth aspect of the present disclosure, a non-transitory computer-readable storage medium is provided, and instructions are stored on the non-transitory computer-readable storage medium, and when the instructions are executed by the processor, the above-mentioned implementation is achieved The steps of the topic-assisted method described.
根据本公开的第五方面,提供了一种计算机程序产品,所述计算机程序产品包括指令,当所述指令被所述处理器执行时,实现如上所述的题目辅助方法的步骤。According to a fifth aspect of the present disclosure, there is provided a computer program product comprising instructions which, when executed by the processor, implement the steps of the topic assistance method as described above.
通过以下参照附图对本公开的示例性实施例的详细描述,本公开的其它特征及其优点将会变得清楚。Other features of the present disclosure and advantages thereof will become apparent from the following detailed description of exemplary embodiments of the present disclosure with reference to the accompanying drawings.
附图说明Description of drawings
构成说明书的一部分的附图描述了本公开的实施例,并且连同说明书一起用于解释本公开的原理。The accompanying drawings, which form a part of the specification, illustrate embodiments of the present disclosure and together with the description serve to explain the principles of the present disclosure.
参照附图,根据下面的详细描述,可以更加清楚地理解本公开,其中:The present disclosure may be more clearly understood from the following detailed description with reference to the accompanying drawings, wherein:
图1示意性地示出了根据本公开的一示例性实施例的题目辅助方法的流程示意图。FIG. 1 schematically shows a flow chart of a topic assistance method according to an exemplary embodiment of the present disclosure.
图2是图1的题目辅助方法中步骤S100的流程示意图。FIG. 2 is a schematic flowchart of step S100 in the topic assistance method of FIG. 1 .
图3是图1的题目辅助方法中步骤S200的至少部分流程的流程示意图。FIG. 3 is a schematic flowchart of at least part of the process of step S200 in the topic assistance method of FIG. 1 .
图4是示意性地示出根据本公开的一具体实施例的显示层级结构图。FIG. 4 is a diagram schematically illustrating a display hierarchy structure according to a specific embodiment of the present disclosure.
图5是图1的题目辅助方法中步骤S300的流程示意图。FIG. 5 is a schematic flowchart of step S300 in the topic assistance method of FIG. 1 .
图6是示意性地示出了根据本公开的一示例性实施例的题目辅助装置的框图。FIG. 6 is a block diagram schematically illustrating a topic assisting apparatus according to an exemplary embodiment of the present disclosure.
图7是示意性地示出了根据本公开的一示例性实施例的题目辅助***的框图。FIG. 7 is a block diagram schematically illustrating a topic assistance system according to an exemplary embodiment of the present disclosure.
注意,在以下说明的实施方式中,有时在不同的附图之间共同使用同一附图标记来表示相同部分或具有相同功能的部分,而省略其重复说明。在本说明书中,使用相似的标号和字母表示类似项,因此,一旦某一项在一个附图中被定义,则在随后的附图中不需要对其进行进一步讨论。Note that, in the embodiments described below, the same reference numerals are used in common between different drawings to denote the same parts or parts having the same function, and repeated descriptions thereof may be omitted. In this specification, like numerals and letters are used to refer to like items, so once an item is defined in one figure, it need not be discussed further in subsequent figures.
具体实施方式Detailed ways
以下将参照附图来详细描述本公开的各种示例性实施例。应注意到:除非另外具体说明,否则在这些实施例中阐述的部件和步骤的相对布置、数字表达式和数值不限制本公开的范围。在下面描述中,为了更好地解释本公开,阐述了许多细节,然而可以理解的是,在没有这些细节的情况下也可以实践本公开。Various exemplary embodiments of the present disclosure will be described in detail below with reference to the accompanying drawings. It should be noted that the relative arrangement of the components and steps, the numerical expressions and numerical values set forth in these embodiments do not limit the scope of the present disclosure unless specifically stated otherwise. In the following description, in order to better explain the present disclosure, numerous details are set forth, however it will be understood that the present disclosure may be practiced without these details.
以下对至少一个示例性实施例的描述实际上仅仅是说明性的,决不作为对本公开及其应用或使用的任何限制。在这里示出和讨论的所有示例中,任何具体值应被解释为仅仅是示例性的,而不是作为限制。The following description of at least one exemplary embodiment is merely illustrative in nature and is in no way intended to limit the disclosure, its application or uses in any way. In all examples shown and discussed herein, any specific value should be construed as illustrative only and not as limiting.
对于相关领域普通技术人员已知的技术、方法和设备可能不作详细讨论,但在适当情况下,所述技术、方法和设备应当被视为说明书的一部分。Techniques, methods, and apparatus known to those of ordinary skill in the relevant art may not be discussed in detail, but where appropriate, such techniques, methods, and apparatus should be considered part of the specification.
本公开提供了一种题目辅助方法,可以用于例如学习和教学。在本公开的实施例中,可以按照显示层级来设置并显示相应的解题过程,从而有助于用户根据自身需求选择解题过程的显示方式,以实现灵活的交互,并改善学习和教学效果。The present disclosure provides a topic assistance method that can be used, for example, in learning and teaching. In the embodiment of the present disclosure, the corresponding problem-solving process can be set and displayed according to the display level, so as to help the user to select the display mode of the problem-solving process according to their own needs, so as to realize flexible interaction and improve the learning and teaching effect. .
如图1所示,在本公开的一示例性实施例中,题目辅助方法可以包括:As shown in FIG. 1, in an exemplary embodiment of the present disclosure, the topic assistance method may include:
步骤S100,获取题目影像,并根据题目影像识别出题目内容。In step S100, a subject image is acquired, and the subject content is identified according to the subject image.
具体而言,如图2所示,步骤S100可以包括:Specifically, as shown in FIG. 2 , step S100 may include:
步骤S110,采用影像获取装置获取题目影像。In step S110, an image acquisition device is used to acquire the subject image.
其中,影像获取装置可以对需要进行辅助的题目进行拍照或摄像,从而获取题目影像。该影像获取装置可以包括摄像头、成像模块、以及图像处理模块等,还可以包括用于接收或下载影像的通信模块等,影像获取装置可以是独立设置的,也可以被包括在诸如智能电话、平板电脑等用户终端中。题目影像可以包括任何形式的视觉呈现,例如照片或视频等。Wherein, the image acquisition device can take pictures or videos of the subject that needs to be assisted, so as to acquire the subject image. The image acquisition device may include a camera, an imaging module, an image processing module, etc., and may also include a communication module for receiving or downloading images. computer and other user terminals. Title images can include any form of visual presentation, such as photographs or videos.
如图2所示,根据题目影像识别出题目内容还可以包括:As shown in Figure 2, identifying the subject content according to the subject image may also include:
步骤S120,采用部署在用户终端的区域识别模型来确定题目影像中的题目内容所在的题目区域,并确定题目区域中的题目数目。In step S120, the area identification model deployed in the user terminal is used to determine the topic area where the topic content in the topic image is located, and the number of topics in the topic area is determined.
其中,区域识别模型可以包括第一神经网络模型,该第一神经网络模型 的输入为题目影像,输出为题目影像中的题目内容所在的题目区域。Wherein, the area identification model may include a first neural network model, the input of the first neural network model is the subject image, and the output is the subject area where the subject content in the subject image is located.
第一神经网络模型可以使用大量的训练样本,按照上述的输入输出,通过任何已知的方法预先训练得到。例如,可以通过如下过程训练得到:建立一个题目影像样本训练集,其中的每个题目影像样本均包括至少一个题目。对每个题目影像样本进行标注处理,以标注出每个题目影像样本中的至少一个题目所在的题目区域的位置;以及通过经过标注处理的题目影像样本训练集对第一神经网络进行训练,以得到第一神经网络模型。第一神经网络可以是任何已知的神经网络,例如深度残差网络、递归神经网络等。The first neural network model can be pre-trained by any known method by using a large number of training samples and according to the above-mentioned input and output. For example, it can be obtained by training through the following process: establishing a training set of subject image samples, wherein each subject image sample includes at least one subject. Perform labeling processing on each title image sample to mark the position of the title area where at least one title in each title image sample is located; and train the first neural network through the labelled title image sample training set to obtain Obtain the first neural network model. The first neural network may be any known neural network, such as a deep residual network, a recurrent neural network, or the like.
对第一神经网络进行训练还可以包括:基于题目影像样本测试集,对经过训练的第一神经网络模型的输出准确率进行测试;若输出准确率小于预定的第一阈值,则增加题目影像样本训练集中的题目影像样本的数量,所增加的题目影像样本中的每个题目影像样本均经过上述标注处理;以及通过增加了题目影像样本数量之后的题目影像样本训练集,重新对第一神经网络进行训练。然后基于题目影像样本测试集对重新训练过的第一神经网络模型的输出准确率再次进行测试,直到第一神经网络模型的输出准确率满足要求即不小于预定的第一阈值为止。本领域技术人员应理解,可以根据需要,将题目影像样本训练集中的一个或多个题目影像样本放到题目影像样本测试集中,也可以将题目影像样本测试集中的一个或多个题目影像样本放到题目影像样本训练集中。Training the first neural network may further include: testing the output accuracy of the trained first neural network model based on the test set of subject image samples; if the output accuracy is less than a predetermined first threshold, adding subject image samples The number of subject image samples in the training set, and each subject image sample in the increased subject image samples has undergone the above labeling processing; and through the subject image sample training set after the number of subject image samples has been increased, the first neural network to train. Then, the output accuracy of the retrained first neural network model is tested again based on the subject image sample test set until the output accuracy of the first neural network model meets the requirements, that is, is not less than a predetermined first threshold. Those skilled in the art should understand that one or more subject image samples in the subject image sample training set can be placed in the subject image sample test set, or one or more subject image samples in the subject image sample test set can be placed in the subject image sample test set as required. to the subject image sample training set.
此外,区域识别模型可以根据所确定的题目区域进一步获取其中所包含的题目数目,以待进一步处理。In addition, the area identification model can further acquire the number of topics contained in the determined topic area for further processing.
如图2所示,根据题目影像识别出题目内容还可以包括:As shown in Figure 2, identifying the subject content according to the subject image may also include:
步骤S131,当题目数目等于一时,采用部署在用户终端的第一字符识别模型来识别题目区域中的题目内容,并将所识别的题目内容上传至服务器;Step S131, when the number of questions is equal to one, adopt the first character recognition model deployed in the user terminal to identify the title content in the title area, and upload the identified title content to the server;
步骤S132,当题目数目大于一时,将题目影像中的至少题目区域上传至服务器中,并采用部署在服务器的第二字符识别模型来识别题目内容。Step S132, when the number of questions is greater than one, upload at least the question area in the title image to the server, and use the second character recognition model deployed on the server to identify the title content.
其中,第一字符识别模型或第二字符识别模型可以包括第二神经网络模型,该第二神经网络模型的输入为题目影像中的题目区域(例如,从完整的题目影像中切割出来的题目区域),输出为题目区域中的字符,即题目内容。 应当理解,本文中所称的字符,包括文字(包括文本文字、图形文字、字母、数字、符号等)以及图片等。Wherein, the first character recognition model or the second character recognition model may include a second neural network model, and the input of the second neural network model is the subject area in the subject image (for example, the subject area cut out from the complete subject image) ), the output is the characters in the title area, that is, the title content. It should be understood that the characters referred to herein include characters (including text characters, graphic characters, letters, numbers, symbols, etc.), pictures, and the like.
类似地,第二神经网络模型可以使用大量的训练样本,按照上述的输入输出,通过任何已知的方法预先训练得到。例如,可以通过如下过程训练得到:建立一个题目影像样本训练集,其中的每个题目影像样本为一个题目区域。对每个题目影像样本进行标注处理,以标注出每个题目影像样本中的题目区域中的题目字符;以及通过经过标注处理的题目影像样本训练集对第二神经网络进行训练,以得到第二神经网络模型。第二神经网络可以是任何已知的神经网络。此外,与上文对第一神经网络的描述相类似,对第二神经网络进行训练还可以包括用题目影像样本测试集来验证第二神经网络模型的输出准确率,如果准确率不满足要求时可以增大题目影像样本训练集中样本的数量并重新进行训练。Similarly, the second neural network model can be pre-trained by any known method using a large number of training samples, according to the above-mentioned input and output. For example, it can be obtained by training through the following process: establishing a training set of subject image samples, where each subject image sample is a subject area. Perform labeling processing on each title image sample to mark title characters in the title area in each title image sample; and train the second neural network through the labelled title image sample training set to obtain a second neural network model. The second neural network can be any known neural network. In addition, similar to the description of the first neural network above, the training of the second neural network may further include verifying the output accuracy of the second neural network model with a test set of subject image samples. If the accuracy does not meet the requirements You can increase the number of samples in the training set of subject image samples and re-train.
在上述实施例中,将区域识别模型和用于单题识别的、相对简单的第一字符识别模型部署在本地的用户终端中,而将用于多题识别的、相对复杂的第二字符识别模型部署在服务器中,有助于在避免占据用户终端的有限的资源的前提下,尽可能地提高识别速度,改善识别效率。In the above embodiment, the area recognition model and the relatively simple first character recognition model for single-question recognition are deployed in the local user terminal, while the relatively complex second character recognition model for multi-question recognition is used for The model is deployed in the server, which helps to increase the recognition speed as much as possible and improve the recognition efficiency on the premise of avoiding occupying the limited resources of the user terminal.
当然,在其它一些实施例中,也可以将区域识别模型和字符识别模型均部署在服务器中。也就是说,无论题目区域中的题目数目是一个还是多个,都将所获取的题目影像从用户终端上传到服务器中,再由服务器中的区域识别模型和字符识别模型进行识别,以待进一步处理。Of course, in some other embodiments, both the region recognition model and the character recognition model may be deployed in the server. That is to say, regardless of whether the number of questions in the topic area is one or more, the acquired topic image will be uploaded from the user terminal to the server, and then recognized by the area recognition model and character recognition model in the server. deal with.
返回图1,题目辅助方法还可以包括:Returning to Figure 1, the topic assistance method can also include:
步骤S200,根据题目内容产生解题答案和解题过程,其中,解题过程包括解题步骤或者解题过程包括解题步骤和至少部分解题步骤的步骤说明。Step S200, generating a problem-solving answer and a problem-solving process according to the content of the problem, wherein the problem-solving process includes a problem-solving step or a problem-solving process includes a problem-solving step and a step description of at least part of the problem-solving step.
解题过程中的解题步骤表示从题目内容推导到解题答案的具体处理,而步骤说明可以包括对为什么采用这样的解题步骤的原因的说明、对解题步骤中所采用的数学工具的解释等等。在解题过程中,解题步骤通常是不可或缺的,而步骤说明可以根据解题步骤的复杂程度给出,例如,对于很简单的解题步骤,可以不产生相关的步骤说明。解题过程可以由文字来表述,或者也可以由图形(例如,包括题目内容中所涉及的函数曲线图等)来表示。The problem-solving steps in the problem-solving process represent the specific processing from the question content to the problem-solving answer, and the step description may include an explanation of why such a problem-solving step is adopted, and an explanation of the mathematical tools used in the problem-solving step. explain and so on. In the problem-solving process, the problem-solving steps are usually indispensable, and the step descriptions can be given according to the complexity of the problem-solving steps. For example, for very simple problem-solving steps, the relevant step descriptions may not be generated. The problem-solving process can be expressed by words, or can also be expressed by graphics (for example, including the function curves involved in the content of the problem, etc.).
如图3所示,根据题目内容产生解题答案可以包括:As shown in Figure 3, generating the answer according to the content of the question may include:
步骤S211,根据由题目内容所确定的题目类型,确定部署在服务器的解题模型;以及Step S211, determining a problem-solving model deployed on the server according to the problem type determined by the problem content; and
步骤S212,采用解题模型来产生解题答案。Step S212, using the problem-solving model to generate a problem-solving answer.
其中,题目类型可以包括计算题、应用题、填空题、选择题、操作题等。可以采用第三神经网络模型来根据题目内容确定题目类型,也就是说,该第三神经网络模型的输入为题目内容,输出为题目类型。第三神经网络模型可以使用大量的训练样本,按照上述的输入输出,通过任何已知的方法对第三神经网络进行预先训练而得到。第三神经网络可以是任何已知的神经网络,例如深度卷积神经网络等。The question types may include calculation questions, application questions, fill-in-the-blank questions, multiple-choice questions, operation questions, and the like. A third neural network model can be used to determine the topic type according to the topic content, that is, the input of the third neural network model is the topic content, and the output is the topic type. The third neural network model can be obtained by pre-training the third neural network by any known method according to the above input and output using a large number of training samples. The third neural network may be any known neural network, such as a deep convolutional neural network or the like.
解题模型具体可以包括用于计算题的计算模型,用于应用题的自然语言处理模型和/或向量模型等等。The problem-solving model may specifically include a computational model for computational problems, a natural language processing model and/or a vector model for applied problems, and the like.
当根据题目内容确定题目类型为计算题时,可以采用部署在服务器的计算模型来产生解题答案。When the type of the question is determined to be a calculation question according to the content of the question, a calculation model deployed on the server can be used to generate the answer to the question.
当根据题目内容确定题目类型为应用题时,可以采用部署在服务器的自然语言处理模型、向量模型等来产生解题答案。自然语言处理模型和向量模型可以分别被单独用于解题,在一些情况下也可以被结合用于解题,以改善解题速度或准确性。When it is determined that the question type is an application question according to the content of the question, a natural language processing model, a vector model, etc. deployed on the server can be used to generate the answer to the question. Natural language processing models and vector models can be used separately for problem solving, or in some cases can be combined to improve problem solving speed or accuracy.
在一些实施例中,自然语言处理模型可以根据应用题的语言来分析并提取出其中关键的数据,然后基于这些数据计算出解题答案。In some embodiments, the natural language processing model can analyze and extract key data according to the language of the application question, and then calculate the answer to the question based on the data.
在一些实施例中,采用向量模型来产生解题答案具体可以包括:对题目内容进行特征提取以生成特征向量;从预设题库中检索与特征向量匹配的标准向量;以及根据预先设置的与标准向量对应的向量答案,产生解题答案。In some embodiments, using a vector model to generate a problem-solving answer may specifically include: performing feature extraction on the content of the question to generate a feature vector; retrieving a standard vector matching the feature vector from a preset question bank; The vector answer corresponding to the vector produces the solution answer.
其中,特征向量可以是二维特征向量,该二维特征向量可以是特征图(feature map),其可以用本领域已知的任何方法来生成,例如可以利用深度卷积神经网络对应用题所在的影像区域进行处理来提取。其中,针对应用题中的文字生成第一二维特征向量,并针对应用题中的图片生成第二二维特征向量;以及将第一二维特征向量和第二二维特征向量拼接以得到二维特征向量。Wherein, the feature vector can be a two-dimensional feature vector, and the two-dimensional feature vector can be a feature map (feature map), which can be generated by any method known in the art. The image area is processed to extract. Wherein, the first two-dimensional feature vector is generated for the text in the application question, and the second two-dimensional feature vector is generated for the picture in the application question; and the first two-dimensional feature vector and the second two-dimensional feature vector are spliced to obtain two dimensional feature vector.
可以采用第四神经网络模型来实现特征提取,也就是说,该第四神经网络模型的输入为题目内容(包括文字和图片),输出为题目内容所对应的特征向量。第四神经网络模型可以使用大量的训练样本,按照上述的输入输出,通过任何已知的方法对第四神经网络进行预先训练而得到。第四神经网络可以是任何已知的神经网络,例如深度卷积神经网络等。The fourth neural network model can be used to achieve feature extraction, that is, the input of the fourth neural network model is the topic content (including text and pictures), and the output is the feature vector corresponding to the topic content. The fourth neural network model can be obtained by pre-training the fourth neural network by any known method according to the above-mentioned input and output by using a large number of training samples. The fourth neural network may be any known neural network, such as a deep convolutional neural network or the like.
预设题库可以包括多个组,每个组可以包括一个或多个向量。这些向量都是对已知的应用题的题目(例如,预先搜集的应用题的试题库中的题目)进行特征提取而生成的特征向量。来自同一组的任意两个向量具有相同的长度,来自不同组的任意两个向量具有不同的长度。The preset question bank can include multiple groups, and each group can include one or more vectors. These vectors are all feature vectors generated by performing feature extraction on the questions of the known applied questions (for example, the questions in the test question bank of the pre-collected applied questions). Any two vectors from the same group have the same length, and any two vectors from different groups have different lengths.
从预设题库中搜索标准向量可以包括:先根据特征向量的长度,在预设题库中找到与特征向量的长度匹配的组;然后在这个长度匹配的组中进行搜索,以找到标准向量。如此,能够更快速地搜索到与特征向量相匹配的标准向量。在一些实施例中,每个组可以具有各自的索引,该索引与该组中的各个向量的长度相匹配(例如相等),在预设题库中找到与特征向量的长度匹配的组包括:根据特征向量的长度索引到匹配的组。Searching for the standard vector from the preset question bank may include: firstly, according to the length of the feature vector, finding a group matching the length of the feature vector in the preset question bank; and then searching the group whose length matches to find the standard vector. In this way, the standard vector matching the feature vector can be searched more quickly. In some embodiments, each group may have a respective index that matches (eg, is equal to) the length of each vector in the group, and finding a group matching the length of the feature vector in the preset question bank includes: according to The length of the feature vector indexes into the matched group.
然后,可以根据预先设置的与标准向量相关联的向量答案,生成应用题的解题答案。其中,向量答案也可以来自于预先搜集的应用题的试题库,例如,该试题库中包括题目内容和与题目内容对应的参***。以向量答案作为母板,可以根据题目内容与该最相近的标准向量之间的差异,来对向量答案进行变形以得到解题答案。Then, the solution answer to the applied question can be generated according to the preset vector answer associated with the standard vector. The vector answer may also come from a pre-collected test question bank of application questions. For example, the test question bank includes question content and reference answers corresponding to the question content. Using the vector answer as the master board, the vector answer can be deformed according to the difference between the content of the question and the closest standard vector to obtain the answer to the question.
类似地,根据题目内容产生解题过程可以包括根据由题目内容所确定的题目类型,产生解题过程。Similarly, generating the problem solving process according to the content of the question may include generating the solving process according to the question type determined by the content of the question.
例如,当根据题目内容确定题目类型为计算题时,可以根据题目内容的形式特征(例如未知数的个数、最高幂次、位置和计算符号等),从预设规则库中获取对应的规则,并根据对应的规则产生解题过程。For example, when it is determined that the question type is a calculation question according to the content of the question, the corresponding rules can be obtained from the preset rule library according to the formal characteristics of the content of the question (such as the number of unknowns, the highest power, the position and the calculation symbol, etc.). And the problem-solving process is generated according to the corresponding rules.
例如,如果识别到的计算题的题目为计算
Figure PCTCN2021128829-appb-000001
则确定该题目的形式特征为计算分数的和。在预设规则库中获取计算分数的和的解题规则,例如可以为依次包括:统一算式中各分数的分母、合并分母并计算分子的和以 及约化分数共三个步骤。然后包括这三个步骤的规则可以生成如图4所示的文字化的解题过程。
For example, if the title of the identified calculation question is calculation
Figure PCTCN2021128829-appb-000001
Then determine the formal feature of the question as the sum of the calculated scores. Obtaining a problem-solving rule for calculating the sum of fractions in the preset rule base, for example, may include three steps in sequence: unifying the denominators of the fractions in the formula, combining the denominators and calculating the sum of the numerators, and reducing the fractions. A rule including these three steps can then generate a literal problem-solving process as shown in Figure 4.
当然,在其它示例中,也可以生成图形化的解题过程。例如,如果识别到的计算题的题目为计算
Figure PCTCN2021128829-appb-000002
中x的值。那么,可以在预设规则库中获取计算一元一次方程的图形化的解题规则,具体为分别绘制
Figure PCTCN2021128829-appb-000003
Figure PCTCN2021128829-appb-000004
的函数图像,并找出交点所在位置的x坐标作为解题答案,从而生成图形化的解题过程。
Of course, in other examples, a graphical problem solving process can also be generated. For example, if the title of the identified calculation question is calculation
Figure PCTCN2021128829-appb-000002
the value of x in . Then, the graphical problem-solving rules for calculating a linear equation in one variable can be obtained in the preset rule base, and the specific rules are drawn separately.
Figure PCTCN2021128829-appb-000003
and
Figure PCTCN2021128829-appb-000004
The function image of , and find the x-coordinate of the intersection point as the answer to the problem, thereby generating a graphical problem-solving process.
返回图1,题目辅助方法还可以包括:Returning to Figure 1, the topic assistance method can also include:
步骤S300,将解题过程设置在相应的显示层级中。Step S300, setting the problem solving process in the corresponding display level.
其中,同一个显示层级中的各解题步骤或者步骤说明通常对应于解题过程中为了实现某个具体的运算目的而串行进行的操作,而不同的显示层级中的解题步骤或步骤说明之间可以具有包含或被包含的关系。Among them, each problem-solving step or step description in the same display level usually corresponds to the serial operations performed in order to achieve a specific operation purpose in the problem-solving process, while the problem-solving steps or step descriptions in different display levels There can be contained or contained relationships between them.
例如,在图4中,用同种标号和缩进来表示的解题步骤或步骤说明处在同一个显示层级中。也就是说,例如“1.统一算式中各分数的分母”、“2.合并分母并计算分子的和”以及“3.约化分数”在一个显示层级中,而例如“(1)寻找分母2,3,6的最小公倍数”和“(2)使算式中的所有分母等于最小公倍数”在另一个显示层级中,依此类推。For example, in Figure 4, problem solving steps or step descriptions denoted by the same numbering and indentation are in the same display level. That is, for example "1. Unify the denominators of fractions in the formula", "2. Combine the denominators and calculate the sum of the numerators" and "3. Reduced fractions" in one display level, while for example "(1) Find the denominator LCM of 2, 3, 6" and "(2) make all denominators in the equation equal to LCM" in another display level, and so on.
在图4所示的示例中,根据在一个显示层级中的步骤说明“1.统一算式中各分数的分母”、“2.合并分母并计算分子的和”以及“3.约化分数”依次执行串行的操作,可以从题目内容“计算
Figure PCTCN2021128829-appb-000005
”得出解题答案“1”。而在另一个显示层级中,“(1)寻找分母2,3,6的最小公倍数”和“(2)使算式中的所有分母等于最小公倍数”为如何实现“统一算式中各分数的分母”的具体分步骤的步骤说明。
In the example shown in FIG. 4 , “1. Denominator of each fraction in unified formula”, “2. Combine denominators and calculate the sum of numerators” and “3. Reduced fractions” are explained in order according to the steps in one display hierarchy Perform serial operations, which can be calculated from the title content "
Figure PCTCN2021128829-appb-000005
" to get the answer "1". In another display level, how about "(1) Find the least common multiple of denominators 2, 3, 6" and "(2) Make all denominators in the formula equal to the least common multiple" Step-by-step instructions for implementing "the denominator of each fraction in a unified formula".
在一些实施例中,如图5所示,将解题过程设置在相应的显示层级中可以包括:In some embodiments, as shown in FIG. 5 , setting the problem solving process in the corresponding display level may include:
步骤S310,根据由题目内容所确定的题目类型,确定分层规则;以及Step S310, determining a hierarchical rule according to the topic type determined by the topic content; and
步骤S320,按照分层规则将解题过程中的解题步骤、或者解题过程中的 解题步骤和步骤说明,分别设置在相应的显示层级中。Step S320, according to the hierarchical rules, the problem-solving steps in the problem-solving process, or the problem-solving steps and step descriptions in the problem-solving process, are respectively set in the corresponding display levels.
可以预先设置分别对应于多种题目类型的多种分层规则,以针对不同的题目特点设置相应的显示层级,其中,显示层级的具体数目与相应的分层规则有关,在此不作限制。Multiple hierarchical rules corresponding to multiple topic types can be preset to set corresponding display levels for different topic characteristics, wherein the specific number of display levels is related to the corresponding hierarchical rules, which is not limited here.
此外,还可以更新已有的分层规则或添加新的分层规则,以优化解题过程的显示方式。具体而言,当根据题目内容确定出新的题目类型时,可以获取与新的题目类型对应的新的分层规则;以及按照新的分层规则将解题过程中的解题步骤、或者解题过程中的解题步骤和步骤说明,分别设置在相应的显示层级中。In addition, existing layering rules can be updated or new layering rules can be added to optimize the display of the problem solving process. Specifically, when a new question type is determined according to the content of the question, a new hierarchical rule corresponding to the new question type can be obtained; The problem solving steps and step descriptions in the problem process are respectively set in the corresponding display levels.
在一具体的分层规则中,可以与解题过程相对应,按照其中的解题步骤、解题步骤的分步骤以及相应的步骤说明,来设置显示层级。In a specific hierarchical rule, corresponding to the problem-solving process, the display level can be set according to the problem-solving steps, the sub-steps of the problem-solving steps, and the corresponding step descriptions.
例如,可以将解题步骤中的主步骤设置在第一显示层级中,将与至少部分主步骤对应的步骤说明设置在第二显示层级中。其中,主步骤是指根据题目内容获得解题答案的最主要的步骤,主步骤忽略了一些解题过程的具体细节,从而更好地显示出解题的思路框架,以改善学习和教学效果。此外,通过将解题步骤和对应的步骤说明分别设置在不同的显示层级中,用户可以根据需要灵活地控制解题步骤和步骤说明的显示。例如,用户在校对自己的解答时,可以控制只显示解题步骤,以节约校对时间;用户在解答过程中需要提示的情况下,可以控制只显示至少部分步骤说明而不显示具体的解题步骤;用户也可以在查看解题步骤的过程中,控制只显示自己不清楚的解题步骤的步骤说明,以提高学习和教学效率。For example, the main steps in the problem solving steps may be set in the first display level, and the step descriptions corresponding to at least part of the main steps may be set in the second display level. Among them, the main step refers to the most important step to obtain the answer to the problem according to the content of the question. The main step ignores some specific details of the problem-solving process, so as to better show the thinking framework of the problem-solving and improve the learning and teaching effect. In addition, by setting the problem-solving steps and corresponding step descriptions in different display levels, the user can flexibly control the display of the problem-solving steps and step descriptions as required. For example, when a user is proofreading his own answer, he can control to display only the problem-solving steps to save proofreading time; when the user needs a prompt during the answering process, he can control to display at least part of the step description instead of specific problem-solving steps. ; During the process of viewing the problem solving steps, users can control to display only the step descriptions of the problem solving steps that they are not clear about, so as to improve the efficiency of learning and teaching.
当然,在其它一些实施例中,也可以将解题步骤和对应的步骤说明设置在同一个显示层级中以待显示。Of course, in some other embodiments, the problem solving steps and the corresponding step descriptions may also be set in the same display level to be displayed.
对于解题步骤中的一些较为复杂的主步骤而言,其可能还具有多个第一分步骤,类似地,对于一些较为复杂的第一分步骤而言,其还可以进一步具有多个第二分步骤,并可以依此类推将解题步骤分成更多个层次的分步骤。For some more complicated main steps in the problem solving steps, it may also have multiple first sub-steps, and similarly, for some more complicated first sub-steps, it may further have multiple second sub-steps Steps can be divided into more sub-steps.
可以将不同层次的分步骤设置在不同的显示层级中。例如,将至少一个主步骤所包含的第一分步骤设置在第三显示层级中,将至少一个第一分步骤所包含的第二分步骤设置在第五显示层级中,等等。相应地,可以将至少部 分第一分步骤、至少部分第二分步骤的对应的步骤说明分别设置在第四显示层级和第六显示层级中。可以理解的是,如果第二分步骤还可以被进一步细分,那么,可以设置更多的显示层级。Different levels of substeps can be set in different display levels. For example, a first sub-step included in at least one main step is arranged in a third display level, a second sub-step included in at least one first sub-step is arranged in a fifth display level, and so on. Correspondingly, the corresponding step descriptions of at least part of the first sub-step and at least part of the second sub-step may be set in the fourth display level and the sixth display level, respectively. It can be understood that, if the second sub-step can be further subdivided, then more display levels can be set.
当然,在其它一些实施例中,也可以将分步骤和对应的步骤说明设置在同一个显示层级中以待显示。Of course, in some other embodiments, the sub-steps and corresponding step descriptions may also be set in the same display level to be displayed.
返回图1,题目辅助方法还可以包括:Returning to Figure 1, the topic assistance method can also include:
步骤S400,根据默认的显示规则和接收到的显示指令中的至少一个来显示相应的显示层级。Step S400, displaying a corresponding display level according to at least one of a default display rule and a received display instruction.
在一些实施例中,可以根据默认的显示规则自动地显示解题过程,而无需用户进行手动干预,以简化用户的操作。默认的显示规则例如可以包括按照预设的时间间隔依次显示处于同一个显示层级中的主步骤,以给予用户一定的思考时间,从而改善学习和教学效果。例如,可以每隔两分钟依次显示主步骤。默认的显示规则也可以例如包括在预设时间间隔后显示处于同一个显示层级中的所有的主步骤,例如在获取到题目影像之后,间隔五分钟才显示所有的主步骤。根据需要,也可以设置其它默认的显示规则来显示主步骤或者某个主步骤的分步骤等等。In some embodiments, the problem solving process can be automatically displayed according to the default display rules without manual intervention by the user, so as to simplify the operation of the user. The default display rules may include, for example, displaying the main steps in the same display level in sequence according to preset time intervals, so as to give the user a certain time to think, thereby improving the learning and teaching effect. For example, the main steps may be displayed sequentially every two minutes. The default display rule may also include, for example, displaying all main steps in the same display level after a preset time interval, for example, after acquiring the subject image, displaying all main steps at an interval of five minutes. As required, other default display rules can also be set to display the main step or the sub-steps of a certain main step, and so on.
在另一些实施例中,可以根据接收到的显示指令来显示对应的显示层级,以方便用户交互。显示指令可以由用户触发用户终端上的相应的按钮、或者在触摸区域中进行动作等方式给出。In other embodiments, the corresponding display level may be displayed according to the received display instruction to facilitate user interaction. The display instruction may be given by the user triggering a corresponding button on the user terminal, or performing an action in the touch area, or the like.
例如,出于对学习/教学效果的考虑,解题过程可以在用户触发时才被显示。用户可以通过查看用户终端的显示装置获得解题答案之后,先自己思考解题步骤,在用户需要查看解题步骤时,再触发显示装置显示这些解题过程。For example, for the consideration of learning/teaching effect, the problem solving process may be displayed only when triggered by the user. After the user obtains the answer to the problem by viewing the display device of the user terminal, he first thinks about the problem solving steps by himself, and then triggers the display device to display the problem solving process when the user needs to view the problem solving steps.
在一具体示例中,显示装置上的触摸区域中可以设置有“解题答案”、“解题步骤”、“步骤解释”或者指向各个方向的箭头等图标,或者触摸区域可以感应到用户在其上进行的动作。用户可以通过点击相应的图标、或者在触摸区域中进行轻扫、双击、单击、缩放等动作来实现对解题过程的显示的控制。例如,通过点击“解题答案”可以使尚未被显示出来的解题答案显示出,或者突出显示已经被显示出的解题答案;通过点击“解题步骤”或者触摸区域中的预设区域来显示出尚未显示的相应的解题步骤,或者突出显示已经显示 出的某个解题步骤;通过点击“步骤解释”或者双击相应的解题步骤来获取该步骤的步骤说明;以及通过点击向下的箭头显示或突出显示同一个显示层级中的下一个解题步骤,通过点击向上的箭头显示或突出显示同一个显示层级中的上一个解题步骤,通过点击向右的箭头显示或突出显示另一个显示层级中的该解题步骤的分步骤,通过点击向左的箭头显示或突出显示另一个显示层级中的包括该解题步骤的解题步骤等等。In a specific example, the touch area on the display device may be provided with icons such as "solution answer", "problem solving step", "step explanation" or arrows pointing in various directions, or the touch area may sense that the user is on the display device. actions performed above. The user can control the display of the problem-solving process by clicking on a corresponding icon, or by swiping, double-clicking, clicking, zooming, and other actions in the touch area. For example, by clicking on "solving answers", the answers that have not yet been displayed can be displayed, or the answers that have been displayed can be highlighted; by clicking on "solving steps" or by touching a preset area in the area Display the corresponding problem solving step that is not yet displayed, or highlight a problem solving step that has been displayed; get the step by step description by clicking "step explanation" or double-clicking the corresponding problem solving step; and by clicking down arrows to display or highlight the next solution step in the same display level, click the up arrow to display or highlight the previous solution step in the same display level, click the right arrow to display or highlight another solution step A sub-step of that problem solving step in one display level, showing or highlighting a problem solving step that includes this problem solving step in another display level by clicking the left arrow, etc.
可以理解的是,也可以在显示装置上设置其它的图形界面或按钮等,从而获取显示指令。It can be understood that, other graphical interfaces or buttons, etc. may also be set on the display device, so as to obtain the display instruction.
本公开还提出了一种题目辅助装置,如图6所示,该题目辅助装置可以包括存储器610、处理器620以及存储在存储器610上的指令,当指令被处理器620执行时,实现如上所述的题目辅助方法的步骤。The present disclosure also proposes a topic assistance device. As shown in FIG. 6 , the topic assistance device may include a memory 610 , a processor 620 , and an instruction stored in the memory 610 . When the instruction is executed by the processor 620 , the above-mentioned implementation is realized. Steps of the topic-assisted method described.
其中,处理器620可以根据存储在存储器610中的指令执行各种动作和处理。具体地,处理器620可以是一种集成电路芯片,具有信号的处理能力。上述处理器可以是通用处理器、数字信号处理器(DSP)、专用集成电路(ASIC)、现成可编程门阵列(FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。可以实现或者执行本公开实施例中公开的各种方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等,可以是X86架构或者是ARM架构等。Among them, the processor 620 can perform various actions and processes according to the instructions stored in the memory 610 . Specifically, the processor 620 may be an integrated circuit chip with signal processing capability. The aforementioned processors may be general purpose processors, digital signal processors (DSPs), application specific integrated circuits (ASICs), off-the-shelf programmable gate arrays (FPGAs) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components. Various methods, steps and logic block diagrams disclosed in the embodiments of the present disclosure can be implemented or executed. The general-purpose processor may be a microprocessor, or the processor may be any conventional processor, etc., and may be an X86 architecture or an ARM architecture, or the like.
存储器610存储有可执行指令,该指令在被处理器620执行上文所述的对象识别方法。存储器610可以是易失性存储器或非易失性存储器,或可包括易失性和非易失性存储器两者。非易失性存储器可以是只读存储器(ROM)、可编程只读存储器(PROM)、可擦除可编程只读存储器(EPROM)、电可擦除可编程只读存储器(EEPROM)或闪存。易失性存储器可以是随机存取存储器(RAM),其用作外部高速缓存。通过示例性但不是限制性说明,许多形式的RAM可用,例如静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、同步动态随机存取存储器(SDRAM)、双倍数据速率同步动态随机存取存储器(DDRSDRAM)、增强型同步动态随机存取存储器(ESDRAM)、同步连接动态随机存取存储器(SLDRAM)和直接内存总线随机存取存储器 (DR RAM)。应注意,本文描述的方法的存储器旨在包括但不限于这些和任意其它适合类型的存储器。The memory 610 stores executable instructions that are executed by the processor 620 to perform the object recognition method described above. Memory 610 may be volatile memory or nonvolatile memory, or may include both volatile and nonvolatile memory. The nonvolatile memory may be read only memory (ROM), programmable read only memory (PROM), erasable programmable read only memory (EPROM), electrically erasable programmable read only memory (EEPROM), or flash memory. Volatile memory may be random access memory (RAM), which acts as an external cache. By way of example and not limitation, many forms of RAM are available, such as static random access memory (SRAM), dynamic random access memory (DRAM), synchronous dynamic random access memory (SDRAM), double data rate synchronous dynamic Random Access Memory (DDRSDRAM), Enhanced Synchronous Dynamic Random Access Memory (ESDRAM), Synchronous Link Dynamic Random Access Memory (SLDRAM), and Direct Memory Bus Random Access Memory (DR RAM). It should be noted that the memory of the methods described herein is intended to include, but not be limited to, these and any other suitable types of memory.
本公开还提出了一种题目辅助***。如图7所示,该题目辅助***可以包括用户终端710以及服务器720。The present disclosure also proposes a topic assistance system. As shown in FIG. 7 , the topic assistance system may include a user terminal 710 and a server 720 .
其中,用户终端710可以被配置为获取题目影像。例如,影像获取装置711可以被包括在用户终端710中。Among them, the user terminal 710 may be configured to obtain the subject image. For example, the image capturing device 711 may be included in the user terminal 710 .
用户终端710和服务器720中的至少一者可以被配置为根据题目影像识别出题目内容。例如,在图7所示的题目辅助***中,用户终端710中可以部署区域识别模块712和用于单题识别的第一字符识别模块713,而服务器中可以部署用于多题识别的第二字符识别模块721。At least one of the user terminal 710 and the server 720 may be configured to identify topic content from the topic image. For example, in the question assistance system shown in FIG. 7 , the user terminal 710 can deploy a region recognition module 712 and a first character recognition module 713 for single-question recognition, and the server can deploy a second character recognition module for multi-question recognition Character recognition module 721.
服务器720可以被配置为根据题目内容产生解题答案和解题过程,其中,解题过程可以包括解题步骤或者解题过程包括解题步骤和至少部分解题步骤的步骤说明。例如,解题模型722和与解题模型722相匹配的预设题库723可以被包括在服务器720中。服务器720还可以包括用于产生解题过程的预设规则库724。The server 720 may be configured to generate a problem-solving answer and a problem-solving process according to the content of the problem, wherein the problem-solving process may include a problem-solving step or the problem-solving process includes a problem-solving step and a step description of at least part of the problem-solving step. For example, a problem solving model 722 and a preset question bank 723 matching the problem solving model 722 may be included in the server 720 . The server 720 may also include a preset rule base 724 for generating problem solving procedures.
用户终端710和服务器720中的至少一者还可以被配置为将解题过程设置在相应的显示层级中。At least one of the user terminal 710 and the server 720 may also be configured to set the problem solving process in the corresponding display level.
用户终端710也可以被配置为根据默认的显示规则和接收到的显示指令中的至少一个来显示对应的显示层级。用于显示的显示装置714可以被包括在用户终端710中。用户终端710还可以对解题答案或解题过程进行相应的渲染后再显示,以美化显示效果。The user terminal 710 may also be configured to display a corresponding display level according to at least one of a default display rule and a received display instruction. A display device 714 for displaying may be included in the user terminal 710 . The user terminal 710 may also render the answer to the problem or the problem solving process before displaying it, so as to beautify the display effect.
当影像获取装置711获取的题目影像后,题目影像被传输给区域识别模型712以识别题目区域,然后,由第一字符识别模型713或第二字符识别模型721来识别题目区域中的题目内容。根据所识别的题目内容,可以产生解题答案和解题过程,解题答案和解题过程被从服务器720传输回用户终端710,以待显示装置714进行显示。After the image acquisition device 711 acquires the subject image, the subject image is transmitted to the area recognition model 712 to identify the subject area, and then the first character recognition model 713 or the second character recognition model 721 identifies the subject content in the subject area. According to the identified content of the question, the answer to the question and the procedure of solving the question can be generated, and the answer and procedure of solving the question are transmitted from the server 720 back to the user terminal 710 to be displayed by the display device 714 .
本公开还提出了一种非暂态计算机可读存储介质,非暂态计算机可读存 储介质上存储有指令,当指令被执行时,可以实现上文所描述的题目辅助方法中的步骤。The present disclosure also proposes a non-transitory computer-readable storage medium, on which instructions are stored, and when the instructions are executed, the steps in the topic assistance method described above can be implemented.
类似地,本公开实施例中的非暂态计算机可读存储介质可以是易失性存储器或非易失性存储器,或可包括易失性和非易失性存储器两者。应注意,本文描述的计算机可读存储介质旨在包括但不限于这些和任意其它适合类型的存储器。Similarly, non-transitory computer-readable storage media in embodiments of the present disclosure may be volatile memory or non-volatile memory, or may include both volatile and non-volatile memory. It should be noted that computer-readable storage media described herein are intended to include, but not be limited to, these and any other suitable types of memory.
本公开也提出了一种计算机程序产品,该计算机程序产品可以包括指令,当指令被处理器执行时,可以实现如上所述的题目辅助方法的步骤。The present disclosure also proposes a computer program product that can include instructions that, when executed by a processor, can implement the steps of the topic assistance method as described above.
指令可以是将由一个或多个处理器直接地执行的任何指令集,诸如机器代码,或者间接地执行的任何指令集,诸如脚本。本文中的术语“指令”、“应用”、“过程”、“步骤”和“程序”在本文中可以互换使用。指令可以存储为目标代码格式以便由一个或多个处理器直接处理,或者存储为任何其他计算机语言,包括按需解释或提前编译的独立源代码模块的脚本或集合。指令可以包括引起诸如一个或多个处理器来充当本文中的各神经网络的指令。本文其他部分更加详细地解释了指令的功能、方法和例程。The instructions may be any set of instructions to be executed directly by one or more processors, such as machine code, or any set of instructions to be executed indirectly, such as scripts. The terms "instructions," "applications," "processes," "steps," and "programs" are used interchangeably herein. Instructions may be stored in object code format for direct processing by one or more processors, or in any other computer language, including scripts or collections of self-contained source code modules that are interpreted on demand or compiled ahead of time. The instructions may include instructions that cause, for example, one or more processors to function as the various neural networks herein. The functions, methods, and routines of the instructions are explained in more detail elsewhere in this document.
另外,本公开的实施方式还可以包括以下示例:In addition, the embodiments of the present disclosure may also include the following examples:
1.一种题目辅助方法,所述题目辅助方法包括:1. A topic assistance method, the topic assistance method comprising:
获取题目影像,并根据题目影像识别出题目内容;Obtain the subject image, and identify the subject content according to the subject image;
根据题目内容产生解题答案和解题过程,其中,解题过程包括解题步骤或者解题过程包括解题步骤和至少部分解题步骤的步骤说明;Generating a problem-solving answer and a problem-solving process according to the content of the problem, wherein the problem-solving process includes a problem-solving step or the problem-solving process includes a problem-solving step and a step description of at least part of the problem-solving step;
将解题过程设置在相应的显示层级中;以及Set the problem solving process in the corresponding display level; and
根据默认的显示规则和接收到的显示指令中的至少一个来显示对应的显示层级。The corresponding display level is displayed according to at least one of a default display rule and a received display instruction.
2.根据1所述的题目辅助方法,获取题目影像,并根据题目影像识别出题目内容包括:2. According to the topic assistance method described in 1, acquiring a topic image, and identifying the topic content according to the topic image includes:
采用影像获取装置获取题目影像;Use an image acquisition device to acquire the subject image;
采用部署在用户终端的区域识别模型来确定题目影像中的题目内容所在 的题目区域,并确定题目区域中的题目数目;以及using an area identification model deployed in the user terminal to determine the topic area where the topic content in the topic image is located, and to determine the number of topics in the topic area; and
当题目数目等于一时,采用部署在用户终端的第一字符识别模型来识别题目区域中的题目内容,并将所识别的题目内容上传至服务器;When the number of questions is equal to one, the first character recognition model deployed in the user terminal is used to identify the topic content in the topic area, and the identified topic content is uploaded to the server;
当题目数目大于一时,将题目影像中的至少题目区域上传至服务器中,并采用部署在服务器的第二字符识别模型来识别题目内容。When the number of questions is greater than one, upload at least the question area in the question image to the server, and use the second character recognition model deployed on the server to identify the question content.
3.根据1所述的题目辅助方法,根据题目内容产生解题***括:3. According to the question assistance method described in 1, generating the answer to the question according to the content of the question includes:
根据由题目内容所确定的题目类型,确定部署在服务器的解题模型;以及Determine the problem solving model deployed on the server according to the problem type determined by the content of the problem; and
采用解题模型来产生解题答案。A problem-solving model is used to generate a problem-solving answer.
4.根据3所述的题目辅助方法,解题模型包括用于题目类型为计算题的题目的计算模型,以及用于题目类型为应用题的题目的自然语言处理模型和/或向量模型。4. The problem assistance method according to 3, wherein the problem solving model includes a computational model for the problem whose problem type is a computational problem, and a natural language processing model and/or a vector model for the problem whose problem type is an applied problem.
5.根据4所述的题目辅助方法,当题目类型为应用题时,采用向量模型来产生解题***括:5. According to the question assistance method described in 4, when the question type is an applied question, using a vector model to generate the answer to the question includes:
当题目类型为应用题时,对题目内容进行特征提取以生成特征向量;When the question type is an applied question, feature extraction is performed on the content of the question to generate a feature vector;
从预设题库中检索与特征向量匹配的标准向量;以及Retrieve a standard vector matching the feature vector from a preset question bank; and
根据预先设置的与标准向量对应的向量答案,产生解题答案。According to the preset vector answer corresponding to the standard vector, the problem-solving answer is generated.
6.根据1所述的题目辅助方法,根据题目内容产生解题过程包括:6. According to the problem assistance method described in 1, the problem solving process according to the content of the problem includes:
根据由题目内容所确定的题目类型,产生解题过程。According to the question type determined by the content of the question, a problem solving process is generated.
7.根据6所述的题目辅助方法,当题目类型为计算题时,根据题目内容产生解题过程包括:7. According to the problem assistance method described in 6, when the problem type is a calculation problem, the problem solving process according to the content of the problem includes:
根据题目内容的形式特征,从预设规则库中获取对应的规则,并根据对应的规则产生解题过程。According to the formal characteristics of the content of the question, the corresponding rules are obtained from the preset rule base, and the problem-solving process is generated according to the corresponding rules.
8.根据1所述的题目辅助方法,将解题过程设置在相应的显示层级中包括:8. According to the problem assistance method described in 1, setting the problem solving process in the corresponding display level includes:
根据由题目内容所确定的题目类型,确定分层规则;以及Determining stratification rules based on the type of question determined by the content of the question; and
按照分层规则将解题过程中的解题步骤、或者解题过程中的解题步骤和步骤说明,分别设置在相应的显示层级中。The problem-solving steps in the problem-solving process, or the problem-solving steps and step descriptions in the problem-solving process, are respectively set in the corresponding display levels according to the hierarchical rules.
9.根据8所述的题目辅助方法,分层规则被配置为将解题步骤中的主步 骤设置在第一显示层级中。9. The problem assistance method according to 8, wherein the hierarchical rules are configured to place the main steps of the problem solving steps in the first display hierarchy.
10.根据9所述的题目辅助方法,分层规则还被配置为将与至少部分主步骤对应的步骤说明设置在第二显示层级中。10. The topic assistance method according to 9, the hierarchical rules are further configured to place step descriptions corresponding to at least part of the main steps in the second display level.
11.根据9所述的题目辅助方法,分层规则还被配置为将解题答案设置在第一显示层级中。11. The problem assistance method according to 9, the hierarchical rule is further configured to set the problem solving answer in the first display level.
12.根据9所述的题目辅助方法,分层规则还被配置为将至少一个主步骤所包含的第一分步骤设置在第三显示层级中。12. The topic assistance method according to 9, the hierarchical rule is further configured to arrange the first sub-step included in the at least one main step in a third display level.
13.根据12所述的题目辅助方法,分层规则还被配置为将与至少部分第一分步骤对应的步骤说明设置在第四显示层级中。13. The topic assistance method according to 12, the hierarchical rule is further configured to arrange step descriptions corresponding to at least part of the first sub-steps in a fourth display level.
14.根据12所述的题目辅助方法,分层规则还被配置为将至少一个第一分步骤所包含的第二分步骤设置在第五显示层级中。14. The topic assistance method according to 12, wherein the hierarchical rule is further configured to arrange the second sub-step included in the at least one first sub-step in a fifth display level.
15.根据14所述的题目辅助方法,分层规则还被配置为将与至少部分第二分步骤对应的步骤说明设置在第六显示层级中。15. The topic assistance method according to 14, the hierarchical rules are further configured to place step descriptions corresponding to at least part of the second sub-steps in a sixth display level.
16.根据1所述的题目辅助方法,将解题过程设置在相应的显示层级中还包括:16. According to the problem assistance method described in 1, setting the problem solving process in the corresponding display level further includes:
当根据题目内容确定出新的题目类型时,获取与新的题目类型对应的新的分层规则;以及When a new topic type is determined according to the topic content, acquiring a new hierarchical rule corresponding to the new topic type; and
按照新的分层规则将解题过程中的解题步骤、或者解题过程中的解题步骤和步骤说明,分别设置在相应的显示层级中。According to the new hierarchical rules, the problem-solving steps in the problem-solving process, or the problem-solving steps and step descriptions in the problem-solving process, are respectively set in the corresponding display levels.
17.根据1所述的题目辅助方法,显示指令被配置为实现以下动作中的至少一者:17. The topic assistance method of 1, the display instructions are configured to implement at least one of the following actions:
显示或突出显示解题答案;display or highlight the answer to the problem;
在当前的显示层级中显示或突出显示当前的解题步骤;Display or highlight the current problem solving step in the current display level;
在当前的显示层级中显示或突出显示不同于当前的解题步骤的其它解题步骤;display or highlight other problem solving steps than the current problem solving step in the current display level;
在不同于当前的显示层级的另一个显示层级中显示或突出显示当前的解题步骤的步骤说明;以及display or highlight step-by-step instructions for the current problem-solving step in a display level different from the current display level; and
在不同于当前的显示层级的另一个显示层级中显示或突出显示不同于当前的解题步骤的其它解题步骤。Display or highlight other problem solving steps than the current problem solving step in another display level than the current one.
18.一种题目辅助装置,所述题目辅助装置包括存储器、处理器以及存储在所述存储器上的指令,当所述指令被所述处理器执行时,实现根据1至17中任一项所述的题目辅助方法的步骤。18. A problem aid device comprising a memory, a processor, and instructions stored on the memory, which, when executed by the processor, implement the method described in any one of 1 to 17. Steps of the topic-assisted method described.
19.一种题目辅助***,所述题目辅助***包括用户终端以及服务器,其中:19. A topic assistance system, the topic assistance system comprising a user terminal and a server, wherein:
所述用户终端被配置为获取题目影像;the user terminal is configured to obtain the subject image;
所述用户终端和所述服务器中的至少一者被配置为根据题目影像识别出题目内容;At least one of the user terminal and the server is configured to identify the topic content according to the topic image;
所述服务器被配置为根据题目内容产生解题答案和解题过程,其中,解题过程包括解题步骤或者解题过程包括解题步骤和至少部分解题步骤的步骤说明;The server is configured to generate a problem-solving answer and a problem-solving process according to the content of the problem, wherein the problem-solving process includes a problem-solving step or the problem-solving process includes a problem-solving step and a step description of at least part of the problem-solving step;
所述用户终端和所述服务器中的至少一者还被配置为将解题过程设置在相应的显示层级中;以及At least one of the user terminal and the server is further configured to set the problem solving process in a corresponding display level; and
所述用户终端被配置为根据默认的显示规则和接收到的显示指令中的至少一个来显示对应的显示层级。The user terminal is configured to display a corresponding display level according to at least one of a default display rule and a received display instruction.
20.根据19所述的题目辅助***,所述用户终端包括:20. The topic assistance system according to 19, wherein the user terminal comprises:
影像获取装置,所述影像获取装置被配置为获取题目影像;以及an image acquisition device configured to acquire a subject image; and
显示装置,所述显示装置被配置为根据默认的显示规则和接收到的显示指令中的至少一个来显示对应的显示层级。A display device configured to display a corresponding display level according to at least one of a default display rule and a received display instruction.
21.一种非暂态计算机可读存储介质,所述非暂态计算机可读存储介质上存储有指令,当所述指令被所述处理器执行时,实现根据1至17中任一项所述的题目辅助方法的步骤。21. A non-transitory computer-readable storage medium having instructions stored thereon that, when executed by the processor, implement the method described in any of 1 to 17. Steps of the topic-assisted method described.
22.一种计算机程序产品,所述计算机程序产品包括指令,当所述指令被所述处理器执行时,实现根据1至17中任一项所述的题目辅助方法的步骤。22. A computer program product comprising instructions which, when executed by the processor, implement the steps of the topic assistance method according to any one of 1 to 17.
需要说明的是,附图中的流程图和框图,图示了按照本公开各种实施例的***、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,所述模块、程序段、或代码的一部分包含一个或多个用于实现规定的 逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的***来实现,或者可以用专用硬件与计算机指令的组合来实现。It should be noted that the flowcharts and block diagrams in the accompanying drawings illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logic for implementing the specified logic Executable instructions for the function. It should also be noted that, in some alternative implementations, the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It is also noted that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented in dedicated hardware-based systems that perform the specified functions or operations , or can be implemented in a combination of dedicated hardware and computer instructions.
一般而言,本公开的各种示例实施例可以在硬件或专用电路、软件、固件、逻辑,或其任何组合中实施。某些方面可以在硬件中实施,而其他方面可以在可以由控制器、微处理器或其他计算设备执行的固件或软件中实施。当本公开的实施例的各方面被图示或描述为框图、流程图或使用某些其他图形表示时,将理解此处描述的方框、装置、***、技术或方法可以作为非限制性的示例在硬件、软件、固件、专用电路或逻辑、通用硬件或控制器或其他计算设备,或其某些组合中实施。In general, the various example embodiments of the present disclosure may be implemented in hardware or special purpose circuits, software, firmware, logic, or any combination thereof. Certain aspects may be implemented in hardware, while other aspects may be implemented in firmware or software that may be executed by a controller, microprocessor or other computing device. While aspects of the embodiments of the present disclosure are illustrated or described as block diagrams, flowcharts, or using some other graphical representation, it is to be understood that the blocks, apparatus, systems, techniques, or methods described herein may be taken as non-limiting Examples are implemented in hardware, software, firmware, special purpose circuits or logic, general purpose hardware or controllers or other computing devices, or some combination thereof.
在说明书及权利要求中的词语“前”、“后”、“顶”、“底”、“之上”、“之下”等,如果存在的话,用于描述性的目的而并不一定用于描述不变的相对位置。应当理解,这样使用的词语在适当的情况下是可互换的,使得在此所描述的本公开的实施例,例如,能够在与在此所示出的或另外描述的那些取向不同的其他取向上操作。The words "front," "rear," "top," "bottom," "over," "under," etc. in the specification and claims, if present, are used for descriptive purposes and not necessarily to describe an invariant relative position. It is to be understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments of the disclosure described herein are, for example, capable of other orientations than those illustrated or otherwise described herein. Orientation to operate.
如在此所使用的,词语“示例性的”意指“用作示例、实例或说明”,而不是作为将被精确复制的“模型”。在此示例性描述的任意实现方式并不一定要被解释为比其它实现方式优选的或有利的。而且,本公开不受在上述技术领域、背景技术、发明内容或具体实施方式中所给出的任何所表述的或所暗示的理论所限定。As used herein, the word "exemplary" means "serving as an example, instance, or illustration" rather than as a "model" to be exactly reproduced. Any implementation illustratively described herein is not necessarily to be construed as preferred or advantageous over other implementations. Furthermore, the present disclosure is not to be bound by any expressed or implied theory presented in the preceding technical field, background, brief summary or detailed description.
如在此所使用的,词语“基本上”意指包含由设计或制造的缺陷、器件或元件的容差、环境影响和/或其它因素所致的任意微小的变化。词语“基本上”还允许由寄生效应、噪声以及可能存在于实际的实现方式中的其它实际考虑因素所致的与完美的或理想的情形之间的差异。As used herein, the word "substantially" is meant to encompass any minor variation due to design or manufacturing imperfections, tolerances of devices or elements, environmental influences, and/or other factors. The word "substantially" also allows for differences from a perfect or ideal situation due to parasitics, noise, and other practical considerations that may exist in an actual implementation.
另外,前面的描述可能提及了被“连接”或“耦接”在一起的元件或节点或特征。如在此所使用的,除非另外明确说明,“连接”意指一个元件/节点 /特征与另一种元件/节点/特征在电学上、机械上、逻辑上或以其它方式直接地连接(或者直接通信)。类似地,除非另外明确说明,“耦接”意指一个元件/节点/特征可以与另一元件/节点/特征以直接的或间接的方式在机械上、电学上、逻辑上或以其它方式连结以允许相互作用,即使这两个特征可能并没有直接连接也是如此。也就是说,“耦接”意图包含元件或其它特征的直接连结和间接连结,包括利用一个或多个中间元件的连接。Additionally, the preceding description may refer to elements or nodes or features being "connected" or "coupled" together. As used herein, unless expressly stated otherwise, "connected" means that one element/node/feature is electrically, mechanically, logically or otherwise directly connected to another element/node/feature (or direct communication). Similarly, unless expressly stated otherwise, "coupled" means that one element/node/feature can be mechanically, electrically, logically or otherwise linked, directly or indirectly, with another element/node/feature to allow interaction, even though the two features may not be directly connected. That is, "coupled" is intended to encompass both direct and indirect connections of elements or other features, including connections utilizing one or more intervening elements.
另外,仅仅为了参考的目的,还可以在本文中使用“第一”、“第二”等类似术语,并且因而并非意图限定。例如,除非上下文明确指出,否则涉及结构或元件的词语“第一”、“第二”和其它此类数字词语并没有暗示顺序或次序。Also, terms like "first," "second," and the like may also be used herein for reference purposes only, and are thus not intended to be limiting. For example, the terms "first," "second," and other such numerical terms referring to structures or elements do not imply a sequence or order unless the context clearly dictates otherwise.
还应理解,“包括/包含”一词在本文中使用时,说明存在所指出的特征、整体、步骤、操作、单元和/或组件,但是并不排除存在或增加一个或多个其它特征、整体、步骤、操作、单元和/或组件以及/或者它们的组合。It should also be understood that the term "comprising/comprising" when used herein indicates the presence of the indicated feature, integer, step, operation, unit and/or component, but does not preclude the presence or addition of one or more other features, Entities, steps, operations, units and/or components and/or combinations thereof.
在本公开中,术语“提供”从广义上用于涵盖获得对象的所有方式,因此“提供某对象”包括但不限于“购买”、“制备/制造”、“布置/设置”、“安装/装配”、和/或“订购”对象等。In this disclosure, the term "providing" is used broadly to encompass all ways of obtaining an object, thus "providing something" includes, but is not limited to, "purchasing," "preparing/manufacturing," "arranging/arranging," "installing/ Assembly", and/or "Order" objects, etc.
虽然已经通过示例对本公开的一些特定实施例进行了详细说明,但是本领域的技术人员应该理解,以上示例仅是为了进行说明,而不是为了限制本公开的范围。在此公开的各实施例可以任意组合,而不脱离本公开的精神和范围。本领域的技术人员还应理解,可以对实施例进行多种修改而不脱离本公开的范围和精神。本公开的范围由所附权利要求来限定。While some specific embodiments of the present disclosure have been described in detail by way of examples, those skilled in the art will appreciate that the above examples are provided for illustration only, and are not intended to limit the scope of the present disclosure. The various embodiments disclosed herein may be combined arbitrarily without departing from the spirit and scope of the present disclosure. It will also be understood by those skilled in the art that various modifications may be made to the embodiments without departing from the scope and spirit of the present disclosure. The scope of the present disclosure is defined by the appended claims.

Claims (22)

  1. 一种题目辅助方法,其特征在于,所述题目辅助方法包括:A topic assistance method, characterized in that the topic assistance method comprises:
    获取题目影像,并根据所述题目影像识别出题目内容;Obtaining the subject image, and identifying the subject content according to the subject image;
    根据所述题目内容产生解题答案和解题过程,其中,所述解题过程包括解题步骤或者所述解题过程包括解题步骤和至少部分解题步骤的步骤说明;generating a problem-solving answer and a problem-solving process according to the content of the problem, wherein the problem-solving process includes a problem-solving step or the problem-solving process includes a problem-solving step and a step description of at least part of the problem-solving step;
    将所述解题过程设置在相应的显示层级中;以及arranging the problem solving process in a corresponding display level; and
    根据默认的显示规则和接收到的显示指令中的至少一个来显示对应的显示层级。The corresponding display level is displayed according to at least one of a default display rule and a received display instruction.
  2. 根据权利要求1所述的题目辅助方法,其特征在于,获取题目影像,并根据所述题目影像识别出题目内容包括:The topic assistance method according to claim 1, wherein acquiring a topic image and identifying the topic content according to the topic image comprises:
    采用影像获取装置获取题目影像;Use an image acquisition device to acquire the subject image;
    采用部署在用户终端的区域识别模型来确定所述题目影像中的题目内容所在的题目区域,并确定所述题目区域中的题目数目;以及Determine the topic area where the topic content in the topic image is located by using an area identification model deployed in the user terminal, and determine the number of topics in the topic area; and
    当所述题目数目等于一时,采用部署在所述用户终端的第一字符识别模型来识别所述题目区域中的题目内容,并将所识别的题目内容上传至服务器;When the number of questions is equal to one, adopting the first character recognition model deployed in the user terminal to identify the title content in the title area, and uploading the identified title content to the server;
    当所述题目数目大于一时,将所述题目影像中的至少题目区域上传至所述服务器中,并采用部署在所述服务器的第二字符识别模型来识别所述题目内容。When the number of questions is greater than one, uploading at least the title area in the title image to the server, and using the second character recognition model deployed on the server to identify the title content.
  3. 根据权利要求1所述的题目辅助方法,其特征在于,根据所述题目内容产生解题***括:The problem assistance method according to claim 1, wherein generating a problem-solving answer according to the content of the problem comprises:
    根据由所述题目内容所确定的题目类型,确定部署在服务器的解题模型;以及determining a problem-solving model deployed on the server according to the problem type determined by the problem content; and
    采用所述解题模型来产生所述解题答案。The problem-solving model is employed to generate the problem-solving answer.
  4. 根据权利要求3所述的题目辅助方法,其特征在于,所述解题模型包括用于题目类型为计算题的题目的计算模型,以及用于题目类型为应用题的题目的自然语言处理模型和/或向量模型。The problem assistance method according to claim 3, wherein the problem solving model includes a calculation model for the problem whose problem type is a computational problem, a natural language processing model for the problem whose problem type is an applied problem, and / or vector models.
  5. 根据权利要求4所述的题目辅助方法,其特征在于,当所述题目类型为应用题时,采用向量模型来产生解题***括:The problem assistance method according to claim 4, wherein when the problem type is an applied problem, using a vector model to generate a problem-solving answer comprises:
    当所述题目类型为应用题时,对所述题目内容进行特征提取以生成特征向量;When the question type is an applied question, feature extraction is performed on the content of the question to generate a feature vector;
    从预设题库中检索与所述特征向量匹配的标准向量;以及retrieving a standard vector matching the feature vector from a preset question bank; and
    根据预先设置的与所述标准向量对应的向量答案,产生所述解题答案。The problem-solving answer is generated according to a preset vector answer corresponding to the standard vector.
  6. 根据权利要求1所述的题目辅助方法,其特征在于,根据所述题目内容产生解题过程包括:The problem assistance method according to claim 1, wherein generating a problem solving process according to the content of the problem comprises:
    根据由所述题目内容所确定的题目类型,产生解题过程。A problem solving process is generated according to the problem type determined by the problem content.
  7. 根据权利要求6所述的题目辅助方法,其特征在于,当所述题目类型为计算题时,根据所述题目内容产生解题过程包括:The problem assistance method according to claim 6, wherein when the problem type is a calculation problem, generating a problem solving process according to the problem content comprises:
    根据所述题目内容的形式特征,从预设规则库中获取对应的规则,并根据对应的所述规则产生解题过程。According to the formal characteristics of the content of the question, the corresponding rules are obtained from the preset rule base, and the problem solving process is generated according to the corresponding rules.
  8. 根据权利要求1所述的题目辅助方法,其特征在于,将所述解题过程设置在相应的显示层级中包括:The problem assistance method according to claim 1, wherein setting the problem solving process in a corresponding display level comprises:
    根据由所述题目内容所确定的题目类型,确定分层规则;以及determining a tiering rule based on the topic type determined by the topic content; and
    按照所述分层规则将所述解题过程中的解题步骤、或者所述解题过程中的解题步骤和步骤说明,分别设置在相应的显示层级中。The problem-solving steps in the problem-solving process, or the problem-solving steps and step descriptions in the problem-solving process, are respectively set in corresponding display levels according to the hierarchical rules.
  9. 根据权利要求8所述的题目辅助方法,其特征在于,所述分层规则被配置为将所述解题步骤中的主步骤设置在第一显示层级中。The problem assistance method of claim 8, wherein the hierarchical rule is configured to set the main step of the problem solving steps in a first display level.
  10. 根据权利要求9所述的题目辅助方法,其特征在于,所述分层规则还被配置为将与至少部分主步骤对应的步骤说明设置在第二显示层级中。9. The topic assistance method of claim 9, wherein the hierarchical rules are further configured to place step descriptions corresponding to at least part of the main steps in the second display level.
  11. 根据权利要求9所述的题目辅助方法,其特征在于,所述分层规则还被配置为将解题答案设置在所述第一显示层级中。The problem assistance method of claim 9, wherein the hierarchical rule is further configured to set the problem solving answer in the first display level.
  12. 根据权利要求9所述的题目辅助方法,其特征在于,所述分层规则还被配置为将至少一个主步骤所包含的第一分步骤设置在第三显示层级中。The topic assistance method of claim 9, wherein the hierarchical rule is further configured to set the first sub-step included in the at least one main step in a third display level.
  13. 根据权利要求12所述的题目辅助方法,其特征在于,所述分层规则还被配置为将与至少部分第一分步骤对应的步骤说明设置在第四显示层级中。13. The topic assistance method of claim 12, wherein the hierarchical rule is further configured to arrange step descriptions corresponding to at least part of the first sub-steps in a fourth display level.
  14. 根据权利要求12所述的题目辅助方法,其特征在于,所述分层规则还被配置为将至少一个第一分步骤所包含的第二分步骤设置在第五显示层级 中。The topic assistance method of claim 12, wherein the hierarchical rule is further configured to set the second sub-step included in the at least one first sub-step in a fifth display level.
  15. 根据权利要求14所述的题目辅助方法,其特征在于,所述分层规则还被配置为将与至少部分第二分步骤对应的步骤说明设置在第六显示层级中。15. The topic assistance method of claim 14, wherein the hierarchical rules are further configured to arrange step descriptions corresponding to at least part of the second sub-steps in a sixth display level.
  16. 根据权利要求1所述的题目辅助方法,其特征在于,将所述解题过程设置在相应的显示层级中还包括:The problem assistance method according to claim 1, wherein setting the problem solving process in a corresponding display level further comprises:
    当根据所述题目内容确定出新的题目类型时,获取与新的题目类型对应的新的分层规则;以及When a new topic type is determined according to the topic content, acquiring a new hierarchical rule corresponding to the new topic type; and
    按照所述新的分层规则将所述解题过程中的解题步骤、或者所述解题过程中的解题步骤和步骤说明,分别设置在所述相应的显示层级中。The problem-solving steps in the problem-solving process, or the problem-solving steps and step descriptions in the problem-solving process, are respectively set in the corresponding display levels according to the new hierarchical rule.
  17. 根据权利要求1所述的题目辅助方法,其特征在于,所述显示指令被配置为实现以下动作中的至少一者:The topic assistance method of claim 1, wherein the display instruction is configured to implement at least one of the following actions:
    显示或突出显示解题答案;display or highlight the answer to the problem;
    在当前的显示层级中显示或突出显示当前的解题步骤;Display or highlight the current problem solving step in the current display level;
    在当前的显示层级中显示或突出显示不同于当前的解题步骤的其它解题步骤;display or highlight other problem solving steps than the current problem solving step in the current display level;
    在不同于当前的显示层级的另一个显示层级中显示或突出显示当前的解题步骤的步骤说明;以及display or highlight step-by-step instructions for the current problem-solving step in a display level different from the current display level; and
    在不同于当前的显示层级的另一个显示层级中显示或突出显示不同于当前的解题步骤的其它解题步骤。Display or highlight other problem solving steps than the current problem solving step in another display level than the current one.
  18. 一种题目辅助装置,其特征在于,所述题目辅助装置包括存储器、处理器以及存储在所述存储器上的指令,当所述指令被所述处理器执行时,实现根据权利要求1至17中任一项所述的题目辅助方法的步骤。A problem assisting device, characterized in that the problem assisting device comprises a memory, a processor, and instructions stored on the memory, and when the instructions are executed by the processor, the problems according to claims 1 to 17 are implemented. The steps of any one of the topic assistance methods.
  19. 一种题目辅助***,其特征在于,所述题目辅助***包括用户终端以及服务器,其中:A topic assistance system, characterized in that the topic assistance system includes a user terminal and a server, wherein:
    所述用户终端被配置为获取题目影像;the user terminal is configured to obtain the subject image;
    所述用户终端和所述服务器中的至少一者被配置为根据所述题目影像识别出题目内容;At least one of the user terminal and the server is configured to identify topic content according to the topic image;
    所述服务器被配置为根据所述题目内容产生解题答案和解题过程,其中, 所述解题过程包括解题步骤或者所述解题过程包括解题步骤和至少部分解题步骤的步骤说明;The server is configured to generate a problem-solving answer and a problem-solving process according to the content of the problem, wherein the problem-solving process includes a problem-solving step or the problem-solving process includes a problem-solving step and a step description of at least part of the problem-solving step ;
    所述用户终端和所述服务器中的至少一者还被配置为将所述解题过程设置在相应的显示层级中;以及At least one of the user terminal and the server is further configured to set the problem solving process in a corresponding display level; and
    所述用户终端被配置为根据默认的显示规则和接收到的显示指令中的至少一个来显示对应的显示层级。The user terminal is configured to display a corresponding display level according to at least one of a default display rule and a received display instruction.
  20. 根据权利要求19所述的题目辅助***,其特征在于,所述用户终端包括:The topic assistance system according to claim 19, wherein the user terminal comprises:
    影像获取装置,所述影像获取装置被配置为获取所述题目影像;以及an image acquisition device configured to acquire the subject image; and
    显示装置,所述显示装置被配置为根据默认的显示规则和接收到的显示指令中的至少一个来显示对应的显示层级。A display device configured to display a corresponding display level according to at least one of a default display rule and a received display instruction.
  21. 一种非暂态计算机可读存储介质,其特征在于,所述非暂态计算机可读存储介质上存储有指令,当所述指令被所述处理器执行时,实现根据权利要求1至17中任一项所述的题目辅助方法的步骤。A non-transitory computer-readable storage medium, characterized in that the non-transitory computer-readable storage medium stores instructions, and when the instructions are executed by the processor, the instructions according to claims 1 to 17 are implemented. The steps of any one of the topic assistance methods.
  22. 一种计算机程序产品,其特征在于,所述计算机程序产品包括指令,当所述指令被所述处理器执行时,实现根据权利要求1至17中任一项所述的题目辅助方法的步骤。A computer program product, characterized in that the computer program product comprises instructions which, when executed by the processor, implement the steps of the topic assistance method according to any one of claims 1 to 17.
PCT/CN2021/128829 2020-12-16 2021-11-04 Question assistance method, apparatus and system WO2022127425A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011489433.XA CN112488052B (en) 2020-12-16 2020-12-16 Question assisting method, device and system
CN202011489433.X 2020-12-16

Publications (1)

Publication Number Publication Date
WO2022127425A1 true WO2022127425A1 (en) 2022-06-23

Family

ID=74917126

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/128829 WO2022127425A1 (en) 2020-12-16 2021-11-04 Question assistance method, apparatus and system

Country Status (2)

Country Link
CN (1) CN112488052B (en)
WO (1) WO2022127425A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112488052B (en) * 2020-12-16 2023-05-26 杭州大拿科技股份有限公司 Question assisting method, device and system
CN113257063A (en) * 2021-06-08 2021-08-13 北京字节跳动网络技术有限公司 Interaction method and terminal equipment

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102169644A (en) * 2010-02-26 2011-08-31 卢君毅 Examination system with answering and reminding functions
US20140093847A1 (en) * 2012-09-28 2014-04-03 The Trustees of the Stevens Institute of Technolory Automated teaching system using declarative problem solving approach
CN107492275A (en) * 2017-09-07 2017-12-19 李卫东 Assisted learning method and device
CN109815955A (en) * 2019-03-04 2019-05-28 杭州大拿科技股份有限公司 Topic householder method and system
CN109859555A (en) * 2019-03-29 2019-06-07 上海乂学教育科技有限公司 It is suitble to Mathematics Discipline methods of exhibiting and the computer system step by step of adaptive learning
CN109948473A (en) * 2019-03-04 2019-06-28 中国计量大学 A kind of method neural network based promoting student's applied problem solution topic ability
CN111369403A (en) * 2020-02-27 2020-07-03 北京字节跳动网络技术有限公司 Problem solving demonstration method and device
CN111475627A (en) * 2019-01-23 2020-07-31 北京字节跳动网络技术有限公司 Method and device for checking solution deduction questions, electronic equipment and storage medium
CN112488052A (en) * 2020-12-16 2021-03-12 杭州大拿科技股份有限公司 Topic auxiliary method, device and system

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07134540A (en) * 1993-11-10 1995-05-23 Hitachi Ltd Computerized education assisting system
KR101568841B1 (en) * 2015-01-19 2015-11-13 주식회사 아이카이스트 Method and system for process output of calculation in accordance with a equation recognition
CN111951133B (en) * 2020-07-31 2024-03-22 北京伴我学教育科技有限公司 Method, device and storage medium for assisting in solving questions

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102169644A (en) * 2010-02-26 2011-08-31 卢君毅 Examination system with answering and reminding functions
US20140093847A1 (en) * 2012-09-28 2014-04-03 The Trustees of the Stevens Institute of Technolory Automated teaching system using declarative problem solving approach
CN107492275A (en) * 2017-09-07 2017-12-19 李卫东 Assisted learning method and device
CN111475627A (en) * 2019-01-23 2020-07-31 北京字节跳动网络技术有限公司 Method and device for checking solution deduction questions, electronic equipment and storage medium
CN109815955A (en) * 2019-03-04 2019-05-28 杭州大拿科技股份有限公司 Topic householder method and system
CN109948473A (en) * 2019-03-04 2019-06-28 中国计量大学 A kind of method neural network based promoting student's applied problem solution topic ability
CN109859555A (en) * 2019-03-29 2019-06-07 上海乂学教育科技有限公司 It is suitble to Mathematics Discipline methods of exhibiting and the computer system step by step of adaptive learning
CN111369403A (en) * 2020-02-27 2020-07-03 北京字节跳动网络技术有限公司 Problem solving demonstration method and device
CN112488052A (en) * 2020-12-16 2021-03-12 杭州大拿科技股份有限公司 Topic auxiliary method, device and system

Also Published As

Publication number Publication date
CN112488052B (en) 2023-05-26
CN112488052A (en) 2021-03-12

Similar Documents

Publication Publication Date Title
WO2020177531A1 (en) Question assistance method and system
CN111191078B (en) Video information processing method and device based on video information processing model
CN110446063B (en) Video cover generation method and device and electronic equipment
WO2022127425A1 (en) Question assistance method, apparatus and system
CN114375435A (en) Enhancing tangible content on a physical activity surface
CN108334627B (en) Method and device for searching new media content and computer equipment
DE102016013487A1 (en) Semantic vector space with natural language
CN110188775B (en) Image content description automatic generation method based on joint neural network model
CN108052687B (en) Education information search system based on Internet
CN111475627B (en) Method and device for checking solution deduction questions, electronic equipment and storage medium
Bansal et al. Visual question answering on image sets
CN110750624A (en) Information output method and device
CN112712069A (en) Question judging method and device, electronic equipment and storage medium
CN112069349A (en) Method for automatically filling in answer, electronic device and readable storage medium
CN113127682A (en) Topic presentation method, system, electronic device, and computer-readable storage medium
Ahmed et al. Arabic sign language intelligent translator
CN117911730A (en) Method, apparatus and computer program product for processing topics
CN117077679B (en) Named entity recognition method and device
WO2023024898A1 (en) Problem assistance method, problem assistance apparatus and problem assistance system
CN110580294B (en) Entity fusion method, device, equipment and storage medium
CN116743692A (en) Historical message folding method and system
US9940320B2 (en) Plugin tool for collecting user generated document segmentation feedback
CN112883218A (en) Image-text combined representation searching method, system, server and storage medium
Mueller et al. Can computers master the art of communication?: A focus on visual analytics
CN113805977B (en) Test evidence obtaining method, model training method, device, equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21905354

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21905354

Country of ref document: EP

Kind code of ref document: A1