WO2019128103A1 - Information input method, device, terminal, and computer readable storage medium - Google Patents

Information input method, device, terminal, and computer readable storage medium Download PDF

Info

Publication number
WO2019128103A1
WO2019128103A1 PCT/CN2018/089393 CN2018089393W WO2019128103A1 WO 2019128103 A1 WO2019128103 A1 WO 2019128103A1 CN 2018089393 W CN2018089393 W CN 2018089393W WO 2019128103 A1 WO2019128103 A1 WO 2019128103A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
prompt
voice input
terminal
information
Prior art date
Application number
PCT/CN2018/089393
Other languages
French (fr)
Chinese (zh)
Inventor
王凯宁
田璐瑜
Original Assignee
重庆小雨点小额贷款有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 重庆小雨点小额贷款有限公司 filed Critical 重庆小雨点小额贷款有限公司
Publication of WO2019128103A1 publication Critical patent/WO2019128103A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/174Form filling; Merging
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/177Editing, e.g. inserting or deleting of tables; using ruled lines
    • G06F40/18Editing, e.g. inserting or deleting of tables; using ruled lines of spreadsheets
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Definitions

  • the present invention relates to the field of computers, and in particular, to an information entry method, apparatus, terminal, and computer readable storage medium.
  • the development of mobile terminals has become more and more intelligent. It is more and more common to fill in various application forms through mobile terminals, or to complete various living payment through mobile terminals.
  • the user can fill in the resume information through the mobile terminal to complete the job application for a certain unit; for example, when the user opens the card or other business in the bank, the application form can be filled in through the mobile terminal, so that the paper can be saved, or Shorten the processing time.
  • the user When the resume input completed by the mobile terminal or the application form for banking application is filled in, the user first needs to touch the text box in the corresponding information column of the mobile terminal, and then the terminal pops up the virtual keyboard, and the user selects a certain input method to input the content.
  • the terminal pops up the virtual keyboard, and the user selects a certain input method to input the content.
  • the user In the above method of filling in a mobile terminal form by using a text, if the content to be filled in the form has both text and English letters and numbers, the user needs to constantly switch the input method, and the operation is cumbersome, which leads to inefficient information filling. Especially for users who are not proficient in using the terminal virtual keyboard.
  • the embodiments of the present invention provide an information input method, device, terminal, and computer readable storage medium, which can improve information input efficiency by voice input information.
  • a first aspect of the embodiments of the present invention provides an information input method, including:
  • the voice input interface In the voice input mode, the voice input interface outputs the prompt voice of the form item name in a preset order
  • the first form is generated according to the form item name of the output prompt voice and the corresponding extracted key information, and the first form is displayed.
  • the information entry method further includes:
  • check whether the pause voice input function needs to be enabled including:
  • check whether the pause voice input function needs to be enabled including:
  • the information entry method further includes:
  • the current information entry mode is switched from the voice entry mode to the text entry mode.
  • the information entry method further includes:
  • the information input method further includes:
  • a second aspect of the embodiments of the present invention provides an information input apparatus, including:
  • the output unit is configured to output, in the voice input mode, the prompt voice of the form item name in a preset order in the voice input interface;
  • An extracting unit configured to detect a response voice input by the user based on the prompt voice, and extract key information matching the prompt voice from the response voice;
  • a generating unit configured to: when detecting that the output prompt voice is the prompt voice of the last form item name in the preset order, generate the first form according to the form item name of the prompt voice and the corresponding extracted key information;
  • a display unit for displaying the first form.
  • the information input device further includes:
  • the detecting unit is specifically configured to: detect whether the response voice input by the user is detected within a preset time after outputting the prompt voice; if not, determine that the pause voice input function needs to be enabled.
  • the detecting unit is specifically configured to: identify whether the response voice input by the user includes key information that matches the prompt voice; if not, determine that the pause voice input function needs to be enabled.
  • the information input device further includes: a switching unit, configured to switch the current information input mode from the voice input mode to the text entry mode if receiving the instruction to switch the input mode.
  • a switching unit configured to switch the current information input mode from the voice input mode to the text entry mode if receiving the instruction to switch the input mode.
  • the information input device further includes:
  • the generating unit is further configured to generate a second form according to the name of the form item that has output the prompt voice, the key information corresponding to the extracted, and the name of the form item remaining to output the prompt voice;
  • the display unit is further configured to display the second form in the text entry interface.
  • an embodiment of the present invention provides a terminal, including a processor, an input device, an output device, and a memory, where the processor, the input device, the output device, and the memory are connected to each other, wherein the memory is used to store the support terminal to execute the foregoing method.
  • a computer program comprising program instructions, the processor being configured to invoke program instructions to perform the method of the first aspect above.
  • an embodiment of the present invention provides a computer readable storage medium, where the computer readable storage medium stores a computer program, the computer program includes program instructions, and when the program instructions are executed by the processor, the processor executes the first aspect.
  • the voice input interface When the voice input mode is used in the embodiment, the voice input interface outputs the prompt voice of the form item name in a preset order, and based on the prompt voice, the key information matching the prompt voice is extracted from the response voice detected by the user input. . If it is detected that the input prompt voice is the prompt voice of the last form item name in the preset order, the first form is generated according to the name of the form item that has output the prompt voice and the corresponding extracted key information, and the first form is displayed on the voice input interface.
  • a form can realize information input by voice and improve information input efficiency.
  • FIG. 2 is a schematic flowchart of another information input method according to an embodiment of the present invention.
  • FIG. 4 is a schematic diagram of another information input interface according to an embodiment of the present invention.
  • FIG. 5 is a schematic structural diagram of an information input apparatus according to an embodiment of the present invention.
  • FIG. 6 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
  • displaying the voice entry interface may facilitate prompting the user to select the information entry mode and prompting the user to start voice entry.
  • the voice input interface can also be used to display text information corresponding to the prompt voice of the terminal output form item name in the voice entry mode and text information corresponding to the response voice input by the user.
  • the prompt voice of the terminal outputting the form item name is “How old are you this year?”
  • the text corresponding to the prompt voice can be displayed in the voice input interface, that is, “How big is this year?” If the user answers "My age is 25", 25 can be displayed in the voice interface.
  • the above only lists some of the contents that may be displayed in the voice input interface, and does not limit the specific content displayed in the voice input interface.
  • the preset order of the prompt voices of the form item names stored locally or stored in the server may be sequentially sorted by the terminal in an important order of the name of each form item in different form types, or may be the terminal according to the form item.
  • the association between names is sorted. For example, if the terminal obtains the form type selected by the user as the application loan, the user's real identity information and the user's economic ability are more important, so the terminal can preset the prompt voice preset of each form item name in the form of the loan application.
  • the terminal may switch the voice input interface to the text entry interface, and display the first form in the text entry interface, so that The user can modify the key information extracted by the terminal in the voice input mode.
  • FIG. 2 is a schematic flowchart of another information input method according to an embodiment of the present invention.
  • the information input method shown in FIG. 2 may include:
  • the voice input interface shown in FIG. 3 may include: a voice bar for outputting a voice prompt, a pause voice input function button, and an information input mode switching button.
  • the voice bar outputting the voice prompt can be accompanied by flashing when the terminal outputs the voice prompt, not only can inform the user that the prompt voice is currently being output, but also can bring a good visual effect to the user;
  • the function button for suspending the voice input can be used.
  • the information input mode switch button can be used to switch the current voice input mode to the text entry mode.
  • the terminal outputs the prompt voice of the form item name in a preset order on the voice input interface.
  • the terminal may pre-store the form item name of the different form and the prompt voice corresponding to the form item name in the local or the server.
  • the terminal calls the stored from the local or the server.
  • the form item name and the prompt voice corresponding to the form item name, and the prompt voice is output.
  • the form item name of the different form stored by the terminal in the local or the server and the prompt voice corresponding to the form item name may be: if the loan form is applied, the correspondence between the form item name and the prompt voice in the form may be assumed.
  • the terminal detects the response voice input by the user based on the prompt voice, and if the response voice input by the user is detected, extracts key information that matches the prompt voice from the response voice; if the response voice input by the user is not detected, that is, the user
  • the step of extracting the key information in the response voice may not be performed without talking to the other person, not hearing the output prompt voice, or the user is processing other things not in front of the terminal.
  • the terminal outputs the prompt voice a second time after outputting the preset time of the prompt voice; if the response voice input by the user is still not detected, then After the same preset time, the terminal outputs the prompt voice for the third time. If the response voice input by the user is still not detected, the terminal may choose to continue to repeat the above steps or choose to skip the prompt voice and continue to input the next prompt voice, or Other ways can be taken.
  • the terminal stops the voice input operation; when the pause voice input function is turned off, that is, when the voice input is continued, the terminal needs to re-output the pause voice.
  • the prompt voice that has been output before the function is entered, so that the voice terminal is output twice, so that the terminal wastes power consumption.
  • the terminal receives the modification instruction for the target form item in the first form, the key information corresponding to the target form item is replaced by the text information input by the user, so that the voice input information can be realized, and the voice input information can be Modifications have been made to improve the efficiency and accuracy of information entry.
  • the detecting unit 505 is specifically configured to: identify whether the response voice input by the user includes key information that matches the prompt voice; if not, determine that the pause voice input function needs to be enabled.
  • the output unit 501 outputs the prompt voice of the form item name in the preset order in the voice input interface, and the extracting unit 502 extracts the prompt voice from the detected response voice input by the user based on the prompt voice.
  • Match key information When detecting that the input prompt voice is the prompt voice of the last form item name in the preset order, the generating unit 503 generates a first form according to the form item name of the prompt voice and the corresponding extracted key information, and the display unit 504 displays the The first form can realize information input by voice and improve information input efficiency.
  • processor 601 is configured to invoke the program instructions and further execute:
  • the processor 601 is configured to invoke the program instructions and execute:
  • the second form is generated according to the name of the form item that has output the prompt voice, the key information corresponding to the extraction, and the name of the form item remaining to output the prompt voice; the second form is displayed in the text entry interface.
  • the processor 601 is configured to invoke the program instruction and further execute:
  • the memory 604 can include read only memory and random access memory and provides instructions and data to the processor 501.
  • a portion of the memory 604 may also include a non-volatile random access memory.
  • the memory 604 can also store information of the device type.
  • the processor 601, the input device 602, and the output device 603 described in the embodiments of the present invention may perform the implementation manners described in the embodiment of the information input method provided in FIG. 1 and FIG.
  • the implementation manner of the information input device is described, and details are not described herein again.
  • the first form is generated according to the form item name of the prompt voice and the corresponding extracted key information, and the first form is displayed.
  • the program instruction is specifically implemented when executed by the processor:
  • program instructions are also implemented when executed by the processor:
  • the current information entry mode is switched from the voice entry mode to the text entry mode.
  • the program instructions are further executed when executed by the processor:
  • the second form is generated according to the name of the form item that has output the prompt voice, the key information corresponding to the extraction, and the name of the form item remaining to output the prompt voice; the second form is displayed in the text entry interface.
  • the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

Embodiments of the present invention provide an information input method, a device, a terminal, and a computer readable storage medium. The method comprises: outputting, under a voice input mode, prompt voice of form item names on a voice input interface according to a preset sequence; detecting response voice input by a user on the basis of the prompt voice, and extracting key information matching the prompt voice from the response voice; and when it is detected that the output prompt voice is the prompt voice of the last form item name in the preset sequence, generating a first form according to the form item name of the output prompt voice and the corresponding extracted key information, and displaying the first form. By adoption of the embodiments of the present invention, information can be input by voice, so that the information input efficiency is improved.

Description

信息录入方法、装置、终端及计算机可读存储介质Information input method, device, terminal and computer readable storage medium 技术领域Technical field
本发明涉及计算机领域,尤其涉及信息录入方法、装置、终端及计算机可读存储介质。The present invention relates to the field of computers, and in particular, to an information entry method, apparatus, terminal, and computer readable storage medium.
背景技术Background technique
随着信息时代的飞速发展,移动终端的发展越来越趋于智能化。通过移动终端进行各种各样的申请表单的填写,或者通过移动终端完成各种生活缴费等操作越来越普遍。比如,用户可以通过移动终端填写简历信息以完成对某个单位的岗位申请;再如,用户在银行办理开卡或者其他业务时,可以通过移动终端填写申请表单,如此即可节省纸张,也可以缩短办理业务时间。With the rapid development of the information age, the development of mobile terminals has become more and more intelligent. It is more and more common to fill in various application forms through mobile terminals, or to complete various living payment through mobile terminals. For example, the user can fill in the resume information through the mobile terminal to complete the job application for a certain unit; for example, when the user opens the card or other business in the bank, the application form can be filled in through the mobile terminal, so that the paper can be saved, or Shorten the processing time.
上述的移动终端完成的简历填写或者办理银行业务的申请表单填写时都需要用户首先触摸移动终端中相应信息栏中的文本框,然后终端弹出虚拟键盘,用户选择某一种输入法输入内容。在以上使用文字填写移动终端表单的方法中,如果表单中需要填入的内容既有文字,又有英文字母和数字,用户需要不停切换输入法,操作比较繁琐,就会导致信息填写效率低,尤其是对于不能熟练使用终端虚拟键盘的用户。When the resume input completed by the mobile terminal or the application form for banking application is filled in, the user first needs to touch the text box in the corresponding information column of the mobile terminal, and then the terminal pops up the virtual keyboard, and the user selects a certain input method to input the content. In the above method of filling in a mobile terminal form by using a text, if the content to be filled in the form has both text and English letters and numbers, the user needs to constantly switch the input method, and the operation is cumbersome, which leads to inefficient information filling. Especially for users who are not proficient in using the terminal virtual keyboard.
发明内容Summary of the invention
本发明实施例提供了一种信息录入方法、装置、终端以及计算机可读存储介质,可以通过语音录入信息,提高信息录入效率。The embodiments of the present invention provide an information input method, device, terminal, and computer readable storage medium, which can improve information input efficiency by voice input information.
本发明实施例第一方面提供了一种信息录入方法,包括:A first aspect of the embodiments of the present invention provides an information input method, including:
在语音录入模式下,在语音录入界面按照预设顺序输出表单项名称的提示语音;In the voice input mode, the voice input interface outputs the prompt voice of the form item name in a preset order;
基于提示语音检测用户输入的响应语音,并从响应语音中提取与提示语音匹配的关键信息;Detecting a response voice input by the user based on the prompt voice, and extracting key information matching the prompt voice from the response voice;
当检测到输出的提示语音为预设顺序中最后一条表单项名称的提示语音时,根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单, 并显示第一表单。When it is detected that the output prompt voice is the prompt voice of the last form item name in the preset order, the first form is generated according to the form item name of the output prompt voice and the corresponding extracted key information, and the first form is displayed.
可选的,信息录入方法还包括:Optionally, the information entry method further includes:
检测是否需开启暂停语音录入功能;若是,则开启暂停语音录入功能。Check if the pause voice input function needs to be enabled; if yes, enable the pause voice input function.
可选的,检测是否需开启暂停语音录入功能,包括:Optionally, check whether the pause voice input function needs to be enabled, including:
检测在输出提示语音后的预设时间内是否检测到用户输入的响应语音;若否,则确定需开启暂停语音录入功能。It is detected whether the response voice input by the user is detected within a preset time after the prompt voice is output; if not, it is determined that the pause voice input function needs to be enabled.
可选的,检测是否需开启暂停语音录入功能,包括:Optionally, check whether the pause voice input function needs to be enabled, including:
识别用户输入的响应语音中是否包括与提示语音匹配的关键信息;若否,则确定需开启暂停语音录入功能。It is recognized whether the response voice input by the user includes key information matching the prompt voice; if not, it is determined that the pause voice input function needs to be enabled.
可选的,信息录入方法还包括:Optionally, the information entry method further includes:
若接收到切换录入模式的指令,将当前的信息录入模式由语音录入模式切换为文本录入模式。If an instruction to switch the entry mode is received, the current information entry mode is switched from the voice entry mode to the text entry mode.
可选的,将当前的信息录入模式由语音录入模式切换为文本录入模式之后,信息录入方法还包括:Optionally, after the current information input mode is switched from the voice input mode to the text entry mode, the information entry method further includes:
根据已输出提示语音的表单项名称、对应提取的关键信息以及剩余待输出提示语音的表单项名称,生成第二表单;在文本录入界面中显示第二表单。The second form is generated according to the name of the form item that has output the prompt voice, the key information corresponding to the extraction, and the name of the form item remaining to output the prompt voice; the second form is displayed in the text entry interface.
可选的,根据已输出的提示语音对应的表单项名称和已提取的关键信息生成第一表单之后,信息录入方法还包括:Optionally, after the first form is generated according to the form item name corresponding to the output prompt voice and the extracted key information, the information input method further includes:
当接收到针对第一表单中目标表单项的修改指令时,获取用户输入的文本信息;Obtaining text information input by the user when receiving a modification instruction for the target form item in the first form;
利用文本信息替换目标表单项对应的关键信息。Replace the key information corresponding to the target form item with text information.
本发明实施例第二方面提供了一种信息录入装置,包括:A second aspect of the embodiments of the present invention provides an information input apparatus, including:
输出单元,用于在语音录入模式下,在语音录入界面按照预设顺序输出表单项名称的提示语音;The output unit is configured to output, in the voice input mode, the prompt voice of the form item name in a preset order in the voice input interface;
提取单元,用于基于提示语音检测用户输入的响应语音,并从响应语音中提取与提示语音匹配的关键信息;An extracting unit, configured to detect a response voice input by the user based on the prompt voice, and extract key information matching the prompt voice from the response voice;
生成单元,用于当检测到输出的提示语音为预设顺序中最后一条表单项名称的提示语音时,根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单;a generating unit, configured to: when detecting that the output prompt voice is the prompt voice of the last form item name in the preset order, generate the first form according to the form item name of the prompt voice and the corresponding extracted key information;
显示单元,用于显示第一表单。A display unit for displaying the first form.
可选的,信息录入装置还包括:Optionally, the information input device further includes:
检测单元,用于检测是否需开启暂停语音录入功能;若是,则开启暂停语音录入功能。The detecting unit is configured to detect whether the pause voice input function needs to be enabled; if yes, the pause voice input function is enabled.
可选的,检测单元具体用于:检测在输出提示语音后的预设时间内是否检测到用户输入的响应语音;若否,则确定需开启暂停语音录入功能Optionally, the detecting unit is specifically configured to: detect whether the response voice input by the user is detected within a preset time after outputting the prompt voice; if not, determine that the pause voice input function needs to be enabled.
可选的,检测单元具体用于:识别用户输入的响应语音中是否包括与提示语音匹配的关键信息;若否,则确定需开启暂停语音录入功能。Optionally, the detecting unit is specifically configured to: identify whether the response voice input by the user includes key information that matches the prompt voice; if not, determine that the pause voice input function needs to be enabled.
可选的,信息录入装置还包括:切换单元,用于若接收到切换录入模式的指令,将当前的信息录入模式由语音录入模式切换为文本录入模式。Optionally, the information input device further includes: a switching unit, configured to switch the current information input mode from the voice input mode to the text entry mode if receiving the instruction to switch the input mode.
可选的,信息录入装置还包括:Optionally, the information input device further includes:
生成单元,还用于根据已输出提示语音的表单项名称、对应提取的关键信息以及剩余待输出提示语音的表单项名称,生成第二表单;The generating unit is further configured to generate a second form according to the name of the form item that has output the prompt voice, the key information corresponding to the extracted, and the name of the form item remaining to output the prompt voice;
显示单元,还用于在文本录入界面中显示第二表单。The display unit is further configured to display the second form in the text entry interface.
第三方面,本发明实施例提供了一种终端,包括处理器、输入设备、输出设备和存储器,处理器、输入设备、输出设备和存储器相互连接,其中,存储器用于存储支持终端执行上述方法的计算机程序,计算机程序包括程序指令,处理器被配置用于调用程序指令,执行上述第一方面的方法。In a third aspect, an embodiment of the present invention provides a terminal, including a processor, an input device, an output device, and a memory, where the processor, the input device, the output device, and the memory are connected to each other, wherein the memory is used to store the support terminal to execute the foregoing method. A computer program comprising program instructions, the processor being configured to invoke program instructions to perform the method of the first aspect above.
第四方面,本发明实施例提供了一种计算机可读存储介质,计算机可读存储介质存储有计算机程序,计算机程序包括程序指令,程序指令当被处理器执行时使处理器执行上述第一方面的方法。In a fourth aspect, an embodiment of the present invention provides a computer readable storage medium, where the computer readable storage medium stores a computer program, the computer program includes program instructions, and when the program instructions are executed by the processor, the processor executes the first aspect. Methods.
通过本发明实施例在语音录入模式时,在语音录入界面按照预设顺序输出表单项名称的提示语音,基于该提示语音从检测到用户输入的响应语音中,提取与该提示语音匹配的关键信息。如果检测到输入的提示语音是预设顺序中最后一条表单项名称的提示语音时,根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单,并在语音录入界面显示该第一表单,可以实现通过语音录入信息,提高信息录入效率。When the voice input mode is used in the embodiment, the voice input interface outputs the prompt voice of the form item name in a preset order, and based on the prompt voice, the key information matching the prompt voice is extracted from the response voice detected by the user input. . If it is detected that the input prompt voice is the prompt voice of the last form item name in the preset order, the first form is generated according to the name of the form item that has output the prompt voice and the corresponding extracted key information, and the first form is displayed on the voice input interface. A form can realize information input by voice and improve information input efficiency.
附图说明DRAWINGS
为了更清楚地说明本发明实施例中的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings to be used in the embodiments will be briefly described below. It is obvious that the drawings in the following description are only some embodiments of the present invention. Those skilled in the art can also obtain other drawings based on these drawings without paying for creative labor.
图1是本发明实施例提供的一种信息录入方法的流程示意图;1 is a schematic flowchart of an information input method according to an embodiment of the present invention;
图2是本发明实施例提供的另一种信息录入方法的流程示意图;2 is a schematic flowchart of another information input method according to an embodiment of the present invention;
图3是本发明实施例提供的一种信息录入界面的示意图;3 is a schematic diagram of an information entry interface according to an embodiment of the present invention;
图4是本发明实施例提供的另一种信息录入界面的示意图;4 is a schematic diagram of another information input interface according to an embodiment of the present invention;
图5是本发明实施例提供的一种信息录入装置的结构示意图;FIG. 5 is a schematic structural diagram of an information input apparatus according to an embodiment of the present invention; FIG.
图6是本发明实施例提供的一种终端的结构示意图。FIG. 6 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is obvious that the described embodiments are only a part of the embodiments of the present invention, but not all embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
参考图1,为本发明实施例提供的一种信息录入方法的流程示意图,如图1所示的信息录入方法,可包括以下步骤:1 is a schematic flowchart of an information input method according to an embodiment of the present invention. The information input method shown in FIG. 1 may include the following steps:
101、在语音录入模式下,终端在语音录入界面按照预设顺序输出表单项名称的提示语音。101. In the voice input mode, the terminal outputs the prompt voice of the form item name in a preset order on the voice input interface.
其中,终端可以为手机、平板电脑等便携式移动终端,也可以为笔记本电脑等非便携式终端。可选的,终端可内置或者外接有至少一个麦克风和至少一个扬声器。其中,至少一个扬声器用于在语音录入界面输出表单项名称的提示语音;至少一个麦克风用于获取用户输入的响应语音。可选的,终端支持语音录入模式和文本录入模式两种信息录入模式,终端启动语音录入模式可以指在开始录入信息时启动语音录入模式进行信息录入,也可以指终端在录入信息过程中由文本录入模式切换为语音录入模式。The terminal may be a portable mobile terminal such as a mobile phone or a tablet computer, or a non-portable terminal such as a notebook computer. Optionally, the terminal may have at least one microphone and at least one speaker built in or externally connected. Wherein at least one speaker is used to output a prompt voice of a form item name in the voice entry interface; at least one microphone is used to obtain a response voice input by the user. Optionally, the terminal supports two types of information input modes: a voice input mode and a text entry mode. The terminal initiates the voice input mode, which may be initiated when the voice input mode is started to enter information, or the terminal may enter the text during the process of entering the information. The entry mode is switched to the voice entry mode.
具体的,当终端启动了语音录入模式时,显示语音录入界面,可以便于提醒用户已经选择的信息录入模式以及提示用户将要开始进行语音录入。该语音 录入界面还可以用来显示语音录入模式下终端输出表单项名称的提示语音对应的文字信息以及用户输入的响应语音对应的文字信息。比如,在语音录入模式下,终端输出表单项名称的提示语音为“请问您今年多大了”,则在语音录入界面中可显示提示语音对应的文字,即“请问您今年多大了”。如果用户回答“我的年龄是25”,则在语音界面中可显示25。以上只是列举一些语音录入界面中可能显示的内容,对于在语音录入界面中显示的具体内容不做限定。Specifically, when the terminal initiates the voice entry mode, displaying the voice entry interface may facilitate prompting the user to select the information entry mode and prompting the user to start voice entry. The voice input interface can also be used to display text information corresponding to the prompt voice of the terminal output form item name in the voice entry mode and text information corresponding to the response voice input by the user. For example, in the voice input mode, the prompt voice of the terminal outputting the form item name is “How old are you this year?”, the text corresponding to the prompt voice can be displayed in the voice input interface, that is, “How big is this year?” If the user answers "My age is 25", 25 can be displayed in the voice interface. The above only lists some of the contents that may be displayed in the voice input interface, and does not limit the specific content displayed in the voice input interface.
可选的,若终端启动语音录入模式是指在开始录入信息时启动语音录入模式,即终端首次进入语音录入模式时,终端输出语音录入模式的操作提示,以便于用户快速掌握如何使用语音录入模式进行信息录入。该操作提示可以文字形式显示在语音录入界面中,也可以语音形式进行播放。可选的,在语音录入界面输出操作提示时,可同时在页面中显示“下次不再提示”选项。当用户掌握了语音录入模式的操作方式之后,可在启动语音录入模式时,选择“下次不再提示”的选项,下次用户启动语音录入信息时直接进入语音录入模式,可节省用户语音录入的时间。Optionally, if the terminal initiates the voice input mode, the voice input mode is started when the information is started to be input, that is, when the terminal enters the voice input mode for the first time, the terminal outputs an operation prompt of the voice input mode, so that the user can quickly grasp how to use the voice input mode. Enter information. The operation prompt can be displayed in the voice input interface or in the form of voice. Optionally, when the operation prompt is outputted in the voice input interface, the option of “no more prompt next time” can be displayed on the page at the same time. After the user has mastered the operation mode of the voice input mode, when the voice input mode is activated, the option of “no more prompt next time” can be selected, and the voice input mode can be directly entered when the user starts the voice input information, which can save the user voice input. time.
可选的,在检测到已经启动语音录入模式时,终端在语音录入界面按照预设顺序输出表单项名称的提示语音。用户通过终端进行信息录入可指根据用户的需求,终端在本地或者服务器存储的各种表单中,选择符合当前用户需求的表单;或者终端根据用户的需求在本地生成表单。表单项名称可指表单中需要用户填写的信息名称,比如表单项名称中可以包括姓名、年龄以及联系方式等,表单项名称的提示语音可指用来提示用户需要输入何种信息的提示语音,比如提示用户需要输入姓名、年龄、家庭住址或者其他信息等。举例来说,可假设终端预先在本地或者服务器中存储有申请贷款类表单、申请还款类表单、以及申请存款类表单等。如果终端获取到用户选择的表单类型为申请贷款表单,则从本地或者服务器中选择申请贷款表单;如果终端获取到用户选择表单类型为还款表单,则从本地或者服务器中选择申请还款表单。Optionally, when detecting that the voice input mode has been activated, the terminal outputs the prompt voice of the form item name in a preset order on the voice input interface. The user can input the information through the terminal, and the terminal selects a form that meets the current user requirement in various forms stored locally or in the server according to the user's needs; or the terminal generates the form locally according to the user's needs. The name of the form item may refer to the name of the information that the user needs to fill in the form. For example, the name of the form item may include the name, age, and contact information, and the prompt voice of the form item name may refer to the prompt voice used to prompt the user to input what kind of information. For example, the user is prompted to enter a name, age, home address or other information. For example, it can be assumed that the terminal stores the application loan form, the application for repayment form, and the application for deposit form in the local or server. If the terminal obtains the form type selected by the user as the application loan form, the application loan form is selected from the local or the server; if the terminal obtains the user selection form type as the repayment form, the application repayment form is selected from the local or the server.
可选的,每个表单中各表单项名称的提示语音可以是终端预先在本地中或者服务器中存储的,并且终端可预先设置各表单项名称的提示语音的输出顺序。或者各表单项名称的提示语音也可以是终端通过实时转换得到的。当终端检测到启动语音录入模式的指令时或者说当终端检测到用户选择了语音录入 模式时,调用服务器中或者本地存储中存储的表单项名称的提示语音,通过终端的扬声器或者终端外接设备进行输出,或者将各表单项名称实时转换成各表单项名称对应的提示语音,然后通过终端的扬声器或者外接设备进行输出的。可选的,在本地存储的或者在服务器中存储的表单项名称的提示语音的预设顺序可以是终端针对不同表单类型中各表单项名称的重要顺序依次排序的,也可以是终端按照表单项名称之间的关联性进行排序的。例如,假设终端获取到用户选择的表单类型为申请贷款,那么用户的真实身份信息和用户的经济能力比较重要,因此终端可预先设定贷款申请的表单中各表单项名称的提示语音的预设顺序为:姓名、身份证号、职业、是否有过不良信用记录、年龄等等;假设终端获取到用户选择的表单类型为办理存款的,那么用户的真实信息以及用户联系方式等较为重要,因此终端可预先设定存款申请的表单中各表单项名称的提示语音的预设顺序为:姓名、电话、家庭住址等等。以上只是列举本实施一些可能的实施方式,具体实施方式不做限定。Optionally, the prompt voice of each form item name in each form may be stored in the local or server in advance by the terminal, and the terminal may preset the output order of the prompt voice of each form item name. Or the prompt voice of each form item name may also be obtained by the terminal through real-time conversion. When the terminal detects the instruction to start the voice input mode or when the terminal detects that the user selects the voice input mode, the prompt voice of the name of the form item stored in the server or in the local storage is called, and is performed through the speaker of the terminal or the external device of the terminal. Output, or convert each form item name into a prompt voice corresponding to each form item name, and then output it through the terminal's speaker or an external device. Optionally, the preset order of the prompt voices of the form item names stored locally or stored in the server may be sequentially sorted by the terminal in an important order of the name of each form item in different form types, or may be the terminal according to the form item. The association between names is sorted. For example, if the terminal obtains the form type selected by the user as the application loan, the user's real identity information and the user's economic ability are more important, so the terminal can preset the prompt voice preset of each form item name in the form of the loan application. The order is: name, ID number, occupation, whether there is a bad credit record, age, etc.; if the terminal obtains the form type selected by the user for deposit, then the user's real information and user contact information are more important, so The terminal may pre-set the prompt order of the prompt voices of each form item name in the form of the deposit application as: name, phone number, home address, and the like. The foregoing is only a list of possible implementations of the present implementation, and the specific implementation manner is not limited.
102、终端基于提示语音检测用户输入的响应语音,并从响应语音中提取与提示语音匹配的关键信息。102. The terminal detects the response voice input by the user based on the prompt voice, and extracts key information that matches the prompt voice from the response voice.
换句话说,在终端启动语音录入模式后,终端可通过内置的或者外接的扬声器输出表单项名称的提示语音,用户根据该提示语音,可通过终端内置的或者外接的麦克风输入响应语音。终端可检测在预设时间内是否接收到用户输入的响应语音,若检测到用户输入的响应语音,则可从用户输入的响应语音中提取与提示语音匹配的关键信息。比如,终端通过内置的扬声器输出表单项名称的提示语音为“请问你叫什么名字”,用户通过终端的麦克风输入的响应语音为“我叫张某某”,终端从用户输入的“我叫张某某”中提取与输出的提示语音匹配的信息,即“张某某”。可选的,终端可以将输出的提示语音对应的文字显示在语音录入界面,终端也可以将从用户输入的响应语音中提取的关键信息以文字形式显示在语音录入界面,这样有助于用户在信息录入过程中发现录入错误的信息,可以记住录入错误的信息,便于节省用户对已录入信息的修改时间。In other words, after the terminal starts the voice entry mode, the terminal can output the prompt voice of the form item name through the built-in or external speaker. According to the prompt voice, the user can input the response voice through the built-in or external microphone of the terminal. The terminal can detect whether the response voice input by the user is received within a preset time. If the response voice input by the user is detected, the key information matching the prompt voice can be extracted from the response voice input by the user. For example, the prompt voice of the terminal outputting the name of the form item through the built-in speaker is “Which name is your name?”, the response voice input by the user through the microphone of the terminal is “I am called Zhang Moumou”, and the terminal inputs “I am Zhang from the user”. In the "something", the information matching the prompt voice of the output is extracted, that is, "Zhang Moumou". Optionally, the terminal may display the text corresponding to the output prompt voice on the voice input interface, and the terminal may also display the key information extracted from the response voice input by the user in a voice input interface, thereby facilitating the user to In the process of information entry, it is found that the error information is entered, and the error information can be remembered, so that the user can save the modification time of the entered information.
在本发明实施例中,用户基于提示语音输入响应语音的方式可以为用户在终端输出提示语音的预设时间内,直接说出针对该提示语音的响应语音,类似 于人与人之间的对话,无需用户进行其他操作,方便用户操作。现有的语音录入方案中,在用户输入语音时,需要用户按住虚拟键盘中某个预设位置或者点击终端预设区域,不便于用户操作。In the embodiment of the present invention, the user may directly respond to the prompt voice for the prompt voice in a preset time when the terminal outputs the prompt voice based on the prompt voice input response voice, which is similar to the conversation between people. No user needs to perform other operations, which is convenient for users to operate. In the existing voice input scheme, when the user inputs the voice, the user needs to press and hold a preset position in the virtual keyboard or click the preset area of the terminal, which is inconvenient for the user to operate.
103、当检测到输出的提示语音为预设顺序中最后一条表单项名称的提示语音时,终端根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单,并显示第一表单。103. When detecting that the output prompt voice is the prompt voice of the last form item name in the preset order, the terminal generates a first form according to the form item name of the output prompt voice and the corresponding extracted key information, and displays the first form. .
也就是说,如果检测当前输出的提示语音为预设顺序中最后一条表单项名称对应的提示语音时,也即检测到信息录入结束时,终端根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单;如果检测当前输出的提示语音不是预设顺序中最后一条表单项名称对应的提示语音,则终端可继续输出预设顺序中下一表单项名称的提示语音。That is to say, if the prompt voice outputted by the current output is the prompt voice corresponding to the last form item name in the preset order, that is, when the information input end is detected, the terminal extracts the form item name according to the prompt voice output and the corresponding extracted The key information generates a first form; if the prompt voice outputted by the current output is not the prompt voice corresponding to the last form item name in the preset order, the terminal may continue to output the prompt voice of the next form item name in the preset order.
举例来说,可假设终端在语音录入信息界面中已输出提示语音的表单项名称和对应提取的关键信息有:姓名-张某某;年龄-26;联系方式-1564567890。假设终端当前在语音录入界面输出提示语音为:“请问您在此之前是否已经办理了其他贷款”,检测到用户基于该提示语音输入的响应语音为:“我之前没有办理过贷款”,则提取到的关键信息为“没有”。进一步的,终端检测该提示语音是否为预设顺序中最后一条表单项名称的提示语音,假设检测该提示语音是最后一条表单项名称的提示语音,则根据:姓名-张某某;年龄-26;联系方式-1564567890以及是否有过贷款记录-否,生成第一表单。该第一表单可指所有的待填写表单项中均已填入内容。其中,所有待填写的表单项可以指表单中所有表单项,也可以指表单中预设的必填表单项。比如,一张申请贷款信息的表单项中,有必填项如姓名、年龄、职业、住址等,也有可不填写项如文化水平、性格等等,此时终端在预先存储表单项名称以及与其对应的表单项名称的提示语音时,可只预先存储必填表单项的相关信息,不存储可不填写项的相关信息。也就是说,在该种情况下终端生成的第一表单中的所有待填写项内容均已录入就是指所有表单中所有必填写项均已填写。For example, it can be assumed that the name of the form item that the terminal has outputted the prompt voice in the voice input information interface and the corresponding extracted key information are: name-Zhangmou; age-26; contact mode-1564567890. Assume that the terminal currently outputs the prompt voice on the voice input interface: "Do you have already processed other loans before this?", and the response voice of the user based on the prompt voice input is detected as: "I have not applied for a loan before", then extract The key message is "no". Further, the terminal detects whether the prompt voice is the prompt voice of the last form item name in the preset order, and assumes that the prompt voice is the prompt voice of the last form item name, according to: name-Zhang XX; age -26 ; Contact -1564567890 and whether there has been a loan record - no, generate the first form. The first form may refer to all of the form items to be filled out that have been filled in. Among them, all the form items to be filled out may refer to all the form items in the form, and may also refer to the required form items preset in the form. For example, a form item for applying for loan information has required fields such as name, age, occupation, address, etc., and may also not fill in items such as cultural level, personality, etc., at this time, the terminal stores the name of the form item in advance and corresponds thereto. When the prompt of the form item name is voiced, only the related information of the required form item may be pre-stored, and the related information of the unfillable item may not be stored. That is to say, in this case, all the items to be filled in the first form generated by the terminal have been entered, that is, all the required items in all the forms have been filled in.
可选的,终端在根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单之后,可将语音录入界面切换为文本录入界面,并在文本录入界面中显示第一表单,以便于用户对终端在语音录入模式下提取的关键信息进行 修改。Optionally, after the terminal generates the first form according to the name of the form item that has output the prompt voice and the corresponding extracted key information, the terminal may switch the voice input interface to the text entry interface, and display the first form in the text entry interface, so that The user can modify the key information extracted by the terminal in the voice input mode.
本实施例在语音录入模式时,在语音录入界面按照预设顺序输出表单项名称的提示语音,基于该提示语音从检测到用户输入的响应语音中,提取与该提示语音匹配的关键信息。如果检测到输入的提示语音是预设顺序中最后一条表单项名称的提示语音时,根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单,并在语音录入界面显示该第一表单,可以实现通过语音录入,提高信息录入效率。In the voice input mode, the voice input interface outputs the prompt voice of the form item name in a preset order, and based on the prompt voice, the key information matching the prompt voice is extracted from the response voice detected by the user. If it is detected that the input prompt voice is the prompt voice of the last form item name in the preset order, the first form is generated according to the name of the form item that has output the prompt voice and the corresponding extracted key information, and the first form is displayed on the voice input interface. A form can be used to improve the efficiency of information entry through voice entry.
参考图2,为本发明实施例提供的另一种信息录入方法的流程示意图,如图2所示的信息录入方法,可包括:FIG. 2 is a schematic flowchart of another information input method according to an embodiment of the present invention. The information input method shown in FIG. 2 may include:
201、在启动语音录入模式时,终端显示语音录入界面。201. When the voice input mode is started, the terminal displays a voice input interface.
可选的,终端在接收到启动语音录入模式指令时,启动语音录入模式,并显示语音录入界面。可选的,在通过终端进行信息录入时,可以先提供给用户一个信息录入模式选择界面(如图3),当终端在该信息选择界面中的预设区域接收到用户输入的选择指令时,启动对应的信息录入模式,进一步的进入信息录入模式对应的录入界面。举例来说,假设在进行信息录入之前,终端可展示信息录入模式选择界面(如图3),可假设用户根据需求选择语音录入模式,则终端接收到选择指令,启动语音录入模式,进一步的将当前信息录入选择模式界面切换为语音录入界面。如图3所示的语音录入界面中可包括:输出语音提示的语音条、暂停语音录入功能按钮,以及信息录入模式切换按钮等。其中,输出语音提示的语音条可在终端输出语音提示时伴随着闪动,不仅可以告知用户目前正在输出提示语音,也可以给用户带来较好的视觉效果;暂停语音录入的功能按钮可以用来暂停或者开启语音录入的功能;信息录入模式切换按钮可以用来切换当前语音录入的模式为文本录入模式。Optionally, when receiving the command to start the voice input mode, the terminal starts the voice input mode and displays the voice input interface. Optionally, when the information is entered through the terminal, the user may first provide an information input mode selection interface (such as FIG. 3). When the terminal receives the selection instruction input by the user in the preset area in the information selection interface, The corresponding information entry mode is activated, and the entry interface corresponding to the information entry mode is further entered. For example, suppose that before the information entry, the terminal can display the information input mode selection interface (as shown in FIG. 3), and it can be assumed that the user selects the voice input mode according to the requirement, the terminal receives the selection instruction, starts the voice input mode, and further The current information input selection mode interface is switched to the voice input interface. The voice input interface shown in FIG. 3 may include: a voice bar for outputting a voice prompt, a pause voice input function button, and an information input mode switching button. The voice bar outputting the voice prompt can be accompanied by flashing when the terminal outputs the voice prompt, not only can inform the user that the prompt voice is currently being output, but also can bring a good visual effect to the user; the function button for suspending the voice input can be used. To pause or turn on the voice input function; the information input mode switch button can be used to switch the current voice input mode to the text entry mode.
以上描述的终端启动语音录入模式,以及显示语音录入界面中的一些实施方式,只是本实施例列举的一些可能,具体的实施方式不做限定。The foregoing describes the terminal to start the voice entry mode and display some implementations in the voice input interface, which are only some of the possibilities listed in this embodiment, and the specific implementation manner is not limited.
202、终端在语音录入界面按照预设顺序输出表单项名称的提示语音。202. The terminal outputs the prompt voice of the form item name in a preset order on the voice input interface.
可选的,终端可预先在本地或者服务器中存储有不同表单的表单项名称以及与表单项名称对应的提示语音,当启动语音录入模式并显示语音录入界面时,终端从本地或者服务器调用已存储的表单项名称和与表单项名称对应的提 示语音,输出该提示语音。举例来说,终端预先在本地或者服务器中存储的不同表单的表单项名称以及与表单项名称对应的提示语音可以为:如果是申请贷款表单,可假设表单中表单项名称与提示语音的对应关系为:姓名对应的提示语音为请问您叫什么名字;年龄对应的提示语音为请问您今年多大了;联系方式对应的提示语音为请问您的手机号码是多少。终端可以对上述例子中的表单项名称以及与其对应的提示语音设置预设顺序,可假设设置的预设顺序为姓名、联系方式、年龄,或者也可以为姓名、年龄、联系方式等等。Optionally, the terminal may pre-store the form item name of the different form and the prompt voice corresponding to the form item name in the local or the server. When the voice input mode is started and the voice input interface is displayed, the terminal calls the stored from the local or the server. The form item name and the prompt voice corresponding to the form item name, and the prompt voice is output. For example, the form item name of the different form stored by the terminal in the local or the server and the prompt voice corresponding to the form item name may be: if the loan form is applied, the correspondence between the form item name and the prompt voice in the form may be assumed. For: The prompt voice corresponding to the name is what name you are calling; the voice corresponding to the age is how big you are this year; the prompt voice corresponding to the contact method is how much your mobile phone number is. The terminal may set a preset order for the name of the form item in the above example and the corresponding prompt voice, and may assume that the preset order of the setting is name, contact, age, or may be name, age, contact, and the like.
203、终端基于提示语音检测用户输入的响应语音,并从响应语音中提取与提示语音匹配的关键信息。203. The terminal detects the response voice input by the user based on the prompt voice, and extracts key information that matches the prompt voice from the response voice.
可选的,终端基于提示语音检测用户输入的响应语音,如果检测到用户输入的响应语音,则从响应语音中提取与提示语音匹配的关键信息;如果没有检测到用户输入的响应语音,即用户可能正在与别人说话,没有听到输出的提示语音,或者用户正在处理其他事情不在终端前,则可不执行提取响应语音中的关键信息的步骤。针对在输出提示语音之后,没有检测到用户输入的响应语音的情况,终端在输出提示语音的预设时间之后,第二次输出该提示语音;若还是没有检测到用户输入的响应语音,则在相同预设时间之后,终端输出第三次该提示语音,若还是没有检测到用户输入的响应语音,则终端可以选择继续重复上述步骤或者选择跳过该提示语音,继续输入下一提示语音,或者可采取其他方式。Optionally, the terminal detects the response voice input by the user based on the prompt voice, and if the response voice input by the user is detected, extracts key information that matches the prompt voice from the response voice; if the response voice input by the user is not detected, that is, the user The step of extracting the key information in the response voice may not be performed without talking to the other person, not hearing the output prompt voice, or the user is processing other things not in front of the terminal. For the case that after the prompt voice is output, the response voice input by the user is not detected, the terminal outputs the prompt voice a second time after outputting the preset time of the prompt voice; if the response voice input by the user is still not detected, then After the same preset time, the terminal outputs the prompt voice for the third time. If the response voice input by the user is still not detected, the terminal may choose to continue to repeat the above steps or choose to skip the prompt voice and continue to input the next prompt voice, or Other ways can be taken.
现有的语音录入方式大多数是需要用户在一段时间内持续录入,如果用户中途有其他紧急事情需要处理或者用户有些信息不确定,需要通过其他方式确认信息的,终端会获取到其他用户输入的语音。比如用户在语音录入时,家人、朋友或者同事需要跟用户讲话,那么终端就会获取家人或者朋友的声音,导致语音录入的准确率低。本实施例方案中,终端可支持暂停语音录入功能,可以使用户在使用终端进行语音录入时更加方便,灵活。Most of the existing voice input methods require the user to continuously enter for a period of time. If the user has other urgent things to process or some information is uncertain, the user needs to confirm the information by other means, and the terminal will obtain other user input. voice. For example, when a user enters a voice, a family member, a friend, or a colleague needs to talk to the user, and the terminal acquires the voice of the family or a friend, resulting in a low accuracy of voice entry. In the solution of the embodiment, the terminal can support the pause voice input function, which can make the user more convenient and flexible when using the terminal for voice input.
可选的,终端在输出提示语音之后,可检测终端是否需开启暂停语音录入功能:若是,则终端开启暂停语音录入功能,如此终端可不执行基于提示语音检测用户输入的响应语音,并从响应语音中提取与提示语音匹配的关键信息的步骤,可节省终端的功耗开销;若否,则终端可执行基于提示语音检测用户输 入的响应语音,并从响应语音中提取与提示语音匹配的关键信息的步骤。也就是说,如果终端接收到开启暂停语音录入功能的指令时,终端可暂停语音录入功能,可不检测用户输入的响应语音。举例来说,假设终端在语音录入界面中可以设置一个暂停语音录入按钮,如果用户点击该按钮,也就是终端接收到暂停语音录入指令,则终端开启暂停语音录入的功能。或者终端获取到用户输入的指示暂停语音录入功能的语音时,终端开启暂停语音录入的功能。Optionally, after outputting the prompt voice, the terminal may detect whether the terminal needs to enable the pause voice input function: if yes, the terminal starts the pause voice input function, so the terminal may not perform the response voice based on the prompt voice detection user input, and the response voice The step of extracting the key information matching the prompt voice may save the power consumption overhead of the terminal; if not, the terminal may perform the response voice based on the prompt voice detection user input, and extract the key information matching the prompt voice from the response voice. A step of. That is to say, if the terminal receives the instruction to enable the pause voice input function, the terminal may pause the voice input function, and may not detect the response voice input by the user. For example, suppose the terminal can set a pause voice input button in the voice input interface. If the user clicks the button, that is, the terminal receives the pause voice input command, the terminal starts the function of pausing the voice input. Or when the terminal obtains the voice input by the user indicating that the voice input function is suspended, the terminal starts the function of suspending the voice input.
可选的,终端检测是否需开启暂停语音录入功能的步骤,可以在步骤202-203任何步骤之前或者之后执行。Optionally, the step of detecting whether the terminal needs to enable the pause voice input function may be performed before or after any step of steps 202-203.
假设终端检测是否需开启暂停语音录入功能的步骤在步骤202之前执行,也就是说在终端输出提示语音之前,检测是否终端开启暂停语音录入功能:如果开启,则终端不输出提示语音,也不检测用户输入的响应语音;如未开启,则终端输出语音提示,进一步的可检测用户输入的响应语音,如此可以保证终端在确保终端正在进行语音录入时输出提示语音,避免终端不必要的能耗开销。比如,如果终端在输出提示语音之后,检测发现终端开启了暂停语音录入功能,则终端停止语音录入的操作;在关闭暂停语音录入功能,也即继续语音录入时,终端又要重新输出在暂停语音录入功能之前已经输出过的提示语音,如此同样的提示语音终端输出两次,使得终端浪费了功耗。It is assumed that the step of detecting whether the terminal needs to enable the pause voice input function is performed before step 202, that is, before the terminal outputs the prompt voice, it is detected whether the terminal starts the pause voice input function: if enabled, the terminal does not output the prompt voice, and does not detect The response voice input by the user; if not, the terminal outputs a voice prompt, and further detects the response voice input by the user, so that the terminal can output the prompt voice when ensuring that the terminal is performing voice input, thereby avoiding unnecessary energy consumption of the terminal. . For example, if the terminal detects that the terminal has enabled the pause voice input function after outputting the prompt voice, the terminal stops the voice input operation; when the pause voice input function is turned off, that is, when the voice input is continued, the terminal needs to re-output the pause voice. The prompt voice that has been output before the function is entered, so that the voice terminal is output twice, so that the terminal wastes power consumption.
可选的,检测是否需开启暂停语音录入功能,包括:检测在输出提示语音后的预设时间内是否检测到用户输入的响应语音,若否,则确定需开启暂停语音录入功能。也就是说,如果终端在输出提示语音的预设时间内没有检测到用户输入的响应语音,此时终端可判断用户可能有紧急事情需要处理或者出现其他状况,则确定需开启暂停语音录入功能;如果终端在输出提示语音的预设时间内检测到用户输入的响应语音,则说明终端没有开启暂停语音录入功能,可执行从用户输入的响应语音中提取与提示语音匹配的关键信息的步骤。Optionally, detecting whether the pause voice input function needs to be enabled, including: detecting whether the response voice input by the user is detected within a preset time after outputting the prompt voice, and if not, determining that the pause voice input function is required to be enabled. That is, if the terminal does not detect the response voice input by the user within the preset time when the prompt voice is output, the terminal may determine that the user may have urgent things to deal with or other situations occur, and then determine that the pause voice input function needs to be enabled; If the terminal detects the response voice input by the user within the preset time for outputting the prompt voice, the terminal does not enable the pause voice input function, and the step of extracting the key information matching the prompt voice from the response voice input by the user may be performed.
可选的,检测是否需开启暂停语音录入功能,包括:识别用户输入的响应语音中是否包括与提示语音匹配的关键信息;若否,则确定需开启暂停语音录入功能。也就是说,如果终端输出提示语音的预设时间内检测到用户输入的响应语音,但识别到该响应语音中不包括与输出的提示语音匹配的关键信息,则终端可判断此时用户不方便继续录音,可确定需开启暂停语音录入功能;如果 终端在输出提示语音预设的时间内检测到用户输入的响应语音,并且从该响应语音中识别到与提示语音匹配的关键信息,则可执行从用户输入的响应语音中提取与提示语音匹配的关键信息的步骤。Optionally, detecting whether the pause voice input function needs to be enabled includes: identifying whether the response voice input by the user includes key information that matches the prompt voice; if not, determining that the pause voice input function is required to be enabled. That is, if the response voice input by the user is detected within the preset time when the terminal outputs the prompt voice, but the key information that matches the output prompt voice is not included in the response voice, the terminal may determine that the user is inconvenient at this time. After the recording is continued, it may be determined that the pause voice input function needs to be enabled; if the terminal detects the response voice input by the user within the preset time of the output prompt voice, and the key information matching the prompt voice is recognized from the response voice, the executable information may be performed. The step of extracting key information matching the prompt voice from the response voice input by the user.
可选的,终端也可以通过判断屏幕是否熄灭来检测是否需开启暂停语音录入功能。换句话说,如果终端检测到屏幕处于熄灭状态,则可确定此时需开启暂停语音录入功能;如果终端检测到屏幕处于未熄灭状态,则可确定此时不需开启暂停语音录入功能。Optionally, the terminal may also detect whether the pause voice input function needs to be enabled by determining whether the screen is off. In other words, if the terminal detects that the screen is in the off state, it can be determined that the pause voice input function needs to be turned on at this time; if the terminal detects that the screen is not off, it can be determined that the pause voice input function is not required at this time.
204、当检测到输出的提示语音为预设顺序中最后一条表单项名称的提示语音时,终端根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单,并显示第一表单。204. When detecting that the output prompt voice is the prompt voice of the last form item name in the preset order, the terminal generates the first form according to the form item name of the output prompt voice and the corresponding extracted key information, and displays the first form. .
步骤204的可能实施方式已经在实施例一中具体描述,在此处不再赘述。The possible implementation manner of step 204 has been specifically described in the first embodiment, and details are not described herein again.
可选的,终端可以在提取到与输出的提示语音匹配的关键信息之后,在检测输出的提示语音是否为预设顺序中最后一条表单项名称的提示语音之前,检测是否需开启暂停语音录入功能。如果终端检测到需开启暂停语音录入功能,则可不检测是否输出的提示语音为预设顺序中最后一条表单项名称的提示语音,可以节省终端功耗开销;如果检测到不需开启暂停语音录入功能,则可继续执行步骤204。Optionally, after extracting the key information that matches the output prompt voice, the terminal may detect whether the pause voice input function needs to be enabled before detecting whether the output prompt voice is the prompt voice of the last form item name in the preset sequence. . If the terminal detects that the pause voice input function needs to be enabled, it may not detect whether the output prompt voice is the prompt voice of the last form item name in the preset sequence, which may save the terminal power consumption overhead; if it is detected that the pause voice input function is not required to be enabled Then, step 204 can be continued.
205、当接收到针对第一表单中目标表单项的修改指令时,终端获取用户输入的文本信息。205. When receiving a modification instruction for the target form item in the first form, the terminal acquires text information input by the user.
206、终端利用文本信息替换目标表单项对应的关键信息。206. The terminal replaces the key information corresponding to the target form item by using the text information.
可选的,根据已输出的提示语音对应的表单项名称和已提取的关键信息生成第一表单之后,还包括:当接收到针对第一表单中目标表单项的修改指令时,获取用户输入的文本信息;利用文本信息替换目标表单项对应的关键信息。Optionally, after the first form is generated according to the form item name corresponding to the output prompt voice and the extracted key information, the method further includes: when receiving the modification instruction for the target form item in the first form, acquiring the user input Text information; replace the key information corresponding to the target form item with the text information.
也就是说,当检测到输出的提示语音为预设顺序中最后一条表单项名称的提示语音时,终端根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单并显示第一表单后,如果接收到针对第一表单中目标表单项的修改指令时,终端可以接收用户在目标表单项的触摸指令,然后弹出虚拟键盘,接收用户通过虚拟键盘输入的文本信息。利用该文本信息替换目标表单项对应的关键信息。或者终端可以通过其他可行的方式实现对第一表单中目标表单项的 修改。其中,第一表单中表单项名称与已提取的关键信息一一对应。比如,表单中表单项名称姓名对应关键信息张某某,表单项名称性别对应关键信息中的男或女。That is, when detecting that the output prompt voice is the prompt voice of the last form item name in the preset order, the terminal generates the first form according to the form item name of the output prompt voice and the corresponding extracted key information, and displays the first After the form, if receiving a modification instruction for the target form item in the first form, the terminal may receive the user's touch instruction on the target form item, and then pop up the virtual keyboard to receive the text information input by the user through the virtual keyboard. The text information is used to replace the key information corresponding to the target form item. Or the terminal can implement the modification of the target form item in the first form by other feasible means. The form item name in the first form has a one-to-one correspondence with the extracted key information. For example, the form item name in the form corresponds to the key information Zhang Moumou, and the form item name gender corresponds to the male or female in the key information.
可选的,若接收到切换录入模式的指令,将当前的信息录入模式由语音录入模式切换为文本录入模式。将当前的信息录入模式由语音录入模式切换为文本录入模式之后,还包括:根据已输出提示语音的表单项名称、对应提取的关键信息以及剩余待输出提示语音的表单项名称,生成第二表单;在文本录入界面中显示第二表单。也就是说,如果终端在语音录入模式下进行信息录入时,在接收到切换录入模式的指令时,将语音录入模式切换为文本录入模式,同时将语音录入模式对应的语音录入界面切换为文本录入模式对应的文本录入界面。切换录入模式之后,在语音录入模式中已提取的关键信息,以文本的形式显示在文本界面中的第二表单中终对应的表单项名称之后(如图4)。在图4中,姓名、年龄以及职业为语音录入模式下已提取的关键信息。另外,文本录入界面显示的第二表单中还显示剩余未提取到关键信息的表单项名称,如月收入、电话、家庭住址以及居住地址等表单项。其中,该第二表单与第一表单不相同,第二表单是指表单中有部分待填写表单项已填写完成,有部分待填写表单项未填写(如图4)。且第二表单项中剩余待填写的表单项需要通过文本形式完成填写(如图4)。可选的,终端接收的切换录入模式指令可以是用户通过语音录入界面中切换模式按钮输入的,也可以是用户通过终端的麦克风输入的。Optionally, if the instruction to switch the input mode is received, the current information input mode is switched from the voice input mode to the text entry mode. After the current information input mode is switched from the voice input mode to the text entry mode, the method further includes: generating a second form according to the name of the form item that has output the prompt voice, the corresponding extracted key information, and the name of the form item remaining to output the prompt voice ; Display the second form in the text entry interface. That is to say, if the terminal performs information input in the voice input mode, when receiving the instruction to switch the input mode, the voice input mode is switched to the text entry mode, and the voice input interface corresponding to the voice input mode is switched to the text entry. The text entry interface corresponding to the mode. After switching the entry mode, the key information extracted in the voice entry mode is displayed in text form after the final corresponding form item name in the second form in the text interface (see FIG. 4). In Figure 4, the name, age, and occupation are the key information that has been extracted in the voice entry mode. In addition, the second form displayed in the text entry interface also displays the form item names that have not been extracted to the key information, such as monthly income, telephone, home address, and residence address. The second form is different from the first form, and the second form means that some of the form items to be filled in the form have been filled out, and some of the form items to be filled out are not filled out (as shown in FIG. 4). And the remaining form items in the second form item need to be filled in by text form (as shown in Figure 4). Optionally, the switch input mode command received by the terminal may be input by the user through the mode switch button in the voice input interface, or may be input by the user through the microphone of the terminal.
本发明实施例,在终端启动语音录入模式时,在语音录入界面按照预设顺序输出表单项名称的提示语音,终端基于提示语音检测用户输入的响应语音,并从响应语音中提取与提示语音匹配的关键信息。如果检测到输出的提示语音为预设顺序中最后一条表单项名称的提示语音时,终端根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单,并显示该第一表单。在该实施例中如果终端接收到针对第一表单中目标表单项的修改指令,则利用用户输入的文本信息替换目标表单项对应的关键信息,可以实现语音录入信息,并可以对语音录入的信息进行修改,提高了信息录入效率和准确性。In the embodiment of the present invention, when the terminal starts the voice input mode, the voice input interface outputs the prompt voice of the form item name in a preset order, and the terminal detects the response voice input by the user based on the prompt voice, and extracts the matching voice from the response voice. Key information. If it is detected that the output prompt voice is the prompt voice of the last form item name in the preset order, the terminal generates a first form according to the form item name of the output prompt voice and the corresponding extracted key information, and displays the first form. In this embodiment, if the terminal receives the modification instruction for the target form item in the first form, the key information corresponding to the target form item is replaced by the text information input by the user, so that the voice input information can be realized, and the voice input information can be Modifications have been made to improve the efficiency and accuracy of information entry.
参考图5,为本发明实施例提供的一种信息录入装置的结构示意图,如图 5所示的信息录入装置,可包括:输出单元501,提取单元502、生成单元503以及显示单元504:FIG. 5 is a schematic structural diagram of an information input apparatus according to an embodiment of the present invention. The information input apparatus shown in FIG. 5 may include: an output unit 501, an extracting unit 502, a generating unit 503, and a display unit 504:
输出单元501,用于在语音录入模式下,在语音录入界面按照预设顺序输出表单项名称的提示语音;The output unit 501 is configured to output, in the voice input mode, the prompt voice of the form item name in a preset order in the voice input interface;
提取单元502,用于基于提示语音检测用户输入的响应语音,并从响应语音中提取与提示语音匹配的关键信息;The extracting unit 502 is configured to detect a response voice input by the user based on the prompt voice, and extract key information matching the prompt voice from the response voice;
生成单元503,用于当检测到输出的提示语音为预设顺序中最后一条表单项名称的提示语音时,根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单;The generating unit 503 is configured to: when detecting that the output prompt voice is the prompt voice of the last form item name in the preset order, generate the first form according to the form item name of the prompt voice and the corresponding extracted key information;
显示单元504,用于显示第一表单。The display unit 504 is configured to display the first form.
可选的,信息录入装置还包括:检测单元505,用于检测检测是否需开启暂停语音录入功能;若是,则开启暂停语音录入功能。Optionally, the information input device further includes: a detecting unit 505, configured to detect whether the paused voice input function needs to be enabled; if yes, enable the pause voice input function.
可选的,检测单元505具体用于:检测在输出提示语音后的预设时间内是否检测到用户输入的响应语音;若否,则确定需开启暂停语音录入功能Optionally, the detecting unit 505 is specifically configured to: detect whether the response voice input by the user is detected within a preset time after outputting the prompt voice; if not, determine that the pause voice input function needs to be enabled.
可选的,检测单元505具体用于:识别用户输入的响应语音中是否包括与提示语音匹配的关键信息;若否,则确定需开启暂停语音录入功能。Optionally, the detecting unit 505 is specifically configured to: identify whether the response voice input by the user includes key information that matches the prompt voice; if not, determine that the pause voice input function needs to be enabled.
可选的,信息录入装置还包括:切换单元506,用于若接收到切换录入模式的指令,将当前的信息录入模式由语音录入模式切换为文本录入模式。Optionally, the information input device further includes: a switching unit 506, configured to switch the current information input mode from the voice input mode to the text entry mode if receiving the instruction to switch the input mode.
可选的,信息录入装置还包括:Optionally, the information input device further includes:
生成单元503,还用于根据已输出提示语音的表单项名称、对应提取的关键信息以及剩余待输出提示语音的表单项名称,生成第二表单;The generating unit 503 is further configured to generate a second form according to the form item name that has output the prompt voice, the corresponding extracted key information, and the form item name of the remaining prompt voice to be outputted;
显示单元504,还用于在文本录入界面中显示第二表单。The display unit 504 is further configured to display the second form in the text entry interface.
本实施例在语音录入模式时,输出单元501在语音录入界面按照预设顺序输出表单项名称的提示语音,提取单元502基于该提示语音从检测到用户输入的响应语音中,提取与该提示语音匹配的关键信息。生成单元503当检测到输入的提示语音是预设顺序中最后一条表单项名称的提示语音时,根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单,显示单元504显示该第一表单,可以实现通过语音录入信息,提高信息录入效率。In the voice input mode, the output unit 501 outputs the prompt voice of the form item name in the preset order in the voice input interface, and the extracting unit 502 extracts the prompt voice from the detected response voice input by the user based on the prompt voice. Match key information. When detecting that the input prompt voice is the prompt voice of the last form item name in the preset order, the generating unit 503 generates a first form according to the form item name of the prompt voice and the corresponding extracted key information, and the display unit 504 displays the The first form can realize information input by voice and improve information input efficiency.
可以理解的是,本实施例的数据信息处理装置的各功能单元、单元的功能 可根据上述方法实施例中的方法具体实现,其具体实现过程可以参照上述方法实施例的相关描述,此处不再赘述。It can be understood that the functions of the respective functional units and units of the data information processing apparatus of the present embodiment may be specifically implemented according to the method in the foregoing method embodiment. For the specific implementation process, reference may be made to the related description of the foregoing method embodiments, where Let me repeat.
参见图6,是本发明实施例提供的一种终端的示意性框图。如图所示的本实施例中的终端可以包括:一个或多个处理器601;一个或多个输入设备602,一个或多个输出设备603和存储器604。上述处理器601、输入设备602、输出设备603和存储器604通过总线605连接。存储器604用于存储计算机程序,计算机程序包括程序指令,处理器601用于执行存储器604存储的程序指令。其中,处理器601被配置用于调用程序指令执行:FIG. 6 is a schematic block diagram of a terminal according to an embodiment of the present invention. The terminal in this embodiment as shown may include one or more processors 601; one or more input devices 602, one or more output devices 603, and memory 604. The above processor 601, input device 602, output device 603, and memory 604 are connected by a bus 605. The memory 604 is used to store computer programs, the computer programs include program instructions, and the processor 601 is configured to execute program instructions stored in the memory 604. Wherein, the processor 601 is configured to invoke program instruction execution:
在语音录入模式下,在语音录入界面按照预设顺序输出表单项名称的提示语音;In the voice input mode, the voice input interface outputs the prompt voice of the form item name in a preset order;
基于提示语音检测用户输入的响应语音,并从响应语音中提取与提示语音匹配的关键信息;Detecting a response voice input by the user based on the prompt voice, and extracting key information matching the prompt voice from the response voice;
当检测到输出的提示语音为预设顺序中最后一条表单项名称的提示语音时,根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单,并显示第一表单。When it is detected that the output prompt voice is the prompt voice of the last form item name in the preset order, the first form is generated according to the form item name of the prompt voice and the corresponding extracted key information, and the first form is displayed.
可选的,处理器601被配置用于调用程序指令还执行:Optionally, the processor 601 is configured to invoke the program instructions and further execute:
检测是否需开启暂停语音录入功能;若是,则开启暂停语音录入功能。Check if the pause voice input function needs to be enabled; if yes, enable the pause voice input function.
可选的,检测是否需开启暂停语音录入功能,处理器601被配置用于调用程序指令具体执行:Optionally, detecting whether the pause voice input function needs to be enabled, the processor 601 is configured to invoke the program instruction to perform:
检测在输出提示语音后的预设时间内是否检测到用户输入的响应语音;若否,则确定需开启暂停语音录入功能。It is detected whether the response voice input by the user is detected within a preset time after the prompt voice is output; if not, it is determined that the pause voice input function needs to be enabled.
可选的,检测是否需开启暂停语音录入功能,处理器601被配置用于调用程序指令具体执行:Optionally, detecting whether the pause voice input function needs to be enabled, the processor 601 is configured to invoke the program instruction to perform:
识别用户输入的响应语音中是否包括与提示语音匹配的关键信息;若否,则确定需开启暂停语音录入功能。It is recognized whether the response voice input by the user includes key information matching the prompt voice; if not, it is determined that the pause voice input function needs to be enabled.
可选的,处理器601被配置用于调用程序指令还执行:Optionally, the processor 601 is configured to invoke the program instructions and further execute:
若接收到切换录入模式的指令,将当前的信息录入模式由语音录入模式切换为文本录入模式。If an instruction to switch the entry mode is received, the current information entry mode is switched from the voice entry mode to the text entry mode.
可选的,将当前的信息录入模式由语音录入模式切换为文本录入模式之 后,处理器601被配置用于调用程序指令还执行:Optionally, after the current information entry mode is switched from the voice entry mode to the text entry mode, the processor 601 is configured to invoke the program instructions and execute:
根据已输出提示语音的表单项名称、对应提取的关键信息以及剩余待输出提示语音的表单项名称,生成第二表单;在文本录入界面中显示第二表单。The second form is generated according to the name of the form item that has output the prompt voice, the key information corresponding to the extraction, and the name of the form item remaining to output the prompt voice; the second form is displayed in the text entry interface.
可选的,根据已输出的提示语音对应的表单项名称和已提取的关键信息生成第一表单之后,处理器601被配置用于调用程序指令还执行:Optionally, after generating the first form according to the form item name corresponding to the output prompt voice and the extracted key information, the processor 601 is configured to invoke the program instruction and further execute:
当接收到针对第一表单中目标表单项的修改指令时,获取用户输入的文本信息;Obtaining text information input by the user when receiving a modification instruction for the target form item in the first form;
利用文本信息替换目标表单项对应的关键信息。Replace the key information corresponding to the target form item with text information.
应当理解,在本发明实施例中,所称处理器601可以是中央处理单元(Central Processing Unit,CPU),该处理器还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that, in the embodiment of the present invention, the processor 601 may be a central processing unit (CPU), and the processor may also be another general-purpose processor, a digital signal processor (DSP). , Application Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware component, etc. The general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
输入设备602可以包括触控板、指纹采传感器(用于采集用户的指纹信息和指纹的方向信息)、麦克风等,输出设备603可以包括显示器(LCD等)、扬声器等。The input device 602 may include a touch panel, a fingerprint sensor (for collecting fingerprint information of the user and direction information of the fingerprint), a microphone, and the like, and the output device 603 may include a display (LCD or the like), a speaker, and the like.
该存储器604可以包括只读存储器和随机存取存储器,并向处理器501提供指令和数据。存储器604的一部分还可以包括非易失性随机存取存储器。例如,存储器604还可以存储设备类型的信息。The memory 604 can include read only memory and random access memory and provides instructions and data to the processor 501. A portion of the memory 604 may also include a non-volatile random access memory. For example, the memory 604 can also store information of the device type.
具体实现中,本发明实施例中所描述的处理器601、输入设备602、输出设备603可执行图1和图2提供的信息录入方法实施例中所描述的实现方式,也可执行图5所描述信息录入装置的实现方式,在此不再赘述。In a specific implementation, the processor 601, the input device 602, and the output device 603 described in the embodiments of the present invention may perform the implementation manners described in the embodiment of the information input method provided in FIG. 1 and FIG. The implementation manner of the information input device is described, and details are not described herein again.
在本发明的实施例中提供一种计算机可读存储介质,计算机可读存储介质存储有计算机程序,计算机程序包括程序指令,程序指令被处理器执行时实现:In a embodiment of the invention, a computer readable storage medium is stored, the computer readable storage medium storing a computer program comprising program instructions, the program instructions being implemented by the processor:
在语音录入模式下,在语音录入界面按照预设顺序输出表单项名称的提示语音;In the voice input mode, the voice input interface outputs the prompt voice of the form item name in a preset order;
基于提示语音检测用户输入的响应语音,并从响应语音中提取与提示语音 匹配的关键信息;Detecting a response voice input by the user based on the prompt voice, and extracting key information matching the prompt voice from the response voice;
当检测到输出的提示语音为预设顺序中最后一条表单项名称的提示语音时,根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单,并显示第一表单。When it is detected that the output prompt voice is the prompt voice of the last form item name in the preset order, the first form is generated according to the form item name of the prompt voice and the corresponding extracted key information, and the first form is displayed.
可选的,程序指令被处理器执行时还实现:Optionally, the program instructions are also implemented when executed by the processor:
检测是否需开启暂停语音录入功能;若是,则开启暂停语音录入功能。Check if the pause voice input function needs to be enabled; if yes, enable the pause voice input function.
可选的,检测是否需开启暂停语音录入功能,程序指令被处理器执行时具体实现:Optionally, it is detected whether the pause voice input function needs to be enabled, and the program instruction is specifically implemented when executed by the processor:
检测在输出提示语音后的预设时间内是否检测到用户输入的响应语音;若否,则确定需开启暂停语音录入功能。It is detected whether the response voice input by the user is detected within a preset time after the prompt voice is output; if not, it is determined that the pause voice input function needs to be enabled.
可选的,检测是否需开启暂停语音录入功能,程序指令被处理器执行时具体实现:Optionally, it is detected whether the pause voice input function needs to be enabled, and the program instruction is specifically implemented when executed by the processor:
识别用户输入的响应语音中是否包括与提示语音匹配的关键信息;若否,则确定需开启暂停语音录入功能。It is recognized whether the response voice input by the user includes key information matching the prompt voice; if not, it is determined that the pause voice input function needs to be enabled.
可选的,程序指令被处理器执行时还实现:Optionally, the program instructions are also implemented when executed by the processor:
若接收到切换录入模式的指令,将当前的信息录入模式由语音录入模式切换为文本录入模式。If an instruction to switch the entry mode is received, the current information entry mode is switched from the voice entry mode to the text entry mode.
可选的,将当前的信息录入模式由语音录入模式切换为文本录入模式之后,程序指令被处理器执行时还实现:Optionally, after the current information entry mode is switched from the voice input mode to the text entry mode, the program instructions are further executed when executed by the processor:
根据已输出提示语音的表单项名称、对应提取的关键信息以及剩余待输出提示语音的表单项名称,生成第二表单;在文本录入界面中显示第二表单。The second form is generated according to the name of the form item that has output the prompt voice, the key information corresponding to the extraction, and the name of the form item remaining to output the prompt voice; the second form is displayed in the text entry interface.
可选的,根据已输出的提示语音对应的表单项名称和已提取的关键信息生成第一表单之后,程序指令被处理器执行时还实现:Optionally, after the first form is generated according to the form item name corresponding to the output prompt voice and the extracted key information, the program instruction is further implemented when executed by the processor:
当接收到针对第一表单中目标表单项的修改指令时,获取用户输入的文本信息;Obtaining text information input by the user when receiving a modification instruction for the target form item in the first form;
利用文本信息替换目标表单项对应的关键信息。Replace the key information corresponding to the target form item with text information.
可以理解的是,本实施例的A和B的功能可根据上述方法实施例中的方法具体实现,其具体实现过程可以参照上述方法实施例的相关描述,此处不再赘述。It is to be understood that the functions of A and B in this embodiment may be specifically implemented according to the method in the foregoing method embodiments. For the specific implementation process, reference may be made to the related description of the foregoing method embodiments, and details are not described herein again.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,的程序可存储于一计算机可读取存储介质中,该程序在执行时,可包括如上述各方法的实施例的流程。其中,的存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory,ROM)或随机存储记忆体(Random Access Memory,RAM)等。A person skilled in the art can understand that all or part of the process of implementing the above embodiments can be completed by a computer program to instruct related hardware, and the program can be stored in a computer readable storage medium, and the program is executed. At the time, the flow of the embodiment of each of the above methods may be included. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).
以上所揭露的仅为本发明一种较佳实施例而已,当然不能以此来限定本发明之权利范围,本领域普通技术人员可以理解实现上述实施例的全部或部分流程,并依本发明权利要求所作的等同变化,仍属于发明所涵盖的范围。The above disclosure is only a preferred embodiment of the present invention, and of course, the scope of the present invention is not limited thereto, and those skilled in the art can understand all or part of the process of implementing the above embodiments, and according to the present invention. The equivalent changes required are still within the scope of the invention.

Claims (10)

  1. 一种信息录入方法,其特征在于,包括:An information entry method, comprising:
    在语音录入模式下,在语音录入界面按照预设顺序输出表单项名称的提示语音;In the voice input mode, the voice input interface outputs the prompt voice of the form item name in a preset order;
    基于所述提示语音检测用户输入的响应语音,并从所述响应语音中提取与所述提示语音匹配的关键信息;Detecting a response voice input by the user based on the prompt voice, and extracting key information matching the prompt voice from the response voice;
    当检测到输出的提示语音为所述预设顺序中最后一条表单项名称的提示语音时,根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单,并显示所述第一表单。When detecting that the output prompt voice is the prompt voice of the last form item name in the preset order, generating a first form according to the form item name of the output prompt voice and the corresponding extracted key information, and displaying the first Form.
  2. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1 further comprising:
    检测是否需开启暂停语音录入功能;Check if the pause voice input function needs to be enabled;
    若是,则开启所述暂停语音录入功能。If yes, the pause voice input function is enabled.
  3. 根据权利要求2所述的方法,其特征在于,所述检测是否需开启暂停语音录入功能,包括:The method according to claim 2, wherein the detecting whether the pause voice input function needs to be enabled comprises:
    检测在输出所述提示语音后的预设时间内是否检测到所述用户输入的响应语音;Detecting whether the response voice input by the user is detected within a preset time after outputting the prompt voice;
    若否,则确定需开启暂停语音录入功能。If not, it is determined that the pause voice input function needs to be enabled.
  4. 根据权利要求2所述的方法,其特征在于,所述检测是否需开启暂停语音录入功能,包括:The method according to claim 2, wherein the detecting whether the pause voice input function needs to be enabled comprises:
    识别所述用户输入的响应语音中是否包括与所述提示语音匹配的所述关键信息;Identifying whether the key information that matches the prompt voice is included in the response voice input by the user;
    若否,则确定需开启暂停语音录入功能。If not, it is determined that the pause voice input function needs to be enabled.
  5. 根据权利要求1~4任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1 to 4, further comprising:
    若接收到切换录入模式的指令,将当前的信息录入模式由语音录入模式切换为文本录入模式。If an instruction to switch the entry mode is received, the current information entry mode is switched from the voice entry mode to the text entry mode.
  6. 根据权利要求5所述的方法,其特征在于,所述将当前的信息录入模式由语音录入模式切换为文本录入模式之后,所述方法还包括:The method according to claim 5, wherein after the current information entry mode is switched from the voice entry mode to the text entry mode, the method further includes:
    根据已输出提示语音的表单项名称、对应提取的关键信息以及剩余待输出提示语音的表单项名称,生成第二表单;Generating a second form according to the name of the form item that has output the prompt voice, the key information corresponding to the extracted, and the name of the form item remaining to output the prompt voice;
    在文本录入界面中显示所述第二表单。The second form is displayed in a text entry interface.
  7. 根据权利要求1所述的方法,其特征在于,所述根据已输出的提示语音对应的表单项名称和已提取的关键信息生成第一表单之后,所述方法还包括:The method according to claim 1, wherein after the first form is generated according to the form item name corresponding to the outputted prompt voice and the extracted key information, the method further includes:
    当接收到针对所述第一表单中目标表单项的修改指令时,获取用户输入的文本信息;Obtaining text information input by the user when receiving a modification instruction for the target form item in the first form;
    利用所述文本信息替换所述目标表单项对应的关键信息。Replacing the key information corresponding to the target form item with the text information.
  8. 一种信息录入装置,其特征在于,包括:An information input device, comprising:
    输出单元,用于在语音录入模式下,在语音录入界面按照预设顺序输出表单项名称的提示语音;The output unit is configured to output, in the voice input mode, the prompt voice of the form item name in a preset order in the voice input interface;
    获取单元,用于基于所述提示语音检测用户输入的响应语音,并从所述响应语音中提取与所述提示语音匹配的关键信息;And an acquiring unit, configured to detect, according to the prompt voice, a response voice input by the user, and extract, from the response voice, key information that matches the prompt voice;
    生成单元,用于当检测到输出的提示语音为所述预设顺序中最后一条表单项名称的提示语音时,根据已输出提示语音的表单项名称和对应提取的关键信息生成第一表单;a generating unit, configured to: when detecting that the output prompt voice is the prompt voice of the last form item name in the preset order, generate a first form according to the form item name of the prompt voice and the corresponding extracted key information;
    显示单元,用于显示所述第一表单。a display unit for displaying the first form.
  9. 一种终端,其特征在于,包括处理器、输入设备、输出设备和存储器,所述处理器、所述输入设备、所述输出设备和所述存储器相互连接,其中,所述存储器用于存储计算机程序,所述计算机程序包括程序指令,所述处理器被配置用于调用所述程序指令,执行如权利要求1-7任一项所述的方法。A terminal, comprising: a processor, an input device, an output device, and a memory, wherein the processor, the input device, the output device, and the memory are connected to each other, wherein the memory is used to store a computer A program, the computer program comprising program instructions, the processor being configured to invoke the program instructions to perform the method of any of claims 1-7.
  10. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有计算机程序,所述计算机程序包括程序指令,所述程序指令当被处理器执行时使所述处理器执行如权利要求1-7任一项所述的方法。A computer readable storage medium, characterized in that the computer readable storage medium stores a computer program, the computer program comprising program instructions that, when executed by a processor, cause the processor to execute as claimed The method of any of 1-7 is claimed.
PCT/CN2018/089393 2017-12-29 2018-05-31 Information input method, device, terminal, and computer readable storage medium WO2019128103A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711474053.7A CN108287815A (en) 2017-12-29 2017-12-29 Information input method, device, terminal and computer readable storage medium
CN201711474053.7 2017-12-29

Publications (1)

Publication Number Publication Date
WO2019128103A1 true WO2019128103A1 (en) 2019-07-04

Family

ID=62832656

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/089393 WO2019128103A1 (en) 2017-12-29 2018-05-31 Information input method, device, terminal, and computer readable storage medium

Country Status (2)

Country Link
CN (1) CN108287815A (en)
WO (1) WO2019128103A1 (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109495636A (en) * 2018-10-23 2019-03-19 慈中华 Information interacting method and device
CN109300473A (en) * 2018-10-23 2019-02-01 慈中华 Voice messaging acquisition methods and device
CN109599102A (en) * 2018-10-24 2019-04-09 慈中华 Identify the method and device of channels and collaterals state
CN109509044A (en) * 2018-11-13 2019-03-22 四川长虹电器股份有限公司 A kind of intelligent tickets system, method and billing information recording device
CN110060674B (en) * 2019-03-15 2022-02-01 重庆小雨点小额贷款有限公司 Table management method, device, terminal and storage medium
CN110660395B (en) * 2019-08-26 2022-04-29 天津开心生活科技有限公司 Safety report generation method and device based on voice recognition
CN111640417A (en) * 2020-05-13 2020-09-08 广州国音智能科技有限公司 Information input method, device, equipment and computer readable storage medium
CN111967235B (en) * 2020-08-31 2023-06-27 深圳赛安特技术服务有限公司 Form processing method, form processing device, computer equipment and storage medium
CN112214997A (en) * 2020-10-09 2021-01-12 深圳壹账通智能科技有限公司 Voice information recording method and device, electronic equipment and storage medium
CN112883696A (en) * 2021-02-03 2021-06-01 维沃移动通信有限公司 Form filling method, form sharing method, device, equipment and storage medium
CN113299289A (en) * 2021-03-30 2021-08-24 阿里巴巴新加坡控股有限公司 Information input method and device and electronic equipment
CN113299290A (en) * 2021-04-06 2021-08-24 维沃移动通信有限公司 Method and device for speech recognition, electronic equipment and readable storage medium
CN113221990B (en) * 2021-04-30 2024-02-23 平安科技(深圳)有限公司 Information input method and device and related equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6195665B1 (en) * 1996-03-05 2001-02-27 Tomorrow's Software, L.L.C. Digital electrical computer apparatus, and methods for making and using the same, for template building, loading, and viewing
CN107357772A (en) * 2017-07-04 2017-11-17 贵州小爱机器人科技有限公司 List filling method, device and computer equipment
CN107430859A (en) * 2015-04-08 2017-12-01 谷歌公司 Input is mapped to form fields

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2002238961A1 (en) * 2001-03-22 2002-10-08 Canon Kabushiki Kaisha Information processing apparatus and method, and program
CN107168551A (en) * 2017-06-13 2017-09-15 重庆小雨点小额贷款有限公司 The input method that a kind of list is filled in
CN107479797A (en) * 2017-09-27 2017-12-15 深圳天珑无线科技有限公司 A kind of method and device for inputting telephone number

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6195665B1 (en) * 1996-03-05 2001-02-27 Tomorrow's Software, L.L.C. Digital electrical computer apparatus, and methods for making and using the same, for template building, loading, and viewing
CN107430859A (en) * 2015-04-08 2017-12-01 谷歌公司 Input is mapped to form fields
CN107357772A (en) * 2017-07-04 2017-11-17 贵州小爱机器人科技有限公司 List filling method, device and computer equipment

Also Published As

Publication number Publication date
CN108287815A (en) 2018-07-17

Similar Documents

Publication Publication Date Title
WO2019128103A1 (en) Information input method, device, terminal, and computer readable storage medium
US11868680B2 (en) Electronic device and method for generating short cut of quick command
JP6588637B2 (en) Learning personalized entity pronunciation
US9454964B2 (en) Interfacing device and method for supporting speech dialogue service
US20120260177A1 (en) Gesture-activated input using audio recognition
EP3444811B1 (en) Speech recognition method and device
CN109785845B (en) Voice processing method, device and equipment
WO2016131386A1 (en) Method and device for service management
WO2018000633A1 (en) Page information processing method, apparatus and electronic device
KR102253279B1 (en) Kiosk-based unmanned payment system using artificial intelligence and its method
KR102443636B1 (en) Electronic device and method for providing information related to phone number
KR20240021834A (en) Method, apparatus, and system for dynamically navigating interactive communication systems
CN111507698A (en) Processing method and device for transferring accounts, computing equipment and medium
CN109547632B (en) Auxiliary call response method, user terminal device and server
US20190304455A1 (en) Electronic device for processing user voice
US20200411004A1 (en) Content input method and apparatus
WO2022213986A1 (en) Voice recognition method and apparatus, electronic device, and readable storage medium
WO2018040438A1 (en) Page content processing method and device
WO2018113751A1 (en) Method for setting communication shortcut and electronic device
CN105898053A (en) Communication recording processing device and method and mobile terminal
CN115118820A (en) Call processing method and device, computer equipment and storage medium
CN108710484B (en) Method, storage medium and device for modifying license plate number through voice
WO2018121487A1 (en) Filtering method and system utilized in interface
US10642929B2 (en) Information display device, information display method and information display program
CN110610704A (en) Method, medium and device for displaying identification and computing equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18896696

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18896696

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 18896696

Country of ref document: EP

Kind code of ref document: A1