WO2021057908A1 - 即时译文显示方法、装置、移动终端和计算机存储介质 - Google Patents

即时译文显示方法、装置、移动终端和计算机存储介质 Download PDF

Info

Publication number
WO2021057908A1
WO2021057908A1 PCT/CN2020/117783 CN2020117783W WO2021057908A1 WO 2021057908 A1 WO2021057908 A1 WO 2021057908A1 CN 2020117783 W CN2020117783 W CN 2020117783W WO 2021057908 A1 WO2021057908 A1 WO 2021057908A1
Authority
WO
WIPO (PCT)
Prior art keywords
translation
display
state
time
real
Prior art date
Application number
PCT/CN2020/117783
Other languages
English (en)
French (fr)
Inventor
陆志豪
董乐麒
Original Assignee
深圳市万普拉斯科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市万普拉斯科技有限公司 filed Critical 深圳市万普拉斯科技有限公司
Publication of WO2021057908A1 publication Critical patent/WO2021057908A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • H04N21/4858End-user interface for client configuration for modifying screen layout parameters, e.g. fonts, size of the windows
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/451Execution arrangements for user interfaces
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4884Data services, e.g. news ticker for displaying subtitles

Definitions

  • the present invention relates to the technical field of electronic translation, and in particular to a method, device, mobile terminal and computer storage medium for displaying instant translation.
  • subtitles are usually used to display the translation result.
  • real-time translation will display the translated translation of a single complete sentence, and real-time translation display cannot be provided. And because there are relatively long sentences, the translation will be delayed, which affects the user experience.
  • the present invention provides a real-time translation display method, device, mobile terminal, and computer storage medium to display the translation in the translation state and the translation in the completed state, provide real-time translation display, and reduce the translation delay, Improve user experience.
  • a real-time translation display method includes: obtaining real-time translation; displaying the translation in a translation state and a translation completion state in different display styles.
  • the present invention also provides a real-time translation display device, including: real-time translation acquisition module, used to acquire real-time translation; real-time translation display module, used to display the translation in the translation state and the translation completed state in different display styles.
  • the present invention also provides a mobile terminal, including a memory and a processor, the memory is used to store a computer program, and the processor runs the computer program to make the mobile terminal execute the instant translation display method.
  • the present invention also provides a computer storage medium, which stores a computer program that, when executed by a processor, implements the instant translation display method.
  • FIG. 1 is a flowchart of a real-time translation display method according to Embodiment 1 of the present invention
  • Embodiment 2 is a flowchart of a real-time translation display method provided by Embodiment 2 of the present invention.
  • FIG. 3 is a detailed logic flow chart of displaying real-time translation provided by Embodiment 3 of the present invention.
  • Fig. 5 is a detailed logic flow chart of displaying real-time translation provided by embodiment 4 of the present invention.
  • FIG. 6 is a schematic structural diagram of a real-time translation display device according to Embodiment 5 of the present invention.
  • Fig. 1 is a flowchart of a real-time translation display method provided by Embodiment 1 of the present invention. The method includes the following steps.
  • Step S11 Obtain real-time translation.
  • the mobile terminal when it translates real-time video or audio, it can recognize the source voice in the video or audio through the voice recognition function, thereby converting the source voice into the original language text, or the translation in other languages, such as English
  • the video can be converted into English text or Chinese translation through voice recognition.
  • the instant translation may be a translation of a sentence, or a translation of a word, or a part of the translation of a sentence, that is, a part of the translation result of the sentence. That is, when the sentence to be translated is relatively long, a part of the translation of the sentence in the translation can be displayed first to improve the user experience.
  • a part of the translation of the source voice preset time translation can be obtained, where the preset time length can be 2 seconds or 3 seconds, etc., there is no limitation here.
  • Step S12 Display the translations in the translation in progress state and the translation completed state in different display styles.
  • a part of the translation of the sentence in the translation can be obtained for display, that is, the translation in the translation state.
  • the translation in the translation state will continue to be translated as the sentence is translated. Changes will be made and gradually complete and correct.
  • the translated translation will be displayed in the completed state.
  • the translation before the translation is completed will continue to change over time, so the subtitles that show the translation are also constantly changing.
  • the finished translation is displayed in different display styles, so that the user can clearly understand and distinguish the subtitles in the translation and the translated subtitles, thereby improving the user experience.
  • the instant translation of the translation result can be obtained every two seconds, and the current subtitles can be replaced every two seconds.
  • the real-time translation in the translation state can be displayed in Song Ti subtitles.
  • the subtitles displayed after completion can be displayed in italics subtitles.
  • the differences between the different display styles include at least one of font differences, font style differences, text color differences, text transparency differences, icon usage differences, background color differences, and text description differences.
  • Fig. 2 is a flowchart of a real-time translation display method according to Embodiment 2 of the present invention. The method includes the following steps.
  • Step S21 Obtain real-time translation.
  • Step S22 According to the status flag of the real-time translation, it is judged whether the real-time translation is in the translation state or the translation completed state.
  • the mobile terminal may be provided with a status mark on the instant translation output by the video or audio translation terminal, and the status mark indicates that the instant translation is the result of the translation state or the translation completion state, so that the mobile terminal Subsequent display is performed through the pre-set subtitle display style, which distinguishes the real-time translation in the status of translation and the status of translation completed.
  • Step S23 Display the translations in the translation in progress state and the translation completed state in different display styles.
  • Fig. 3 is a detailed logic flow chart for displaying real-time translation provided by Embodiment 3 of the present invention, including the following steps.
  • Step S31 If the real-time translation is in the translation state, it is judged whether the translation of the current subtitle is in the translation state or the translation completion state.
  • different display time logic is also adopted in the process of displaying subtitles, so as to make the replacement and display process of subtitles more smooth and to ensure translation.
  • the real-time translation in the completed state can be displayed correctly.
  • the real-time translation to be displayed is in the state of translation, it is necessary to determine whether the currently displayed subtitle is the translation in the completed state of translation or the translation in the state of translation before the display.
  • the judging process can be based on the status mark of the current subtitle translation, and an algorithm or application can be set in the mobile terminal to judge.
  • Step S32 If the translation of the current subtitle is in a translation state, replace and display the current subtitle as the instant translation.
  • the translation of the current subtitle is in the translation state, it means that the translation to be displayed is the latest translation of the source voice corresponding to the current subtitle. Therefore, the current subtitle can be replaced and displayed with the instant translation to update the current source voice.
  • the instant translation can be directly displayed.
  • Step S33 If the translation of the current subtitle is in the translation completed state, it is determined whether the current subtitle display reaches the preset rule duration.
  • the translation of the current subtitle is in the translation complete state, it means that the translation to be displayed is the translation of the new source voice.
  • the translation of the new source voice is displayed, it is necessary to determine whether the current subtitles are displayed as expected.
  • Set the length of the rule to ensure that users will not miss the completed translation, thereby improving user experience.
  • an algorithm or an application program can be set in the mobile terminal to determine whether the subtitles in the completed state of translation reach the preset rule duration.
  • the display duration of the preset rule can be set when the subtitles in the translated state are displayed.
  • Step S34 If the current subtitle display in the translation completed state reaches the preset rule duration, subtitles are replaced and displayed as the real-time translation.
  • the display time of the subtitles in the completed state of the translation reaches the preset rule duration
  • the current subtitles are replaced with the instant translation, so that the user can read the instant translation of the new source voice.
  • the above-mentioned preset rules include: the longer the speech duration of the source speech of the instant translation is, the longer the display time of the translation in the corresponding translation completed state is.
  • FIG. 4 it is a flowchart for setting the display time of the translation in the translation completed state provided in the third embodiment, including the following steps.
  • Step S41 If the voice duration of the source voice is less than the first preset time value, display the corresponding translated translation in the completed state for the first preset duration.
  • the first preset time value may be, for example, 6 seconds
  • the first preset time length may be 2 seconds, that is, when the source speech duration is less than 6 seconds, the final translated translation will be
  • the subtitles are displayed for 2 seconds, and when they are displayed, they are displayed in the preset display style of the translation completed state to distinguish the translation in the process of translation.
  • Step S42 If the voice duration is greater than or equal to the first preset time value and less than or equal to the second preset time value, display the corresponding completed translation for the second preset duration and the second preset duration is the same as the voice duration.
  • the duration is directly proportional.
  • the first preset time value may be, for example, 6 seconds
  • the second preset time value may be 12 seconds
  • the second preset time length may be one-third of the source voice duration, that is, the first 2.
  • the preset duration is 2 to 4 seconds.
  • the subtitle display time of the translated translation is 3 seconds, so that the user can complete the reading of the translation.
  • Step S43 If the speech duration is greater than the second preset time value, display the corresponding completed translation for a third preset duration, where the second preset duration is greater than the first preset duration and less than the first preset duration.
  • the third preset duration is greater than the first preset duration and less than the first preset duration.
  • the second preset time value may be 12 seconds
  • the third preset time length is 4 seconds, that is, the display time of the translated subtitles after the translation is completed is 4 seconds at the longest, so as not to affect the next source voice translation The display improves the user experience.
  • Fig. 5 is a detailed logic flow chart of displaying real-time translation provided by Embodiment 4 of the present invention, including the following steps.
  • Step S51 If the real-time translation is in the translation state, it is judged whether the translation of the current subtitle is in the translation state or the translation completion state.
  • Step S52 If the translation of the current subtitle is in a translation state, replace and display the current subtitle as the real-time translation.
  • Step S53 If the translation of the current subtitle is in a translation completed state, it is judged whether the current subtitle display reaches the preset rule duration.
  • Step S54 If the current subtitle display in the translation completed state reaches the preset rule duration, the subtitle is replaced and displayed as the real-time translation.
  • Step S55 If the real-time translation is in the translation completion state, the real-time translation is displayed in direct subtitles, or the current subtitles are replaced and displayed as the real-time translation.
  • the real-time translation can be directly displayed in subtitles when there is no subtitle currently displayed. If there are currently subtitles displayed, the self-cleaning replaces the current subtitles to be displayed as the instant translation, so as to avoid delays in displaying the translation in the completed translation state, thereby improving the user experience.
  • FIG. 6 is a schematic structural diagram of a real-time translation display device according to Embodiment 5 of the present invention.
  • the instant translation display device 600 includes the following modules.
  • the instant translation acquisition module 610 is used to acquire the instant translation.
  • the instant translation display module 620 is used to display the translations in the translation in progress state and the translation completed state in different display styles.
  • the present invention also provides a mobile terminal, which may include a smart phone, a tablet computer, a vehicle-mounted computer, a smart wearable device, and the like.
  • the mobile terminal includes a memory and a processor, and the memory can be used to store a computer program.
  • the processor runs the computer program to enable the mobile terminal to execute the above method or the functions of each module in the above instant translation display device.
  • the memory may include a storage program area and a storage data area.
  • the storage program area may store an operating system, an application program required by at least one function (such as a sound playback function, an image playback function, etc.), etc.; Use the created data (such as audio data, phone book, etc.) and so on.
  • the memory may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid-state storage device.
  • This embodiment also provides a computer storage medium for storing the computer program used in the above-mentioned mobile terminal.
  • an embodiment of the present invention also provides a computer program product, the computer program product includes a computer program stored on a non-transitory computer-readable storage medium, the computer program includes program instructions, when the program instructions are When executed by a computer, the computer is caused to execute the method in any of the foregoing method embodiments.
  • each block in the flowchart or block diagram may represent a module, program segment, or part of the code, and the module, program segment, or part of the code contains one or more functions for realizing the specified logic function.
  • Executable instructions may also occur in a different order from the order marked in the drawings.
  • each block in the structure diagram and/or flowchart, and the combination of the blocks in the structure diagram and/or flowchart can be used as a dedicated hardware-based system that performs specified functions or actions. , Or can be realized by a combination of dedicated hardware and computer instructions.
  • the functional modules or units in the various embodiments of the present invention may be integrated together to form an independent part, or each module may exist alone, or two or more modules may be integrated to form an independent part.
  • the function is implemented in the form of a software function module and sold or used as an independent product, it can be stored in a computer readable storage medium.
  • the technical solution of the present invention essentially or the part that contributes to the prior art or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a smart phone, a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present invention.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disks or optical disks and other media that can store program codes. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Acoustics & Sound (AREA)
  • User Interface Of Digital Computer (AREA)
  • Machine Translation (AREA)

Abstract

本发明提供一种即时译文显示方法、装置、移动终端和计算机存储介质,该即时译文显示方法包括:获取即时译文;对处于翻译中状态和翻译完成状态的译文以不同的显示样式进行显示。

Description

即时译文显示方法、装置、移动终端和计算机存储介质
交叉引用
本发明要求在2019年09月29日提交中国专利局、申请号为201910936815.3、发明名称为“即时译文显示方法、装置、移动终端和计算机存储介质”的中国专利申请的优先权,该申请的全部内容通过引用结合在本发明中。
技术领域
本发明涉及电子翻译技术领域,具体而言,涉及一种即时译文显示方法、装置、移动终端和计算机存储介质。
背景技术
对实时的视频或音频进行即时翻译后,通常都会使用字幕显示翻译的结果。在一些情形下的即时翻译中,会显示单个完整句子已经翻译完成的译文,不能提供实时的译文显示。并且由于存在比较长的句子时,译文就会有延迟,从而影响用户的体验。
发明内容
鉴于上述问题,本发明提供了一种即时译文显示方法、装置、移动终端和计算机存储介质,以显示翻译中状态的译文,以及翻译完成状态的译文,提供实时译文显示,并降低译文的延迟,提高用户的体验。
为了实现上述目的,本发明采用如下的技术方案:
一种即时译文显示方法,包括:获取即时译文;对处于翻译中状态和翻译完成状态的译文以不同的显示样式进行显示。
本发明还提供一种即时译文显示装置,包括:即时译文获取模块,用于获取即时译文;即时译文显示模块,用于对处于翻译中状态和翻译完成状态 的译文以不同的显示样式进行显示。
本发明还提供一种移动终端,包括存储器以及处理器,所述存储器用于存储计算机程序,所述处理器运行所述计算机程序以使所述移动终端执行所述的即时译文显示方法。
本发明还提供一种计算机存储介质,其存储有计算机程序,所述计算机程序在被处理器执行时实施所述的即时译文显示方法。
附图说明
为了更清楚地说明本发明的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,应当理解,以下附图仅示出了本发明的某些实施例,因此不应被看作是对本发明保护范围的限定。在各个附图中,类似的构成部分采用类似的编号。
图1是本发明实施例1提供的一种即时译文显示方法的流程图;
图2是本发明实施例2提供的一种即时译文显示方法的流程图;
图3是本发明实施例3提供的显示即时译文的详细逻辑流程图;
图4是本发明实施例3提供的翻译完成状态的译文显示时间的设定流程图;
图5是本发明实施例4提供的显示即时译文的详细逻辑流程图;
图6是本发明实施例5提供的一种即时译文显示装置的结构示意图。
具体实施方式
下面将结合本发明实施例中附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。
通常在此处附图中描述和示出的本发明实施例的组件可以以各种不同的配置来布置和设计。因此,以下对在附图中提供的本发明的实施例的详细描 述并非旨在限制要求保护的本发明的范围,而是仅仅表示本发明的选定实施例。基于本发明的实施例,本领域技术人员在没有做出创造性劳动的前提下所获得的所有其他实施例,都属于本发明保护的范围。
在下文中,可在本发明的各种实施例中使用的术语“包括”、“具有”及其同源词仅意在表示特定特征、数字、步骤、操作、元件、组件或前述项的组合,并且不应被理解为首先排除一个或更多个其它特征、数字、步骤、操作、元件、组件或前述项的组合的存在或增加一个或更多个特征、数字、步骤、操作、元件、组件或前述项的组合的可能性。
此外,术语“第一”、“第二”、“第三”等仅用于区分描述,而不能理解为指示或暗示相对重要性。
除非另有限定,否则在这里使用的所有术语(包括技术术语和科学术语)具有与本发明的各种实施例所属领域普通技术人员通常理解的含义相同的含义。所述术语(诸如在一般使用的词典中限定的术语)将被解释为具有与在相关技术领域中的语境含义相同的含义并且将不被解释为具有理想化的含义或过于正式的含义,除非在本发明的各种实施例中被清楚地限定。
实施例1
图1是本发明实施例1提供的一种即时译文显示方法的流程图,该方法包括如下步骤。
步骤S11:获取即时译文。
本发明实施例中,移动终端对实时的视频或音频进行翻译时,可通过语音识别功能识别视频或音频中的源语音,从而将源语音转换为原语言文字,或者其他语言的译文,例如英语视频可以通过语音识别转化为英语文字,或者中文译文等。其中,该即时译文可以为一个句子的译文,或者一个单词的译文,也可以是一个句子的一部分译文,也即为该句子的一部分翻译结果。也即在需要翻译的句子比较长时,可以将翻译中句子的一部分译文先进行显示,以提高用户的体验度,例如可以获取源语音预设时长翻译的部分译文, 其中该预设时长可以为2秒或3秒等,这里不做限定。
步骤S12:对处于翻译中状态和翻译完成状态的译文以不同的显示样式进行显示。
本发明实施例中,在一个源语音长句进行翻译时,可以获取翻译中句子的一部分译文进行显示,也即为翻译中状态的译文,该翻译中状态的译文随着句子的继续翻译,会产生变化,并逐渐完整以及正确,最终将获得翻译完成状态的译文进行显示。其中,由于不同语言间存在语法、语序以及词义上的差异,因此在翻译完成状态前的译文是会随着时间不断更替的,因此显示译文的字幕也在不断更替,对处于翻译中状态和翻译完成状态的译文以不同显示样式进行显示,使用户清晰了解并区分翻译中的字幕以及翻译完成的字幕,从而提高用户体验度。例如在一个10秒音频长句进行翻译时,可以每两秒获取一次翻译结果的即时译文,每两秒替换当前字幕进行显示,而在翻译中状态的即时译文可以使用宋体字幕进行显示,在翻译完成后进行显示的字幕可以使用楷体字幕显示。
本发明实施例中,所述不同的显示样式之间的差异包括字体差异、字体样式差异、文字颜色差异、文字透明度差异、图标使用差异、背景颜色差异和文字说明差异中的至少一种。
实施例2
图2是本发明实施例2提供的一种即时译文显示方法的流程图,该方法包括如下步骤。
步骤S21:获取即时译文。
此步骤与上述步骤S11一致,在此不再赘述。
步骤S22:根据所述即时译文的状态标记,判断所述即时译文是处于翻译中状态还是处于翻译完成状态。
本发明实施例中,移动终端中在视频或音频翻译端输出的即时译文上可 以设置有状态标记,通过状态标记表明该即时译文是翻译中状态的结果或者是翻译完成状态的结果,以便移动终端后续通过预先设定的字幕显示样式来进行显示,区分开翻译中状态与翻译完成状态的即时译文。
步骤S23:对处于翻译中状态和翻译完成状态的译文以不同的显示样式进行显示。
此步骤与上述步骤S12一致,在此不再赘述。
实施例3
图3是本发明实施例3提供的显示即时译文的详细逻辑流程图,包括如下步骤。
步骤S31:若所述即时译文处于翻译中状态,判断当前字幕的译文是处于翻译中状态还是处于翻译完成状态。
本发明实施例中,对于翻译中状态的即时译文与翻译完成状态的即时译文在显示为字幕的过程中,也采用不一样的显示时间逻辑,以使字幕的更换显示过程更加流畅,以及保证翻译完成状态的即时译文能够正确显示。其中,若即将要进行显示的即时译文是翻译中的状态时,在显示之前需要先判断当前显示的字幕是翻译完成状态的译文,还是翻译中状态的译文。判断过程可以基于当前字幕译文的状态标记,在移动终端中设置算法或应用程序来判断。
步骤S32:若当前字幕的译文处于翻译中状态,则将所述当前字幕替换显示为所述即时译文。
本发明实施例中,若当前字幕的译文处于翻译中状态时,则说明即将显示的译文是当前字幕相应源语音的最新译文,因此可以将当前字幕替换显示为该即时译文,以更新当前源语音句子的最新翻译结果,从而提高用户体验度。
本发明实施例中,若当前没有译文进行字幕显示时,则可直接显示该即 时译文。
步骤S33:若当前字幕的译文处于翻译完成状态,则判断当前字幕显示是否达到预设规则的时长。
本发明实施例中,若当前字幕的译文处于翻译完成状态时,则说明即将显示的译文是新的源语音的译文,在显示新的源语音的译文时,需要判断当前字幕的显示是否达到预设规则的时长,以保证用户不会错过翻译完成的译文,从而提高用户体验度。其中,可以在移动终端中设置有算法或应用程序,用与判断翻译完成状态字幕是否达到预设规则的时长。或者还可以在该翻译完成状态的字幕进行显示时就设置好预设规则的显示时长。
步骤S34:若处于翻译完成状态的当前字幕显示达到预设规则的时长时,字幕替换显示为所述即时译文。
本发明实施例中,在翻译完成状态的字幕显示时间达到预设规则的时长时,则将当前字幕替换为该即时译文,以便用户阅读新的源语音的即时译文。经过上述采用不一样的显示时间逻辑来显示翻译中状态的即时译文,以及翻译完成状态的即时译文,使翻译中内容和翻译完成内容的现实逻辑和显示时长形成差异化,从而使用户的阅读体验更好。
本发明实施例中,上述预设规则包括:即时翻译的源语音的语音时长越长,则相应的翻译完成状态的译文显示时间越长。具体地,如图4所示,是本实施例3提供的翻译完成状态的译文显示时间的设定流程图,包括如下步骤。
步骤S41:若源语音的语音时长小于第一预设时间值,将相应的翻译完成状态的译文显示第一预设时长。
本发明实施例中,该第一预设时间值例如可以为6秒,该第一预设时长可以为2秒,也即源语音的时长小于6秒时,则最终翻译完成的译文则会以字幕的形式显示2秒,并且显示的时候是以翻译完成状态的预设显示式样进行显示,以区分翻译中的译文。
步骤S42:若所述语音时长大于等于第一预设时间值且小于等于第二预设时间值,将相应的完成状态的译文的显示第二预设时长并且第二预设时长与所述语音时长成正比。
本发明实施例中,该第一预设时间值例如可以为6秒,第二预设时间值可以为12秒,而第二预设时长可以为源语音时长的三分之一,也即第二预设时长为2~4秒,例如源语音时长为9秒时,翻译完成的译文的字幕显示时间为3秒,以便用户可以完成对译文的阅读。
步骤S43:若所述语音时长大于第二预设时间值,将相应的完成状态的译文的显示第三预设时长,所述第二预设时长大于所述第一预设时长且小于所述第三预设时长。
本发明实施例中,该第二预设时间值可以为12秒,第三预设时长为4秒,也即翻译完成的译文字幕显示时间最长为4秒,从而不影响下个源语音译文的显示,提高用户的体验。
实施例4
图5是本发明实施例4提供的显示即时译文的详细逻辑流程图,包括如下步骤。
步骤S51:若所述即时译文处于翻译中状态,判断当前字幕的译文是处于翻译中状态还是处于翻译完成状态。
此步骤与上述步骤S31一致,在此不再赘述。
步骤S52:若当前字幕的译文处于翻译中状态,则将所述当前字幕替换显示为所述即时译文。
此步骤与上述步骤S32一致,在此不再赘述。
步骤S53:若当前字幕的译文处于翻译完成状态,则判断当前字幕显示是否达到预设规则的时长。
此步骤与上述步骤S33一致,在此不再赘述。
步骤S54:若处于翻译完成状态的当前字幕显示达到预设规则的时长时,字幕替换显示为所述即时译文。
此步骤与上述步骤S34一致,在此不再赘述。
步骤S55:若所述即时译文处于翻译完成状态,直接字幕显示所述即时译文,或当前字幕替换显示为所述即时译文。
本发明实施例中,若获得的即时译文是出于翻译完成状态的,在当前无字幕显示时可以直接字幕显示该即时译文。如果当前有字幕显示则自洁替换当前字幕显示为该即时译文,以避免翻译完成状态的译文显示延误,从而提高用户的体验。
实施例5
图6是本发明实施例5提供的一种即时译文显示装置的结构示意图。
该即时译文显示装置600包括以下模块。
即时译文获取模块610,用于获取即时译文。
即时译文显示模块620,用于对处于翻译中状态和翻译完成状态的译文以不同的显示样式进行显示。
本发明实施例中,上述各个模块更加详细的功能描述可以参考前述实施例中相应部分的内容,在此不再赘述。
此外,本发明还提供了一种移动终端,该移动终端可以包括智能电话、平板电脑、车载电脑、智能穿戴设备等。该移动终端包括存储器和处理器,存储器可用于存储计算机程序,处理器通过运行所述计算机程序,从而使移动终端执行上述方法或者上述即时译文显示装置中的各个模块的功能。
存储器可包括存储程序区和存储数据区,其中,存储程序区可存储操作***、至少一个功能所需的应用程序(比如声音播放功能、图像播放功能等)等;存储数据区可存储根据移动终端的使用所创建的数据(比如音频数据、电话本等)等。此外,存储器可以包括高速随机存取存储器,还可以包括非 易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。
本实施例还提供了一种计算机存储介质,用于储存上述移动终端中使用的计算机程序。
此外,本发明实施例还提供了一种计算机程序产品,所述计算机程序产品包括存储在非暂态计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,使所述计算机执行上述任意方法实施例中的方法。
在本申请所提供的几个实施例中,应该理解到,所揭露的装置和方法,也可以通过其它的方式实现。以上所描述的装置实施例仅仅是示意性的,例如,附图中的流程图和结构图显示了根据本发明的多个实施例的装置、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段或代码的一部分,所述模块、程序段或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在作为替换的实现方式中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个连续的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,结构图和/或流程图中的每个方框、以及结构图和/或流程图中的方框的组合,可以用执行规定的功能或动作的专用的基于硬件的***来实现,或者可以用专用硬件与计算机指令的组合来实现。
另外,在本发明各个实施例中的各功能模块或单元可以集成在一起形成一个独立的部分,也可以是各个模块单独存在,也可以两个或更多个模块集成形成一个独立的部分。
所述功能如果以软件功能模块的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可 以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是智能手机、个人计算机、服务器、或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应所述以权利要求的保护范围为准。

Claims (10)

  1. 一种即时译文显示方法,其中,包括:
    获取即时译文;
    对处于翻译中状态和翻译完成状态的译文以不同的显示样式进行显示。
  2. 根据权利要求1所述的即时译文显示方法,其中,还包括:
    根据所述即时译文的状态标记,判断所述即时译文是处于翻译中状态还是处于翻译完成状态。
  3. 根据权利要求1所述的即时译文显示方法,其中,所述“对处于翻译中状态和翻译完成状态的译文以不同的显示样式进行显示”包括:
    若所述即时译文处于翻译中状态,判断当前字幕的译文是处于翻译中状态还是处于翻译完成状态;
    若当前字幕的译文处于翻译中状态,则将所述当前字幕替换显示为所述即时译文;
    若当前字幕的译文处于翻译完成状态,则判断当前字幕显示是否达到预设规则的时长;
    若处于翻译完成状态的当前字幕显示达到预设规则的时长时,字幕替换显示为所述即时译文。
  4. 根据权利要求3所述的即时译文显示方法,其中,所述预设规则包括:
    即时翻译的源语音的语音时长越长,则相应的翻译完成状态的译文显示时间越长。
  5. 根据权利要求4所述的即时译文显示方法,其中,所述“即时翻译的源语音的语音时长越长,则相应的翻译完成状态的译文显示时间越长”包括:
    若源语音的语音时长小于第一预设时间值,将相应的翻译完成状态的译文显示第一预设时长;
    若所述语音时长大于等于第一预设时间值且小于等于第二预设时间值,将相应的完成状态的译文的显示第二预设时长并且第二预设时长与所述语音 时长成正比;
    若所述语音时长大于第二预设时间值,将相应的完成状态的译文的显示第三预设时长,所述第二预设时长大于所述第一预设时长且小于所述第三预设时长。
  6. 根据权利要求3所述的即时译文显示方法,其中,所述“对处于翻译中状态和翻译完成状态的译文以不同的显示样式进行显示”还包括:
    若所述即时译文处于翻译完成状态,直接字幕显示所述即时译文,或当前字幕替换显示为所述即时译文。
  7. 根据权利要求1至6任一项所述的即时译文显示方法,其中,所述不同的显示样式之间的差异包括字体差异、字体样式差异、文字颜色差异、文字透明度差异、图标使用差异、背景颜色差异和文字说明差异中的至少一种。
  8. 一种即时译文显示装置,其中,包括:
    即时译文获取模块,用于获取即时译文;
    即时译文显示模块,用于对处于翻译中状态和翻译完成状态的译文以不同的显示样式进行显示。
  9. 一种移动终端,其中,包括存储器以及处理器,所述存储器用于存储计算机程序,所述处理器运行所述计算机程序以使所述移动终端执行根据权利要求1至7中任一项所述的即时译文显示方法。
  10. 一种计算机存储介质,其中,其存储有计算机程序,所述计算机程序在被处理器执行时实施权利要求1至7中任一项所述的即时译文显示方法。
PCT/CN2020/117783 2019-09-29 2020-09-25 即时译文显示方法、装置、移动终端和计算机存储介质 WO2021057908A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910936815.3 2019-09-29
CN201910936815.3A CN112584252B (zh) 2019-09-29 2019-09-29 即时译文显示方法、装置、移动终端和计算机存储介质

Publications (1)

Publication Number Publication Date
WO2021057908A1 true WO2021057908A1 (zh) 2021-04-01

Family

ID=75111373

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/117783 WO2021057908A1 (zh) 2019-09-29 2020-09-25 即时译文显示方法、装置、移动终端和计算机存储介质

Country Status (2)

Country Link
CN (1) CN112584252B (zh)
WO (1) WO2021057908A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114143592A (zh) * 2021-11-30 2022-03-04 北京字节跳动网络技术有限公司 视频处理方法、视频处理装置和计算机可读存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1356674A (zh) * 2000-12-05 2002-07-03 唐蓉 易学易懂外文发音读物
US20030176995A1 (en) * 2002-03-14 2003-09-18 Oki Electric Industry Co., Ltd. Translation mediate system, translation mediate server and translation mediate method
CN105761201A (zh) * 2016-02-02 2016-07-13 山东大学 一种翻译图片中文字的方法
CN107066455A (zh) * 2017-03-30 2017-08-18 唐亮 一种多语言智能预处理实时统计机器翻译***
CN107632980A (zh) * 2017-08-03 2018-01-26 北京搜狗科技发展有限公司 语音翻译方法和装置、用于语音翻译的装置

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108182183B (zh) * 2017-12-27 2021-09-17 北京百度网讯科技有限公司 图片文字翻译方法、应用及计算机设备
CN108985201A (zh) * 2018-06-29 2018-12-11 网易有道信息技术(北京)有限公司 图像处理方法、介质、装置和计算设备

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1356674A (zh) * 2000-12-05 2002-07-03 唐蓉 易学易懂外文发音读物
US20030176995A1 (en) * 2002-03-14 2003-09-18 Oki Electric Industry Co., Ltd. Translation mediate system, translation mediate server and translation mediate method
CN105761201A (zh) * 2016-02-02 2016-07-13 山东大学 一种翻译图片中文字的方法
CN107066455A (zh) * 2017-03-30 2017-08-18 唐亮 一种多语言智能预处理实时统计机器翻译***
CN107632980A (zh) * 2017-08-03 2018-01-26 北京搜狗科技发展有限公司 语音翻译方法和装置、用于语音翻译的装置

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114143592A (zh) * 2021-11-30 2022-03-04 北京字节跳动网络技术有限公司 视频处理方法、视频处理装置和计算机可读存储介质
CN114143592B (zh) * 2021-11-30 2023-10-27 抖音视界有限公司 视频处理方法、视频处理装置和计算机可读存储介质

Also Published As

Publication number Publication date
CN112584252A (zh) 2021-03-30
CN112584252B (zh) 2022-02-22

Similar Documents

Publication Publication Date Title
US11176141B2 (en) Preserving emotion of user input
US20210160582A1 (en) Method and system of displaying subtitles, computing device, and readable storage medium
CN107239547B (zh) 用于语音点歌的语音纠错方法、终端及存储介质
KR102081229B1 (ko) 텍스트 입력에 따른 실시간 이미지 출력 장치 및 방법
CN111885416B (zh) 一种音视频的修正方法、装置、介质及计算设备
EP3080713A1 (en) Using statistical language models to improve text input
CN111898388A (zh) 视频字幕翻译编辑方法、装置、电子设备及存储介质
CN111885313A (zh) 一种音视频的修正方法、装置、介质及计算设备
US10896624B2 (en) System and methods for transforming language into interactive elements
JP6155821B2 (ja) 情報処理装置、情報処理方法、及びプログラム
WO2021057908A1 (zh) 即时译文显示方法、装置、移动终端和计算机存储介质
US20240169972A1 (en) Synchronization method and apparatus for audio and text, device, and medium
CN113870396B (zh) 一种口型动画生成方法、装置、计算机设备及存储介质
WO2019007408A1 (zh) 一种显示方法、装置、可穿戴设备及计算机可读存储介质
CN102955770A (zh) 一种拼音自动识别方法及***
CN109714248B (zh) 一种数据处理方法及装置
CN110780749B (zh) 一种字符串纠错方法和装置
CN104427263A (zh) 一种显示字幕的方法和多媒体播放装置
CN112114770A (zh) 基于语音交互的界面引导方法、装置及设备
CN115862631A (zh) 一种字幕生成方法、装置、电子设备和存储介质
KR20180028434A (ko) 단어 관리 방법 및 장치
JP2019062332A (ja) 表示態様決定装置、表示装置、表示態様決定方法及びプログラム
CN108108350B (zh) 名词识别方法及装置
EP3055859B1 (en) Identifying a contact
JP5832815B2 (ja) 字幕情報を用いた検索結果提供方法およびシステム

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20868138

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 29-08-22)

122 Ep: pct application non-entry in european phase

Ref document number: 20868138

Country of ref document: EP

Kind code of ref document: A1