WO2021057908A1 - Procédé et dispositif de traduction instantanée, terminal mobile, et support de stockage informatique - Google Patents

Procédé et dispositif de traduction instantanée, terminal mobile, et support de stockage informatique Download PDF

Info

Publication number
WO2021057908A1
WO2021057908A1 PCT/CN2020/117783 CN2020117783W WO2021057908A1 WO 2021057908 A1 WO2021057908 A1 WO 2021057908A1 CN 2020117783 W CN2020117783 W CN 2020117783W WO 2021057908 A1 WO2021057908 A1 WO 2021057908A1
Authority
WO
WIPO (PCT)
Prior art keywords
translation
display
state
time
real
Prior art date
Application number
PCT/CN2020/117783
Other languages
English (en)
Chinese (zh)
Inventor
陆志豪
董乐麒
Original Assignee
深圳市万普拉斯科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市万普拉斯科技有限公司 filed Critical 深圳市万普拉斯科技有限公司
Publication of WO2021057908A1 publication Critical patent/WO2021057908A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • H04N21/4858End-user interface for client configuration for modifying screen layout parameters, e.g. fonts, size of the windows
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/451Execution arrangements for user interfaces
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4884Data services, e.g. news ticker for displaying subtitles

Definitions

  • the present invention relates to the technical field of electronic translation, and in particular to a method, device, mobile terminal and computer storage medium for displaying instant translation.
  • subtitles are usually used to display the translation result.
  • real-time translation will display the translated translation of a single complete sentence, and real-time translation display cannot be provided. And because there are relatively long sentences, the translation will be delayed, which affects the user experience.
  • the present invention provides a real-time translation display method, device, mobile terminal, and computer storage medium to display the translation in the translation state and the translation in the completed state, provide real-time translation display, and reduce the translation delay, Improve user experience.
  • a real-time translation display method includes: obtaining real-time translation; displaying the translation in a translation state and a translation completion state in different display styles.
  • the present invention also provides a real-time translation display device, including: real-time translation acquisition module, used to acquire real-time translation; real-time translation display module, used to display the translation in the translation state and the translation completed state in different display styles.
  • the present invention also provides a mobile terminal, including a memory and a processor, the memory is used to store a computer program, and the processor runs the computer program to make the mobile terminal execute the instant translation display method.
  • the present invention also provides a computer storage medium, which stores a computer program that, when executed by a processor, implements the instant translation display method.
  • FIG. 1 is a flowchart of a real-time translation display method according to Embodiment 1 of the present invention
  • Embodiment 2 is a flowchart of a real-time translation display method provided by Embodiment 2 of the present invention.
  • FIG. 3 is a detailed logic flow chart of displaying real-time translation provided by Embodiment 3 of the present invention.
  • Fig. 5 is a detailed logic flow chart of displaying real-time translation provided by embodiment 4 of the present invention.
  • FIG. 6 is a schematic structural diagram of a real-time translation display device according to Embodiment 5 of the present invention.
  • Fig. 1 is a flowchart of a real-time translation display method provided by Embodiment 1 of the present invention. The method includes the following steps.
  • Step S11 Obtain real-time translation.
  • the mobile terminal when it translates real-time video or audio, it can recognize the source voice in the video or audio through the voice recognition function, thereby converting the source voice into the original language text, or the translation in other languages, such as English
  • the video can be converted into English text or Chinese translation through voice recognition.
  • the instant translation may be a translation of a sentence, or a translation of a word, or a part of the translation of a sentence, that is, a part of the translation result of the sentence. That is, when the sentence to be translated is relatively long, a part of the translation of the sentence in the translation can be displayed first to improve the user experience.
  • a part of the translation of the source voice preset time translation can be obtained, where the preset time length can be 2 seconds or 3 seconds, etc., there is no limitation here.
  • Step S12 Display the translations in the translation in progress state and the translation completed state in different display styles.
  • a part of the translation of the sentence in the translation can be obtained for display, that is, the translation in the translation state.
  • the translation in the translation state will continue to be translated as the sentence is translated. Changes will be made and gradually complete and correct.
  • the translated translation will be displayed in the completed state.
  • the translation before the translation is completed will continue to change over time, so the subtitles that show the translation are also constantly changing.
  • the finished translation is displayed in different display styles, so that the user can clearly understand and distinguish the subtitles in the translation and the translated subtitles, thereby improving the user experience.
  • the instant translation of the translation result can be obtained every two seconds, and the current subtitles can be replaced every two seconds.
  • the real-time translation in the translation state can be displayed in Song Ti subtitles.
  • the subtitles displayed after completion can be displayed in italics subtitles.
  • the differences between the different display styles include at least one of font differences, font style differences, text color differences, text transparency differences, icon usage differences, background color differences, and text description differences.
  • Fig. 2 is a flowchart of a real-time translation display method according to Embodiment 2 of the present invention. The method includes the following steps.
  • Step S21 Obtain real-time translation.
  • Step S22 According to the status flag of the real-time translation, it is judged whether the real-time translation is in the translation state or the translation completed state.
  • the mobile terminal may be provided with a status mark on the instant translation output by the video or audio translation terminal, and the status mark indicates that the instant translation is the result of the translation state or the translation completion state, so that the mobile terminal Subsequent display is performed through the pre-set subtitle display style, which distinguishes the real-time translation in the status of translation and the status of translation completed.
  • Step S23 Display the translations in the translation in progress state and the translation completed state in different display styles.
  • Fig. 3 is a detailed logic flow chart for displaying real-time translation provided by Embodiment 3 of the present invention, including the following steps.
  • Step S31 If the real-time translation is in the translation state, it is judged whether the translation of the current subtitle is in the translation state or the translation completion state.
  • different display time logic is also adopted in the process of displaying subtitles, so as to make the replacement and display process of subtitles more smooth and to ensure translation.
  • the real-time translation in the completed state can be displayed correctly.
  • the real-time translation to be displayed is in the state of translation, it is necessary to determine whether the currently displayed subtitle is the translation in the completed state of translation or the translation in the state of translation before the display.
  • the judging process can be based on the status mark of the current subtitle translation, and an algorithm or application can be set in the mobile terminal to judge.
  • Step S32 If the translation of the current subtitle is in a translation state, replace and display the current subtitle as the instant translation.
  • the translation of the current subtitle is in the translation state, it means that the translation to be displayed is the latest translation of the source voice corresponding to the current subtitle. Therefore, the current subtitle can be replaced and displayed with the instant translation to update the current source voice.
  • the instant translation can be directly displayed.
  • Step S33 If the translation of the current subtitle is in the translation completed state, it is determined whether the current subtitle display reaches the preset rule duration.
  • the translation of the current subtitle is in the translation complete state, it means that the translation to be displayed is the translation of the new source voice.
  • the translation of the new source voice is displayed, it is necessary to determine whether the current subtitles are displayed as expected.
  • Set the length of the rule to ensure that users will not miss the completed translation, thereby improving user experience.
  • an algorithm or an application program can be set in the mobile terminal to determine whether the subtitles in the completed state of translation reach the preset rule duration.
  • the display duration of the preset rule can be set when the subtitles in the translated state are displayed.
  • Step S34 If the current subtitle display in the translation completed state reaches the preset rule duration, subtitles are replaced and displayed as the real-time translation.
  • the display time of the subtitles in the completed state of the translation reaches the preset rule duration
  • the current subtitles are replaced with the instant translation, so that the user can read the instant translation of the new source voice.
  • the above-mentioned preset rules include: the longer the speech duration of the source speech of the instant translation is, the longer the display time of the translation in the corresponding translation completed state is.
  • FIG. 4 it is a flowchart for setting the display time of the translation in the translation completed state provided in the third embodiment, including the following steps.
  • Step S41 If the voice duration of the source voice is less than the first preset time value, display the corresponding translated translation in the completed state for the first preset duration.
  • the first preset time value may be, for example, 6 seconds
  • the first preset time length may be 2 seconds, that is, when the source speech duration is less than 6 seconds, the final translated translation will be
  • the subtitles are displayed for 2 seconds, and when they are displayed, they are displayed in the preset display style of the translation completed state to distinguish the translation in the process of translation.
  • Step S42 If the voice duration is greater than or equal to the first preset time value and less than or equal to the second preset time value, display the corresponding completed translation for the second preset duration and the second preset duration is the same as the voice duration.
  • the duration is directly proportional.
  • the first preset time value may be, for example, 6 seconds
  • the second preset time value may be 12 seconds
  • the second preset time length may be one-third of the source voice duration, that is, the first 2.
  • the preset duration is 2 to 4 seconds.
  • the subtitle display time of the translated translation is 3 seconds, so that the user can complete the reading of the translation.
  • Step S43 If the speech duration is greater than the second preset time value, display the corresponding completed translation for a third preset duration, where the second preset duration is greater than the first preset duration and less than the first preset duration.
  • the third preset duration is greater than the first preset duration and less than the first preset duration.
  • the second preset time value may be 12 seconds
  • the third preset time length is 4 seconds, that is, the display time of the translated subtitles after the translation is completed is 4 seconds at the longest, so as not to affect the next source voice translation The display improves the user experience.
  • Fig. 5 is a detailed logic flow chart of displaying real-time translation provided by Embodiment 4 of the present invention, including the following steps.
  • Step S51 If the real-time translation is in the translation state, it is judged whether the translation of the current subtitle is in the translation state or the translation completion state.
  • Step S52 If the translation of the current subtitle is in a translation state, replace and display the current subtitle as the real-time translation.
  • Step S53 If the translation of the current subtitle is in a translation completed state, it is judged whether the current subtitle display reaches the preset rule duration.
  • Step S54 If the current subtitle display in the translation completed state reaches the preset rule duration, the subtitle is replaced and displayed as the real-time translation.
  • Step S55 If the real-time translation is in the translation completion state, the real-time translation is displayed in direct subtitles, or the current subtitles are replaced and displayed as the real-time translation.
  • the real-time translation can be directly displayed in subtitles when there is no subtitle currently displayed. If there are currently subtitles displayed, the self-cleaning replaces the current subtitles to be displayed as the instant translation, so as to avoid delays in displaying the translation in the completed translation state, thereby improving the user experience.
  • FIG. 6 is a schematic structural diagram of a real-time translation display device according to Embodiment 5 of the present invention.
  • the instant translation display device 600 includes the following modules.
  • the instant translation acquisition module 610 is used to acquire the instant translation.
  • the instant translation display module 620 is used to display the translations in the translation in progress state and the translation completed state in different display styles.
  • the present invention also provides a mobile terminal, which may include a smart phone, a tablet computer, a vehicle-mounted computer, a smart wearable device, and the like.
  • the mobile terminal includes a memory and a processor, and the memory can be used to store a computer program.
  • the processor runs the computer program to enable the mobile terminal to execute the above method or the functions of each module in the above instant translation display device.
  • the memory may include a storage program area and a storage data area.
  • the storage program area may store an operating system, an application program required by at least one function (such as a sound playback function, an image playback function, etc.), etc.; Use the created data (such as audio data, phone book, etc.) and so on.
  • the memory may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid-state storage device.
  • This embodiment also provides a computer storage medium for storing the computer program used in the above-mentioned mobile terminal.
  • an embodiment of the present invention also provides a computer program product, the computer program product includes a computer program stored on a non-transitory computer-readable storage medium, the computer program includes program instructions, when the program instructions are When executed by a computer, the computer is caused to execute the method in any of the foregoing method embodiments.
  • each block in the flowchart or block diagram may represent a module, program segment, or part of the code, and the module, program segment, or part of the code contains one or more functions for realizing the specified logic function.
  • Executable instructions may also occur in a different order from the order marked in the drawings.
  • each block in the structure diagram and/or flowchart, and the combination of the blocks in the structure diagram and/or flowchart can be used as a dedicated hardware-based system that performs specified functions or actions. , Or can be realized by a combination of dedicated hardware and computer instructions.
  • the functional modules or units in the various embodiments of the present invention may be integrated together to form an independent part, or each module may exist alone, or two or more modules may be integrated to form an independent part.
  • the function is implemented in the form of a software function module and sold or used as an independent product, it can be stored in a computer readable storage medium.
  • the technical solution of the present invention essentially or the part that contributes to the prior art or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a smart phone, a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present invention.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disks or optical disks and other media that can store program codes. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Acoustics & Sound (AREA)
  • User Interface Of Digital Computer (AREA)
  • Machine Translation (AREA)

Abstract

La présente invention concerne un procédé et un dispositif de traduction instantanée, un terminal mobile et un support d'informations informatique. Le procédé d'affichage de traduction instantanée comprend les étapes consistant à : acquérir une traduction instantanée ; et afficher, dans différents styles d'affichage, des traductions dans un état de traduction en cours et dans un état de translation terminée.
PCT/CN2020/117783 2019-09-29 2020-09-25 Procédé et dispositif de traduction instantanée, terminal mobile, et support de stockage informatique WO2021057908A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910936815.3A CN112584252B (zh) 2019-09-29 2019-09-29 即时译文显示方法、装置、移动终端和计算机存储介质
CN201910936815.3 2019-09-29

Publications (1)

Publication Number Publication Date
WO2021057908A1 true WO2021057908A1 (fr) 2021-04-01

Family

ID=75111373

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/117783 WO2021057908A1 (fr) 2019-09-29 2020-09-25 Procédé et dispositif de traduction instantanée, terminal mobile, et support de stockage informatique

Country Status (2)

Country Link
CN (1) CN112584252B (fr)
WO (1) WO2021057908A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114143592A (zh) * 2021-11-30 2022-03-04 北京字节跳动网络技术有限公司 视频处理方法、视频处理装置和计算机可读存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1356674A (zh) * 2000-12-05 2002-07-03 唐蓉 易学易懂外文发音读物
US20030176995A1 (en) * 2002-03-14 2003-09-18 Oki Electric Industry Co., Ltd. Translation mediate system, translation mediate server and translation mediate method
CN105761201A (zh) * 2016-02-02 2016-07-13 山东大学 一种翻译图片中文字的方法
CN107066455A (zh) * 2017-03-30 2017-08-18 唐亮 一种多语言智能预处理实时统计机器翻译***
CN107632980A (zh) * 2017-08-03 2018-01-26 北京搜狗科技发展有限公司 语音翻译方法和装置、用于语音翻译的装置

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108182183B (zh) * 2017-12-27 2021-09-17 北京百度网讯科技有限公司 图片文字翻译方法、应用及计算机设备
CN108985201A (zh) * 2018-06-29 2018-12-11 网易有道信息技术(北京)有限公司 图像处理方法、介质、装置和计算设备

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1356674A (zh) * 2000-12-05 2002-07-03 唐蓉 易学易懂外文发音读物
US20030176995A1 (en) * 2002-03-14 2003-09-18 Oki Electric Industry Co., Ltd. Translation mediate system, translation mediate server and translation mediate method
CN105761201A (zh) * 2016-02-02 2016-07-13 山东大学 一种翻译图片中文字的方法
CN107066455A (zh) * 2017-03-30 2017-08-18 唐亮 一种多语言智能预处理实时统计机器翻译***
CN107632980A (zh) * 2017-08-03 2018-01-26 北京搜狗科技发展有限公司 语音翻译方法和装置、用于语音翻译的装置

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114143592A (zh) * 2021-11-30 2022-03-04 北京字节跳动网络技术有限公司 视频处理方法、视频处理装置和计算机可读存储介质
CN114143592B (zh) * 2021-11-30 2023-10-27 抖音视界有限公司 视频处理方法、视频处理装置和计算机可读存储介质

Also Published As

Publication number Publication date
CN112584252A (zh) 2021-03-30
CN112584252B (zh) 2022-02-22

Similar Documents

Publication Publication Date Title
US11176141B2 (en) Preserving emotion of user input
US20210160582A1 (en) Method and system of displaying subtitles, computing device, and readable storage medium
CN107239547B (zh) 用于语音点歌的语音纠错方法、终端及存储介质
KR102081229B1 (ko) 텍스트 입력에 따른 실시간 이미지 출력 장치 및 방법
CN111885416B (zh) 一种音视频的修正方法、装置、介质及计算设备
WO2015089409A1 (fr) Utilisation de modèles de langues statistiques pour améliorer une entrée de texte
CN111898388A (zh) 视频字幕翻译编辑方法、装置、电子设备及存储介质
CN111885313A (zh) 一种音视频的修正方法、装置、介质及计算设备
US10896624B2 (en) System and methods for transforming language into interactive elements
JP6155821B2 (ja) 情報処理装置、情報処理方法、及びプログラム
WO2021057908A1 (fr) Procédé et dispositif de traduction instantanée, terminal mobile, et support de stockage informatique
US20240169972A1 (en) Synchronization method and apparatus for audio and text, device, and medium
CN113870396B (zh) 一种口型动画生成方法、装置、计算机设备及存储介质
WO2019007408A1 (fr) Procédé d'affichage, dispositif, appareil pouvant être porté, et support d'informations lisible par ordinateur
CN102955770A (zh) 一种拼音自动识别方法及***
CN109714248B (zh) 一种数据处理方法及装置
CN110780749B (zh) 一种字符串纠错方法和装置
CN104427263A (zh) 一种显示字幕的方法和多媒体播放装置
CN112114770A (zh) 基于语音交互的界面引导方法、装置及设备
KR20180028434A (ko) 단어 관리 방법 및 장치
JP2019062332A (ja) 表示態様決定装置、表示装置、表示態様決定方法及びプログラム
CN108108350B (zh) 名词识别方法及装置
TW201830227A (zh) 顯示文本的字串的方法、可穿戴裝置及非暫態電腦可讀媒體
EP3055859B1 (fr) Identification d'un contact
JP5832815B2 (ja) 字幕情報を用いた検索結果提供方法およびシステム

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20868138

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 29-08-22)

122 Ep: pct application non-entry in european phase

Ref document number: 20868138

Country of ref document: EP

Kind code of ref document: A1