WO2021057908A1 - Procédé et dispositif de traduction instantanée, terminal mobile, et support de stockage informatique - Google Patents
Procédé et dispositif de traduction instantanée, terminal mobile, et support de stockage informatique Download PDFInfo
- Publication number
- WO2021057908A1 WO2021057908A1 PCT/CN2020/117783 CN2020117783W WO2021057908A1 WO 2021057908 A1 WO2021057908 A1 WO 2021057908A1 CN 2020117783 W CN2020117783 W CN 2020117783W WO 2021057908 A1 WO2021057908 A1 WO 2021057908A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- translation
- display
- state
- time
- real
- Prior art date
Links
- 238000013519 translation Methods 0.000 title claims abstract description 241
- 238000000034 method Methods 0.000 title claims abstract description 35
- 230000014616 translation Effects 0.000 claims abstract description 239
- 238000004590 computer program Methods 0.000 claims description 14
- 230000006870 function Effects 0.000 description 13
- 238000010586 diagram Methods 0.000 description 6
- 238000004140 cleaning Methods 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/485—End-user interface for client configuration
- H04N21/4858—End-user interface for client configuration for modifying screen layout parameters, e.g. fonts, size of the windows
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/58—Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/451—Execution arrangements for user interfaces
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/485—End-user interface for client configuration
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/488—Data services, e.g. news ticker
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/488—Data services, e.g. news ticker
- H04N21/4884—Data services, e.g. news ticker for displaying subtitles
Definitions
- the present invention relates to the technical field of electronic translation, and in particular to a method, device, mobile terminal and computer storage medium for displaying instant translation.
- subtitles are usually used to display the translation result.
- real-time translation will display the translated translation of a single complete sentence, and real-time translation display cannot be provided. And because there are relatively long sentences, the translation will be delayed, which affects the user experience.
- the present invention provides a real-time translation display method, device, mobile terminal, and computer storage medium to display the translation in the translation state and the translation in the completed state, provide real-time translation display, and reduce the translation delay, Improve user experience.
- a real-time translation display method includes: obtaining real-time translation; displaying the translation in a translation state and a translation completion state in different display styles.
- the present invention also provides a real-time translation display device, including: real-time translation acquisition module, used to acquire real-time translation; real-time translation display module, used to display the translation in the translation state and the translation completed state in different display styles.
- the present invention also provides a mobile terminal, including a memory and a processor, the memory is used to store a computer program, and the processor runs the computer program to make the mobile terminal execute the instant translation display method.
- the present invention also provides a computer storage medium, which stores a computer program that, when executed by a processor, implements the instant translation display method.
- FIG. 1 is a flowchart of a real-time translation display method according to Embodiment 1 of the present invention
- Embodiment 2 is a flowchart of a real-time translation display method provided by Embodiment 2 of the present invention.
- FIG. 3 is a detailed logic flow chart of displaying real-time translation provided by Embodiment 3 of the present invention.
- Fig. 5 is a detailed logic flow chart of displaying real-time translation provided by embodiment 4 of the present invention.
- FIG. 6 is a schematic structural diagram of a real-time translation display device according to Embodiment 5 of the present invention.
- Fig. 1 is a flowchart of a real-time translation display method provided by Embodiment 1 of the present invention. The method includes the following steps.
- Step S11 Obtain real-time translation.
- the mobile terminal when it translates real-time video or audio, it can recognize the source voice in the video or audio through the voice recognition function, thereby converting the source voice into the original language text, or the translation in other languages, such as English
- the video can be converted into English text or Chinese translation through voice recognition.
- the instant translation may be a translation of a sentence, or a translation of a word, or a part of the translation of a sentence, that is, a part of the translation result of the sentence. That is, when the sentence to be translated is relatively long, a part of the translation of the sentence in the translation can be displayed first to improve the user experience.
- a part of the translation of the source voice preset time translation can be obtained, where the preset time length can be 2 seconds or 3 seconds, etc., there is no limitation here.
- Step S12 Display the translations in the translation in progress state and the translation completed state in different display styles.
- a part of the translation of the sentence in the translation can be obtained for display, that is, the translation in the translation state.
- the translation in the translation state will continue to be translated as the sentence is translated. Changes will be made and gradually complete and correct.
- the translated translation will be displayed in the completed state.
- the translation before the translation is completed will continue to change over time, so the subtitles that show the translation are also constantly changing.
- the finished translation is displayed in different display styles, so that the user can clearly understand and distinguish the subtitles in the translation and the translated subtitles, thereby improving the user experience.
- the instant translation of the translation result can be obtained every two seconds, and the current subtitles can be replaced every two seconds.
- the real-time translation in the translation state can be displayed in Song Ti subtitles.
- the subtitles displayed after completion can be displayed in italics subtitles.
- the differences between the different display styles include at least one of font differences, font style differences, text color differences, text transparency differences, icon usage differences, background color differences, and text description differences.
- Fig. 2 is a flowchart of a real-time translation display method according to Embodiment 2 of the present invention. The method includes the following steps.
- Step S21 Obtain real-time translation.
- Step S22 According to the status flag of the real-time translation, it is judged whether the real-time translation is in the translation state or the translation completed state.
- the mobile terminal may be provided with a status mark on the instant translation output by the video or audio translation terminal, and the status mark indicates that the instant translation is the result of the translation state or the translation completion state, so that the mobile terminal Subsequent display is performed through the pre-set subtitle display style, which distinguishes the real-time translation in the status of translation and the status of translation completed.
- Step S23 Display the translations in the translation in progress state and the translation completed state in different display styles.
- Fig. 3 is a detailed logic flow chart for displaying real-time translation provided by Embodiment 3 of the present invention, including the following steps.
- Step S31 If the real-time translation is in the translation state, it is judged whether the translation of the current subtitle is in the translation state or the translation completion state.
- different display time logic is also adopted in the process of displaying subtitles, so as to make the replacement and display process of subtitles more smooth and to ensure translation.
- the real-time translation in the completed state can be displayed correctly.
- the real-time translation to be displayed is in the state of translation, it is necessary to determine whether the currently displayed subtitle is the translation in the completed state of translation or the translation in the state of translation before the display.
- the judging process can be based on the status mark of the current subtitle translation, and an algorithm or application can be set in the mobile terminal to judge.
- Step S32 If the translation of the current subtitle is in a translation state, replace and display the current subtitle as the instant translation.
- the translation of the current subtitle is in the translation state, it means that the translation to be displayed is the latest translation of the source voice corresponding to the current subtitle. Therefore, the current subtitle can be replaced and displayed with the instant translation to update the current source voice.
- the instant translation can be directly displayed.
- Step S33 If the translation of the current subtitle is in the translation completed state, it is determined whether the current subtitle display reaches the preset rule duration.
- the translation of the current subtitle is in the translation complete state, it means that the translation to be displayed is the translation of the new source voice.
- the translation of the new source voice is displayed, it is necessary to determine whether the current subtitles are displayed as expected.
- Set the length of the rule to ensure that users will not miss the completed translation, thereby improving user experience.
- an algorithm or an application program can be set in the mobile terminal to determine whether the subtitles in the completed state of translation reach the preset rule duration.
- the display duration of the preset rule can be set when the subtitles in the translated state are displayed.
- Step S34 If the current subtitle display in the translation completed state reaches the preset rule duration, subtitles are replaced and displayed as the real-time translation.
- the display time of the subtitles in the completed state of the translation reaches the preset rule duration
- the current subtitles are replaced with the instant translation, so that the user can read the instant translation of the new source voice.
- the above-mentioned preset rules include: the longer the speech duration of the source speech of the instant translation is, the longer the display time of the translation in the corresponding translation completed state is.
- FIG. 4 it is a flowchart for setting the display time of the translation in the translation completed state provided in the third embodiment, including the following steps.
- Step S41 If the voice duration of the source voice is less than the first preset time value, display the corresponding translated translation in the completed state for the first preset duration.
- the first preset time value may be, for example, 6 seconds
- the first preset time length may be 2 seconds, that is, when the source speech duration is less than 6 seconds, the final translated translation will be
- the subtitles are displayed for 2 seconds, and when they are displayed, they are displayed in the preset display style of the translation completed state to distinguish the translation in the process of translation.
- Step S42 If the voice duration is greater than or equal to the first preset time value and less than or equal to the second preset time value, display the corresponding completed translation for the second preset duration and the second preset duration is the same as the voice duration.
- the duration is directly proportional.
- the first preset time value may be, for example, 6 seconds
- the second preset time value may be 12 seconds
- the second preset time length may be one-third of the source voice duration, that is, the first 2.
- the preset duration is 2 to 4 seconds.
- the subtitle display time of the translated translation is 3 seconds, so that the user can complete the reading of the translation.
- Step S43 If the speech duration is greater than the second preset time value, display the corresponding completed translation for a third preset duration, where the second preset duration is greater than the first preset duration and less than the first preset duration.
- the third preset duration is greater than the first preset duration and less than the first preset duration.
- the second preset time value may be 12 seconds
- the third preset time length is 4 seconds, that is, the display time of the translated subtitles after the translation is completed is 4 seconds at the longest, so as not to affect the next source voice translation The display improves the user experience.
- Fig. 5 is a detailed logic flow chart of displaying real-time translation provided by Embodiment 4 of the present invention, including the following steps.
- Step S51 If the real-time translation is in the translation state, it is judged whether the translation of the current subtitle is in the translation state or the translation completion state.
- Step S52 If the translation of the current subtitle is in a translation state, replace and display the current subtitle as the real-time translation.
- Step S53 If the translation of the current subtitle is in a translation completed state, it is judged whether the current subtitle display reaches the preset rule duration.
- Step S54 If the current subtitle display in the translation completed state reaches the preset rule duration, the subtitle is replaced and displayed as the real-time translation.
- Step S55 If the real-time translation is in the translation completion state, the real-time translation is displayed in direct subtitles, or the current subtitles are replaced and displayed as the real-time translation.
- the real-time translation can be directly displayed in subtitles when there is no subtitle currently displayed. If there are currently subtitles displayed, the self-cleaning replaces the current subtitles to be displayed as the instant translation, so as to avoid delays in displaying the translation in the completed translation state, thereby improving the user experience.
- FIG. 6 is a schematic structural diagram of a real-time translation display device according to Embodiment 5 of the present invention.
- the instant translation display device 600 includes the following modules.
- the instant translation acquisition module 610 is used to acquire the instant translation.
- the instant translation display module 620 is used to display the translations in the translation in progress state and the translation completed state in different display styles.
- the present invention also provides a mobile terminal, which may include a smart phone, a tablet computer, a vehicle-mounted computer, a smart wearable device, and the like.
- the mobile terminal includes a memory and a processor, and the memory can be used to store a computer program.
- the processor runs the computer program to enable the mobile terminal to execute the above method or the functions of each module in the above instant translation display device.
- the memory may include a storage program area and a storage data area.
- the storage program area may store an operating system, an application program required by at least one function (such as a sound playback function, an image playback function, etc.), etc.; Use the created data (such as audio data, phone book, etc.) and so on.
- the memory may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid-state storage device.
- This embodiment also provides a computer storage medium for storing the computer program used in the above-mentioned mobile terminal.
- an embodiment of the present invention also provides a computer program product, the computer program product includes a computer program stored on a non-transitory computer-readable storage medium, the computer program includes program instructions, when the program instructions are When executed by a computer, the computer is caused to execute the method in any of the foregoing method embodiments.
- each block in the flowchart or block diagram may represent a module, program segment, or part of the code, and the module, program segment, or part of the code contains one or more functions for realizing the specified logic function.
- Executable instructions may also occur in a different order from the order marked in the drawings.
- each block in the structure diagram and/or flowchart, and the combination of the blocks in the structure diagram and/or flowchart can be used as a dedicated hardware-based system that performs specified functions or actions. , Or can be realized by a combination of dedicated hardware and computer instructions.
- the functional modules or units in the various embodiments of the present invention may be integrated together to form an independent part, or each module may exist alone, or two or more modules may be integrated to form an independent part.
- the function is implemented in the form of a software function module and sold or used as an independent product, it can be stored in a computer readable storage medium.
- the technical solution of the present invention essentially or the part that contributes to the prior art or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a smart phone, a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present invention.
- the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disks or optical disks and other media that can store program codes. .
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Acoustics & Sound (AREA)
- User Interface Of Digital Computer (AREA)
- Machine Translation (AREA)
Abstract
La présente invention concerne un procédé et un dispositif de traduction instantanée, un terminal mobile et un support d'informations informatique. Le procédé d'affichage de traduction instantanée comprend les étapes consistant à : acquérir une traduction instantanée ; et afficher, dans différents styles d'affichage, des traductions dans un état de traduction en cours et dans un état de translation terminée.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910936815.3A CN112584252B (zh) | 2019-09-29 | 2019-09-29 | 即时译文显示方法、装置、移动终端和计算机存储介质 |
CN201910936815.3 | 2019-09-29 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021057908A1 true WO2021057908A1 (fr) | 2021-04-01 |
Family
ID=75111373
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/117783 WO2021057908A1 (fr) | 2019-09-29 | 2020-09-25 | Procédé et dispositif de traduction instantanée, terminal mobile, et support de stockage informatique |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN112584252B (fr) |
WO (1) | WO2021057908A1 (fr) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114143592A (zh) * | 2021-11-30 | 2022-03-04 | 北京字节跳动网络技术有限公司 | 视频处理方法、视频处理装置和计算机可读存储介质 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1356674A (zh) * | 2000-12-05 | 2002-07-03 | 唐蓉 | 易学易懂外文发音读物 |
US20030176995A1 (en) * | 2002-03-14 | 2003-09-18 | Oki Electric Industry Co., Ltd. | Translation mediate system, translation mediate server and translation mediate method |
CN105761201A (zh) * | 2016-02-02 | 2016-07-13 | 山东大学 | 一种翻译图片中文字的方法 |
CN107066455A (zh) * | 2017-03-30 | 2017-08-18 | 唐亮 | 一种多语言智能预处理实时统计机器翻译*** |
CN107632980A (zh) * | 2017-08-03 | 2018-01-26 | 北京搜狗科技发展有限公司 | 语音翻译方法和装置、用于语音翻译的装置 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108182183B (zh) * | 2017-12-27 | 2021-09-17 | 北京百度网讯科技有限公司 | 图片文字翻译方法、应用及计算机设备 |
CN108985201A (zh) * | 2018-06-29 | 2018-12-11 | 网易有道信息技术(北京)有限公司 | 图像处理方法、介质、装置和计算设备 |
-
2019
- 2019-09-29 CN CN201910936815.3A patent/CN112584252B/zh active Active
-
2020
- 2020-09-25 WO PCT/CN2020/117783 patent/WO2021057908A1/fr active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1356674A (zh) * | 2000-12-05 | 2002-07-03 | 唐蓉 | 易学易懂外文发音读物 |
US20030176995A1 (en) * | 2002-03-14 | 2003-09-18 | Oki Electric Industry Co., Ltd. | Translation mediate system, translation mediate server and translation mediate method |
CN105761201A (zh) * | 2016-02-02 | 2016-07-13 | 山东大学 | 一种翻译图片中文字的方法 |
CN107066455A (zh) * | 2017-03-30 | 2017-08-18 | 唐亮 | 一种多语言智能预处理实时统计机器翻译*** |
CN107632980A (zh) * | 2017-08-03 | 2018-01-26 | 北京搜狗科技发展有限公司 | 语音翻译方法和装置、用于语音翻译的装置 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114143592A (zh) * | 2021-11-30 | 2022-03-04 | 北京字节跳动网络技术有限公司 | 视频处理方法、视频处理装置和计算机可读存储介质 |
CN114143592B (zh) * | 2021-11-30 | 2023-10-27 | 抖音视界有限公司 | 视频处理方法、视频处理装置和计算机可读存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN112584252A (zh) | 2021-03-30 |
CN112584252B (zh) | 2022-02-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11176141B2 (en) | Preserving emotion of user input | |
US20210160582A1 (en) | Method and system of displaying subtitles, computing device, and readable storage medium | |
CN107239547B (zh) | 用于语音点歌的语音纠错方法、终端及存储介质 | |
KR102081229B1 (ko) | 텍스트 입력에 따른 실시간 이미지 출력 장치 및 방법 | |
CN111885416B (zh) | 一种音视频的修正方法、装置、介质及计算设备 | |
WO2015089409A1 (fr) | Utilisation de modèles de langues statistiques pour améliorer une entrée de texte | |
CN111898388A (zh) | 视频字幕翻译编辑方法、装置、电子设备及存储介质 | |
CN111885313A (zh) | 一种音视频的修正方法、装置、介质及计算设备 | |
US10896624B2 (en) | System and methods for transforming language into interactive elements | |
JP6155821B2 (ja) | 情報処理装置、情報処理方法、及びプログラム | |
WO2021057908A1 (fr) | Procédé et dispositif de traduction instantanée, terminal mobile, et support de stockage informatique | |
US20240169972A1 (en) | Synchronization method and apparatus for audio and text, device, and medium | |
CN113870396B (zh) | 一种口型动画生成方法、装置、计算机设备及存储介质 | |
WO2019007408A1 (fr) | Procédé d'affichage, dispositif, appareil pouvant être porté, et support d'informations lisible par ordinateur | |
CN102955770A (zh) | 一种拼音自动识别方法及*** | |
CN109714248B (zh) | 一种数据处理方法及装置 | |
CN110780749B (zh) | 一种字符串纠错方法和装置 | |
CN104427263A (zh) | 一种显示字幕的方法和多媒体播放装置 | |
CN112114770A (zh) | 基于语音交互的界面引导方法、装置及设备 | |
KR20180028434A (ko) | 단어 관리 방법 및 장치 | |
JP2019062332A (ja) | 表示態様決定装置、表示装置、表示態様決定方法及びプログラム | |
CN108108350B (zh) | 名词识别方法及装置 | |
TW201830227A (zh) | 顯示文本的字串的方法、可穿戴裝置及非暫態電腦可讀媒體 | |
EP3055859B1 (fr) | Identification d'un contact | |
JP5832815B2 (ja) | 字幕情報を用いた検索結果提供方法およびシステム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20868138 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 29-08-22) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20868138 Country of ref document: EP Kind code of ref document: A1 |