CN108366182B - Calibration method and device for synchronous broadcast of text voice and computer storage medium - Google Patents

Calibration method and device for synchronous broadcast of text voice and computer storage medium Download PDF

Info

Publication number
CN108366182B
CN108366182B CN201810150541.0A CN201810150541A CN108366182B CN 108366182 B CN108366182 B CN 108366182B CN 201810150541 A CN201810150541 A CN 201810150541A CN 108366182 B CN108366182 B CN 108366182B
Authority
CN
China
Prior art keywords
line
voice
broadcast
display unit
broadcasting
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810150541.0A
Other languages
Chinese (zh)
Other versions
CN108366182A (en
Inventor
武赢
杨珣
李月
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BOE Technology Group Co Ltd
Original Assignee
BOE Technology Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BOE Technology Group Co Ltd filed Critical BOE Technology Group Co Ltd
Priority to CN201810150541.0A priority Critical patent/CN108366182B/en
Publication of CN108366182A publication Critical patent/CN108366182A/en
Application granted granted Critical
Publication of CN108366182B publication Critical patent/CN108366182B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/278Subtitling
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4884Data services, e.g. news ticker for displaying subtitles
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/04Synchronising
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Controls And Circuits For Display Device (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The embodiment of the invention provides a calibration method and a device for synchronous broadcast of text voice and a computer storage medium, wherein the calibration method comprises the following steps: the character display unit receives the voice recognition character string sent by the voice broadcasting unit, determines the characters displayed in each line, and automatically scrolls the characters displayed in each line at a preset scrolling speed; the character display unit obtains line feed information corresponding to each line according to the characters displayed by each line, and sends the line feed information to the voice broadcasting unit; the text display unit receives the broadcast line number sent by the voice broadcast unit, wherein the broadcast line number is the line number sent when the actual broadcast progress of the voice broadcast unit reaches the line feed information corresponding to each line; the character display unit determines the current scrolling display line number, compares the display line number with the broadcast line number, determines an error value, and adjusts the scrolling speed of the characters displayed on each line when the error value exceeds a preset threshold value.

Description

Calibration method and device for synchronous broadcast of text voice and computer storage medium
Technical Field
The invention relates to the technical field of intelligent systems, in particular to a calibration method and device for synchronous text voice broadcast and a computer storage medium.
Background
In the prior art, the text and voice synchronous broadcasting solution mostly adopts a method of presetting a timestamp, such as dynamic rolling of movie subtitles and lyrics. The method is characterized in that the broadcast content and the broadcast time are known.
However, for the random voice question and answer, the situation that such broadcast contents and broadcast duration are synchronously played by text and voice, the method of presetting the timestamp in the prior art is not applicable, and under the situation, a better text and voice synchronous broadcast mode cannot be presented for the user.
In summary, in the prior art, when the text and the voice are broadcast synchronously, the user experience is poor.
Disclosure of Invention
In view of the above, the present invention provides a calibration method and apparatus for synchronous broadcast of text and voice, and a computer storage medium, so as to increase user experience.
In order to achieve the purpose, the invention provides the following technical scheme:
a calibration method for synchronous broadcast of text and voice comprises the following steps:
the character display unit receives the voice recognition character string sent by the voice broadcasting unit, determines the characters displayed in each line, and automatically scrolls the characters displayed in each line at a preset scrolling speed;
the character display unit obtains line feed information corresponding to each line according to the characters displayed by each line, and sends the line feed information to the voice broadcasting unit;
the text display unit receives the broadcasting line number sent by the voice broadcasting unit, wherein the broadcasting line number is the line number sent when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line;
and the character display unit determines the display line number currently scrolled to, compares the display line number with the broadcast line number, determines an error value, and adjusts the scrolling speed of the characters displayed on each line when the error value exceeds a preset threshold value.
Preferably, the text display unit receives the voice recognition character string sent by the voice broadcast unit, and determines the characters displayed on each line, including:
the character display unit receives the voice recognition character string sent by the voice broadcasting unit, and determines characters displayed in each line according to the width and the height of the character display unit.
Preferably, the text display unit determines the number of display lines currently scrolled to, including:
and the character display unit determines the display line number scrolled to currently display according to the initial line number value, the height pixel of the character and the preset scrolling speed.
Preferably, the comparing the number of display lines with the number of broadcast lines to determine an error value includes:
and taking the difference between the display line number and the broadcast line number as an error value.
Preferably, when the error value exceeds a preset threshold, adjusting the scrolling speed of the displayed character of each line comprises:
when the difference value between the display line number and the broadcast line number is larger than zero, the scrolling speed of the character is reduced;
and when the difference value between the display line number and the broadcast line number is less than zero, increasing the scrolling speed of the character.
A calibration method for synchronous broadcast of text and voice comprises the following steps:
the voice broadcasting unit sends the acquired voice recognition character string to a character display unit;
the voice broadcasting unit receives the line feed information sent by the character display unit, and sends broadcasting lines to the character display unit when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line.
Preferably, after the voice broadcast unit sends the speech recognition character string that obtains for the text display element, the voice broadcast unit receives before the line feed information that the text display element sent, still includes:
and the voice broadcasting unit starts to broadcast the acquired voice recognition character string after preset time.
A calibration device for synchronous broadcast of text and voice comprises:
the character rolling module is used for receiving the voice recognition character string sent by the voice broadcasting unit, determining the characters displayed in each line and automatically rolling the characters displayed in each line at a preset rolling speed;
the line feed information determining module is used for obtaining line feed information corresponding to each line according to the characters displayed by each line and sending the line feed information to the voice broadcasting unit;
the information receiving module is used for receiving the broadcasting line number sent by the voice broadcasting unit, wherein the broadcasting line number is the line number sent when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line;
and the synchronous calibration module is used for determining the number of display lines currently scrolled to, comparing the number of display lines with the number of broadcast lines, determining an error value, and adjusting the scrolling speed of the characters displayed on each line when the error value exceeds a preset threshold value.
A calibration device for synchronous broadcast of text and voice comprises:
the character string sending module is used for sending the acquired voice recognition character string to the character display unit;
and the line feed back information determining module is used for receiving the line feed information sent by the character display unit, and sending the broadcasting line number to the character display unit when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line.
A computer storage medium stores a program corresponding to the calibration method.
Compared with the prior art, the scheme of the invention has the following beneficial effects:
the calibration method for synchronous broadcast of the text voice provided by the embodiment of the invention comprises the following steps: the character display unit receives the voice recognition character string sent by the voice broadcasting unit, determines the characters displayed in each line, and automatically scrolls the characters displayed in each line at a preset scrolling speed; the character display unit obtains line feed information corresponding to each line according to the characters displayed by each line, and sends the line feed information to the voice broadcasting unit; the text display unit receives the broadcast line number sent by the voice broadcast unit, wherein the broadcast line number is the line number sent when the actual broadcast progress of the voice broadcast unit reaches the line feed information corresponding to each line; the character display unit determines the current scrolling display line number, compares the display line number with the broadcast line number, determines an error value, and adjusts the scrolling speed of the characters displayed on each line when the error value exceeds a preset threshold value. According to the calibration method, the display line number is compared with the broadcast line number to determine an error value, and when the error value exceeds a preset threshold value, the rolling speed of the characters displayed on each line is adjusted, so that the accumulated time delay of drift errors caused by long-time playing can be calibrated, and further the user experience is increased.
Additional aspects and advantages of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention.
Drawings
The foregoing and/or additional aspects and advantages of the present invention will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
fig. 1 is a schematic diagram of a one-time complete synchronous text-to-speech broadcast process according to an embodiment of the present invention;
fig. 2 is a flowchart of a calibration method for synchronous broadcast of text and voice according to an embodiment of the present invention;
fig. 3 is a flowchart of another calibration method for synchronous broadcast of text and voice according to an embodiment of the present invention;
fig. 4 is a block diagram of a calibration apparatus for synchronous broadcast of text and voice according to an embodiment of the present invention;
fig. 5 is a block diagram of another calibration apparatus for synchronous broadcast of text and voice according to an embodiment of the present invention.
Detailed Description
Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are illustrative only and should not be construed as limiting the invention.
As used herein, the singular forms "a", "an", "the" and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms "comprises" and/or "comprising," when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. It will be understood that when an element is referred to as being "connected" or "coupled" to another element, it can be directly connected or coupled to the other element or intervening elements may also be present. Further, "connected" or "coupled" as used herein may include wirelessly connected or wirelessly coupled. As used herein, the term "and/or" includes all or any element and all combinations of one or more of the associated listed items.
It will be understood by those skilled in the art that, unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the prior art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
Firstly, error sources in the process of synchronous broadcast of text and voice are introduced.
In an Android system, there are a text display APK (Android Package) and a voice broadcast APK, and a one-time complete text and voice synchronous broadcast process is shown in fig. 1.
As shown in fig. 1, after the voice broadcast APK obtains the voice recognition String (String) returned from the cloud, it sends the String to the text display APK at time T0, the text display APK receives the String at time T1, the text display APK obtains the text total characters, the display line number, and the percentage of each line of characters in the total characters according to the current device width and height, and according to the display parameters, and sends the line change array at time T3, where the line change array includes the percentage of each line of characters in the total characters.
As shown in fig. 1, at the time of T4, the voice broadcast APK receives the line feed array, compares with the broadcast progress provided by the broadcast function of the voice broadcast APK itself, and when the broadcast progress reaches the value in the text display APK line feed array, sends the broadcast line number to the text display APK at the time of T5, and the text display APK receives the broadcast line number at the time of T6 and completes line feed display at the time of T7.
Can see from above process, send String from voice broadcast APK, show to text display APK, send the broadcast line number from voice broadcast APK, show that the APK actually accomplishes the line feed and shows to the text, there are a lot of delays in the centre, very big influence the accuracy and the user experience that the synchronous broadcast of characters pronunciation.
The inventor of the application finds that the factors influencing the accuracy of the synchronous broadcasting process of the text voice mainly have two points: firstly, broadcasting delay (such as function call delay of the voice broadcasting APK and the character display APK, function delay of the character display APK and function delay of the voice broadcasting APK) existing before the voice broadcasting APK and the character display APK leave a factory; secondly, in the practical application process, drift errors caused by long-time playing are accumulated and delayed.
The specific embodiment of the invention solves the problems that the broadcasting delay exists before the delivery of the voice broadcasting APK and the text display APK as follows:
because voice broadcast APK and word display APK are in same system, so voice broadcast APK and word display APK share a clock, have following several main delays in this process:
propagation instruction latency: t isds=T1-T0=T4-T3=T6-T5
And (3) displaying instruction delay: t isDspAc=T3-T1
Line feed instruction delay: t isLineAc=T7-T6
The above-mentioned instruction delays, although slightly different, may be approximately equal, but the overall difference is not large.
The following describes how to measure the above delay:
in the embodiment of the invention, an average measurement averaging method can be adopted, multiple times of measurement are carried out in advance before software versions of the voice broadcast APK and the text display APK are delivered from a factory, and an average value is obtained for multiple measurement values. That is, in the measurement process, a timestamp (obtained by a system time function) is added in the transmitted instruction, so that the absolute time of transmission of each instruction can be obtained, and further, each delay can be obtained.
After the broadcasting delay existing before delivery is calibrated, the embodiment of the invention mainly introduces how to calibrate the accumulated drift error delay caused by long-time broadcasting so as to increase the user experience.
The technical scheme of the embodiment of the invention is described below by combining the accompanying drawings.
As shown in fig. 2, a calibration method for synchronous broadcast of text and voice is provided in the embodiment of the present invention, and includes:
s201, a character display unit receives a voice recognition character string sent by a voice broadcasting unit, determines characters displayed in each line, and automatically scrolls the characters displayed in each line at a preset scrolling speed;
s202, the character display unit obtains line feed information corresponding to each line according to characters displayed by each line, and sends the line feed information to the voice broadcasting unit;
s203, the text display unit receives the broadcasting line number sent by the voice broadcasting unit, wherein the broadcasting line number is the line number sent when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line;
s204, the character display unit determines the number of display lines which are scrolled currently, compares the number of display lines with the number of broadcast lines, determines an error value, and adjusts the scrolling speed of the characters displayed on each line when the error value exceeds a preset threshold value.
The calibration method for synchronous broadcast of the text voice in the embodiment of the invention comprises the following steps: the character display unit receives the voice recognition character string sent by the voice broadcasting unit, determines the characters displayed in each line, and automatically scrolls the characters displayed in each line at a preset scrolling speed; the character display unit obtains line feed information corresponding to each line according to the characters displayed by each line, and sends the line feed information to the voice broadcasting unit; the text display unit receives the broadcast line number sent by the voice broadcast unit, wherein the broadcast line number is the line number sent when the actual broadcast progress of the voice broadcast unit reaches the line feed information corresponding to each line; the character display unit determines the current scrolling display line number, compares the display line number with the broadcast line number, determines an error value, and adjusts the scrolling speed of the characters displayed on each line when the error value exceeds a preset threshold value. According to the calibration method, the display line number is compared with the broadcast line number to determine an error value, and when the error value exceeds a preset threshold value, the rolling speed of the characters displayed on each line is adjusted, so that the accumulated time delay of drift errors caused by long-time playing can be calibrated, and further the user experience is increased.
Preferably, the step S201 includes that the text display unit receives the voice recognition character string sent by the voice broadcast unit, and determines the characters displayed in each line, including: the character display unit receives the voice recognition character string sent by the voice broadcasting unit, and determines characters displayed in each line according to the width and the height of the character display unit.
Specifically, it is assumed that the speech recognition character string received by the character display unit includes 100 characters, the character display unit calculates the characters that can be displayed in each line in real time according to the width and height of the character display unit, and it is assumed that 30 characters are displayed at most in each line (different character display units are different), so that a total of four lines can be displayed by 100 characters.
Regarding the line feed information in step S202, specifically, the line feed information in the embodiment of the present invention is taken as an example of a line feed array, and the embodiment of the previous paragraph is continued, the text display unit may obtain, according to the characters displayed by each line, a line feed array linefeed [3] ═ 30%, 60%, and 90% }, in the embodiment of the present invention, each value in the line feed array linefeed [3] is represented by a percentage, 30% in the line feed array linefeed [3] represents that the first line may display 30 characters, 60% represents that the first line and the second line may display 60 characters in total, 90% represents that the first line, the second line, and the third line may display 90 characters in total, and since the last line is not changed, the line feed array linefeed [3] includes only the percentage value of the previous three lines that need line feed.
Preferably, the step S204 of determining the currently scrolled display line number by the text display unit includes: and the character display unit determines the currently scrolled display line number according to the initial line number value, the height pixel of the character and the preset scrolling speed.
Specifically, in the specific embodiment of the present invention, the text display unit includes a variable LineNum, where the variable LineNum records the currently scrolled display line number, the initial value is 1, when scrolling to the second line, the value of LineNum is added with 1, and the scrolled line number can be determined according to the height pixel of the text and the preset scrolling speed; such as: the height pixel of the character is 20 pixels, the preset scrolling speed is 4 px/time, and thus a row is scrolled through five times of instructions, and the unit of the scrolling speed is pixel/time.
Preferably, the comparing the display line count with the broadcast line count in step S204 to determine an error value includes: and (4) making a difference between the display line number and the broadcast line number, and taking the difference value as an error value.
Specifically, if the difference value after the difference between the display line number and the broadcast line number is greater than zero, the speed of the actual line feed of the character display unit is greater than the broadcast speed of the voice broadcast unit; and if the difference value after the difference between the display line number and the broadcasting line number is less than zero, the speed of the actual line feed of the character display unit is less than the broadcasting speed of the voice broadcasting unit. Along with the extension of broadcasting time, the difference between the actual line feed speed of the text display unit and the broadcasting speed of the voice broadcasting unit is larger and larger, so that the text display and the voice broadcasting are asynchronous.
In the embodiment of the present invention, it is assumed that the broadcasting speed of the voice broadcasting unit is a fixed and unchangeable value, and in order to enable the text display and the voice broadcasting to be performed synchronously, the text display unit needs to adjust the scrolling speed of each line of displayed characters.
Preferably, when the error value exceeds the preset threshold in step S204, adjusting the scrolling speed of the displayed characters in each line includes: when the difference value between the display line number and the broadcast line number is larger than zero, the scrolling speed of the characters displayed on each line is reduced; and when the difference value between the display line number and the broadcast line number is less than zero, increasing the scrolling speed of the characters displayed on each line. In this way, even if the initial display scroll speed is set too fast or too slow, the scroll speed of the characters displayed in each line is finally converged and stabilized to one cell after a period of adjustment, and through such adjustment, the character display and the voice playing can be performed in synchronization.
Based on the same inventive concept, a calibration method for synchronous broadcast of text and voice is further provided in the embodiments of the present invention, as shown in fig. 3, the method includes:
s301, the voice broadcasting unit sends the acquired voice recognition character string to a character display unit;
s302, the voice broadcasting unit receives line feed information sent by the character display unit, and when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line, the broadcasting line feed is sent to the character display unit.
Preferably, after the voice broadcasting unit sends the acquired voice recognition character string to the text display unit and before the voice broadcasting unit receives the line feed information sent by the text display unit in the embodiment of the present invention, the method further includes: and the voice broadcasting unit starts to broadcast the acquired voice recognition character string after the preset time.
Specifically, as shown in fig. 1, the preset time in the embodiment of the present invention is T2-T0, so that the broadcast is started after the preset time, and the command delay error can be compensated well.
The following describes a calibration method for synchronous broadcast of text and voice according to an embodiment of the present invention in detail with reference to an embodiment.
In the embodiment of the invention, the basic unit for synchronous broadcast of the characters and the voice is line synchronization.
Assume that the voice broadcast speed is one character per second, and the voice broadcast speed is a fixed and unchanging value.
The specific embodiment of the invention takes a random voice question-answer as an example, and a questioner asks questions: "please introduce XXX? ". The voice broadcasting unit in the embodiment of the present invention obtains the voice recognition character string corresponding to the problem, and assumes that the voice recognition character string includes 100 characters, string is 100 (including punctuation marks), and the voice broadcasting unit sends string to the text display unit at time value T0, and the voice broadcasting unit starts broadcasting at time T2-T0, as shown in fig. 1.
As shown in fig. 1, after receiving string, the text display unit starts to automatically scroll the subtitles at a suitable random speed, where the unit of the random speed is pixel/time; meanwhile, the character display unit calculates the characters which can be displayed in each line in real time according to the width and the height of the character display unit, and assumes that each line displays at most 30 characters (different character display units have different conditions), so that 100 characters can display four lines in total.
As shown in fig. 1, the line feed information in the embodiment of the present invention is exemplified by a line feed array, the text display unit obtains, according to characters displayed in each line, a line feed array linefeed [3] = { 30%, 60%, 90% }, where 30% of the line feed array represents a line feed signal corresponding to a first line, 60% represents line feed information corresponding to a second line, 90% represents line feed information corresponding to a third line, and the last line is not line feed, and the text display unit sends the line feed array linefeed [3] to the voice broadcast unit at a time T3.
In addition, in the specific embodiment of the present invention, the text display unit includes a variable LineNum, where the variable LineNum records the currently scrolled display line number, the initial value is 1, when scrolling to the second line, the value of LineNum is added by 1, and the scrolled display line number can be determined according to the line number initial value, the height pixel of the text, and the preset scrolling speed; such as: the height pixel of the text is 20 pixels, and the preset scrolling speed is 4 px/time, so that one line is scrolled through five commands.
As shown in fig. 1, at a time value of T4, the voice broadcasting unit receives the line feed array linereception [3] sent by the text display unit, and the voice broadcasting unit can obtain its own broadcasting progress in real time during broadcasting, and when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line (that is, reaches each value in the line feed array linereception [3 ]), sends the broadcasting line number to the text display unit.
For example: recording the broadcasting line number as LineCMd, wherein the broadcasting line number LineCMd does not actually execute line changing, the broadcasting progress of the voice broadcasting unit is currently 25%, and when the broadcasting reaches 30%, the broadcasting line number is sent (LineCMd is 2, which indicates that the voice broadcasting unit has been broadcasted to the second line at the moment); when the broadcast reaches 60%, the broadcast line number is sent again (LineCmd ═ 3, which indicates that the voice broadcast unit has been played to the third line at this time); when the broadcast reaches 90%, the broadcast line number is sent again (LineCmd ═ 4, indicating that the voice broadcast unit has been played to the fourth line at this time).
As shown in fig. 1, the text display unit receives the number of broadcast lines at time T6, and completes the actual line feed operation at time T7. Specifically, in the embodiment of the present invention, after receiving the broadcast line count sent by the voice broadcast unit each time, the text display unit compares the received broadcast line count (the line count of line feed obtained by the voice broadcast unit theory) with the actual display line count determined by the text display unit, and determines an error value, for example: and (3) making a difference between the broadcasting line number and the display line number, determining the difference value as an error value, and adjusting the scrolling speed of the characters displayed in each line when the error value exceeds a preset threshold value.
When the difference value between the display line number and the broadcast line number is larger than zero in specific implementation, the scrolling speed of the characters displayed in each line is reduced; and when the difference value between the display line number and the broadcast line number is less than zero, increasing the scrolling speed of the characters displayed on each line.
Therefore, after a certain time, the character display and the voice broadcast are synchronized, and the scrolling speed of the characters displayed on each line of the character display unit is stabilized to a reasonable interval; the embodiment of the invention solves the problem of accumulated broadcasting errors caused by long-time broadcasting of voice broadcasting by adjusting the scrolling speed of the characters displayed on each line, thereby achieving the effect of real-time synchronization of the character voice.
Based on the same inventive concept, a calibration device for synchronous broadcast of text and voice is further provided in the embodiments of the present invention, as shown in fig. 4, including:
the character scrolling module 41 is configured to receive the voice recognition character string sent by the voice broadcasting unit, determine characters displayed in each line, and automatically scroll the characters displayed in each line at a preset scrolling speed;
the line feed information determining module 42 is configured to obtain line feed information corresponding to each line according to the characters displayed in each line, and send the line feed information to the voice broadcasting unit;
the information receiving module 43 is configured to receive the broadcast line number sent by the voice broadcast unit, where the broadcast line number is the line number sent when the actual broadcast progress of the voice broadcast unit reaches the line feed information corresponding to each line;
and the synchronous calibration module 44 is configured to determine a currently scrolled display line number, compare the display line number with the broadcast line number, determine an error value, and adjust a scrolling speed of the displayed characters in each line when the error value exceeds a preset threshold value.
Based on the same inventive concept, an embodiment of the present invention further provides a calibration apparatus for synchronous broadcast of text and voice, as shown in fig. 5, including:
a character string sending module 51, configured to send the obtained voice recognition character string to a text display unit;
and the line feed feedback information determining module 52 is configured to receive the line feed information sent by the text display unit, and send the broadcast line number to the text display unit when the actual broadcast progress of the voice broadcast unit reaches the line feed information corresponding to each line.
Based on the same inventive concept, the specific embodiment of the present invention further provides a computer storage medium, where a program corresponding to the calibration method for synchronous broadcast of text and voice is stored in the computer storage medium.
Those skilled in the art will appreciate that the present invention includes apparatus directed to performing one or more of the operations described in the present application. These devices may be specially designed and manufactured for the required purposes, or they may comprise known devices in general-purpose computers. These devices have stored therein computer programs that are selectively activated or reconfigured. Such a computer program may be stored in a device (e.g., computer) readable medium, including, but not limited to, any type of disk including floppy disks, hard disks, optical disks, CD-ROMs, and magnetic-optical disks, ROMs (Read-Only memories), RAMs (Random Access memories), EPROMs (Erasable programmable Read-Only memories), EEPROMs (Electrically Erasable programmable Read-Only memories), flash memories, magnetic cards, or optical cards, or any type of media suitable for storing electronic instructions, and each coupled to a bus. That is, a readable medium includes any medium that stores or transmits information in a form readable by a device (e.g., a computer).
To sum up, the calibration method for synchronous text-to-speech broadcast according to the embodiment of the present invention includes: the character display unit receives the voice recognition character string sent by the voice broadcasting unit, determines the characters displayed in each line, and automatically scrolls the characters displayed in each line at a preset scrolling speed; the character display unit obtains line feed information corresponding to each line according to the characters displayed by each line, and sends the line feed information to the voice broadcasting unit; the text display unit receives the broadcast line number sent by the voice broadcast unit, wherein the broadcast line number is the line number sent when the actual broadcast progress of the voice broadcast unit reaches the line feed information corresponding to each line; the character display unit determines the current scrolling display line number, compares the display line number with the broadcast line number, determines an error value, and adjusts the scrolling speed of the characters displayed on each line when the error value exceeds a preset threshold value. According to the calibration method, the display line number is compared with the broadcast line number to determine an error value, and when the error value exceeds a preset threshold value, the rolling speed of the characters displayed on each line is adjusted, so that the accumulated time delay of drift errors caused by long-time playing can be calibrated, and further the user experience is increased.
The foregoing is only a partial embodiment of the present invention, and it should be noted that, for those skilled in the art, various modifications and decorations can be made without departing from the principle of the present invention, and these modifications and decorations should also be regarded as the protection scope of the present invention.

Claims (10)

1. A calibration method for synchronous broadcast of text and voice is characterized by comprising the following steps:
the character display unit receives the voice recognition character string sent by the voice broadcasting unit, determines the characters displayed in each line, and automatically scrolls the characters displayed in each line at a preset scrolling speed;
the character display unit obtains line feed information corresponding to each line according to the characters displayed by each line, and sends the line feed information to the voice broadcasting unit;
the text display unit receives the broadcasting line number sent by the voice broadcasting unit, wherein the broadcasting line number is the line number sent when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line;
and the character display unit determines the display line number currently scrolled to, compares the display line number with the broadcast line number, determines an error value, and adjusts the scrolling speed of the characters displayed on each line when the error value exceeds a preset threshold value.
2. The calibration method according to claim 1, wherein the text display unit receives the voice recognition character string sent by the voice broadcasting unit, and determines the characters displayed in each row, including:
the character display unit receives the voice recognition character string sent by the voice broadcasting unit, and determines characters displayed in each line according to the width and the height of the character display unit.
3. The calibration method of claim 1, wherein the determining, by the text display unit, the number of display lines currently scrolled to comprises:
and the character display unit determines the display line number scrolled to currently display according to the initial line number value, the height pixel of the character and the preset scrolling speed.
4. The calibration method of claim 1, wherein comparing the number of display lines to the number of broadcast lines to determine an error value comprises:
and taking the difference between the display line number and the broadcast line number as an error value.
5. The calibration method according to claim 4, wherein adjusting the scrolling speed of the displayed characters for each line when the error value exceeds a preset threshold comprises:
when the difference value between the display line number and the broadcast line number is larger than zero, the scrolling speed of the character is reduced;
and when the difference value between the display line number and the broadcast line number is less than zero, increasing the scrolling speed of the character.
6. A calibration method for synchronous broadcast of text and voice is characterized by comprising the following steps:
the voice broadcasting unit sends the acquired voice recognition character string to a character display unit;
the voice broadcasting unit receives the line feed information sent by the character display unit, and sends broadcasting lines to the character display unit when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line.
7. The calibration method according to claim 6, wherein after the voice broadcasting unit sends the acquired voice recognition character string to a text display unit, and before the voice broadcasting unit receives line feed information sent by the text display unit, the method further comprises:
and the voice broadcasting unit starts to broadcast the acquired voice recognition character string after preset time.
8. The utility model provides a calibration device that synchronous report of characters pronunciation which characterized in that includes:
the character rolling module is used for receiving the voice recognition character string sent by the voice broadcasting unit, determining the characters displayed in each line and automatically rolling the characters displayed in each line at a preset rolling speed;
the line feed information determining module is used for obtaining line feed information corresponding to each line according to the characters displayed by each line and sending the line feed information to the voice broadcasting unit;
the information receiving module is used for receiving the broadcasting line number sent by the voice broadcasting unit, wherein the broadcasting line number is the line number sent when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line;
and the synchronous calibration module is used for determining the number of display lines currently scrolled to, comparing the number of display lines with the number of broadcast lines, determining an error value, and adjusting the scrolling speed of the characters displayed on each line when the error value exceeds a preset threshold value.
9. The utility model provides a calibration device that synchronous report of characters pronunciation which characterized in that includes:
the character string sending module is used for sending the acquired voice recognition character string to the character display unit;
and the line feed back information determining module is used for receiving the line feed information sent by the character display unit, and sending the broadcasting line number to the character display unit when the actual broadcasting progress of the voice broadcasting unit reaches the line feed information corresponding to each line.
10. A computer storage medium storing a program corresponding to the calibration method according to any one of claims 1 to 5; and/or a program corresponding to the calibration method according to any one of claims 6 to 7 is stored.
CN201810150541.0A 2018-02-13 2018-02-13 Calibration method and device for synchronous broadcast of text voice and computer storage medium Active CN108366182B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810150541.0A CN108366182B (en) 2018-02-13 2018-02-13 Calibration method and device for synchronous broadcast of text voice and computer storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810150541.0A CN108366182B (en) 2018-02-13 2018-02-13 Calibration method and device for synchronous broadcast of text voice and computer storage medium

Publications (2)

Publication Number Publication Date
CN108366182A CN108366182A (en) 2018-08-03
CN108366182B true CN108366182B (en) 2020-07-07

Family

ID=63002416

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810150541.0A Active CN108366182B (en) 2018-02-13 2018-02-13 Calibration method and device for synchronous broadcast of text voice and computer storage medium

Country Status (1)

Country Link
CN (1) CN108366182B (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109413479A (en) * 2018-09-28 2019-03-01 四川长虹电器股份有限公司 The method that smart television voice Interaction Interface content of text is completely shown
CN110267080A (en) * 2019-04-30 2019-09-20 北京云迹科技有限公司 Data processing method and device for robot casting content synchronization
CN110232194B (en) * 2019-06-17 2024-04-09 安徽听见科技有限公司 Translation display method, device, equipment and readable storage medium
CN110910918B (en) * 2019-10-11 2021-03-23 广州优谷信息技术有限公司 Subtitle rolling control method, system and device
CN110991455B (en) * 2020-02-11 2023-05-05 上海肇观电子科技有限公司 Image text broadcasting method and equipment, electronic circuit and storage medium thereof
US11776286B2 (en) 2020-02-11 2023-10-03 NextVPU (Shanghai) Co., Ltd. Image text broadcasting
CN111917937A (en) * 2020-06-23 2020-11-10 广州优谷信息技术有限公司 Prompter display method, device, prompter system, operation terminal and storage medium
CN112423094A (en) * 2020-10-30 2021-02-26 广州佰锐网络科技有限公司 Double-recording service broadcasting method and device and storage medium
CN112632120A (en) * 2020-12-08 2021-04-09 深圳前海微众银行股份有限公司 Information processing method, device, equipment and computer storage medium
CN112887779B (en) * 2021-01-20 2022-11-18 杭州小众圈科技有限公司 Method, system and device for automatically rolling subtitles based on voice rhythm

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6130722A (en) * 1996-05-03 2000-10-10 Samsung Electronics Co., Ltd Closed-caption broadcasting and displaying method and apparatus suitable for syllable characters
CN102148937A (en) * 2010-02-05 2011-08-10 新奥特(北京)视频技术有限公司 Realization method and realization device for tail screen staying in scrolling design
CN102663988A (en) * 2012-04-28 2012-09-12 广东威创视讯科技股份有限公司 Method, device and system for broadcasting subtitles
US8826354B2 (en) * 2010-12-01 2014-09-02 At&T Intellectual Property I, L.P. Method and system for testing closed caption content of video assets

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20010026609A (en) * 1999-09-07 2001-04-06 나현민 Audio play media device for combining automatic caption information extracting and storing function with character displaying function
WO2002080143A1 (en) * 2001-03-29 2002-10-10 Koninklijke Philips Electronics N.V. Synchronise an audio cursor and a text cursor during editing
JP2004266577A (en) * 2003-02-28 2004-09-24 Casio Comput Co Ltd Media synchronously reproducing apparatus, server and media synchronous reproduction processing program, and server processing program
JP3945778B2 (en) * 2004-03-12 2007-07-18 インターナショナル・ビジネス・マシーンズ・コーポレーション Setting device, program, recording medium, and setting method
CN101500127A (en) * 2008-01-28 2009-08-05 德信智能手机技术(北京)有限公司 Method for synchronously displaying subtitle in video telephone call
CN101382937B (en) * 2008-07-01 2011-03-30 深圳先进技术研究院 Multimedia resource processing method based on speech recognition and on-line teaching system thereof
JP2010136067A (en) * 2008-12-04 2010-06-17 Sony Corp Data processing device, data processing method, and program
CN101808202B (en) * 2009-02-18 2013-09-04 联想(北京)有限公司 Method, system and computer for realizing sound-and-caption synchronization in video file
KR101709053B1 (en) * 2010-04-01 2017-02-22 티제이커뮤니케이션 주식회사 Caption data structure and caption player for synchronizing syllables between a sound source and caption data
US8564721B1 (en) * 2012-08-28 2013-10-22 Matthew Berry Timeline alignment and coordination for closed-caption text using speech recognition transcripts
CN104038804B (en) * 2013-03-05 2017-09-29 三星电子(中国)研发中心 Captioning synchronization apparatus and method based on speech recognition
CN103297711B (en) * 2013-06-19 2016-11-02 青海汉拉信息科技股份有限公司 Chinese marks Chinese language subtitles recorded broadcast equipment the most in real time
CN103561217A (en) * 2013-10-14 2014-02-05 深圳创维数字技术股份有限公司 Method and terminal for generating captions
CN106033678A (en) * 2015-03-18 2016-10-19 珠海金山办公软件有限公司 Playing content display method and apparatus thereof
CN105791713A (en) * 2016-03-21 2016-07-20 安徽声讯信息技术有限公司 Intelligent device for playing voices and captions synchronously
CN106385548A (en) * 2016-09-05 2017-02-08 努比亚技术有限公司 Mobile terminal and method for generating video captions
CN106340291A (en) * 2016-09-27 2017-01-18 广东小天才科技有限公司 Bilingual subtitle production method and system
CN106340294A (en) * 2016-09-29 2017-01-18 安徽声讯信息技术有限公司 Synchronous translation-based news live streaming subtitle on-line production system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6130722A (en) * 1996-05-03 2000-10-10 Samsung Electronics Co., Ltd Closed-caption broadcasting and displaying method and apparatus suitable for syllable characters
CN102148937A (en) * 2010-02-05 2011-08-10 新奥特(北京)视频技术有限公司 Realization method and realization device for tail screen staying in scrolling design
US8826354B2 (en) * 2010-12-01 2014-09-02 At&T Intellectual Property I, L.P. Method and system for testing closed caption content of video assets
CN102663988A (en) * 2012-04-28 2012-09-12 广东威创视讯科技股份有限公司 Method, device and system for broadcasting subtitles

Also Published As

Publication number Publication date
CN108366182A (en) 2018-08-03

Similar Documents

Publication Publication Date Title
CN108366182B (en) Calibration method and device for synchronous broadcast of text voice and computer storage medium
CN110418170B (en) Detection method and device, storage medium and electronic device
CN105100876B (en) A kind of playback method and device of Streaming Media
US8644085B2 (en) Duty cycle distortion correction
CN105681874A (en) Network video online playing method and device
CN106604065B (en) A kind of system and method realized multi-screen and play video with screen
US11595731B2 (en) Implementation method and system of real-time subtitle in live broadcast and device
CN112153082B (en) Method and device for smoothly displaying real-time streaming video picture in android system
CN111954050B (en) Method and system for synchronizing videos among multiple devices
KR102566550B1 (en) Method of display playback synchronization of digital contents in multiple connected devices and apparatus using the same
CN103648011A (en) Audio and video synchronization device and method based on HLS protocol
CN103428584A (en) Method and device for keeping synchronization of audio and video on multimedia playing platform
CN108156500B (en) Multimedia data time correction method, computer device and computer readable storage medium
US8913190B2 (en) Method and apparatus for regenerating a pixel clock signal
CN108174267A (en) The sending device of interactive information, method and computer readable storage medium in live streaming
CN114679665A (en) Bluetooth headset audio and video synchronization method and device, electronic equipment and storage medium
BR112022002268A2 (en) Frame rate conversion
GB2604504A (en) Cloud-based media synchronization system for generating a synchronization interface and performing media synchronization
TW201933880A (en) Method, apparatus, and system for displaying live broadcast messages
US20190342659A1 (en) Correcting for latency of an audio chain
CN112866755B (en) Video playing method and device, electronic equipment and storage medium
CN104333778A (en) Dynamic playing buffering processing method and electronic device
US9262998B2 (en) Display system and data transmission method thereof
CN115484483A (en) Streaming media audio and video synchronization method, terminal equipment, control device and medium
CN104661114A (en) Network multimedia resource consumption control method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant