CN108259986B - Multi-channel audio playing method and device, electronic equipment and storage medium - Google Patents

Multi-channel audio playing method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN108259986B
CN108259986B CN201810044616.7A CN201810044616A CN108259986B CN 108259986 B CN108259986 B CN 108259986B CN 201810044616 A CN201810044616 A CN 201810044616A CN 108259986 B CN108259986 B CN 108259986B
Authority
CN
China
Prior art keywords
audio
language
channel
data
stream data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810044616.7A
Other languages
Chinese (zh)
Other versions
CN108259986A (en
Inventor
麦土银
曾路明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vidaa Netherlands International Holdings BV
Original Assignee
Hisense Electronic Technology Shenzhen Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hisense Electronic Technology Shenzhen Co ltd filed Critical Hisense Electronic Technology Shenzhen Co ltd
Priority to CN201810044616.7A priority Critical patent/CN108259986B/en
Publication of CN108259986A publication Critical patent/CN108259986A/en
Application granted granted Critical
Publication of CN108259986B publication Critical patent/CN108259986B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • H04N21/4852End-user interface for client configuration for modifying audio parameters, e.g. switching between mono and stereo
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • H04N21/4856End-user interface for client configuration for language selection, e.g. for the menu or subtitles

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Stereophonic System (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The invention discloses a multi-channel audio playing method and device, electronic equipment and a computer readable storage medium, wherein the scheme comprises the following steps: according to an audio playing request, acquiring audio stream data indicated by the audio playing request; analyzing the audio stream data, and determining audio languages corresponding to a plurality of sound channels in the audio stream data; judging whether the audio language corresponding to at least one sound channel is the same as the audio language indicated by the language configuration information; and if so, controlling to output the audio data of the sound channel with the same audio language indicated by the language configuration information, and closing the output of other sound channels. The technical scheme provided by the invention can automatically select one sound channel from the plurality of sound channels to play without manual operation, thereby solving the problem of stress, not increasing the operation of a user, and solving the defect of complicated operation of the existing audio playing.

Description

Multi-channel audio playing method and device, electronic equipment and storage medium
Technical Field
The present invention relates to the field of audio processing technologies, and in particular, to a method and an apparatus for playing multi-channel audio, an electronic device, and a computer-readable storage medium.
Background
Digital television services have been widely spread worldwide, and television multi-language coexistence is a very important function, so that a common audio/video stream usually has multiple channels of audio, each channel of audio corresponds to one language, and each channel of audio is distinguished by an independent PID (Packet Identifier).
At present, the television service has gradually changed to the all-digital television, and the audio language in the three major standards (DVB, ISDB, ATSC) of the digital television allows the Dual Mono (Dual channel) situation. Dual Mono means that the audio data of the left and right channels are different and share one PID. When the left and right channels transmit different audio data and the channel output of the television is set to Stereo (Stereo), two sounds are heard at the same time, and the sounds are stressed. The prior art provides for a user to select which channel to play by setting a language menu switch, but prior to selection, two sounds will be played immediately if the default channel output is set to stereo. Or when the common audio and video program and the Dual Mono program are switched, the sound channel output can not be correctly and automatically switched, and two sounds can be played.
The phenomenon of stress is avoided by manually selecting one sound channel for output by the user, the user operation is required, the steps of audio and video playing operation are added, and the playing operation is relatively complex.
Disclosure of Invention
In order to solve the problem that in the related art, a user needs to rely on user operation to increase the steps of audio and video playing operation to avoid the phenomenon of stress by manually selecting a sound channel for output, so that the playing operation is relatively complicated, the invention provides a multi-channel audio playing method.
In one aspect, the present invention provides a method for playing multi-channel audio, where the method includes:
according to an audio playing request, acquiring audio stream data indicated by the audio playing request;
analyzing the audio stream data, and determining audio languages corresponding to a plurality of sound channels in the audio stream data;
judging whether the audio language corresponding to at least one sound channel is the same as the audio language indicated by the language configuration information;
and if so, controlling to output the audio data of the sound channel with the same audio language indicated by the language configuration information, and closing the output of other sound channels.
In another aspect, the present invention provides a multi-channel audio playing apparatus, including:
the file acquisition module is used for acquiring audio stream data indicated by the audio playing request according to the audio playing request;
the language obtaining module is used for analyzing the audio stream data and determining audio languages corresponding to a plurality of sound channels in the audio stream data;
the language matching module is used for judging whether the audio language corresponding to at least one sound channel is the same as the audio language indicated by the language configuration information;
and the audio output module is used for controlling the output of the audio data of the channel which is the same as the audio language indicated by the language configuration information when the audio language corresponding to at least one channel is the same as the audio language indicated by the language configuration information, and closing the output of other channels.
Further, the present invention provides an electronic device, including:
a processor;
a memory for storing processor-executable instructions;
wherein the processor is configured to perform the above multi-channel audio playing method.
In addition, the present invention also provides a computer-readable storage medium storing a computer program, which is executable by a processor to perform the above-mentioned multi-channel audio playing method.
The technical scheme provided by the embodiment of the invention can have the following beneficial effects:
according to the technical scheme provided by the invention, the audio data of the sound channel which is the same as the audio language indicated by the language configuration information is controlled to be output according to the audio languages corresponding to the sound channels in the audio stream data, and the output of other sound channels is closed, so that one sound channel can be automatically selected from the sound channels to be played without manual operation, the problem of accent is solved, the operation of a user is not increased, and the defect of complicated operation of the existing audio playing is overcome.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the invention and together with the description, serve to explain the principles of the invention.
FIG. 1 is a schematic illustration of an implementation environment in accordance with the present invention;
FIG. 2 is a block diagram illustrating an apparatus in accordance with an exemplary embodiment;
FIG. 3 is a flow chart illustrating a method of multi-channel audio playback in accordance with an exemplary embodiment;
FIG. 4 is a detailed flowchart of step 310 of the corresponding embodiment of FIG. 3;
FIG. 5 is a detailed flowchart of step 330 of the corresponding embodiment of FIG. 3;
FIG. 6 is a detailed flowchart illustrating a multi-channel audio playing method according to another exemplary embodiment
FIG. 7 is a block diagram illustrating an apparatus for playing multi-channel audio according to an exemplary embodiment;
FIG. 8 is a detailed block diagram of a file acquisition module of the corresponding embodiment of FIG. 7;
fig. 9 is a detailed block diagram of the language acquisition module of the corresponding embodiment of fig. 7.
Detailed Description
Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the present invention. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the invention, as detailed in the appended claims.
FIG. 1 is a schematic diagram illustrating an implementation environment to which the present invention relates, according to an exemplary embodiment. The implementation environment to which the present invention relates includes a smart-player device 110. The smart playing device 110 may play the audio stream data by using the multi-channel audio playing method provided by the present invention. The audio stream data may be stored in a local database of the smart-player device 110.
The implementation environment will also include a data source that provides data, i.e., audio stream data, as desired. In particular, in this implementation environment, the data source may be server 130. The smart playing device 110 may obtain the audio stream data from the server 130, and then play the obtained audio stream data by using the multi-channel audio playing method provided by the present invention.
It should be noted that the multi-channel audio playing method of the present invention is not limited to deploying corresponding processing logic in the smart playing device 110, and may also be deployed in other machines. For example, processing logic of an audio playing method of multiple channels is deployed in a terminal device having data processing capability, and the like.
Fig. 2 is a block diagram illustrating a smart-player device 110 according to an example embodiment. The smart playing device 110 may be, for example, a smart television set-top box, a smart stereo, and the like.
Referring to fig. 2, the smart player device 110 may have a relatively large difference due to different configurations or performances, and may include one or more Central Processing Units (CPUs) 222 (e.g., one or more processors) and a memory 232, one or more storage media 230 (e.g., one or more mass storage devices) storing applications 242 or data 244. Memory 232 and storage medium 230 may be, among other things, transient or persistent storage. The program stored in the storage medium 230 may include one or more modules (not shown), each of which may include a series of instruction operations for the smart play device 110. Still further, the central processor 222 may be configured to communicate with the storage medium 230 to execute a series of instruction operations in the storage medium 230 on the smart player device 110. Smart-play device 110 may also include one or more power supplies 226, one or more wired or wireless network interfaces 250, one or more input-output interfaces 258, and/or one or more operating systems 241, such as Android, Linux, Windows Server, Mac OS XTM, UnixTM, FreeBSDTM, and so forth. The steps performed by the smart play device 110 in the embodiments shown in fig. 3, 4 and 5 described below may be based on the structure of the smart play device 110 shown in fig. 2.
It will be understood by those skilled in the art that all or part of the steps in implementing the embodiments described below may be implemented by hardware, or may be implemented by program instructions, and that the program may be stored in a computer-readable storage medium, such as a read-only memory, a magnetic or optical disk, and the like.
Fig. 3 is a flow chart illustrating a method of multi-channel audio playback according to an exemplary embodiment. The multi-channel audio playing method is applicable to a range and an execution subject, for example, the method is used for the smart playing device 110 of the implementation environment shown in fig. 1. As shown in fig. 3, the multi-channel audio playing method may be performed by the smart playing device 110, and may include the following steps.
In step 310, according to an audio playing request, obtaining audio stream data indicated by the audio playing request;
in this case, the audio stream data may contain a plurality of audio data (at least two audio data), which are independent of each other, but share a PID (packet identifier). Therefore, when the file corresponding to the PID is called, multiple audio data are called at the same time. The audio stream data may be a common Dual Mono audio file or may contain more channels. For example, if a certain tv program contains chinese dialogue audio and english dialogue audio, the chinese dialogue audio and the english dialogue audio are used as two audio data in the audio stream data. For ease of understanding, the present invention is described with the audio stream data being a Dual Mono audio file, and more channels are implemented with reference to Dual channels.
It should be explained that when multiple audio channels are included in the normal audio stream data, each audio channel has an independent PID. However, for a special Dual Mono audio file, the left and right channels transmit different audio data, and the two channels of audio data share one PID. Therefore, when the audio file corresponding to the PID is requested to be called, the user can hear two sounds at the same time due to the fact that the left and right channels transmit different audio data, and the problem of audio accent exists.
For television programs of the ISDB (japanese television standard) and HATSC (european television standard) standards, there are two-channel cases of audio languages, i.e., the left and right channels transmit audio data of different languages. For example, the left channel carries english conversation content and the right channel carries chinese conversation content. The english conversation contents and the chinese conversation contents share one PID. Therefore, when the audio file corresponding to the PID is called, English conversation content and Chinese conversation content are output at the same time, and the problem of accent occurs. The prior art outputs audio data of one channel by a user manually selecting it. Based on the above defects, the present invention provides a multi-channel audio playing method, which solves the problem of accent when playing audio stream data.
Taking playing a television program as an example, a user triggers selection of the television program, and the smart playing device 110 receives a video playing request and an audio playing request at the same time. The video playing request and the audio playing request carry the television program information selected by the user. Then, the smart playing device 110 downloads the TS stream file corresponding to the television program from the server 130, and parses the video stream data and the audio stream data from the TS stream file. The audio stream data may be a multi-channel audio file, and in one embodiment, the audio stream data is a DualMono audio file, the left and right channels transmit audio data in different languages, and the audio data of the left and right channels share a PID.
In an exemplary embodiment, as shown in fig. 4, the step 310 specifically includes:
in step 311, an audio play request is received;
assuming that the audio playing device 110 is a Digital Television (DTV), the user triggers the selection of a tv program, and the digital tv receives a certain tv program and sends a playing instruction through a user-side remote controller. As required, the audio playing device 110 receives the audio playing request and also receives the video playing request sent by the user-side remote controller.
In step 312, according to the file identification information carried by the audio playing request, audio stream data corresponding to the file identification information is obtained.
The file identification information may be a PID, the audio playing request may carry the PID, and the digital television analyzes audio stream data corresponding to the PID from a TS stream file downloaded by the server 130 by setting a corresponding PID filter, or obtains the audio stream data corresponding to the PID from an audio/video stream stored in the digital television.
In step 330, parsing the audio stream data and determining audio languages corresponding to a plurality of channels in the audio stream data;
the analyzing of the audio stream data refers to decoding and restoring the obtained audio stream data to obtain information such as the attribute of the audio stream data carried in the audio stream data, for example, the number of channels of the audio stream data. The audio language refers to a language adopted by audio data transmitted by a sound channel, such as chinese, english, japanese, korean, and the like.
Specifically, the audio stream data may be dual channels, and the left and right channels transmit audio data of different languages, for example, the left and right channels respectively include chinese audio and english audio, so that by parsing the audio stream data, audio languages corresponding to the left channel and the right channel respectively in the audio stream data are obtained from a parsing result, for example, the left channel is chinese audio and the right channel is english audio.
In an exemplary embodiment, as shown in fig. 5, the step 330 specifically includes:
in step 331, parsing the audio stream data to obtain an audio component type and an audio encoding mode;
specifically, the smart playing device 110 may be a digital television, and a code stream SI (Service Information)/PSI (Program Specific Information) table decoding module disposed in the smart playing device 110 itself may analyze various table data Information of the code stream, such as video, audio, and picture table data. In the ISDB standard, it is possible to determine the type of Audio Component by parsing Audio Component Descriptor (Audio Component Descriptor) data in the SI/PSI table, the Audio Component Descriptor data including the Component _ type.
The digital television decoding module disposed in the smart playing device 110 may decode digital television code stream data, and parse a data header of a PES (Packetized Elementary Streams) related thereto. The PES header includes an Audio coding mode (Audio coding mode).
In step 332, if the audio component type and/or the audio encoding mode indicate that the audio stream data is audio data in which multiple channels transmit different audio languages, the audio language corresponding to each of the multiple channels is obtained from the parsing result of the audio stream data.
Specifically, if the audio component type in the audio component descriptor indicates that the audio stream data respectively transmits audio data of different languages for a plurality of channels, and the audio data of the plurality of channels share one PID, it indicates that an accent occurs when playing the audio stream data, and the audio languages respectively corresponding to the plurality of channels are obtained from the audio component descriptor.
In order to improve the accuracy of determining multiple channels, if the audio coding mode in the PES header indicates that the audio stream data is audio data in multiple channels and respectively transmits audio data in different languages, and the audio data in the multiple channels share one PID, it indicates that an accent occurs when playing the audio stream data, and the audio languages corresponding to the multiple channels are obtained from the audio component descriptor.
That is, if any one of the audio component type and the audio encoding method indicates that the audio stream data is a plurality of channels and transmits audio data in different languages, the multiple channels share one PID, and the audio data in one channel needs to be selected according to the language configuration information for output. Conversely, if both the audio component type and the audio coding mode indicate that the multiple channels do not share a PID, the default output channel may be set to stereo.
Wherein, the step 332 specifically includes:
and if the audio component type and/or the audio coding mode carry identification codes used for marking the audio stream data as a plurality of sound channels to respectively transmit audio data in different languages, obtaining audio languages respectively corresponding to the plurality of sound channels from the analysis result of the audio stream data.
For example, the identification code for tagging two channels to respectively transmit audio data in different languages may be 0x02, taking audio stream data as a two-channel audio file. If the component _ type is equal to 0x02, it can be considered that the obtained audio stream data is dual channels, the left and right channels respectively transmit audio data of different languages, and the two channels of audio data share one PID, and the audio languages respectively corresponding to the dual channels are obtained from the parsing result (e.g. audio component descriptor) of the dual-channel audio file.
If component _ type is not equal to 0x02, it is further possible to determine whether Audio coding mode is equal to 0. Wherein "0" is used to mark the two-channel audio file as the identification code for the two channels to respectively transmit the audio data of different languages in the audio coding mode. As long as any one of component _ type 0x02 and Audio coding mode 0 is satisfied, the acquired Audio stream data can be considered as one PID shared by two channels.
In step 350, determining whether there is at least one channel corresponding to the same audio language as the audio language indicated by the language configuration information;
the language configuration information is generated in advance by the smart player device 110 according to the language preference of the user, and the language configuration information is used for indicating the language preferred by the user. For example, the smart player device 110 may determine the language preference of the user according to the audio language of the television program watched by the user history, and then generate the language configuration information. The language configuration information may be stored in a local database of the smart play device 110, or may be obtained by the smart play device 110 from the server 130.
Specifically, the language configuration information indicates a language preferred by the user, and the smart playing device 110 determines whether an audio language identical to the language preferred by the user exists according to the audio languages respectively corresponding to the plurality of channels. In special cases, when there is no audio language identical to the language preferred by the user, audio data of one audio language can be randomly selected for output.
In an exemplary embodiment, before the step 350, the following steps are further included: and reading language configuration information which is generated in advance according to language selection triggered by a user.
It should be noted that, the user may set a language preferred by the user in advance, and the smart playing device 110 generates the language configuration information and stores the language configuration information in the local database according to the selection of the preferred language triggered by the user in advance. Therefore, when a proper audio language needs to be selected for a user, the language configuration information is read from the database, and whether the audio language corresponding to at least one sound channel is the same as the audio language indicated by the language configuration information or not is judged according to the audio languages corresponding to the sound channels.
In an exemplary embodiment, the step 350 specifically includes: and comparing the language identification code carried by the language configuration information with the language identification codes of the audio languages corresponding to the multiple sound channels respectively, and judging whether the language identification code of the audio language corresponding to at least one sound channel is matched with the language identification code carried by the language configuration information.
Specifically, the language identification codes of the Audio languages corresponding to the multiple sound channels can be obtained by analyzing the Audio Component Descriptor data in the SI/PSI table. When the audio component Descriptor is a binaural, the language identification code of the left channel audio language is ISO _639_ language _ code, and the language identification code of the right channel audio language is ISO _639_ language _ code _2, which can be obtained from the audio component Descriptor data. Assuming that the language identification code carried by the language configuration information is ISO _639_ language _ code _2, the language identification code is compared with the language identification codes of the left and right channels to obtain the language identification code consistent with the language identification code of the right channel, so that the channel where the audio language matched with the language preferred by the user is located can be screened out. Thereafter, the corresponding output may be set to the right channel, and the output of the left channel may be turned off.
In step 370, if yes, the control unit outputs the audio data of the channel in the same audio language indicated by the language configuration information and turns off the output of the other channels.
Taking the example that the audio stream data is a two-channel audio file, the left channel and the right channel respectively transmit audio data of different languages, and assuming that the channel which is the same as the audio language indicated by the language configuration information is the left channel, the audio data of the left channel is controlled to be output, and the output of the audio data of the right channel is closed. Similarly, for multi-channel, the control unit outputs audio data of the same channel as the audio language indicated by the language configuration information, and turns off the output of audio data of other channels.
In the prior art, when an audio file with a plurality of channels is played, the plurality of channels are stressed, so that the user needs to manually select and output audio data of one channel, and the operation is complicated. According to the technical scheme provided by the above exemplary embodiment of the present invention, according to the audio languages corresponding to the multiple channels in the audio stream data, the audio data of the channel which is the same as the audio language indicated by the language configuration information is controlled to be output, and the output of other channels is closed, so that one channel can be automatically selected from the multiple channels to be played without manual operation, thereby not only solving the problem of accent, but also not increasing the operation of a user, and solving the defect of complicated operation of the current audio playing.
Fig. 6 is a detailed flowchart of a multi-channel audio playing method according to an exemplary embodiment of the present invention. In the exemplary embodiment, for example, the two-channel audio is played, and the smart playing device 110 may be a smart television. As shown in fig. 6, the smart player device 110 receives a request for playing a tv program (S601), and turns on a PID filter according to the PID of the requested program to obtain an audio file with a specified PID (S602). The code stream SI/PSI decoding table module analyzes the interface data (S603), the digital television decoding module analyzes the PES data (S604), and saves the PES decoded data (S605). The code stream SI/PSI demapping module parses the associated Audio Component Descriptor data (S606), and if it is determined that Component _ type in the Audio Component Descriptor data is 0x02(S607), it confirms that the current Audio file is a binaural (S610). On the contrary, the component _ type is not equal to 0x02, and the bitstream information in the PES decoded data saved after parsing is further acquired (S608). If the Audio coding mode in the bitstream information is judged to be 0(S609), it is confirmed that the current Audio file is a binaural (S610). If the Audio coding mode is not equal to 0, the channel output is set to stereo (S616).
After confirming that the current Audio file is a binaural (S610), the favorite language set by the user is read (S611), and it is determined whether the favorite language is the same as the language ISO _639_ language _ code corresponding to the left channel in the Audio Component Descriptor data (S612), and if so, the channel setting is output as a left channel (S614). On the contrary, if it is different from the language of the left channel, it is judged whether the preferred language is the same as the language ISO _639_ language _ code _2 corresponding to the right channel in the AudioComponent Descriptor data (S613), and if so, the channel setting output is the right channel (S615). If the language is also different from the right channel, the channel setting is output as a left channel (S614). After the setting of the output channels is completed, the television sound is output (S617). According to the scheme provided by the invention, a user can automatically select one sound channel to output without manually selecting the sound channel, so that the phenomenon of accent is avoided.
The following is an embodiment of the apparatus of the present invention, which can be used to execute an embodiment of a multi-channel audio playing method executed by the aforementioned smart playing device 110 of the present invention. For details that are not disclosed in the embodiments of the apparatus of the present invention, please refer to the embodiments of the multi-channel audio playing method of the present invention.
Fig. 7 is a block diagram illustrating an apparatus for playing multi-channel audio, which may be used in the smart player device 110 of the implementation environment shown in fig. 1, to perform all or part of the steps of the multi-channel audio playing method shown in any one of fig. 3 to 6 according to an exemplary embodiment. As shown in fig. 7, the method and apparatus include, but are not limited to: a file acquisition module 710, a language acquisition module 730, a language matching module 750, and an audio output module 770.
The file obtaining module 710 is configured to obtain, according to an audio playing request, audio stream data indicated by the audio playing request;
a language obtaining module 730, configured to analyze the audio stream data, and determine audio languages corresponding to multiple channels in the audio stream data;
the language matching module 750 is configured to determine whether an audio language corresponding to at least one channel is the same as the audio language indicated by the language configuration information;
and the audio output module 770 is configured to, when there is an audio language corresponding to at least one channel that is the same as the audio language indicated by the language configuration information, control to output the audio data of the channel that is the same as the audio language indicated by the language configuration information, and turn off the output of other channels.
The implementation process of the functions and actions of each module in the above device is specifically described in the implementation process of the corresponding step in the above multi-channel audio playing method, and is not described herein again.
The file acquisition module 710 may be, for example, one of the physical structures of the wired or wireless network interface 250 of fig. 2.
The language obtaining module 730, the language matching module 750 and the audio output module 770 can also be functional modules for executing corresponding steps in the above-mentioned multi-channel audio playing method. It is understood that these modules may be implemented in hardware, software, or a combination of both. When implemented in hardware, these modules may be implemented as one or more hardware modules, such as one or more application specific integrated circuits. When implemented in software, the modules may be implemented as one or more computer programs executing on one or more processors, such as programs stored in memory 232 for execution by central processor 222 of FIG. 2.
In an exemplary embodiment, as shown in fig. 8, the file obtaining module 710 includes:
a request receiving unit 711 for receiving an audio play request;
the file obtaining unit 712 is configured to obtain, according to the file identification information carried in the audio playing request, audio stream data corresponding to the file identification information.
Further, as shown in fig. 9, the language obtaining module 730 includes:
a file parsing unit 731, configured to parse the audio stream data to obtain an audio component type and an audio encoding mode;
a language obtaining unit 732, configured to obtain, from a parsing result of the audio stream data, audio languages corresponding to multiple channels when the audio component type and/or the audio coding mode indicate that the audio stream data is audio data in which multiple channels transmit different languages, respectively.
The language obtaining unit 732 includes:
and the language obtaining subunit is used for obtaining the audio languages corresponding to the multiple sound channels from the analysis result of the audio stream data when the audio component type and/or the audio coding mode carry the identification codes used for marking the audio stream data as the multiple sound channels to transmit the audio data in different languages respectively.
Further, the multi-channel audio playing device provided by the invention further comprises:
and the information reading module is used for reading the language configuration information, and the language configuration information is generated according to the language selection triggered by the user in advance.
Further, the language matching module 750 specifically includes:
the language comparison unit is used for comparing the language identification code carried by the language configuration information with the language identification codes of the audio languages corresponding to the multiple sound channels respectively; and judging whether the language identification code of the audio language corresponding to at least one sound channel is matched with the language identification code carried by the language configuration information.
Optionally, the present invention further provides an electronic device, which can be used in the smart playing device 110 of the implementation environment shown in fig. 1 to execute all or part of the steps of the multi-channel audio playing method shown in any one of fig. 3 to 6. The electronic device includes:
a processor;
a memory for storing processor-executable instructions;
wherein the processor is configured to perform the multi-channel audio playing method according to the above exemplary embodiment.
The specific manner in which the processor of the electronic device performs operations in this embodiment has been described in detail in the embodiment of the multi-channel audio playing method, and will not be elaborated here.
In an exemplary embodiment, a storage medium is also provided that is a computer-readable storage medium, such as may be transitory and non-transitory computer-readable storage media, including instructions. The storage medium stores a computer program that can be executed by the central processing unit 222 of the smart play device 110 to implement the multi-channel audio playing method.
It will be understood that the invention is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the invention is limited only by the appended claims.

Claims (7)

1. A method for multi-channel audio playback, the method comprising:
receiving an audio playing request, and acquiring audio stream data corresponding to file identification information according to the file identification information carried by the audio playing request; the audio stream data comprises audio data of at least two sound channels, and the audio playing request comprises file identification information of the audio data;
analyzing the audio stream data by a table-decoding module based on code stream service information or program specific information to obtain an audio component type corresponding to an audio component descriptor, and decoding the audio stream data to obtain an audio coding mode in a packed original stream;
if the audio component type and/or the audio coding mode indicate that the audio stream data are audio data with a plurality of sound channels respectively transmitting different audio languages, obtaining the audio languages respectively corresponding to the sound channels from the analysis result of the audio stream data;
judging whether the audio language corresponding to at least one sound channel is the same as the audio language indicated by the language configuration information or not based on the file identification information;
if so, controlling to output the audio data of the sound channel with the same audio language indicated by the language configuration information, and closing the output of other sound channels;
and if the audio language corresponding to the sound channel is not the same as the audio language indicated by the language configuration information, randomly selecting audio data of one audio language for output.
2. The method of claim 1, wherein if the audio component type and/or audio coding mode indicates that the audio stream data is a plurality of channels and transmits audio data in different audio languages, obtaining audio languages corresponding to the plurality of channels from the parsing result of the audio stream data comprises:
and if the audio component type and/or the audio coding mode carry identification codes used for marking the audio stream data as a plurality of sound channels to respectively transmit audio data of different languages, obtaining the audio languages respectively corresponding to the sound channels from the analysis result of the audio stream data.
3. The method of claim 1, wherein before determining whether the audio language corresponding to the at least one channel is the same as the audio language indicated by the language configuration information, the method further comprises:
and reading language configuration information which is generated in advance according to language selection triggered by a user.
4. The method of claim 1, wherein the determining whether the audio language corresponding to the at least one channel is the same as the audio language indicated by the language configuration information comprises:
comparing the language identification code carried by the language configuration information with the language identification codes of the audio languages respectively corresponding to the plurality of sound channels;
and judging whether the language identification code of the audio language corresponding to at least one sound channel is matched with the language identification code carried by the language configuration information.
5. An apparatus for multi-channel audio playback, the apparatus comprising:
the file acquisition module is used for receiving an audio playing request and acquiring audio stream data corresponding to file identification information according to the file identification information carried by the audio playing request; the audio stream data comprises audio data of at least two sound channels, and the audio playing request comprises file identification information of the audio data;
the file analysis unit is used for analyzing the audio stream data by a table-solving module based on the code stream service information or the program specific information to obtain an audio component type corresponding to an audio component descriptor, and decoding the audio stream data to obtain an audio coding mode in a packed original stream;
a language obtaining unit, configured to obtain, from an analysis result of the audio stream data, audio languages corresponding to multiple channels if the audio component type and/or the audio coding mode indicate that the audio stream data is audio data in which multiple channels transmit different audio languages respectively;
the language matching module is used for judging whether the audio language corresponding to at least one sound channel is the same as the audio language indicated by the language configuration information or not based on the file identification information;
the audio output module is used for controlling the output of the audio data of the channel which is the same as the audio language indicated by the language configuration information and closing the output of other channels when the audio language corresponding to at least one channel is the same as the audio language indicated by the language configuration information; and if the audio language corresponding to the sound channel is not the same as the audio language indicated by the language configuration information, randomly selecting audio data of one audio language for output.
6. An electronic device, characterized in that the electronic device comprises:
a processor;
a memory for storing processor-executable instructions;
wherein the processor is configured to perform the multi-channel audio playback method of any one of claims 1-4.
7. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program, which is executable by a processor to perform the multi-channel audio playback method of any one of claims 1 to 4.
CN201810044616.7A 2018-01-17 2018-01-17 Multi-channel audio playing method and device, electronic equipment and storage medium Active CN108259986B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810044616.7A CN108259986B (en) 2018-01-17 2018-01-17 Multi-channel audio playing method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810044616.7A CN108259986B (en) 2018-01-17 2018-01-17 Multi-channel audio playing method and device, electronic equipment and storage medium

Publications (2)

Publication Number Publication Date
CN108259986A CN108259986A (en) 2018-07-06
CN108259986B true CN108259986B (en) 2021-06-01

Family

ID=62726678

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810044616.7A Active CN108259986B (en) 2018-01-17 2018-01-17 Multi-channel audio playing method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN108259986B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109451417B (en) * 2018-11-29 2024-03-15 广州艾美网络科技有限公司 Multichannel audio processing method and system
CN113905119B (en) * 2020-06-22 2024-06-04 阿里巴巴集团控股有限公司 Terminal bracket, control method thereof, audio processing method, audio processing system, electronic device and computer readable storage medium
CN112040388A (en) * 2020-09-03 2020-12-04 北京华宇轩罡教育科技有限公司 Method, device and equipment for playing audio of left and right sound channels

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1540983A (en) * 2003-03-25 2004-10-27 株式会社东芝 Informationj recording medium, information recording/playback method and information recording/playback appts.
CN101044573A (en) * 2004-08-19 2007-09-26 索尼株式会社 Reproduction device, reproduction method, reproduction program, recording medium, and data structure
CN101136232A (en) * 2007-10-15 2008-03-05 殷亮 Double subtitling double track data media and player having parent language of foreign languages
CN102917241A (en) * 2011-08-04 2013-02-06 瀚宇彩晶股份有限公司 Automatic detection method of vocal tract language name and television using same
CN105898617A (en) * 2016-04-28 2016-08-24 广东欧珀移动通信有限公司 Method and device for matching with audio track of video play
CN106464953A (en) * 2014-04-15 2017-02-22 克里斯·T·阿纳斯塔斯 Binaural audio systems and methods

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100206786B1 (en) * 1996-06-22 1999-07-01 구자홍 Multi-audio processing device for a dvd player
CN2899335Y (en) * 2006-02-27 2007-05-09 邱瓊谊 TV set with built-in independent infrared radio transmitter
CN1937664B (en) * 2006-09-30 2010-11-10 华为技术有限公司 System and method for realizing multi-language conference
CN101741430A (en) * 2009-12-14 2010-06-16 中兴通讯股份有限公司 Method for switching audio channels and media play system
CN103947188A (en) * 2012-10-30 2014-07-23 深圳市多尼卡电子技术有限公司 Method and device for editing and playing audio-video file, and broadcasting system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1540983A (en) * 2003-03-25 2004-10-27 株式会社东芝 Informationj recording medium, information recording/playback method and information recording/playback appts.
CN101044573A (en) * 2004-08-19 2007-09-26 索尼株式会社 Reproduction device, reproduction method, reproduction program, recording medium, and data structure
CN101136232A (en) * 2007-10-15 2008-03-05 殷亮 Double subtitling double track data media and player having parent language of foreign languages
CN102917241A (en) * 2011-08-04 2013-02-06 瀚宇彩晶股份有限公司 Automatic detection method of vocal tract language name and television using same
CN106464953A (en) * 2014-04-15 2017-02-22 克里斯·T·阿纳斯塔斯 Binaural audio systems and methods
CN105898617A (en) * 2016-04-28 2016-08-24 广东欧珀移动通信有限公司 Method and device for matching with audio track of video play

Also Published As

Publication number Publication date
CN108259986A (en) 2018-07-06

Similar Documents

Publication Publication Date Title
US11381886B2 (en) Data processor and transport of user control data to audio decoders and renderers
JP4634412B2 (en) Apparatus and method for performing codec upgrade
US20200228848A1 (en) Audiovisual content item data streams
CN108259986B (en) Multi-channel audio playing method and device, electronic equipment and storage medium
US9883213B2 (en) Embedding encoded audio into transport stream for perfect splicing
KR102020689B1 (en) Reception device, reception method, program, and information processing system
CN103703787A (en) Information processing device, information processing method, program and application information table transmission device
KR20090024885A (en) Method for software updateing in display device
US20170311032A1 (en) Content Identifier Remapping for Fast Channel Change
CN113228166B (en) Command control device, control method, and nonvolatile storage medium
KR100914808B1 (en) Channel-tuning method in a DTV receiver
US12035018B2 (en) Data processor and transport of user control data to audio decoders and renderers
CN109688460B (en) Consonant output method for digital television picture, digital television and storage medium
US20230007321A1 (en) Methods and devices for providing personalized audio to a user
CN117812368A (en) Display equipment and program playing method
JP2023183745A (en) Broadcasting system, receiver, reception method, and program
KR20190093386A (en) Apparatus for providing service of electronic service guide in digital broadcast based on voice recognition and method for the same
CN114731459A (en) Method and apparatus for personalizing audio content
CN111601157A (en) Audio output method and display device
KR20080066197A (en) Method for automatically setting up an output language in a digital broadcasting reproducing apparatus

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20221018

Address after: 83 Intekte Street, Devon, Netherlands

Patentee after: VIDAA (Netherlands) International Holdings Ltd.

Address before: Room 601, building C, Tsinghua Ziguang science and Technology Park, No. 13, Langshan Road, North District, high tech Zone, Shenzhen, Guangdong 518000

Patentee before: HISENSE ELECTRONIC TECHNOLOGY (SHENZHEN) Co.,Ltd.

TR01 Transfer of patent right