WO2022089098A1 - 音高调节方法、装置及计算机存储介质 - Google Patents
音高调节方法、装置及计算机存储介质 Download PDFInfo
- Publication number
- WO2022089098A1 WO2022089098A1 PCT/CN2021/119571 CN2021119571W WO2022089098A1 WO 2022089098 A1 WO2022089098 A1 WO 2022089098A1 CN 2021119571 W CN2021119571 W CN 2021119571W WO 2022089098 A1 WO2022089098 A1 WO 2022089098A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- pitch
- melody
- target
- file
- fundamental frequency
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 40
- 230000009466 transformation Effects 0.000 claims description 16
- 238000004364 calculation method Methods 0.000 claims description 5
- 238000006243 chemical reaction Methods 0.000 claims description 5
- 230000006870 function Effects 0.000 description 11
- 238000004891 communication Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 6
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000000630 rising effect Effects 0.000 description 3
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000007599 discharging Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 238000012905 input function Methods 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000002085 persistent effect Effects 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 238000010079 rubber tapping Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000010897 surface acoustic wave method Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H1/00—Details of electrophonic musical instruments
- G10H1/18—Selecting circuits
- G10H1/20—Selecting circuits for transposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H1/00—Details of electrophonic musical instruments
- G10H1/36—Accompaniment arrangements
- G10H1/361—Recording/reproducing of accompaniment for use with an external source, e.g. karaoke systems
- G10H1/366—Recording/reproducing of accompaniment for use with an external source, e.g. karaoke systems with means for modifying or correcting the external signal, e.g. pitch correction, reverberation, changing a singer's voice
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H1/00—Details of electrophonic musical instruments
- G10H1/0008—Associated control or indicating means
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H1/00—Details of electrophonic musical instruments
- G10H1/02—Means for controlling the tone frequencies, e.g. attack or decay; Means for producing special musical effects, e.g. vibratos or glissandos
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H1/00—Details of electrophonic musical instruments
- G10H1/36—Accompaniment arrangements
- G10H1/361—Recording/reproducing of accompaniment for use with an external source, e.g. karaoke systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/005—Musical accompaniment, i.e. complete instrumental rhythm synthesis added to a performed melody, e.g. as output by drum machines
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/031—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
- G10H2210/066—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal for pitch analysis as part of wider processing for musical purposes, e.g. transcription, musical performance evaluation; Pitch recognition, e.g. in polyphonic sounds; Estimation or use of missing fundamental
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/325—Musical pitch modification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/325—Musical pitch modification
- G10H2210/331—Note pitch correction, i.e. modifying a note pitch or replacing it by the closest one in a given scale
Definitions
- the embodiments of the present application relate to the field of data processing, and in particular, to a pitch adjustment method, device, and computer storage medium.
- the current smart terminal music software can provide users with singing recording services, that is, the music software plays the accompaniment of the song, the user sings under the accompaniment, and the music software records the user's singing voice, and then mixes the user's singing voice with the accompaniment of the song
- the final synthesized work includes the user's singing voice and the accompaniment of the song.
- Embodiments of the present application provide a pitch adjustment method, device, and computer storage medium, which are used to automatically adjust the accompaniment of a target song, so that the user's singing voice matches the accompaniment in pitch.
- a first aspect of the embodiments of the present application provides a pitch adjustment method, including:
- the alternative melody files are used to identify the pitch value of the note in the melody of the target song, and the pitch values identified by each of the alternative melody files are different;
- the target fundamental frequency point includes the fundamental frequency The fundamental frequency point corresponding in time to the note of the candidate melody file in the sequence;
- the candidate melody file with the smallest sum is determined as the target melody file, and the pitch of the accompaniment file of the target song is adjusted according to the pitch value difference between the target melody file and the original melody file of the target song.
- a second aspect of the embodiments of the present application provides a pitch adjustment device, including:
- the first acquisition unit is used to acquire a plurality of candidate melody files, the candidate melody files are used to identify the pitch values of notes in the melody of the target song, and the pitch values identified by each of the candidate melody files are different ;
- the second acquisition unit is used to acquire the fundamental frequency sequence of the singing voice of the user singing the target song
- the conversion unit is used to convert the frequency value of the target fundamental frequency point of the fundamental frequency sequence into a pitch value according to a preset algorithm, and the target fundamental frequency point includes the frequency value of the fundamental frequency sequence and the candidate melody file.
- the fundamental frequency point corresponding to the note in time;
- the calculation unit is used to calculate the pitch value difference between each of the alternative melody files and the fundamental frequency sequence at each corresponding time point, and to count all the pitch values of each of the alternative melody files respectively. the sum of the high value differences;
- the pitch adjustment unit is used to determine the candidate melody file with the smallest sum as the target melody file, and adjust the target song according to the pitch value difference between the target melody file and the original melody file of the target song the pitch of the accompaniment file.
- a third aspect of the embodiments of the present application provides a pitch adjustment device, including:
- processor memory, bus, input and output devices
- the processor is connected to the memory and the input and output device;
- the bus is respectively connected to the processor, the memory and the input and output devices;
- the processor is used to obtain a plurality of candidate melody files, and the alternative melody files are used to identify the pitch values of the notes in the melody of the target song, and the pitch values identified by each of the alternative melody files are different, and obtain
- the fundamental frequency sequence of the singing voice of the user singing the target song, and the frequency value of the target fundamental frequency point of the fundamental frequency sequence is converted into a pitch value according to a preset algorithm, and the target fundamental frequency point includes the fundamental frequency sequence
- the target fundamental frequency point includes the fundamental frequency sequence
- calculate the pitch value difference between each of the alternative melody file and the fundamental frequency sequence at each corresponding time point calculate the pitch value difference between each of the alternative melody file and the fundamental frequency sequence at each corresponding time point , and count the sum of all pitch value differences of each described candidate melody file respectively, determine the candidate melody file with the smallest sum as the target melody file, and according to the target melody file and the target song
- the pitch value difference of the original melody file adjusts the pitch of the accompaniment file of the target song.
- a fourth aspect of the embodiments of the present application provides a computer storage medium, where an instruction is stored in the computer storage medium, and when the instruction is executed on a computer, the computer executes the method of the foregoing first aspect.
- the embodiments of the present application have the following advantages:
- the fundamental frequency sequence of the user's singing is obtained, the pitch value difference between each candidate melody file and the fundamental frequency sequence at each corresponding time point is calculated, and the The sum of all pitch value differences, determine the candidate melody file with the smallest sum as the target melody file, and adjust the pitch of the accompaniment file of the target song according to the pitch value difference between the target melody file and the original melody file of the target song , since the pitch identified by the target melody file has the highest matching degree with the pitch of the user's singing voice, the accompaniment after the pitch adjustment can match the pitch of the user's singing voice, and the resulting mixed works can obtain good listening feel.
- 1 is a schematic flowchart of a pitch adjustment method in an embodiment of the application
- FIG. 2 is another schematic flowchart of the pitch adjustment method in the embodiment of the present application.
- FIG. 3 is a schematic structural diagram of a pitch adjustment device in an embodiment of the present application.
- FIG. 4 is another structural schematic diagram of the pitch adjustment device in the embodiment of the application.
- FIG. 5 is another schematic structural diagram of the pitch adjustment device in the embodiment of the present application.
- Embodiments of the present application provide a pitch adjustment method, device, and computer storage medium, which are used to automatically adjust the accompaniment of a target song, so that the user's singing voice matches the accompaniment in pitch.
- an embodiment of the pitch adjustment method in the embodiment of the present application includes:
- the method of this embodiment may be applied to a pitch adjustment apparatus, and the apparatus may be a computer device capable of performing data processing tasks, such as a terminal and a server.
- the generating device is a terminal, it can be a smart phone, a tablet computer, a laptop computer, a desktop computer, a self-service terminal, etc.; when it is a server, it can be an independent physical server, or a plurality of physical servers.
- Server clusters or distributed systems can also be cloud servers that provide basic cloud computing services such as cloud databases, cloud computing, and big data and artificial intelligence platforms.
- the pitch of the accompaniment of the target song is adjusted according to the pitch of the user's singing voice, so that the pitch of the accompaniment matches the pitch of the user's singing voice, and the mixed work of the user's singing voice and the accompaniment has a better sense of hearing.
- a plurality of candidate melody files are used as references to determine the degree of adjustment of the pitch of the accompaniment. Therefore, when adjusting the pitch of the accompaniment, a plurality of candidate melody files are obtained, wherein each candidate melody file is used to identify the pitch value of the note in the melody of the target song, and each candidate melody file identifies The pitch values are different.
- the pitch value of the melody of the target song identified by the candidate melody file may be a pitch value of 0-108 or 0-88.
- the pitch value identified by candidate melody file 1 is 0, the pitch value identified by candidate melody file 2 is 1, and so on.
- the pitch adjustment device obtains the audio data of the user's singing voice, and extracts the fundamental frequency of the singing voice to obtain a fundamental frequency sequence, which includes a plurality of fundamental frequency points.
- the commonly used fundamental frequency extraction algorithms include autocorrelation algorithm, parallel processing method, cepstral method and simplified inverse filtering method.
- the fundamental frequency of the singing voice is obtained, and the fundamental frequency sequence of the user's singing voice is obtained.
- this embodiment uses multiple candidate melody files as a reference, and the multiple candidate melody files identify the pitch values of the melody, when comparing the candidate melody files with the fundamental frequency sequence of the user's singing, It is necessary to convert the frequency value of the target fundamental frequency point in the fundamental frequency sequence into a pitch value, and the target fundamental frequency point includes the fundamental frequency point in the fundamental frequency sequence corresponding to the notes of the candidate melody file in time, so that the The pitch value of the fundamental frequency point is compared with the pitch value identified by the candidate melody file, and the comparison result can be used as the basis for the pitch adjustment of the accompaniment.
- the pitch value identified by the candidate melody file is also the pitch value of the note.
- you can calculate The pitch value difference between each candidate melody file and the fundamental frequency sequence at each corresponding time point, where the corresponding time point means that the fundamental frequency point of the fundamental frequency sequence falls within the position of a certain note in the candidate melody file The time range at which the fundamental frequency corresponds to the note in time. For example, the duration of a note is 1s. If a fundamental frequency point falls within the time range of the 1s note, the fundamental frequency point corresponds to the note in time, and the sound of the two can be calculated. High value difference.
- the numerical value of the sum of the pitch value differences can reflect the size of the gap between the pitch value of the candidate melody file and the pitch value of the fundamental frequency sequence of the user's singing voice, that is, the larger the value of the sum, the greater the gap.
- the pitch of the candidate melody file does not match the pitch of the user's singing voice; the smaller the value of the sum is, the smaller the gap is, and the higher the degree of fit between the pitch of the candidate melody file and the pitch of the user's singing voice, Then, the pitch of the accompaniment is adjusted according to the candidate melody file, and the accompaniment matching the pitch of the user's singing voice can be obtained.
- the original melody file of the target song is used to identify the pitch values of the notes in the original melody of the target song
- the original melody may be the singing melody of the original singer of the target song, because the original singer is generally a relatively professional singer Therefore, the pitch of the original melody will generally match the pitch of the accompaniment of the target song, and the pitch value identified by the original melody file will also match the pitch value of the accompaniment. Therefore, the pitch of the accompaniment file of the target song can be adjusted according to the pitch value difference between the target melody file and the original melody file.
- the accompaniment obtained by adjusting the pitch according to the target melody file will also match the pitch of the user's singing voice, thereby making the adjustment
- the accompaniment after the pitch and the user's singing voice have a good sense of hearing.
- the pitch values of the notes identified by a certain candidate melody file are 24, 25, 29, 31, 34, and 27 respectively (in practical applications, the number of notes identified by the candidate melody file is determined according to the target song , here only a limited number of notes are exemplified), and the pitch values of the target fundamental frequency points corresponding to the above-mentioned notes in the fundamental frequency sequence of the target song are 24, 25, 28, 31, 34, and 27 respectively.
- Calculate the pitch value difference between the corresponding target fundamental frequency point and the note as 0, 0, 1, 0, 0, 0 (the absolute value of the pitch value difference), and statistically obtain the pitch value difference
- the sum is 1.
- the sum of the pitch value differences of other candidate melody files can be calculated.
- the pitch of the accompaniment file of the target song can be adjusted according to the pitch difference between the target melody file and the original melody file of the target song, so that The accompaniment after adjusting the pitch can match the pitch of the user's singing voice and enhance the sense of hearing.
- the fundamental frequency sequence of the user's singing is obtained, the pitch value difference between each candidate melody file and the fundamental frequency sequence at each corresponding time point is calculated, and all the data of each candidate melody file are counted separately.
- the sum of the pitch value differences, the candidate melody file with the smallest sum is determined as the target melody file, and the pitch of the accompaniment file of the target song is adjusted according to the pitch value difference between the target melody file and the original melody file of the target song, Since the pitch identified by the target melody file has the highest matching degree with the pitch of the user's singing voice, the accompaniment after the pitch adjustment can match the pitch of the user's singing voice, and the resulting mixed work can have a good sense of hearing. .
- FIG. 2 another embodiment of the pitch adjustment method in the embodiment of the present application includes:
- the multiple candidate melody files may be any files used to identify the pitch values of the melody of the target song, as long as the pitch values identified by each candidate melody file are different.
- the multiple candidate melody files can be obtained by transforming the original melody files of the target song.
- the original melody file is used to identify the pitch value of the original melody of the target song, and the original melody may be the singing melody of the original singer of the target song. Since the melody is composed of musical notes, when the original melody file is transformed into a rising or falling key, the transformed value can be added to the pitch values of all the notes in the original melody file, thereby obtaining a transformed melody file. Therefore, the transformed melody file and the original melody file can be respectively used as candidate melody files, and both can be used as reference for adjusting the pitch of the accompaniment.
- the transformation of the original melody file may be a rising key transformation or a falling key transformation
- the transformation value may be a positive value or a negative value.
- the transformation value is +1, it means that the pitch value of the original melody file is increased by 1 unit, which is a rising transformation; the transformation value is -2, which means that the pitch value of the original melody file is reduced by 2 units. for the downshifting transformation.
- the transformation may be specifically performed based on the principle of twelve equal laws.
- the twelve equal temperament is a method of music law, dividing a pure octave into twelve equal parts, each equal part is called a semitone, and it is the most important tuning method. Therefore, based on the twelve equal temperament, an octave where the original melody file is located can be equally divided to obtain twelve semitone intervals, wherein the original melody file corresponds to one semitone interval in the twelve semitone intervals; After that, according to the interval relationship between the chromatic interval corresponding to the original melody file and other chromatic intervals, the pitch values of all the notes in the original melody file are respectively added and transformed 11 times, thereby obtaining 11 transformed melody files.
- the transformed melody file Since the transformation value is added according to chromatic intervals, the transformed melody file also corresponds to one chromatic interval in the twelve chromatic intervals, that is, each transformed melody file corresponds to the twelve chromatic intervals respectively. a semitone interval in .
- the 11 transformed melody files together with the original melody file constitute 12 candidate melody files.
- the pitch values of all the notes in the original melody file are respectively added and transformed 11 times, which are +1, +2, +3, ..., +9, +10, +11, then the original melody file's The pitch value is the smallest, and the pitch value of the melody file plus the transform value +11 is the largest.
- the specific algorithm content of the preset algorithm is not limited, as long as it is an algorithm that can convert the frequency value of the fundamental frequency point into a pitch value.
- the preset algorithm can be the following formula:
- hz_value is the frequency value of the fundamental frequency point.
- the frequency value of the fundamental frequency point can be converted into a pitch value through the above formula.
- the target fundamental frequency point may include all fundamental frequency points in the fundamental frequency sequence, or may only include the target fundamental frequency point corresponding in time to the notes of the candidate melody file.
- one way may be to traverse each fundamental frequency point of the fundamental frequency sequence, and convert the frequency value of each fundamental frequency point into a pitch value according to a preset algorithm, and then, Then, from all the fundamental frequency points of the fundamental frequency sequence, determine the target fundamental frequency point corresponding to the notes of the candidate melody file in time; another way can also be, first determine from all the fundamental frequency points of the fundamental frequency sequence.
- the target fundamental frequency point corresponding to the note in the alternative melody file in time is obtained.
- When converting the frequency value to the pitch value only the frequency value of the target fundamental frequency point is converted. Compared with the previous method, no conversion is required.
- the frequency values of other fundamental frequency points greatly reduce the operation of calculating the pitch value and reduce the pressure of data processing.
- the pitch value difference between each candidate melody file and the fundamental frequency sequence at each corresponding time point when calculating the pitch value difference between each candidate melody file and the fundamental frequency sequence at each corresponding time point, obtain the temporally related target fundamental frequency point in each candidate melody file.
- the pitch value of the corresponding note that is, when a certain fundamental frequency point falls within the time value range of a certain note, the fundamental frequency point is the target fundamental frequency point corresponding to the note in time.
- the pitch value difference between the corresponding target fundamental frequency point and the note in time is calculated, so as to obtain the pitch value difference between the candidate melody file and the fundamental frequency sequence at each corresponding time point.
- the candidate melody file determines whether the note in the candidate melody file corresponds to the fundamental frequency point of the fundamental frequency sequence in time, and the specific method may be that the candidate melody file also identifies the start time and end time of the note in the melody of the target song.
- the note corresponding to the target fundamental frequency point in time can be determined according to the start time and end time of the note, that is, the fundamental frequency point falls within the time period from the start time to the end time of a certain note, then the target fundamental frequency is determined. The point corresponds in time to that note. After the corresponding note is determined, the pitch value of the corresponding note is obtained.
- the candidate melody file with the smallest sum of pitch value differences has the highest pitch matching degree with the user's singing voice. Therefore, the candidate melody file with the smallest sum of pitch value differences is determined as the reference for accompaniment pitch adjustment. .
- the degree of pitch matching between the target melody file and the user's singing voice can be further determined, that is, the note with a pitch value difference of 0 in the target melody file is among all the notes.
- the note with a pitch value difference of 0 in the target melody file accounts for 100% of all notes, it means that there is no difference in pitch between the entire target melody file and the user's singing voice.
- the pitch value can well match the user's singing voice, which also shows that the user has a strong ability to grasp the pitch from another perspective.
- the proportion of notes with a pitch value difference of 0 in the target melody file is extremely low among all notes, it means that there are many differences in pitch between the target melody file and the user's singing voice, and the matching degree between the two is not high. It may be because the user's ability to grasp the pitch is not strong, and they often go out of tune when singing, and cannot sing according to a certain pitch.
- the preset threshold can be set arbitrarily, and can be obtained by summarizing experimental data. For example, it can be set to any value between 80% and 100%.
- the proportion of notes with a pitch value difference of 0 in the target melody file among all notes is greater than the preset threshold, it indicates that the target melody file and the user's singing have a high degree of pitch matching.
- the pitch value difference of the original melody file of the target song adjusts the pitch of the accompaniment file of the target song.
- the operations performed in this step are similar to the operations performed in step 104 in the aforementioned embodiment shown in FIG. 1 .
- the target melody file is one of the multiple candidate melody files obtained in step 201, if the multiple candidate melody files are obtained by transforming the original melody file of the target song, the The transformation relationship of the melody file directly determines the pitch value difference between the target melody file and the original melody file.
- step 201 transforms the original melody file based on the twelve equal temperament to obtain 12 candidate melody files, and each candidate melody file corresponds to a chromatic interval, therefore, the difference between the target melody file and the original melody file is There is an interval relationship, that is, how many semitone intervals are different, and when it is specifically expressed in pitch, it is the pitch difference between the melody corresponding to the target melody file and the melody corresponding to the original melody file. Therefore, the pitch of the accompaniment file of the target song can be adjusted according to the interval relationship between the target melody file and the original melody file.
- the proportion of notes with a pitch value difference of 0 in the target melody file is less than the preset threshold, it indicates that there are many differences in pitch between the target melody file and the user's singing voice, and the matching degree between the two is not high. At this time, it is considered that the user has a poor grasp of the pitch of the target song. Even if the pitch of the accompaniment file is adjusted according to the target melody file, the accompaniment cannot fit the user's singing well. Therefore, the pitch of the accompaniment file is not adjusted and the accompaniment is not changed. the pitch.
- the pitch match between the target melody file and the user's singing voice can be further determined by judging whether the proportion of notes with a pitch value difference of 0 in all notes in the target melody file is greater than a preset threshold. degree, and improve the feasibility of the program.
- an embodiment of the pitch adjustment device in the embodiment of the present application includes:
- the first obtaining unit 301 is used to obtain a plurality of candidate melody files, the candidate melody files are used to identify the pitch values of notes in the melody of the target song, and the pitch values identified by each candidate melody file are different;
- the second obtaining unit 302 is used to obtain the fundamental frequency sequence of the singing voice of the user singing the target song
- the conversion unit 303 is used to convert the frequency value of the target fundamental frequency point of the fundamental frequency sequence into a pitch value according to a preset algorithm, and the target fundamental frequency point includes the time corresponding to the note of the candidate melody file in the fundamental frequency sequence. fundamental frequency point;
- the calculation unit 304 is used to calculate the pitch value difference between each candidate melody file and the fundamental frequency sequence at each corresponding time point, and count the difference of all the pitch value differences of each candidate melody file respectively. sum;
- the pitch adjustment unit 305 is configured to determine the candidate melody file with the smallest sum as the target melody file, and adjust the pitch of the accompaniment file of the target song according to the pitch value difference between the target melody file and the original melody file of the target song.
- the first obtaining unit 301 is specifically configured to obtain the original melody file of the target song, and add transformation values to the pitch values of all the notes in the original melody file to obtain the transformed melody file,
- the original melody file and the transformed melody file are respectively used as alternative melody files.
- the first obtaining unit 301 is specifically configured to equally divide the octave corresponding to the original melody file based on the twelve equal temperament to obtain twelve semitone intervals, and the original melody file corresponds to twelve one of the semitone intervals;
- the pitch values of all the notes in the original melody file are respectively added and transformed 11 times to obtain 11 transformed melody files;
- each transformed melody file corresponds to one chromatic interval in the twelve chromatic intervals respectively.
- the pitch adjustment unit 305 is specifically configured to adjust the pitch of the accompaniment file of the target song according to the interval relationship between the target melody file and the original melody file high.
- the pitch adjustment device further includes:
- Judging unit 306 for judging whether the proportion of notes with a pitch value difference of 0 in all notes in the target melody file is greater than a preset threshold
- the pitch adjustment unit 305 is specifically configured to, when the proportion of notes with a pitch value difference of 0 in all the notes in the target melody file is greater than a preset threshold, execute the pitch of the original melody file according to the target melody file and the target song. The steps of adjusting the pitch of the accompaniment file of the target song by the difference value; when the proportion of notes with a pitch value difference of 0 in the target melody file is not greater than the preset threshold, the pitch of the accompaniment file is not adjusted .
- the conversion unit 303 is specifically configured to traverse each fundamental frequency point of the fundamental frequency sequence, convert the frequency value of each fundamental frequency point into a pitch value according to a preset algorithm, and convert the frequency value of each fundamental frequency point into a pitch value from the fundamental frequency point. Determine the target fundamental frequency point among all fundamental frequency points of the frequency sequence;
- the calculation unit 304 is specifically used to obtain the pitch value of the note corresponding to the target fundamental frequency point in time in each candidate melody file, and calculate the pitch value difference between the corresponding target fundamental frequency point and the note in time. value.
- the alternative melody file is also used to identify the start time and end time of the notes in the melody of the target song;
- the calculation unit 304 is specifically configured to determine the note corresponding to the target fundamental frequency point in time according to the start time and the end time of the note in each candidate melody file, and obtain the sound of the note corresponding to the target fundamental frequency point in time. high value.
- each unit in the pitch adjustment device the operations performed by each unit in the pitch adjustment device are similar to those described in the foregoing embodiments shown in FIG. 1 to FIG. 2 , and details are not repeated here.
- the first obtaining unit 301 obtains the fundamental frequency sequence of the user's singing voice
- the calculating unit 304 calculates the pitch value difference between each candidate melody file and the fundamental frequency sequence at each corresponding time point, and counts the differences respectively.
- the sum of all pitch value differences of each candidate melody file, the pitch adjustment unit 305 determines the candidate melody file with the smallest sum as the target melody file, and according to the pitch of the target melody file and the original melody file of the target song The value difference adjusts the pitch of the accompaniment file of the target song. Since the pitch identified by the target melody file has the highest matching degree with the pitch of the user's singing voice, the accompaniment after the pitch adjustment can match the pitch of the user's singing voice. Fit, the resulting mixed works can get a good sense of hearing.
- an embodiment of the pitch adjustment device in the embodiment of the present application includes:
- the pitch adjustment device 400 may include one or more central processing units (CPUs) 401 and a memory 405, where one or more application programs or data are stored in the memory 405.
- CPUs central processing units
- memory 405 where one or more application programs or data are stored in the memory 405.
- the memory 405 may be volatile storage or persistent storage.
- the program stored in the memory 405 may include one or more modules, each module may include a series of instruction operations on the pitch adjustment apparatus.
- the central processing unit 401 may be configured to communicate with the memory 405 to execute a series of instruction operations in the memory 405 on the pitch adjustment device 400 .
- the pitch adjustment device 400 may also include one or more power supplies 402, one or more wired or wireless network interfaces 403, one or more input and output interfaces 404, and/or, one or more operating systems, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM, etc.
- one or more operating systems such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM, etc.
- the central processing unit 401 can perform the operations performed by the pitch adjustment apparatus in the embodiments shown in FIG. 1 to FIG. 2 , and details are not repeated here.
- an embodiment of the pitch adjustment device in the embodiment of the present application includes:
- the terminal can be any terminal device including a mobile phone, a tablet computer, a PDA (Personal Digital Assistant), a POS (Point ofSales, a sales terminal), a vehicle-mounted computer, etc.
- the terminal is a mobile phone as an example:
- FIG. 5 is a block diagram showing a partial structure of a mobile phone related to a terminal provided by an embodiment of the present application.
- the mobile phone includes: a radio frequency (Radio Frequency, RF) circuit 510 , a memory 520 , an input unit 530 , a display unit 540 , a sensor 550 , an audio circuit 560 , a wireless fidelity (WiFi) module 570 , and a processor 580 , and power supply 590 and other components.
- RF Radio Frequency
- the RF circuit 510 can be used for receiving and sending signals during sending and receiving of information or during a call. In particular, after receiving the downlink information of the base station, it is processed by the processor 580; in addition, the designed uplink data is sent to the base station.
- the RF circuit 510 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a Low Noise Amplifier (LNA), a duplexer, and the like.
- LNA Low Noise Amplifier
- RF circuitry 510 may also communicate with networks and other devices via wireless communications.
- the above-mentioned wireless communication can use any communication standard or protocol, including but not limited to Global System of Mobile communication (GSM), General Packet Radio Service (General Packet Radio Service, GPRS), Code Division Multiple Access (Code Division) Multiple Access, CDMA), Wideband Code Division Multiple Access (Wideband Code Division Multiple Access, WCDMA), Long Term Evolution (Long Term Evolution, LTE), email, Short Messaging Service (Short Messaging Service, SMS), etc.
- GSM Global System of Mobile communication
- General Packet Radio Service General Packet Radio Service
- GPRS General Packet Radio Service
- Code Division Multiple Access Code Division Multiple Access
- CDMA Code Division Multiple Access
- Wideband Code Division Multiple Access Wideband Code Division Multiple Access
- WCDMA Wideband Code Division Multiple Access
- LTE Long Term Evolution
- SMS Short Messaging Service
- the memory 520 can be used to store software programs and modules, and the processor 580 executes various functional applications and data processing of the mobile phone by running the software programs and modules stored in the memory 520.
- the memory 520 may mainly include a stored program area and a stored data area, wherein the stored program area may store an operating system, an application program (such as a sound playback function, an image playback function, etc.) required for at least one function, and the like; Data created by the use of the mobile phone (such as audio data, phone book, etc.), etc.
- memory 520 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
- the input unit 530 may be used for receiving inputted numerical or character information, and generating key signal input related to user setting and function control of the mobile phone.
- the input unit 530 may include a touch panel 531 and other input devices 532 .
- the touch panel 531 also referred to as a touch screen, can collect the user's touch operations on or near it (such as the user's finger, stylus, etc., any suitable object or accessory on or near the touch panel 531). operation), and drive the corresponding connection device according to the preset program.
- the touch panel 531 may include two parts, a touch detection device and a touch controller.
- the touch detection device detects the user's touch orientation, detects the signal brought by the touch operation, and transmits the signal to the touch controller; the touch controller receives the touch information from the touch detection device, converts it into contact coordinates, and then sends it to the touch controller.
- the touch panel 531 can be implemented in various types such as resistive, capacitive, infrared, and surface acoustic waves.
- the input unit 530 may further include other input devices 532 .
- other input devices 532 may include, but are not limited to, one or more of physical keyboards, function keys (such as volume control keys, switch keys, etc.), trackballs, mice, joysticks, and the like.
- the display unit 540 may be used to display information input by the user or information provided to the user and various menus of the mobile phone.
- the display unit 540 may include a display panel 541, and optionally, the display panel 541 may be configured in the form of a liquid crystal display (Liquid Crystal Display, LCD), an organic light-emitting diode (Organic Light-Emitting Diode, OLED), and the like.
- the touch panel 531 may cover the display panel 541, and when the touch panel 531 detects a touch operation on or near it, it transmits it to the processor 580 to determine the type of the touch event, and then the processor 580 determines the type of the touch event according to the touch event. Type provides corresponding visual output on display panel 541 .
- the touch panel 531 and the display panel 541 are used as two independent components to realize the input and input functions of the mobile phone, in some embodiments, the touch panel 531 and the display panel 541 can be integrated to form Realize the input and output functions of the mobile phone.
- the cell phone may also include at least one sensor 550, such as a light sensor, a motion sensor, and other sensors.
- the light sensor may include an ambient light sensor and a proximity sensor, wherein the ambient light sensor may adjust the brightness of the display panel 541 according to the brightness of the ambient light, and the proximity sensor may turn off the display panel 541 and/or when the mobile phone is moved to the ear. or backlight.
- the accelerometer sensor can detect the magnitude of acceleration in all directions (usually three axes), and can detect the magnitude and direction of gravity when it is stationary. games, magnetometer attitude calibration), vibration recognition related functions (such as pedometer, tapping), etc.; as for other sensors such as gyroscope, barometer, hygrometer, thermometer, infrared sensor, etc. Repeat.
- the audio circuit 560, the speaker 561, and the microphone 562 can provide an audio interface between the user and the mobile phone.
- the audio circuit 560 can transmit the received audio data converted electrical signal to the speaker 561, and the speaker 561 converts it into a sound signal for output; on the other hand, the microphone 562 converts the collected sound signal into an electrical signal, which is converted by the audio circuit 560 After receiving, it is converted into audio data, and then the audio data is output to the processor 580 for processing, and then sent to, for example, another mobile phone through the RF circuit 510, or the audio data is output to the memory 520 for further processing.
- WiFi is a short-distance wireless transmission technology.
- the mobile phone can help users to send and receive emails, browse web pages, and access streaming media through the WiFi module 570, which provides users with wireless broadband Internet access.
- FIG. 5 shows the WiFi module 570, it can be understood that it is not a necessary component of the mobile phone.
- the processor 580 is the control center of the mobile phone, using various interfaces and lines to connect various parts of the entire mobile phone, by running or executing the software programs and/or modules stored in the memory 520, and calling the data stored in the memory 520.
- the processor 580 may include one or more processing units; preferably, the processor 580 may integrate an application processor and a modem processor, wherein the application processor mainly processes the operating system, user interface, and application programs, etc. , the modem processor mainly deals with wireless communication. It can be understood that, the above-mentioned modulation and demodulation processor may not be integrated into the processor 580 .
- the mobile phone also includes a power supply 590 (such as a battery) for supplying power to various components.
- a power supply 590 (such as a battery) for supplying power to various components.
- the power supply can be logically connected to the processor 580 through a power management system, so as to manage charging, discharging, and power consumption management functions through the power management system.
- the mobile phone may also include a camera, a Bluetooth module, and the like, which will not be repeated here.
- the processor 580 included in the terminal may perform the functions in the foregoing embodiments shown in FIG. 1 to FIG. 2 , and details are not described herein again.
- An embodiment of the present application further provides a computer storage medium, wherein an embodiment includes: an instruction is stored in the computer storage medium, and when the instruction is executed on a computer, causes the computer to execute the foregoing embodiments shown in FIG. 1 to FIG. 2 .
- the disclosed system, apparatus and method may be implemented in other manners.
- the apparatus embodiments described above are only illustrative.
- the division of the units is only a logical function division. In actual implementation, there may be other division methods.
- multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not implemented.
- the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.
- the units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
- each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.
- the above-mentioned integrated units may be implemented in the form of hardware, or may be implemented in the form of software functional units.
- the integrated unit if implemented in the form of a software functional unit and sold or used as an independent product, may be stored in a computer-readable storage medium.
- the technical solutions of the present application can be embodied in the form of software products in essence, or the parts that contribute to the prior art, or all or part of the technical solutions, and the computer software products are stored in a storage medium , including several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present application.
- the aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (ROM, read-only memory), random access memory (RAM, random access memory), magnetic disk or optical disk and other media that can store program codes .
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Electrophonic Musical Instruments (AREA)
- Auxiliary Devices For Music (AREA)
Abstract
Description
Claims (11)
- 一种音高调节方法,其特征在于,包括:获取多个备选旋律文件,所述备选旋律文件用于标识目标歌曲的旋律中音符的音高值,每个所述备选旋律文件所标识的音高值不同;获取用户歌唱所述目标歌曲的歌声的基频序列,并根据预设算法将所述基频序列的目标基频点的频率值转换为音高值,所述目标基频点包括所述基频序列中与所述备选旋律文件的音符在时间上相对应的基频点;分别计算每个所述备选旋律文件与所述基频序列在每一个相对应时间点上的音高值差值,并分别统计每个所述备选旋律文件的所有音高值差值的总和;将所述总和最小的备选旋律文件确定为目标旋律文件,并根据所述目标旋律文件与所述目标歌曲的原始旋律文件的音高值差值调节所述目标歌曲的伴奏文件的音高。
- 根据权利要求1所述的音高调节方法,其特征在于,所述获取多个备选旋律文件,包括:获取所述目标歌曲的所述原始旋律文件;对所述原始旋律文件的所有音符的音高值加上变换值,得到变换后的旋律文件;分别将所述原始旋律文件以及所述变换后的旋律文件作为所述备选旋律文件。
- 根据权利要求2所述的音高调节方法,其特征在于,所述对所述原始旋律文件的所有音符的音高值加上变换值,得到变换后的旋律文件,包括:基于十二平均律,将所述原始旋律文件对应的八度音阶平均分割,得到十二个半音音程,所述原始旋律文件对应所述十二个半音音程中的一个半音音程;按照所述原始旋律文件对应的半音音程与其它所述半音音程之间的音程关系,分别对所述原始旋律文件的所有音符的音高值执行11次的加变换值,得到11个所述变换后的旋律文件;其中,每个所述变换后的旋律文件分别对应所述十二个半音音程中的一个半音音程。
- 根据权利要求3所述的音高调节方法,其特征在于,当所述目标旋律文件不是所述原始旋律文件时,所述根据所述目标旋律文件与所述目标歌曲的原始旋律文件的音高值差值调节所述目标歌曲的伴奏文件的音高,包括:根据所述目标旋律文件与所述原始旋律文件之间的音程关系调节所述目标歌曲的伴奏文件的音高。
- 根据权利要求1所述的音高调节方法,其特征在于,所述将所述总和最小的备选旋律文件确定为目标旋律文件之后,所述方法还包括:判断所述目标旋律文件中音高值差值为0的音符在所有音符中的占比是否大于预设阈值;若是,则执行所述根据所述目标旋律文件与所述目标歌曲的原始旋律文件的音高值差值调节所述目标歌曲的伴奏文件的音高的步骤;若否,则不调节所述伴奏文件的音高。
- 根据权利要求1所述的音高调节方法,其特征在于,所述根据预设算法将所述基频序列的目标基频点的频率值转换为音高值,包括:确定所述基频序列中与所述备选旋律文件的音符在时间上相对应的所述目标基频点;根据所述预设算法将所述目标基频点的频率值转换为音高值;所述分别计算每个所述备选旋律文件与所述基频序列在每一个相对应时间点上的音高值差值,包括:获取每个所述备选旋律文件中与所述目标基频点在时间上相对应的音符 的音高值,计算时间上相对应的目标基频点与音符之间的音高值差值。
- 根据权利要求6所述的音高调节方法,其特征在于,所述备选旋律文件还用于标识所述目标歌曲的旋律中音符的开始时间及结束时间;所述获取每个所述备选旋律文件中与所述目标基频点在时间上相对应的音符的音高值,包括:根据每个所述备选旋律文件中音符的开始时间及结束时间确定与所述目标基频点在时间上相对应的音符;获取与所述目标基频点在时间上相对应的音符的音高值。
- 一种音高调节装置,其特征在于,包括:第一获取单元,用于获取多个备选旋律文件,所述备选旋律文件用于标识目标歌曲的旋律中音符的音高值,每个所述备选旋律文件所标识的音高值不同;第二获取单元,用于获取用户歌唱所述目标歌曲的歌声的基频序列;转换单元,用于根据预设算法将所述基频序列的目标基频点的频率值转换为音高值,所述目标基频点包括所述基频序列中与所述备选旋律文件的音符在时间上相对应的基频点;计算单元,用于分别计算每个所述备选旋律文件与所述基频序列在每一个相对应时间点上的音高值差值,并分别统计每个所述备选旋律文件的所有音高值差值的总和;音高调节单元,用于将所述总和最小的备选旋律文件确定为目标旋律文件,并根据所述目标旋律文件与所述目标歌曲的原始旋律文件的音高值差值调节所述目标歌曲的伴奏文件的音高。
- 根据权利要求8所述的音高调节装置,其特征在于,所述音高调节装置还包括:判断单元,用于判断所述目标旋律文件中音高值差值为0的音符在所有音 符中的占比是否大于预设阈值;音高调节单元具体用于当所述目标旋律文件中音高值差值为0的音符在所有音符中的占比大于预设阈值时,执行所述根据所述目标旋律文件与所述目标歌曲的原始旋律文件的音高值差值调节所述目标歌曲的伴奏文件的音高的步骤;当所述目标旋律文件中音高值差值为0的音符在所有音符中的占比不大于预设阈值时,不调节所述伴奏文件的音高。
- 一种音高调节装置,其特征在于,包括:处理器、存储器、总线、输入输出设备;所述处理器与所述存储器、输入输出设备相连;所述总线分别连接所述处理器、存储器以及输入输出设备;所述处理器用于获取多个备选旋律文件,所述备选旋律文件用于标识目标歌曲的旋律中音符的音高值,每个所述备选旋律文件所标识的音高值不同,获取用户歌唱所述目标歌曲的歌声的基频序列,并根据预设算法将所述基频序列的目标基频点的频率值转换为音高值,所述目标基频点包括所述基频序列中与所述备选旋律文件的音符在时间上相对应的基频点,分别计算每个所述备选旋律文件与所述基频序列在每一个相对应时间点上的音高值差值,并分别统计每个所述备选旋律文件的所有音高值差值的总和,将所述总和最小的备选旋律文件确定为目标旋律文件,并根据所述目标旋律文件与所述目标歌曲的原始旋律文件的音高值差值调节所述目标歌曲的伴奏文件的音高。
- 一种计算机存储介质,其特征在于,所述计算机存储介质中存储有指令,所述指令在计算机上执行时,使得所述计算机执行如权利要求1至7中任一项所述的方法。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US18/034,031 US20230395051A1 (en) | 2020-10-27 | 2021-09-22 | Pitch adjustment method and device, and computer storage medium |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011163021.7 | 2020-10-27 | ||
CN202011163021.7A CN112270913B (zh) | 2020-10-27 | 2020-10-27 | 音高调节方法、装置及计算机存储介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022089098A1 true WO2022089098A1 (zh) | 2022-05-05 |
Family
ID=74342914
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/119571 WO2022089098A1 (zh) | 2020-10-27 | 2021-09-22 | 音高调节方法、装置及计算机存储介质 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20230395051A1 (zh) |
CN (1) | CN112270913B (zh) |
WO (1) | WO2022089098A1 (zh) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112270913B (zh) * | 2020-10-27 | 2022-11-18 | 腾讯音乐娱乐科技(深圳)有限公司 | 音高调节方法、装置及计算机存储介质 |
CN113192477A (zh) * | 2021-04-28 | 2021-07-30 | 北京达佳互联信息技术有限公司 | 音频处理方法及装置 |
CN113178183B (zh) * | 2021-04-30 | 2024-05-14 | 杭州网易云音乐科技有限公司 | 音效处理方法、装置、存储介质和计算设备 |
CN113314093B (zh) * | 2021-06-01 | 2024-04-12 | 广州酷狗计算机科技有限公司 | 音频合成方法、装置、终端及存储介质 |
CN114566191A (zh) * | 2022-02-25 | 2022-05-31 | 腾讯音乐娱乐科技(深圳)有限公司 | 录音的修音方法及相关装置 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2015225332A (ja) * | 2014-05-30 | 2015-12-14 | カシオ計算機株式会社 | 楽音発生装置、電子楽器、楽音発生方法およびプログラム |
CN108206026A (zh) * | 2017-12-05 | 2018-06-26 | 北京小唱科技有限公司 | 确定音频内容音高偏差的方法及装置 |
CN109272975A (zh) * | 2018-08-14 | 2019-01-25 | 无锡冰河计算机科技发展有限公司 | 演唱伴奏自动调整方法、装置及ktv点唱机 |
CN111785238A (zh) * | 2020-06-24 | 2020-10-16 | 腾讯音乐娱乐科技(深圳)有限公司 | 音频校准方法、装置及存储介质 |
CN112270913A (zh) * | 2020-10-27 | 2021-01-26 | 腾讯音乐娱乐科技(深圳)有限公司 | 音高调节方法、装置及计算机存储介质 |
-
2020
- 2020-10-27 CN CN202011163021.7A patent/CN112270913B/zh active Active
-
2021
- 2021-09-22 US US18/034,031 patent/US20230395051A1/en active Pending
- 2021-09-22 WO PCT/CN2021/119571 patent/WO2022089098A1/zh active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2015225332A (ja) * | 2014-05-30 | 2015-12-14 | カシオ計算機株式会社 | 楽音発生装置、電子楽器、楽音発生方法およびプログラム |
CN108206026A (zh) * | 2017-12-05 | 2018-06-26 | 北京小唱科技有限公司 | 确定音频内容音高偏差的方法及装置 |
CN109272975A (zh) * | 2018-08-14 | 2019-01-25 | 无锡冰河计算机科技发展有限公司 | 演唱伴奏自动调整方法、装置及ktv点唱机 |
CN111785238A (zh) * | 2020-06-24 | 2020-10-16 | 腾讯音乐娱乐科技(深圳)有限公司 | 音频校准方法、装置及存储介质 |
CN112270913A (zh) * | 2020-10-27 | 2021-01-26 | 腾讯音乐娱乐科技(深圳)有限公司 | 音高调节方法、装置及计算机存储介质 |
Also Published As
Publication number | Publication date |
---|---|
US20230395051A1 (en) | 2023-12-07 |
CN112270913A (zh) | 2021-01-26 |
CN112270913B (zh) | 2022-11-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2022089098A1 (zh) | 音高调节方法、装置及计算机存储介质 | |
CN107705778B (zh) | 音频处理方法、装置、存储介质以及终端 | |
CN109166593B (zh) | 音频数据处理方法、装置及存储介质 | |
CN109256146B (zh) | 音频检测方法、装置及存储介质 | |
CN106782600B (zh) | 音频文件的评分方法及装置 | |
EP3493198A1 (en) | Method and device for determining delay of audio | |
WO2018072543A1 (zh) | 模型生成方法、语音合成方法及装置 | |
WO2014008843A1 (zh) | 一种声纹特征模型更新方法及终端 | |
US20180342231A1 (en) | Sound effect parameter adjustment method, mobile terminal and storage medium | |
WO2020103550A1 (zh) | 音频信号的评分方法、装置、终端设备及计算机存储介质 | |
WO2018223837A1 (zh) | 音乐播放方法及相关产品 | |
WO2017215660A1 (zh) | 一种场景音效的控制方法、及电子设备 | |
CN111883091A (zh) | 音频降噪方法和音频降噪模型的训练方法 | |
CN106847307B (zh) | 信号检测方法及装置 | |
CN108668024B (zh) | 一种语音处理方法及终端 | |
CN111785238B (zh) | 音频校准方法、装置及存储介质 | |
CN107229629B (zh) | 音频识别方法及装置 | |
CN109616135B (zh) | 音频处理方法、装置及存储介质 | |
CN109085985B (zh) | 发声控制方法、装置、电子装置以及存储介质 | |
CN110599989B (zh) | 音频处理方法、装置及存储介质 | |
WO2017215511A1 (zh) | 一种场景音效的控制方法、及相关产品 | |
WO2020228226A1 (zh) | 一种纯音乐检测方法、装置及存储介质 | |
CN110378677B (zh) | 一种红包领取方法、装置、移动终端及存储介质 | |
WO2016155527A1 (zh) | 一种流媒体对齐方法,设备以及存储介质 | |
CN112667844A (zh) | 检索音频的方法、装置、设备和存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21884828 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 18034031 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 16.08.2023) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21884828 Country of ref document: EP Kind code of ref document: A1 |