CN104599692A

CN104599692A - Recording method and device and recording content searching method and device

Info

Publication number: CN104599692A
Application number: CN201410774335.9A
Authority: CN
Inventors: 陈青山
Original assignee: Shanghai Hehe Information Technology Development Co Ltd
Current assignee: Shanghai Hehe Information Technology Development Co Ltd
Priority date: 2014-12-16
Filing date: 2014-12-16
Publication date: 2015-05-06
Anticipated expiration: 2034-12-16
Also published as: CN104599692B

Abstract

The invention discloses a recording method. The recording method comprises step one, performing audio recording through a recording device; step two, performing once or multi-time image shooting through the recording device during audio recording as the recording device comprises an image shooting part, and recording the positions of all image shooting moments on a recorded audio time axis; step three, saving recorded audio, shot images and the positions of the image shooting moments on the recorded audio time axis. The invention further discloses the recording device for implementing the recording method, a searching method for recording contents from the recording method and a recording content searching device. By means of the recording method and device the recording content searching method and device, images related to the recording contents are obtained in a shooting mode during recording, the images or information recorded on the images are or is used as the recording content searching basis, searching of the recording contents is achieved, the step is simple and convenient, usage is facilitated, and the accuracy is high.

Description

The way of recording and device, recording substance searching method and device

Technical field

The present invention relates to a kind of way of recording.The invention still further relates to a kind of recording device.The present invention relates to again a kind of data search method, especially a kind of recording substance searching method.The present invention relates to a kind of data serching device again, especially a kind of recording substance searcher.

Background technology

Along with the development of infotech, people can touch increasing electronic data in life.In order to obtain the data that oneself needs from the electronic data of magnanimity, data searching technology just becomes a kind of vital technology.But due to the standardization of text code, the search for text data is fairly simple comparatively speaking, and the result of retrieval is also more accurate.Such as, but just more difficult for the search of voice data, current existing way identifies audio data content, identifies the language content in audio frequency, and then search in the mode of text search.Like this, user just in a section audio, can search the fragment oneself needed.Instant audio content is very long, and user need not all play, and also can obtain the fragment required for oneself, improves the efficiency of voice data search greatly.Because identified content comes from voice data itself, the voice data current with this identification content has natural relevance, therefore, it is possible to be positioned to the audio content place of user's needs according to the content of this identification.But, can ground unrest be there is when voice data is recorded, sometimes also can along with the change of the factor such as intonation, tone, these all can cause audio content identification to there is a large amount of mistakes, cause searching for audio content accurately.

Summary of the invention

Technical matters to be solved by this invention is to provide a kind of way of recording, and realize the recording device that this way of recording adopts, the recording data being convenient to carry out searching for location to recorded audio data content can be produced, the present invention also will provide a kind of recording substance searching method, and realize the recording substance searcher that this recording substance searching method adopts, can search for recording substance easily, help user navigates to the position in the recording data of needs fast.

For solving the problems of the technologies described above, the technical scheme of the way of recording of the present invention comprises the following steps:

Step one, adopts recording device to carry out audio recording;

Step 2, described recording device includes image pickup section, uses described recording device to carry out the image taking of one or many in the process of audio recording, record each shooting image time be engraved in position on recorded audio timeline;

Step 3, the audio frequency that preservation is recorded, the image of shooting and the position of each image capture moment on recorded audio timeline.

The invention discloses a kind of recording device realizing the above-mentioned way of recording and adopt, its technical scheme is, based on computer system, comprising:

Taping component, is used for carrying out audio recording;

Shooting parts, are used for carrying out image taking in the process of audio recording, and record each shooting image time be engraved in position on recorded audio timeline;

First storage component, the audio frequency that preservation is recorded, the image of shooting and the position of each image capture moment on recorded audio timeline.

The invention also discloses a kind of recording searching method, its technical scheme is, the above-mentioned way of recording of described recording is recorded, comprise record to some extent audio frequency, image and each position of described image capture moment on recorded audio timeline, described recording substance searching method comprises the following steps:

Step one, selects in described image;

Step 2, according to selected image, is positioned to the audio content of the position of this image capture moment on recorded audio timeline.

The invention also discloses a kind of recording substance searcher realizing above-mentioned recording substance searching method and adopt, its technical scheme is, based on computer system, comprising:

Second storage component, stores described recording, described recording comprise record to some extent audio frequency, image and each position of described image capture moment on recorded audio timeline;

Image selection unit, is used for selecting in described image;

First search parts, according to the image that described image selection unit is selected, is positioned to the audio content of the position of this image capture moment on recorded audio timeline.

The present invention discloses a kind of way of recording again, and its technical scheme comprises the following steps:

Step one, adopts sound pick-up outfit to carry out audio recording;

Step 2, described sound pick-up outfit includes image pickup section, in the process of audio recording, use described sound pick-up outfit to carry out the image taking of one or many, in captured image, include word, record each shooting image time be engraved in position on recorded audio timeline;

Step 3, Text region is carried out in captured image, to identify that each word of obtaining is engraved in the position on recorded audio timeline when corresponding to the shooting of the image at its place, preserve the audio frequency recorded, the position identified on the word that obtains and the audio timeline corresponding to each word.

The present invention discloses a kind of recording device realizing the above-mentioned way of recording and adopt again, and its technical scheme is, based on computer system, comprising:

Taping component, is used for carrying out audio recording;

3rd Text region parts, carry out Text region to the image of described shooting parts shooting, will identify that each word obtained is engraved in the position on recorded audio timeline when corresponding to the shooting of the image at its place;

3rd memory unit, preserves the audio frequency recorded, the position identified on the word that obtains and the audio timeline corresponding to each word.

The present invention again discloses a kind of recording substance searching method, its technical scheme is, described recording is recorded according to the above-mentioned way of recording, comprise the audio frequency recorded to some extent, the position identified on the word that obtains and the audio timeline corresponding to each word, described recording substance searching method comprises the following steps:

Step one, the word that the word selecting an identification to obtain or the one group of identification corresponding to same position on audio timeline obtain;

Step 2, according to selected word, is positioned to the audio content of the position on audio timeline corresponding to this word.

The present invention again discloses a kind of recording substance searcher realizing above-mentioned recording substance searching method and adopt, and its technical scheme is, based on computer system, comprising:

4th memory unit, stores described recording, and described recording comprises the audio frequency recorded to some extent, the position identified on the word that obtains and the audio timeline corresponding to each word,

Word alternative pack, is used for word that selection identification obtains or the word that one group of identification corresponding to same position on audio timeline obtains;

Second search parts, according to selected word, is positioned to the audio content of the position on audio timeline corresponding to this word.

The present invention passes through technique scheme, the image relevant to recording substance is obtained by the mode of shooting when recording, and the foundation that the information recorded on this image or this image is searched for as recording substance, achieve the search of recording substance, step is easy, be easy to use, and accuracy rate is high.

Accompanying drawing explanation

Below in conjunction with drawings and Examples, the present invention is further detailed explanation:

Fig. 1 is the schematic diagram of the way of recording of the present invention and a recording substance searching method embodiment;

Fig. 2 is the schematic diagram of recording device of the present invention and a recording substance searcher embodiment;

Fig. 3 is the schematic diagram of the way of recording of the present invention and another embodiment of recording substance searching method.

Embodiment

The invention discloses a kind of way of recording, comprise the following steps:

Step one, adopts recording device to carry out audio recording;

Word is included in described image.

After carrying out image taking, also comprise the step of image being carried out to Text region, and in step 3, preserve record audio frequency, the image of shooting, each pictograph recognition result and the position of each image capture moment on recorded audio timeline.

The invention also discloses a kind of recording substance searching method, the above-mentioned way of recording of described recording is recorded, comprise record to some extent audio frequency, image and each position of described image capture moment on recorded audio timeline, described recording substance searching method comprises the following steps:

Step one, selects in described image;

Include word in described image, Text region is carried out to described image; Whether input or selection keyword, include described keyword in retrieval Text region result, if comprised, then in described step one, selects the image comprising keyword.

Described recording comprise record to some extent audio frequency, the image of shooting, each pictograph recognition result and the position of each image capture moment on recorded audio timeline, input or selection keyword, described keyword whether is included in retrieval Text region result, if comprised, then in described step one, select the image comprising keyword.

If described keyword is included in the Text region result of multiple image, then in described step one, select one of them in the plurality of comprising in the image of keyword.

The audio content of the position of described image capture moment on recorded audio timeline refers at least one in following three kinds:

Audio content in the previous time period of the position of described image capture moment on recorded audio timeline;

Audio content after the position of described image capture moment on recorded audio timeline in the time period;

Audio content in the time period of the position of described image capture moment on recorded audio timeline, this audio content contains the position of described image capture moment on recorded audio timeline on described audio timeline.

User adopts the way of recording provided by the present invention to carry out recording audio, carries out the image taking of one or many in the process of recording audio.Because recording audio and shooting image carry out with same place at one time, therefore captured image is inevitable has natural contacting with audio content.Such as, in a meeting, spokesman makes a speech in conjunction with the lantern slide of projection, the speech of user to spokesman is recorded, in the process of recording, user can take the lantern slide of current projection, because the speech content of projection and the spokesman of institute's lantern slide is synchronously carried out, therefore take the moment of often opening lantern slide, necessarily spokesman explains the moment of this lantern slide related content.The audio frequency of recorded spokesman being made a speech, the slide image of shooting and the position of each image capture moment on recorded audio timeline are preserved as recording data together, as shown in Figure 1.In Fig. 1, namely image 1 and image 2 can be captured slide images, and shooting time is respectively 00:30 and 01:40.

When user needs to obtain the fragment of certain in spokesman's speech, namely the image of each lantern slide preserved in recording can be checked, the audio content of the position on the time shaft corresponding to it is learnt from slide image, thus by selecting specific image, be positioned to the recording fragment required for user.

Owing to may comprise word in slide image, user can carry out Text region to each slide image.In recording as shown in Figure 1, image 1 contains the character image of keyword 1, keyword 2 and keyword 3, by carrying out Text region, obtains " keyword 1 ", " keyword 2 " and " keyword 3 " word of the text formatting corresponding with image 1; Image 2 contains the character image of keyword 2, by carrying out Text region, obtains " keyword 2 " word of the text formatting corresponding with image 2.User is when searching for the recording substance needed, can select or input keyword, such as input or selection " keyword 1 ", find through search to include " keyword 1 " in the Text region result of image 1, therefore image 1 is just selected, and be positioned to the recording fragment be engraved in when image 1 is taken on audio timeline, i.e. the recording fragment of 00:30 position.

In addition to the above, user can also making recording in, carry out Text region to image, and by recognition result also with the slide image at its place and corresponding being kept in recording data of the shooting time of this slide image.In recording as shown in Figure 1, when recording by carrying out Text region, obtain " keyword 1 ", " keyword 2 " and " keyword 3 " word of the text formatting corresponding with image 1, and " keyword 2 " word of the text formatting corresponding with image 2, and by these words and the slide image at its place and corresponding being kept in recording data of the shooting time of this slide image.When user carries out recording substance search, also can select or input keyword, such as input or selection " keyword 1 ", find through search to include " keyword 1 " in the Text region result of the image 1 preserved, therefore image 1 is just selected, and is positioned to the recording fragment be engraved in when image 1 is taken on audio timeline.

In the process of above-mentioned keyword search, likely there will be in multiple image and all include same keyword.Such as, if user's input or selection " keyword 2 ", all include " keyword 2 " in the Text region result that search finds image 1 and image 2, now these can be included " keyword 2 " image and be supplied to user, selected in these specific images by user, thus be positioned to the recording substance of user's needs.

In the present invention, the audio content of the position of described image capture moment on recorded audio timeline can be the audio content in the previous time period of the position of described image capture moment on recorded audio timeline, the image such as selected is image 1, then above-mentioned audio content is the audio content in 00:20 to the 00:30 time period.

In the present invention, the audio content of the position of described image capture moment on recorded audio timeline can also be the position of described image capture moment on recorded audio timeline after audio content in a time period, the image such as selected is image 1, then above-mentioned audio content is the audio content in 00:30 to the 00:40 time period.

In the present invention, audio content in the time period of all right position of described image capture moment on recorded audio timeline of audio content of the position of described image capture moment on recorded audio timeline, this audio content contains the position of described image capture moment on recorded audio timeline on described audio timeline, the image such as selected is image 1, then above-mentioned audio content is the audio content in 00:25 to the 00:35 time period.

The invention also discloses a kind of recording device realizing the above-mentioned way of recording and adopt, based on computer system, described computer system can be PC, also can be smart mobile phone, as shown in Figure 2, can also be panel computer, recording device of the present invention also comprises:

Taping component, is used for carrying out audio recording;

Described recording device also comprises the first Text region parts, Text region is carried out to the image of described shooting parts shooting, described first storage component preserve record audio frequency, the image of shooting, each pictograph recognition result and the position of each image capture moment on recorded audio timeline.

The invention also discloses a kind of recording substance searcher realizing above-mentioned recording substance searching method and adopt, based on computer system, described computer system can be PC, also can be smart mobile phone, as shown in Figure 2, can also be panel computer, recording substance searcher of the present invention also comprises:

Image selection unit, is used for selecting in described image;

Described recording substance searcher also comprises:

Second Text region parts, carry out Text region to described image;

First keyword parts, input or selection keyword, retrieve in the Text region result of described second Text region parts whether include described keyword, if comprised, then selects by described image selection unit the image comprising keyword.

In described recording substance searcher, store described recording in described second storage component, described recording comprise record to some extent audio frequency, the image of shooting, each pictograph recognition result and the position of each image capture moment on recorded audio timeline; Described recording substance searcher also comprises the second keyword parts, and input or selection keyword, retrieve in described each pictograph recognition result whether include described keyword, if comprised, then selects by described image selection unit the image comprising keyword.

The present invention discloses a kind of way of recording again, comprises the following steps:

Step one, adopts sound pick-up outfit to carry out audio recording;

The present invention discloses a kind of recording substance searching method again, the above-mentioned way of recording of described recording is recorded, comprise the audio frequency recorded to some extent, the position identified on the word that obtains and the audio timeline corresponding to each word, described recording substance searching method comprises the following steps:

Also comprise input in the step of described recording substance searching method or select keyword, retrieving in the word that described identification obtains whether include described keyword, if comprised, then in described step one, selecting the word that the described identification comprising keyword obtains.

If described keyword is included in the word that the many groups identification corresponding to diverse location on audio timeline obtains, then, in described step one, in the word that the plurality of many groups identification comprising keyword obtains, select a group wherein.

The audio content of the position on the audio timeline that described word is corresponding refers at least one in following three kinds:

Similar to previous embodiment, user adopts the way of recording provided by the present invention to carry out recording audio, carries out the image taking of one or many in the process of recording audio, and carries out Text region to captured image, obtains Text region result.Because recording audio and shooting image carry out with same place at one time, therefore captured image is inevitable has natural contacting with audio content, and identifying in image that the word obtained is also inevitable has natural contacting with audio content.Such as, in a meeting, spokesman makes a speech in conjunction with the lantern slide of projection, the speech of user to spokesman is recorded, in the process of recording, user can take the lantern slide of current projection, because the speech content of projection and the spokesman of institute's lantern slide is synchronously carried out, therefore take the moment of often opening lantern slide, necessarily spokesman explains the moment of this lantern slide related content.Text region is carried out by often opening slide image, and identify each word of obtaining be engraved in when being corresponded to the shooting of the image at its place the position on the audio timeline recorded, the audio frequency that recorded spokesman is made a speech, identify corresponding to the word that obtains and each word audio timeline on position preserve as recording data together, as shown in Figure 3.In Fig. 3, user have taken an image at 00:30, and identification obtains " keyword 1 ", " keyword 2 " and keyword 3; User have taken an image at 01:40, and identification obtains " keyword 2 ".

When user needs to obtain the fragment of certain in spokesman's speech, namely the word that the identification of preserving in recording obtains can be checked, the audio content of the position on the time shaft corresponding to it can be learnt from these words, thus by selecting specific word, be positioned to the recording fragment required for user.

User is when searching for the recording substance needed, can select or input keyword, such as input or selection " keyword 1 ", then " keyword 1 " in Fig. 3 is just selected, and the recording fragment be engraved in when being positioned to " keyword 1 " shooting on audio timeline, i.e. the recording fragment of 00:30 position.

In the process of above-mentioned keyword search, likely there will be in multiple image and all include same keyword.Such as, if user's input or selection " keyword 2 ", all include " keyword 2 " in search finds the word that identification corresponding to recording 00:30 position and 00:40 position obtains, the word that many groups identification that now these can be included " keyword 2 " obtains all is supplied to user, selected in these specific group of text by user, thus be positioned to the recording substance of user's needs.

In the present invention, the audio content of the position on the audio timeline that described word is corresponding can be the audio content in the previous time period of position on audio timeline that described word is corresponding, the word such as selected is " keyword 1 ", then above-mentioned audio content is the audio content in 00:20 to the 00:30 time period.

In the present invention, the audio content of the position on the audio timeline that described word is corresponding can be the position on audio timeline that described word is corresponding after audio content in a time period, the word such as selected is " keyword 1 ", then above-mentioned audio content is the audio content in 00:30 to the 00:40 time period.

In the present invention, the audio content of the position on the audio timeline that described word is corresponding can be the audio content in the time period of position on audio timeline that described word is corresponding, this audio content contains the position on audio timeline corresponding to described word on described audio timeline, the word such as selected is " keyword 1 ", then above-mentioned audio content is the audio content in 00:25 to the 00:35 time period.

The present invention again discloses a kind of recording device realizing the above-mentioned way of recording and adopt, and based on computer system, described computer system can be PC, also can be smart mobile phone, as shown in Figure 2, can also be panel computer, recording device of the present invention also comprises:

Taping component, is used for carrying out audio recording;

The present invention again discloses a kind of recording substance searcher realizing above-mentioned recording substance searching method and adopt, based on computer system, described computer system can be PC, also can be smart mobile phone, as shown in Figure 2, can also be panel computer, recording substance searcher of the present invention also comprises:

Described recording substance searcher, also comprise: the 3rd keyword parts, input or select keyword, retrieve in the word that described identification obtains whether include described keyword, if comprised, then the word selecting the described identification comprising keyword to obtain by described word alternative pack.

The foregoing is only preferred embodiment of the present invention, and be not used to limit substantial technological context of the present invention, substantial technological content of the present invention be broad sense be defined in the right of application, any technology entities that other people complete or method, if with application right define identical, also or a kind of change of equivalence, be all covered by being regarded as among this right.

Claims

1. a way of recording, is characterized in that, comprises the following steps:

Step one, adopts recording device to carry out audio recording;

2. the way of recording according to claim 1, is characterized in that, includes word in described image.

3. the way of recording according to claim 2, it is characterized in that, after carrying out image taking, also comprise the step of image being carried out to Text region, and in step 3, preserve record audio frequency, the image of shooting, each pictograph recognition result and the position of each image capture moment on recorded audio timeline.

4. realize as the way of recording in claim 1-3 as described in any one the recording device that adopts, it is characterized in that, based on computer system, comprising:

Taping component, is used for carrying out audio recording;

5. recording device according to claim 4, it is characterized in that, also comprise the first Text region parts, Text region is carried out to the image of described shooting parts shooting, described first storage component preserve record audio frequency, the image of shooting, each pictograph recognition result and the position of each image capture moment on recorded audio timeline.

6. a recording substance searching method, it is characterized in that, described recording is recorded according to the way of recording in claim 1-3 described in any one, comprise record to some extent audio frequency, image and each position of described image capture moment on recorded audio timeline, described recording substance searching method comprises the following steps:

Step one, selects in described image;

7. recording substance searching method according to claim 6, is characterized in that, includes word in described image, carries out Text region to described image; Whether input or selection keyword, include described keyword in retrieval Text region result, if comprised, then in described step one, selects the image comprising keyword.

8. recording substance searching method according to claim 6, it is characterized in that, described recording comprise record to some extent audio frequency, the image of shooting, each pictograph recognition result and the position of each image capture moment on recorded audio timeline, input or selection keyword, described keyword whether is included in retrieval Text region result, if comprised, then in described step one, select the image comprising keyword.

9. the recording substance searching method according to claim 7 or 8, is characterized in that, if described keyword is included in the Text region result of multiple image, then in described step one, selects one of them in the plurality of comprising in the image of keyword.

10. recording substance searching method according to claim 6, is characterized in that, the audio content of the position of described image capture moment on recorded audio timeline refers at least one in following three kinds:

11. 1 kinds realize as the recording substance searching method in claim 6-10 as described in any one the recording substance searcher that adopts, it is characterized in that, based on computer system, comprising:

Image selection unit, is used for selecting in described image;

12. recording substance searchers according to claim 11, is characterized in that, also comprise:

Second Text region parts, carry out Text region to described image;

13. recording substance searchers according to claim 11, it is characterized in that, store described recording in described second storage component, described recording comprise record to some extent audio frequency, the image of shooting, each pictograph recognition result and the position of each image capture moment on recorded audio timeline; Described recording substance searcher also comprises the second keyword parts, and input or selection keyword, retrieve in described each pictograph recognition result whether include described keyword, if comprised, then selects by described image selection unit the image comprising keyword.

14. 1 kinds of ways of recording, is characterized in that, comprise the following steps:

Step one, adopts sound pick-up outfit to carry out audio recording;

15. 1 kinds of recording devices realizing the way of recording as claimed in claim 14 and adopt, is characterized in that, based on computer system, comprising:

Taping component, is used for carrying out audio recording;

16. 1 kinds of recording substance searching methods, it is characterized in that, the described recording way of recording according to claim 14 is recorded, and comprise the audio frequency recorded to some extent, the position identified on the word that obtains and the audio timeline corresponding to each word, described recording substance searching method comprises the following steps:

17. recording substance searching methods according to claim 16, it is characterized in that input or select keyword retrieves in the word that described identification obtains whether include described keyword, if comprised, then in described step one, select the word that the described identification comprising keyword obtains.

18. recording substance searching methods according to claim 17, it is characterized in that, if described keyword is included in the word that the many groups identification corresponding to diverse location on audio timeline obtains, then in described step one, in the word that the plurality of many groups identification comprising keyword obtains, select a group wherein.

19. recording substance searching methods according to claim 16, is characterized in that, the audio content of the position on the audio timeline that described word is corresponding refers at least one in following three kinds:

20. 1 kinds realize as the recording substance searching method in claim 16-19 as described in any one the recording substance searcher that adopts, it is characterized in that, based on computer system, comprising:

21. recording substance searchers according to claim 20, is characterized in that, also comprise:

3rd keyword parts, input or select keyword, retrieves in the word that described identification obtains whether include described keyword, if comprised, then the word selecting the described identification comprising keyword to obtain by described word alternative pack.