Summary of the invention
Technical matters to be solved by this invention is to provide a kind of way of recording, and realize the recording device that this way of recording adopts, the recording data being convenient to carry out searching for location to recorded audio data content can be produced, the present invention also will provide a kind of recording substance searching method, and realize the recording substance searcher that this recording substance searching method adopts, can search for recording substance easily, help user navigates to the position in the recording data of needs fast.
For solving the problems of the technologies described above, the technical scheme of the way of recording of the present invention comprises the following steps:
Step one, adopts recording device to carry out audio recording;
Step 2, described recording device includes image pickup section, uses described recording device to carry out the image taking of one or many in the process of audio recording, record each shooting image time be engraved in position on recorded audio timeline;
Step 3, the audio frequency that preservation is recorded, the image of shooting and the position of each image capture moment on recorded audio timeline.
The invention discloses a kind of recording device realizing the above-mentioned way of recording and adopt, its technical scheme is, based on computer system, comprising:
Taping component, is used for carrying out audio recording;
Shooting parts, are used for carrying out image taking in the process of audio recording, and record each shooting image time be engraved in position on recorded audio timeline;
First storage component, the audio frequency that preservation is recorded, the image of shooting and the position of each image capture moment on recorded audio timeline.
The invention also discloses a kind of recording searching method, its technical scheme is, the above-mentioned way of recording of described recording is recorded, comprise record to some extent audio frequency, image and each position of described image capture moment on recorded audio timeline, described recording substance searching method comprises the following steps:
Step one, selects in described image;
Step 2, according to selected image, is positioned to the audio content of the position of this image capture moment on recorded audio timeline.
The invention also discloses a kind of recording substance searcher realizing above-mentioned recording substance searching method and adopt, its technical scheme is, based on computer system, comprising:
Second storage component, stores described recording, described recording comprise record to some extent audio frequency, image and each position of described image capture moment on recorded audio timeline;
Image selection unit, is used for selecting in described image;
First search parts, according to the image that described image selection unit is selected, is positioned to the audio content of the position of this image capture moment on recorded audio timeline.
The present invention discloses a kind of way of recording again, and its technical scheme comprises the following steps:
Step one, adopts sound pick-up outfit to carry out audio recording;
Step 2, described sound pick-up outfit includes image pickup section, in the process of audio recording, use described sound pick-up outfit to carry out the image taking of one or many, in captured image, include word, record each shooting image time be engraved in position on recorded audio timeline;
Step 3, Text region is carried out in captured image, to identify that each word of obtaining is engraved in the position on recorded audio timeline when corresponding to the shooting of the image at its place, preserve the audio frequency recorded, the position identified on the word that obtains and the audio timeline corresponding to each word.
The present invention discloses a kind of recording device realizing the above-mentioned way of recording and adopt again, and its technical scheme is, based on computer system, comprising:
Taping component, is used for carrying out audio recording;
Shooting parts, are used for carrying out image taking in the process of audio recording, and record each shooting image time be engraved in position on recorded audio timeline;
3rd Text region parts, carry out Text region to the image of described shooting parts shooting, will identify that each word obtained is engraved in the position on recorded audio timeline when corresponding to the shooting of the image at its place;
3rd memory unit, preserves the audio frequency recorded, the position identified on the word that obtains and the audio timeline corresponding to each word.
The present invention again discloses a kind of recording substance searching method, its technical scheme is, described recording is recorded according to the above-mentioned way of recording, comprise the audio frequency recorded to some extent, the position identified on the word that obtains and the audio timeline corresponding to each word, described recording substance searching method comprises the following steps:
Step one, the word that the word selecting an identification to obtain or the one group of identification corresponding to same position on audio timeline obtain;
Step 2, according to selected word, is positioned to the audio content of the position on audio timeline corresponding to this word.
The present invention again discloses a kind of recording substance searcher realizing above-mentioned recording substance searching method and adopt, and its technical scheme is, based on computer system, comprising:
4th memory unit, stores described recording, and described recording comprises the audio frequency recorded to some extent, the position identified on the word that obtains and the audio timeline corresponding to each word,
Word alternative pack, is used for word that selection identification obtains or the word that one group of identification corresponding to same position on audio timeline obtains;
Second search parts, according to selected word, is positioned to the audio content of the position on audio timeline corresponding to this word.
The present invention passes through technique scheme, the image relevant to recording substance is obtained by the mode of shooting when recording, and the foundation that the information recorded on this image or this image is searched for as recording substance, achieve the search of recording substance, step is easy, be easy to use, and accuracy rate is high.
Embodiment
The invention discloses a kind of way of recording, comprise the following steps:
Step one, adopts recording device to carry out audio recording;
Step 2, described recording device includes image pickup section, uses described recording device to carry out the image taking of one or many in the process of audio recording, record each shooting image time be engraved in position on recorded audio timeline;
Step 3, the audio frequency that preservation is recorded, the image of shooting and the position of each image capture moment on recorded audio timeline.
Word is included in described image.
After carrying out image taking, also comprise the step of image being carried out to Text region, and in step 3, preserve record audio frequency, the image of shooting, each pictograph recognition result and the position of each image capture moment on recorded audio timeline.
The invention also discloses a kind of recording substance searching method, the above-mentioned way of recording of described recording is recorded, comprise record to some extent audio frequency, image and each position of described image capture moment on recorded audio timeline, described recording substance searching method comprises the following steps:
Step one, selects in described image;
Step 2, according to selected image, is positioned to the audio content of the position of this image capture moment on recorded audio timeline.
Include word in described image, Text region is carried out to described image; Whether input or selection keyword, include described keyword in retrieval Text region result, if comprised, then in described step one, selects the image comprising keyword.
Described recording comprise record to some extent audio frequency, the image of shooting, each pictograph recognition result and the position of each image capture moment on recorded audio timeline, input or selection keyword, described keyword whether is included in retrieval Text region result, if comprised, then in described step one, select the image comprising keyword.
If described keyword is included in the Text region result of multiple image, then in described step one, select one of them in the plurality of comprising in the image of keyword.
The audio content of the position of described image capture moment on recorded audio timeline refers at least one in following three kinds:
Audio content in the previous time period of the position of described image capture moment on recorded audio timeline;
Audio content after the position of described image capture moment on recorded audio timeline in the time period;
Audio content in the time period of the position of described image capture moment on recorded audio timeline, this audio content contains the position of described image capture moment on recorded audio timeline on described audio timeline.
User adopts the way of recording provided by the present invention to carry out recording audio, carries out the image taking of one or many in the process of recording audio.Because recording audio and shooting image carry out with same place at one time, therefore captured image is inevitable has natural contacting with audio content.Such as, in a meeting, spokesman makes a speech in conjunction with the lantern slide of projection, the speech of user to spokesman is recorded, in the process of recording, user can take the lantern slide of current projection, because the speech content of projection and the spokesman of institute's lantern slide is synchronously carried out, therefore take the moment of often opening lantern slide, necessarily spokesman explains the moment of this lantern slide related content.The audio frequency of recorded spokesman being made a speech, the slide image of shooting and the position of each image capture moment on recorded audio timeline are preserved as recording data together, as shown in Figure 1.In Fig. 1, namely image 1 and image 2 can be captured slide images, and shooting time is respectively 00:30 and 01:40.
When user needs to obtain the fragment of certain in spokesman's speech, namely the image of each lantern slide preserved in recording can be checked, the audio content of the position on the time shaft corresponding to it is learnt from slide image, thus by selecting specific image, be positioned to the recording fragment required for user.
Owing to may comprise word in slide image, user can carry out Text region to each slide image.In recording as shown in Figure 1, image 1 contains the character image of keyword 1, keyword 2 and keyword 3, by carrying out Text region, obtains " keyword 1 ", " keyword 2 " and " keyword 3 " word of the text formatting corresponding with image 1; Image 2 contains the character image of keyword 2, by carrying out Text region, obtains " keyword 2 " word of the text formatting corresponding with image 2.User is when searching for the recording substance needed, can select or input keyword, such as input or selection " keyword 1 ", find through search to include " keyword 1 " in the Text region result of image 1, therefore image 1 is just selected, and be positioned to the recording fragment be engraved in when image 1 is taken on audio timeline, i.e. the recording fragment of 00:30 position.
In addition to the above, user can also making recording in, carry out Text region to image, and by recognition result also with the slide image at its place and corresponding being kept in recording data of the shooting time of this slide image.In recording as shown in Figure 1, when recording by carrying out Text region, obtain " keyword 1 ", " keyword 2 " and " keyword 3 " word of the text formatting corresponding with image 1, and " keyword 2 " word of the text formatting corresponding with image 2, and by these words and the slide image at its place and corresponding being kept in recording data of the shooting time of this slide image.When user carries out recording substance search, also can select or input keyword, such as input or selection " keyword 1 ", find through search to include " keyword 1 " in the Text region result of the image 1 preserved, therefore image 1 is just selected, and is positioned to the recording fragment be engraved in when image 1 is taken on audio timeline.
In the process of above-mentioned keyword search, likely there will be in multiple image and all include same keyword.Such as, if user's input or selection " keyword 2 ", all include " keyword 2 " in the Text region result that search finds image 1 and image 2, now these can be included " keyword 2 " image and be supplied to user, selected in these specific images by user, thus be positioned to the recording substance of user's needs.
In the present invention, the audio content of the position of described image capture moment on recorded audio timeline can be the audio content in the previous time period of the position of described image capture moment on recorded audio timeline, the image such as selected is image 1, then above-mentioned audio content is the audio content in 00:20 to the 00:30 time period.
In the present invention, the audio content of the position of described image capture moment on recorded audio timeline can also be the position of described image capture moment on recorded audio timeline after audio content in a time period, the image such as selected is image 1, then above-mentioned audio content is the audio content in 00:30 to the 00:40 time period.
In the present invention, audio content in the time period of all right position of described image capture moment on recorded audio timeline of audio content of the position of described image capture moment on recorded audio timeline, this audio content contains the position of described image capture moment on recorded audio timeline on described audio timeline, the image such as selected is image 1, then above-mentioned audio content is the audio content in 00:25 to the 00:35 time period.
The invention also discloses a kind of recording device realizing the above-mentioned way of recording and adopt, based on computer system, described computer system can be PC, also can be smart mobile phone, as shown in Figure 2, can also be panel computer, recording device of the present invention also comprises:
Taping component, is used for carrying out audio recording;
Shooting parts, are used for carrying out image taking in the process of audio recording, and record each shooting image time be engraved in position on recorded audio timeline;
First storage component, the audio frequency that preservation is recorded, the image of shooting and the position of each image capture moment on recorded audio timeline.
Described recording device also comprises the first Text region parts, Text region is carried out to the image of described shooting parts shooting, described first storage component preserve record audio frequency, the image of shooting, each pictograph recognition result and the position of each image capture moment on recorded audio timeline.
The invention also discloses a kind of recording substance searcher realizing above-mentioned recording substance searching method and adopt, based on computer system, described computer system can be PC, also can be smart mobile phone, as shown in Figure 2, can also be panel computer, recording substance searcher of the present invention also comprises:
Second storage component, stores described recording, described recording comprise record to some extent audio frequency, image and each position of described image capture moment on recorded audio timeline;
Image selection unit, is used for selecting in described image;
First search parts, according to the image that described image selection unit is selected, is positioned to the audio content of the position of this image capture moment on recorded audio timeline.
Described recording substance searcher also comprises:
Second Text region parts, carry out Text region to described image;
First keyword parts, input or selection keyword, retrieve in the Text region result of described second Text region parts whether include described keyword, if comprised, then selects by described image selection unit the image comprising keyword.
In described recording substance searcher, store described recording in described second storage component, described recording comprise record to some extent audio frequency, the image of shooting, each pictograph recognition result and the position of each image capture moment on recorded audio timeline; Described recording substance searcher also comprises the second keyword parts, and input or selection keyword, retrieve in described each pictograph recognition result whether include described keyword, if comprised, then selects by described image selection unit the image comprising keyword.
The present invention discloses a kind of way of recording again, comprises the following steps:
Step one, adopts sound pick-up outfit to carry out audio recording;
Step 2, described sound pick-up outfit includes image pickup section, in the process of audio recording, use described sound pick-up outfit to carry out the image taking of one or many, in captured image, include word, record each shooting image time be engraved in position on recorded audio timeline;
Step 3, Text region is carried out in captured image, to identify that each word of obtaining is engraved in the position on recorded audio timeline when corresponding to the shooting of the image at its place, preserve the audio frequency recorded, the position identified on the word that obtains and the audio timeline corresponding to each word.
The present invention discloses a kind of recording substance searching method again, the above-mentioned way of recording of described recording is recorded, comprise the audio frequency recorded to some extent, the position identified on the word that obtains and the audio timeline corresponding to each word, described recording substance searching method comprises the following steps:
Step one, the word that the word selecting an identification to obtain or the one group of identification corresponding to same position on audio timeline obtain;
Step 2, according to selected word, is positioned to the audio content of the position on audio timeline corresponding to this word.
Also comprise input in the step of described recording substance searching method or select keyword, retrieving in the word that described identification obtains whether include described keyword, if comprised, then in described step one, selecting the word that the described identification comprising keyword obtains.
If described keyword is included in the word that the many groups identification corresponding to diverse location on audio timeline obtains, then, in described step one, in the word that the plurality of many groups identification comprising keyword obtains, select a group wherein.
The audio content of the position on the audio timeline that described word is corresponding refers at least one in following three kinds:
Audio content in the previous time period of the position of described image capture moment on recorded audio timeline;
Audio content after the position of described image capture moment on recorded audio timeline in the time period;
Audio content in the time period of the position of described image capture moment on recorded audio timeline, this audio content contains the position of described image capture moment on recorded audio timeline on described audio timeline.
Similar to previous embodiment, user adopts the way of recording provided by the present invention to carry out recording audio, carries out the image taking of one or many in the process of recording audio, and carries out Text region to captured image, obtains Text region result.Because recording audio and shooting image carry out with same place at one time, therefore captured image is inevitable has natural contacting with audio content, and identifying in image that the word obtained is also inevitable has natural contacting with audio content.Such as, in a meeting, spokesman makes a speech in conjunction with the lantern slide of projection, the speech of user to spokesman is recorded, in the process of recording, user can take the lantern slide of current projection, because the speech content of projection and the spokesman of institute's lantern slide is synchronously carried out, therefore take the moment of often opening lantern slide, necessarily spokesman explains the moment of this lantern slide related content.Text region is carried out by often opening slide image, and identify each word of obtaining be engraved in when being corresponded to the shooting of the image at its place the position on the audio timeline recorded, the audio frequency that recorded spokesman is made a speech, identify corresponding to the word that obtains and each word audio timeline on position preserve as recording data together, as shown in Figure 3.In Fig. 3, user have taken an image at 00:30, and identification obtains " keyword 1 ", " keyword 2 " and keyword 3; User have taken an image at 01:40, and identification obtains " keyword 2 ".
When user needs to obtain the fragment of certain in spokesman's speech, namely the word that the identification of preserving in recording obtains can be checked, the audio content of the position on the time shaft corresponding to it can be learnt from these words, thus by selecting specific word, be positioned to the recording fragment required for user.
User is when searching for the recording substance needed, can select or input keyword, such as input or selection " keyword 1 ", then " keyword 1 " in Fig. 3 is just selected, and the recording fragment be engraved in when being positioned to " keyword 1 " shooting on audio timeline, i.e. the recording fragment of 00:30 position.
In the process of above-mentioned keyword search, likely there will be in multiple image and all include same keyword.Such as, if user's input or selection " keyword 2 ", all include " keyword 2 " in search finds the word that identification corresponding to recording 00:30 position and 00:40 position obtains, the word that many groups identification that now these can be included " keyword 2 " obtains all is supplied to user, selected in these specific group of text by user, thus be positioned to the recording substance of user's needs.
In the present invention, the audio content of the position on the audio timeline that described word is corresponding can be the audio content in the previous time period of position on audio timeline that described word is corresponding, the word such as selected is " keyword 1 ", then above-mentioned audio content is the audio content in 00:20 to the 00:30 time period.
In the present invention, the audio content of the position on the audio timeline that described word is corresponding can be the position on audio timeline that described word is corresponding after audio content in a time period, the word such as selected is " keyword 1 ", then above-mentioned audio content is the audio content in 00:30 to the 00:40 time period.
In the present invention, the audio content of the position on the audio timeline that described word is corresponding can be the audio content in the time period of position on audio timeline that described word is corresponding, this audio content contains the position on audio timeline corresponding to described word on described audio timeline, the word such as selected is " keyword 1 ", then above-mentioned audio content is the audio content in 00:25 to the 00:35 time period.
The present invention again discloses a kind of recording device realizing the above-mentioned way of recording and adopt, and based on computer system, described computer system can be PC, also can be smart mobile phone, as shown in Figure 2, can also be panel computer, recording device of the present invention also comprises:
Taping component, is used for carrying out audio recording;
Shooting parts, are used for carrying out image taking in the process of audio recording, and record each shooting image time be engraved in position on recorded audio timeline;
3rd Text region parts, carry out Text region to the image of described shooting parts shooting, will identify that each word obtained is engraved in the position on recorded audio timeline when corresponding to the shooting of the image at its place;
3rd memory unit, preserves the audio frequency recorded, the position identified on the word that obtains and the audio timeline corresponding to each word.
The present invention again discloses a kind of recording substance searcher realizing above-mentioned recording substance searching method and adopt, based on computer system, described computer system can be PC, also can be smart mobile phone, as shown in Figure 2, can also be panel computer, recording substance searcher of the present invention also comprises:
4th memory unit, stores described recording, and described recording comprises the audio frequency recorded to some extent, the position identified on the word that obtains and the audio timeline corresponding to each word,
Word alternative pack, is used for word that selection identification obtains or the word that one group of identification corresponding to same position on audio timeline obtains;
Second search parts, according to selected word, is positioned to the audio content of the position on audio timeline corresponding to this word.
Described recording substance searcher, also comprise: the 3rd keyword parts, input or select keyword, retrieve in the word that described identification obtains whether include described keyword, if comprised, then the word selecting the described identification comprising keyword to obtain by described word alternative pack.
The present invention passes through technique scheme, the image relevant to recording substance is obtained by the mode of shooting when recording, and the foundation that the information recorded on this image or this image is searched for as recording substance, achieve the search of recording substance, step is easy, be easy to use, and accuracy rate is high.
The foregoing is only preferred embodiment of the present invention, and be not used to limit substantial technological context of the present invention, substantial technological content of the present invention be broad sense be defined in the right of application, any technology entities that other people complete or method, if with application right define identical, also or a kind of change of equivalence, be all covered by being regarded as among this right.