CN1949227A - Searching method, system and apparatus for playing media file - Google Patents

Searching method, system and apparatus for playing media file Download PDF

Info

Publication number
CN1949227A
CN1949227A CN 200610150708 CN200610150708A CN1949227A CN 1949227 A CN1949227 A CN 1949227A CN 200610150708 CN200610150708 CN 200610150708 CN 200610150708 A CN200610150708 A CN 200610150708A CN 1949227 A CN1949227 A CN 1949227A
Authority
CN
China
Prior art keywords
media file
playable media
fragment
file
time code
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 200610150708
Other languages
Chinese (zh)
Other versions
CN100501738C (en
Inventor
王喆
贾玉龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sogou Technology Development Co Ltd
Original Assignee
Beijing Sogou Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sogou Technology Development Co Ltd filed Critical Beijing Sogou Technology Development Co Ltd
Priority to CNB2006101507080A priority Critical patent/CN100501738C/en
Publication of CN1949227A publication Critical patent/CN1949227A/en
Application granted granted Critical
Publication of CN100501738C publication Critical patent/CN100501738C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention supplies the searching method aimed at playable media file. It includes the following steps: incident used to build relevance between the exterior time code file and internet playable media file; confirming syncopating scheme used to analyze the exterior time code file and confirm the need segment; according to its start stop time point, get its associated playable media file syncopating scheme; getting information used to get the segment or segment combination of the associated playable media file according to the syncopate scheme. The invention uses exterior time code file to gain or analyze relative content for each associated playable media file, syncopate more detailed and accurate content segment or its combination to make the playable media file searching effect more better, searching result more abundant, increase media file searching accuracy and individualization.

Description

A kind of searching method, system and device at playable media file
Technical field
The present invention relates to the internet information process field, particularly relate to a kind of searching method and system at playable media file in the internet, a kind of method for building up of the search database at playable media file, and obtain client at the search server and the information of playable media file.
Background technology
Nowadays, people obtain information and resource from the internet a kind of the most quick and effective means have been become by search engine retrieving.Utilize search engine, people almost just can retrieve the content that needs, for example news, video display and music etc. between a moment.Wherein, especially to be used by people to the retrieval of playable media file is the most normal.
Why the user can obtain result for retrieval fast, and prerequisite is to have finished collection and arrangement to Internet resources by search engine.When user entered keyword was retrieved, search engine took out satisfactory content and presents to the user from the result who collects arrangement.In this course, collection and the arrangement to Internet resources becomes prerequisite and the basis that correctly presents retrieval of content.
Retrieving to playable media file in the prior art is as follows: search engine is logical to the title of the song in link information or the context for example in the playable media file, Word messages such as singer, collect, and obtain the link of corresponding playable media file; According to user's search key, corresponding playable media file link is presented in user's the search result list then.Obviously, in the above-mentioned prior art, when search engine is searched for the playable media file content on the internet, only search whole link information, and often can not obtain for the segment information of playable media file.
In addition, want to obtain the situation of the playable media file information of certain fragment for the user, such as, the user usually wants to obtain the fragment of certain audio file as ringing sound of cell phone or CRBT, prior art can only be by downloading the audio file that has intercepted, yet, this actually rare in search by the audio file of uploading again after the intercepting of software physics in advance, and the file part of intercepting often is not that the user is required.
No. 200510097847.7 application documents of Chinese patent disclose the invention of a kind of music search system and music searching device, this invention comprises music searching device and music searching terminal, wherein said music searching device comprises: input block is used to import the clock signal by the On/Off signal indication; Data storage cell is used for being associated with storing with music-related information a plurality of fragments of cadence information, and described music-related information is relevant with music corresponding to described cadence information; Search unit is used in a plurality of fragments of the cadence information that described data storage cell is stored search and has fluctuation (fluctuation) figure identical with the clock signal of importing described input block or the cadence information of similar wave pattern; And search result output unit, be used to read the music-related information that is associated with storing with the cadence information of finding by described search unit from described data storage cell, and the music-related information that reads of output, Search Results as search, wherein said music searching terminal comprises: communication unit is used for communicating by letter with described music searching device by order wire; Operating unit is used for by described communication unit described clock signal being imported described input block; Receiving element is used for receiving by described communication unit from described search result output unit the Search Results of described input timing signal; And display unit, be used to show the Search Results of described reception.
Yet, though this invention is segmented playable media file, but its music searching terminal is a mobile phone, purpose is easier use of music searching device and the easier realization that makes based on the rhythm input, thereby limitation is bigger, also is not suitable in the internet to require to obtain the frag info of playable media file according to customer requirements or other.
In a word, in the existing playable media file search, how to set up the segmentation of a search database realization to playable media file, obtain the resource of various piece, and being combined into corresponding ferret out according to user's request or other requirement, the different needs that satisfies the user is the technical matters that those skilled in the art press for solution.
Summary of the invention
Technical matters to be solved by this invention provides a kind of searching method and system at playable media file, to solve under the prior art condition problem that playable media file certain or a plurality of fragments can't be provided according to user's request.
Another object of the present invention provides the method for a kind of foundation at the search database of playable media file, thereby can provide playable media file certain or a plurality of fragment, fragment combination according to user's request.
Another object of the present invention provides a kind of search server and information at playable media file and obtains client, can realize segmentation to playable media file, obtain the resource of various piece, and be combined into corresponding ferret out according to user's request or other requirement.
For solving the problems of the technologies described above, the invention provides a kind of searching method at playable media file, may further comprise the steps:
The invention discloses a kind of searching method, may further comprise the steps at playable media file:
Associated steps is externally set up related between the playable media file in timing code file and the internet;
Cutting scheme determining step is analyzed described external time code file, determines the fragment of required external time code file; According to the beginning and ending time point of described definite fragment, the cutting scheme of the playable media file that acquisition is associated with it;
The information obtaining step obtains the fragment or the fragment combination of the described playable media file that is associated according to described cutting scheme.
Wherein, described associated steps and cutting scheme determining step are all finished at server end, and described cutting scheme comprises that the playable media file that is associated links and information is obtained terminal; Client is obtained user required playable media file link and information and is obtained terminal, and then the information of finishing is obtained.Preferably, described terminal is the beginning and ending time point of this external time code file fragment correspondence in the playable media file that is associated; Perhaps, described terminal is each packet of this external time code file fragment correspondence in the playable media file that is associated or the start-stop location point of byte.
Another implementation is: described associated steps and cutting scheme determining step are all finished at server end, described cutting scheme comprises the playable media file fragment that is associated or the link of fragment combination, and described link is according to the fragment of described external time code the playable media file that is associated to be carried out depositing in addition after the actual cutting obtaining; Client is obtained the link of required fragment of user or fragment combination, and then the information of finishing is obtained.
Another feasible implementation is: described cutting scheme determining step and information obtaining step are all finished in client; And before cutting scheme determining step, also comprise: client is obtained required playable media file link of user and the external time code file that is associated thereof.
Preferably, carry out related with the external time code file playable media file by the mode of text cluster.Wherein, described associated steps can also comprise: filter out the foundation of a typical external time code file as subsequent analysis in described of a sort external time code file.
Preferably, described analytical approach is: at text analyzing, semantic analysis, language model analysis or the participle analysis of external time code file.For example: when needs were searched for the climax fragment of audio file, described analytic process adopted following steps to realize: the identification parameter of described external time code file being determined to comprise multiplicity by text analyzing; Give corresponding weighted value according to described identification parameter to each fragment of described external time code file, the height of described weighted value has determined whether this fragment is the degree of priority of climax fragment.
In the said method, described local download or the online playing of being retrieved as.
The present invention also provides a kind of search system at playable media file, comprising:
Associative cell is used for setting up between the playable media file of externally timing code file and internet related;
Cutting scheme determining unit is used for described external time code file is analyzed, and determines the fragment of required external time code file; And, obtain the cutting scheme of the playable media file that is associated with it according to the fragment of described definite external time code file;
Information acquisition unit is used for obtaining according to described cutting scheme the fragment or the fragment combination of the described playable media file that is associated.
Preferably, described associative cell and cutting scheme determining unit are positioned at server end, and described cutting scheme comprises that the playable media file that is associated links and information is obtained terminal.Described information acquisition unit is arranged in client, and described client Connection Service device end obtains user required playable media file link and information and obtains terminal.Wherein, described terminal is the beginning and ending time point of this external time code file fragment correspondence in the playable media file that is associated; Perhaps, described terminal is each packet of this external time code file fragment correspondence in the playable media file that is associated or the start-stop location point of byte.
Another feasible implementation, described associative cell and cutting scheme determining unit are positioned at server end; Described cutting scheme comprises the playable media file fragment that is associated or the link of fragment combination, and described link is according to the fragment of described external time code the playable media file that is associated to be carried out depositing in addition after the actual cutting obtaining; Described information acquisition unit is arranged in client, described client Connection Service device end, and client is obtained the link of required fragment of user or fragment combination, and then the information of finishing is obtained.
Another feasible implementation, described associative cell is positioned at server end, described cutting scheme determining unit and information acquisition unit are arranged in client, and described client Connection Service device end obtains required playable media file link of user and the external time code file that is associated thereof.
Preferably, described associative cell adopts the mode of text cluster to carry out related with playable media file the external time code file.Wherein, described associative cell can also comprise: external time code file consolidation module is used for filtering out the foundation of a typical external time code file as subsequent analysis at described of a sort external time code file.
The invention also discloses a kind of method for building up of the search database at playable media file, may further comprise the steps:
Associated steps is externally set up related between the playable media file in timing code file and the internet;
Cutting scheme determining step is analyzed described external time code file, determines the fragment of required external time code file; According to the fragment of described definite external time code file, obtain cutting scheme to its playable media file that is associated;
Storing step is stored to search database with described cutting scheme; Described cutting scheme is provided with searching mark.
Wherein, described cutting scheme comprises that the playable media file that is associated links and information is obtained terminal.Perhaps, described cutting scheme comprises according to the fragment of described external time code the playable media file that is associated is carried out the playable media file fragment that actual cutting obtains or the link of fragment combination.
The also claimed a kind of search server at playable media file of the present invention comprises:
First memory, be used for storing external time code file and internet playable media file related information and at the first cutting scheme of playable media file, the described first cutting scheme comprises that the playable media file that is associated links and information is obtained terminal, and described terminal is obtained by the external time code file analysis;
Requestor is used for retrieving corresponding cutting scheme according to user's querying condition;
Interface unit is used to receive user's query requests and returns corresponding cutting scheme.
Wherein, described terminal can be each packet of this external time code file fragment correspondence in the playable media file that is associated or the start-stop location point of byte.
Described terminal also can be the beginning and ending time point of this external time code file fragment correspondence in the playable media file that is associated; Described search server also comprises start-stop location point computing unit, is used for calculating at each packet of described playable media file correspondence or the actual storage start-stop location point of byte according to the compressed format of described beginning and ending time point, playable media file.
Described terminal also can be the beginning and ending time point of this external time code file fragment correspondence in the playable media file that is associated; Described search server also comprises start-stop location point determining unit, be used to call corresponding playing program, read the file header information and the part of media information of the playable media file that is associated earlier, carry out redirect according to described beginning and ending time point then, obtain byte offsets information, thereby determine each corresponding in this playable media file packet or the actual storage start-stop location point of byte.
Preferably, described search server can also comprise: second memory, be used to store the second cutting scheme, the described second cutting scheme comprises the playable media file fragment that is associated or the link of fragment combination, and described link is according to the fragment of described external time code the playable media file that is associated to be carried out depositing in addition after the actual cutting obtaining.
Preferably, described search server can also comprise: download unit, and be used for obtaining required fragment, and produce corresponding playlist according to user's query requests, together return client; Perhaps will return client after the described fragment combination.
The present invention also provides a kind of information at playable media file to obtain client, comprising:
Cutting scheme acquiring unit, be used to receive cutting scheme at playable media file, described cutting scheme comprises that the playable media file that is associated links and information is obtained terminal, and described terminal is the beginning and ending time point of this external time code file fragment correspondence in the playable media file that is associated;
Start-stop location point determining unit is used for determining at each packet of described playable media file correspondence or the actual storage start-stop location point of byte according to described beginning and ending time point;
Information acquisition unit is used to finish information and obtains.
Definite mode of described start-stop location point can for: calculate each corresponding in the described playable media file packet or the actual storage start-stop location point of byte according to the compressed format of described beginning and ending time point, playable media file.
Definite mode of described start-stop location point also can for: call corresponding playing program, read the file header information and the part of media information of the playable media file that is associated earlier, carry out redirect according to described beginning and ending time point then, obtain byte offsets information, thereby determine each corresponding in this playable media file packet or the actual storage start-stop location point of byte.
Preferably, described information is obtained client and can also be comprised: integrator is used to integrate a plurality of local playable media file fragments of downloading; Wrapper, the playable media file fragment with download this locality is encapsulated as new playable media file according to required compressed format; And cutting scheme determining unit, be used for the external time code file that receives is analyzed, determine the fragment of required external time code file; And, obtain cutting scheme at its playable media file that is associated according to the fragment of described definite external time code file.
When the climax fragment of needs search audio file, described cutting scheme determining unit specifically comprises: text analysis model is used for described external time code file is determined to comprise by text analyzing the identification parameter of multiplicity; Weight is given module, is used for giving corresponding weighted value according to described identification parameter to each fragment of described external time code file, and the height of described weighted value has determined whether this fragment is the degree of priority of climax fragment; Cutting scheme determination module is used for the fragment according to described definite external time code file, obtains the cutting scheme at its playable media file that is associated.
Described information is obtained client and can also be comprised: the playlist generation unit, be used to set up playlist, and corresponding player can be play the fragment of a plurality of playable media file continuously according to this playlist.
Compared with prior art, the present invention has the following advantages:
The present invention utilizes the external time code file can obtain to analyze in other words the related content of each playable media file that is associated, and analyze and be syncopated as more detailed, contents fragment accurately, perhaps make up fragment, reach the effect of better search playable media file, also make Search Results abundanter.
And, the present invention can improve the accuracy of search play medium resource greatly, can segment playable media file, obtain the resource of various piece, and be combined into corresponding ferret out according to demand, satisfy the different needs of user, and then improve the search accuracy and the search of media file personalized.
Description of drawings
Fig. 1 is the embodiment process flow diagram that the present invention is directed to the searching method of playable media file;
Fig. 2 is the process flow diagram that adopts the climax part of method search audio file shown in Figure 1;
Fig. 3 is the synoptic diagram that returns search result information in embodiment illustrated in fig. 2;
Fig. 4 is the synoptic diagram that returns lyrics information in embodiment illustrated in fig. 2;
Fig. 5 a-Fig. 5 b is the example structure block diagram that the present invention is directed to the search system of playable media file;
Fig. 6 is the embodiment process flow diagram of method for building up that the present invention is directed to the search database of playable media file;
Fig. 7 is the example structure block diagram at the search server of playable media file;
Fig. 8 is the example structure block diagram that a kind of information of the present invention is obtained client.
Embodiment
Before method of the present invention is described in conjunction with specific embodiments, at first introduce the related several notions of playable media file.
Playable media file in the internet is that digital play medium is stored in the computer-readable medium with data layout, wherein play medium is a data sequence, interrupted in time, by sampling and quantification, simulation play medium conversion of signals is become the digital play medium signal of being made up of many binary numbers 1 and 0.Usually, the playable media file in the internet comprises audio file, video file, FLASH file or the like, and for example, common file layout suffix is that swf, mpg, mp4, rm, mp3, wma, wmv, ASF etc. are the types of playable media file.
Timing code is the unique time encoding at each joint/section play medium information record.The external time code file is a kind of file that has timing code and corresponding text message of corresponding play medium.Usually be presented as the lyrics of audio content or the caption information of dialogue or video content.For example .LRC lyrics file or .SUB subtitling format file etc.
For example, lyrics file
[ti: fruit juice divides you half]
[ar: flower]
[by: catkin wafts]
[offset:500]
[00:08.01] editor: catkin wafts
I want that that that that that [00:10.01]
You want that that that that that [02:24.74] [02:14.73] [01:24.74] [00:14.76]
…?…?…?…?…?…?…
Go up routine middle front part and comprise text attribute information such as some titles of the song, author, the lyrics part of back all has corresponding timestamp, the lyrics that a plurality of timestamps are arranged are the lyrics that all can show when these timestamps begin respectively, in case run into the timestamp of the lyrics that the next one begins to show, the lyrics of current demonstration promptly no longer show, that is to say that next zero-time is the termination time of the preceding paragraph lyrics.
For example, subtitle file
…?…?…?…?…?…?…
00:01:23,249-->00:01:27,276
Do not do evil with Extra Sensory Perception, I have known, the charles
…?…?…?…?…?…?…
Subtitle file has comprised beginning and ending time and the termination time that captions show in the last example, shows the caption information of below according to above-mentioned timestamp.
Above two examples only be usually the external time code files of being seen two kinds of media files, other playable media file also is to set up such external time code file, set up different timing code files according to different demands, that is to say that the playable media file of all forms can have corresponding external time code file.
With reference to Fig. 1, be the process flow diagram of a kind of searching method at playable media file of the present invention, may further comprise the steps:
Step 101, associated steps are externally set up related between the playable media file in timing code file and the internet;
Step 102, cutting scheme determining step are analyzed described external time code file, determine the fragment of required external time code file; And, obtain the cutting scheme of the playable media file that is associated with it according to the beginning and ending time point of described definite fragment;
Step 103, information obtaining step obtain the fragment or the fragment combination of the described playable media file that is associated according to described cutting scheme.
Described information is obtained and generally can be comprised and download to local and the online playing dual mode, and following description is generally adopted download to come expression information to obtain and described.
Relate to server end and client in the searching method shown in Figure 1, wherein, can be divided into following two kinds of situations:
First kind of implementation, described associated steps 101, cutting scheme determining step 103 are all finished at server end, and client is obtained the required cutting scheme of user, and then finish download according to the segmental information in the cutting scheme, obtain the fragment or the fragment combination of required playable media file.
Second implementation, only described associated steps 101 is finished at server end, and cutting scheme determining step 103, and download step 104 is all finished by client, client is obtained required playable media file link and the external time code file that is associated thereof, and then finish analysis, cutting scheme and determine, download the fragment or the fragment combination that obtain required playable media file.
Certainly in the above two kinds of cases, in order to help the user to realize retrieval, need at playable media file link and the external time code file that is associated thereof some keywords or classification index be set at server end, perhaps at the cutting scheme keyword or classification index are set at server end, but because this part is very known at search field, so just do not given unnecessary details at this.For example, the user can realize retrieval to information needed by various browsers, then by triggering the external time code file that client makes that client can be obtained the required cutting scheme of user or playable media file link and be associated.
For above-mentioned first kind of implementation, can adopt virtual cutting, be that segmental information in the described cutting scheme can be the playable media file link be associated and download terminal that client is obtained the required playable media file link of user and downloaded terminal, and then finishes download.The play medium link that certain the external time code file that obtains for virtual cutting is associated has only when user's download or when watching this section playable media file, just can and play in client downloads, and the file of reality does not have any variation.Be understandable that,, also have different requirements of combination because the user has different demands, so the cutting scheme is not unique, every kind of cutting scheme all can be stored in the search database, when the user searches for, offers user's cutting scheme corresponding with keyword.For example, the user wants to search for the pairing snatch of song of song that finds in certain live concert video file in some special editions, through after the virtual cutting, can offer the user to the video file terminal of the fragment correspondence that meets song title in this special edition, thereby make the user obtain the live concert version of song correspondence in certain special edition.Yet because different special editions has different songs, and same first song also may be in different special editions, so the user can obtain the video file terminal that all meet the fragment correspondence of song title in this special edition.
Wherein, described terminal is the beginning and ending time point of this external time code file fragment correspondence in the playable media file that is associated; Perhaps, described terminal is each packet of this external time code file fragment correspondence in the playable media file that is associated or the start-stop location point of byte.Certainly, when described terminal is corresponding beginning and ending time during point, client needs and can determine each corresponding in the described playable media file packet or the actual storage start-stop location point of byte according to described beginning and ending time point.
Definite mode of described start-stop location point can for: calculate each corresponding in the described playable media file packet or the actual storage start-stop location point of byte according to the compressed format of described beginning and ending time point, playable media file, because the audio format of standard has existed definite sample frequency and storage standards.For example, well-known, during compacted voice file, be to compress according to fixing bit rate, correspondingly, can pass through reverse conversion, calculate the position of this audio file, thereby form linear correspondence corresponding to its external time code.Preferably, can also at first obtain the file header information of required playable media file, obtain actual storage start-stop location point in conjunction with its compressed format information calculations again, but can provide how more detailed information because the file header information data amount is very little, so can improve computing velocity and precision.
In addition, definite mode of described start-stop location point can also for: call corresponding playing program, read the file header information and the part of media information of the playable media file that is associated earlier, carry out redirect according to described beginning and ending time point then, obtain byte offsets information, thereby determine each corresponding in this playable media file packet or the actual storage start-stop location point of byte, for example, Windows Media stream media technology just can realize said process.
Because the file header of audio-video document is very little, in processing, can ignore, guarantee the accurate linear corresponding of audio file and timing code so can add certain side-play amount, can obtain corresponding byte information by special software or the preprocessing process by existing media player in the reality and guarantee this linear corresponding.
For above-mentioned first kind of implementation, also can adopt actual cutting, be in the described cutting scheme segmental information can also for: the playable media file fragment that is associated or the link of fragment combination, described link are according to the fragment of described external time code the playable media file that is associated to be carried out depositing in addition after the actual cutting obtaining; Client is obtained the link of required fragment of user or fragment combination, and then finishes download.In the said process, server end has been stored peer link in advance, thereby avoided real-time calculation process, improve retrieval rate, but may bring performance, storage space to have relatively high expectations to server, bring certain wasting of resources, still very feasible at some download scenarios special or that clicking rate is higher.In a word, those skilled in the art can adopt two kinds of cutting schemes of above-mentioned proposition to select a use fully as required, perhaps are used in combination, and all belong to one of design of the present invention.
Actual cutting is exactly by tool software, comes physics cutting playable media file according to the terminal that analysis obtains, and playable media file is divided into a plurality of file fragments, thereby and deposits in addition and form new link.The user can directly click corresponding link according to the cutting scheme that retrieves and download the fragment that can obtain required playable media file.
Preferably, synthetic new playable media file of the playable media file slice groups that described a plurality of actual cuttings can be obtained.Described combination is physically, exactly the synthetic new playable media file of the slice groups of all playable media file is offered the user; Certainly, also can not make up, only set up a playlist, offer each media player, each media player can be playing continuously a plurality of files according to this playlist, and these a plurality of files are treated as a file.
For above-mentioned second kind of implementation, client is obtained required playable media file link of user and the external time code file that is associated thereof from server end, and then finish analysis, cutting scheme and determine, download the fragment or the fragment combination that obtain required playable media file.Wherein, the required segmental information of client downloads should comprise: playable media file link that is associated and download terminal, owing to only comprise the beginning and ending time point of needs in the external time code file, so the client of this moment needs and can determine each corresponding in the described playable media file packet or the actual storage start-stop location point of byte according to described beginning and ending time point, just can finish download, the existing description before correlation technique realizes is with reference to getting final product.
Below the associated steps in the method shown in Figure 1 101 is described in detail.
Described association can adopt various feasible modes to realize, for example: by with mode such as other company's cooperations in the issue playable media file with regard to supporting issue its external time code file that is associated, thereby can obtain the comparatively accurate degree of association; Perhaps,, make the external time code file that is associated, also can obtain the more accurate degree of association specially at existing playable media file.The above-mentioned introduction by manually setting up the method for linked database also can be realized related process automatically by computing machine, and for example: the mode by text cluster is carried out related with the external time code file playable media file.
Generally, the playable media file of from the internet, obtaining, for example all store the text attribute information relevant in mp3, the windowsmedia file, as author, composition, singer, special edition, title of the song, network address etc. with this document in file self fixed position.Above-mentioned text attribute information is that related web page context and/or the link information according to playable media file obtains; Preferably, also can directly from playable media file, extract, because the text attribute information in the playable media file is generally more accurate than related web page context or link information.Text cluster is meant, the fileinfo in the comparison database, and the playable media file that text attribute information is identical or similar is labeled as same class.
And for the external time code file, owing to himself being exactly text formatting or storing various text messages, so directly from file, read certain text attribute information, for example, title of the song, author, album name etc., the external time code file mark that text attribute information is identical or similar is same class.Simultaneously external time code file and playable media file are carried out text cluster, thereby the two is associated.
In practice, the external time code file that obtains preliminary polymerization by above-mentioned correlating method has a plurality of, and playable media file is relative disorderly with the incidence relation between the external time code file, thus the accuracy of the cutting scheme of influence acquisition.In this case, above-mentioned correlating method can further include: filter out the foundation of a typical external time code file as subsequent analysis in described of a sort external time code file.
Be specially, a plurality of external time code files are put together analyze, differentiate and to belong to correct text attribute information in the same class external time code file, promptly select one and be combined into a new external time code file as the cutting sample as the cutting sample or according to the correct text attribute information that determines.
The present invention can assert whether text attribute information is correct according to multiple mode or rule.For example, majority rule is thought in of a sort external time code file, frequency of occurrences height, and promptly the higher text attribute information of proportion is a correct information.Perhaps, text attribute information to each external time code file is analyzed, if the complete hit criteria of this text attribute information storehouse (information bank that forms by manual intervention, think very accurately), promptly give its higher weight, when weighted value was higher than certain threshold value, just differentiating was correct information.And for single external time code file,, then can be used as the cutting sample if the correct information that has is maximum.Certainly, the present invention does not need choosing of cutting sample done any restriction, and above-mentioned only is several optimal ways.
Because only playable media file has been carried out text cluster in the said method, also can there be the different playable media file of more details in the same class, for can being put according to the beginning and ending time of described typical external time code file, all playable media file that are associated come cutting, the present invention can also be on the basis based on above-mentioned correlating method, certain rule is set filters playable media file, described filtering rule can be provided with according to various search needs, and the present invention does not need this to limit.Perhaps, the present invention can also be provided with the playable media file that is used to search for source, for example, only chooses or official provides etc. from specific website, thereby can guarantee the consistance that playable media file is higher.
Below the analytic process at the external time code file in the method shown in Figure 1 is described in detail.
Owing to have the music lyrics in the external time code file, video caption or the like text message, can think information to this section playable media file, we just can know what current audio frequency or the general contents in the video be by analyzing these information, thereby can realize the fragment search to the playable media file that is associated.In the present invention, the analytical approach that the external time code file is analyzed can comprise text analyzing, semantic analysis, language model analysis or participle analysis.
Wherein, text analyzing is that the text message of external time code file is analyzed, and obtains the emphasis key element of text message.When being applied in the reality, for example, when needs were searched for the climax fragment of audio file, its analytic process may further comprise the steps: the identification parameter of described external time code file being determined to comprise multiplicity by text analyzing; Described identification parameter can also comprise the time of repetition paragraph, repeated segments etc., can be provided with according to demand; Give corresponding weighted value according to described identification parameter to each fragment of described external time code file, weighted value according to each fragment sorts to each climax fragment, for example, time length according to multiplicity and repetition paragraph is marked, sort then, the fragment that score is the highest is preferred climax fragment.The present invention is for there being different two or more climax fragments also to support in the audio file.Certainly, can also analyze the key element fragment of various playable media file according to demand, for example,, analyze the different chapters and sections of an audio or video lecture etc. by interrupt unit according to such text analyzing.
Semantic analysis is meant by the semantic relation between notional word and the notional word in the text structure of analyzing the external time code file, judge the roughly meaning of a certain section or several sections text representations, give the text semantic keyword respectively, this semanteme keyword can comprise the literal that occurs in this section text, also can be with the most close literal of this section text semantic, the searching key word of described keyword respective user.
The participle analysis is meant by the text message to the external time code file carries out participle, gives corresponding keyword respectively, and the searching key word of described keyword respective user can also form related keyword jointly in conjunction with semantic analysis.
The language model analysis classes is similar to semantic analysis, does not repeat them here.
In a word by above-mentioned analytic process, can determine the fragment of a plurality of required external time code files, and also can give corresponding keyword for each fragment by above-mentioned analytic process.For determining fragment and the process of giving keyword, can realize by multiple analytical procedure.Certainly, for client is finished analytic process, be not need to give corresponding keyword.
By above-mentioned analytic process, also can determine a plurality of required fragments at an external time code file, promptly can there be a plurality of cutting schemes at each playable media file that is associated, for example: provide in certain playable media file about the fragment combination of certain commodity, the fragment combination about the someone perhaps is provided in certain playable media file; Also can give a plurality of or multiple keyword at each fragment, the fragment classification information also can be thought a kind of of keyword, promptly can have multiple search condition at each fragment, for example: apple, number.Described classification can be provided with according to various needs, for example, and music climax paragraph, agricultural, industry, personage etc.
Described analytic process can also comprise as required the replicate analysis to the external time code file, and to arrangement of searching marks such as keyword or the like, analysis herein should be done more wide in range explanation.
With reference to Fig. 2, be the embodiment that the present invention adopts searching method shown in Figure 1, a kind of process flow diagram of searching for the climax part of audio file after receiving user's query requests, may further comprise the steps:
Step 201 is set up relatedly between the audio file in lyrics file and the internet, and present embodiment is set up described association at server end by the following method, comprises following substep:
First substep obtains song and singer's title in the described lyrics file, according to described song and singer's title described lyrics file is carried out text cluster;
For example: song and the singer's name obtained in the lyrics file are called the Silk Road and Liang Jingru, and it is as follows to obtain certain typical consequence through text cluster:
The Silk Road | Liang Jingru |::: the Silk Road-towards the road of liking | http://ww.jpwy.net/gc/down.php? id=2317﹠amp; Ps=8﹠amp; Ti=%CB%BF%C2%B7﹠amp; Key=8c2cc747756b0ac7486c52bdf7640a30|
[ti: the Silk Road]
[ar: Liang Jingru]
[by: Sun Xu]
[offset:500]
[00:03.75] Liang Jingru--the Silk Road
[00:08.94] editor: Sun Xu (www.jpwy.net)
[00:12.00]
[00:17.20] is if roam is your talent
[00:22.50] you must be my the most beautiful chasing so
…?…?…
I become lake above the Tianshan Mountains with tears stream [00:56.80] [02:37.50]
[01:00.50] [02:41.00] can encamp when making you tired and park
…?…?…
[01:42.00] [03:22.00] [04:35.00] journey that who is who
[01:45.00] [03:25.30] I if you remember
…?…?…
I never cried (no~oh~) after [04:22.00] fell in love with you
I never lie in (I don't care) [04:31.20]
I need only [04:38.00]
You remember [04:48.80]
Second substep obtains song and singer's title (text attribute information) in the interconnection network audio file, according to described song and singer's title described audio file is carried out text cluster;
The 3rd substep 2013 carries out related with audio file of a sort lyrics file.In the present embodiment, promptly can connect the lyrics file of audio file and its correspondence by text messages such as title of the song singer special editions.
For example, song and the singer's title obtained in the audio file are respectively the Silk Road and Liang Jingru, and certain the audio file result in such of process text cluster acquisition is:
The Silk Road | Liang Jingru | the Silk Road | http://yy.9yy.com:8080/Song/ahn/q/qq/9/7.wma
In real world applications, the lyrics file that client searches often quality is higher, can typical lyrics file of lyrics file conduct of picked at random.And in the present embodiment also can, preferred, filter out a typical lyrics file by some analysis strategies, for example, the multiplicity of the lyrics by judging each source if there are the lyrics of the above number percent of certain threshold value to repeat, just thinks that one of them can be used as typical lyrics file; Perhaps can come scores accumulated to the evaluation of each lyrics by the online friend on the webpage, score is high becomes typical lyrics file; Perhaps, contrast song length and lyrics time point find the typical lyrics file that becomes of mating most.
For example, through top text cluster, lyrics file that cluster association can be obtained and audio files storage and can be given corresponding group number or association identification number to database.
At this moment, accept user's query requests, just can return search result information, for example shown in Figure 3.The user clicks the icon of the lyrics in this interface, just can demonstrate the lyrics, as shown in Figure 4.
Step 202 is analyzed described typical lyrics file, determines climax fragment wherein, and obtains the start-stop cutting time point of associated audio file according to described definite fragment.
Comprise following substep:
First substep is determined identification parameter to the lyrics file of being extracted by text analyzing, for example, repeats the time of paragraph, multiplicity and repeated segments;
Second substep is given corresponding weighted value according to described identification parameter to each fragment of lyrics file, and the height of described weighted value has determined whether this fragment is the degree of priority of climax fragment.Can comprise different a plurality of climax fragments in the one first song.
The 3rd substep is according to the climax fragment of described definite lyrics file, the high tide point of the audio file that obtains being associated.
Step 202 can be finished at server end, also can finish in client, and the associated description of Fig. 1 describes in detail, repeats no more herein.Certainly, server also can directly carry out actual cutting to its audio file that is associated according to retrieval frequency (for example, for the audio file of retrieval TOP100), returns its storage link in server database of client then and gets final product.
Step 203, when the user clicked the icon of selecting the tinkle of bells diy, server can extract the link and the corresponding lyrics file of this audio file in the database automatically, and analyzes the high tide point that obtains.In server, sign is arranged, when the user clicks the tinkle of bells diy, just can know that first song that the user asks, and the lyrics content of song is got for song.According to lyrics content, the participle of elapsed time fragment just can obtain the time point of climax fragment, passes client in the lump back.
Step 204, client are finished and are downloaded or formerly play.After client receives information on the server, just can on player, mark out the position of climax fragment automatically, and go up the segmentation download song to the url of appointment.Described player is various DST PLAYER in the prior art.Certainly, the user also can pull audition by player, determines that the fragment of other times section conduct oneself needs is downloaded.
Step 205, the fragment or the fragment combination of the described audio file that is associated of client stores, and being encapsulated as new playable media file, the form after the encapsulation can be the form of former media file, also can be converted to other media file formats through coding and decoding.
For example, files in stream media for the ASF series of existing Microsoft (comprises the wma-audio frequency, the wmv-video), we can be by reading head (head) information of original in advance, and then jump to the time point of appointment, and obtain the file dot information, store, storage the time need encapsulate original header, that is to say that the file of intercepting is downloaded to local back finishes download for behind header of this file encapsulation just calculation.Other Streaming Media can the rest may be inferred, can in spider (spider) search, judge the format information of this file, by reading head information in advance or the Study document extension name is analyzed, the file that goes to encapsulate cutting according to the encapsulation strategy of corresponding format gets final product then.
The present invention preferably can also partly be integrated into the audio file climax of a plurality of downloads a complete audio file and offer the user, can not integrate yet, just set up a playlist, offer each player, each player can be play a plurality of files continuously according to this playlist.
With reference to Fig. 5 a-5b, be the frame diagram of embodiment that the present invention is directed to the search system of playable media file, comprise with lower unit:
Associative cell 501 is used for setting up between the playable media file of externally timing code file and internet related;
Cutting scheme determining unit 502 is used for described external time code file is analyzed, and determines the fragment of required external time code file according to analysis result; And, obtain the cutting scheme of the playable media file that is associated with it according to the fragment of described definite external time code file;
Information acquisition unit 503 is used for obtaining according to described cutting scheme the fragment or the fragment combination of the described playable media file that is associated.
With reference to Fig. 5 a, be the system framework figure of embodiment 1 and embodiment 2,
Embodiment 1, and described associative cell 504 and cutting scheme determining unit 502 are positioned at server end, and described cutting scheme comprises the playable media file link that is associated and downloads terminal.Described download unit 503 is arranged in client, and described client Connection Service device end obtains the required playable media file link of user and downloads terminal.Described terminal is the beginning and ending time point of this external time code file fragment correspondence in the playable media file that is associated; Perhaps, described terminal is each packet of this external time code file fragment correspondence in the playable media file that is associated or the start-stop location point of byte.
Embodiment 2, and described associative cell 501 and cutting scheme determining unit 502 are positioned at server end; Described cutting scheme comprises the playable media file fragment that is associated or the link of fragment combination, and described link is according to the fragment of described external time code the playable media file that is associated to be carried out depositing in addition after the actual cutting obtaining; Described download unit 503 is arranged in client, described client Connection Service device end, and client is obtained the link of required fragment of user or fragment combination, and then finishes download.
With reference to Fig. 5 b, be the system framework figure of embodiment 3,
Embodiment 3, described associative cell 501 is positioned at server end, described cutting scheme determining unit 502 and download unit 503 are arranged in client, described client Connection Service device end, obtain required playable media file link of user and the external time code file that is associated thereof, and then finish analysis, cutting scheme and determine, download the fragment or the fragment combination that obtain required playable media file.
Among above-mentioned three embodiment, definite mode of described start-stop location point can for: calculate each corresponding in the described playable media file packet or the actual storage start-stop location point of byte according to the compressed format of described beginning and ending time point, playable media file; For the purpose of accurately, can calculate described start-stop location point jointly in conjunction with certain file header information.Definite mode of described start-stop location point also can for: call corresponding playing program, read the file header information and the part of media information of the playable media file that is associated earlier, carry out redirect according to described beginning and ending time point then, obtain byte offsets information, thereby determine each corresponding in this playable media file packet or the actual storage start-stop location point of byte.Corresponding playing program is the various programs that can play this media file, for example, and the player of this media file format correspondence, player plug-in or other application programs that can play or the like.
Among above-mentioned three embodiment, described associative cell can adopt the mode of text cluster to carry out related with playable media file the external time code file.For example, described associative cell specifically can comprise:
External time code body of an instrument cluster module is used to obtain the text attribute information of described external time code file, according to described text attribute information the external time code file is carried out text cluster;
Playable media file text cluster module is used to obtain the text attribute information of described playable media file, according to described text attribute information playable media file is carried out text cluster;
Relating module is used for carrying out related with playable media file of a sort external time code file.
Preferably, described associative cell can also comprise: external time code file consolidation module is used for filtering out the foundation of a typical external time code file as subsequent analysis at described of a sort external time code file.
When needs search be the climax fragment of audio file the time, described cut-out scheme determining unit can comprise with lower module and is used for analysis at the external time code file:
Text analysis model is used for described external time code file is determined to comprise by text analyzing the identification parameter of multiplicity;
Weight is given module, is used for giving corresponding weighted value according to described identification parameter to each fragment of described external time code file, and the height of described weighted value has determined whether this fragment is the degree of priority of climax fragment.
With reference to Fig. 6, be the method for building up block diagram of a kind of search database at playable media file of the present invention.The foundation of server end search database is to finish important step of the present invention, at the embodiment that aforementioned cutting scheme determining step is finished at server end, needs following steps to realize the foundation of search database:
Associated steps 601 is externally set up related between the playable media file in timing code file and the internet;
Cutting scheme determining step 602 is analyzed described external time code file, determines the fragment of required external time code file according to analysis result; According to the fragment of described definite external time code file, obtain cutting scheme to its playable media file that is associated;
Storing step 603 is stored to search database with described cutting scheme; Described cutting scheme is provided with searching mark.By described searching mark, link this search database and just can retrieve required cutting scheme, described searching mark can be keyword, classification etc.
Wherein, described cutting scheme can be virtual cutting, comprises the playable media file link that is associated and downloads terminal.Described cutting scheme also can be actual cutting, comprises according to the fragment of described external time code the playable media file that is associated is carried out the playable media file fragment that actual cutting obtains or the link of fragment combination.
With reference to Fig. 7, be the structured flowchart of a kind of search server at playable media file of the present invention, comprising:
First memory 701, be used for storing external time code file and internet playable media file related information and at the first cutting scheme of playable media file, the described first cutting scheme comprises the playable media file link that is associated and downloads terminal that described download terminal is obtained by the external time code file;
Requestor 702 is used for retrieving corresponding cutting scheme according to user's querying condition.
Interface unit 703 is used to receive user's query requests and returns corresponding cutting scheme.
Search server shown in Figure 7 has been finished at associated steps in the search procedure of playable media file and cutting scheme determining step, and concrete information obtaining step still needs to finish by client.Retrievable informations such as the cutting scheme of storing in the search server shown in Figure 7 can obtain by the method for setting up search database shown in Figure 6, can certainly obtain by other modes, in a word, only require in the search server shown in Figure 7 that having stored above-mentioned retrievable information gets final product, do not need to limit the source mode of retrievable informations such as cutting scheme.
The search server shown in Figure 7 and the annexation of client and unit are formed, and at the difference of described terminal, can have following four kinds of situations:
(1) when the start-stop location point of each corresponding in the playable media file that described terminal is being associated for this external time code file fragment packet or byte, client can directly connect described server and finish information and obtain.
(2) the beginning and ending time point of correspondence in the playable media file that described terminal is being associated for this external time code file fragment, described search server also needs to comprise start-stop location point computing unit, is used for calculating at each packet of described playable media file correspondence or the actual storage start-stop location point of byte according to the compressed format of described beginning and ending time point, playable media file; Client can directly connect described server and finish information and obtain.Preferably, start-stop location point computing unit can also at first obtain the file header information of required playable media file, obtain actual storage start-stop location point in conjunction with its compressed format information calculations again, but can provide how more detailed information because the file header information data amount is very little, so can improve computing velocity and precision.
(3) the beginning and ending time point of correspondence in the playable media file that described terminal is being associated for this external time code file fragment, described search server also comprises start-stop location point determining unit, be used to call corresponding playing program, read the file header information and the part of media information of the playable media file that is associated earlier, carry out redirect according to described beginning and ending time point then, obtain byte offsets information, thereby determine each corresponding in this playable media file packet or the actual storage start-stop location point of byte; Client can directly connect described server and finish information and obtain.
(4) the beginning and ending time point of correspondence in the playable media file that described terminal is being associated for this external time code file fragment, described search server provides the beginning and ending time dot information to client, finish the conversion of start-stop location point by client, and then finish information according to the start-stop location point that obtains and obtain.Under first three kind situation, adopt existing information to obtain client and can finish, under the 4th kind of situation, then need client to carry out some and improve, comprise the assembly of the conversion that is used to finish the start-stop location point.
In order to satisfy some special Search Requirements, search server shown in Figure 7, can also comprise: second memory 704, be used to store the second cutting scheme, the described second cutting scheme is at the actual cutting process of search server, comprise the playable media file fragment that is associated or the link of fragment combination, described link is according to the fragment of described external time code the playable media file that is associated to be carried out depositing in addition after the actual cutting obtaining.The first cutting scheme of storing in the search server and the second cutting scheme, can adopt different cutting schemes at different audio files, for example, can adopt the mode of actual cutting for the audio file of fragment retrieval TOP100, and for other audio files, then can adopt virtual cutting, thereby can better meet user's Search Requirement, improve recall precision.
Because generally speaking, the calculated performance of server all is better than the calculated performance of user end computer, so, search server shown in Figure 7 can also comprise download unit 705, query requests according to the user at first downloads to server with desired data, is being sent to client; The situation that said process is particularly useful for return data when being a plurality of fragment.At this moment, server can be a file with a plurality of fragment combination that obtain, and is sent to client then; Also can produce a play list file, together be sent to client with a plurality of fragments that obtain, the playlist mode is particularly useful for the situation of user's online playing, and the user will think that this time link result is exactly a file, can improve user's experience.
With reference to Fig. 8, be the structured flowchart that a kind of information at playable media file of the present invention is obtained client, comprising:
Cutting scheme acquiring unit 801, be used to receive cutting scheme at playable media file, described cutting scheme comprises the playable media file link that is associated and downloads terminal that described terminal is the beginning and ending time point of this external time code file fragment correspondence in the playable media file that is associated;
Start-stop location point determining unit 802 is used for determining at each packet of described playable media file correspondence or the actual storage start-stop location point of byte according to described beginning and ending time point;
Information acquisition unit 803 is used to finish information and obtains.Described information is obtained can be for being downloaded to local repeated application or online playing.
Wherein, definite mode of described start-stop location point can for: calculate each corresponding in the described playable media file packet or the actual storage start-stop location point of byte according to the compressed format of described beginning and ending time point, playable media file.Preferably, for the purpose of accurately, can calculate described start-stop location point jointly in conjunction with certain file header information.
Definite mode of described start-stop location point also can for: call corresponding playing program, read the file header information and the part of media information of the playable media file that is associated earlier, carry out redirect according to described beginning and ending time point then, obtain byte offsets information, thereby determine each corresponding in this playable media file packet or the actual storage start-stop location point of byte.
The frag info that adopts client shown in Figure 8 to finish playable media file obtains, as long as corresponding search server can return the cutting scheme of beginning and ending time dot information corresponding in the required playable media file, client shown in Figure 8 can be determined required start-stop location point by start-stop location point determining unit, thereby the information of finishing is obtained.
When search server can only provide the related information of required audio file and external time code file, client shown in Figure 8 can be automatic or manual the external time code file is analyzed, obtain required fragment, and then obtain required cutting scheme.If client shown in Figure 8 need be finished the analysis to the external time code file automatically, then also need to comprise: cutting scheme determining unit 804, be used for the external time code file that receives is analyzed, determine the fragment of required external time code file according to analysis result; And, obtain cutting scheme at its playable media file that is associated according to the fragment of described definite external time code file.
For example, when the climax fragment of needs search audio file, described cutting scheme determining unit specifically comprises: text analysis model is used for described external time code file is determined to comprise by text analyzing the identification parameter of multiplicity; Weight is given module, is used for giving corresponding weighted value according to described identification parameter to each fragment of described external time code file, and the height of described weighted value has determined whether this fragment is the degree of priority of climax fragment; Cutting scheme determination module is used for the fragment according to described definite external time code file, obtains the cutting scheme at its playable media file that is associated.According to the difference of concrete analysis demand, described cutting scheme determining unit can specifically comprise different submodules.
The media fragment resource of obtaining in order to reuse, information then shown in Figure 8 is obtained client, also comprises: wrapper 805 with the playable media file fragment of downloading, is encapsulated as new playable media file according to the form of original.
For fear of the trouble of playing or deposit a plurality of media fragment information, information shown in Figure 8 is obtained client, also comprises: integrator 806 is used to integrate the playable media file fragment of a plurality of downloads.As for how integrating a plurality of fragment resources, and the coding and decoding transfer process between the different media formats is well known to those skilled in the art, do not describe in detail at this.
In like manner, for simplicity, described information is obtained client, also can comprise: playlist generation unit 807, be used to set up playlist, and corresponding player can be play the fragment of a plurality of playable media file continuously according to this playlist.Be that information is obtained client and finished and set up the such virtual portfolio of playlist and get final product, and do not need to finish the practical combinations that integrator 806 is realized.
Information shown in Figure 8 is obtained the download technology that client can adopt existing breakpoint transmission, also can adopt the mode of point-to-point (P2P) to obtain, and the present invention does not need concrete downloading process is limited.
Above-mentioned about not detailed part in the associated description of Fig. 5-Fig. 8, can be referring to the relevant portion of this instructions earlier figures 1-Fig. 2.
More than to a kind of searching method and system provided by the present invention at playable media file in the internet, a kind of method for building up of the search database at playable media file, and obtain client at the search server and the information of playable media file, be described in detail, used specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, the part that all can change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (34)

1, a kind of searching method at playable media file is characterized in that, may further comprise the steps:
Associated steps is externally set up related between the playable media file in timing code file and the internet;
Cutting scheme determining step is analyzed described external time code file, determines the fragment of required external time code file; According to the beginning and ending time point of described definite fragment, the cutting scheme of the playable media file that acquisition is associated with it;
The information obtaining step obtains the fragment or the fragment combination of the described playable media file that is associated according to described cutting scheme.
2, method according to claim 1 is characterized in that,
Described associated steps and cutting scheme determining step are all finished at server end, and described cutting scheme comprises that the playable media file that is associated links and information is obtained terminal;
Client is obtained user required playable media file link and information and is obtained terminal, and then the information of finishing is obtained.
3, method according to claim 2 is characterized in that,
Described terminal is the beginning and ending time point of this external time code file fragment correspondence in the playable media file that is associated;
Perhaps, described terminal is each packet of this external time code file fragment correspondence in the playable media file that is associated or the start-stop location point of byte.
4, method according to claim 1 is characterized in that,
Described associated steps and cutting scheme determining step are all finished at server end, described cutting scheme comprises the playable media file fragment that is associated or the link of fragment combination, and described link is according to the fragment of described external time code the playable media file that is associated to be carried out depositing in addition after the actual cutting obtaining;
Client is obtained the link of required fragment of user or fragment combination, and then the information of finishing is obtained.
5, method according to claim 1 is characterized in that,
Described cutting scheme determining step and information obtaining step are all finished in client;
And before cutting scheme determining step, also comprise: client is obtained required playable media file link of user and the external time code file that is associated thereof.
6, method according to claim 1 is characterized in that, described associated steps is carried out related with the external time code file playable media file by the mode of text cluster.
7, method according to claim 6 is characterized in that, described associated steps also comprises:
In described of a sort external time code file, filter out the foundation of a typical external time code file as subsequent analysis.
8, method according to claim 1 is characterized in that, described analytical approach is: at text analyzing, semantic analysis, language model analysis or the participle analysis of external time code file.
9, method according to claim 1 is characterized in that, when needs were searched for the climax fragment of audio file, described analytic process adopted following steps to realize:
Described external time code file is determined to comprise the identification parameter of multiplicity by text analyzing;
Give corresponding weighted value according to described identification parameter to each fragment of described external time code file, the height of described weighted value has determined whether this fragment is the degree of priority of climax fragment.
10, method according to claim 1 is characterized in that, described local download or the online playing of being retrieved as.
11, a kind of search system at playable media file is characterized in that, comprising:
Associative cell is used for setting up between the playable media file of externally timing code file and internet related;
Cutting scheme determining unit is used for described external time code file is analyzed, and determines the fragment of required external time code file; And, obtain the cutting scheme of the playable media file that is associated with it according to the fragment of described definite external time code file;
Information acquisition unit is used for obtaining according to described cutting scheme the fragment or the fragment combination of the described playable media file that is associated.
12, system according to claim 11 is characterized in that,
Described associative cell and cutting scheme determining unit are positioned at server end, and described cutting scheme comprises that the playable media file that is associated links and information is obtained terminal.
Described information acquisition unit is arranged in client, and described client Connection Service device end obtains user required playable media file link and information and obtains terminal.
13, system according to claim 12 is characterized in that,
Described terminal is the beginning and ending time point of this external time code file fragment correspondence in the playable media file that is associated;
Perhaps, described terminal is each packet of this external time code file fragment correspondence in the playable media file that is associated or the start-stop location point of byte.
14, system according to claim 11 is characterized in that,
Described associative cell and cutting scheme determining unit are positioned at server end; Described cutting scheme comprises the playable media file fragment that is associated or the link of fragment combination, and described link is according to the fragment of described external time code the playable media file that is associated to be carried out depositing in addition after the actual cutting obtaining;
Described information acquisition unit is arranged in client, described client Connection Service device end, and client is obtained the link of required fragment of user or fragment combination, and then the information of finishing is obtained.
15, system according to claim 11 is characterized in that,
Described associative cell is positioned at server end, and described cutting scheme determining unit and information acquisition unit are arranged in client, and described client Connection Service device end obtains required playable media file link of user and the external time code file that is associated thereof.
16, system according to claim 11 is characterized in that, described associative cell adopts the mode of text cluster to carry out related with playable media file the external time code file.
17, device according to claim 16 is characterized in that, described associative cell also comprises:
External time code file consolidation module is used for filtering out the foundation of a typical external time code file as subsequent analysis at described of a sort external time code file.
18, a kind of method for building up of the search database at playable media file is characterized in that, may further comprise the steps:
Associated steps is externally set up related between the playable media file in timing code file and the internet;
Cutting scheme determining step is analyzed described external time code file, determines the fragment of required external time code file; According to the fragment of described definite external time code file, obtain cutting scheme to its playable media file that is associated;
Storing step is stored to search database with described cutting scheme; Described cutting scheme is provided with searching mark.
19, method according to claim 18 is characterized in that, described cutting scheme comprises that the playable media file that is associated links and information is obtained terminal.
20, method according to claim 18, it is characterized in that described cutting scheme comprises according to the fragment of described external time code carries out the playable media file fragment that actual cutting obtains or the link of fragment combination to the playable media file that is associated.
21, a kind of search server at playable media file is characterized in that, comprising:
First memory, be used for storing external time code file and internet playable media file related information and at the first cutting scheme of playable media file, the described first cutting scheme comprises that the playable media file that is associated links and information is obtained terminal, and described terminal is obtained by the external time code file analysis;
Requestor is used for retrieving corresponding cutting scheme according to user's querying condition;
Interface unit is used to receive user's query requests and returns corresponding cutting scheme.
22, search server according to claim 21 is characterized in that, described terminal is each packet of this external time code file fragment correspondence in the playable media file that is associated or the start-stop location point of byte.
23, search server according to claim 21 is characterized in that,
Described terminal is the beginning and ending time point of this external time code file fragment correspondence in the playable media file that is associated;
Described search server also comprises start-stop location point computing unit, is used for calculating at each packet of described playable media file correspondence or the actual storage start-stop location point of byte according to the compressed format of described beginning and ending time point, playable media file.
24, search server according to claim 21 is characterized in that,
Described terminal is the beginning and ending time point of this external time code file fragment correspondence in the playable media file that is associated;
Described search server also comprises start-stop location point determining unit, be used to call corresponding playing program, read the file header information and the part of media information of the playable media file that is associated earlier, carry out redirect according to described beginning and ending time point then, obtain byte offsets information, thereby determine each corresponding in this playable media file packet or the actual storage start-stop location point of byte.
25, search server according to claim 21 is characterized in that, also comprises:
Second memory, be used to store the second cutting scheme, the described second cutting scheme comprises the playable media file fragment that is associated or the link of fragment combination, and described link is according to the fragment of described external time code the playable media file that is associated to be carried out depositing in addition after the actual cutting obtaining.
26, search server according to claim 21 is characterized in that, also comprises:
Download unit is used for obtaining required fragment according to user's query requests, and produces corresponding playlist, together returns client; Perhaps will return client after the described fragment combination.
27, a kind of information at playable media file is obtained client, it is characterized in that, comprising:
Cutting scheme acquiring unit, be used to receive cutting scheme at playable media file, described cutting scheme comprises that the playable media file that is associated links and information is obtained terminal, and described terminal is the beginning and ending time point of this external time code file fragment correspondence in the playable media file that is associated;
Start-stop location point determining unit is used for determining at each packet of described playable media file correspondence or the actual storage start-stop location point of byte according to described beginning and ending time point;
Information acquisition unit is used to finish information and obtains.
28, information according to claim 27 is obtained client, it is characterized in that,
Definite mode of described start-stop location point is: the compressed format according to described beginning and ending time point, playable media file is calculated each corresponding in the described playable media file packet or the actual storage start-stop location point of byte.
29, information according to claim 27 is obtained client, it is characterized in that,
Definite mode of described start-stop location point is: call corresponding playing program, read the file header information and the part of media information of the playable media file that is associated earlier, carry out redirect according to described beginning and ending time point then, obtain byte offsets information, thereby determine each corresponding in this playable media file packet or the actual storage start-stop location point of byte.
30, information according to claim 27 is obtained client, it is characterized in that, also comprises:
Integrator is used to integrate a plurality of local playable media file fragments of downloading.
31, obtain client according to claim 27 or 30 described information, it is characterized in that, also comprise:
Wrapper, the playable media file fragment with download this locality is encapsulated as new playable media file according to required compressed format.
32, information according to claim 27 is obtained client, it is characterized in that, also comprises:
Cutting scheme determining unit is used for the external time code file that receives is analyzed, and determines the fragment of required external time code file; And, obtain cutting scheme at its playable media file that is associated according to the fragment of described definite external time code file.
33, information according to claim 32 is obtained client, it is characterized in that, when needs were searched for the climax fragment of audio file, described cutting scheme determining unit specifically comprised:
Text analysis model is used for described external time code file is determined to comprise by text analyzing the identification parameter of multiplicity;
Weight is given module, is used for giving corresponding weighted value according to described identification parameter to each fragment of described external time code file, and the height of described weighted value has determined whether this fragment is the degree of priority of climax fragment;
Cutting scheme determination module is used for the fragment according to described definite external time code file, obtains the cutting scheme at its playable media file that is associated.
34, information according to claim 27 is obtained client, it is characterized in that, also comprises:
The playlist generation unit is used to set up playlist, and corresponding player can be play the fragment of a plurality of playable media file continuously according to this playlist.
CNB2006101507080A 2006-10-24 2006-10-24 Searching method, system and apparatus for playing media file Active CN100501738C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2006101507080A CN100501738C (en) 2006-10-24 2006-10-24 Searching method, system and apparatus for playing media file

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2006101507080A CN100501738C (en) 2006-10-24 2006-10-24 Searching method, system and apparatus for playing media file

Publications (2)

Publication Number Publication Date
CN1949227A true CN1949227A (en) 2007-04-18
CN100501738C CN100501738C (en) 2009-06-17

Family

ID=38018743

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2006101507080A Active CN100501738C (en) 2006-10-24 2006-10-24 Searching method, system and apparatus for playing media file

Country Status (1)

Country Link
CN (1) CN100501738C (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101895736A (en) * 2010-07-07 2010-11-24 杭州华三通信技术有限公司 Media stream data processing method and device thereof
CN102377914A (en) * 2011-09-22 2012-03-14 宇龙计算机通信科技(深圳)有限公司 Terminal and multimedia file management method
CN102882900A (en) * 2011-07-11 2013-01-16 阿里巴巴集团控股有限公司 Application and deployment method for large-scale server cluster and large-scale server cluster
CN104735545A (en) * 2015-03-31 2015-06-24 努比亚技术有限公司 Audio/video file playing method and system
CN105302839A (en) * 2014-07-31 2016-02-03 腾讯科技(深圳)有限公司 File filtration method and system
CN105843393A (en) * 2016-03-30 2016-08-10 苏州合欣美电子科技有限公司 Video player for self-adaptively adjusting subtitles
WO2018113535A1 (en) * 2016-12-22 2018-06-28 阿里巴巴集团控股有限公司 Method and apparatus for automatically generating dubbing characters, and electronic device
CN108513140A (en) * 2018-03-05 2018-09-07 北京电广聪信息技术有限公司 A method of it repeating advertising segment for screening in audio and generates a mao audio
CN108694203A (en) * 2017-04-11 2018-10-23 北京雷石天地电子技术有限公司 A kind of continuous method and apparatus for playing song portions content

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101895736A (en) * 2010-07-07 2010-11-24 杭州华三通信技术有限公司 Media stream data processing method and device thereof
CN101895736B (en) * 2010-07-07 2013-03-13 浙江宇视科技有限公司 Media stream data processing method and device thereof
CN102882900A (en) * 2011-07-11 2013-01-16 阿里巴巴集团控股有限公司 Application and deployment method for large-scale server cluster and large-scale server cluster
CN102882900B (en) * 2011-07-11 2016-06-22 阿里巴巴集团控股有限公司 Large-scale server cluster application section arranging method and large-scale server cluster
CN102377914A (en) * 2011-09-22 2012-03-14 宇龙计算机通信科技(深圳)有限公司 Terminal and multimedia file management method
CN105302839A (en) * 2014-07-31 2016-02-03 腾讯科技(深圳)有限公司 File filtration method and system
CN104735545A (en) * 2015-03-31 2015-06-24 努比亚技术有限公司 Audio/video file playing method and system
CN105843393A (en) * 2016-03-30 2016-08-10 苏州合欣美电子科技有限公司 Video player for self-adaptively adjusting subtitles
WO2018113535A1 (en) * 2016-12-22 2018-06-28 阿里巴巴集团控股有限公司 Method and apparatus for automatically generating dubbing characters, and electronic device
CN108694203A (en) * 2017-04-11 2018-10-23 北京雷石天地电子技术有限公司 A kind of continuous method and apparatus for playing song portions content
CN108513140A (en) * 2018-03-05 2018-09-07 北京电广聪信息技术有限公司 A method of it repeating advertising segment for screening in audio and generates a mao audio

Also Published As

Publication number Publication date
CN100501738C (en) 2009-06-17

Similar Documents

Publication Publication Date Title
CN1949227A (en) Searching method, system and apparatus for playing media file
CN1227613C (en) Database annotation and retrieval
CN1176432C (en) Method and system for providing national language inquiry service
CN1242337C (en) Method and system for accessing information on a network
CN1533163A (en) Free text and attribute search of electronic program guide data
CN1731396A (en) System and method for implementing network resource search by mobile terminal
CN1799051A (en) Method for browsing contents using page storing file
CN1647528A (en) Meta data edition device, meta data reproduction device, meta data distribution device, meta data search device, meta data reproduction condition setting device, and meta data distribution method
CN1530926A (en) Phonetic recognizing dictionary producer and information search device
CN1757020A (en) Information processing device, information processing method, and computer program
CN1877582A (en) Advertisement information retrieval system and method therefor
CN1327195A (en) Method and system for accessing information on network
WO2015101155A1 (en) Method for recommending information to user
CN1912872A (en) Method and system for abstracting new word
CN1932756A (en) Dynamically generating a voice navigable menu for synthesized data
CN1328668A (en) System and method for specifying www site
CN1625740A (en) Index structure of metadata, method for providing indices of metadata, and metadata searching method and apparatus using the indices of metadata
CN101048773A (en) Document analysis system and document adaptation system
CN1667609A (en) Document information management system and document information management method
CN1924860A (en) Search engine based search result fast pre-reading device
CN101080927A (en) Content recommendation device
CN1871603A (en) System and method for processing a query
CN1768373A (en) Information processing device, information processing method, and computer program
CN1867142A (en) Method and system for mobile terminal device obtaining computer information
CN101069181A (en) Storage device and recording medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant