CN110134801A - A kind of matching process and storage medium of work title and multimedia file - Google Patents

A kind of matching process and storage medium of work title and multimedia file Download PDF

Info

Publication number
CN110134801A
CN110134801A CN201910349555.XA CN201910349555A CN110134801A CN 110134801 A CN110134801 A CN 110134801A CN 201910349555 A CN201910349555 A CN 201910349555A CN 110134801 A CN110134801 A CN 110134801A
Authority
CN
China
Prior art keywords
title
multimedia file
character
matching
work
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910349555.XA
Other languages
Chinese (zh)
Inventor
李震宇
周兴华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujian Star Net eVideo Information Systems Co Ltd
Original Assignee
Fujian Star Net eVideo Information Systems Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujian Star Net eVideo Information Systems Co Ltd filed Critical Fujian Star Net eVideo Information Systems Co Ltd
Priority to CN201910349555.XA priority Critical patent/CN110134801A/en
Publication of CN110134801A publication Critical patent/CN110134801A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/43Querying
    • G06F16/435Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • G06F16/90344Query processing by using string matching techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention relates to multimedia technology field, the matching process and storage medium of a kind of work title and multimedia file are provided, the matching process of the work title and multimedia file is the following steps are included: obtain work title and multimedia file to be matched;Processing is filtered to the title of the multimedia file, removes preset characters or character string wherein included;The work title is matched with the title of the multimedia file after filtration treatment, obtains matching score;It is associated with the work title that the highest multimedia file of score will be matched.When the invention is filtered processing to the title of multimedia file, useless character therein can be filtered out, promotes the readability and matching accuracy rate of the title of multimedia file, while also reducing the data processing amount and time-consuming of matching process, improves matching efficiency.

Description

A kind of matching process and storage medium of work title and multimedia file
Technical field
The present invention relates to multimedia technology fields, more particularly to the matching process of a kind of work title and multimedia file And storage medium.
Background technique
In the physical storage medium of the multimedia software such as movie library or music libraries, it is stored with a large amount of multimedia file, And it needs constantly to update the multimedia file (i.e. new works) issued in the recent period.However the multimedia file updated is by not What same channel was collected into, since naming rule of the different data providings to multimedia file is usually different, often Character string containing various complexity;Meanwhile there is also multiple format, such as mp4, avi, mkv, iso-dvd for multimedia file;Cause This causes the multimedia file for being difficult to be collected into be mapped with the work title in list.
Patent No. CN201810814634.9, patent name are " film information acquisition methods, device and mobile terminal " A kind of method that shadow library is matched according to movie name is disclosed, this method uses correlation machine learning method, does participle construction, will Each participle matches one by one with movie name in shadow library, re-defines a threshold value, assesses the matching degree of each participle.This method To complicated character string such as " ten face x .y bury z volt ", discrimination is low and time-consuming higher, and traditional similarity of character string algorithm The title that (Jaro-Winkler, editing distance etc.) is not suitable for this multimedia file yet contains the scene of complex characters string.
Due to without mature effective matching scheme, can accurately and rapidly by multimedia file and make in the prior art The name of an article claims to be mapped, and therefore, in the data preparation or renewal process of multimedia software, require a great deal of time progress Match multimedia file work.
Summary of the invention
For this reason, it may be necessary to the matching process of a kind of work title and multimedia file be provided, for solving the above-mentioned prior art It is difficult to accurate, the technical issues of quickly being matched multimedia file with work title.
To achieve the above object, the present invention provides the matching process of a kind of work title and multimedia file, including with Lower step:
Obtain work title and multimedia file;
Processing is filtered to the title of the multimedia file, removes preset characters or character string wherein included;
The work title is matched with the title of the multimedia file after filtration treatment, obtains matching score;
It is associated with the work title that the highest multimedia file of score will be matched.
Further, the step " is filtered processing to the title of the multimedia file, removes wherein included pre- If character or character string " the following steps are included:
Processing is filtered to the title of the multimedia file, included in the title for removing the multimedia file Character in addition to Chinese, English character, number, ' (', ') ' and ': '.
Further, the step " is filtered processing to the title of the multimedia file, removes wherein included pre- If character or character string " the following steps are included:
Capitalization English character in the title of the multimedia file is replaced with into corresponding small English character.
Further, the step " is filtered processing to the title of the multimedia file, removes wherein included pre- If character or character string " the following steps are included:
Processing is filtered to the title of the multimedia file, included in the title for removing the multimedia file Year information and file format information.
Further, the step " is filtered processing to the title of the multimedia file, removes the multimedia text Year information included in the title of part and file format information " the following steps are included:
It is boundary with blank character, the title of the multimedia file is divided into more than two substrings;
Judge whether each substring is time character, if second substring or the substring after it are the time Character then removes the substring and other substrings later.
Further, the step is " by the title progress of the multimedia file after the work title and filtration treatment Match, obtain matching score " the following steps are included:
The title of the work title character and overanxious treated the multimedia file one by one is compared, is judged Character in the work title whether there is in the multimedia title;
It is scored one by one according to the comparing result of preset matching degree scoring mechanism and each character, by each scoring It is added and obtains the matching score of the title of the multimedia file.
Further, the preset matching degree scoring mechanism includes:
According to the type of character matched in the title of the multimedia file, different the are added up to the matching score One score value, wherein the first score value is positive number, and the first score value of Chinese character is greater than the first of English character or numerical character Score value;
When the character in the title of the multimedia file there are continuous coupling, then the is added up into the matching score again Two score values, the second score value are positive number and are positively correlated with the number of continuous coupling character;
When unmatched character occurs in the title of the multimedia file, add up third score value into the matching score, Wherein, third score value is negative.
Further, the preset matching degree scoring mechanism further include:
Add up in the title of the multimedia file to match the interval times of character or character string, when the interval times are super Crossing preset value then terminates matching degree score.
Further, the multimedia names are film or song, the match party of the work title and multimedia file Method further comprises the steps of:
The multimedia file or work title are shown from high to low according to the matching score.
Further, in order to solve the above technical problems, the present invention also provides another technical solutions:
A kind of computer readable storage medium is stored thereon with computer program, real when described program is executed by processor Step described in existing any of the above item technical solution.
It is different from the prior art, above-mentioned technical proposal is when for multimedia file match information, to the name of multimedia file Title is filtered processing, then again will filtering to remove preset characters or character string included in the title of multimedia file The title of treated multimedia file is matched with work title, will match the highest multimedia file of score and institute It is associated to state work title.Wherein, when being filtered processing to the title of the multimedia file retrieved, nothing therein can be filtered out With character, its readable and matching accuracy rate of file is promoted, while also reducing the data processing amount and time-consuming of matching process, is improved Matching efficiency.
Detailed description of the invention
Fig. 1 is the step flow chart of the matching process of work title described in specific embodiment and multimedia file;
Fig. 2 is the flow chart that specific embodiment carries out the name-matches of the multimedia file;
Fig. 3 is the flow chart of matching degree scoring mechanism described in specific embodiment;
Fig. 4 is computer readable storage medium block diagram described in specific embodiment.
Specific embodiment
Technology contents, construction feature, the objects and the effects for detailed description technical solution, below in conjunction with specific reality It applies example and attached drawing is cooperated to be explained in detail.
It please refers to Fig.1 to Fig.3, present embodiments provides the matching process of a kind of work title and multimedia file.It is described The matching process of work title and multimedia file can be used in the storage medium to multimedia software such as shadow library, music libraries being deposited The multimedia files such as film, music, the MV of storage are matched with the standard work title in works list.The work title With the matching process of multimedia file comprising steps of
S101, work title and multimedia file are obtained;Wherein, work title is being obtained and when multimedia file, it can one The multiple work titles of secondary acquisition and multiple multimedia files.The work title is the title of the media work of standard, such as The work title can in the playlist of softwares such as shadow library, music libraries work title (such as movie name: ambushing on all sides, Song title: children's stories town).Wherein, work title can be collected into together when collecting multimedia file, for example, passing through a certain canal When road is collected into a collection of movie file, which provides the works list of this batch of film simultaneously.The multimedia file can be with For different types such as film, music, MV, also, every kind multimedia file can there are many different formats, for example, film is literary The format of part may include mp4, avi, mkv, iso-dvd etc..For ease illustration and understanding, we determine the work title Justice is the first title, is the second title by the name definition of the multimedia file.Due to the name of newly-increased multimedia file Be it is irregular, can not be directly linked with the work title in newly-increased works list, therefore can not directly pass through when in use Work title in works list is directly linked to corresponding multimedia file.Therefore, it is necessary to will be in works list before using Work title matched with the multimedia file in storage medium.After obtaining work title and multimedia file, i.e., S102 can be entered step.
S102, processing is filtered to the title of the multimedia file, removes preset characters wherein included or character String;Wherein, due to containing the other information in addition to work title in the title of the multimedia file, in order to mention High matched accuracy and timeliness, the information unrelated with work title should be able to be filtered out in filtration treatment, and can guarantee not Mistake filters out work title.Therefore, the preset characters or character string should be the name of expressed information and multimedia file Claim unrelated character or character string, such as most spcial character, or includes the character string of these spcial characters.With film For, it may include time character, Video coding character, resolution character or tray in the preset characters or character string Formula character etc..S103 is entered step after the filtration treatment.
S103, by the title (i.e. second of the multimedia file after the work title (i.e. the first title) and filtration treatment Title) it is matched, obtain matching score;It wherein, can be by the title of the multimedia file after filtration treatment when being matched (i.e. filtered second title) character is matched one by one with the work title (i.e. the first title), and according to matching degree It obtains matching score accordingly, matching score and matching degree are positively correlated, and matching degree is higher, and matching score is also higher, on the contrary ?.By the title of matching treatment you can get it each multimedia file and the matching score of the work title, and can will be each Multimedia file is arranged from high to low by matching score;Obtain the title of each work title Yu filtered multimedia file Matching score, and can by work title by matching score arrange from high to low.It is entered step after obtaining matching score S104。
S104, will to match the highest multimedia file of score associated with the work title, or will match score The highest work title is associated with the multimedia file.Therefore, the name of an article is made with described by the multimedia file Title is associated, and can be positioned by work title to corresponding multimedia file, is carried out by softwares such as shadow library, music libraries When multimedia file plays, corresponding multimedia file may link to by the work title in selection works list and broadcast It puts.And the information can be shown in terminal, so that users can browse., it can be achieved that it automatically will be in storage medium by the above method Multimedia file be associated with work title, without user be manually operated, improve matching efficiency.
Title (i.e. second place of the matching process of the work title and multimedia file to the multimedia file retrieved Claim) when being filtered processing, useless character therein can be filtered out, readable and subsequent of the title of multimedia file is promoted With accuracy rate matched in step, while also reducing the data processing amount and time-consuming of matching process, improves matching efficiency.
In one embodiment, the step: in S102 " processing is filtered to the title of the multimedia file, is removed Preset characters or character string wherein included " the following steps are included:
Processing is filtered to the title of the multimedia file, included in the title for removing the multimedia file Character in addition to Chinese, English character, number, ' (', ') ' and ': '.Filtering step is poly- also referred to as shallowly to be filtered, and is mainly The filter type is more for all spcial characters (retaining ' (' and ') ' and ': ') for filtering out in title in addition to Chinese and English character and number Simply, it is easy to implement, while original information can be retained again.For the ease of subsequent matching step, in this embodiment, can also incite somebody to action All capitalization English character letters are changed into lowercase in information.
In another embodiment, the step: in S102 " processing is filtered to the title of the multimedia file, is gone Except preset characters wherein included or character string " the following steps are included: removing included in the title of the multimedia file Year information and file format information.Due to the year information or tray in the title (i.e. the second title) of multimedia file Formula information have relatively fixed format, convenient for identification and it is uncorrelated to work title, so by filter out year information and File format information can be further simplified the title (i.e. the second title) of the multimedia file, to reduce the number of matching step According to treating capacity, and reduce its influence with matching precision, improves matching accuracy.
For inventor by a large amount of observations, discovery is stored in local multimedia file especially movie file, name rule Then have a common ground: before the title of multimedia file often appears in publication year, and centre is if there is blank character (such as ' .' or space), then subsequent field be almost and the title garbage of file (such as Video coding, resolution ratio, compression Group, caption information etc.).Therefore, in one embodiment, the institute in the title (i.e. the second title) for removing the multimedia file It can be specifically boundary with blank character, by the name of the multimedia file when year information and file format information that include (i.e. the second title) is claimed to be divided into more than two substrings;And judge whether comprising time character in each substring, if It include time character in second substring or substring after it, then removal includes the son of the time character Character string and other substrings later, if before time character appears in movie name as the first substring, such as " 2017. ambush on all sides .1080p " then all retain, wherein time character can be identified by regular expression.It can delete in this way Except the time character in the title (i.e. the second title) of multimedia file, but can remain using the time as the file of movie name such as "2012".Therefore, the filtering method data processing amount is small, and can filter a large amount of garbages by the filter type, and to the greatest extent Work title may only be retained, greatly accelerate matching efficiency and accuracy.It is situated between for the storage of movie library, this kind of software of music libraries There are a large amount of multimedia files to need to match in matter, above-mentioned matching process is with regard to especially suitable.
As shown in Fig. 2, in one embodiment, the step S103 is " by the work title (i.e. the first title) and filtering The title (i.e. the second title) of treated multimedia file is matched, and matching score is obtained " the following steps are included:
S201, by the work title (i.e. the first title) character and overanxious treated the multimedia file one by one Title (i.e. the second title) compares, and judges in the work title that the character of (i.e. the first title) whether there is in described more In the title (i.e. the second title) of media;
S202, it is scored one by one according to the comparing result of preset matching degree scoring mechanism and each character, Jiang Gesuo Commentary split-phase adds the matching score for obtaining the title of the multimedia file.
Specifically, in title (the i.e. second place for carrying out work title (i.e. first title) and the multimedia file Claim) comparison when, settable two pointers, pointer is used for the work title (i.e. the first title), another pointer is used for The title (i.e. the second title) of the multimedia file, the initial position of two pointers are respectively directed to initial position (i.e. first A character position).
In comparison, the first character of the work title (i.e. the first title) is found first in the multimedia file The position that occurs of title (i.e. the second title), if occurring, the title (i.e. the first title) of the multimedia file is corresponding Pointer is directed toward the position, and if it does not exist, then pointer moves down one to the work title (i.e. the first title), and it is current to find pointer Whether signified character occurs in the title (i.e. the second title) of the multimedia file, the then title of the multimedia file The pointer of (i.e. the second title) is directed toward the position.The rest may be inferred by each character in the work title (i.e. the first title) by One compares with each character in the media work title, until finding initial matched character.
If it is identical during the comparison process, to encounter character pointed by two pointers, then two pointers move down one;Encounter two When the character that pointer is directed toward is inconsistent, the character of the work title (i.e. the first title) direction is found in the multimedia file Title (i.e. the second title) current location after occur position, and if it exists, the then title (i.e. second of the multimedia file Title) pointer is directed toward the position;If it does not exist, work title (i.e. the first title) pointer moves down one, then finds current Character that work title (i.e. the second title) pointer is directed toward is worked as in title (i.e. the second title) of the multimedia file The position occurred after front position, and if it exists, then title (i.e. the second title) pointer of the multimedia file is directed toward the position, Otherwise matching terminates.Two pointers have been directed toward end if there is one of pointer, then matching terminates.
In other embodiments, it additionally provides and realizes the work title and multimedia file by way of array Title comparison, wherein settable two arrays, and the work title and more matchmakers are respectively directed to by the subscript of change array Kinds of characters in the title of body file realizes the scheme of character comparison and the side that comparison is realized above by pointer by array Case is approximate, just repeats no more here.As can be seen from the above description, in the matching process, can guarantee in the work title Each character can be scanned and compare one by one with each character in the title of multimedia file, and matching degree is scored Mechanism one by one scores to each character in the work title, to guarantee the reliability of obtained matching score.
In view of character different types of in work title has different weights to matching degree, in order to improve work title Matched accuracy, as shown in figure 3, a kind of matching degree scoring mechanism is provided in one embodiment, the preset matching degree Scoring mechanism includes:
The type of matched character in S301, the title according to the multimedia file, it is cumulative not to the matching score The first same score value, wherein the first score value is positive number, and the first score value of Chinese character is greater than English character or numerical character The first score value;
S302, when the character in the title of the multimedia file there are continuous coupling, then again into the matching score Cumulative second score value, the second score value are positive number and are positively correlated with the number of continuous coupling character;
S303, the unmatched character of title appearance when the multimedia file, add up third into the matching score Score value, wherein third score value is negative.
For the ease of score, an initial matching score and interval times (latter two i.e. first matched word can be set Symbol is intermediate to be there is mismatching character to be just calculated as interval primary), such as initial matching score is set as 10 points, no maximum, interval meter Number is initially 0.
When additional character (i.e. (', ') ' and ': ' for matching a Chinese character or reservation) when, add the basis point is upper One score value (such as 5 points);
When matching an English character or numerical character, above add the first score value (such as 2 points) on basis point;
When non-English character or numerical character, every continuous coupling, before basis point upper bonus point (i.e. the first score value) It puts on, then plus continuous coupling number point (i.e. the second score value).Three times such as continuous coupling, then add 5+ (5+1)+(5+2) in total three times =18 points;
When encountering a mismatch character (not distinguishing Chinese and English character, number and other symbols), plus third score value (such as -5 points, that is, detain 5 points).
At the end of matching, score is matched if more than 0, returns to the number of current matching score and identical characters (referred to as With number);Otherwise it fails to match.The work title or multimedia file that it fails to match can be abandoned.
In the matching degree scoring mechanism, assigned first point of the additional character of matched Chinese character and reservation Value, greater than the second score value that matched English character or numerical character are assigned, therefore prominent Chinese character is in work title With importance in degree;And when the character of continuous coupling occur, on the basis of cumulative first score value, it is additionally accumulated second Score value protrudes character continuous coupling to the importance of work title matching degree, which combines the two, improves The accuracy of the name-matches of multimedia file.
In an implementation column, the preset matching degree scoring mechanism further include:
Add up matching character or the interval times of character string in the title (i.e. the second title) of the multimedia file, works as institute Stating interval times then terminates matching degree score more than preset value.
For example, in title (i.e. the second title) comparison process of the work title (i.e. the first title) and multimedia file In, it is mismatched if encountering two character strings, space-number adds 1, once interval times reach preset value such as 3 times, then determines For it fails to match, and terminate the secondary matching.By the matching degree scoring mechanism accumulation interval number, matching knot can be determined in advance Whether fruit fails, and terminates in advance the matched scoring process, to reduce time waste, improves matching timeliness.
When system batch downloads a large amount of multimedia files, and obtains corresponding work title simultaneously, in above-mentioned matching degree It, can be with work title (i.e. the first title) for standard, to the title of multiple and different multimedia files (i.e. the in scoring mechanism Two titles) being given a mark from the matching degree of the work title (i.e. the first title) (calculates the title of different multimedia files Matching score), and the highest multimedia file of score will be matched in the database and built with the work title (i.e. the first title) Vertical association, i.e., the highest multimedia file of matching score is exactly file corresponding to the work title.Using the technical solution, The matching for carrying out work title to batch multimedia file automatically can be achieved.
It, can also be with multimedia text in above-mentioned matching degree scoring mechanism when system only increases a multimedia file The title (i.e. the second title) of part is standard, to the name of multiple and different work title (i.e. the first title) and the multimedia file Claim the matching degree of (i.e. the second title) to be given a mark and (calculate the matching score of different work titles), and in the database will The matching highest work title of score is associated with multimedia file foundation.Using the technical solution, it can be achieved that newly-increased more Media file Auto-matching work title.
In some embodiments, the preset matching degree scoring mechanism may also include prefix constraint, the prefix constraint Specifically: judge that the prefix of work title (i.e. the first title) mismatches whether number is more than preset value (such as twice), if so, Directly determine that it fails to match;If it is not, then each mismatches the certain score value (such as 4 points) of deduction.Wherein, when comparing beginning, If from first character continuously multiple characters, the name in the multimedia file in the work title (i.e. the first title) Claim to can not find matched character in (i.e. the second title), then the number for continuously mismatching character in work title is the works The prefix of title mismatches number, for example, work title is " the World Without Thieve ", and entitled " the ambushing on all sides " of multimedia file, Each character in work title matches difference at this time, and it is 4 that prefix, which mismatches number,;When work title " New Police Story ", and Entitled " story of dolphin " of multimedia file, the 4th character is just matched in work title at this time arrives, and before three Character match less than, therefore its prefix mismatch number be 3.It can also determine that in advance it fails to match by prefix constraint, and The matched scoring process is terminated in advance, to reduce time waste, improves matching timeliness.
In the matching degree scoring mechanism, can also title to the multimedia file of exact matching in continuous coupling bonus point base On plinth, separately add certain score value (such as 10 points).
In one embodiment, it is entitled standard with multimedia file, multiple and different work titles is matched Score, therefore in this embodiment, after carrying out matching score to each work title according to the matching degree scoring mechanism, use By matching and having the work title of matching score by matching, score is high to Low sorts by all for heapsort method, and before selecting Several results;Then each work title is traversed, calculates the quantity (being denoted as a) of matched work title, and by each matching Work title in matched character number (being denoted as l) divided by matched work title quantity (i.e. l/a), if obtained value Less than some threshold value (being typically designed to 0.8, can also design by actual conditions), then the result is abandoned.It will will finally obtain After work title reorders, and the work title to make number one is associated with multimedia file.
It in another embodiment, is to be carried out using work title as standard to the title of multiple and different multimedia files With score, therefore in this embodiment, the title of each multimedia file is matched according to the matching degree scoring mechanism After score, using heapsort method by all by matching and having the title of the multimedia file of matching score high by matching score It sorts to low, and selects several preceding results;Then the title for traversing each multimedia file calculates matched multimedia text The quantity (being denoted as a) of the title of part, and character number (being denoted as l) matched in the title of each matched multimedia file is removed With the quantity (i.e. l/a) of the title of matched multimedia file, if obtained value be less than some threshold values (be typically designed to 0.8, Can also be designed by actual conditions), then abandon the result.After finally obtained multimedia title is reordered, and will The multimedia to make number one is associated with work title.
In another embodiment, a kind of computer readable storage medium 400 is provided, is deposited on computer readable storage medium Computer program is contained, the step in any of the above item embodiment is realized when described program is executed by processor.Preferably, described Computer readable storage medium 400 is KTV, the readable storage medium storing program for executing of multimedia order programme system in digital entertainments place such as drinks.
It should be noted that being not intended to limit although the various embodiments described above have been described herein Scope of patent protection of the invention.Therefore, it based on innovative idea of the invention, change that embodiment described herein is carried out and is repaired Change, or using equivalent structure or equivalent flow shift made by description of the invention and accompanying drawing content, it directly or indirectly will be with Upper technical solution is used in other related technical areas, is included within scope of patent protection of the invention.

Claims (10)

1. the matching process of a kind of work title and multimedia file, which comprises the following steps:
Obtain work title and multimedia file to be matched;
Processing is filtered to the title of the multimedia file, removes preset characters or character string wherein included;
The work title is matched with the title of the multimedia file after filtration treatment, obtains matching score;
It is associated with the work title that the highest multimedia file of score will be matched.
2. the matching process of work title according to claim 1 and multimedia file, which is characterized in that the step " being filtered processing to the title of the multimedia file, remove preset characters or character string wherein included " includes following step It is rapid:
Processing is filtered to the title of the multimedia file, is removed included in the title for removing the multimedia file Character except text, English character, number, ' (', ') ' and ': '.
3. the matching process of work title according to claim 2 and multimedia file, which is characterized in that the step " being filtered processing to the title of the multimedia file, remove preset characters or character string wherein included " includes following step It is rapid:
Capitalization English character in the title of the multimedia file is replaced with into corresponding small English character.
4. the matching process of work title according to claim 1 and multimedia file, which is characterized in that the step " being filtered processing to the title of the multimedia file, remove preset characters or character string wherein included " includes following step It is rapid:
Processing is filtered to the title of the multimedia file, removes the time included in the title of the multimedia file Information and file format information.
5. the matching process of work title according to claim 4 and multimedia file, which is characterized in that the step " processing is filtered to the title of the multimedia file, removes the letter of time included in the title of the multimedia file Breath and file format information " the following steps are included:
It is boundary with blank character, the title of the multimedia file is divided into more than two substrings;
Judge in each substring whether to be time character string, if second substring or the substring after it are the time Character string then removes the substring and other substrings later.
6. according to the matching process of claim 2 to the 5 any work title and multimedia file, which is characterized in that institute Stating step " matching the work title with the title of the multimedia file after filtration treatment, obtain matching score " includes Following steps:
The title of the work title character and overanxious treated the multimedia file one by one is compared, described in judgement Character in work title whether there is in the title of the multimedia file;
It is scored one by one according to the comparing result of preset matching degree scoring mechanism and each character, each scoring is added Obtain the matching score of the title of the multimedia file.
7. the matching process of work title according to claim 6 and multimedia file, which is characterized in that described preset Matching degree scoring mechanism includes:
According to the type of character matched in the title of the multimedia file, add up different first points to the matching score Value, wherein the first score value is positive number, and the first score value of Chinese character is greater than first point of English character or numerical character Value;
When the character in the title of the multimedia file there are continuous coupling, then add up second point into the matching score again Value, the second score value are positive number and are positively correlated with the number of continuous coupling character;
When unmatched character occurs in the title of the multimedia file, add up third score value into the matching score, wherein Third score value is negative.
8. the matching process of work title according to claim 7 and multimedia file, which is characterized in that described preset Matching degree scoring mechanism further include:
Add up in the title of the multimedia file to match the interval times of character or character string, when the discrete data is more than pre- If value then terminates matching degree score.
9. the matching process of work title according to claim 1 and multimedia file, which is characterized in that the multimedia The matching process of entitled film or song, the work title and multimedia file further comprises the steps of:
The multimedia file or work title are shown from high to low according to the matching score.
10. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that described program is processed Device realizes step as described in any one of claim 1 to 9 when executing.
CN201910349555.XA 2019-04-28 2019-04-28 A kind of matching process and storage medium of work title and multimedia file Pending CN110134801A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910349555.XA CN110134801A (en) 2019-04-28 2019-04-28 A kind of matching process and storage medium of work title and multimedia file

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910349555.XA CN110134801A (en) 2019-04-28 2019-04-28 A kind of matching process and storage medium of work title and multimedia file

Publications (1)

Publication Number Publication Date
CN110134801A true CN110134801A (en) 2019-08-16

Family

ID=67575442

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910349555.XA Pending CN110134801A (en) 2019-04-28 2019-04-28 A kind of matching process and storage medium of work title and multimedia file

Country Status (1)

Country Link
CN (1) CN110134801A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112612912A (en) * 2021-01-06 2021-04-06 当趣网络科技(杭州)有限公司 Method and system for automatically generating film cover wall

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103942254A (en) * 2014-03-18 2014-07-23 电子科技大学 Pirated video resource discovery method oriented to network disk share service
CN104504021A (en) * 2014-12-11 2015-04-08 北京国双科技有限公司 Data matching method and device
CN107133218A (en) * 2017-05-26 2017-09-05 北京惠商之星网络科技有限公司 Trade name intelligent Matching method, system and computer-readable recording medium
CN107656958A (en) * 2017-06-09 2018-02-02 平安科技(深圳)有限公司 A kind of classifying method and server of multi-data source data
US9892094B2 (en) * 2010-12-28 2018-02-13 Amazon Technologies, Inc. Electronic book pagination
US20180225864A1 (en) * 2015-08-03 2018-08-09 Baidu Online Network Technology (Beijing) Co., Ltd. Method and apparatus for reconstructing scene, terminal device, and storage medium
CN109636476A (en) * 2018-12-17 2019-04-16 山东浪潮云信息技术有限公司 A kind of brand name data standardization processing method and device
CN109635276A (en) * 2018-11-12 2019-04-16 厦门市美亚柏科信息股份有限公司 A kind of information matching method and terminal

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9892094B2 (en) * 2010-12-28 2018-02-13 Amazon Technologies, Inc. Electronic book pagination
CN103942254A (en) * 2014-03-18 2014-07-23 电子科技大学 Pirated video resource discovery method oriented to network disk share service
CN104504021A (en) * 2014-12-11 2015-04-08 北京国双科技有限公司 Data matching method and device
US20180225864A1 (en) * 2015-08-03 2018-08-09 Baidu Online Network Technology (Beijing) Co., Ltd. Method and apparatus for reconstructing scene, terminal device, and storage medium
CN107133218A (en) * 2017-05-26 2017-09-05 北京惠商之星网络科技有限公司 Trade name intelligent Matching method, system and computer-readable recording medium
CN107656958A (en) * 2017-06-09 2018-02-02 平安科技(深圳)有限公司 A kind of classifying method and server of multi-data source data
CN109635276A (en) * 2018-11-12 2019-04-16 厦门市美亚柏科信息股份有限公司 A kind of information matching method and terminal
CN109636476A (en) * 2018-12-17 2019-04-16 山东浪潮云信息技术有限公司 A kind of brand name data standardization processing method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
邓红卫 等: "《数据结构(C语言版)》", 31 August 2017 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112612912A (en) * 2021-01-06 2021-04-06 当趣网络科技(杭州)有限公司 Method and system for automatically generating film cover wall

Similar Documents

Publication Publication Date Title
CN102156751B (en) Method and device for extracting video fingerprint
CN109033086A (en) A kind of address resolution, matched method and device
CN101719167B (en) Interactive movie searching method
CN110362824B (en) Automatic error correction method, device, terminal equipment and storage medium
CN103049568A (en) Method for classifying documents in mass document library
CN102999625A (en) Method for realizing semantic extension on retrieval request
CN106682012A (en) Commodity object information searching method and device
US20100274781A1 (en) Ordered index
CN103514236A (en) Retrieval condition error correction prompt processing method based on Pinyin in retrieval application
CN112883734B (en) Block chain security event public opinion monitoring method and system
CN116722876B (en) Intelligent storage method for user data for format light reading
CN102193995B (en) Method and device for establishing multimedia data index and retrieval
CN112083812A (en) Associative word determining method and device, storage medium and electronic equipment
CN105279281A (en) Internet-of-things data access method
CN103761286B (en) A kind of Service Source search method based on user interest
CN105488471B (en) A kind of font recognition methods and device
CN104615782B (en) Address matching process based on sliding window maximum matching algorithm
CN107219935B (en) Chinese character input system and method for continuously writing Chinese characters and supporting interaction
KR101358793B1 (en) Method of forming index file, Method of searching data and System for managing data using dictionary index file, Recoding medium
CN110134801A (en) A kind of matching process and storage medium of work title and multimedia file
CN103294670A (en) Searching method and system based on word list
CN109254962B (en) Index optimization method and device based on T-tree and storage medium
CN105025013A (en) A dynamic IP coupling model based on a priority Trie tree
CN105740374B (en) Three-dimensional platform data fuzzy query method based on distributed memory
CN102004598B (en) Media player and character input method thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190816