CN110019817A - A kind of detection method, device and the electronic equipment of text in video information - Google Patents
A kind of detection method, device and the electronic equipment of text in video information Download PDFInfo
- Publication number
- CN110019817A CN110019817A CN201811473997.7A CN201811473997A CN110019817A CN 110019817 A CN110019817 A CN 110019817A CN 201811473997 A CN201811473997 A CN 201811473997A CN 110019817 A CN110019817 A CN 110019817A
- Authority
- CN
- China
- Prior art keywords
- sentence
- text
- video
- participle
- target photo
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 45
- 238000013145 classification model Methods 0.000 claims abstract description 41
- 238000006243 chemical reaction Methods 0.000 claims abstract description 23
- 238000000034 method Methods 0.000 claims abstract description 22
- 238000000605 extraction Methods 0.000 claims abstract description 9
- 230000015654 memory Effects 0.000 claims description 14
- 239000000284 extract Substances 0.000 claims description 13
- 238000013527 convolutional neural network Methods 0.000 claims description 9
- 238000012549 training Methods 0.000 claims description 5
- 238000012546 transfer Methods 0.000 claims description 4
- 238000004590 computer program Methods 0.000 claims 1
- 238000007689 inspection Methods 0.000 abstract description 4
- 238000012512 characterization method Methods 0.000 abstract description 3
- 238000012545 processing Methods 0.000 description 9
- 238000004891 communication Methods 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 7
- 230000009286 beneficial effect Effects 0.000 description 5
- 230000011218 segmentation Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 230000008859 change Effects 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000000712 assembly Effects 0.000 description 2
- 238000000429 assembly Methods 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 230000033001 locomotion Effects 0.000 description 2
- 230000007787 long-term memory Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000012015 optical character recognition Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000002093 peripheral effect Effects 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- KLDZYURQCUYZBL-UHFFFAOYSA-N 2-[3-[(2-hydroxyphenyl)methylideneamino]propyliminomethyl]phenol Chemical compound OC1=CC=CC=C1C=NCCCN=CC1=CC=CC=C1O KLDZYURQCUYZBL-UHFFFAOYSA-N 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 201000001098 delayed sleep phase syndrome Diseases 0.000 description 1
- 208000033921 delayed sleep phase type circadian rhythm sleep disease Diseases 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000002360 explosive Substances 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/783—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/7844—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using original textual content or text extracted from visual content or transcript of audio data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Multimedia (AREA)
- Library & Information Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Character Discrimination (AREA)
Abstract
Subject description discloses detection method, device and the electronic equipments of a kind of text in video information.The inspection method of the text in video information includes: to extract Target Photo from video to be detected, including the key frame in video to be detected;Text information is extracted from Target Photo, and the sentence after text sentence is segmented is carried out to the text information;Vector conversion further is carried out to the sentence after participle, obtains the term vector segmented in sentence;The term vector of sentence and conversion acquisition after participle is finally inputted into textual classification model, semantics recognition is carried out by textual classification model and whether is exported in characterization text information comprising with the semantics recognition result for presetting semantic text, to realize the detection of violation video text, the extraction of key frame and the semantics recognition of text information avoid not detecting violation video text because of the variation of simple literal expression mode i.e. in video, improve the accuracy of violation video text detection and check efficiency.
Description
Technical field
This specification is related to software technology field, in particular to a kind of detection method of text in video information, device and
Electronic equipment.
Background technique
With the continuous development of network technique, multimedia resource is in explosive growth, wherein the growth of video resource is especially
Rapidly, the supervision of video quality is particularly important.Video quality includes the supervision of picture material and the supervision of video text, figure
As the interception of content supervised mainly to violation picture material, have been able to be blocked well by image recognition technology
It cuts, but the supervision for video text, due to the variability and the uncertainty of appearance position in video of text, video text
Violation be difficult to be detected, a kind of detection method of text in video information is needed, to realize the inspection of violation video text
It surveys.
Summary of the invention
This specification embodiment provides detection method, device and the electronic equipment of a kind of text in video information, for real
Now to the detection of violation video text, the accuracy of violation video text detection is improved.
In a first aspect, this specification embodiment provides a kind of detection method of text in video information, comprising:
Target Photo is extracted from video to be detected, wherein the Target Photo includes the pass in the video to be detected
Key frame;
Text information is extracted from the Target Photo;
Text sentence participle is carried out to the text information, the sentence after being segmented;
Vector conversion is carried out to the sentence after the participle, obtains the term vector segmented in the sentence;
By after the participle sentence and the term vector input textual classification model, by the textual classification model into
Row semantics recognition simultaneously exports semantics recognition as a result, whether the semantics recognition result is for characterizing in the text information comprising tool
There is the text for presetting semanteme.
Second aspect, this specification embodiment provide a kind of detection device of text in video information, comprising:
Picture extraction unit, for extracting Target Photo from video to be detected, wherein the Target Photo includes described
Key frame in video to be detected;
Word Input unit, for extracting text information from the Target Photo;
Participle unit, for carrying out text sentence participle to the text information, the sentence after being segmented;
Vector transduced cell is obtained and is segmented in the sentence for carrying out vector conversion to the sentence after the participle
Term vector;
Recognition unit, for by after the participle sentence and the term vector input textual classification model, by described
Textual classification model carries out semantics recognition and exports semantics recognition as a result, the semantics recognition result is for characterizing the text letter
Whether comprising with the text for presetting semanteme in breath.
The third aspect, this specification embodiment provide a kind of computer readable storage medium, are stored thereon with computer journey
Sequence, the program perform the steps of when being executed by processor
Target Photo is extracted from video to be detected, wherein the Target Photo includes the pass in the video to be detected
Key frame;
Text information is extracted from the Target Photo;
Text sentence participle is carried out to the text information, the sentence after being segmented;
Vector conversion is carried out to the sentence after the participle, obtains the term vector segmented in the sentence;
By after the participle sentence and the term vector input textual classification model, by the textual classification model into
Row semantics recognition simultaneously exports semantics recognition as a result, whether the semantics recognition result is for characterizing in the text information comprising tool
There is the text for presetting semanteme.
Fourth aspect, this specification embodiment provide a kind of electronic equipment, include memory and one or one
Above program, one of them perhaps more than one program be stored in memory and be configured to by one or one with
It includes the instruction for performing the following operation that upper processor, which executes the one or more programs:
Target Photo is extracted from video to be detected, wherein the Target Photo includes the pass in the video to be detected
Key frame;
Text information is extracted from the Target Photo;
Text sentence participle is carried out to the text information, the sentence after being segmented;
Vector conversion is carried out to the sentence after the participle, obtains the term vector segmented in the sentence;
By after the participle sentence and the term vector input textual classification model, by the textual classification model into
Row semantics recognition simultaneously exports semantics recognition as a result, whether the semantics recognition result is for characterizing in the text information comprising tool
There is the text for presetting semanteme.
Said one or multiple technical solutions in this specification embodiment, at least have the following technical effect that
This specification embodiment provides a kind of detection method of text in video information, extracts target from video to be detected
Picture, the Target Photo include the key frame in the video to be detected;Text information is extracted from Target Photo;Text is believed
Breath carries out text sentence participle, the sentence after being segmented;Word in vector conversion acquisition sentence is carried out to the sentence after participle
Vector;By after participle sentence and term vector input textual classification model, semantics recognition and defeated is carried out by textual classification model
Characterize in text information whether the semantics recognition comprising the text with default semanteme is as a result, and by believing text in video out
The semantics recognition of breath, to realize the detection of violation video text, avoid because simple literal expression mode variation and can not
It detects violation video text, improves the accuracy of violation video text detection.Further, this illustrates that embodiment provides upper
Method is stated, it is obvious due to covering in key frame by extracting key frame in video when carrying out the detection of text in video information
The video content of variation extracts text information from key frame and is identified, can greatly reduce the calculating of video text identification
Amount reaches the beneficial effect for improving violation video text detection efficiency.
Detailed description of the invention
In order to illustrate more clearly of the technical solution in this specification embodiment, embodiment or the prior art will be retouched below
Attached drawing needed in stating is briefly described, it should be apparent that, the accompanying drawings in the following description is the one of this specification
A little embodiments for those of ordinary skill in the art without any creative labor, can also be according to this
A little attached drawings obtain other attached drawings.
Fig. 1 is a kind of flow diagram of the detection method for text in video information that this specification embodiment provides;
Fig. 2 is a kind of schematic diagram of the detection device for text in video information that this specification embodiment provides;
Fig. 3 is the schematic diagram for a kind of electronic equipment that this specification embodiment provides.
Specific embodiment
To keep the purposes, technical schemes and advantages of this specification embodiment clearer, below in conjunction with this specification reality
The attached drawing in example is applied, the technical solution in this specification embodiment is clearly and completely described, it is clear that described reality
Applying example is this specification a part of the embodiment, instead of all the embodiments.The embodiment of base in this manual, this field are general
Logical technical staff every other embodiment obtained without creative efforts belongs to this specification protection
Range.
Detection method, device and the electronic equipment of a kind of text in video information are provided in this specification embodiment, is used for
It realizes the detection to violation video text, improves the accuracy of violation video text detection.
With reference to the accompanying drawing to the main realization principle of this specification embodiment technical solution, specific embodiment and its right
The beneficial effect that should be able to reach is explained in detail.
Embodiment
Referring to FIG. 1, this specification embodiment provides a kind of detection method of text in video information, this method comprises:
S10: extracting Target Photo from video to be detected, and the Target Photo includes the key in the video to be detected
Frame;
S12: text information is extracted from the Target Photo;
S14: text sentence participle is carried out to the text information, the sentence after being segmented;
S16: vector conversion is carried out to the sentence after the participle, obtains the term vector segmented in the sentence;
S18: by the sentence and term vector input textual classification model after the participle, pass through the text classification mould
Type carries out semantics recognition and exports semantics recognition as a result, whether the semantics recognition result wraps for characterizing in the text information
Semantic text is preset containing having.
It in specific implementation process, executes S10 and extracts Target Photo from video to be detected, which is in video
Part picture, for improving the detection efficiency of violation text in video.Wherein, Target Photo may include in video to be detected
Hot spot frame and/or random frame in key frame and video to be detected.
One video is made of more than ten supreme thousand sheets pictures, and the picture in video is otherwise known as frame, in video role or
It is any that frame locating for key operations in person's object of which movement or variation is referred to as a definition in key frame or video
The frame of beginning and end smooth transition is key frame, and a series of key frames define the motion process that viewer will be seen that.Relatively
Normal frames in video, the change information amount that key frame includes is larger, can quickly and effectively obtain video by key frame
In key message, repetition meter in video detection can be effectively reduced by extracting key frame to carry out the detection of video violation text
It calculates, and can be avoided omission key message, effectively improve the efficiency and accuracy rate of video violation text detection.
Specifically, the pass in video to be detected can be extracted according to the similarity between frame picture every in video to be detected
Key frame is as Target Photo.For example, the average similarity between the frame and frame in video to be detected in one section of duration can be calculated;
Then, the frame for being less than the presupposition multiple of average value in this section of duration with the similarity of former frame is found out;If without such frame,
The intermediate frame in this section of duration can be extracted as key frame;If there is such frame, then can extract in this section of duration with
The similarity of former frame is less than the frame of the presupposition multiple of average value as key frame.For the key frame of extraction, it is small to exclude brightness
In the frame that the frame of a certain threshold value is excessively dark.By the above-mentioned means, the present embodiment dynamic carries out key-frame extraction, according to video
In in one section of duration picture variation severe degree, current image variation is more violent, more key frames is extracted, even if duration is simultaneously
It does not grow;On the contrary, even one section of very long video also only extracts less key frame if video pictures are essentially unchanged;Such as
Fruit video pictures be it is completely black, then any key frame is not extracted, to improve the quality of key frame.
The present embodiment considers the uncertainty that violation text occurs, and further extracts hot spot frame supplementary target picture, mentions
The accuracy of high video violation text detection.Hot spot frame refers to that comment amount is greater than the frame of given threshold.Wherein, hot spot frame mentions
It takes, the point of video playback time corresponding to the time point of comment and/or barrage can be obtained, count the comment of each frame picture
And/or barrage item number is the comment amount of the frame.Comment amount is greater than given threshold characterization user to its picture material, word content
Attention rate it is bigger, a possibility that there are violation texts in this kind of frames, is larger, for this purpose, extracting comment amount is greater than given threshold
Frame as hot spot frame, be used to supplementary target picture.Certainly, in the specific implementation process, can also from video to be detected with
Machine extracts frame, that is, random frame of preset quantity, also regard random frame as Target Photo, improves the popularity of the distribution of Target Photo,
Further increase the accuracy of video violation text detection.
After S10 extracts Target Photo, continues to execute S12 and extract text information from Target Photo.
Specifically, can use OCR (Optical Character Recognition, optical character identification) technology will
Text in Target Photo is converted to text structural information, to extract text information, facilitates the subsequent participle to text and identification.
Wherein, S12 can carry out text information extraction to each Target Photo, but may be without corresponding text in certain Target Photos
Word does not do subsequent processing for this kind of Target Photo.
S14 carries out text sentence participle to the text information that S12 is extracted, the sentence after being segmented.Wherein, this implementation
Example is not intended to limit the specific algorithm of text sentence participle, existing any segmentation methods can be used, such as mechanical Chinese word segmentation algorithm, base
Segmentation methods in n-gram, segmentation methods based on hidden Markov model etc..
After S14, further executes S16 and vector conversion is carried out to the sentence after participle, obtain the word segmented in sentence
Vector.Specifically, can by Chinese word vector algorithm such as cw2vec or natural language vector transfer algorithm such as word2vec,
Vector conversion is carried out to the sentence after participle, obtains the term vector segmented in sentence.Preferably, text can be inputted using cw2vec
This sequence exports in text sequence and respectively segments corresponding semantic vector, more accurate for the semantic meaning representation of Chinese word segmentation.
Sentence after the participle that the term vector and S14 obtained based on S16 conversion is obtained, executes S18 for the sentence after participle
Textual classification model is inputted with term vector, semantics recognition is carried out by textual classification model and exports semantics recognition result.Specifically
, it can be by the sentence and term vector input textual classification model after participle, textual classification model can be according to the sentence after participle
Whether each term vector is formed vector matrix, and carries out semantics recognition to vector matrix by minor structure, detect and wrap in vector matrix
Containing the default semantic i.e. corresponding semanteme of violation text, to obtain recognition result and export.Pass through the combination of sentence and term vector
The semantics recognition of progress, can be more accurate identify it is each participle and sentence expression semanteme, so as to more accurate
The detection for carrying out violation text avoids because variation, the use of near synonym etc. of literal expression mode cause violation text detection
Missing inspection.
Textual classification model in the present embodiment can be two disaggregated model of text, in violation detection scene, such as
Fruit part text in violation of rules and regulations, is then considered as in violation of rules and regulations, output can also be further labeled to violation text.Textual classification model is specific
It can be instructed for the textual classification model Text-CNN based on convolutional neural networks training acquisition or based on shot and long term memory network
Practice the textual classification model LSTM+Softmax obtained.Preferably, cw2vec can be used when term vector is converted in the present embodiment,
Text classification is carried out in conjunction with Text-CNN, on the basis of promoting the accuracy of semantic conversion, in conjunction with the powerful office of Text-CNN
Portion's information detection capability reaches the beneficial effect for improving the accuracy of violation text detection in video.
In the above-described embodiments, Target Photo is extracted from video to be detected, which includes the view to be detected
Key frame in frequency;Text information is extracted from Target Photo;Text sentence participle is carried out to text information, after being segmented
Sentence;Term vector in vector conversion acquisition sentence is carried out to the sentence after participle;By after participle sentence and term vector input
Whether textual classification model carries out semantics recognition by textual classification model and exports default comprising having in characterization text information
The semantics recognition of semantic text as a result, and by the semantics recognition to text in video information, to realize violation video text
Detection, avoid because simple literal expression mode variation and can not detect violation video text, improve violation video
The accuracy of text detection.Further, this illustrates the above method that embodiment provides, in the inspection for carrying out text in video information
When survey, by extracting key frame in video, due to covering the video content of significant change in key frame, text is extracted from key frame
Word information is identified, the calculation amount of video text identification can be greatly reduced, and is reached and is improved violation video text detection efficiency
Beneficial effect.
A kind of detection method of text in video information is provided based on the above embodiment, and the present embodiment also correspondence provides one kind
The detection device of text in video information, referring to FIG. 2, the device further include:
Picture extraction unit 20, for extracting Target Photo from video to be detected, wherein the Target Photo includes institute
State the key frame in video to be detected;
Word Input unit 22, for extracting text information from the Target Photo;
Participle unit 24, for carrying out text sentence participle to the text information, the sentence after being segmented;
Vector transduced cell 26 is obtained and is segmented in the sentence for carrying out vector conversion to the sentence after the participle
Term vector;
Recognition unit 28, for passing through institute for the sentence and term vector input textual classification model after the participle
Textual classification model is stated to carry out semantics recognition and export semantics recognition as a result, the semantics recognition result is for characterizing the text
Whether comprising with the text for presetting semanteme in information.
As an alternative embodiment, the picture extraction unit 20 can use any one or more following side
Formula extracts Target Photo:
Mode one, according to the similarity between frame picture every in video to be detected, extract the pass in the video to be detected
Key frame is as the Target Photo;
Mode two extracts the hot spot frame that comment amount is greater than given threshold from the video to be detected, by the hot spot frame
As the Target Photo;
Mode three, the random frame for extracting preset quantity at random from the video to be detected, using the random frame as institute
State Target Photo.
Wherein, relative to the normal frames in video, the change information amount that key frame includes is larger, can be fast by key frame
Speed, the effective key message obtained in video, extracting key frame can effectively reduce to carry out the detection of video violation text
Computing repeatedly in video detection, and can be avoided omission key message, effectively improve video violation text detection efficiency and
Accuracy rate.
As an alternative embodiment, the vector transduced cell 26, can by Chinese word vector algorithm or
Natural language vector transfer algorithm carries out vector conversion to the sentence after the participle, obtain the word that is segmented in the sentence to
Amount.Preferably, be for text information it is Chinese, can be obtained by Chinese word vector algorithm cw2vec semantic more accurate
Term vector.
As an alternative embodiment, the textual classification model 28 can be to be obtained based on convolutional neural networks training
The textual classification model obtained or the textual classification model obtained based on the training of shot and long term memory network.Preferably, in term vector
Cw2vec is used when conversion, carries out text classification in conjunction with Text-CNN, on the basis of promoting the accuracy of semantic conversion, in conjunction with
Text-CNN powerful local message detectability reaches the beneficial effect for improving the accuracy of violation text detection in video.
About the device in above-described embodiment, wherein each unit executes the concrete mode of operation in method
It is described in detail in embodiment, no longer elaborates herein.
Referring to FIG. 3, being that a kind of electronics for realizing data query method shown according to an exemplary embodiment is set
Standby 700 block diagram.For example, electronic equipment 700 can be computer, database console, tablet device, personal digital assistant
Deng.
Referring to Fig. 3, electronic equipment 700 may include following one or more components: processing component 702, memory 704,
Power supply module 706, multimedia component 708, the interface 710 and communication component 712 of input/output (I/O).
The integrated operation of the usual controlling electronic devices 700 of processing component 702 is such as grasped with display, data communication, and record
Make associated operation.Processing element 702 may include one or more processors 720 to execute instruction, above-mentioned to complete
The all or part of the steps of method.In addition, processing component 702 may include one or more modules, it is convenient for 702 He of processing component
Interaction between other assemblies.
Memory 704 is configured as storing various types of data to support the operation in equipment 700.These data are shown
Example includes the instruction of any application or method for operating on electronic equipment 700, contact data, telephone directory number
According to, message, picture, video etc..Memory 704 can by any kind of volatibility or non-volatile memory device or they
Combination realize, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM) is erasable
Programmable read only memory (EPROM), programmable read only memory (PROM), read-only memory (ROM), magnetic memory, quick flashing
Memory, disk or CD.
Power supply module 706 provides electric power for the various assemblies of electronic equipment 700.Power supply module 706 may include power supply pipe
Reason system, one or more power supplys and other with for electronic equipment 700 generate, manage, and distribute the associated component of electric power.
I/O interface 710 provides interface between processing component 702 and peripheral interface module, and above-mentioned peripheral interface module can
To be keyboard, click wheel, button etc..These buttons may include, but are not limited to: home button, volume button, start button and lock
Determine button.
Communication component 712 is configured to facilitate the communication of wired or wireless way between electronic equipment 700 and other equipment.
Electronic equipment 700 can access the wireless network based on communication standard, such as WiFi, 2G or 3G or their combination.Show at one
In example property embodiment, communication component 712 receives broadcast singal or broadcast from external broadcasting management system via broadcast channel
Relevant information.In one exemplary embodiment, the communication component 712 further includes near-field communication (NFC) module, short to promote
Cheng Tongxin.For example, radio frequency identification (RFID) technology, Infrared Data Association (IrDA) technology, ultra wide band can be based in NFC module
(UWB) technology, bluetooth (BT) technology and other technologies are realized.
In the exemplary embodiment, electronic equipment 700 can be by one or more application specific integrated circuit (ASIC), number
Word signal processor (DSP), digital signal processing appts (DSPD), programmable logic device (PLD), field programmable gate array
(FPGA), controller, microcontroller, microprocessor or other electronic components are realized, for executing the above method.
In the exemplary embodiment, a kind of non-transitorycomputer readable storage medium including instruction, example are additionally provided
It such as include the memory 704 of instruction, above-metioned instruction can be executed by the processor 720 of electronic equipment 700 to complete the above method.Example
Such as, the non-transitorycomputer readable storage medium can be ROM, random access memory (RAM), CD-ROM, tape, soft
Disk and optical data storage devices etc..
A kind of non-transitorycomputer readable storage medium, when the instruction in the storage medium is by the processing of mobile terminal
When device executes, so that electronic equipment is able to carry out a kind of data query method, which comprises
Target Photo is extracted from video to be detected, wherein the Target Photo includes the pass in the video to be detected
Key frame;Text information is extracted from the Target Photo;Text sentence participle is carried out to the text information, after being segmented
Sentence;Vector conversion is carried out to the sentence after the participle, obtains the term vector segmented in the sentence;After the participle
Sentence and the term vector input textual classification model, carry out semantics recognition by the textual classification model and export semantic knowledge
Not as a result, whether the semantics recognition result is used to characterize in the text information comprising with default semantic text.
It should be understood that the present invention is not limited to the precise structure already described above and shown in the accompanying drawings, and
And various modifications and changes may be made without departing from the scope thereof.The scope of the present invention is limited only by the attached claims
The foregoing is merely presently preferred embodiments of the present invention, is not intended to limit the invention, it is all in spirit of the invention and
Within principle, any modification, equivalent replacement, improvement and so on be should all be included in the protection scope of the present invention.
Claims (10)
1. a kind of detection method of text in video information, comprising:
Target Photo is extracted from video to be detected, wherein the Target Photo includes the key frame in the video to be detected;
Text information is extracted from the Target Photo;
Text sentence participle is carried out to the text information, the sentence after being segmented;
Vector conversion is carried out to the sentence after the participle, obtains the term vector segmented in the sentence;
By the sentence and term vector input textual classification model after the participle, passes through the textual classification model and carry out language
Justice identifies and exports semantics recognition as a result, whether the semantics recognition result is pre- comprising having in the text information for characterizing
If semantic text.
2. the method as described in claim 1 extracts Target Photo from video to be detected, comprising:
The hot spot frame that comment amount is greater than given threshold is extracted from the video to be detected, using the hot spot frame as the target
Picture;And/or
The random frame for extracting preset quantity at random from the video to be detected, using the random frame as the Target Photo.
3. method according to claim 2, the sentence to after the participle carries out vector conversion, obtains in the sentence
The term vector of participle, comprising:
By Chinese word vector algorithm or natural language vector transfer algorithm, vector is carried out to the sentence after the participle and is turned
It changes, obtains the term vector segmented in the sentence.
4. the method as described in claims 1 to 3 is any, the textual classification model is to be obtained based on convolutional neural networks training
Textual classification model.
5. a kind of detection device of text in video information, comprising:
Picture extraction unit, for extracting Target Photo from video to be detected, wherein the Target Photo includes described to be checked
Survey the key frame in video;
Word Input unit, for extracting text information from the Target Photo;
Participle unit, for carrying out text sentence participle to the text information, the sentence after being segmented;
Vector transduced cell, for carrying out vector conversion to the sentence after the participle, obtain the word that is segmented in the sentence to
Amount;
Recognition unit, for passing through the text for the sentence and term vector input textual classification model after the participle
Disaggregated model carries out semantics recognition and exports semantics recognition as a result, the semantics recognition result is for characterizing in the text information
Whether comprising with the text for presetting semanteme.
6. device as claimed in claim 5, the picture extraction unit, are also used to:
The hot spot frame that comment amount is greater than given threshold is extracted from the video to be detected, using the hot spot frame as the target
Picture;And/or
The random frame for extracting preset quantity at random from the video to be detected, using the random frame as the Target Photo.
7. device as claimed in claim 6, the vector transduced cell, are used for:
By Chinese word vector algorithm or natural language vector transfer algorithm, vector is carried out to the sentence after the participle and is turned
It changes, obtains the term vector segmented in the sentence.
8. the device as described in claim 5~7 is any, the textual classification model is to be obtained based on convolutional neural networks training
Textual classification model.
9. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the program is held by processor
It is performed the steps of when row
Target Photo is extracted from video to be detected, wherein the Target Photo includes the key frame in the video to be detected;
Text information is extracted from the Target Photo;
Text sentence participle is carried out to the text information, the sentence after being segmented;
Vector conversion is carried out to the sentence after the participle, obtains the term vector segmented in the sentence;
By the sentence and term vector input textual classification model after the participle, passes through the textual classification model and carry out language
Justice identifies and exports semantics recognition as a result, whether the semantics recognition result is pre- comprising having in the text information for characterizing
If semantic text.
10. a kind of electronic equipment, which is characterized in that include memory and one or more than one program, wherein one
A perhaps more than one program is stored in memory and is configured to execute described one by one or more than one processor
A or more than one program includes the instruction for performing the following operation:
Target Photo is extracted from video to be detected, wherein the Target Photo includes the key frame in the video to be detected;
Text information is extracted from the Target Photo;
Text sentence participle is carried out to the text information, the sentence after being segmented;
Vector conversion is carried out to the sentence after the participle, obtains the term vector segmented in the sentence;
By the sentence and term vector input textual classification model after the participle, passes through the textual classification model and carry out language
Justice identifies and exports semantics recognition as a result, whether the semantics recognition result is pre- comprising having in the text information for characterizing
If semantic text.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811473997.7A CN110019817A (en) | 2018-12-04 | 2018-12-04 | A kind of detection method, device and the electronic equipment of text in video information |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811473997.7A CN110019817A (en) | 2018-12-04 | 2018-12-04 | A kind of detection method, device and the electronic equipment of text in video information |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110019817A true CN110019817A (en) | 2019-07-16 |
Family
ID=67188592
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811473997.7A Pending CN110019817A (en) | 2018-12-04 | 2018-12-04 | A kind of detection method, device and the electronic equipment of text in video information |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110019817A (en) |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110351596A (en) * | 2019-07-17 | 2019-10-18 | 刘进 | A kind of internet stream media big data barrage processing system and processing method |
CN110458162A (en) * | 2019-07-25 | 2019-11-15 | 上海兑观信息科技技术有限公司 | A kind of method of intelligent extraction pictograph information |
CN110781345A (en) * | 2019-10-31 | 2020-02-11 | 北京达佳互联信息技术有限公司 | Video description generation model acquisition method, video description generation method and device |
CN110852231A (en) * | 2019-11-04 | 2020-02-28 | 云目未来科技(北京)有限公司 | Illegal video detection method and device and storage medium |
CN110866543A (en) * | 2019-10-18 | 2020-03-06 | 支付宝(杭州)信息技术有限公司 | Picture detection and picture classification model training method and device |
CN110991175A (en) * | 2019-12-10 | 2020-04-10 | 爱驰汽车有限公司 | Text generation method, system, device and storage medium under multiple modes |
CN111274389A (en) * | 2020-01-14 | 2020-06-12 | 北京松果电子有限公司 | Information processing method and device, computer equipment and storage medium |
CN111401368A (en) * | 2020-03-24 | 2020-07-10 | 武汉大学 | News video title extraction method based on deep learning |
CN112367491A (en) * | 2020-10-30 | 2021-02-12 | 贵州乐诚技术有限公司 | Remote call monitoring system based on AI identification |
CN112381091A (en) * | 2020-11-23 | 2021-02-19 | 北京达佳互联信息技术有限公司 | Video content identification method and device, electronic equipment and storage medium |
CN112464927A (en) * | 2020-11-25 | 2021-03-09 | 苏宁金融科技(南京)有限公司 | Information extraction method, device and system |
CN113038153A (en) * | 2021-02-26 | 2021-06-25 | 深圳道乐科技有限公司 | Financial live broadcast violation detection method, device and equipment and readable storage medium |
CN113360374A (en) * | 2021-07-30 | 2021-09-07 | 中电福富信息科技有限公司 | Test method for automatically detecting adverse information of APP |
CN113591743A (en) * | 2021-08-04 | 2021-11-02 | 中国人民大学 | Calligraphy video identification method, system, storage medium and computing device |
CN113641854A (en) * | 2021-07-28 | 2021-11-12 | 上海影谱科技有限公司 | Method and system for converting characters into video |
CN113642422A (en) * | 2021-07-27 | 2021-11-12 | 东北电力大学 | Continuous Chinese sign language recognition method |
CN113794624A (en) * | 2021-09-07 | 2021-12-14 | 广州华多网络科技有限公司 | Instant message transmission control method and device, computer equipment and storage medium |
CN113850184A (en) * | 2021-09-22 | 2021-12-28 | 支付宝(杭州)信息技术有限公司 | Method, device, equipment and readable medium for detecting video content |
CN113901814A (en) * | 2021-10-11 | 2022-01-07 | 国网电子商务有限公司 | Neural network word segmentation method and device for energy E-commerce field |
WO2022042609A1 (en) * | 2020-08-31 | 2022-03-03 | 北京字节跳动网络技术有限公司 | Hot word extraction method, apparatus, electronic device, and medium |
CN114241367A (en) * | 2021-12-02 | 2022-03-25 | 北京智美互联科技有限公司 | Visual semantic detection method and system |
CN114239570A (en) * | 2021-12-02 | 2022-03-25 | 北京智美互联科技有限公司 | Sensitive data identification method and system based on semantic analysis |
Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102609407A (en) * | 2012-02-16 | 2012-07-25 | 复旦大学 | Fine-grained semantic detection method of harmful text contents in network |
CN102622373A (en) * | 2011-01-31 | 2012-08-01 | 中国科学院声学研究所 | Statistic text classification system and statistic text classification method based on term frequency-inverse document frequency (TF*IDF) algorithm |
CN103281177A (en) * | 2013-04-10 | 2013-09-04 | 广东电网公司信息中心 | Method and system for detecting hostile attack on Internet information system |
CN105847993A (en) * | 2016-04-19 | 2016-08-10 | 乐视控股(北京)有限公司 | Method and device for sharing video clip |
CN106445998A (en) * | 2016-05-26 | 2017-02-22 | 达而观信息科技(上海)有限公司 | Text content auditing method and system based on sensitive word |
CN106973305A (en) * | 2017-03-20 | 2017-07-21 | 广东小天才科技有限公司 | Method and device for detecting bad content in video |
CN107862258A (en) * | 2017-10-24 | 2018-03-30 | 广东小天才科技有限公司 | Method, device and equipment for checking text content in video and storage medium |
CN108040262A (en) * | 2018-01-25 | 2018-05-15 | 湖南机友科技有限公司 | Live audio and video are reflected yellow method and device in real time |
CN108124191A (en) * | 2017-12-22 | 2018-06-05 | 北京百度网讯科技有限公司 | A kind of video reviewing method, device and server |
CN108446388A (en) * | 2018-03-22 | 2018-08-24 | 平安科技(深圳)有限公司 | Text data quality detecting method, device, equipment and computer readable storage medium |
CN108536815A (en) * | 2018-04-08 | 2018-09-14 | 北京奇艺世纪科技有限公司 | A kind of file classification method and device |
CN108537139A (en) * | 2018-03-20 | 2018-09-14 | 校宝在线(杭州)科技股份有限公司 | A kind of Online Video wonderful analysis method based on barrage information |
CN108595634A (en) * | 2018-04-25 | 2018-09-28 | 腾讯科技(深圳)有限公司 | Message management method, device and electronic equipment |
CN108628868A (en) * | 2017-03-16 | 2018-10-09 | 北京京东尚科信息技术有限公司 | File classification method and device |
-
2018
- 2018-12-04 CN CN201811473997.7A patent/CN110019817A/en active Pending
Patent Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102622373A (en) * | 2011-01-31 | 2012-08-01 | 中国科学院声学研究所 | Statistic text classification system and statistic text classification method based on term frequency-inverse document frequency (TF*IDF) algorithm |
CN102609407A (en) * | 2012-02-16 | 2012-07-25 | 复旦大学 | Fine-grained semantic detection method of harmful text contents in network |
CN103281177A (en) * | 2013-04-10 | 2013-09-04 | 广东电网公司信息中心 | Method and system for detecting hostile attack on Internet information system |
CN105847993A (en) * | 2016-04-19 | 2016-08-10 | 乐视控股(北京)有限公司 | Method and device for sharing video clip |
CN106445998A (en) * | 2016-05-26 | 2017-02-22 | 达而观信息科技(上海)有限公司 | Text content auditing method and system based on sensitive word |
CN108628868A (en) * | 2017-03-16 | 2018-10-09 | 北京京东尚科信息技术有限公司 | File classification method and device |
CN106973305A (en) * | 2017-03-20 | 2017-07-21 | 广东小天才科技有限公司 | Method and device for detecting bad content in video |
CN107862258A (en) * | 2017-10-24 | 2018-03-30 | 广东小天才科技有限公司 | Method, device and equipment for checking text content in video and storage medium |
CN108124191A (en) * | 2017-12-22 | 2018-06-05 | 北京百度网讯科技有限公司 | A kind of video reviewing method, device and server |
CN108040262A (en) * | 2018-01-25 | 2018-05-15 | 湖南机友科技有限公司 | Live audio and video are reflected yellow method and device in real time |
CN108537139A (en) * | 2018-03-20 | 2018-09-14 | 校宝在线(杭州)科技股份有限公司 | A kind of Online Video wonderful analysis method based on barrage information |
CN108446388A (en) * | 2018-03-22 | 2018-08-24 | 平安科技(深圳)有限公司 | Text data quality detecting method, device, equipment and computer readable storage medium |
CN108536815A (en) * | 2018-04-08 | 2018-09-14 | 北京奇艺世纪科技有限公司 | A kind of file classification method and device |
CN108595634A (en) * | 2018-04-25 | 2018-09-28 | 腾讯科技(深圳)有限公司 | Message management method, device and electronic equipment |
Cited By (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110351596B (en) * | 2019-07-17 | 2021-07-27 | 上海播呗网络科技有限公司 | Internet streaming media big data bullet screen processing system and processing method |
CN110351596A (en) * | 2019-07-17 | 2019-10-18 | 刘进 | A kind of internet stream media big data barrage processing system and processing method |
CN110458162A (en) * | 2019-07-25 | 2019-11-15 | 上海兑观信息科技技术有限公司 | A kind of method of intelligent extraction pictograph information |
CN110866543A (en) * | 2019-10-18 | 2020-03-06 | 支付宝(杭州)信息技术有限公司 | Picture detection and picture classification model training method and device |
CN110866543B (en) * | 2019-10-18 | 2022-07-15 | 支付宝(杭州)信息技术有限公司 | Picture detection and picture classification model training method and device |
CN110781345A (en) * | 2019-10-31 | 2020-02-11 | 北京达佳互联信息技术有限公司 | Video description generation model acquisition method, video description generation method and device |
CN110852231A (en) * | 2019-11-04 | 2020-02-28 | 云目未来科技(北京)有限公司 | Illegal video detection method and device and storage medium |
CN110991175A (en) * | 2019-12-10 | 2020-04-10 | 爱驰汽车有限公司 | Text generation method, system, device and storage medium under multiple modes |
CN110991175B (en) * | 2019-12-10 | 2024-04-09 | 爱驰汽车有限公司 | Method, system, equipment and storage medium for generating text in multi-mode |
CN111274389A (en) * | 2020-01-14 | 2020-06-12 | 北京松果电子有限公司 | Information processing method and device, computer equipment and storage medium |
CN111274389B (en) * | 2020-01-14 | 2023-10-24 | 北京小米松果电子有限公司 | Information processing method, device, computer equipment and storage medium |
CN111401368A (en) * | 2020-03-24 | 2020-07-10 | 武汉大学 | News video title extraction method based on deep learning |
WO2022042609A1 (en) * | 2020-08-31 | 2022-03-03 | 北京字节跳动网络技术有限公司 | Hot word extraction method, apparatus, electronic device, and medium |
CN112367491A (en) * | 2020-10-30 | 2021-02-12 | 贵州乐诚技术有限公司 | Remote call monitoring system based on AI identification |
CN112381091A (en) * | 2020-11-23 | 2021-02-19 | 北京达佳互联信息技术有限公司 | Video content identification method and device, electronic equipment and storage medium |
CN112381091B (en) * | 2020-11-23 | 2024-07-12 | 北京达佳互联信息技术有限公司 | Video content identification method, device, electronic equipment and storage medium |
CN112464927B (en) * | 2020-11-25 | 2023-10-31 | 苏宁金融科技(南京)有限公司 | Information extraction method, device and system |
CN112464927A (en) * | 2020-11-25 | 2021-03-09 | 苏宁金融科技(南京)有限公司 | Information extraction method, device and system |
CN113038153A (en) * | 2021-02-26 | 2021-06-25 | 深圳道乐科技有限公司 | Financial live broadcast violation detection method, device and equipment and readable storage medium |
CN113642422A (en) * | 2021-07-27 | 2021-11-12 | 东北电力大学 | Continuous Chinese sign language recognition method |
CN113642422B (en) * | 2021-07-27 | 2024-05-24 | 东北电力大学 | Continuous Chinese sign language recognition method |
CN113641854A (en) * | 2021-07-28 | 2021-11-12 | 上海影谱科技有限公司 | Method and system for converting characters into video |
CN113641854B (en) * | 2021-07-28 | 2023-09-26 | 上海影谱科技有限公司 | Method and system for converting text into video |
CN113360374A (en) * | 2021-07-30 | 2021-09-07 | 中电福富信息科技有限公司 | Test method for automatically detecting adverse information of APP |
CN113591743A (en) * | 2021-08-04 | 2021-11-02 | 中国人民大学 | Calligraphy video identification method, system, storage medium and computing device |
CN113591743B (en) * | 2021-08-04 | 2023-11-24 | 中国人民大学 | Handwriting video identification method, system, storage medium and computing device |
CN113794624A (en) * | 2021-09-07 | 2021-12-14 | 广州华多网络科技有限公司 | Instant message transmission control method and device, computer equipment and storage medium |
CN113850184A (en) * | 2021-09-22 | 2021-12-28 | 支付宝(杭州)信息技术有限公司 | Method, device, equipment and readable medium for detecting video content |
CN113901814A (en) * | 2021-10-11 | 2022-01-07 | 国网电子商务有限公司 | Neural network word segmentation method and device for energy E-commerce field |
CN114239570A (en) * | 2021-12-02 | 2022-03-25 | 北京智美互联科技有限公司 | Sensitive data identification method and system based on semantic analysis |
CN114241367A (en) * | 2021-12-02 | 2022-03-25 | 北京智美互联科技有限公司 | Visual semantic detection method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110019817A (en) | A kind of detection method, device and the electronic equipment of text in video information | |
CN107995536B (en) | Method, device and equipment for extracting video preview and computer storage medium | |
US11394675B2 (en) | Method and device for commenting on multimedia resource | |
CN110225387A (en) | A kind of information search method, device and electronic equipment | |
EP3872652A2 (en) | Method and apparatus for processing video, electronic device, medium and product | |
CN111259148B (en) | Information processing method, device and storage medium | |
CN111723784B (en) | Risk video identification method and device and electronic equipment | |
CN102663015A (en) | Video semantic labeling method based on characteristics bag models and supervised learning | |
US11481563B2 (en) | Translating texts for videos based on video context | |
Kieran et al. | A framework for an event driven video surveillance system | |
CN107491435A (en) | Method and device based on Computer Automatic Recognition user feeling | |
CN112883734B (en) | Block chain security event public opinion monitoring method and system | |
Logan IV et al. | Multimodal attribute extraction | |
CN106708905A (en) | Video content searching method and device | |
CN108345612A (en) | A kind of question processing method and device, a kind of device for issue handling | |
CN112765974A (en) | Service assisting method, electronic device and readable storage medium | |
CN113282754A (en) | Public opinion detection method, device, equipment and storage medium for news events | |
CN113869063A (en) | Data recommendation method and device, electronic equipment and storage medium | |
CN113392195A (en) | Public opinion monitoring method and device, electronic equipment and storage medium | |
CN112861510A (en) | Summary processing method, apparatus, device and storage medium | |
CN111552819B (en) | Entity extraction method and device and readable storage medium | |
CN116912478A (en) | Object detection model construction, image classification method and electronic equipment | |
CN111708444A (en) | Input method, input device and input device | |
WO2022134700A1 (en) | Method and apparatus for identifying target object | |
CN115169345A (en) | Training method, device and equipment for text emotion analysis model and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190716 |