CN109145284A - Information processing method and device - Google Patents

Information processing method and device Download PDF

Info

Publication number
CN109145284A
CN109145284A CN201710464769.2A CN201710464769A CN109145284A CN 109145284 A CN109145284 A CN 109145284A CN 201710464769 A CN201710464769 A CN 201710464769A CN 109145284 A CN109145284 A CN 109145284A
Authority
CN
China
Prior art keywords
text
information
urtext
identified
feature set
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710464769.2A
Other languages
Chinese (zh)
Inventor
李大霞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201710464769.2A priority Critical patent/CN109145284A/en
Publication of CN109145284A publication Critical patent/CN109145284A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Marketing (AREA)
  • Human Resources & Organizations (AREA)
  • General Business, Economics & Management (AREA)
  • Economics (AREA)
  • Computing Systems (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a kind of information processing method and devices.Wherein, this method comprises: identifying to urtext, number characteristic set and alphabetic feature set are obtained, wherein, number characteristic set include with the associated identification information of contact method, alphabetic feature set includes the corresponding monogram of text that occurs in urtext;Characteristic set of checking numbers is combined with alphabetic feature set, obtains resulting text.The present invention solves identifies the weak technical problem of the method identification function of text advertisements in the prior art.

Description

Information processing method and device
Technical field
The present invention relates to data processing fields, in particular to a kind of information processing method and device.
Background technique
Text advertisements are that text based form carries out advertisement.Text advertisements typically occur in the comment of hot news, Or in the chat group of instant communication software, the form of appearance is usually name of product and contact method, such as: xxx increases product, It needs that vx2516372819 please be add, in order to intercept these text advertisements, the way of the prior art is usually to use regular expression Mode, if in text include certain digit number, alphanumeric, or with 1 beginning 11 bit digitals and front and back include The prompt words such as " adding ", " vx ", " qq ", then it is assumed that be advertisement;Otherwise it is assumed that being normal information.
But regular expression identification can only identify the number comprising certain digit or alphanumeric or with 1 beginning 11 bit digitals and front and back include the prompt words such as " adding ", " vx ", " qq " advertisement, recognition mode is single, for the text of variation This advertisement, such as: " family's common vetch " (homophonic: " adding micro- " letter), " acrid flavour " (homophonic " wechat ") this variant form can not carry out poor It lifts, it is difficult to achieve the purpose that identification.
Therefore, the mode means of identification text advertisements are single used in currently available technology, can not fight the text of variation The problems such as this advertisement and small identification range, seriously affects the information of user so that the platform environment of content cannot be purified effectively Safety.
For the weak problem of the method identification function of text advertisements is identified in the prior art, effective solution is not yet proposed at present Certainly scheme.
Summary of the invention
The embodiment of the invention provides a kind of information processing method and devices, at least to solve to identify text in the prior art The weak technical problem of the method identification function of advertisement.
According to an aspect of an embodiment of the present invention, a kind of information processing method is provided, comprising: carry out to urtext Identification obtains number characteristic set and alphabetic feature set, wherein number characteristic set includes and the associated mark of contact method Information, alphabetic feature set include the corresponding monogram of text occurred in urtext;Check numbers characteristic set and letter Characteristic set is combined, and obtains resulting text.
According to another aspect of an embodiment of the present invention, a kind of information processing unit is additionally provided, comprising: identification module is used It is identified in urtext, obtains number characteristic set and alphabetic feature set, wherein number characteristic set includes and connection It is the associated identification information of mode, alphabetic feature set includes the corresponding monogram of text occurred in urtext;It obtains Module is combined with alphabetic feature set for checking numbers characteristic set, obtains resulting text.
According to another aspect of an embodiment of the present invention, a kind of storage medium is additionally provided, storage medium includes the journey of storage Sequence, wherein equipment where control storage medium executes above-mentioned information processing method in program operation.
According to another aspect of an embodiment of the present invention, a kind of processor is additionally provided, processor is used to run program, In, program executes above-mentioned information processing method when running.
According to another aspect of an embodiment of the present invention, a kind of system is additionally provided, comprising: processor;And memory, with Processor connection, for providing the instruction for executing following treatment process for processor: being identified to urtext, obtain number Characteristic set and alphabetic feature set, wherein number characteristic set includes and the associated identification information of contact method, alphabetic feature Set includes the corresponding monogram of text occurred in urtext;It checks numbers characteristic set and alphabetic feature set carries out group It closes.
In embodiments of the present invention, the above embodiments of the present application identify urtext, determine that number feature combines With alphabetic feature set, it is combined further according to number characteristic set and alphabetic feature set, obtains resulting text.Due to rubbish Rubbish information would generally avoid the logical formula of regular expression defined by modes such as variations, therefore cannot be by the prior art Junk information recognition methods identified, and the application and without using regular expression to urtext carry out Direct Recognition, and It is to generate the corresponding resulting text of original application text, since resulting text is special by the number characteristic set and letter of urtext Collection, which is closed, to be constituted, therefore even if carried out variation processing to junk information, but still to pass through number special for the feature of junk information Collection is closed or alphabetic feature set is embodied in resulting text, to be identified.
The application above scheme solves as a result, identifies the weak technology of the method identification function of text advertisements in the prior art Problem has reached identification to the technical effect of the junk information after variation.
Detailed description of the invention
The drawings described herein are used to provide a further understanding of the present invention, constitutes part of this application, this hair Bright illustrative embodiments and their description are used to explain the present invention, and are not constituted improper limitations of the present invention.In the accompanying drawings:
Fig. 1 is the flow chart according to a kind of information processing method of the embodiment of the present application 1;
Fig. 2 be according to the embodiment of the present application 2 it is a kind of for realizing information processing method terminal (or movement set It is standby) hardware block diagram
Fig. 3 is the flow chart according to a kind of information processing method of the embodiment of the present application 2;
Fig. 4 is the schematic diagram according to a kind of information processing unit of the embodiment of the present application 3;
Fig. 5 is the schematic diagram according to a kind of system of the embodiment of the present application 4;And
Fig. 6 is the flow chart according to a kind of information processing method of the embodiment of the present application 5;
Fig. 7 is the structural block diagram according to a kind of terminal of the embodiment of the present application 6.
Specific embodiment
In order to enable those skilled in the art to better understand the solution of the present invention, below in conjunction in the embodiment of the present invention Attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment is only The embodiment of a part of the invention, instead of all the embodiments.Based on the embodiments of the present invention, ordinary skill people The model that the present invention protects all should belong in member's every other embodiment obtained without making creative work It encloses.
It should be noted that description and claims of this specification and term " first " in above-mentioned attached drawing, " Two " etc. be to be used to distinguish similar objects, without being used to describe a particular order or precedence order.It should be understood that using in this way Data be interchangeable under appropriate circumstances, so as to the embodiment of the present invention described herein can in addition to illustrating herein or Sequence other than those of description is implemented.In addition, term " includes " and " having " and their any deformation, it is intended that cover Cover it is non-exclusive include, for example, the process, method, system, product or equipment for containing a series of steps or units are not necessarily limited to Step or unit those of is clearly listed, but may include be not clearly listed or for these process, methods, product Or other step or units that equipment is intrinsic.
Firstly, the part noun or term that occur during the embodiment of the present application is described are suitable for following solution It releases:
Regular expression: regular expression is a kind of logical formula to string operation, with more predefined The combination of specific character and specific character, forms one " regular character string ", this " regular character string " is used to express to character A kind of filter logic of string, is normally used for retrieve, replace the text for meeting some mode (rule).
The text advertisements of variation: in text advertisements text or number make a variation, to hide Ad blocking, such as: " xxx increases product, needs to add vx2516372819 " after being made a variation, and obtains that " xxx increases product, needs that dimension+heart please be added Two May Day, six Radix Notoginseng 2819 ".
Embodiment 1
Text advertisements typically appear in popular forum, popular microblogging comment in, or in time in the chat group of chat software, Inconvenience is brought for user, in order to shield text advertisements, the prior art will have solid usually using the mode of regular expression The text of mould-fixed is intercepted, and existing text advertisements make a variation to information, in order to hide the interception of information by information Variation, which becomes regular expression, to be identified, but the form that user can identify, still affect the experience of user.
In order to solve the problems, such as the identification to the text advertisements of variation, this application provides corresponding solutions, specifically such as Shown in Fig. 1:
Step S11 inputs urtext.
Specifically, above-mentioned urtext can be through the comment function in application program, issuing the functions such as strange thing makes The text that user can be immediately seen.
In an alternative embodiment, by taking microblogging as an example, can using each in popular microblogging comment on as The identification of urtext progress text advertisements.
Step S12, pre-processes urtext.
In above-mentioned steps, carrying out pretreated mode to urtext can be as follows one or more:
(1) corresponding lowercase is converted by the capitalization occurred in the urtext.
Such as: urtext is that " interested parties please add tri- device * #742 " of WEi54&, turn lowercase by capitalization After pretreatment, obtain that " interested parties please add tri- device * #742 " of wei54&.
(2) corresponding simplified Chinese character is converted by the complex form of Chinese characters occurred in the urtext.
Such as: urtext is that " interested parties please add tri- device * #742 " of wei54&, turn the pre- place of simplified Chinese character by the complex form of Chinese characters After reason, obtain that " interested parties please add tri- device * #742 " of wei54&.
(3) corresponding number will be isolated in the character of the characterization numerology occurred in the urtext.
Such as: still urtext is that " interested parties please add tri- device * #742 " of wei54&, " three devices " is converted to letter " san Qi ", then by the phonetic meaning that is characterized of letter hints obliquely at for corresponding number, obtains " 37 ".
(4) the one or more preset kind characters occurred in the urtext are replaced with into unified certain kinds type-word Symbol.
Such as: urtext is that " interested parties please add wei54&37*#742 " to pass through one or more preset kind characters After the pretreatment for replacing with unified specific type character, " interested parties please add wei54 37 742 " is obtained.In this example, " & ", " * ", the character that " # " is preset kind, replace with space character for " & ", " * ", " # ", " & ", " * ", " # " can also be replaced It is changed to null character, that is, deletes " & ", " * ", " # ", obtains that " interested parties please add wei5437742 ".
(5) content that the picture includes is extracted from the picture occurred in the urtext, wherein the content packet Include at least one of: letter, text, number.
Such as: Text region is carried out to the picture that occurs in urtext, identifies letter in picture, text sum number Word.
Herein it should be noted that above-mentioned several pretreated modes can be while carry out, it is also possible to according to setting What fixed sequence successively carried out, it is obtaining the result is that carrying out treated result by above-mentioned four kinds of pretreatment modes.
Step S13 obtains number characteristic set.
Specifically, number characteristic set is contained in the associated identification information of contact method, such as: WeChat ID, phone number or QQ number etc..The length range of corresponding character string can be determined according to the classification logotype of contact method to pretreated text, And number is identified from pretreated text by classification logotype and the length range determined, and by the message identification of number It is added to number characteristic set, the message identification of number can be qqNum, vxNum, telNum etc., for characterizing QQ number, wechat Number, the numbers of phone number and other contact methods.
In an alternative embodiment, it is first determined the string length of phone number is 11, the character string of QQ number Length is 9 or 10, and the character string range of WeChat ID is 6 to 10, with " interested parties, for micro- 5437742 " please being add, identification Belong to the length range of the corresponding character string of WeChat ID comprising 7 bit digitals " 5437742 " in the text to be identified out, therefore will The message identification " vxNum " of number is added in number characteristic set.
Step S14 obtains alphabetic feature set.
Specifically, alphabetic feature set includes the corresponding monogram of text occurred in urtext.It is special to obtain letter Collection close can be it is following any one or more: convert corresponding spelling for the text occurred in the text to be identified Sound, and the phonetic converted by text is added to alphabetic feature set;The english information that will occur in text to be identified It is added to alphabetic feature set;The alphabetical information occurred in text to be identified is added to alphabetic feature set.
For example, text to be identified be " interested parties please add wei5437742 ", be converted to after phonetic for " you yi zhe, qing jia wei 5437741”。
Step S15, Hanzify Number Reorganization.
It being intercepted in order to prevent, the common variant form of text advertisements is the Chinese character expression by data using unisonance, such as: By " interested parties, wei54&37*#742 " please be added to be expressed as, and " interested parties, please add tri- device * #742 " of wei54&, become to fight this kind Different mode, after getting alphabetic feature set, it is also necessary to determine in alphabetic feature set and be read with the presence or absence of with multiple numbers The identical monogram of sound if it is present converting corresponding digital collection for determining monogram, then passes through acquisition number Number characteristic set is added in the message identification that the number of number is converted by letter by the mode of code feature set.
In an alternative embodiment, with text to be identified be " interested parties, for tri- device * #742 " of wei54& please be add, It is monogram by text conversion to be identified, obtains " you yi zhe qing jia wei wu si san qi qi si Er ", wherein " yi " and " wu si san qi qi si er " this two parts letter are identical as the pronunciation of number.But " yi " is corresponding Number be " 1 ", be not belonging to the length range of the corresponding character string of contact method, thus not will " 1 " addition number characteristic set; " wu si san qi qi si er " corresponding number is " 5437742 ", belongs to the corresponding character string of WeChat ID in contact method The range of length, therefore number characteristic set is added in " 5437742 " corresponding message identification " vxNum ".
Step S16, output prediction score value.
Specifically, number characteristic set and alphabetic feature set can be combined, i.e., according to the input of urtext Sequence is combined the number characteristic set and alphabetic feature set that get, obtains the corresponding resulting text of urtext, Resulting text is inputted into preset assessment models, exports the corresponding prediction score value of the urtext by assessment models, wherein this is pre- Score value is surveyed for characterizing the probability value that urtext corresponds to different evaluation types.
In an alternative embodiment, include A with evaluation type: advertisement text, B comprising contact method: not including The advertisement text and C of contact method: whether for normal text, presetting in assessment models testing result text includes that number is special Sign, meanwhile, whether include preset letter in the alphabetic feature set in testing result text, is obtained according to two testing results The score value of three kinds of evaluation types.By taking resulting text " you yi zhe qing jia WEi5437742 " as an example, the resulting text In include number feature, and the weighted value w1=0.7 of testing result, only comprising a default letter " jia in alphabetic feature set Wei ", this are scored at 0.3, the weighted value w2=0.3 of this testing result.
The probability that the evaluation type of the resulting text is A as a result, is 1*0.7+0.3*0.3=0.79.Due to evaluation type B Characterization includes the advertisement text of contact method, and has determined in the resulting text comprising contact method, therefore, the resulting text Evaluation type be B probability be 0.For evaluation type C, 1 point, and weight are obtained when in resulting text not including number feature It is 0.7, therefore this of the resulting text is scored at 0, it is only pre- comprising one in the alphabetic feature set in testing result text If letter, be scored at 0.7, and weight is 0.3, therefore, the probability that the evaluation type of the resulting text is C is 0*0.7+0.7* 0.3=0.21.
In an alternative embodiment, preset evaluation type may include: to obtain it corresponding to the general of A evaluation type Rate value is 0.79, and the probability value corresponding to B evaluation type is 0, and the probability value corresponding to C type is 0.21, and setting threshold value is 0.5, it according to the prediction score value of the text and preset threshold value it is found that the evaluation type of the text is type-A, that is, include correspondent party The advertisement text of formula.After determining the evaluation type of urtext, if the evaluation type of urtext be A class or B class, really Determine to intercept urtext comprising junk information in urtext.
In an alternative embodiment, it is 0.5 that threshold value, which is still arranged, but urtext corresponds to the general of A evaluation type Rate value is 0.4, and the probability value corresponding to B evaluation type is 0.35, and the probability value corresponding to C evaluation type is 0.25, due to this The probability value that urtext corresponds to three evaluation types is respectively less than threshold value 0.5, therefore whether not can determine that the urtext Comprising junk information, then intercept process is not done to the urtext.
In another optional embodiment, setting threshold value is 0.4, but urtext corresponds to the probability of A evaluation type Value is 0.1, and the probability value corresponding to B evaluation type is 0.42, and the probability value corresponding to C evaluation type is 0.48, B evaluation type It is both greater than threshold value 0.4 with the probability value of C evaluation type, in that case, the maximum evaluation type of select probability value is as former The type of beginning text, i.e. C evaluation type are not done the urtext it is thus determined that not including junk information in the urtext Intercept process.
It should be noted that, the above embodiments of the present application identify urtext herein, determine number feature combine and Alphabetic feature set is combined further according to number characteristic set and alphabetic feature set, and according to combined result to original Whether assessed comprising predefined type information in beginning text.Since junk information would generally avoid canonical by modes such as variations The logical formula of expression formula defined, therefore cannot be identified by junk information recognition methods in the prior art, and the application And Direct Recognition is carried out to urtext without using regular expression, but the corresponding resulting text of original application text is carried out Identification, since resulting text is made of the number characteristic set of urtext and alphabetic feature set, even if believing rubbish Breath has carried out variation processing, but the feature of junk information still can be embodied in knot by number characteristic set or alphabetic feature set In fruit text, to be identified.
The application above scheme solves as a result, identifies the weak technology of the method identification function of text advertisements in the prior art Problem has reached identification to the technical effect of the junk information after variation.
Embodiment 2
According to embodiments of the present invention, a kind of embodiment of information processing method is additionally provided, it should be noted that in attached drawing Process the step of illustrating can execute in a computer system such as a set of computer executable instructions, although also, Logical order is shown in flow charts, but in some cases, can be executed with the sequence for being different from herein it is shown or The step of description.
Embodiment of the method provided by the embodiment of the present application one can be in mobile terminal, terminal or similar fortune It calculates and is executed in device.The hardware for the terminal (or mobile device) that Fig. 2 shows a kind of for realizing information processing method Structural block diagram.As shown in Fig. 2, terminal 20 (or mobile device 20) may include it is one or more (in figure using 202a, 202b ... ..., 202n are shown) (processor 202 can include but is not limited to Micro-processor MCV or programmable patrols processor 202 The processing unit of volume device FPGA etc.), memory 204 for storing data and the transmission module for communication function 206.It in addition to this, can also include: display, input/output interface (I/O interface), the port universal serial bus (USB) (a port that can be used as in the port of I/O interface is included), network interface, power supply and/or camera.The common skill in this field Art personnel are appreciated that structure shown in Fig. 2 is only to illustrate, and do not cause to limit to the structure of above-mentioned electronic device.For example, Terminal 20 may also include the more perhaps less component than shown in Fig. 2 or match with different from shown in Fig. 2 It sets.
It is to be noted that said one or multiple processors 202 and/or other data processing circuits lead to herein Can often " data processing circuit " be referred to as.The data processing circuit all or part of can be presented as software, hardware, firmware Or any other combination.In addition, data processing circuit for single independent processing module or all or part of can be integrated to meter In any one in other elements in calculation machine terminal 20 (or mobile device).As involved in the embodiment of the present application, The data processing circuit controls (such as the selection for the variable resistance end path connecting with interface) as a kind of processor.
Memory 204 can be used for storing the software program and module of application software, such as the information in the embodiment of the present invention Corresponding program instruction/the data storage device of processing method, the software that processor 202 is stored in memory 204 by operation Program and module realize the Hole Detection of above-mentioned application program thereby executing various function application and data processing Method.Memory 204 may include high speed random access memory, may also include nonvolatile memory, such as one or more magnetism Storage device, flash memory or other non-volatile solid state memories.In some instances, memory 104 can further comprise phase The memory remotely located for processor 202, these remote memories can pass through network connection to terminal 20.On The example for stating network includes but is not limited to internet, intranet, local area network, mobile radio communication and combinations thereof.
Transmitting device 206 is used to that data to be received or sent via a network.Above-mentioned network specific example may include The wireless network that the communication providers of terminal 20 provide.In an example, transmitting device 206 includes that a network is suitable Orchestration (Network Interface Controller, NIC), can be connected by base station with other network equipments so as to Internet is communicated.In an example, transmitting device 206 can be radio frequency (Radio Frequency, RF) module, For wirelessly being communicated with internet.
Display can such as touch-screen type liquid crystal display (LCD), the liquid crystal display aloow user with The user interface of terminal 10 (or mobile device) interacts.
Herein it should be noted that in some optional embodiments, above-mentioned computer equipment shown in Fig. 2 (or movement is set It is standby) it may include hardware element (including circuit), software element (including the computer generation that may be stored on the computer-readable medium Code) or both hardware element and software element combination.It should be pointed out that Fig. 2 is only a reality of particular embodiment Example, and it is intended to show that the type for the component that may be present in above-mentioned computer equipment (or mobile device).
Under above-mentioned running environment, this application provides information processing methods as shown in Figure 3.Fig. 3 is according to the present invention A kind of flow chart of information processing method of embodiment 2.As shown in connection with fig. 3, this method comprises:
Step S31, identifies urtext, obtains number characteristic set and alphabetic feature set, wherein described number Code characteristic set include with the associated identification information of contact method, the alphabetic feature set include the urtext in occur The corresponding monogram of text.Specifically, above-mentioned urtext can be through the comment function in application program, publication is new The functions such as fresh thing allow users to the text directly seen.It is above-mentioned to can be contact method with the associated identification information of contact method Number, such as: WeChat ID, QQ number, telephone number, phone number etc..
As a kind of optional embodiment, OK in order to obtain urtext characteristic set and alphabetic feature set are needed The urtext is pre-processed, obtain text to be identified.Wherein, carrying out pretreatment to the urtext includes At least one of: corresponding lowercase is converted by the capitalization occurred in the urtext;By the original text The complex form of Chinese characters occurred in this is converted into corresponding simplified Chinese character;It will be in the character of the characterization numerology occurred in the urtext Isolate corresponding number;The one or more preset kind characters occurred in the urtext are replaced with into the specific of unification Type character;The content that the picture includes is extracted from the picture occurred in the urtext, wherein the content includes At least one of: letter, text, number.
In the above-described embodiments, it is obtaining identifying from text to be identified after pretreated text to be identified The number characteristic set may include: the length range that corresponding character string is determined according to the classification logotype of contact method;It is logical The length range crossing the classification logotype and determining identifies number from the text to be identified, and by the letter of the number Breath mark is added to the number characteristic set.Specifically, the classification of contact method has the length range of corresponding character string, Such as: if contact method is telephone number, corresponding string length is 11, if contact method is QQ number code, Corresponding string length range is 8 to 10, if contact method is wechat number, corresponding string length is 6 Position is to 10.Above-mentioned contact method is given for example only, and contact method i.e. its corresponding character string that other can be used in communication is long Degree can be applied to above-described embodiment.
In the above-described embodiments, identified from the text to be identified the alphabetic feature set include it is following at least One of: corresponding letter, and alphabetical group will converted by text are converted by the text occurred in the text to be identified It closes and is added to the alphabetic feature set;The english information occurred in the text to be identified is added to the alphabetic feature Set;The alphabetical information occurred in the text to be identified is added to the alphabetic feature set.
It is special obtaining letter due to there is the text for showing contact number by way of Chinese character or by letter After collection is closed, determine in the alphabetic feature set there is monogram identical with multiple number pronunciations;By determining word Mother's combination is converted into corresponding digital collection;If passing through the classification logotype and the length range determined from the digital collection It identifies number, then the message identification of the number is added to the number characteristic set.
Step S33 is combined the number characteristic set and the alphabetic feature set, obtains resulting text.Tool Body, the number characteristic set of urtext and alphabetic feature set can be combined according to default rule, be constituted new Text, i.e. resulting text.Specifically, the number characteristic set and the alphabetic feature set are combined can be by The number characteristic set and alphabetic feature set that get are combined according to the input sequence of urtext.
As a kind of optional embodiment, it is combined to the number characteristic set and the alphabetic feature set, After obtaining the resulting text, whether assess comprising preset kind information in the resulting text.Specifically, above-mentioned Preset kind information can be the information such as advertising information, uncivil information.
As in a kind of optional embodiment, default assessment mould can be input to using the resulting text as test case Type obtains classification information and probability value corresponding with the classification information that the default assessment models export;Using described Whether it includes the preset kind information that classification information and the probability value are assessed in the resulting text.Specifically, above-mentioned pre- If assessment models can be used for giving a mark to resulting text, the corresponding evaluation type of resulting text is obtained.
In the above-described embodiments, using the classification information and the probability value assess in the resulting text whether include The preset kind information includes: in the probability value corresponding classification information assessment resulting text chosen and be greater than preset threshold It whether include the preset kind information.
It should be noted that for the various method embodiments described above, for simple description, therefore, it is stated as a series of Combination of actions, but those skilled in the art should understand that, the present invention is not limited by the sequence of acts described because According to the present invention, some steps may be performed in other sequences or simultaneously.Secondly, those skilled in the art should also know It knows, the embodiments described in the specification are all preferred embodiments, and related actions and modules is not necessarily of the invention It is necessary.
Through the above description of the embodiments, those skilled in the art can be understood that according to above-mentioned implementation The method of example can be realized by means of software and necessary general hardware platform, naturally it is also possible to by hardware, but it is very much In the case of the former be more preferably embodiment.Based on this understanding, technical solution of the present invention is substantially in other words to existing The part that technology contributes can be embodied in the form of software products, which is stored in a storage In medium (such as ROM/RAM, magnetic disk, CD), including some instructions are used so that a terminal device (can be mobile phone, calculate Machine, server or network equipment etc.) execute method described in each embodiment of the present invention.
Embodiment 3
According to embodiments of the present invention, additionally provide it is a kind of for implementing the information processing unit of above- mentioned information processing method, As shown in figure 4, the device 400 includes:
Identification module 402, for being identified to urtext, acquisition number characteristic set and alphabetic feature set, In, number characteristic set include with contact method associated identification information, alphabetic feature set includes to occur in urtext The corresponding monogram of text.
Module 404 is obtained, is combined for checking numbers characteristic set with alphabetic feature set, obtains resulting text.
Herein it should be noted that above-mentioned identification module 402 and acquisition module 404 correspond to the step S31 in embodiment 1 To step S33, three modules are identical as example and application scenarios that corresponding step is realized, but are not limited to the above embodiments one Disclosure of that.It should be noted that above-mentioned module may operate in the calculating of the offer of embodiment 1 as a part of device In machine terminal 20.
Optionally, according to the above embodiments of the present application, identification module 402 includes:
Processing unit obtains text to be identified for pre-processing to urtext.
Recognition unit, for identifying number characteristic set and alphabetic feature set from text to be identified.
Optionally, according to the above embodiments of the present application, processing unit, for turning the capitalization occurred in urtext Turn to corresponding lowercase;Alternatively, converting corresponding simplified Chinese character for the complex form of Chinese characters occurred in urtext;Alternatively, by former Corresponding number is isolated in the character of the characterization numerology occurred in beginning text;By one kind occurred in urtext or more Kind preset kind character replaces with unified specific type character;Picture is extracted from the picture occurred in urtext includes Content, wherein content includes at least one of: letter, text, number.
Optionally, according to the above embodiments of the present application, recognition unit includes:
It determines subelement, the length range of corresponding character string is determined for the classification logotype according to contact method.
Subelement is identified, for identifying and being extra-large from text to be identified by classification logotype and the length range determined Code, and the message identification of number is added to number characteristic set.
Optionally, according to the above embodiments of the present application, subelement, the text for will occur in text to be identified are identified It is converted into corresponding letter, and the monogram converted by text is added to alphabetic feature set;And/or it will be to be identified Text in the english information that occurs be added to alphabetic feature set;And/or the alphabetical information that will occur in text to be identified It is added to alphabetic feature set.
Optionally, according to the above embodiments of the present application, above-mentioned apparatus further include:
There is monogram identical with multiple number pronunciations for determining in alphabetic feature set in determining module.
Conversion module, for converting corresponding digital collection for the monogram determined.
Identification module, if being also used to identify number from digital collection by classification logotype and the length range determined, Then the message identification of number is added to number characteristic set.
Optionally, according to the above embodiments of the present application, evaluation module includes:
Acquiring unit obtains default assessment mould for being input to default assessment models for resulting text as test case The classification information of type output and probability value corresponding with classification information.
Assessment unit, for whether including preset kind information using in classification information and probability value assessment result text.
Optionally, according to the above embodiments of the present application, assessment unit is corresponding greater than the probability value of preset threshold for choosing Classification information assessment result text in whether include preset kind information.
Embodiment 4
According to embodiments of the present invention, a kind of system is additionally provided, as shown in figure 5, the system includes:
Processor 50;And
Memory 52, is connected to the processor, for providing the instruction for executing following treatment process for the processor: Urtext is identified, number characteristic set and alphabetic feature set are obtained, wherein the number characteristic set include with The associated identification information of contact method, the alphabetic feature set include the corresponding letter of text occurred in the urtext Combination;The number characteristic set and the alphabetic feature set are combined, resulting text is obtained.
Specifically, above-mentioned processor can also be performed such as other steps in embodiment 1, details are not described herein again.
Embodiment 5
The embodiment of the present invention can provide a kind of information processing method, and Fig. 6 is that according to embodiments of the present invention 5 one kind can The flow chart of the information processing method of choosing, as shown in connection with fig. 6, this method comprises the following steps:
Step S61 obtains input information, wherein input information comprises at least one of the following the data of data type: number Word, text, letter, picture, audio, video data type.
Specifically, above-mentioned input information can be through the comment function in application program, issuing the functions such as strange thing makes Number, text, letter or the picture that user can be immediately seen are also possible to be inserted in the audio in online music, or insertion Video in network video.
The data inputted in information are converted to letter type data by step S63, obtain the corresponding letter collection of input information It closes.
Specifically, phonetic can be directly converted into, using phonetic as alphabetic class for the input information of text type Type data obtain the corresponding set of letters of input information;For the input information of picture type, image knowledge can be carried out to it Not, it obtains number, text and the letter in picture, then by number therein and text conversion is phonetic, and by number and text The letter that the phonetic and image recognition being converted to obtain is as letter type data;It, can for the input information of audio types Audio-frequency information progress speech recognition is first obtained text information, then phonetic is converted by text information, using phonetic as letter Categorical data;For the input information of video type, each frame image in video can be believed according to the input of picture type Breath is converted, and the corresponding monogram of video information is obtained.
Step S65 matches set of letters based on preset alphabetical sample, judges to input whether information includes preset kind Information.Specifically, above-mentioned preset kind information can be the information such as advertising information, uncivil information.
In above-mentioned steps, judge to input whether information includes that the mode of preset kind information can be for will be in set of letters Letter be compared and judge with preset letter sample.Specifically, above-mentioned preset alphabetical sample can be big according to analysis The preset kind information of amount obtains.
In an alternative embodiment, it by taking preset kind information is advertising information as an example, is wrapped in preset letter sample It is that " interested parties please add in input information containing the corresponding set of letters of a variety of advertising informations obtained by experience In the case where wei5437742 ", " you yi zhe, qing jia wei wu si san qi qi si is converted into Er ", will " the set of letters in you yi zhe, qing jia wei wu si san qi qi si er " and default sample database It is matched, the set of letters for obtaining successful match is " jiawe " and " wu si san qi qi si er ", therefore is judged Inputting information, " interested parties, please add wei5437742 " is advertising information.
As a kind of optional embodiment, alphabet data type includes phonetic, and alphabetical sample includes phonetic sample.Specifically , phonetic sample can be used for characterizing the corresponding phonetic of word, vocabulary and pinyin combinations with practical significance.
As a kind of optional embodiment, the data inputted in information are converted into letter type data, obtain input letter Ceasing corresponding set of letters includes at least one of:
Convert corresponding phonetic for the number of the appearance inputted in information, and by the phonetic converted by text be added to Set of letters.With " interested parties for please adding wei5437742 ", obtain that " interested parties please add wei wu after above-mentioned conversion si san qi qi si er”。
Convert corresponding phonetic for the text of the appearance inputted in information, and by the phonetic converted by text be added to Set of letters.With " interested parties obtain " you after above-mentioned conversion for please adding wei wu si san qi qi si er " yi zhe,qing jia wei wu si san qi qi si er”。
It is corresponding phonetic by the content transformation extracted from the pictorial information of the appearance in input information, and will be by content It converts obtained phonetic to be added to set of letters, wherein content includes at least one of: letter, text, number.
It is corresponding phonetic by the content transformation extracted from the audio-frequency information of the appearance in input information, and will be by content It converts obtained phonetic to be added to set of letters, wherein content includes at least one of: letter, text, number.Specifically, The content extracted in audio-frequency information is usually voice messaging, and in above-mentioned steps, voice messaging is converted to corresponding phonetic, and The phonetic that voice messaging is converted is added to monogram, so as to be matched with preset alphabetical sample, and then to sound Whether frequency information is that preset kind information is judged.
It is corresponding phonetic by the content transformation extracted from the video information of the appearance in input information, and will be by content It converts obtained phonetic to be added to set of letters, wherein content includes at least one of: letter, text, number.Above-mentioned In step, each frame image in video information can be handled, extract the content in each frame image, and be converted into pair Phonetic is answered, and the phonetic that video information is converted is added to monogram, so as to be matched with preset alphabetical sample, It and then whether is that preset kind information judges to video information.
Embodiment 6
The embodiment of the present invention can provide a kind of terminal, which can be in terminal group Any one computer terminal.Optionally, in the present embodiment, above-mentioned terminal also could alternatively be mobile whole The terminal devices such as end.
Optionally, in the present embodiment, above-mentioned terminal can be located in multiple network equipments of computer network At least one network equipment.
In the present embodiment, above-mentioned terminal can be with following steps in the information processing method of executing application Program code: identifying urtext, obtains number characteristic set and alphabetic feature set, wherein number characteristic set Comprising with the associated identification information of contact method, alphabetic feature set includes the corresponding alphabetical group of the text that occurs in urtext It closes;Characteristic set of checking numbers is combined with alphabetic feature set, obtains resulting text.
Optionally, Fig. 7 is a kind of structural block diagram of terminal according to an embodiment of the present invention.As shown in fig. 7, the meter Calculation machine terminal 70 may include: one or more (one is only shown in figure) processors 72, memory 74 and Peripheral Interface 76。
Wherein, memory can be used for storing software program and module, such as the information processing method in the embodiment of the present invention Program instruction/module corresponding with device, the software program and module that processor is stored in memory by operation, thus Application and data processing are performed various functions, that is, realizes above-mentioned information processing method.Memory may include that high speed is deposited at random Reservoir, can also include nonvolatile memory, such as one or more magnetic storage device, flash memory or other are non-volatile Property solid-state memory.In some instances, memory can further comprise the memory remotely located relative to processor, these Remote memory can pass through network connection to terminal A.The example of above-mentioned network includes but is not limited to internet, enterprises Net, local area network, mobile radio communication and combinations thereof.
Processor can call the information and application program of memory storage by transmitting device, to execute following step: Urtext is identified, number characteristic set and alphabetic feature set are obtained, wherein number characteristic set includes and contacts The associated identification information of mode, alphabetic feature set include the corresponding monogram of text occurred in urtext;It checks numbers Characteristic set is combined with alphabetic feature set, obtains resulting text.
Optionally, the program code of following steps can also be performed in above-mentioned processor: pre-processing, obtains to urtext To text to be identified;Number characteristic set and alphabetic feature set are identified from text to be identified.
Optionally, the program code of following steps can also be performed in above-mentioned processor: the capitalization that will occur in urtext Letter is converted into corresponding lowercase;Corresponding simplified Chinese character is converted by the complex form of Chinese characters occurred in urtext;By original text Corresponding number is isolated in the character of the characterization numerology occurred in this;It is one or more pre- by what is occurred in urtext If type character replaces with unified specific type character;From being extracted in the picture occurred in urtext in picture includes Hold, wherein content includes at least one of: letter, text, number.
Optionally, the program code of following steps can also be performed in above-mentioned processor: according to the classification logotype of contact method Determine the length range of corresponding character string;It is identified from text to be identified by classification logotype and the length range determined Number, and the message identification of number is added to number characteristic set.
Optionally, the program code of following steps can also be performed in above-mentioned processor: by what is occurred in text to be identified Text is converted into corresponding letter, and the monogram converted by text is added to alphabetic feature set;It will be to be identified The english information occurred in text is added to alphabetic feature set;The alphabetical information occurred in text to be identified is added to word Female characteristic set.
Optionally, the program code of following steps can also be performed in above-mentioned processor: determining and exists in alphabetic feature set Monogram identical with multiple number pronunciations;Corresponding digital collection is converted by determining monogram;If passing through class The length range that Biao Shi and not determine identifies number from digital collection, then the message identification of number is added to number feature set It closes.
Optionally, the program code of following steps can also be performed in above-mentioned processor: in resulting text whether comprising pre- If type information is assessed.
Optionally, the program code of following steps can also be performed in above-mentioned processor: using resulting text as test case Default assessment models are input to, classification information and probability value corresponding with classification information that default assessment models export are obtained; It whether include preset kind information using in classification information and probability value assessment result text.
Optionally, the program code of following steps can also be performed in above-mentioned processor: choosing the probability for being greater than preset threshold Whether it is worth in corresponding classification information assessment result text comprising preset kind information.
Using the embodiment of the present invention, the above embodiments of the present application identify urtext, determine that number feature combines It with alphabetic feature set, is combined further according to number characteristic set and alphabetic feature set, and according to combined result pair Whether assessed comprising predefined type information in urtext.By taking junk information as an example, since junk information would generally pass through The modes such as variation avoid the logical formula of regular expression defined, therefore cannot be by junk information identification side in the prior art Method identified, and the application and carries out Direct Recognition to urtext without using regular expression, but to original application text Corresponding resulting text is identified, since resulting text is by the number characteristic set and alphabetic feature set structure of urtext At, therefore even if carried out variation processing to junk information, but the feature of junk information still can by number characteristic set or Alphabetic feature set is embodied in resulting text, to be identified.
The application above scheme solves as a result, identifies the weak technology of the method identification function of text advertisements in the prior art Problem has reached identification to the technical effect of the junk information after variation.
It will appreciated by the skilled person that structure shown in Fig. 7 is only to illustrate, terminal is also possible to intelligence It can mobile phone (such as Android phone, iOS mobile phone), tablet computer, applause computer and mobile internet device (Mobile Internet Devices, MID), the terminal devices such as PAD.Fig. 7 it does not cause to limit to the structure of above-mentioned electronic device.Example Such as, terminal 70 may also include the more or less component (such as network interface, display device) than shown in Fig. 7, Or with the configuration different from shown in Fig. 7.
Those of ordinary skill in the art will appreciate that all or part of the steps in the various methods of above-described embodiment is can It is completed with instructing the relevant hardware of terminal device by program, which can store in a computer readable storage medium In, storage medium may include: flash disk, read-only memory (Read-Only Memory, ROM), random access device (Random Access Memory, RAM), disk or CD etc..
Embodiment 7
The embodiments of the present invention also provide a kind of storage mediums.Optionally, in the present embodiment, above-mentioned storage medium can For saving program code performed by information processing method provided by above-described embodiment one.
Optionally, in the present embodiment, above-mentioned storage medium can be located in computer network in computer terminal group In any one terminal, or in any one mobile terminal in mobile terminal group.
Optionally, in the present embodiment, storage medium is arranged to store the program code for executing following steps: right Urtext identified, number characteristic set and alphabetic feature set are obtained, wherein number characteristic set includes and correspondent party The associated identification information of formula, alphabetic feature set include the corresponding monogram of text occurred in urtext;Check numbers spy Collection is closed and is combined with alphabetic feature set, and resulting text is obtained.
The serial number of the above embodiments of the invention is only for description, does not represent the advantages or disadvantages of the embodiments.
In the above embodiment of the invention, it all emphasizes particularly on different fields to the description of each embodiment, does not have in some embodiment The part of detailed description, reference can be made to the related descriptions of other embodiments.
In several embodiments provided herein, it should be understood that disclosed technology contents can pass through others Mode is realized.Wherein, the apparatus embodiments described above are merely exemplary, such as the division of the unit, only A kind of logical function partition, there may be another division manner in actual implementation, for example, multiple units or components can combine or Person is desirably integrated into another system, or some features can be ignored or not executed.Another point, shown or discussed is mutual Between coupling, direct-coupling or communication connection can be through some interfaces, the INDIRECT COUPLING or communication link of unit or module It connects, can be electrical or other forms.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple In network unit.It can select some or all of unit therein according to the actual needs to realize the mesh of this embodiment scheme 's.
It, can also be in addition, the functional units in various embodiments of the present invention may be integrated into one processing unit It is that each unit physically exists alone, can also be integrated in one unit with two or more units.Above-mentioned integrated list Member both can take the form of hardware realization, can also realize in the form of software functional units.
If the integrated unit is realized in the form of SFU software functional unit and sells or use as independent product When, it can store in a computer readable storage medium.Based on this understanding, technical solution of the present invention is substantially The all or part of the part that contributes to existing technology or the technical solution can be in the form of software products in other words It embodies, which is stored in a storage medium, including some instructions are used so that a computer Equipment (can for personal computer, server or network equipment etc.) execute each embodiment the method for the present invention whole or Part steps.And storage medium above-mentioned includes: that USB flash disk, read-only memory (ROM, Read-Only Memory), arbitrary access are deposited Reservoir (RAM, Random Access Memory), mobile hard disk, magnetic or disk etc. be various to can store program code Medium.
The above is only a preferred embodiment of the present invention, it is noted that for the ordinary skill people of the art For member, various improvements and modifications may be made without departing from the principle of the present invention, these improvements and modifications are also answered It is considered as protection scope of the present invention.

Claims (16)

1. a kind of information processing method characterized by comprising
Urtext is identified, obtains number characteristic set and alphabetic feature set, wherein the number characteristic set packet Containing with the associated identification information of contact method, the alphabetic feature set include in the urtext text that occurs it is corresponding Monogram;
The number characteristic set and the alphabetic feature set are combined, resulting text is obtained.
2. obtaining the number the method according to claim 1, wherein identifying to the urtext Characteristic set includes: with the alphabetic feature set
The urtext is pre-processed, text to be identified is obtained;
The number characteristic set and the alphabetic feature set are identified from the text to be identified.
3. according to the method described in claim 2, it is characterized in that, to the urtext carry out pretreatment include it is following at least One of:
Corresponding lowercase is converted by the capitalization occurred in the urtext;
Corresponding simplified Chinese character is converted by the complex form of Chinese characters occurred in the urtext;
Corresponding number will be isolated in the character of the characterization numerology occurred in the urtext;
The one or more preset kind characters occurred in the urtext are replaced with into unified specific type character;
The content that the picture includes is extracted from the picture occurred in the urtext, wherein the content includes following At least one: letter, text, number.
4. according to the method described in claim 2, it is characterized in that, identifying that the number is special from the text to be identified Collection is closed
The length range of corresponding character string is determined according to the classification logotype of contact method;
Number is identified from the text to be identified by the classification logotype and the length range determined, and will be described number The message identification of code is added to the number characteristic set.
5. according to the method described in claim 4, it is characterized in that, identifying that the letter is special from the text to be identified It includes at least one of that collection, which is closed:
Corresponding letter, and the monogram that will be converted by text are converted by the text occurred in the text to be identified It is added to the alphabetic feature set;
The english information occurred in the text to be identified is added to the alphabetic feature set;
The alphabetical information occurred in the text to be identified is added to the alphabetic feature set.
6. according to the method described in claim 5, it is characterized in that, identifying the letter from the text to be identified After characteristic set, further includes:
Determine in the alphabetic feature set there is monogram identical with multiple number pronunciations;
Corresponding digital collection is converted by determining monogram;
If number is identified from the digital collection by the classification logotype and the length range determined, by the number Message identification be added to the number characteristic set.
7. the method according to claim 1, wherein to the number characteristic set and the alphabetic feature collection Conjunction is combined, after obtaining the resulting text, further includes:
Whether assess comprising preset kind information in the resulting text.
8. the method according to the description of claim 7 is characterized in that whether including the preset kind in the resulting text Information carries out assessment
Default assessment models are input to using the resulting text as test case, obtain the class of the default assessment models output Other information and probability value corresponding with the classification information;
Whether assessed in the resulting text using the classification information and the probability value includes the preset kind information.
9. according to the method described in claim 8, it is characterized in that, using described in the classification information and probability value assessment It whether include that the preset kind information includes: in resulting text
It whether chooses in the probability value corresponding classification information assessment resulting text for be greater than preset threshold comprising described default Type information.
10. a kind of processing unit of information characterized by comprising
Identification module obtains number characteristic set and alphabetic feature set, wherein described for identifying to urtext Number characteristic set include with the associated identification information of contact method, the alphabetic feature set include the urtext in go out The corresponding monogram of existing text;
It obtains module and obtains resulting text for being combined to the number characteristic set and the alphabetic feature set.
11. a kind of system characterized by comprising
Processor;And
Memory is connected to the processor, for providing the instruction for executing following treatment process for the processor:
Urtext is identified, obtains number characteristic set and alphabetic feature set, wherein the number characteristic set packet Containing with the associated identification information of contact method, the alphabetic feature set include in the urtext text that occurs it is corresponding Monogram;
The number characteristic set and the alphabetic feature set are combined, resulting text is obtained.
12. a kind of information processing method characterized by comprising
Obtain input information, wherein the input information comprises at least one of the following the data of data type: number, text, word Mother, picture, audio, video data type;
Data in the input information are converted into letter type data, obtain the corresponding set of letters of the input information;
Based on preset alphabetical sample, the set of letters is matched, judges whether the input information includes preset kind information.
13. according to the method for claim 12, which is characterized in that the letter type data include phonetic, the letter Sample includes phonetic sample.
14. according to the method for claim 13, which is characterized in that the data in the input information are converted to alphabetic class Type data, obtaining the corresponding set of letters of the input information includes at least one of:
Convert corresponding phonetic for the number of the appearance in the input information, and by the phonetic converted by text be added to The set of letters;
Convert corresponding phonetic for the text of the appearance in the input information, and by the phonetic converted by text be added to The set of letters;
It is corresponding phonetic by the content transformation extracted from the pictorial information of the appearance in the input information, and will be by described The phonetic that content transformation obtains is added to the set of letters, wherein the content includes at least one of: letter, text, Number;
It is corresponding phonetic by the content transformation extracted from the audio-frequency information of the appearance in the input information, and will be by described The phonetic that content transformation obtains is added to the set of letters, wherein the content includes at least one of: letter, text, Number;
It is corresponding phonetic by the content transformation extracted from the video information of the appearance in the input information, and will be by described The phonetic that content transformation obtains is added to the set of letters, wherein the content includes at least one of: letter, text, Number.
15. a kind of storage medium, which is characterized in that the storage medium includes the program of storage, wherein run in described program When control the storage medium where equipment perform claim require any one of 1 to 9 described in information processing method or right It is required that information processing method described in any one of 12 to 14.
16. a kind of processor, which is characterized in that the processor is for running program, wherein right of execution when described program is run Benefit require any one of 1 to 9 described at information described in any one of information processing method or claim 12 to 14 Reason method.
CN201710464769.2A 2017-06-19 2017-06-19 Information processing method and device Pending CN109145284A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710464769.2A CN109145284A (en) 2017-06-19 2017-06-19 Information processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710464769.2A CN109145284A (en) 2017-06-19 2017-06-19 Information processing method and device

Publications (1)

Publication Number Publication Date
CN109145284A true CN109145284A (en) 2019-01-04

Family

ID=64804566

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710464769.2A Pending CN109145284A (en) 2017-06-19 2017-06-19 Information processing method and device

Country Status (1)

Country Link
CN (1) CN109145284A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110457597A (en) * 2019-08-08 2019-11-15 中科鼎富(北京)科技发展有限公司 A kind of advertisement recognition method and device
CN112560855A (en) * 2020-12-18 2021-03-26 平安银行股份有限公司 Image information extraction method and device, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB0303312D0 (en) * 2003-02-13 2003-03-19 Brooks Robert E Advertising response system
CN102231873A (en) * 2011-06-22 2011-11-02 中兴通讯股份有限公司 Method and system for monitoring garbage message and monitor processing apparatus
CN102591854A (en) * 2012-01-10 2012-07-18 凤凰在线(北京)信息技术有限公司 Advertisement filtering system and advertisement filtering method specific to text characteristics
CN102761872A (en) * 2012-08-01 2012-10-31 成都四方信息技术有限公司 Spam message intercepting method
CN103415004A (en) * 2013-07-26 2013-11-27 中国联合网络通信集团有限公司 Method and device for detecting junk short message
CN104346337A (en) * 2013-07-24 2015-02-11 腾讯科技(深圳)有限公司 Method and device for intercepting junk information

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB0303312D0 (en) * 2003-02-13 2003-03-19 Brooks Robert E Advertising response system
CN102231873A (en) * 2011-06-22 2011-11-02 中兴通讯股份有限公司 Method and system for monitoring garbage message and monitor processing apparatus
CN102591854A (en) * 2012-01-10 2012-07-18 凤凰在线(北京)信息技术有限公司 Advertisement filtering system and advertisement filtering method specific to text characteristics
CN102761872A (en) * 2012-08-01 2012-10-31 成都四方信息技术有限公司 Spam message intercepting method
CN104346337A (en) * 2013-07-24 2015-02-11 腾讯科技(深圳)有限公司 Method and device for intercepting junk information
CN103415004A (en) * 2013-07-26 2013-11-27 中国联合网络通信集团有限公司 Method and device for detecting junk short message

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110457597A (en) * 2019-08-08 2019-11-15 中科鼎富(北京)科技发展有限公司 A kind of advertisement recognition method and device
CN112560855A (en) * 2020-12-18 2021-03-26 平安银行股份有限公司 Image information extraction method and device, electronic equipment and storage medium
CN112560855B (en) * 2020-12-18 2022-10-14 平安银行股份有限公司 Image information extraction method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
US10210865B2 (en) Method and apparatus for inputting information
US11074279B2 (en) Method for providing chatting service with chatbot assisted by human counselor
CN108460026B (en) Translation method and device
CN108710647B (en) Data processing method and device for chat robot
KR100695392B1 (en) A method for converting SMS message to multimedia message and sending the multimedia message and text-image converting server
CN109218390A (en) User's screening technique and device
US11010687B2 (en) Detecting abusive language using character N-gram features
CN107592255B (en) Information display method and equipment
CN104184653B (en) A kind of method and apparatus of message screening
CN106792250A (en) Barrage information interacting method and device
CN109635080A (en) Acknowledgment strategy generation method and device
CN105929980A (en) Method and device for inputting information
CN107832941A (en) Order processing method and device
WO2016203805A1 (en) Information processing device, information processing system, information processing method, and program
CN110880324A (en) Voice data processing method and device, storage medium and electronic equipment
CN112291423A (en) Intelligent response processing method and device for communication call, electronic equipment and storage medium
CN114969352B (en) Text processing method, system, storage medium and electronic equipment
CN112188232A (en) Video generation method, video display method and device
CN112447073A (en) Explanation video generation method, explanation video display method and device
CN112765364A (en) Group chat session ordering method and device, storage medium and electronic equipment
CN109145284A (en) Information processing method and device
CN110970030A (en) Voice recognition conversion method and system
KR20190134100A (en) Method and apparatus for providing chatting service
CN108090044A (en) The recognition methods of contact method and device
KR101986153B1 (en) System and method for communication service using webtoon identification technology

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190104