CN109784038A - Detecting black chain method, apparatus, system and computer readable storage medium - Google Patents

Detecting black chain method, apparatus, system and computer readable storage medium Download PDF

Info

Publication number
CN109784038A
CN109784038A CN201811653703.9A CN201811653703A CN109784038A CN 109784038 A CN109784038 A CN 109784038A CN 201811653703 A CN201811653703 A CN 201811653703A CN 109784038 A CN109784038 A CN 109784038A
Authority
CN
China
Prior art keywords
chain
exterior chain
exterior
black
tested
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811653703.9A
Other languages
Chinese (zh)
Inventor
顾鹏飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qianxin Technology Co Ltd
Original Assignee
Beijing Qianxin Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qianxin Technology Co Ltd filed Critical Beijing Qianxin Technology Co Ltd
Priority to CN201811653703.9A priority Critical patent/CN109784038A/en
Publication of CN109784038A publication Critical patent/CN109784038A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Present disclose provides a kind of detecting black chain methods, including obtain exterior chain to be detected, are based on the exterior chain, determine content information associated with exterior chain, and in the case where content information meets preset condition, determine that exterior chain is black chain.The disclosure additionally provides a kind of detecting black chain device, a kind of detecting black chain system and a kind of computer readable storage medium.

Description

Detecting black chain method, apparatus, system and computer readable storage medium
Technical field
This disclosure relates to a kind of detecting black chain method, apparatus, system and computer readable storage medium.
Background technique
Black chain refers to the backward chaining of some other websites artificially obtained with improper means, so that it is corresponding to improve black chain Website ranking in a search engine.The link of the second website is imported on the first website, which is known as on the first website Exterior chain.
Currently, whether the exterior chain in detection webpage is black chain mainly according to perceptual property external in webpage, such as hide, Positional shift etc..But this detecting black chain method can be in the presence of much reporting by mistake and failing to report, testing result is inaccurate.
Summary of the invention
An aspect of this disclosure provides a kind of detecting black chain method, including obtains exterior chain to be detected, based on described Exterior chain determines content information associated with the exterior chain, and in the case where the content information meets preset condition, really The fixed exterior chain is black chain.
Optionally, determine that content information associated with the exterior chain includes, and is based on the exterior chain, really based on the exterior chain Whether the label of content or the exterior chain in webpage corresponding to the fixed exterior chain includes black word, wherein the black word is The frequency of occurrences is greater than the word of first threshold in black chain.
Optionally, determine that content information associated with the exterior chain includes, and is based on the exterior chain, really based on the exterior chain The Internet protocol address of the fixed exterior chain whether be marked as black station or the exterior chain Internet protocol address whether Within the scope of preset address, wherein the black station is that the server of black chain was implanted into webpage, the preset address range for institute State the corresponding Internet protocol address range in black station.
Optionally, determine that content information associated with the exterior chain includes, and is based on the exterior chain, really based on the exterior chain Website where the fixed exterior chain is implanted the historical data of black chain, and/or is based on the exterior chain, determines corresponding to the exterior chain Webpage in content and the content in the webpage where the exterior chain correlation.
Optionally, obtaining exterior chain to be detected includes the change information for obtaining the content of multiple webpages, is believed in the variation Breath shows in the case where there is at least one exterior chain in the multiple webpage, judges whether there is the exterior chain not being tested, In the case where in the presence of the exterior chain not being tested, using the exterior chain not being tested as exterior chain to be detected.
Optionally, the exterior chain not being tested include it is multiple, will be described in the case where there is the exterior chain not being tested The exterior chain not being tested includes determining the similarity of multiple exterior chains not being tested as exterior chain to be detected, will Similarity is greater than multiple exterior chains not being tested of second threshold as exterior chain to be detected.
Optionally, it in the case where the content information meets preset condition, determines that the exterior chain is that black chain includes, obtains The content information is input in the machine learning model by machine learning model, is exported as a result, and described defeated In the case that result meets preset condition out, determine that the exterior chain is black chain.
Another aspect of the disclosure provides a kind of detecting black chain device, including obtain module, the first determining module and Second determining module.Module is obtained for obtaining exterior chain to be detected.First determining module be used for be based on the exterior chain, determine with The associated content information of exterior chain.Second determining module is used in the case where the content information meets preset condition, Determine that the exterior chain is black chain.
Optionally, module is obtained to include the first acquisition submodule, judging submodule and determine submodule.First obtains son Module is used to obtain the change information of the content of multiple webpages.Judging submodule is the multiple for showing in the change information In the case where occurring at least one exterior chain in webpage, the exterior chain not being tested is judged whether there is.Determine that submodule is used for In the case where in the presence of the exterior chain not being tested, using the exterior chain not being tested as exterior chain to be detected.
Another aspect of the present disclosure provides a kind of computer readable storage medium, is stored with computer executable instructions, Described instruction is when executed for realizing method as described above.
Another aspect of the present disclosure provides a kind of computer program, and the computer program, which includes that computer is executable, to be referred to It enables, described instruction is when executed for realizing method as described above.
Detailed description of the invention
In order to which the disclosure and its advantage is more fully understood, referring now to being described below in conjunction with attached drawing, in which:
Fig. 1 diagrammatically illustrates the application scenarios of detecting black chain method according to an embodiment of the present disclosure;
Fig. 2 diagrammatically illustrates the flow chart of the detecting black chain method according to the embodiment of the present disclosure;
Fig. 3 diagrammatically illustrates the flow chart of the acquisition exterior chain to be detected according to the embodiment of the present disclosure;
Fig. 4 is diagrammatically illustrated according to the embodiment of the present disclosure in the case where content information meets preset condition, is determined Exterior chain is the flow chart of black chain;
Fig. 5 diagrammatically illustrates the flow chart of the detecting black chain method according to another embodiment of the disclosure;
Fig. 6 diagrammatically illustrates the block diagram of the detecting black chain device according to the embodiment of the present disclosure;
Fig. 7 diagrammatically illustrates the block diagram of the acquisition module according to the embodiment of the present disclosure;
Fig. 8 diagrammatically illustrates the block diagram of the second determining module according to the embodiment of the present disclosure;And
Fig. 9 diagrammatically illustrates the block diagram of the detecting black chain system according to the embodiment of the present disclosure.
Specific embodiment
Hereinafter, will be described with reference to the accompanying drawings embodiment of the disclosure.However, it should be understood that these descriptions are only exemplary , and it is not intended to limit the scope of the present disclosure.In the following detailed description, to elaborate many specific thin convenient for explaining Section is to provide the comprehensive understanding to the embodiment of the present disclosure.It may be evident, however, that one or more embodiments are not having these specific thin It can also be carried out in the case where section.In addition, in the following description, descriptions of well-known structures and technologies are omitted, to avoid Unnecessarily obscure the concept of the disclosure.
Term as used herein is not intended to limit the disclosure just for the sake of description specific embodiment.It uses herein The terms "include", "comprise" etc. show the presence of the feature, step, operation and/or component, but it is not excluded that in the presence of Or add other one or more features, step, operation or component.
There are all terms (including technical and scientific term) as used herein those skilled in the art to be generally understood Meaning, unless otherwise defined.It should be noted that term used herein should be interpreted that with consistent with the context of this specification Meaning, without that should be explained with idealization or excessively mechanical mode.
It, in general should be according to this using statement as " at least one in A, B and C etc. " is similar to Field technical staff is generally understood the meaning of the statement to make an explanation (for example, " system at least one in A, B and C " Should include but is not limited to individually with A, individually with B, individually with C, with A and B, with A and C, have B and C, and/or System etc. with A, B, C).Using statement as " at least one in A, B or C etc. " is similar to, generally come Saying be generally understood the meaning of the statement according to those skilled in the art to make an explanation (for example, " having in A, B or C at least One system " should include but is not limited to individually with A, individually with B, individually with C, with A and B, have A and C, have B and C, and/or the system with A, B, C etc.).
Shown in the drawings of some block diagrams and/or flow chart.It should be understood that some sides in block diagram and/or flow chart Frame or combinations thereof can be realized by computer program instructions.These computer program instructions can be supplied to general purpose computer, The processor of special purpose computer or other programmable data processing units, so that these instructions are when executed by this processor can be with Creation is for realizing function/operation device illustrated in these block diagrams and/or flow chart.The technology of the disclosure can be hard The form of part and/or software (including firmware, microcode etc.) is realized.In addition, the technology of the disclosure, which can be taken, is stored with finger The form of computer program product on the computer readable storage medium of order, the computer program product is for instruction execution system System uses or instruction execution system is combined to use.
Embodiment of the disclosure provides a kind of detecting black chain method, including obtains exterior chain to be detected, based on described outer Chain determines content information associated with exterior chain, and in the case where content information meets preset condition, determines that exterior chain is black Chain.
Fig. 1 diagrammatically illustrates the application scenarios of detecting black chain method according to an embodiment of the present disclosure.It should be noted that Being is only the example that can apply the scene of the embodiment of the present disclosure shown in Fig. 1, to help skilled in the art to understand the disclosure Technology contents, but be not meant to that the embodiment of the present disclosure may not be usable for other equipment, system, environment or scene.
As shown in Figure 1, including electronic equipment 100 in the application scenarios, electronic equipment 100 is able to access that webpage 110.
Electronic equipment 100 can be the various electronic equipments with display screen and supported web page browsing, including but unlimited In smart phone, tablet computer, pocket computer on knee and desktop computer etc..
As shown in Figure 1, in webpage 101 include black chain 111, black chain 111 refer to be hidden on webpage 101, user can not Depending on and can be improved the link of itself ranking in a search engine.
Black chain is usually the dangerous websites such as some game websites, Trojan for stealing numbers website, fishing website, advertiser website.Such as Fruit electronic equipment does not carry out security protection work, is easy virus infection.Therefore it needs the detecting black chain on website accurately Out, to prevent from bringing user economic or body and mind injury.
Accurately the detecting black chain in website can be come out according to the detection method of the embodiment of the present disclosure, prevent black chain It fails to report and reports by mistake.In accordance with an embodiment of the present disclosure, detecting black chain method includes obtaining exterior chain to be detected, determination and the exterior chain phase Associated content information determines whether the exterior chain is black chain according to content information.
Fig. 2 diagrammatically illustrates the flow chart of the detecting black chain method according to the embodiment of the present disclosure.
As shown in Fig. 2, this method includes operation S210~S230.
In operation S210, exterior chain to be detected is obtained.
In operation S220, it is based on the exterior chain, determines content information associated with the exterior chain.
In operation S203, in the case where the content information meets preset condition, determine that the exterior chain is black chain.
In accordance with an embodiment of the present disclosure, this method can be judged according to content information associated with exterior chain exterior chain whether be Black chain, so that the process for detecting black chain is more comprehensive, obtained result is also more accurate, reduces the probability failed to report and reported by mistake.
In accordance with an embodiment of the present disclosure, in operation S210, such as crawler can be carried out to certain website, to obtain on the website Exterior chain to be detected.
Fig. 3 diagrammatically illustrates the flow chart of the acquisition exterior chain to be detected according to the embodiment of the present disclosure.
As shown in figure 3, this method includes operation S211~S213.
In operation S211, the change information of the content of multiple webpages is obtained.
In operation S212, in the case where the change information shows at least one exterior chain occur in the multiple webpage, Judge whether there is the exterior chain not being tested.
Operation S213, exist be not tested exterior chain in the case where, using the exterior chain not being tested as Exterior chain to be detected.
In accordance with an embodiment of the present disclosure, this method can monitor the content change of multiple webpages, to find net in time The black chain occurred in page, improves the real-time of detecting black chain.
In accordance with an embodiment of the present disclosure, in operation S211, such as crawler operation can be executed to webpage, passes through comparison front and back Crawler twice is as a result, obtain the content change information of webpage.Certainly, those skilled in the art can also by other methods come Obtain the content change of webpage.
In accordance with an embodiment of the present disclosure, in operation S212, for example, when this crawler result is than previous crawler result When at least one more exterior chain, at least one extra exterior chain is judged.Such as can be, successively check at least one Whether exterior chain is in black chain, wherein black chain is the set of the black chain detected.If at least one exterior chain can be in black chain It is found in library, then there is no the black chain not being tested, the exterior chain not having in black chain if it exists, then presence is not tested Exterior chain.
In accordance with an embodiment of the present disclosure, in operation S213, such as it can be and will not be tested in operation S212 Exterior chain as exterior chain to be detected.
In accordance with an embodiment of the present disclosure, if the exterior chain not being tested have it is multiple, using the exterior chain not being tested as to The exterior chain of detection includes determining the similarity of multiple exterior chains not being tested, by similarity be greater than second threshold it is multiple not The exterior chain being tested is as exterior chain to be detected.This method can connect multiple webpages of monitoring, so that synthesis is more The content change information of a webpage, screens multiple exterior chains, selects the exterior chain for being most likely to be black chain, is guaranteeing accurate inspection On the basis of survey, the efficiency of detection is improved.
In accordance with an embodiment of the present disclosure, it determines the similarity of multiple exterior chains not being tested, such as can be root It gives a mark according to the calculating factor of exterior chain to exterior chain, the similarity of exterior chain is determined according to the degree of closeness of score.Wherein, the factor is calculated Such as can be domain name, the format of exterior chain, the text in exterior chain etc..It in accordance with an embodiment of the present disclosure, can be by difference Calculating Factor minute with different weights come give exterior chain marking.
Referring back to Fig. 2, in accordance with an embodiment of the present disclosure, in operation S220, such as can be corresponding to determining exterior chain Whether the label of content and/or exterior chain in webpage includes black word.Wherein, black word is that the frequency of occurrences is greater than first in black chain The word of threshold value.Webpage corresponding to exterior chain, such as can be the webpage jumped to when clicking exterior chain.According to the reality of the disclosure Example is applied, black word for example can be the word that the frequency of occurrences in black chain that those skilled in the art come out is greater than first threshold.
In accordance with an embodiment of the present disclosure, in operation S220, such as it can be the Internet protocol address for determining the exterior chain Whether (IP address) is marked as the IP address of black station or exterior chain whether within the scope of preset address.Wherein, black station is to net Page was implanted into the server of black chain, and preset address range is IP address range corresponding with black station.For example, exterior chain http: // The IP address of www.example.com/foo/bar is 1.1.1.1, which was once implanted into black chain to other webpages.Root According to embodiment of the disclosure, the IP address of the server of black chain was implanted into usually in certain address range to webpage, ability Whether field technique personnel can rule of thumb set preset address range, to judge the IP address of exterior chain in preset address range It is interior.
In accordance with an embodiment of the present disclosure, it is implanted in operation S220, such as the website that can be where determining the exterior chain The historical data of black chain.Historical data for example can be the frequency the waiting whether website was implanted black chain, is implanted black chain. For example, determining that webpage 110 was once implanted black chain to judge whether another exterior chain in webpage 110 shown in FIG. 1 is black chain The case where.
In accordance with an embodiment of the present disclosure, in operation S220, such as it can be and determine in webpage corresponding to the exterior chain The correlation of content and the content in the webpage where the exterior chain.For example, the webpage 110 in scene shown in Fig. 1 includes Content is all and educates related, and the content of webpage corresponding to another exterior chain on webpage 110 is entertainment city advertisement, then Determine that the correlation of the two is lower.
In accordance with an embodiment of the present disclosure, it in operation S230, such as can be at least one described in the aforesaid operations S220 In the case that a content information meets preset condition, determine that exterior chain is black chain.
Fig. 4 is diagrammatically illustrated according to the embodiment of the present disclosure in the case where content information meets preset condition, is determined Exterior chain is the flow chart of black chain.
As shown in figure 4, this method includes operation S231~S233.
In operation S231, machine learning model is obtained.
In operation S232, the content information is input in the machine learning model, obtains output result.
In operation S233, in the case where the output result meets preset condition, determine that the exterior chain is black chain.
In accordance with an embodiment of the present disclosure, in operation S231, machine learning model for example be can be according to a large amount of black chain instruction Practice the machine learning model of sample training.
In accordance with an embodiment of the present disclosure, it in operation S232, such as can be by " exterior chain includes black word ", " corresponding to exterior chain Webpage in content and the content in the webpage where exterior chain correlation it is very low " etc. content information be input to the machine In device learning model.Machine learning model calculates output result according to the content information of input.Output result for example can be one A numerical value.
In accordance with an embodiment of the present disclosure, it can be in operation S232, such as preset condition more than threshold value 0.4, it is assumed that output As a result it is 0.9, then exports result and meet preset condition, so that it is determined that exterior chain is black chain.
In accordance with an embodiment of the present disclosure, in the correct feelings of result for judging to determine exterior chain according to machine learning model as black chain Under condition, the characteristic of the exterior chain can be continued on for into training machine learning model.
Fig. 5 diagrammatically illustrates the flow chart of the detecting black chain method according to another embodiment of the disclosure.
As shown in figure 5, this method includes operation S501~S507.
In operation S501, the content change information of multiple webpages is monitored.Such as it can execute above with reference to described in Fig. 3 Operate S211.
In operation S502, when occurring the exterior chain newly increased in webpage, the exterior chain newly increased is obtained.
In operation S503, the exterior chain not being tested is judged whether there is.Such as it can execute above with reference to described by Fig. 3 Operation S212.The exterior chain not being tested if it exists then executes operation S504.The exterior chain not being tested if it does not exist, S507 is executed, terminates detection, continues to test content change.
In operation S504, content information associated with one of exterior chain is determined.Such as it can execute above with reference to Fig. 2 Described operation S220.For example, determining whether the IP address of exterior chain is marked as black station, whether the IP address of exterior chain is being preset Content in webpage corresponding to the inside and outside chain of address range whether include black word, exterior chain label whether include black word, comprising should Most of website of exterior chain was once implanted the web page contents of web page contents corresponding to black chain, exterior chain and the webpage where exterior chain It is unrelated.
In operation S505, judge whether the score for operating the content information that S504 is determined reaches threshold value.If reaching threshold value, hold Row operation S506 determines that exterior chain is black chain, and returns again to operation S503, to continue to judge next emerging exterior chain.If Not up to threshold value executes operation S503, to continue to judge next emerging exterior chain.
Fig. 6 diagrammatically illustrates the block diagram of the detecting black chain device 600 according to the embodiment of the present disclosure.
As shown in fig. 6, detecting black chain device 600 is including obtaining module 610, the first determining module 620 and second determines Module 630.
Module 610 is obtained, such as executes the operation S210 described above with reference to Fig. 2, for obtaining exterior chain to be detected.
First determining module 620, such as the operation S220 described above with reference to Fig. 2 is executed, for being based on the exterior chain, really Fixed content information associated with the exterior chain.
Second determining module 630, such as the operation S230 described above with reference to Fig. 2 is executed, in the content information In the case where meeting preset condition, determine that the exterior chain is black chain.
In accordance with an embodiment of the present disclosure, it is based on exterior chain, determines that content information associated with exterior chain includes, it is true based on exterior chain Whether the label for determining the content and/or exterior chain in webpage corresponding to exterior chain includes black word, wherein black word is in black chain The frequency of occurrences is greater than the word of first threshold.
In accordance with an embodiment of the present disclosure, it is based on exterior chain, determines that content information associated with exterior chain includes, it is true based on exterior chain Whether whether the Internet protocol address for determining exterior chain be marked as the Internet protocol address of black station or exterior chain in preset address In range, wherein black station is the server for being implanted into black chain to webpage, and preset address range is internet corresponding with black station is stated Protocol address range.
In accordance with an embodiment of the present disclosure, it is based on the exterior chain, determines that content information associated with the exterior chain includes base Website where exterior chain determines exterior chain is implanted the historical data of black chain, and/or is based on exterior chain, determines net corresponding to exterior chain The correlation of content and the content in the webpage where exterior chain in page.
Fig. 7 diagrammatically illustrates the block diagram of the acquisition module 610 according to the embodiment of the present disclosure.
Include the first acquisition submodule 611, judging submodule 612 and determine submodule as shown in fig. 7, obtaining module 610 Block 613.
First obtains submodule 611, such as executes the operation S211 described above with reference to Fig. 3, for obtaining multiple webpages Content change information.
Judging submodule 612, such as the operation S212 described above with reference to Fig. 3 is executed, in the change information table In the case where there is at least one exterior chain in bright the multiple webpage, the exterior chain not being tested is judged whether there is.
It determines submodule 613, such as executes the operation S213 described above with reference to Fig. 3, for not being tested in presence Exterior chain in the case where, using the exterior chain not being tested as exterior chain to be detected.
In accordance with an embodiment of the present disclosure, determine that submodule 613 includes the first determination unit and the second determination unit.Wherein, First determination unit is used to determine the similarity of multiple exterior chains not being tested.Second determination unit is used for similarity Greater than second threshold multiple exterior chains not being tested as exterior chain to be detected.
Fig. 8 diagrammatically illustrates the block diagram of the second determining module 630 according to the embodiment of the present disclosure.
As shown in figure 8, the second determining module 630 include second obtain submodule 631, third obtain submodule 632 and Determine black chain submodule 633.
Second obtains submodule 631, such as executes the operation S231 described above with reference to Fig. 4, for obtaining machine learning Model.
Third obtains submodule 632, such as executes the operation S232 described above with reference to Fig. 4, for believing the content Breath is input in the machine learning model, obtains output result.
It determines black chain submodule 633, such as executes the operation S233 described above with reference to Fig. 4, for being tied in the output In the case that fruit meets preset condition, determine that the exterior chain is black chain.
It is module according to an embodiment of the present disclosure, submodule, unit, any number of or in which any more in subelement A at least partly function can be realized in a module.It is single according to the module of the embodiment of the present disclosure, submodule, unit, son Any one or more in member can be split into multiple modules to realize.According to the module of the embodiment of the present disclosure, submodule, Any one or more in unit, subelement can at least be implemented partly as hardware circuit, such as field programmable gate Array (FPGA), programmable logic array (PLA), system on chip, the system on substrate, the system in encapsulation, dedicated integrated electricity Road (ASIC), or can be by the hardware or firmware for any other rational method for integrate or encapsulate to circuit come real Show, or with any one in three kinds of software, hardware and firmware implementations or with wherein any several appropriately combined next reality It is existing.Alternatively, can be at least by part according to one or more of the module of the embodiment of the present disclosure, submodule, unit, subelement Ground is embodied as computer program module, when the computer program module is run, can execute corresponding function.
For example, obtaining module 610, any number of in the first determining module 620 and the second determining module 630 can close And it is realized in a module or any one module therein can be split into multiple modules.Alternatively, in these modules At least partly functions of one or more modules can be combined at least partly function of other modules, and in a module Middle realization.In accordance with an embodiment of the present disclosure, it obtains in module 610, the first determining module 620 and the second determining module 630 At least one can at least be implemented partly as hardware circuit, such as field programmable gate array (FPGA), programmable logic Array (PLA), system on chip, the system on substrate, the system in encapsulation, specific integrated circuit (ASIC), or can be by right Circuit carries out the hardware such as any other rational method that is integrated or encapsulating or firmware to realize, or with software, hardware and consolidates Any one in three kinds of implementations of part several appropriately combined is realized with wherein any.Alternatively, obtaining module 610, the At least one of one determining module 620 and the second determining module 630 can at least be implemented partly as computer program Module can execute corresponding function when the computer program module is run.
Fig. 9 diagrammatically illustrates the detecting black chain system for being adapted for carrying out method as described above according to the embodiment of the present disclosure Block diagram.Detecting black chain system shown in Fig. 9 is only an example, function to the embodiment of the present disclosure and should not use model Shroud carrys out any restrictions.
As shown in figure 9, detecting black chain system 900 includes processor 910, computer readable storage medium 920.Detecting black chain System 900 can execute the method according to the embodiment of the present disclosure.
Specifically, processor 910 for example may include general purpose microprocessor, instruction set processor and/or related chip group And/or special microprocessor (for example, specific integrated circuit (ASIC)), etc..Processor 910 can also include using for caching The onboard storage device on way.Processor 910 can be the different movements for executing the method flow according to the embodiment of the present disclosure Single treatment unit either multiple processing units.
Computer readable storage medium 920, such as can be non-volatile computer readable storage medium, specific example Including but not limited to: magnetic memory apparatus, such as tape or hard disk (HDD);Light storage device, such as CD (CD-ROM);Memory, such as Random access memory (RAM) or flash memory;Etc..
Computer readable storage medium 920 may include computer program 921, which may include generation Code/computer executable instructions execute processor 910 according to the embodiment of the present disclosure Method or its any deformation.
Computer program 921 can be configured to have the computer program code for example including computer program module.Example Such as, in the exemplary embodiment, the code in computer program 921 may include one or more program modules, for example including 921A, module 921B ....It should be noted that the division mode and number of module are not fixation, those skilled in the art can To be combined according to the actual situation using suitable program module or program module, when these program modules are combined by processor 910 When execution, processor 910 is executed according to the method for the embodiment of the present disclosure or its any deformation.
In accordance with an embodiment of the present disclosure, processor 910 can be executed according to the method for the embodiment of the present disclosure or its any change Shape.
According to an embodiment of the invention, obtaining in module 610, the first determining module 620 and the second determining module 630 At least one can be implemented as the computer program module with reference to Fig. 9 description may be implemented when being executed by processor 910 Corresponding operating described above.
The disclosure additionally provides a kind of computer readable storage medium, which can be above-mentioned reality It applies included in equipment/device/system described in example;Be also possible to individualism, and without be incorporated the equipment/device/ In system.Above-mentioned computer readable storage medium carries one or more program, when said one or multiple program quilts When execution, the method according to the embodiment of the present disclosure is realized.
In accordance with an embodiment of the present disclosure, computer readable storage medium can be non-volatile computer-readable storage medium Matter, such as can include but is not limited to: portable computer diskette, hard disk, random access storage device (RAM), read-only memory (ROM), erasable programmable read only memory (EPROM or flash memory), portable compact disc read-only memory (CD-ROM), light Memory device, magnetic memory device or above-mentioned any appropriate combination.In the disclosure, computer readable storage medium can With to be any include or the tangible medium of storage program, the program can be commanded execution system, device or device use or Person is in connection.
Flow chart and block diagram in attached drawing are illustrated according to the system of the various embodiments of the disclosure, method and computer journey The architecture, function and operation in the cards of sequence product.In this regard, each box in flowchart or block diagram can generation A part of one module, program segment or code of table, a part of above-mentioned module, program segment or code include one or more Executable instruction for implementing the specified logical function.It should also be noted that in some implementations as replacements, institute in box The function of mark can also occur in a different order than that indicated in the drawings.For example, two boxes succeedingly indicated are practical On can be basically executed in parallel, they can also be executed in the opposite order sometimes, and this depends on the function involved.Also it wants It is noted that the combination of each box in block diagram or flow chart and the box in block diagram or flow chart, can use and execute rule The dedicated hardware based systems of fixed functions or operations is realized, or can use the group of specialized hardware and computer instruction It closes to realize.
It will be understood by those skilled in the art that the feature recorded in each embodiment and/or claim of the disclosure can To carry out multiple combinations and/or combination, even if such combination or combination are not expressly recited in the disclosure.Particularly, exist In the case where not departing from disclosure spirit or teaching, the feature recorded in each embodiment and/or claim of the disclosure can To carry out multiple combinations and/or combination.All these combinations and/or combination each fall within the scope of the present disclosure.
Although the disclosure, art technology has shown and described referring to the certain exemplary embodiments of the disclosure Personnel it should be understood that in the case where the spirit and scope of the present disclosure limited without departing substantially from the following claims and their equivalents, A variety of changes in form and details can be carried out to the disclosure.Therefore, the scope of the present disclosure should not necessarily be limited by above-described embodiment, But should be not only determined by appended claims, also it is defined by the equivalent of appended claims.

Claims (11)

1. a kind of detecting black chain method, comprising:
Obtain exterior chain to be detected;
Based on the exterior chain, content information associated with the exterior chain is determined;And
In the case where the content information meets preset condition, determine that the exterior chain is black chain.
2. according to the method described in claim 1, wherein, described to be based on the exterior chain, determination is associated with the exterior chain interior Holding information includes:
Based on the exterior chain, determine whether the label of the content in webpage corresponding to the exterior chain and/or the exterior chain wraps Include black word, wherein the black word is the word that the frequency of occurrences is greater than first threshold in black chain.
3. according to the method described in claim 1, wherein, described to be based on the exterior chain, determination is associated with the exterior chain interior Holding information includes:
Based on the exterior chain, determine whether the Internet protocol address of the exterior chain is marked as black station or the exterior chain Internet protocol address whether within the scope of preset address,
Wherein, the black station is the server that black chain was implanted into webpage, and the preset address range is corresponding with the black station Internet protocol address range.
4. according to the method described in claim 1, wherein, described to be based on the exterior chain, determination is associated with the exterior chain interior Holding information includes:
Based on the exterior chain, the website where determining the exterior chain is implanted the historical data of black chain;And/or
Based on the exterior chain, the content in webpage corresponding to the exterior chain and the content in the webpage where the exterior chain are determined Correlation.
5. according to the method described in claim 1, wherein, the acquisition exterior chain to be detected includes:
Obtain the change information of the content of multiple webpages;
In the case where the change information shows at least one exterior chain occur in the multiple webpage, judge whether there is not by The exterior chain detected;
In the case where there is the exterior chain not being tested, using the exterior chain not being tested as exterior chain to be detected.
6. according to the method described in claim 5, wherein, the exterior chain not being tested include it is multiple, it is described to exist not In the case where the exterior chain being tested, include: using the exterior chain not being tested as exterior chain to be detected
Determine the similarity of multiple exterior chains not being tested;
Similarity is greater than multiple exterior chains not being tested of second threshold as exterior chain to be detected.
7. it is described in the case where the content information meets preset condition according to the method described in claim 1, wherein, really The fixed exterior chain is that black chain includes:
Obtain machine learning model;
The content information is input in the machine learning model, output result is obtained;And
In the case where the output result meets preset condition, determine that the exterior chain is black chain.
8. a kind of detecting black chain device, comprising:
Module is obtained, for obtaining exterior chain to be detected;
First determining module determines content information associated with the exterior chain for being based on the exterior chain;And
Second determining module, in the case where the content information meets preset condition, determining that the exterior chain is black chain.
9. device according to claim 8, wherein the acquisition module includes:
First obtains submodule, the change information of the content for obtaining multiple webpages;
Judging submodule, in the case where the change information shows at least one exterior chain occur in the multiple webpage, Judge whether there is the exterior chain not being tested;
Determine submodule, for exist be not tested exterior chain in the case where, using the exterior chain not being tested as Exterior chain to be detected.
10. a kind of detecting black chain system, comprising:
One or more processors;
Storage device, for storing one or more programs,
Wherein, when one or more of programs are executed by one or more of processors, so that one or more of The method that processor executes any one as described in claim 1~7.
11. a kind of computer readable storage medium, is stored thereon with executable instruction, which makes to handle when being executed by processor The method that device executes any one as described in claim 1~7.
CN201811653703.9A 2018-12-29 2018-12-29 Detecting black chain method, apparatus, system and computer readable storage medium Pending CN109784038A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811653703.9A CN109784038A (en) 2018-12-29 2018-12-29 Detecting black chain method, apparatus, system and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811653703.9A CN109784038A (en) 2018-12-29 2018-12-29 Detecting black chain method, apparatus, system and computer readable storage medium

Publications (1)

Publication Number Publication Date
CN109784038A true CN109784038A (en) 2019-05-21

Family

ID=66499799

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811653703.9A Pending CN109784038A (en) 2018-12-29 2018-12-29 Detecting black chain method, apparatus, system and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN109784038A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111680252A (en) * 2020-06-05 2020-09-18 腾讯科技(深圳)有限公司 External link identification method, device, equipment and computer readable storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102622435A (en) * 2012-02-29 2012-08-01 百度在线网络技术(北京)有限公司 Method and device for detecting black chain
CN103856442A (en) * 2012-11-30 2014-06-11 腾讯科技(深圳)有限公司 Black chain detection method, apparatus and system
CN105488402A (en) * 2014-12-23 2016-04-13 哈尔滨安天科技股份有限公司 Dark link detection method and system
CN107273416A (en) * 2017-05-05 2017-10-20 深信服科技股份有限公司 The dark chain detection method of webpage, device and computer-readable recording medium
CN107370718A (en) * 2016-05-12 2017-11-21 深圳市深信服电子科技有限公司 The detection method and device of black chain in webpage
CN107948168A (en) * 2017-11-29 2018-04-20 四川无声信息技术有限公司 Page detection method and device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102622435A (en) * 2012-02-29 2012-08-01 百度在线网络技术(北京)有限公司 Method and device for detecting black chain
CN103856442A (en) * 2012-11-30 2014-06-11 腾讯科技(深圳)有限公司 Black chain detection method, apparatus and system
CN105488402A (en) * 2014-12-23 2016-04-13 哈尔滨安天科技股份有限公司 Dark link detection method and system
CN107370718A (en) * 2016-05-12 2017-11-21 深圳市深信服电子科技有限公司 The detection method and device of black chain in webpage
CN107273416A (en) * 2017-05-05 2017-10-20 深信服科技股份有限公司 The dark chain detection method of webpage, device and computer-readable recording medium
CN107948168A (en) * 2017-11-29 2018-04-20 四川无声信息技术有限公司 Page detection method and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111680252A (en) * 2020-06-05 2020-09-18 腾讯科技(深圳)有限公司 External link identification method, device, equipment and computer readable storage medium

Similar Documents

Publication Publication Date Title
CN107729475B (en) Webpage element acquisition method, device, terminal and computer-readable storage medium
US20150082280A1 (en) Automatic verification by comparing user interface images
JP6036100B2 (en) Step-by-step visual comparison of web browser screens
CN105868096B (en) For showing the method, device and equipment of web page test result in a browser
CN106022123A (en) Multi-file malware analysis
CN106385640B (en) Video annotation method and device
CN103617213B (en) Method and system for identifying newspage attributive characters
US11580444B2 (en) Data visualization machine learning model performance
EP3064133B1 (en) Specifying apparatus and specifying method
CN109905385A (en) A kind of webshell detection method, apparatus and system
CN103077254B (en) Webpage acquisition methods and device
CA3120833C (en) Identifying equivalent links on a page
KR102224731B1 (en) Search engine optimization through selective indexing
CN107622008A (en) The traversal method and device of a kind of five application page
CN108959550B (en) User focus mining method, device, equipment and computer readable medium
JP7236694B2 (en) Information processing method and information processing system
CN107992402A (en) Blog management method and log management apparatus
US10671456B2 (en) Method and device for acquiring application information
CN109784038A (en) Detecting black chain method, apparatus, system and computer readable storage medium
US20160239552A1 (en) Data Display System
CN104699843B (en) The method and device of product comparative information is provided in Search Results
CN108052927A (en) Gesture processing method and processing device based on video data, computing device
CN108369647A (en) Quality control based on image
KR20180015491A (en) Method and apparatus for storing log of access based on kewords
CN110990558B (en) Electronic book content display method, computing equipment and computer storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 100088 Building 3 332, 102, 28 Xinjiekouwai Street, Xicheng District, Beijing

Applicant after: Qianxin Technology Group Co.,Ltd.

Address before: 100088 Building 3 332, 102, 28 Xinjiekouwai Street, Xicheng District, Beijing

Applicant before: BEIJING QIANXIN TECHNOLOGY Co.,Ltd.

CB02 Change of applicant information
RJ01 Rejection of invention patent application after publication

Application publication date: 20190521

RJ01 Rejection of invention patent application after publication