WO2015003556A1 - Device and method for acquiring second-level domain name information associated with main domain name - Google Patents

Device and method for acquiring second-level domain name information associated with main domain name Download PDF

Info

Publication number
WO2015003556A1
WO2015003556A1 PCT/CN2014/080720 CN2014080720W WO2015003556A1 WO 2015003556 A1 WO2015003556 A1 WO 2015003556A1 CN 2014080720 W CN2014080720 W CN 2014080720W WO 2015003556 A1 WO2015003556 A1 WO 2015003556A1
Authority
WO
WIPO (PCT)
Prior art keywords
domain name
primary
level domain
level
information associated
Prior art date
Application number
PCT/CN2014/080720
Other languages
French (fr)
Chinese (zh)
Inventor
苏洵
董方
赵武
胡振勇
Original Assignee
北京奇虎科技有限公司
奇智软件(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京奇虎科技有限公司, 奇智软件(北京)有限公司 filed Critical 北京奇虎科技有限公司
Publication of WO2015003556A1 publication Critical patent/WO2015003556A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Definitions

  • the present invention relates to the field of the Internet, and in particular, to an apparatus and method for acquiring second-level domain name information associated with a primary domain name. Background technique
  • the present invention has been made in order to provide an apparatus and method for overcoming the above problems or at least partially solving or slowing down the secondary domain name information associated with the primary domain name of the above problem.
  • a device for obtaining second-level domain name information associated with a primary domain name including: a webpage crawling module configured to crawl webpage content whose web address is set as a primary domain name; The module is configured to analyze the content of the captured webpage, and obtain the second-level domain name information associated with the primary domain name in the webpage content.
  • the device further includes: a search result obtaining module configured to obtain a search result that is included in the search engine and associated with the primary domain name; and a second domain name obtaining module configured to analyze the search result obtained by the search result obtaining module, Obtain the second-level domain name information associated with the primary domain name.
  • a search result obtaining module configured to obtain a search result that is included in the search engine and associated with the primary domain name
  • a second domain name obtaining module configured to analyze the search result obtained by the search result obtaining module, Obtain the second-level domain name information associated with the primary domain name.
  • the foregoing search result obtaining module includes: a generating unit configured to combine the primary domain name and the preset identifier to generate query information; and the acquiring unit configured to input query information in at least one search engine to obtain at least one search Search results associated with the primary domain name included in the engine.
  • the generating unit is further configured to add a preset identifier before the primary domain name to generate query information.
  • the preset identifier added by the generating unit includes: site: ; correspondingly, the query information generated by the generating unit is: site:a.com, where a.com is the primary domain name.
  • the second domain name obtaining module is configured to acquire the second-level domain name information associated with the primary domain name as follows: merge the search results associated with the primary domain name obtained by different search engines; the merged search result Get the second-level domain name information associated with the primary domain name.
  • the device further includes: a merging module, configured to merge the second-level domain name information associated with the primary domain name obtained by the first domain name obtaining module and the second domain name obtaining module respectively; and the output module is configured to be merged
  • the second-level domain name information is output as a second-level domain name record.
  • the merging module performs the merging operation according to the following manner: the second-level domain name information associated with the primary domain name obtained by the first domain name obtaining module and the second domain name obtaining module respectively is subjected to weight processing and combined.
  • the device further includes: a list establishing module, configured to use a preset dictionary to establish a predetermined second-level domain name list associated with the primary domain name; and a third domain name obtaining module configured to acquire and learn based on the predetermined second-level domain name list The second-level domain name information associated with the domain name.
  • a list establishing module configured to use a preset dictionary to establish a predetermined second-level domain name list associated with the primary domain name
  • a third domain name obtaining module configured to acquire and learn based on the predetermined second-level domain name list The second-level domain name information associated with the domain name.
  • the third domain name obtaining module is further configured to access each second-level domain name in the predetermined second-level domain name list, and obtain a second-level domain name that is successfully accessed, and obtain second-level domain name information associated with the primary domain name.
  • the merging module is further configured to merge the acquired second-level domain name information with the second-level domain name information associated with the primary domain name obtained by the third domain name obtaining module.
  • the device further includes: a dictionary supplementing module, configured to configure, according to the merged module, any one of the second-level domain names associated with the primary domain name, if the second-level domain name does not exist in the preset dictionary In it, it is added to the default dictionary.
  • a dictionary supplementing module configured to configure, according to the merged module, any one of the second-level domain names associated with the primary domain name, if the second-level domain name does not exist in the preset dictionary In it, it is added to the default dictionary.
  • the device further includes: a domain name accessing module, configured to access, by using different line nodes, at least one second-level domain name of one or more second-level domain names obtained by using different line nodes, and acquiring through different line nodes The response result returned, and the delay of the response result returned by the different line nodes;
  • the information acquisition module is configured to obtain the IP obtained by accessing each second-level domain name through different line nodes in the response result (Interaet Protocol, interconnected between networks) Protocol) address, and confirm the line type of each second-level domain name according to the delay of returning the response result of different line nodes;
  • the IP address output module configured to output the IP address and line type of each second-level domain name.
  • the information obtaining module includes: an IP address obtaining unit configured to obtain, in the response result, an IP address obtained by accessing each second-level domain name through different line nodes; and a line type confirming unit configured to compare and access each second-level domain name When the delays of the response results of different line nodes are the same, when the result of the comparison is different, it is confirmed that the line type of the line node with the shortest delay in returning the response result is the line type of the second-level domain name, and when the comparison result is the same, Confirm that the line type of the second-level domain name is the default type.
  • a method for obtaining second-level domain name information associated with a primary domain name including: crawling webpage content whose URL is set as a primary domain name; analyzing the content of the captured webpage, obtaining a webpage The second-level domain name information associated with the primary domain name in the content.
  • the method further includes: obtaining a search result that is included in the search engine and associated with the primary domain name; analyzing the obtained search result to obtain the second-level domain name information associated with the primary domain name.
  • the method further includes: establishing, by using a preset dictionary, a predetermined second-level domain name list associated with the primary domain name; and acquiring second-level domain name information associated with the primary domain name based on the predetermined secondary domain name list.
  • obtaining the second-level domain name information associated with the primary domain name based on the predetermined second-level domain name list including: accessing each second-level domain name in the predetermined second-level domain name list, and obtaining a second-level domain name with successful access, obtaining Secondary domain name information associated with the primary domain name.
  • the method further includes: combining all the second-level domain information that is currently acquired and associated with the primary domain name.
  • the method further includes: if the second-level domain name associated with the primary domain name obtained by the merge process is not present in the preset dictionary, Fill in the preset n towel.
  • the method further includes: accessing, by using different line nodes, at least one of the one or more second-level domain names in the obtained second-level domain name information, and obtaining a response result returned by the different line node, and The delay of the response result of the different line nodes is returned; the IP address obtained by accessing each second-level domain name through different line nodes is obtained in the response result, and the line type of each second-level domain name is confirmed according to the delay of returning the response result of different line nodes. ; Output the IP address and line type of each second-level domain name.
  • the line type of each second-level domain name is confirmed according to the delay of returning the response result of the different line nodes, including: comparing, when accessing each second-level domain name, whether delays of different line nodes returning response results are the same, when comparing When the result is different, confirm that the line type of the line node with the shortest response delay result is the line type of the second-level domain name. When the comparison result is the same, confirm that the line type of the second-level domain name is the default type.
  • a computer program comprising computer readable code causing the server to perform any of claims 15-28 when run on a server The method for obtaining second-level domain name information associated with a primary domain name.
  • a computer readable medium wherein the computer program according to claim 29 is stored.
  • the present invention provides an apparatus and method for acquiring second-level domain name information associated with a primary domain name.
  • the second-level domain name information associated with the primary domain name of the website can be obtained efficiently and comprehensively. It can also obtain the IP address and line type of each second-level domain name in the second-level domain name information, and can display the obtained information, so that the webmaster user can more fully understand the domain name access status of the website.
  • FIG. 1 is a block diagram showing the structure of a device for acquiring a second-level domain name associated with a primary domain name according to an embodiment of the present invention
  • FIG. 2 shows a schematic diagram of a website selection list according to an embodiment of the present invention
  • FIG. 3 is a schematic diagram showing initialization of acquiring secondary domain name information associated with a website primary domain name according to an embodiment of the present invention
  • FIG. 4 is a schematic diagram of specific parameters associated with a website primary domain name according to an embodiment of the present invention
  • FIG. 5 illustrates a method for acquiring secondary domain name information associated with a primary domain name according to an embodiment of the present invention
  • Flow chart
  • FIG. 6 is a flow chart of a method for obtaining search results associated with a primary domain name in a search engine, in accordance with one embodiment of the present invention
  • Figure 7 is a block diagram schematically showing a server for performing a method of acquiring second-level domain name information associated with a primary domain name according to the present invention
  • Fig. 8 is a view schematically showing a storage unit for holding or carrying program code for realizing a method of acquiring second-level domain name information associated with a primary domain name according to the present invention.
  • FIG. 1 is a structural diagram of a device for acquiring second-level domain name information associated with a primary domain name according to an embodiment of the present invention.
  • the device 100 can be used to obtain second-level domain name information associated with the website primary domain name and other information corresponding to the second-level domain name information.
  • the device structure of the device, the connection relationship between the various parts, and the role of each part are described in detail. Referring to FIG. 1 , the webpage crawling module 1 10 and the first domain name obtaining module 120 of the device 100 cooperate with each other to obtain second-level domain name information associated with the website primary domain name.
  • the webpage crawling module 1 10 first obtains the main domain name of the website that needs to query the second-level domain name information according to the user instruction, and crawls the main domain name of the website according to the website main domain name.
  • the content of the webpage is further analyzed by the first domain name obtaining module 120 to analyze the content of the webpage captured by the webpage crawling module 110, and obtain the second-level domain name information associated with the primary domain name of the website.
  • the primary domain name of the website mentioned above is usually the website address.
  • the primary domain name of Baidu website is its website ***.com.
  • the website primary domain name obtained by the webpage crawling module 110 may be input by the user in an input box provided by the device, or may be performed in a pre-saved list of domain names of the websites according to the user instruction. Select.
  • FIG. 2 shows a schematic diagram of a website selection list according to an embodiment of the present invention.
  • the user can trigger the operation of the web crawl module 1 10 by clicking on the "initialize” button shown in the main domain name "kafei.com” in the list.
  • the displayed page is as shown in Figure 3, showing the initialization diagram of the second-level domain information associated with the main domain name of the website.
  • the first domain name obtaining module 120 can obtain the second-level domain name information in the webpage content captured by the webpage crawling module 110, because the webpage crawling module 1 10 obtains the webpage content.
  • the second-level domain name information obtained by the first ⁇ or name obtaining module 120 in the web link is "news.***.com”.
  • the device 100 obtains the second-level domain name information by crawling the webpage content of the website main domain name.
  • the webpage crawling module 1 10 usually only crawls the webpage content of the website's main domain name, if some of the second-level domain name associated links do not exist in the webpage content, the first domain name obtaining module 120 obtains the The second-level domain name information associated with the website's primary domain name is not comprehensive.
  • the device 100 provides the second-level domain acquisition module 120 and the first domain name acquisition module 120. Use other modules to get the second-level domain information information. It should be noted that when the device 100 obtains the second-level domain name information through different modules, the order in which the second-level domain name information is obtained is not specifically limited.
  • the following describes the connection relationship and functions between other modules in the device 100 for obtaining the second-level domain name information.
  • the device 100 can also cooperate to obtain the second-level domain name information associated with the website primary domain name by using the search result obtaining module 130 and the second domain name obtaining module 140.
  • the search result obtaining module 130 first obtains the primary domain name of the website that needs to query the second-level domain name information according to the user instruction, and obtains the search result associated with the search engine according to the website primary domain name.
  • the manner in which the search result obtaining module 130 obtains the domain name of the website is the same as that of the web page capture module 110, and is not described here.
  • the second domain name obtaining module 140 obtains the second-level domain name information in the search result obtained by the search result obtaining module 130.
  • the search result obtaining module 130 includes a generating unit 131 and an obtaining unit 132.
  • the connection relationship and function of the generating unit 131 and the obtaining unit 132 will be specifically described below.
  • the generating unit 131 after receiving the user indication, the generating unit 131 obtains the website primary domain name, and combines the obtained website primary domain name with the preset identifier to generate query information.
  • the obtaining unit 132 is coupled to the generating unit 131 to obtain query information from the generating unit 131, and the other side is coupled to the search engine side for inputting query information in the search engine to obtain information about the search engine. Search results for the website's primary domain.
  • the obtaining unit 132 can simultaneously be coupled with a plurality of search engines, input query information in the plurality of search engines, and obtain the plurality of searches. Search results for the main domain of the website included in the engine.
  • the generating unit 131 generates a query information.
  • the preset identifier may be added to the primary domain name to form query information.
  • the preset identifier can be: site: , at this time, the query information is: site:a.com, where a.com is an example of the website's primary domain name.
  • the preset identifier can also be other characters, which will not be exemplified here.
  • the present invention also uses the preset character as the site: and specifically takes a search engine S as an example.
  • the primary domain name of the website that obtains the second-level domain name information is ABC.com.
  • the inquiry information generated by the generating unit 131 is site: ABC.com.
  • the obtaining unit 132 provides the search engine S with the search keyword site: ABC.com, and at this time, several search results are obtained, wherein each search result includes a URL link containing the domain name of each level of the website. , the content of the search results is shown in Table 1.
  • the obtaining unit 132 can obtain a URL link including the domain name of the website. Thereby, the obtaining unit 132 can acquire the second-level domain name information associated with the website primary domain name.
  • the obtaining unit 132 can obtain the search result through multiple search engines.
  • the search results obtained by different search engines are largely duplicated.
  • the second domain name obtaining module 140 needs to be Excluding and merging the search results obtained by different search engines, and then obtaining the second-level domain name information associated with the primary domain name in the merged search results.
  • the operation of excluding the merge processing avoids the duplication of search results and reduces the processing pressure when acquiring the second-level domain name information.
  • the second-level domain name information associated with the website primary domain name can be obtained comprehensively through the above two methods for acquiring the second-level domain name information.
  • the second-level domain name information may be omitted, for example, some second-level domain names are exclusively used for internal management. It does not appear on the website homepage and search engine.
  • the embodiment of the present invention also uses other modules to obtain the second-level domain name information. It should be noted that when the device 100 obtains the second-level domain name information through different modules, the order of obtaining the second-level domain name information is not specifically limited.
  • the device 100 can also obtain the second-level domain name information associated with the website primary domain name by using the list establishing module 170 and the third domain name obtaining module 180 to cooperate with each other.
  • the premise of obtaining the second-level domain name information by the list building module 170 and the third domain name obtaining module 180 is that the device 100 pre-stores a dictionary, which may include a website such as "mail", "bbs", "blog", etc.
  • the content of the second-level domain name is described the process of using the dictionary to obtain second-level domain information.
  • the list establishing module 170 first combines each content in the dictionary with the website main domain name to obtain a URL associated with the website main domain name (Uniform Resource Locator, unified resource positioning). a list, and the third domain name obtaining module 180 acquires the second-level domain name information associated with the website primary domain name based on the URL list.
  • the content included in the URL list established by the list building module 170 is as shown in Table 2.
  • the above mentioned third domain name obtaining module 180 acquires second-level domain name information associated with the primary domain name based on the URL list.
  • the following describes the process of obtaining second-level domain name information based on the URL list.
  • the third domain name obtaining module 180 sequentially accesses each URL in the URL list. When a certain URL is successfully accessed, the second-level domain name indicated by the URL exists, and the second-level domain name is reserved. When the URL fails, The second-level domain name represented by the URL does not exist. The second-level domain name is not saved. Finally, all URLs with successful access in the URL list are combined as the result of the acquisition.
  • the access described above includes parsing the URL address, and if the parsing is successful, the URL can be accessed.
  • a certain port of the URL address may be accessed (for example, port 80, port 443, etc.), and if it is accessible, it is considered to be a second-level domain name.
  • the embodiment of the present invention introduces the connection relationship and functions between modules of the second-level domain name information in three different ways.
  • the use of the above three sets of modules to obtain secondary domain name information is only a preferred embodiment.
  • the user can also select a module for obtaining the second-level domain name information according to the specific application environment, and the user can obtain the second-level domain name information by using only one or any two sets of modules mentioned above, or can obtain the second-level domain name information except the above.
  • the other modules except the module of the domain name information are used to obtain the second-level domain name information.
  • the other modules used to obtain the second-level domain name information are not described in this embodiment.
  • the device 100 uses the above multiple modules to obtain the second-level domain name information, the second-level domain name is duplicated to some extent.
  • the device 100 The merging module 150 is further configured to perform reordering and merging processing on all the second-level domain name information acquired by the first domain name obtaining module 120, the second domain name obtaining module 140, and the third domain name obtaining module 180. Rearrangement The merge processing operation reduces the processing pressure of subsequent modules in the operation of the second-level domain name information.
  • the dictionary mentioned above is an important factor for obtaining the second-level domain name information, and the content thereof may affect the comprehensiveness of obtaining the second-level domain name information. Therefore, after the merge module 150 obtains the second-level domain name information, the dictionary supplementing module 190 can also add the content that does not exist in the dictionary in the second-level domain name information to the dictionary, and the content of the dictionary is improved, and the When the second-level domain name information is obtained through a dictionary, the second-level domain name information is comprehensive.
  • the device 100 may display the merged second-level domain name information to the user, as shown in FIG. 1 .
  • the device 100 further includes a record output module 160, coupled to the merge module 150, for displaying the second-level domain name information processed by the merge module 150 to the user for viewing by the user.
  • the second-level domain name information displayed by the record output module 160 to the user is as shown in Table 3 below.
  • each displayed information includes “ABC.com”, which results in a cumbersome display and is not easy for the user to view.
  • the record output module 160 when the record output module 160 outputs the second-level domain name information, it only needs to remove the part after the website main domain name (ABC.com) to display, and the specific displayed information is shown in Table 4. Shown.
  • the device 100 further includes a domain name access module 200 and an information acquiring module 210.
  • the domain name accessing module 200 is coupled to the merge module 150, and accesses each second-level domain name in the second-level domain name information obtained by the merge module 150, and submits the response result obtained when accessing each second-level domain name to the information.
  • the acquisition module 210 performs further processing. The process of accessing the second-level domain name by the domain name access module 200 is specifically described below.
  • the domain name accessing module 200 accesses each second-level domain name in the second-level domain name information through different line nodes, and obtains the response result of each second-level domain name about different line nodes and returns the response result. After the delay, these access results are passed to the information acquisition module 210 for further processing.
  • the different line nodes mentioned above may be nodes such as Netcom, telecommunications, and the like.
  • the processing operation process of the information acquisition module 210 is also described in conjunction with FIG. 1.
  • the processing operation process of the information obtaining module 210 can be divided into two aspects.
  • the IP address obtaining unit 21 1 in FIG. 1 obtains the IP address of each second-level domain name at different line nodes in the response result.
  • the line type confirming unit 212 in Fig. 1 confirms the line type of each second-level domain name based on the delay of the response result returned by the different line node. The operation of the line type confirming unit 212 will be specifically described below.
  • the line type confirming unit 212 can confirm the line type of the second-level domain name by comparing whether the delay of the response result returned by the different line nodes by each of the second-level domain names is the same. When the result of the comparison is different, the line type of the line node with the shortest delay in returning the response result is the line type of the second-level domain name. When the comparison result is the same, the line type of the second-level domain name is confirmed as the default type. In a specific embodiment of the invention, it is assumed that the line node comprises a network node and a telecommunications node.
  • the line type confirmation unit 212 When the line type confirmation unit 212 acquires a certain For the line type of the second-level domain name, the line type confirming unit 212 first accesses a second-level domain name through the two line nodes of the telecommunication and the Netcom respectively, wherein the delay of returning the response result by the telecommunication node is a delay of 1, passing The delay of returning the response result of the network gateway node is delay 2, and then the line type confirming unit 212 compares the relationship between the delay 1 and the delay 2, and if the delay 1 is less than the delay 2, the line of the second-level domain name at this time The type is telecommunication. If the delay 1 is equal to the delay 2, the line type of the second-level domain name is the default type.
  • the IP address output module 220 After obtaining the IP address and the line type of each second-level domain name in the second-level domain name information, the IP address output module 220 outputs the IP address and line type of each second-level domain name obtained by the information obtaining module 210 to the user.
  • the device 100 displays the second-level domain name information associated with the website primary domain name and the IP address and line type of each second-level domain name in the second-level domain name information to the user, so that the user can conveniently perform the website.
  • a comprehensive understanding enables users to access better services for the website.
  • the embodiment of the present invention further provides FIG. 4, which shows the website owner. A schematic diagram of the specific parameters associated with the domain name.
  • the device 100 for acquiring the second-level domain name information associated with the primary domain name can obtain the second-level domain name information associated with the website primary domain name effectively and comprehensively, and can also obtain the second-level domain name information.
  • the IP address and line type of each second-level domain name in the second-level domain name information can display the obtained information, so that the webmaster user can more fully understand the domain name access status of the website, so that the website can effectively access the website. Protection services and CDN services.
  • the device 100 for obtaining the second-level domain name information associated with a primary domain name is only a preferred embodiment in the architecture, and does not limit the actual structure of the device 100. There may also be multiple other devices (modules) or interfaces in the device 100, as the case may be.
  • FIG. 5 is a flow chart showing a method for obtaining secondary domain name information associated with a primary domain name according to an embodiment of the present invention.
  • the method includes steps S501 to S513. The method starts in step S501.
  • step S501 the primary domain name of the website that needs to query the second-level domain name information is obtained according to the user indication.
  • the secondary domain name information associated with the primary domain name of the website is obtained according to the primary domain name of the website.
  • the user indication in step S501 may be specifically sent by the user when the website main domain name is input through the input box, or may be sent when the website domain name list is pre-stored.
  • the following describes the method for obtaining the second-level domain name information associated with the website primary domain name according to the primary domain name of the website.
  • the method includes steps S502 and S503.
  • step S502 the webpage content related to the website main domain name is captured according to the website main domain name acquired in step S501. Where the content of the captured webpage is There are a few links to URLs that contain the main domain of the website.
  • step S503 is performed.
  • step S503 the webpage content captured in step S502 is analyzed, and the second-level domain name information associated with the website primary domain name is obtained therein.
  • step S503 can obtain the second-level domain name information associated with the website main domain name through the webpage content.
  • step S502 and step S503 are the first method for obtaining the second-level domain name information according to the website main domain name introduced in the embodiment, that is, analyzing the webpage content captured according to the website main domain name to obtain the second-level domain name information.
  • This method usually only captures the content of the webpage of the website's primary domain name. If some of the links associated with the second-level domain name do not exist in the content of the webpage, the second-level domain name information may not be comprehensively obtained to some extent.
  • the embodiment of the present invention further provides another method for obtaining the second-level domain name information according to the website primary domain name. It should be noted that, in this embodiment, the execution of the two methods has no order relationship, and the second method may be performed before or after the first method, or may be performed simultaneously with the first method. The second method will be specifically described in conjunction with FIG. 5.
  • step S504 the search result associated with the website primary domain name is obtained in the search engine according to the website primary domain name acquired in step S501.
  • the process of specifically obtaining the search result in step S504 will be described in detail in the method shown in FIG.
  • step S601 the primary domain name of the website acquired in step S501 is combined with the preset identifier to generate query information.
  • the preset identifier may be added to the website primary domain name to form query information.
  • the default identifier can be: site:
  • the main domain name of the website is a.com.
  • the generated query information is: site:a.com.
  • the preset identifier may also be other characters, which is not illustrated here.
  • step S602 the search engine is provided with query information, and the search result of the website main domain name included in the search engine is obtained.
  • the query information may be provided to multiple search engines at the same time, and the primary domain name of the website included in the multiple search engines is obtained. Search results to ensure full access to the search results for the main domain of the website.
  • Step S505 Acquire second-level domain name information associated with the primary domain name in the search result associated with the primary domain name obtained in step S504.
  • step S505 is When executing, you need to first search for the search results obtained by different search engines. If the merge process is performed, the second-level domain name information associated with the primary domain name is obtained in the search result after the merge process. Excluding the merge processing operation avoids duplication of search results, and alleviates the processing pressure when the second-level domain name information is acquired in step S505.
  • the above two methods for obtaining second-level domain name information are introduced.
  • the second-level domain name information associated with the website primary domain name can be obtained comprehensively.
  • whether the second-level domain information obtained through the first method or the second method may miss the second-level domain information, for example, some second-level domain names are exclusively used for internal management, and will not appear on the website. Home and search engine.
  • the embodiment of the present invention also provides a third method for obtaining the second-level domain name information associated with the website primary domain name. It should be noted that, in this embodiment, the execution of the third method has no order relationship with the execution of the above two methods. The third method is described in detail in Figure 5.
  • the third method is performed after step S501, and specifically includes step S506 and step S507.
  • a dictionary is stored in advance, and the dictionary stores content that can be used as a second-level domain name of the website, such as "mail", "bbs", "blog” and the like.
  • each content in the above dictionary is combined with the main domain name of the website by step S506 to obtain a URL list associated with the website main domain name.
  • each URL in the URL list is accessed in turn, and it is determined whether the access is successful.
  • the second-level domain name represented by the URL exists.
  • the second-level domain name is retained. If the URL fails to be accessed, the second-level domain name represented by the URL does not exist.
  • Step S507 combines all the successfully accessed URLs in the URL list as the obtained second-level domain name information after accessing all the URLs in the URL list.
  • the access described above includes parsing the URL address, and if the parsing is successful, the URL can be accessed.
  • a port of the URL address may be accessed (for example, port 80, port 443, etc.), and if it is accessible, it is considered to be a second-level domain name.
  • this embodiment introduces three different methods to obtain second-level domain name information.
  • the above three ways to obtain the second-level domain name information are only preferred embodiments.
  • the user can also select the method for obtaining the second-level domain name information according to the specific application environment.
  • the user can obtain the second-level domain name information by using only one of the above methods or any two of the above methods, and can also obtain the second-level domain name information by using other methods than the above three methods, and other methods for obtaining the second-level domain name information.
  • Method This embodiment is not described one by one.
  • step S508 is further performed. In step S508, all the second-level domain name information acquired in step S503, step S505, and step S507 are rearranged and combined.
  • the content that does not exist in the dictionary in the second-level domain name information may be added to the dictionary in step S509 to complete the dictionary content.
  • step S510 is further performed, and the second-level domain name information after the re-arrangement and processing in step S508 is displayed. user.
  • step S508 in order to enable the website to access the protection service and the CDN service more effectively, it is necessary to obtain the second-level domain name after obtaining the second-level domain name information associated with the website primary domain name. Other relevant information about the information, so that website managers can more fully understand the use of the website.
  • step S511 each secondary domain name in the second-level domain name information after the rearrangement and merge processing is accessed, and then, step S512 is accessed to step S511. The response results obtained for each second-level domain name are further processed.
  • step S511 can access each second-level domain name in the second-level domain name information through different line nodes (such as a network communication node and a telecommunication node, etc.), and respectively obtain response results of each second-level domain name about different line nodes. And the delay in returning the response result. Then, the processing is performed in step S512.
  • the processing in step S512 can be specifically divided into two processes: IP address acquisition and line type acquisition. The following two procedures are specifically introduced.
  • step S512 when the IP address is obtained, the IP address of each second-level domain name at a different line node may be extracted from the response result of each second-level domain name obtained in step S511.
  • step S512 when the line type is obtained, the line type of the second-level domain name can be confirmed by comparing whether the delay of the response result of each second-level domain name returned by the different line node is the same.
  • the specific implementation manner is: when the comparison result is different, confirm that the line type of the line node with the shortest delay in returning the response result is the line type of the second-level domain name, and when the comparison result is the same, confirm that the line type of the second-level domain name is The default type.
  • step S513 After obtaining the IP address and the line type of each second-level domain name in the second-level domain name information, the process proceeds to step S513, and the IP address and the line of each second-level domain name in the second-level domain name information obtained in step S512 are continued.
  • the type is output to the user.
  • the method provided in this embodiment can obtain the second-level domain name information associated with the website domain name effectively and comprehensively on the one hand, and acquire the second-level domain name information in each level on the other hand.
  • the IP address and line type of the domain name and can display the obtained information. This allows the webmaster to have a more complete understanding of the domain's domain name access so that the site can effectively access the protection services and CDN services.
  • modules in the devices of the embodiments can be adaptively changed and placed in one or more devices different from the embodiment.
  • the modules or units or components of the embodiments may be combined into one module or unit or component, and further they may be divided into a plurality of sub-modules or sub-units or sub-components.
  • any combination of the features disclosed in the specification, including the accompanying claims, the abstract and the drawings, and any methods so disclosed, or All processes or units of the device are combined.
  • Each feature disclosed in the specification (including the accompanying claims, the abstract and the drawings) may be replaced by an alternative feature that provides the same, equivalent or similar purpose.
  • the various component embodiments of the present invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof.
  • Those skilled in the art will appreciate that some or all of the components of the apparatus for implementing secondary domain name information associated with a primary domain name in accordance with an embodiment of the present invention may be implemented in practice using a microprocessor or digital signal processor (DSP). Some or all of the features.
  • the invention may also be implemented as a device or device program (e.g., a computer program and a computer program product) for performing some or all of the methods described herein.
  • Such a program implementing the present invention may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from the Internet website, or provided on a carrier signal, or in any other form.
  • FIG. 7 illustrates a server, such as an application server, that can implement a search result method associated with a primary domain name in a search engine in accordance with the present invention.
  • the server traditionally includes a processor 710 and a computer program product or computer readable medium in the form of a memory 720.
  • Memory 720 can be an electronic memory such as flash memory, EEPROM (Electrically Erasable Programmable Read Only Memory), EPROM, hard disk, or ROM.
  • Memory 720 has storage for program code 731 for performing any of the method steps described above Space 730.
  • storage space 730 for program code may include various program code 731 for implementing various steps in the above methods, respectively.
  • the program code can be read from or written to one or more computer program products.
  • Such computer program products include program code carriers such as hard disks, compact disks (CDs), memory cards or floppy disks.
  • Such a computer program product is typically a portable or fixed storage unit as described with reference to FIG.
  • the storage unit may have a storage section, a storage space, and the like arranged similarly to the storage 720 in the server of FIG.
  • the program code can be compressed, for example, in an appropriate form.
  • the storage unit includes computer readable code 731', i.e., code that can be read by a processor, such as 710, that when executed by the server causes the server to perform various steps in the methods described above.
  • an embodiment or “one or more embodiments” as used herein means that the particular features, structures, or characteristics described in connection with the embodiments are included in at least one embodiment of the invention.
  • the phrase “in one embodiment” herein does not necessarily refer to the same embodiment.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

Disclosed are a device and method for acquiring second-level domain name information associated with a main domain name. The device comprises: a webpage crawling module which is configured to crawl a webpage content of which the URL is set as a main domain name; and a first domain name acquisition module which is configured to analyze the crawled webpage content, so as to acquire second-level domain name information associated with the main domain name in the webpage content. The present invention can effectively and comprehensively acquire the second-level domain name information associated with the main domain name, so that a webmaster user can learn about the domain name access condition of a website more comprehensively.

Description

获取一个主域名相关联的二级域名信息的设备和方法 技术领域  Apparatus and method for obtaining second-level domain name information associated with a primary domain name
本发明涉及互联网领域, 尤其涉及一种获取一个主域名相关联的二级域名信息 的设备和方法。 背景技术  The present invention relates to the field of the Internet, and in particular, to an apparatus and method for acquiring second-level domain name information associated with a primary domain name. Background technique
随着互联网技术的发展, 互联网上的站长用户也越来也多, 而为站长用户提供 的网络防护和 CDN ( Content Delivery Network, 内容分发网络) 的业务也越来也多。 如何能够方便、 快速地让用户网站准确的接入保护服务和 CDN服务成了亟待解决的 问题。  With the development of Internet technology, there are more and more webmasters on the Internet, and there are more and more network protection and CDN (Content Delivery Network) services for webmasters. How to easily and quickly allow users to accurately access protection services and CDN services becomes an urgent problem to be solved.
若让用户网站方便、 快速地准确接入保护服务和 CDN服务, 前提是站长用户必 须对网站的域名访问情况进行全面了解, 进而需要站长用户准确获取与该网站主域 名相关联的二级域名信息。  If the user website is allowed to access the protection service and the CDN service conveniently and quickly, the premise that the webmaster user must have a comprehensive understanding of the domain name access of the website, and then the webmaster user needs to accurately obtain the second level associated with the main domain name of the website. Domain name information.
但是, 现有技术中, 并没有一种有效地、 全面的获取网站主域名相关联的二级 域名信息的方式, 从而使得站长用户不能更全面地了解网站的域名访问情况。 发明内容  However, in the prior art, there is no effective and comprehensive way of obtaining the second-level domain name information associated with the website main domain name, so that the webmaster user cannot fully understand the domain name access status of the website. Summary of the invention
鉴于上述问题, 提出了本发明以便提供一种克服上述问题或者至少部分地解决 或者减缓上述问题的主域名相关联的二级域名信息的设备和方法。  In view of the above problems, the present invention has been made in order to provide an apparatus and method for overcoming the above problems or at least partially solving or slowing down the secondary domain name information associated with the primary domain name of the above problem.
根据本发明的一个方面, 提供了一种获取一个主域名相关联的二级域名信息的 设备, 包括: 网页抓取模块, 配置为抓取其网址设置为主域名的网页内容; 第一域 名获取模块, 配置为分析所抓取的网页内容, 获取网页内容中、 与主域名相关联的 二级域名信息。  According to an aspect of the present invention, a device for obtaining second-level domain name information associated with a primary domain name is provided, including: a webpage crawling module configured to crawl webpage content whose web address is set as a primary domain name; The module is configured to analyze the content of the captured webpage, and obtain the second-level domain name information associated with the primary domain name in the webpage content.
可选地, 上述设备还包括: 搜索结果获取模块, 配置为获取搜索引擎收录的、 与主域名相关联的搜索结果; 第二域名获取模块, 配置为分析搜索结果获取模块获 取的搜索结果, 以获取与主域名相关联的二级域名信息。  Optionally, the device further includes: a search result obtaining module configured to obtain a search result that is included in the search engine and associated with the primary domain name; and a second domain name obtaining module configured to analyze the search result obtained by the search result obtaining module, Obtain the second-level domain name information associated with the primary domain name.
可选地, 上述搜索结果获取模块包括: 生成单元, 配置为对主域名以及预设标 识符进行组合, 生成查询信息; 获取单元, 配置为在至少一个搜索引擎中输入查询 信息, 获取至少一个搜索引擎收录的、 与主域名相关联的搜索结果。  Optionally, the foregoing search result obtaining module includes: a generating unit configured to combine the primary domain name and the preset identifier to generate query information; and the acquiring unit configured to input query information in at least one search engine to obtain at least one search Search results associated with the primary domain name included in the engine.
可选地, 生成单元还配置为在主域名前添加预设标识符, 生成查询信息。  Optionally, the generating unit is further configured to add a preset identifier before the primary domain name to generate query information.
可选地, 生成单元添加的预设标识符包括: site: ; 相应的, 生成单元生成的查 询信息为: site:a.com, 其中, a.com为主域名。 可选地, 第二域名获取模块配置为按如下方式获取与主域名相关联的二级域名 信息: 对通过不同搜索引擎获取的与主域名相关联的搜索结果进行合并; 在合并后 的搜索结果中获取与主域名相关联的二级域名信息。 Optionally, the preset identifier added by the generating unit includes: site: ; correspondingly, the query information generated by the generating unit is: site:a.com, where a.com is the primary domain name. Optionally, the second domain name obtaining module is configured to acquire the second-level domain name information associated with the primary domain name as follows: merge the search results associated with the primary domain name obtained by different search engines; the merged search result Get the second-level domain name information associated with the primary domain name.
可选地, 上述设备还包括: 合并模块, 配置为将第一域名获取模块与第二域名 获取模块分别获取的与主域名相关联的二级域名信息进行合并; 记录输出模块, 配 置为将合并后的二级域名信息作为二级域名记录进行输出。  Optionally, the device further includes: a merging module, configured to merge the second-level domain name information associated with the primary domain name obtained by the first domain name obtaining module and the second domain name obtaining module respectively; and the output module is configured to be merged The second-level domain name information is output as a second-level domain name record.
可选地, 合并模块按照如下方式实现合并操作: 将第一域名获取模块与第二域 名获取模块分别获取的与主域名相关联的二级域名信息进行排重处理后合并。  Optionally, the merging module performs the merging operation according to the following manner: the second-level domain name information associated with the primary domain name obtained by the first domain name obtaining module and the second domain name obtaining module respectively is subjected to weight processing and combined.
可选地, 上述设备还包括: 列表建立模块, 配置为利用预设字典, 建立与主域 名相关联的预定二级域名列表; 第三域名获取模块, 配置为基于预定二级域名列表 获取与主域名相关联的二级域名信息。  Optionally, the device further includes: a list establishing module, configured to use a preset dictionary to establish a predetermined second-level domain name list associated with the primary domain name; and a third domain name obtaining module configured to acquire and learn based on the predetermined second-level domain name list The second-level domain name information associated with the domain name.
可选地, 第三域名获取模块还配置为对预定二级域名列表中的每个二级域名进 行访问, 并获取访问成功的二级域名, 得到与主域名相关联的二级域名信息。  Optionally, the third domain name obtaining module is further configured to access each second-level domain name in the predetermined second-level domain name list, and obtain a second-level domain name that is successfully accessed, and obtain second-level domain name information associated with the primary domain name.
可选地, 合并模块还配置为将已获取的二级域名信息与第三域名获取模块获取 的与主域名相关联的二级域名信息进行合并。  Optionally, the merging module is further configured to merge the acquired second-level domain name information with the second-level domain name information associated with the primary domain name obtained by the third domain name obtaining module.
可选地, 上述设备还包括: 字典补充模块, 配置为对于经合并模块得到的与主 域名相关联的二级域名信息中的任意一个二级域名, 若该二级域名不存在于预设字 典中, 则将其补入预设字典中。  Optionally, the device further includes: a dictionary supplementing module, configured to configure, according to the merged module, any one of the second-level domain names associated with the primary domain name, if the second-level domain name does not exist in the preset dictionary In it, it is added to the default dictionary.
可选地, 上述设备还包括: 域名访问模块, 配置为利用不同线路节点访问所获 得的二级域名信息中的一个或者多个二级域名中的至少一个二级域名, 并获取通过 不同线路节点返回的响应结果, 及不同线路节点返回响应结果的时延; 信息获取模 块,配置为在响应结果中获取每个二级域名通过不同线路节点访问得到的 IP( Interaet Protocol, 网络之间互连的协议) 地址, 并根据不同线路节点返回响应结果的时延确 认每个二级域名的线路类型; 以及 IP地址输出模块, 配置为输出每个二级域名的 IP 地址及线路类型。  Optionally, the device further includes: a domain name accessing module, configured to access, by using different line nodes, at least one second-level domain name of one or more second-level domain names obtained by using different line nodes, and acquiring through different line nodes The response result returned, and the delay of the response result returned by the different line nodes; the information acquisition module is configured to obtain the IP obtained by accessing each second-level domain name through different line nodes in the response result (Interaet Protocol, interconnected between networks) Protocol) address, and confirm the line type of each second-level domain name according to the delay of returning the response result of different line nodes; and the IP address output module, configured to output the IP address and line type of each second-level domain name.
可选地, 信息获取模块包括: IP 地址获取单元, 配置为在响应结果中获取每个 二级域名通过不同线路节点访问得到的 IP地址; 线路类型确认单元, 配置为比较访 问每个二级域名时, 不同线路节点返回响应结果的时延是否相同, 当比较的结果不 同时, 确认返回响应结果时延最短的线路节点所在的线路类型为二级域名的线路类 型, 当比较的结果相同时, 确认二级域名的线路类型为默认类型。  Optionally, the information obtaining module includes: an IP address obtaining unit configured to obtain, in the response result, an IP address obtained by accessing each second-level domain name through different line nodes; and a line type confirming unit configured to compare and access each second-level domain name When the delays of the response results of different line nodes are the same, when the result of the comparison is different, it is confirmed that the line type of the line node with the shortest delay in returning the response result is the line type of the second-level domain name, and when the comparison result is the same, Confirm that the line type of the second-level domain name is the default type.
根据本发明的另一方面, 提供了一种获取一个主域名相关联的二级域名信息的 方法, 包括: 抓取其网址设置为主域名的网页内容; 分析所抓取的网页内容, 获取 网页内容中、 与主域名相关联的二级域名信息。 可选地, 该方法还包括: 获取搜索引擎收录的、 与主域名相关联的搜索结果; 分析获取的搜索结果, 以获取与主域名相关联的二级域名信息。 According to another aspect of the present invention, a method for obtaining second-level domain name information associated with a primary domain name is provided, including: crawling webpage content whose URL is set as a primary domain name; analyzing the content of the captured webpage, obtaining a webpage The second-level domain name information associated with the primary domain name in the content. Optionally, the method further includes: obtaining a search result that is included in the search engine and associated with the primary domain name; analyzing the obtained search result to obtain the second-level domain name information associated with the primary domain name.
可选地, 该方法还包括: 利用预设字典, 建立与主域名相关联的预定二级域名 列表; 基于预定二级域名列表获取与主域名相关联的二级域名信息。  Optionally, the method further includes: establishing, by using a preset dictionary, a predetermined second-level domain name list associated with the primary domain name; and acquiring second-level domain name information associated with the primary domain name based on the predetermined secondary domain name list.
可选地, 基于预定二级域名列表获取与主域名相关联的二级域名信息, 包括: 对预定二级域名列表中的每个二级域名进行访问, 并获取访问成功的二级域名, 得 到与主域名相关联的二级域名信息。  Optionally, obtaining the second-level domain name information associated with the primary domain name based on the predetermined second-level domain name list, including: accessing each second-level domain name in the predetermined second-level domain name list, and obtaining a second-level domain name with successful access, obtaining Secondary domain name information associated with the primary domain name.
可选地, 该方法还包括: 将当前已获取的所有与主域名相关联的二级域名信息 进行合并。  Optionally, the method further includes: combining all the second-level domain information that is currently acquired and associated with the primary domain name.
可选地, 该方法还包括: 对于经合并处理后得到的与主域名相关联的二级域名 信息中的任意一个二级域名, 若该二级域名不存在于预设字典中, 则将其补入预设 n巾。  Optionally, the method further includes: if the second-level domain name associated with the primary domain name obtained by the merge process is not present in the preset dictionary, Fill in the preset n towel.
可选地, 该方法还包括: 利用不同线路节点访问所获得的二级域名信息中的一 个或者多个二级域名中的至少一个二级域名, 并获取通过不同线路节点返回的响应 结果, 及不同线路节点返回响应结果的时延; 在响应结果中获取每个二级域名通过 不同线路节点访问得到的 IP地址, 并根据不同线路节点返回响应结果的时延确认每 个二级域名的线路类型; 输出每个二级域名的 IP地址及线路类型。  Optionally, the method further includes: accessing, by using different line nodes, at least one of the one or more second-level domain names in the obtained second-level domain name information, and obtaining a response result returned by the different line node, and The delay of the response result of the different line nodes is returned; the IP address obtained by accessing each second-level domain name through different line nodes is obtained in the response result, and the line type of each second-level domain name is confirmed according to the delay of returning the response result of different line nodes. ; Output the IP address and line type of each second-level domain name.
可选地, 根据不同线路节点返回响应结果的时延确认每个二级域名的线路类型, 包括: 比较访问每个二级域名时, 不同线路节点返回响应结果的时延是否相同, 当 比较的结果不同时, 确认返回响应结果时延最短的线路节点所在的线路类型为二级 域名的线路类型, 当比较的结果相同时, 确认二级域名的线路类型为默认类型。  Optionally, the line type of each second-level domain name is confirmed according to the delay of returning the response result of the different line nodes, including: comparing, when accessing each second-level domain name, whether delays of different line nodes returning response results are the same, when comparing When the result is different, confirm that the line type of the line node with the shortest response delay result is the line type of the second-level domain name. When the comparison result is the same, confirm that the line type of the second-level domain name is the default type.
根据本发明的又一个方面, 提供了一种计算机程序, 其包括计算机可读代码, 当所述计算机可读代码在服务器上运行时, 导致所述服务器执行根据权利要求 15-28 中的任一个所述的获取一个主域名相关联的二级域名信息的方法。  According to still another aspect of the present invention, a computer program comprising computer readable code causing the server to perform any of claims 15-28 when run on a server The method for obtaining second-level domain name information associated with a primary domain name.
根据本发明的再一个方面, 提供了一种计算机可读介质, 其中存储了如权利要 求 29所述的计算机程序。  According to still another aspect of the present invention, a computer readable medium is provided, wherein the computer program according to claim 29 is stored.
本发明提供了一种获取一个主域名相关联的二级域名信息的设备和方法, 通过 本发明, 一方面, 能够有效、 全面的获取与网站主域名相关联的二级域名信息, 另 一方面还能够获取二级域名信息中每个二级域名的 IP地址及线路类型, 并能够将获 取到的信息进行显示, 使得站长用户能够更全面地了解网站的域名访问情况。  The present invention provides an apparatus and method for acquiring second-level domain name information associated with a primary domain name. According to the present invention, on the one hand, the second-level domain name information associated with the primary domain name of the website can be obtained efficiently and comprehensively. It can also obtain the IP address and line type of each second-level domain name in the second-level domain name information, and can display the obtained information, so that the webmaster user can more fully understand the domain name access status of the website.
上述说明仅是本发明技术方案的概述, 为了能够更清楚了解本发明的技术手段, 而可依照说明书的内容予以实施, 并且为了让本发明的上述和其它目的、 特征和优 点能够更明显易懂, 以下特举本发明的具体实施方式。 附图说明 The above description is only an overview of the technical solutions of the present invention, and the technical means of the present invention can be more clearly understood, and can be implemented in accordance with the contents of the specification, and the above and other objects, features and advantages of the present invention can be more clearly understood. Specific embodiments of the invention are set forth below. DRAWINGS
通过阅读下文优选实施方式的详细描述, 各种其他的优点和益处对于本领域普 通技术人员将变得清楚明了。 附图仅用于示出优选实施方式的目的, 而并不认为是 对本发明的限制。 而且在整个附图中, 用相同的参考符号表示相同的部件。 在附图 中- 图 1 示出了根据本发明一个实施例的一种获取一个主域名相关联的二级域名信 息的设备结构图;  Various other advantages and benefits will become apparent to those skilled in the art in the <RTIgt; The drawings are only for the purpose of illustrating the preferred embodiments and are not intended to limit the invention. Throughout the drawings, the same reference numerals are used to refer to the same parts. In the accompanying drawings - FIG. 1 is a block diagram showing the structure of a device for acquiring a second-level domain name associated with a primary domain name according to an embodiment of the present invention;
图 2示出了根据本发明一个实施例的网站选择列表示意图;  2 shows a schematic diagram of a website selection list according to an embodiment of the present invention;
图 3 出了根据本发明一个实施例的正在获取与网站主域名相关联的二级域名信 息的初始化示意图;  3 is a schematic diagram showing initialization of acquiring secondary domain name information associated with a website primary domain name according to an embodiment of the present invention;
图 4出了根据本发明一个实施例的与网站主域名相关联的各项具体参数示意图; 图 5 出了根据本发明一个实施例的一种获取一个主域名相关联的二级域名信息 的方法流程图; 以及  FIG. 4 is a schematic diagram of specific parameters associated with a website primary domain name according to an embodiment of the present invention; FIG. 5 illustrates a method for acquiring secondary domain name information associated with a primary domain name according to an embodiment of the present invention; Flow chart;
图 6 了根据本发明一个实施例的在搜索引擎中获取与主域名相关联的搜索结果 方法流程图;  6 is a flow chart of a method for obtaining search results associated with a primary domain name in a search engine, in accordance with one embodiment of the present invention;
图 7 示意性地示出了用于执行根据本发明的获取一个主域名相关联的二级域名 信息的方法的服务器的框图; 以及  Figure 7 is a block diagram schematically showing a server for performing a method of acquiring second-level domain name information associated with a primary domain name according to the present invention;
图 8 示意性地示出了用于保持或者携带实现根据本发明的获取一个主域名相关 联的二级域名信息的方法的程序代码的存储单元。 具体实施方式  Fig. 8 is a view schematically showing a storage unit for holding or carrying program code for realizing a method of acquiring second-level domain name information associated with a primary domain name according to the present invention. detailed description
在此提供的算法和显示不与任何特定计算机、 虚拟***或者其它设备固有相关。 各种通用***也可以与基于在此的示教一起使用。 根据上面的描述, 构造这类*** 所要求的结构是显而易见的。 此外, 本发明也不针对任何特定编程语言。 应当明白, 可以利用各种编程语言实现在此描述的本发明的内容, 并且上面对特定语言所做的 描述是为了披露本发明的最佳实施方式。  The algorithms and displays provided herein are not inherently related to any particular computer, virtual system, or other device. Various general purpose systems can also be used with the teaching based on the teachings herein. According to the above description, the structure required to construct such a system is obvious. Moreover, the invention is not directed to any particular programming language. It is to be understood that the present invention may be embodied in a variety of programming language, and the description of the specific language is described above for the purpose of illustrating the preferred embodiments of the invention.
相关技术提及, 目前没有一种有效地、 全面的获取网站主域名相关联的二级域 名信息的方式, 能够使得站长用户不能更全面地了解网站的域名访问情况。  As mentioned in the related art, there is currently no effective and comprehensive way to obtain the secondary domain name information associated with the website's primary domain name, which can make the webmaster user unable to more fully understand the domain name access of the website.
为了解决上述技术问题, 本发明实施例提供了一种获取一个主域名相关联的二 级域名信息的设备。 图 1 示出了根据本发明一个实施例的一种获取一个主域名相关 联的二级域名信息的设备结构图。 该设备 100 可以用来获取与网站主域名相关联的 二级域名信息以及二级域名信息对应的其他信息。 现分别对该设备的设备结构、 各 部分之间的连接关系以及各部分的作用进行详细介绍。 参见图 1, 具体可以通过该设备 100 的网页抓取模块 1 10和第一域名获取模块 120相互配合来实现获取与网站主域名相关联的二级域名信息。在进行二级域名信息 的获取时, 首先由网页抓取模块 1 10 根据用户指示来获取需要进行查询二级域名信 息的网站的主域名, 并根据网站主域名来抓取与网站主域名相关的网页内容, 接下 来再由第一域名获取模块 120来对网页抓取模块 1 10抓取的网页内容进行分析, 获 取与该网站主域名相关联的二级域名信息。 In order to solve the above technical problem, an embodiment of the present invention provides an apparatus for acquiring second-level domain name information associated with a primary domain name. FIG. 1 is a structural diagram of a device for acquiring second-level domain name information associated with a primary domain name according to an embodiment of the present invention. The device 100 can be used to obtain second-level domain name information associated with the website primary domain name and other information corresponding to the second-level domain name information. The device structure of the device, the connection relationship between the various parts, and the role of each part are described in detail. Referring to FIG. 1 , the webpage crawling module 1 10 and the first domain name obtaining module 120 of the device 100 cooperate with each other to obtain second-level domain name information associated with the website primary domain name. When the second-level domain name information is obtained, the webpage crawling module 1 10 first obtains the main domain name of the website that needs to query the second-level domain name information according to the user instruction, and crawls the main domain name of the website according to the website main domain name. The content of the webpage is further analyzed by the first domain name obtaining module 120 to analyze the content of the webpage captured by the webpage crawling module 110, and obtain the second-level domain name information associated with the primary domain name of the website.
上文提及的网站的主域名, 通常为网站的网址, 例如, 百度网站的主域名即为 其网址 ***.com。 在本发明的一个具体实施例中, 网页抓取模块 1 10获取的网站主 域名可以由用户在该设备提供的输入框中进行输入, 也可以根据用户指示在预先保 存的各网站域名列表中进行选取。 对于用户在预先保存的各网站域名列表中进行选 取的情况,具体参见图 2,图 2示出了根据本发明一个实施例的网站选择列表示意图。  The primary domain name of the website mentioned above is usually the website address. For example, the primary domain name of Baidu website is its website ***.com. In a specific embodiment of the present invention, the website primary domain name obtained by the webpage crawling module 110 may be input by the user in an input box provided by the device, or may be performed in a pre-saved list of domain names of the websites according to the user instruction. Select. For the case where the user selects in the pre-saved list of website domain names, refer to FIG. 2, which shows a schematic diagram of a website selection list according to an embodiment of the present invention.
例如,用户可以通过点击列表中主域名为 "kafei.com"—项所示的"初始化"按钮来 触发网页抓取模块 1 10的操作。 当用户点击了 "初始化"按钮后, 所显示的页面如图 3 所示, 展示了正在获取与网站主域名相关联的二级域名信息的初始化示意图。  For example, the user can trigger the operation of the web crawl module 1 10 by clicking on the "initialize" button shown in the main domain name "kafei.com" in the list. When the user clicks the "Initialize" button, the displayed page is as shown in Figure 3, showing the initialization diagram of the second-level domain information associated with the main domain name of the website.
本发明实施例中, 第一域名获取模块 120之所以能够在网页抓取模块 1 10抓取 到的网页内容中获取到二级域名信息, 是因为网页抓取模块 1 10 所获取的网页内容 中会包含若干条含有网站主域名的网址链接, 这就使得第一域名获取模块 120 能够 在各网址链接中提取与主域名相关联的二级域名信息。 在一个具体的实施例中, 假 设网页抓取模块 1 10 抓取到的网页内容中包含的某一条网址链接为 "news.***.com/question/559520798. html?push=core&group=l", 此时, 第一 ±或名获取 模块 120在该网址链接中所获取的二级域名信息为 "news.***.com"。  In the embodiment of the present invention, the first domain name obtaining module 120 can obtain the second-level domain name information in the webpage content captured by the webpage crawling module 110, because the webpage crawling module 1 10 obtains the webpage content. There will be a number of URL links containing the website's primary domain name, which enables the first domain name obtaining module 120 to extract the second-level domain name information associated with the primary domain name in each of the URL links. In a specific embodiment, it is assumed that the webpage content included in the webpage content captured by the webpage crawling module 1 10 is "news.***.com/question/559520798. html?push=core&group=l", The second-level domain name information obtained by the first ± or name obtaining module 120 in the web link is "news.***.com".
上述描述为设备 100 通过抓取网站主域名的网页内容来获取二级域名信息。 但 是, 由于网页抓取模块 1 10 通常只抓取网站主域名的网页内容, 如果一些二级域名 相关联的链接不存在于该网页内容中, 就会导致第一域名获取模块 120 获取到的与 网站主域名相关联的二级域名信息不全面。 这时, 本发明实施例为了较为全面地获 取到与网站主域名相关联的二级域名信息, 该设备 100 除了通过上述网页抓取模块 1 10和第一域名获取模块 120之外,还提供了使用其他模块来获取二级域名信息的方 式。 需要说明的是, 设备 100 通过不同的模块来获取二级域名信息时, 其获取二级 域名信息的顺序并无具体限定。  The above description is that the device 100 obtains the second-level domain name information by crawling the webpage content of the website main domain name. However, since the webpage crawling module 1 10 usually only crawls the webpage content of the website's main domain name, if some of the second-level domain name associated links do not exist in the webpage content, the first domain name obtaining module 120 obtains the The second-level domain name information associated with the website's primary domain name is not comprehensive. In this case, in order to obtain the second-level domain name information associated with the website domain name, the device 100 provides the second-level domain acquisition module 120 and the first domain name acquisition module 120. Use other modules to get the second-level domain information information. It should be noted that when the device 100 obtains the second-level domain name information through different modules, the order in which the second-level domain name information is obtained is not specifically limited.
下面具体介绍该设备 100 中用于获取二级域名信息的其他模块之间的连接关系 以及作用。  The following describes the connection relationship and functions between other modules in the device 100 for obtaining the second-level domain name information.
参见图 1, 该设备 100还可以通过搜索结果获取模块 130和第二域名获取模块 140相互配合来实现获取与网站主域名相关联的二级域名信息。在进行二级域名信息 的获取时, 首先由搜索结果获取模块 130 根据用户指示获取需要进行查询二级域名 信息的网站的主域名, 并根据该网站主域名在搜索引擎中获取与其相关联的搜索结 果。 搜索结果获取模块 130获取网站主域名的方式与上述网页抓取模块 110的两种 获取方式相同, 在此不再赘述。 接下来, 由第二域名获取模块 140 在搜索结果获取 模块 130得到的搜索结果中获取二级域名信息。 Referring to FIG. 1, the device 100 can also cooperate to obtain the second-level domain name information associated with the website primary domain name by using the search result obtaining module 130 and the second domain name obtaining module 140. Performing second-level domain information When the acquisition is performed, the search result obtaining module 130 first obtains the primary domain name of the website that needs to query the second-level domain name information according to the user instruction, and obtains the search result associated with the search engine according to the website primary domain name. The manner in which the search result obtaining module 130 obtains the domain name of the website is the same as that of the web page capture module 110, and is not described here. Next, the second domain name obtaining module 140 obtains the second-level domain name information in the search result obtained by the search result obtaining module 130.
为了更好地描述搜索结果获取模块 130 获取与主域名相关联的搜索结果的具体 过程, 本实施例结合图 1进行具体说明。 如图 1所示, 其中, 搜索结果获取模块 130 包括生成单元 131和获取单元 132。下面具体介绍生成单元 131和获取单元 132的连 接关系及作用。 本发明实施例中, 在接收到用户指示后, 由生成单元 131 获取网站 主域名, 并将获取的网站主域名与预设标识符进行组合, 生成查询信息。 获取单元 132—侧与上述生成单元 131相耦合, 用来从生成单元 131获取查询信息, 而另一侧 与搜索引擎侧相耦合, 用于在搜索引擎中输入查询信息, 获取搜索引擎收录的关于 网站主域名的搜索结果。 为了能够获取到更加全面的与网站主域名相关联的搜索结 果, 本实施例中, 获取单元 132 可以同时与多个搜索引擎相耦合, 在多个搜索引擎 中输入查询信息, 获取该多个搜索引擎收录的关于网站主域名的搜索结果。  In order to better describe the specific process of the search result obtaining module 130 acquiring the search result associated with the primary domain name, the present embodiment is specifically described in conjunction with FIG. 1. As shown in FIG. 1 , the search result obtaining module 130 includes a generating unit 131 and an obtaining unit 132. The connection relationship and function of the generating unit 131 and the obtaining unit 132 will be specifically described below. In the embodiment of the present invention, after receiving the user indication, the generating unit 131 obtains the website primary domain name, and combines the obtained website primary domain name with the preset identifier to generate query information. The obtaining unit 132 is coupled to the generating unit 131 to obtain query information from the generating unit 131, and the other side is coupled to the search engine side for inputting query information in the search engine to obtain information about the search engine. Search results for the website's primary domain. In order to be able to obtain a more comprehensive search result associated with the website main domain name, in this embodiment, the obtaining unit 132 can simultaneously be coupled with a plurality of search engines, input query information in the plurality of search engines, and obtain the plurality of searches. Search results for the main domain of the website included in the engine.
进一步, 生成单元 131 生成查询信息的过程, 在一个具体的实施例中, 可以将 预设标识符添加到主域名前, 构成查询信息。 假设预设标识符可以为: site: , 此时, 查询信息为: site:a.com, 其中, a.com为网站主域名的举例。 当然, 预设标识符还可 以为其他字符, 在此不再举例说明。  Further, the generating unit 131 generates a query information. In a specific embodiment, the preset identifier may be added to the primary domain name to form query information. Assume that the preset identifier can be: site: , at this time, the query information is: site:a.com, where a.com is an example of the website's primary domain name. Of course, the preset identifier can also be other characters, which will not be exemplified here.
为了更加具体地解释说明获取单元 132 获取多个搜索引擎收录的关于网站主域 名的搜索结果, 本发明还以预设字符为 site:、 且具体以一个搜索引擎 S 为例来进行 说明。 在本发明的一个具体实施例中, 假设获取二级域名信息的网站的主域名为 ABC.com。 此时, 由生成单元 131生成的查询信息为 site: ABC.com。 在生成了查询 信息之后, 获取单元 132向搜索引擎 S提供搜索关键词 site: ABC.com, 此时, 会得 到若干条搜索结果, 其中, 每条搜索结果包含一条含有网站各级域名的网址链接, 其中, 搜索结果所包含的内容如表一所示。  To more specifically explain that the obtaining unit 132 obtains the search results of the website main domain name included in the plurality of search engines, the present invention also uses the preset character as the site: and specifically takes a search engine S as an example. In a specific embodiment of the present invention, it is assumed that the primary domain name of the website that obtains the second-level domain name information is ABC.com. At this time, the inquiry information generated by the generating unit 131 is site: ABC.com. After the query information is generated, the obtaining unit 132 provides the search engine S with the search keyword site: ABC.com, and at this time, several search results are obtained, wherein each search result includes a URL link containing the domain name of each level of the website. , the content of the search results is shown in Table 1.
表一  Table I
www. ABC.com  Www. ABC.com
college. ABC.com  College. ABC.com
zhilu.vip. ABC.com  Zhilu.vip. ABC.com
newcoding.vip. ABC.com  Newcoding.vip. ABC.com
s 1. wb 1. wanwan. sina. com  s 1. wb 1. wanwan. sina. com
通过表一可以得知, 获取单元 132能够获取到包含该网站各级域名的网址链接, 从而获取单元 132能够获取到与网站主域名相关联的二级域名信息。 As can be seen from Table 1, the obtaining unit 132 can obtain a URL link including the domain name of the website. Thereby, the obtaining unit 132 can acquire the second-level domain name information associated with the website primary domain name.
上述提及, 获取单元 132 可以通过多个搜索引擎来获取搜索结果, 一般来说, 通过不同搜索引擎获取的搜索结果极大程度上会存在重复, 这时, 需要由第二域名 获取模块 140 先对通过不同搜索引擎获取的搜索结果进行排除合并处理, 之后在合 并处理后的搜索结果中获取与主域名相关联的二级域名信息。 排除合并处理的操作 避免了搜索结果的重复, 减轻了获取二级域名信息时的处理压力。  As mentioned above, the obtaining unit 132 can obtain the search result through multiple search engines. Generally, the search results obtained by different search engines are largely duplicated. In this case, the second domain name obtaining module 140 needs to be Excluding and merging the search results obtained by different search engines, and then obtaining the second-level domain name information associated with the primary domain name in the merged search results. The operation of excluding the merge processing avoids the duplication of search results and reduces the processing pressure when acquiring the second-level domain name information.
本实施例中, 通过上述两种对二级域名信息的获取方式, 已经能够较为全面地 获取到与网站主域名相关联的二级域名信息。 但是, 无论是通过第一域名获取模块 120获取, 还是通过第二域名获取模块 140来获取的二级域名信息, 都有可能会遗漏 二级域名信息, 例如一些二级域名专门用于内部管理, 而不会出现在网站主页和搜 索引擎中。 本发明实施例为了更加全面地获取到与网站主域名相关联的二级域名信 息, 还使用其他模块来获取二级域名信息。 需要说明的是, 设备 100 通过不同的模 块来获取二级域名信息时, 其获取二级域名信息的顺序并无具体限定。  In this embodiment, the second-level domain name information associated with the website primary domain name can be obtained comprehensively through the above two methods for acquiring the second-level domain name information. However, whether the second domain name information obtained by the first domain name obtaining module 120 or the second domain name obtaining module 140 is obtained, the second-level domain name information may be omitted, for example, some second-level domain names are exclusively used for internal management. It does not appear on the website homepage and search engine. In order to obtain the second-level domain name information associated with the website primary domain name more comprehensively, the embodiment of the present invention also uses other modules to obtain the second-level domain name information. It should be noted that when the device 100 obtains the second-level domain name information through different modules, the order of obtaining the second-level domain name information is not specifically limited.
下面具体介绍用于获取二级域名信息的其他模块之间的连接关系以及作用。 参见图 1,该设备 100还可以通过列表建立模块 170和第三域名获取模块 180相 互配合来实现获取与网站主域名相关联的二级域名信息。 通过列表建立模块 170 和 第三域名获取模块 180来获取二级域名信息的前提为该设备 100 中预先保存有一个 字典, 其中包括了诸如 "mail"、 "bbs"、 "blog"等可能为网站二级域名的内容。 下面具 体介绍利用该字典来获取二级域名信息的过程。  The following describes the connection relationship and functions between other modules for obtaining the second-level domain name information. Referring to FIG. 1, the device 100 can also obtain the second-level domain name information associated with the website primary domain name by using the list establishing module 170 and the third domain name obtaining module 180 to cooperate with each other. The premise of obtaining the second-level domain name information by the list building module 170 and the third domain name obtaining module 180 is that the device 100 pre-stores a dictionary, which may include a website such as "mail", "bbs", "blog", etc. The content of the second-level domain name. The following describes the process of using the dictionary to obtain second-level domain information.
在利用字典进行二级域名信息的获取时, 先由列表建立模块 170 将字典中的每 个内容分别与网站主域名相结合, 得到与网站主域名相关联的 URL ( Uniform Resource Locator, 统一资源定位符) 列表, 再由第三域名获取模块 180基于该 URL 列表获取与网站主域名相关联的二级域名信息。 在一个具体的实施例中, 列表建立 模块 170所建立的 URL列表中所包括的内容如表二所示。  When the second-level domain name information is acquired by using the dictionary, the list establishing module 170 first combines each content in the dictionary with the website main domain name to obtain a URL associated with the website main domain name (Uniform Resource Locator, unified resource positioning). a list, and the third domain name obtaining module 180 acquires the second-level domain name information associated with the website primary domain name based on the URL list. In a specific embodiment, the content included in the URL list established by the list building module 170 is as shown in Table 2.
表二  Table II
mail. ABC.com  Mail. ABC.com
bbs. ABC.com  Bbs. ABC.com
blog. ABC.com  Blog. ABC.com
news.ABC.com  news.ABC.com
上述提及了第三域名获取模块 180基于该 URL列表获取与主域名相关联的二级 域名信息。 下面介绍其具体基于 URL列表获取二级域名信息的过程。 由第三域名获 取模块 180依次对 URL列表中的每个 URL进行访问, 当访问某个 URL成功时, 说 明该 URL表示的二级域名存在, 此时保留该二级域名, 而当访问某个 URL失败时, 说明该 URL表示的二级域名不存在, 此时不保存该二级域名, 最后, 将 URL列表 中所有访问成功的 URL进行组合作为获取结果。 上面所描述的访问, 包括对该 URL 地址进行解析, 如果解析成功, 则说明可以访问该 URL。 当然本发明不受限于此, 例如, 在解析成功之后, 还可以对该 URL地址的某个端口进行访问 (如 80端口, 443端口等) , 如果可以访问才认为是二级域名存在。 The above mentioned third domain name obtaining module 180 acquires second-level domain name information associated with the primary domain name based on the URL list. The following describes the process of obtaining second-level domain name information based on the URL list. The third domain name obtaining module 180 sequentially accesses each URL in the URL list. When a certain URL is successfully accessed, the second-level domain name indicated by the URL exists, and the second-level domain name is reserved. When the URL fails, The second-level domain name represented by the URL does not exist. The second-level domain name is not saved. Finally, all URLs with successful access in the URL list are combined as the result of the acquisition. The access described above includes parsing the URL address, and if the parsing is successful, the URL can be accessed. Of course, the present invention is not limited thereto. For example, after the parsing succeeds, a certain port of the URL address may be accessed (for example, port 80, port 443, etc.), and if it is accessible, it is considered to be a second-level domain name.
综上所述, 本发明实施例介绍了通过三种不同的方式来获取二级域名信息的各 模块之间连接关系及作用。 对于本发明而言, 采取上述三组模块来获取二级域名信 息仅为较佳的实施方式。 当然, 用户还可以根据具体的应用环境来选择获取二级域 名信息的模块, 用户即可以只采用上述的任意一组或者任意两组模块来获取二级域 名信息, 也可以使用除上述获取二级域名信息的模块之外的其他模块来获取二级域 名信息, 对于用来获取二级域名信息的其他的模块本实施例不再进行一一介绍。  In summary, the embodiment of the present invention introduces the connection relationship and functions between modules of the second-level domain name information in three different ways. For the purposes of the present invention, the use of the above three sets of modules to obtain secondary domain name information is only a preferred embodiment. Of course, the user can also select a module for obtaining the second-level domain name information according to the specific application environment, and the user can obtain the second-level domain name information by using only one or any two sets of modules mentioned above, or can obtain the second-level domain name information except the above. The other modules except the module of the domain name information are used to obtain the second-level domain name information. The other modules used to obtain the second-level domain name information are not described in this embodiment.
进一步, 由于该设备 100 同时采用了上述多个模块来获取二级域名信息, 这就 在一定程度上会造成了二级域名的重复, 为了避免二级域名的重复, 参见图 1, 该设 备 100还包括合并模块 150, 其可以将上述第一域名获取模块 120、 第二域名获取模 块 140、第三域名获取模块 180所获取的所有二级域名信息进行重排合并处理。 重排 合并处理操作降低了后续模块在对二级域名信息进行操作处理时的处理压力。  Further, since the device 100 uses the above multiple modules to obtain the second-level domain name information, the second-level domain name is duplicated to some extent. To avoid the repetition of the second-level domain name, referring to FIG. 1, the device 100 The merging module 150 is further configured to perform reordering and merging processing on all the second-level domain name information acquired by the first domain name obtaining module 120, the second domain name obtaining module 140, and the third domain name obtaining module 180. Rearrangement The merge processing operation reduces the processing pressure of subsequent modules in the operation of the second-level domain name information.
本实施例中, 上述提及的字典作为一种获取二级域名信息的重要因素, 其包含 的内容会影响到获取二级域名信息的全面性。 所以在合并模块 150 在获取到二级域 名信息之后, 还可以通过字典补充模块 190 将二级域名信息中不存在于字典中的内 容添加到该字典中, 完善了字典的内容, 保证了之后在通过字典来获取二级域名信 息时, 二级域名信息的全面性。  In this embodiment, the dictionary mentioned above is an important factor for obtaining the second-level domain name information, and the content thereof may affect the comprehensiveness of obtaining the second-level domain name information. Therefore, after the merge module 150 obtains the second-level domain name information, the dictionary supplementing module 190 can also add the content that does not exist in the dictionary in the second-level domain name information to the dictionary, and the content of the dictionary is improved, and the When the second-level domain name information is obtained through a dictionary, the second-level domain name information is comprehensive.
另外, 在合并模块 150 将所有二级域名信息进行合并之后, 为了方便用户了解 到这些二级域名信息, 该设备 100 还可以将合并后的二级域名信息显示给用户, 具 体参见图 1, 该设备 100还包括记录输出模块 160, 与上述合并模块 150相耦合, 用 来将合并模块 150 合并处理后的二级域名信息显示给用户, 以便用户进行查看。 本 实施例中, 记录输出模块 160向用户显示的二级域名信息如下表三所示。  In addition, after the merging module 150 merges all the second-level domain name information, the device 100 may display the merged second-level domain name information to the user, as shown in FIG. 1 . The device 100 further includes a record output module 160, coupled to the merge module 150, for displaying the second-level domain name information processed by the merge module 150 to the user for viewing by the user. In this embodiment, the second-level domain name information displayed by the record output module 160 to the user is as shown in Table 3 below.
表三  Table 3
Figure imgf000010_0001
通过上述表三可以看出,每个显示的信息中均包含有 "ABC.com",这就导致了显 示的信息较为繁琐, 不易用户查看。 本实施例为了增加信息的简洁性, 记录输出模 块 160 在输出二级域名信息时, 只需将其去除网站主域名 (ABC.com) 之后的部分 进行显示即可, 具体显示的信息如表四所示。
Figure imgf000010_0001
As can be seen from the above Table 3, each displayed information includes "ABC.com", which results in a cumbersome display and is not easy for the user to view. In this embodiment, in order to increase the simplicity of the information, when the record output module 160 outputs the second-level domain name information, it only needs to remove the part after the website main domain name (ABC.com) to display, and the specific displayed information is shown in Table 4. Shown.
表四  Table 4
passport  Passport
sdxl www  Sdxl www
shenmo  Shenmo
w2i  W2i
本实施例中, 若要使网站更为有效地接入保护服务和 CDN服务, 这就需要在获 取了与网站主域名相关联的二级域名信息之后, 还应当获取到关于二级域名信息的 其他相关信息, 以使网站管理者能够更全面的了解到该网站的使用情况。 此时, 参 见图 1, 该设备 100还包括域名访问模块 200及信息获取模块 210。 其中, 域名访问 模块 200与合并模块 150相耦合, 对合并模块 150合并得到的二级域名信息中的每 个二级域名进行访问, 并将访问每个二级域名时得到的响应结果交由信息获取模块 210进行进一步处理。 下面具体介绍域名访问模块 200访问二级域名的过程。 In this embodiment, in order to enable the website to access the protection service and the CDN service more effectively, it is necessary to obtain the second-level domain name information after obtaining the second-level domain name information associated with the website primary domain name. Other relevant information to give website managers a more complete picture of the use of the website. At this time, referring to FIG. 1, the device 100 further includes a domain name access module 200 and an information acquiring module 210. The domain name accessing module 200 is coupled to the merge module 150, and accesses each second-level domain name in the second-level domain name information obtained by the merge module 150, and submits the response result obtained when accessing each second-level domain name to the information. The acquisition module 210 performs further processing. The process of accessing the second-level domain name by the domain name access module 200 is specifically described below.
在本实施例中, 域名访问模块 200 会通过不同的线路节点去访问二级域名信息 中的每个二级域名, 分别得到每个二级域名关于不同线路节点的响应结果及返回响 应结果的时延, 之后, 将这些访问结果交由信息获取模块 210 进行进一步处理。 上 述提及的不同的线路节点可以为网通、 电信等节点。  In this embodiment, the domain name accessing module 200 accesses each second-level domain name in the second-level domain name information through different line nodes, and obtains the response result of each second-level domain name about different line nodes and returns the response result. After the delay, these access results are passed to the information acquisition module 210 for further processing. The different line nodes mentioned above may be nodes such as Netcom, telecommunications, and the like.
进一步, 信息获取模块 210的处理操作过程也结合图 1 进行说明。 其中, 信息 获取模块 210的处理操作过程可分为两方面,一方面由图 1中的 IP地址获取单元 21 1 在响应结果中获取每个二级域名在不同线路节点处的 IP地址。 另一方面由图 1中的 线路类型确认单元 212 根据不同线路节点返回响应结果的时延确认每个二级域名的 线路类型。 下面主要对线路类型确认单元 212的操作过程进行具体介绍。  Further, the processing operation process of the information acquisition module 210 is also described in conjunction with FIG. 1. The processing operation process of the information obtaining module 210 can be divided into two aspects. On the one hand, the IP address obtaining unit 21 1 in FIG. 1 obtains the IP address of each second-level domain name at different line nodes in the response result. On the other hand, the line type confirming unit 212 in Fig. 1 confirms the line type of each second-level domain name based on the delay of the response result returned by the different line node. The operation of the line type confirming unit 212 will be specifically described below.
线路类型确认单元 212 可以通过比较每个二级域名通过不同线路节点返回响应 结果的时延是否相同, 来确认二级域名的线路类型。 当比较的结果不同时, 确认返 回响应结果时延最短的线路节点所在的线路类型为该二级域名的线路类型, 当比较 的结果相同时, 确认二级域名的线路类型为默认类型。 在本发明的一个具体实施例 中, 假设线路节点包括网通节点和电信两个节点。 当线路类型确认单元 212 获取某 个二级域名的线路类型时, 先由线路类型确认单元 212 分别通过电信和网通两个线 路节点对一个二级域名进行访问,其中,通过电信节点返回响应结果的时延为时延 1, 通过网通节点返回响应结果的时延为时延 2, 随后, 线路类型确认单元 212比较时延 1与时延 2的大小关系, 如果时延 1小于时延 2, 则此时该二级域名的线路类型为电 信, 如果时延 1等于时延 2, 此时该二级域名的线路类型为默认类型。 The line type confirming unit 212 can confirm the line type of the second-level domain name by comparing whether the delay of the response result returned by the different line nodes by each of the second-level domain names is the same. When the result of the comparison is different, the line type of the line node with the shortest delay in returning the response result is the line type of the second-level domain name. When the comparison result is the same, the line type of the second-level domain name is confirmed as the default type. In a specific embodiment of the invention, it is assumed that the line node comprises a network node and a telecommunications node. When the line type confirmation unit 212 acquires a certain For the line type of the second-level domain name, the line type confirming unit 212 first accesses a second-level domain name through the two line nodes of the telecommunication and the Netcom respectively, wherein the delay of returning the response result by the telecommunication node is a delay of 1, passing The delay of returning the response result of the network gateway node is delay 2, and then the line type confirming unit 212 compares the relationship between the delay 1 and the delay 2, and if the delay 1 is less than the delay 2, the line of the second-level domain name at this time The type is telecommunication. If the delay 1 is equal to the delay 2, the line type of the second-level domain name is the default type.
在获取了二级域名信息中每个二级域名的 IP地址及线路类型之后, 由 IP地址输 出模块 220将信息获取模块 210获取的每个二级域名的 IP地址及线路类型输出给用 户。  After obtaining the IP address and the line type of each second-level domain name in the second-level domain name information, the IP address output module 220 outputs the IP address and line type of each second-level domain name obtained by the information obtaining module 210 to the user.
综上所述, 该设备 100 将获取的与网站主域名相关联的二级域名信息、 二级域 名信息中每个二级域名的 IP地址及线路类型均显示给用户, 方便用户对于该网站进 行全面了解, 使得用户能够为网站接入更优的服务。 为了更加清楚地介绍将二级域 名信息、二级域名信息中每个二级域名的 IP地址及线路类型均显示给用户后的效果, 本发明实施例还提供了图 4, 展示了与网站主域名相关联的各项具体参数示意图。  In summary, the device 100 displays the second-level domain name information associated with the website primary domain name and the IP address and line type of each second-level domain name in the second-level domain name information to the user, so that the user can conveniently perform the website. A comprehensive understanding enables users to access better services for the website. In order to more clearly describe the effect of displaying the IP address and the line type of each of the second-level domain information and the second-level domain name information to the user, the embodiment of the present invention further provides FIG. 4, which shows the website owner. A schematic diagram of the specific parameters associated with the domain name.
可见, 本发明实施例提供的获取一个主域名相关联的二级域名信息的设备 100, 一方面, 能够有效、 全面的获取与网站主域名相关联的二级域名信息, 另一方面还 能够获取二级域名信息中每个二级域名的 IP地址及线路类型, 并能够将获取到的信 息进行显示, 使得站长用户能够更全面地了解网站的域名访问情况, 以使该网站有 效地接入保护服务和 CDN服务。  It can be seen that the device 100 for acquiring the second-level domain name information associated with the primary domain name provided by the embodiment of the present invention can obtain the second-level domain name information associated with the website primary domain name effectively and comprehensively, and can also obtain the second-level domain name information. The IP address and line type of each second-level domain name in the second-level domain name information, and can display the obtained information, so that the webmaster user can more fully understand the domain name access status of the website, so that the website can effectively access the website. Protection services and CDN services.
值得说明地是, 上述提供的获取一个主域名相关联的二级域名信息的设备 100, 在架构上仅仅是优选的实施例, 并不对该设备 100的实际结构造成限定。 该设备 100 中还可以存在多个其他的器件 (模块) 或者接口, 根据具体情况而定。  It should be noted that the device 100 for obtaining the second-level domain name information associated with a primary domain name is only a preferred embodiment in the architecture, and does not limit the actual structure of the device 100. There may also be multiple other devices (modules) or interfaces in the device 100, as the case may be.
针对上述提及的设备 100,本发明还提供了一种获取一个主域名相关联的二级域 名信息的方法, 该方法具体用于获取与某个网站主域名相关联的二级域名信息以及 其他相关信息。 图 5 出示了本发明一个实施例的一种获取一个主域名相关联的二级 域名信息的方法流程图。 参见图 5, 该方法包括步骤 S501至步骤 S513。 该方法起始 于步骤 S501 , 在步骤 S501中, 根据用户指示来获取需要进行查询二级域名信息的网 站的主域名。 在获取了网站的主域名之后, 根据网站的主域名来获取与网站主域名 相关联的二级域名信息。 其中, 步骤 S501中的用户指示具体可以由用户通过输入框 中输入网站主域名时发出, 也可以是在预先保存的各网站域名列表中进行选取时发 出。  For the device 100 mentioned above, the present invention further provides a method for acquiring second-level domain name information associated with a primary domain name, where the method is specifically for acquiring second-level domain name information associated with a website primary domain name and other Related Information. FIG. 5 is a flow chart showing a method for obtaining secondary domain name information associated with a primary domain name according to an embodiment of the present invention. Referring to FIG. 5, the method includes steps S501 to S513. The method starts in step S501. In step S501, the primary domain name of the website that needs to query the second-level domain name information is obtained according to the user indication. After obtaining the primary domain name of the website, the secondary domain name information associated with the primary domain name of the website is obtained according to the primary domain name of the website. The user indication in step S501 may be specifically sent by the user when the website main domain name is input through the input box, or may be sent when the website domain name list is pre-stored.
下面, 具体介绍根据网站的主域名来获取与网站主域名相关联的二级域名信息 的方法。 该方法包括步骤 S502和步骤 S503。 在步骤 S502中, 根据步骤 S501获取 的网站主域名来抓取与网站主域名相关的网页内容。 其中, 所抓取的网页内容中至 少包含了若干条含有网站主域名的网址链接。 The following describes the method for obtaining the second-level domain name information associated with the website primary domain name according to the primary domain name of the website. The method includes steps S502 and S503. In step S502, the webpage content related to the website main domain name is captured according to the website main domain name acquired in step S501. Where the content of the captured webpage is There are a few links to URLs that contain the main domain of the website.
接下来, 执行步骤 S503 , 在步骤 S503中, 分析步骤 S502抓取的网页内容, 在 其中获取与网站主域名相关联的二级域名信息。 本实施例中, 由于步骤 S502所抓取 的网页内容中会包含若干条含有网站主域名的网址链接, 这就使得步骤 S503能够通 过网页内容来获取网站主域名相关联的二级域名信息。  Next, step S503 is performed. In step S503, the webpage content captured in step S502 is analyzed, and the second-level domain name information associated with the website primary domain name is obtained therein. In this embodiment, since the webpage content captured in step S502 includes a plurality of webpage links including the website main domain name, step S503 can obtain the second-level domain name information associated with the website main domain name through the webpage content.
上述步骤 S502和步骤 S503为本实施例介绍的根据网站主域名来获取二级域名 信息的第一种方法, 即分析根据网站主域名抓取的网页内容来获取二级域名信息。 该方法通常只抓取网站主域名的网页内容, 如果一些二级域名相关联的链接不存在 于该网页内容中, 就会在一定程度上导致二级域名信息获取不全面。 为了能够较为 全面的获取到与网站主域名相关联的二级域名信息, 本发明实施例还提供了另一种 根据网站主域名来获取二级域名信息的方法。 需要说明的是, 在本实施例中, 该两 种方法的执行并无顺序关系, 第二种方法可以在第一种方法之前或者之后执行, 也 可以与第一种方法同时执行。 现结合图 5对第二种方法进行具体介绍。  The foregoing step S502 and step S503 are the first method for obtaining the second-level domain name information according to the website main domain name introduced in the embodiment, that is, analyzing the webpage content captured according to the website main domain name to obtain the second-level domain name information. This method usually only captures the content of the webpage of the website's primary domain name. If some of the links associated with the second-level domain name do not exist in the content of the webpage, the second-level domain name information may not be comprehensively obtained to some extent. In order to obtain the second-level domain name information associated with the website domain name, the embodiment of the present invention further provides another method for obtaining the second-level domain name information according to the website primary domain name. It should be noted that, in this embodiment, the execution of the two methods has no order relationship, and the second method may be performed before or after the first method, or may be performed simultaneously with the first method. The second method will be specifically described in conjunction with FIG. 5.
参见图 5, 第二种方法执行于上述步骤 S501 之后, 具体包括步骤 S504和步骤 S505。 在步骤 S504中, 根据步骤 S501获取的网站主域名在搜索引擎中获取与网站 主域名相关联的搜索结果。 步骤 S504具体获取搜索结果的过程将在 6所示的方法中 进行详细描述。  Referring to FIG. 5, the second method is performed after step S501, and specifically includes step S504 and step S505. In step S504, the search result associated with the website primary domain name is obtained in the search engine according to the website primary domain name acquired in step S501. The process of specifically obtaining the search result in step S504 will be described in detail in the method shown in FIG.
图 6 示出了根据本发明一个实施例的在搜索引擎中获取与主域名相关联的搜索 结果方法流程图。 该流程起始于步骤 S601, 在步骤 S601中, 将步骤 S501获取的网 站主域名与预设标识符进行组合, 生成查询信息。 本实施例中, 可以将预设标识符 添加到网站主域名前, 构成查询信息。 假设预设标识符可以为: site: , 网站主域名 为 a.com, 此时, 生成的查询信息为: site:a.com。 其中, 预设标识符还可以为其他字 符, 在此不再举例说明。  6 shows a flow diagram of a method of obtaining search results associated with a primary domain name in a search engine, in accordance with one embodiment of the present invention. The process starts in step S601. In step S601, the primary domain name of the website acquired in step S501 is combined with the preset identifier to generate query information. In this embodiment, the preset identifier may be added to the website primary domain name to form query information. Assume that the default identifier can be: site: , and the main domain name of the website is a.com. At this time, the generated query information is: site:a.com. The preset identifier may also be other characters, which is not illustrated here.
在步骤 S601生成查询信息后, 进入步骤 S602, 在步骤 S602中, 向搜索引擎提 供查询信息, 获取搜索引擎收录的关于网站主域名的搜索结果。 本实施例中, 为了 能够获取到更加全面的与主域名相关联的搜索结果, 步骤 S602在实现时, 可以同时 向多个搜索引擎提供查询信息, 获取该多个搜索引擎收录的关于网站主域名的搜索 结果, 以保证能够全面的获取到关于网站主域名的搜索结果。  After the query information is generated in step S601, the process proceeds to step S602. In step S602, the search engine is provided with query information, and the search result of the website main domain name included in the search engine is obtained. In this embodiment, in order to obtain a more comprehensive search result associated with the primary domain name, in step S602, the query information may be provided to multiple search engines at the same time, and the primary domain name of the website included in the multiple search engines is obtained. Search results to ensure full access to the search results for the main domain of the website.
上述结合图 6介绍了步骤 S504的具体操作过程, 下面接着介绍步骤 S505的具 体操作。步骤 S505, 在步骤 S504得到的与主域名相关联的搜索结果中获取与主域名 相关联的二级域名信息。 在此, 需要提及的是, 当上述步骤 S504通过多个不同的搜 索引擎来获取搜索结果时, 所获取的搜索结果极大程度上会存在重复, 为了避免搜 索结果的重复, 则步骤 S505在执行时, 需要首先对通过不同搜索引擎获取的搜索结 果进行排除合并处理, 之后在合并处理后的搜索结果中获取与主域名相关联的二级 域名信息。 排除合并处理操作避免了搜索结果的重复, 减轻了步骤 S505获取二级域 名信息时的处理压力。 The specific operation of step S504 is described above in conjunction with FIG. 6, and the specific operation of step S505 is next described. Step S505: Acquire second-level domain name information associated with the primary domain name in the search result associated with the primary domain name obtained in step S504. Here, it should be mentioned that when the above step S504 obtains the search result through a plurality of different search engines, the obtained search result may be repeated to a great extent. In order to avoid the repetition of the search result, step S505 is When executing, you need to first search for the search results obtained by different search engines. If the merge process is performed, the second-level domain name information associated with the primary domain name is obtained in the search result after the merge process. Excluding the merge processing operation avoids duplication of search results, and alleviates the processing pressure when the second-level domain name information is acquired in step S505.
综上所述, 介绍了上述两种对二级域名信息的获取方法, 通过上述两种方法, 已经能够较为全面地获取到与网站主域名相关联的二级域名信息。 但是无论是通过 第一种方式, 还是通过第二种方式来获取的二级域名信息, 都有可能会遗漏二级域 名信息, 例如一些二级域名专门用于内部管理, 而不会出现在网站主页和搜索引擎 中。 为了更加全面地获取到与网站主域名相关联的二级域名信息, 在此, 本发明实 施例还提供了第三种获取与网站主域名相关联的二级域名信息的方法。 需要说明的 是, 在本实施例中, 第三种方法的执行与上述两种方法的执行也无顺序关系。 现结 合图 5对第三种方法进行具体介绍。  In summary, the above two methods for obtaining second-level domain name information are introduced. Through the above two methods, the second-level domain name information associated with the website primary domain name can be obtained comprehensively. However, whether the second-level domain information obtained through the first method or the second method may miss the second-level domain information, for example, some second-level domain names are exclusively used for internal management, and will not appear on the website. Home and search engine. In order to more fully obtain the second-level domain name information associated with the website primary domain name, the embodiment of the present invention also provides a third method for obtaining the second-level domain name information associated with the website primary domain name. It should be noted that, in this embodiment, the execution of the third method has no order relationship with the execution of the above two methods. The third method is described in detail in Figure 5.
参见图 5, 第三种方法执行于上述步骤 S501 之后, 具体包括步骤 S506和步骤 S507。 需要说明的是, 执行第三种方法的前提为预先存储有一个字典, 该字典中保 存有可作为网站二级域名的内容, 如" mail"、 "bbs"、 "blog"等内容。 此时, 由步骤 S506将上述字典中的每个内容分别与网站的主域名相结合, 得到与网站主域名相关 联的 URL列表。  Referring to FIG. 5, the third method is performed after step S501, and specifically includes step S506 and step S507. It should be noted that the premise of executing the third method is that a dictionary is stored in advance, and the dictionary stores content that can be used as a second-level domain name of the website, such as "mail", "bbs", "blog" and the like. At this time, each content in the above dictionary is combined with the main domain name of the website by step S506 to obtain a URL list associated with the website main domain name.
在得到 URL列表后, 继续执行步骤 S507, 在步骤 S507中, 依次对 URL列表中 的每个 URL进行访问, 并判断是否访问成功。当访问某个 URL成功时, 说明该 URL 表示的二级域名存在,此时保留该二级域名,而当访问某个 URL失败时,说明该 URL 表示的二级域名不存在, 此时不保存该二级域名。 步骤 S507在对 URL列表中的所 有 URL访问完毕后, 将 URL列表中所有访问成功的 URL进行组合, 作为所获取的 二级域名信息。 需要说明的是, 上面所描述的访问, 包括对该 URL地址进行解析, 如果解析成功, 则说明可以访问该 URL。 当然本发明不受限于此, 例如, 在解析成 功之后, 还可以对该 URL地址的某个端口进行访问 (如 80端口, 443端口等) , 如 果可以访问才认为是二级域名存在。  After obtaining the URL list, proceeding to step S507, in step S507, each URL in the URL list is accessed in turn, and it is determined whether the access is successful. When a URL is successfully accessed, the second-level domain name represented by the URL exists. The second-level domain name is retained. If the URL fails to be accessed, the second-level domain name represented by the URL does not exist. The second-level domain name. Step S507 combines all the successfully accessed URLs in the URL list as the obtained second-level domain name information after accessing all the URLs in the URL list. It should be noted that the access described above includes parsing the URL address, and if the parsing is successful, the URL can be accessed. Of course, the present invention is not limited thereto. For example, after the parsing is successful, a port of the URL address may be accessed (for example, port 80, port 443, etc.), and if it is accessible, it is considered to be a second-level domain name.
综上所述, 本实施例共介绍了三种不同方法来获取二级域名信息。 对于本发明 而言, 采取上述三种方式来获取二级域名信息仅为较佳的实施方式。 当然, 用户还 可以根据具体的应用环境来选择获取二级域名信息的方式。 用户即可以只采用上述 的任意一种或者任意两种方式来获取二级域名信息, 也可以使用除上述三种方法之 外的其他方法来获取二级域名信息, 对于获取二级域名信息的其他方法本实施例不 再进行一一介绍。  In summary, this embodiment introduces three different methods to obtain second-level domain name information. For the present invention, the above three ways to obtain the second-level domain name information are only preferred embodiments. Of course, the user can also select the method for obtaining the second-level domain name information according to the specific application environment. The user can obtain the second-level domain name information by using only one of the above methods or any two of the above methods, and can also obtain the second-level domain name information by using other methods than the above three methods, and other methods for obtaining the second-level domain name information. Method This embodiment is not described one by one.
进一步, 由于本实施例采取了上述三种方法来获取二级域名信息, 这就在一定 程度上会造成了二级域名的重复, 为了避免二级域名的重复, 在执行完上述三种方 法之后, 还需执行步骤 S508, 在步骤 S508中, 将步骤 S503、 步骤 S505及步骤 S507 中获取的所有二级域名信息进行重排合并处理。 Further, since the foregoing three methods are used to obtain the second-level domain name information, the second-level domain name is duplicated to some extent. To avoid the repetition of the second-level domain name, the above three parties are executed. After the method, step S508 is further performed. In step S508, all the second-level domain name information acquired in step S503, step S505, and step S507 are rearranged and combined.
在步骤 S508得到重排合并处理后的二级域名信息之后, 还可以通过步骤 S509, 将二级域名信息中不存在于字典中的内容添加到该字典中, 以完善字典内容。  After the second-level domain name information after the re-arrangement processing is obtained in step S508, the content that does not exist in the dictionary in the second-level domain name information may be added to the dictionary in step S509 to complete the dictionary content.
另外, 在步骤 S508得到重排合并处理后的二级域名信息之后, 为了方便用户了 解到这些二级域名信息, 还继续执行步骤 S510, 将步骤 S508重排合并处理后的二级 域名信息显示给用户。  In addition, after the second-level domain name information after the re-arrangement processing is obtained in step S508, in order to facilitate the user to understand the second-level domain name information, step S510 is further performed, and the second-level domain name information after the re-arrangement and processing in step S508 is displayed. user.
进一步, 本实施例中, 若要使网站更为有效地接入保护服务和 CDN服务, 这就 需要在获取了与网站主域名相关联的二级域名信息之后, 还应当获取到关于二级域 名信息的其他相关信息, 以使网站管理者能够更全面的了解到该网站的使用情况。 此时, 在步骤 S508执行完毕后, 进入到步骤 S511 , 在步骤 S511中, 对重排合并处 理后的二级域名信息中的每个二级域名进行访问, 之后, 由步骤 S512 对步骤 S511 访问每个二级域名时得到的响应结果进行进一步处理。  Further, in this embodiment, in order to enable the website to access the protection service and the CDN service more effectively, it is necessary to obtain the second-level domain name after obtaining the second-level domain name information associated with the website primary domain name. Other relevant information about the information, so that website managers can more fully understand the use of the website. At this time, after the execution of step S508 is completed, the process proceeds to step S511. In step S511, each secondary domain name in the second-level domain name information after the rearrangement and merge processing is accessed, and then, step S512 is accessed to step S511. The response results obtained for each second-level domain name are further processed.
上述提及, 步骤 S511可以通过不同的线路节点(例如网通节点和、 电信节点等) 去访问二级域名信息中的每个二级域名, 分别得到每个二级域名关于不同线路节点 的响应结果及返回响应结果的时延。 随后, 由步骤 S512进行下一步处理, 其中, 步 骤 S512的处理过程具体可以分为 IP地址获取和线路类型获取两个过程,下面分别对 这两个过程进行具体介绍。  As mentioned above, step S511 can access each second-level domain name in the second-level domain name information through different line nodes (such as a network communication node and a telecommunication node, etc.), and respectively obtain response results of each second-level domain name about different line nodes. And the delay in returning the response result. Then, the processing is performed in step S512. The processing in step S512 can be specifically divided into two processes: IP address acquisition and line type acquisition. The following two procedures are specifically introduced.
其中, 步骤 S512在进行 IP地址获取时, 可以在步骤 S511得到的每个二级域名 关于不同线路节点的响应结果中, 提取每个二级域名在不同线路节点处的 IP地址。 步骤 S512在进行线路类型获取时, 可以通过比较每个二级域名通过不同线路节点返 回响应结果的时延是否相同, 来确认二级域名的线路类型。 具体实现方式为, 当比 较的结果不同时, 确认返回响应结果时延最短的线路节点所在的线路类型为该二级 域名的线路类型, 当比较的结果相同时, 确认二级域名的线路类型为默认类型。  In step S512, when the IP address is obtained, the IP address of each second-level domain name at a different line node may be extracted from the response result of each second-level domain name obtained in step S511. In step S512, when the line type is obtained, the line type of the second-level domain name can be confirmed by comparing whether the delay of the response result of each second-level domain name returned by the different line node is the same. The specific implementation manner is: when the comparison result is different, confirm that the line type of the line node with the shortest delay in returning the response result is the line type of the second-level domain name, and when the comparison result is the same, confirm that the line type of the second-level domain name is The default type.
在通过上述步骤 S512获取了二级域名信息中每个二级域名的 IP地址及线路类型 之后, 继续执行步骤 S513, 将步骤 S512获取的二级域名信息中每个二级域名的 IP 地址及线路类型输出给用户。  After obtaining the IP address and the line type of each second-level domain name in the second-level domain name information, the process proceeds to step S513, and the IP address and the line of each second-level domain name in the second-level domain name information obtained in step S512 are continued. The type is output to the user.
根据以上描述可以得知, 本实施例提供的方法, 一方面, 能够有效、 全面的获 取与网站主域名相关联的二级域名信息, 另一方面还能够获取二级域名信息中每个 二级域名的 IP地址及线路类型, 并能对获取的信息进行显示。 这就使得站长用户能 够更全面地了解网站的域名访问情况,以使该网站有效地接入保护服务和 CDN服务。  According to the above description, the method provided in this embodiment can obtain the second-level domain name information associated with the website domain name effectively and comprehensively on the one hand, and acquire the second-level domain name information in each level on the other hand. The IP address and line type of the domain name, and can display the obtained information. This allows the webmaster to have a more complete understanding of the domain's domain name access so that the site can effectively access the protection services and CDN services.
在此处所提供的说明书中, 说明了大量具体细节。 然而, 能够理解, 本发明的 实施例可以在没有这些具体细节的情况下实践。 在一些实例中, 并未详细示出公知 的方法、 结构和技术, 以便不模糊对本说明书的理解。 Numerous specific details are set forth in the description provided herein. However, it is understood that the embodiments of the invention may be practiced without these specific details. In some instances, it is not shown in detail The method, structure and technique are so as not to obscure the understanding of this specification.
类似地, 应当理解, 为了精简本公开并帮助理解各个发明方面中的一个或多个, 在上面对本发明的示例性实施例的描述中, 本发明的各个特征有时被一起分组到单 个实施例、 图、 或者对其的描述中。 然而, 并不应将该公开的方法解释成反映如下 意图: 即所要求保护的本发明要求比在每个权利要求中所明确记载的特征更多的特 征。 更确切地说, 如下面的权利要求书所反映的那样, 发明方面在于少于前面公开 的单个实施例的所有特征。 因此, 遵循具体实施方式的权利要求书由此明确地并入 该具体实施方式, 其中每个权利要求本身都作为本发明的单独实施例。  Similarly, the various features of the present invention are sometimes grouped together into a single embodiment, in the above description of the exemplary embodiments of the invention, Figure, or a description of it. However, the method of the disclosure should not be construed as reflecting the intention that the claimed invention requires more features than those explicitly recited in each claim. Rather, as the following claims reflect, inventive aspects reside in less than all features of the single embodiments disclosed herein. Therefore, the claims following the specific embodiments are hereby explicitly incorporated into the specific embodiments, and each of the claims as a separate embodiment of the invention.
本领域那些技术人员可以理解, 可以对实施例中的设备中的模块进行自适应性 地改变并且把它们设置在与该实施例不同的一个或多个设备中。 可以把实施例中的 模块或单元或组件组合成一个模块或单元或组件, 以及此外可以把它们分成多个子 模块或子单元或子组件。 除了这样的特征和 /或过程或者单元中的至少一些是相互排 斥之外, 可以采用任何组合对本说明书 (包括伴随的权利要求、 摘要和附图) 中公 开的所有特征以及如此公开的任何方法或者设备的所有过程或单元进行组合。 除非 另外明确陈述, 本说明书 (包括伴随的权利要求、 摘要和附图) 中公开的每个特征 可以由提供相同、 等同或相似目的的替代特征来代替。  Those skilled in the art will appreciate that the modules in the devices of the embodiments can be adaptively changed and placed in one or more devices different from the embodiment. The modules or units or components of the embodiments may be combined into one module or unit or component, and further they may be divided into a plurality of sub-modules or sub-units or sub-components. In addition to such features and/or at least some of the processes or units being mutually exclusive, any combination of the features disclosed in the specification, including the accompanying claims, the abstract and the drawings, and any methods so disclosed, or All processes or units of the device are combined. Each feature disclosed in the specification (including the accompanying claims, the abstract and the drawings) may be replaced by an alternative feature that provides the same, equivalent or similar purpose.
此外, 本领域的技术人员能够理解, 尽管在此所述的一些实施例包括其它实施 例中所包括的某些特征而不是其它特征, 但是不同实施例的特征的组合意味着处于 本发明的范围之内并且形成不同的实施例。 例如, 在下面的权利要求书中, 所要求 保护的实施例的任意之一都可以以任意的组合方式来使用。  In addition, those skilled in the art will appreciate that, although some embodiments described herein include certain features that are not included in other embodiments, and other features, combinations of features of different embodiments are intended to be within the scope of the present invention. Different embodiments are formed and formed. For example, in the following claims, any one of the claimed embodiments can be used in any combination.
本发明的各个部件实施例可以以硬件实现, 或者以在一个或者多个处理器上运 行的软件模块实现, 或者以它们的组合实现。 本领域的技术人员应当理解, 可以在 实践中使用微处理器或者数字信号处理器(DSP)来实现根据本发明实施例的一个主 域名相关联的二级域名信息的设备中的一些或者全部部件的一些或者全部功能。 本 发明还可以实现为用于执行这里所描述的方法的一部分或者全部的设备或者装置程 序 (例如, 计算机程序和计算机程序产品) 。 这样的实现本发明的程序可以存储在 计算机可读介质上, 或者可以具有一个或者多个信号的形式。 这样的信号可以从因 特网网站上下载得到, 或者在载体信号上提供, 或者以任何其他形式提供。  The various component embodiments of the present invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art will appreciate that some or all of the components of the apparatus for implementing secondary domain name information associated with a primary domain name in accordance with an embodiment of the present invention may be implemented in practice using a microprocessor or digital signal processor (DSP). Some or all of the features. The invention may also be implemented as a device or device program (e.g., a computer program and a computer program product) for performing some or all of the methods described herein. Such a program implementing the present invention may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from the Internet website, or provided on a carrier signal, or in any other form.
例如, 图 7 示出了可以实现根据本发明的在搜索引擎中获取与主域名相关联的 搜索结果方法的服务器, 例如应用服务器。 该服务器传统上包括处理器 710 和以存 储器 720形式的计算机程序产品或者计算机可读介质。 存储器 720可以是诸如闪存、 EEPROM (电可擦除可编程只读存储器) 、 EPROM、 硬盘或者 ROM之类的电子存 储器。 存储器 720具有用于执行上述方法中的任何方法步骤的程序代码 731 的存储 空间 730。 例如, 用于程序代码的存储空间 730可以包括分别用于实现上面的方法中 的各种步骤的各个程序代码 731。这些程序代码可以从一个或者多个计算机程序产品 中读出或者写入到这一个或者多个计算机程序产品中。 这些计算机程序产品包括诸 如硬盘, 紧致盘 (CD ) 、 存储卡或者软盘之类的程序代码载体。 这样的计算机程序 产品通常为如参考图 8所述的便携式或者固定存储单元。该存储单元可以具有与图 7 的服务器中的存储器 720 类似布置的存储段、 存储空间等。 程序代码可以例如以适 当形式进行压缩。 通常, 存储单元包括计算机可读代码 731 ', 即可以由例如诸如 710 之类的处理器读取的代码, 这些代码当由服务器运行时, 导致该服务器执行上面所 描述的方法中的各个步骤。 For example, FIG. 7 illustrates a server, such as an application server, that can implement a search result method associated with a primary domain name in a search engine in accordance with the present invention. The server traditionally includes a processor 710 and a computer program product or computer readable medium in the form of a memory 720. Memory 720 can be an electronic memory such as flash memory, EEPROM (Electrically Erasable Programmable Read Only Memory), EPROM, hard disk, or ROM. Memory 720 has storage for program code 731 for performing any of the method steps described above Space 730. For example, storage space 730 for program code may include various program code 731 for implementing various steps in the above methods, respectively. The program code can be read from or written to one or more computer program products. These computer program products include program code carriers such as hard disks, compact disks (CDs), memory cards or floppy disks. Such a computer program product is typically a portable or fixed storage unit as described with reference to FIG. The storage unit may have a storage section, a storage space, and the like arranged similarly to the storage 720 in the server of FIG. The program code can be compressed, for example, in an appropriate form. Typically, the storage unit includes computer readable code 731', i.e., code that can be read by a processor, such as 710, that when executed by the server causes the server to perform various steps in the methods described above.
本文中所称的 "一个实施例"、 "实施例"或者"一个或者多个实施例 "意味着, 结合 实施例描述的特定特征、 结构或者特性包括在本发明的至少一个实施例中。 此外, 请注意, 这里"在一个实施例中"的词语例子不一定全指同一个实施例。  "an embodiment," or "one or more embodiments" as used herein means that the particular features, structures, or characteristics described in connection with the embodiments are included in at least one embodiment of the invention. In addition, it should be noted that the phrase "in one embodiment" herein does not necessarily refer to the same embodiment.
应该注意的是上述实施例对本发明进行说明而不是对本发明进行限制, 并且本 领域技术人员在不脱离所附权利要求的范围的情况下可设计出替换实施例。 在权利 要求中, 不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词"包含" 不排除存在未列在权利要求中的元件或步骤。位于元件之前的单词 "一"或"一个"不排 除存在多个这样的元件。 本发明可以借助于包括有若干不同元件的硬件以及借助于 适当编程的计算机来实现。 在列举了若干装置的单元权利要求中, 这些装置中的若 干个可以是通过同一个硬件项来具体体现。 单词第一、 第二、 以及第三等的使用不 表示任何顺序。 可将这些单词解释为名称。  It is to be noted that the above-described embodiments are illustrative of the present invention and are not intended to limit the scope of the invention, and those skilled in the art can devise alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as a limitation. The word "comprising" does not exclude the presence of the elements or steps that are not in the claims. The word "one" or "one" before the element does not exclude the presence of multiple such elements. The invention can be implemented by means of hardware comprising several distinct elements and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means can be embodied by the same item. The use of the words first, second, and third does not indicate any order. These words can be interpreted as names.
此外, 还应当注意, 本说明书中使用的语言主要是为了可读性和教导的目的而 选择的, 而不是为了解释或者限定本发明的主题而选择的。 因此, 在不偏离所附权 利要求书的范围和精神的情况下, 对于本技术领域的普通技术人员来说许多修改和 变更都是显而易见的。 对于本发明的范围, 对本发明所做的公开是说明性的, 而非 限制性的, 本发明的范围由所附权利要求书限定。  In addition, it should be noted that the language used in the specification has been selected for the purpose of readability and teaching, and is not intended to be construed as limiting or limiting. Therefore, many modifications and changes will be apparent to those skilled in the art without departing from the scope of the invention. The disclosure of the present invention is intended to be illustrative, and not restrictive, and the scope of the invention is defined by the appended claims.

Claims

权 利 要 求 Rights request
1、 一种获取一个主域名相关联的二级域名信息的设备, 包括- 网页抓取模块, 配置为抓取其网址设置为所述主域名的网页内容; 1. A device for obtaining secondary domain name information associated with a primary domain name, including - a web page crawling module configured to capture web page content whose URL is set to the primary domain name;
第一域名获取模块, 配置为分析所抓取的网页内容, 获取所述网页内容中、 与 所述主域名相关联的二级域名信息。 The first domain name acquisition module is configured to analyze the captured web page content and obtain the secondary domain name information associated with the primary domain name in the web page content.
2、 根据权利要求 1所述的设备, 所述设备还包括: 2. The device according to claim 1, further comprising:
搜索结果获取模块, 配置为获取搜索引擎收录的、 与所述主域名相关联的搜索 结果; The search result acquisition module is configured to obtain the search results included by the search engine and associated with the main domain name;
第二域名获取模块, 配置为分析所述搜索结果获取模块获取的搜索结果, 以获 取与所述主域名相关联的二级域名信息。 The second domain name acquisition module is configured to analyze the search results obtained by the search result acquisition module to obtain secondary domain name information associated with the primary domain name.
3、 根据权利要求 2所述的设备, 所述搜索结果获取模块包括- 生成单元, 配置为对所述主域名以及预设标识符进行组合, 生成查询信息; 获取单元, 配置为在至少一个搜索引擎中输入所述查询信息, 获取所述至少一 个搜索引擎收录的、 与所述主域名相关联的搜索结果。 3. The device according to claim 2, the search result acquisition module includes - a generation unit configured to combine the primary domain name and a preset identifier to generate query information; an acquisition unit configured to generate query information in at least one search The query information is input into the engine to obtain search results associated with the primary domain name included in the at least one search engine.
4、 根据权利要求 3所述的设备, 所述生成单元还配置为在所述主域名前添加预 设标识符, 生成查询信息。 4. The device according to claim 3, the generating unit is further configured to add a preset identifier before the primary domain name to generate query information.
5、根据权利要求 4所述的设备,所述生成单元添加的所述预设标识符包括: site:; 相应的, 所述生成单元生成的所述查询信息为: site:a.com, 其中, a.com为所述 主域名。 5. The device according to claim 4, the preset identifier added by the generating unit includes: site:; correspondingly, the query information generated by the generating unit is: site:a.com, where , a.com is the main domain name.
6、 根据权利要求 2至 5任一项所述的设备, 所述第二域名获取模块配置为按如 下方式获取与所述主域名相关联的二级域名信息- 对通过不同搜索引擎获取的与所述主域名相关联的搜索结果进行合并; 在合并后的搜索结果中获取与所述主域名相关联的二级域名信息。 6. The device according to any one of claims 2 to 5, the second domain name acquisition module is configured to acquire the secondary domain name information associated with the primary domain name in the following manner - for the second domain name information obtained through different search engines. The search results associated with the primary domain name are merged; and the secondary domain name information associated with the primary domain name is obtained from the merged search results.
7、 根据权利要求 2至 6任一项所述的设备, 所述设备还包括: 7. The device according to any one of claims 2 to 6, further comprising:
合并模块, 配置为将所述第一域名获取模块与所述第二域名获取模块分别获取 的与所述主域名相关联的二级域名信息进行合并; A merging module configured to merge the secondary domain name information associated with the primary domain name obtained by the first domain name acquisition module and the second domain name acquisition module respectively;
记录输出模块, 配置为将合并后的二级域名信息作为二级域名记录进行输出。 The record output module is configured to output the merged second-level domain name information as a second-level domain name record.
8、 根据权利要求 7所述的设备, 所述合并模块按照如下方式实现合并操作: 将所述第一域名获取模块与所述第二域名获取模块分别获取的与所述主域名相 关联的二级域名信息进行排重处理后合并。 8. The device according to claim 7, the merging module implements the merging operation in the following manner: two domain names associated with the primary domain name obtained by the first domain name acquisition module and the second domain name acquisition module respectively. Level domain name information is merged after deduplication processing.
9、 根据权利要求 7或 8所述的设备, 所述设备还包括: 9. The device according to claim 7 or 8, further comprising:
列表建立模块, 配置为利用预设字典, 建立与所述主域名相关联的预定二级域 名列表; A list building module configured to use a preset dictionary to create a predetermined secondary domain name list associated with the primary domain name;
第三域名获取模块, 配置为基于所述预定二级域名列表获取与所述主域名相关 联的二级域名信息。 A third domain name acquisition module configured to acquire information related to the primary domain name based on the predetermined secondary domain name list. associated second-level domain name information.
10、 根据权利要求 9 所述的设备, 所述第三域名获取模块还配置为对所述预定 二级域名列表中的每个二级域名进行访问, 并获取访问成功的二级域名, 得到与所 述主域名相关联的二级域名信息。 10. The device according to claim 9, the third domain name acquisition module is further configured to access each second-level domain name in the predetermined second-level domain name list, and obtain the second-level domain name with successful access, and obtain the second-level domain name with Second-level domain name information associated with the primary domain name.
11、 根据权利要求 9或 10所述的设备, 所述合并模块还配置为将已获取的二级 域名信息与所述第三域名获取模块获取的与所述主域名相关联的二级域名信息进行 合并。 11. The device according to claim 9 or 10, the merging module is further configured to combine the acquired secondary domain name information with the secondary domain name information associated with the primary domain name acquired by the third domain name acquisition module. Merge.
12、 根据权利要求 1 1所述的设备, 所述设备还包括: 12. The device according to claim 11, further comprising:
字典补充模块, 配置为对于经所述合并模块得到的与所述主域名相关联的二级 域名信息中的任意一个二级域名, 若该二级域名不存在于所述预设字典中, 则将其 补入所述预设字典中。 The dictionary supplement module is configured to, for any second-level domain name in the second-level domain name information associated with the primary domain name obtained through the merging module, if the second-level domain name does not exist in the preset dictionary, then Add it to the default dictionary.
13、 根据权利要求 1至 12任一项所述的设备, 所述设备还包括: 13. The device according to any one of claims 1 to 12, further comprising:
域名访问模块, 配置为利用不同线路节点访问所获得的二级域名信息中的一个 或者多个二级域名中的至少一个二级域名, 并获取通过不同线路节点返回的响应结 果, 及不同线路节点返回响应结果的时延; The domain name access module is configured to use different line nodes to access at least one second-level domain name in one or more second-level domain names obtained from the second-level domain name information, and obtain the response results returned through the different line nodes, and the different line nodes The delay in returning response results;
信息获取模块, 配置为在响应结果中获取每个二级域名通过不同线路节点访问 得到的 IP地址, 并根据不同线路节点返回响应结果的时延确认每个二级域名的线路 类型; 以及 The information acquisition module is configured to obtain the IP address of each second-level domain name accessed through different line nodes in the response result, and confirm the line type of each second-level domain name based on the delay in returning response results from different line nodes; and
IP地址输出模块, 配置为输出每个二级域名的 IP地址及线路类型。 The IP address output module is configured to output the IP address and line type of each second-level domain name.
14、 根据权利要求 13所述的设备, 所述信息获取模块包括: 14. The device according to claim 13, the information acquisition module includes:
IP 地址获取单元, 配置为在响应结果中获取每个二级域名通过不同线路节点访 问得到的 IP地址; The IP address acquisition unit is configured to obtain the IP address obtained by accessing each second-level domain name through different line nodes in the response result;
线路类型确认单元, 配置为比较访问每个二级域名时, 不同线路节点返回响应 结果的时延是否相同, 当比较的结果不同时, 确认返回响应结果时延最短的线路节 点所在的线路类型为二级域名的线路类型, 当比较的结果相同时, 确认所述二级域 名的线路类型为默认类型。 The line type confirmation unit is configured to compare whether the delay of response results returned by different line nodes is the same when accessing each second-level domain name. When the comparison results are different, confirm the line type of the line node with the shortest return response result delay. The line type of the second-level domain name. When the comparison results are the same, it is confirmed that the line type of the second-level domain name is the default type.
15、 一种获取一个主域名相关联的二级域名信息的方法, 包括- 抓取其网址设置为所述主域名的网页内容; 15. A method of obtaining secondary domain name information associated with a primary domain name, including - crawling the content of web pages whose URLs are set to the primary domain name;
分析所抓取的网页内容, 获取所述网页内容中、 与所述主域名相关联的二级域 名信息。 Analyze the captured web page content and obtain the secondary domain name information associated with the primary domain name in the web page content.
16、 根据权利要求 15所述的方法, 还包括: 16. The method of claim 15, further comprising:
获取搜索引擎收录的、 与所述主域名相关联的搜索结果; Obtain the search results associated with the main domain name included in the search engine;
分析获取的搜索结果, 以获取与所述主域名相关联的二级域名信息。 The obtained search results are analyzed to obtain secondary domain name information associated with the primary domain name.
17、 根据权利要求 16所述的方法, 所述获取搜索引擎收录的、 与所述主域名相 关联的搜索结果, 包括: 对所述主域名以及预设标识符进行组合, 生成查询信息; 17. The method according to claim 16, said obtaining the search results associated with the primary domain name included in the search engine, including: Combine the primary domain name and the preset identifier to generate query information;
在至少一个搜索引擎中输入所述查询信息, 获取所述至少一个搜索引擎收录的、 与所述主域名相关联的搜索结果。 Enter the query information into at least one search engine to obtain search results associated with the primary domain name that are included in the at least one search engine.
18、根据权利要求 17所述的方法,所述对所述主域名以及预设标识符进行组合, 生成查询信息, 包括: 18. The method according to claim 17, said combining the primary domain name and a preset identifier to generate query information, including:
在所述主域名前添加预设标识符, 生成查询信息。 Add a preset identifier before the main domain name to generate query information.
19、 根据权利要求 18所述的方法, 所述预设标识符包括: site: ; 19. The method according to claim 18, the preset identifier includes: site:;
相应的, 生成的所述查询信息为: site:a.com, 其中, a.com为所述主域名。 Correspondingly, the generated query information is: site:a.com, where a.com is the main domain name.
20、 根据权利要求 16至 19任一项所述的方法, 所述分析获取的搜索结果, 以 获取与所述主域名相关联的二级域名信息, 包括- 对通过不同搜索引擎获取的与所述主域名相关联的搜索结果进行合并; 在合并后的搜索结果中获取与所述主域名相关联的二级域名信息。 20. The method according to any one of claims 16 to 19, said analyzing the obtained search results to obtain secondary domain name information associated with the primary domain name, including - analyzing the search results obtained through different search engines. The search results associated with the primary domain name are merged; and the secondary domain name information associated with the primary domain name is obtained from the merged search results.
21、 根据权利要求 16至 20任一项所述的方法, 还包括- 将当前已获取的所有与所述主域名相关联的二级域名信息进行合并; 21. The method according to any one of claims 16 to 20, further comprising - merging all currently obtained secondary domain name information associated with the primary domain name;
将合并后的二级域名信息作为二级域名记录进行输出。 Output the merged second-level domain name information as a second-level domain name record.
22、 根据权利要求 21所述的方法, 所述将当前已获取的所有与所述主域名相关 联的二级域名信息进行合并, 包括- 将当前已获取的所有与所述主域名相关联的二级域名信息进行排重处理后合 并。 22. The method according to claim 21, said merging all currently acquired secondary domain name information associated with the primary domain name, including - merging all currently acquired secondary domain name information associated with the primary domain name Second-level domain name information is merged after deduplication processing.
23、 根据权利要求 21或 22所述的方法, 还包括: 23. The method according to claim 21 or 22, further comprising:
利用预设字典, 建立与所述主域名相关联的预定二级域名列表; Using a preset dictionary, establish a list of predetermined secondary domain names associated with the primary domain name;
基于所述预定二级域名列表获取与所述主域名相关联的二级域名信息。 Obtain secondary domain name information associated with the primary domain name based on the predetermined secondary domain name list.
24、 根据权利要求 23所述的方法, 所述基于所述预定二级域名列表获取与所述 主域名相关联的二级域名信息, 包括- 对所述预定二级域名列表中的每个二级域名进行访问, 并获取访问成功的二级 域名, 得到与所述主域名相关联的二级域名信息。 24. The method according to claim 23, said obtaining the secondary domain name information associated with the primary domain name based on the predetermined secondary domain name list, including - for each secondary domain name in the predetermined secondary domain name list The first-level domain name is accessed, and the second-level domain name that is successfully accessed is obtained, and the second-level domain name information associated with the primary domain name is obtained.
25、 根据权利要求 23或 24所述的方法, 还包括: 25. The method according to claim 23 or 24, further comprising:
将当前已获取的所有与所述主域名相关联的二级域名信息进行合并。 Merge all currently obtained secondary domain name information associated with the primary domain name.
26、 根据权利要求 25所述的方法, 还包括: 26. The method of claim 25, further comprising:
对于经合并处理后得到的与所述主域名相关联的二级域名信息中的任意一个二 级域名, 若该二级域名不存在于所述预设字典中, 则将其补入所述预设字典中。 For any second-level domain name in the second-level domain name information associated with the primary domain name obtained after the merging process, if the second-level domain name does not exist in the preset dictionary, it will be added to the preset dictionary. Assume in the dictionary.
27、 根据权利要求 15至 26任一项所述的方法, 还包括- 利用不同线路节点访问所获得的二级域名信息中的一个或者多个二级域名中的 至少一个二级域名, 并获取通过不同线路节点返回的响应结果, 及不同线路节点返 回响应结果的时延; 在响应结果中获取每个二级域名通过不同线路节点访问得到的 IP地址, 并根据 不同线路节点返回响应结果的时延确认每个二级域名的线路类型; 27. The method according to any one of claims 15 to 26, further comprising - using different line nodes to access at least one second-level domain name of one or more second-level domain names in the obtained second-level domain name information, and obtaining The response results returned by different line nodes, and the delay in returning response results from different line nodes; Obtain the IP address of each second-level domain name accessed through different line nodes in the response result, and confirm the line type of each second-level domain name based on the delay in returning response results from different line nodes;
输出每个二级域名的 IP地址及线路类型。 Output the IP address and line type of each second-level domain name.
28、 根据权利要求 27所述的方法, 所述根据不同线路节点返回响应结果的时延 确认每个二级域名的线路类型, 包括: 28. The method according to claim 27, wherein confirming the line type of each second-level domain name based on the delay in returning response results from different line nodes includes:
比较访问每个二级域名时, 不同线路节点返回响应结果的时延是否相同, 当比 较的结果不同时, 确认返回响应结果时延最短的线路节点所在的线路类型为二级域 名的线路类型, 当比较的结果相同时, 确认所述二级域名的线路类型为默认类型。 When accessing each second-level domain name, compare whether the delays in returning response results from different line nodes are the same. When the comparison results are different, confirm that the line type of the line node with the shortest return response result delay is the line type of the second-level domain name. When the comparison results are the same, it is confirmed that the line type of the second-level domain name is the default type.
29、 一种计算机程序, 包括计算机可读代码, 当所述计算机可读代码在服务器 上运行时,导致所述服务器执行根据权利要求 15-28中的任一个所述的获取一个主域 名相关联的二级域名信息的方法。 29. A computer program, comprising computer readable code, which, when run on a server, causes the server to perform obtaining a primary domain name association according to any one of claims 15-28 second-level domain name information.
30、 一种计算机可读介质, 其中存储了如权利要求 29所述的计算机程序。 30. A computer-readable medium in which the computer program according to claim 29 is stored.
PCT/CN2014/080720 2013-07-09 2014-06-25 Device and method for acquiring second-level domain name information associated with main domain name WO2015003556A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201310286579.8A CN103399871B (en) 2013-07-09 2013-07-09 Obtain the device and method of an associated second-level domain information of Main Domain
CN201310286579.8 2013-07-09

Publications (1)

Publication Number Publication Date
WO2015003556A1 true WO2015003556A1 (en) 2015-01-15

Family

ID=49563501

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/080720 WO2015003556A1 (en) 2013-07-09 2014-06-25 Device and method for acquiring second-level domain name information associated with main domain name

Country Status (2)

Country Link
CN (1) CN103399871B (en)
WO (1) WO2015003556A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4036760A1 (en) * 2021-02-01 2022-08-03 Deutsche Telekom AG Method and system for automated and dynamic main domain determination and identification

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103399871B (en) * 2013-07-09 2018-06-12 北京奇安信科技有限公司 Obtain the device and method of an associated second-level domain information of Main Domain
CN103595827B (en) * 2013-11-29 2017-06-06 北京奇安信科技有限公司 A kind of IP address recognition methods of CDN source station and device
CN103618742B (en) * 2013-12-09 2017-10-27 北京奇安信科技有限公司 Webmaster's method for verifying authority
CN105407186A (en) * 2015-12-23 2016-03-16 北京奇虎科技有限公司 Method and device for acquiring subdomain names
CN105740361B (en) * 2016-01-26 2019-01-29 上海晶赞科技发展有限公司 The detection method and device of full dose data integrity degree
CN108900581A (en) * 2018-06-12 2018-11-27 恒安嘉新(北京)科技股份公司 A kind of method for building up of the key feature knowledge base of large-scale website

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8078707B1 (en) * 2004-11-12 2011-12-13 Juniper Networks, Inc. Network management using hierarchical domains
CN102571404A (en) * 2010-12-31 2012-07-11 北京新媒传信科技有限公司 Website access statistical method and website access statistical system
CN102970339A (en) * 2012-10-30 2013-03-13 北京奇虎科技有限公司 Method for displaying web address and browser
CN103399871A (en) * 2013-07-09 2013-11-20 北京奇虎科技有限公司 Equipment and method for capturing second-level domain information associated with main domain

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102882991B (en) * 2012-09-29 2016-03-30 北京奇虎科技有限公司 A kind of browser and carry out the method for domain name mapping

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8078707B1 (en) * 2004-11-12 2011-12-13 Juniper Networks, Inc. Network management using hierarchical domains
CN102571404A (en) * 2010-12-31 2012-07-11 北京新媒传信科技有限公司 Website access statistical method and website access statistical system
CN102970339A (en) * 2012-10-30 2013-03-13 北京奇虎科技有限公司 Method for displaying web address and browser
CN103399871A (en) * 2013-07-09 2013-11-20 北京奇虎科技有限公司 Equipment and method for capturing second-level domain information associated with main domain

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4036760A1 (en) * 2021-02-01 2022-08-03 Deutsche Telekom AG Method and system for automated and dynamic main domain determination and identification

Also Published As

Publication number Publication date
CN103399871A (en) 2013-11-20
CN103399871B (en) 2018-06-12

Similar Documents

Publication Publication Date Title
WO2015003556A1 (en) Device and method for acquiring second-level domain name information associated with main domain name
WO2013143403A1 (en) Method and system for accessing website
CN106489258B (en) Linking to content using an information centric network
CN107241344B (en) Client is intercepted to the method, apparatus and system of the access of hostile network server
US20160335353A1 (en) Infrastructure enabling intelligent execution and crawling of a web application
US20170185678A1 (en) Crawler system and method
US8732252B2 (en) Cooperating system, chat server, program, and cooperating method
WO2016070718A1 (en) Method, device, and browser for file downloading
WO2011116696A1 (en) Method and system for providing network resources
US8484373B2 (en) System and method for redirecting a request for a non-canonical web page
EP3146698A1 (en) Method and system for acquiring web pages
CN109600385B (en) Access control method and device
WO2014154095A1 (en) Method for displaying website authentication information and browser
CN110430188A (en) A kind of quick url filtering method and device
WO2016173185A1 (en) Information pushing method and apparatus
KR20180074774A (en) How to identify malicious websites, devices and computer storage media
US10491606B2 (en) Method and apparatus for providing website authentication data for search engine
US20140101249A1 (en) Systems and Methods for Managing and Presenting Information
WO2015179244A1 (en) Method and system for acquiring web pages
US11477161B1 (en) Systems and methods for detecting DNS communications through time-to-live analyses
US20100192069A1 (en) Differentiating a User from Multiple Users Based on a Determined Pattern of Accessing a Prescribed Network Destination
US10404685B2 (en) User security authentication system in internet and method thereof
Khare et al. Smart crawler for harvesting deep web with multi-classification
CN106919600A (en) One kind failure network address access method and terminal
US20090248673A1 (en) Method of sorting web pages, search terminal and client terminal

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14822233

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14822233

Country of ref document: EP

Kind code of ref document: A1