CN106776915A - A kind of new clustering algorithm realizes that search engine keywords optimize - Google Patents
A kind of new clustering algorithm realizes that search engine keywords optimize Download PDFInfo
- Publication number
- CN106776915A CN106776915A CN201611086516.8A CN201611086516A CN106776915A CN 106776915 A CN106776915 A CN 106776915A CN 201611086516 A CN201611086516 A CN 201611086516A CN 106776915 A CN106776915 A CN 106776915A
- Authority
- CN
- China
- Prior art keywords
- keyword
- search engine
- cluster
- follows
- clustering algorithm
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Probability & Statistics with Applications (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
A kind of new clustering algorithm realizes that search engine keywords optimize, and kernel keyword, the corresponding data item of search keyword, such as national monthly volumes of searches, degree of contention and each clicking cost of estimation are determined according to business eventDeng, dimension-reduction treatment again is carried out to above-mentioned keyword set, each keyword First Five-Year Plan dimensional vector is represented, that is, increase homepage webpage number and total searched page number, and then is reduced to the four-dimension again by five dimensions, finally using a kind of new clustering algorithm to keyword clustering, eachThe information flow function in field isInventive algorithm is more simple and effective, run time complexity is low, processing speed is faster, classification results more meet empirical value, with more preferable data process effects, can help the ranking of website its keyword of fast lifting in a short time, for enterprise web site brings certain flow and inquiry, so as to reach preferable web information flow target.
Description
Technical field
The present invention relates to Semantic Web technology field, and in particular to a kind of new clustering algorithm method realizes that search engine is crucial
Word optimizes.
Background technology
Index is held up has turned into the important tool that numerous netizens obtain information.Search engine optimization (Search Engine
Optimization, abbreviation SEO) refer to that series of optimum is carried out to website using correlation technique, so as to improve corresponding keyword
Ranking on a search engine, is finally reached the purpose of website marketing.SEO is the optimization of keyword after all.Keyword is
The word or expression that user uses when related pages are searched for, is also that search engine is setting up the word that concordance list is used.Utilize
Keyword helps to obtain search engine inquiry ranking higher, it should be noted that keyword research is intended to find out the key of most worthy
Word.Search engine optimization technology includes black cap technology and white cap technology, wherein black cap technology represents violation search engine optimization rule
Malice optimisation technique then, show as being piled up in the page keyword in keyword optimisation technique or place unrelated keyword with
Ranking in a search engine is improved, current each search engine has been incorporated into correlation technique and rule to the net using black cap technology
Station is punished;White cap technology then represents the optimisation technique of searched engine accreditation.At present both at home and abroad to the reason of keyword optimization
It is relatively more by research and technology application, but temporarily do not propose an effective method to simplify key word analysis flow, also without one
Individual perfect mechanism manages keyword optimisation strategy and progress.Based on the demand, the invention provides a kind of new cluster
Algorithm algorithm realizes that search engine keywords are excellent.
The content of the invention
The technical problem that search engine optimization is realized in keyword optimization is directed to, the invention provides a kind of new cluster calculation
Method realizes that search engine keywords optimize.
In order to solve the above problems, the present invention is achieved by the following technical solutions:
Step 1:Kernel keyword is determined according to business event, related keyword is collected using search engine, these are crucial
Word has corresponding data items in a search engine, such as national monthly volumes of searches, degree of contention and each clicking cost (CPC) of estimation
Step 2:With reference to enterprise product and market analysis, the above-mentioned related keyword set for searching of dimensionality reduction is screened;
Step 3:For the keyword set after screening dimensionality reduction, by the corresponding page of search engine search keyword, this
In record homepage webpage number and total searched page number, i.e. each keyword dimensionality reduction be four-dimensional again by five dimensional vectors.
Step 4:Using a kind of new clustering algorithm, clustering processing is carried out to above-mentioned keyword, its specific sub-step is as follows:
Step 4.1:Using the k-means algorithm initialization clusters based on ε fields;
Step 4.2:Initialize the information flow function in each ε fieldFollowing judgements are pressed from set of data objects D
Condition selects k initial cluster center;
Step 4.3:To every class keywords i, (i ∈ (1,2 ..., m)) are redistributed, poly- by probability function p (i) selection
Class center j ';
Step 4.4:According to the result of decision function Δ (I), Ge Cu centers are recalculated;
Step 4.5:If cluster center changes, step 4.2 is gone to, otherwise iteration terminates, export cluster result.
Step 5:According to enterprise's concrete condition, comprehensive keyword efficiency optimization and value rate optimize, and selection is suitable crucial
Word optimisation strategy reaches web information flow target.
Present invention has the advantages that:
1, this algorithm can simplify key word analysis flow, and then reduce whole web information flow workload.
2, the run time complexity of this algorithm is low, and processing speed is faster.
3rd, this algorithm has bigger value.
4th, the ranking of website its keyword of fast lifting in a short time can be helped.
5th, for enterprise web site brings certain flow and inquiry, so as to reach preferable web information flow target.
6th, the degree of accuracy of this algorithm classification result more meets empirical value.
7th, this algorithm is more simple and effective.
8th, the effect of data processing is more preferable.
Brief description of the drawings
A kind of new clustering algorithms of Fig. 1 realize that search engine keywords optimize structure flow chart
A kind of applicating flow chart of the new clustering algorithms of Fig. 2 in cluster analysis
Specific embodiment
In order to solve the technical problem that search engine optimization is realized in keyword optimization, the present invention is carried out with reference to Fig. 1-Fig. 2
Describe in detail, its specific implementation step is as follows:
Step 1:Kernel keyword is determined according to business event, related keyword is collected using search engine, these are crucial
Word has corresponding data items in a search engine, such as national monthly volumes of searches, degree of contention and each clicking cost (CPC) of estimation
Deng.
Step 2:With reference to enterprise product and market analysis, the above-mentioned related keyword set for searching of dimensionality reduction is screened;
Step 3:For the keyword set after screening dimensionality reduction, by the corresponding page of search engine search keyword, this
In record homepage webpage number and total searched page number, i.e. each keyword dimensionality reduction be four-dimensional, its specific meter again by five dimensional vectors
Calculation process is as follows:
Here associative key number is m, existing following m × 5 matrix:
Ni、Ldi、CPCi、NiS、NiYIt is followed successively by monthly volumes of searches, degree of contention, the estimation of i-th corresponding this country of keyword
Each clicking cost (CPC), homepage webpage number, total searched page number.
Dimensionality reduction is the four-dimension again, i.e.,
XI ∈ (1,2 ..., m)It is search efficiency, ZI ∈ (1,2 ..., m)It is value rate, as following formula:
Step 4:Using a kind of new clustering algorithm, clustering processing is carried out to above-mentioned keyword, its specific sub-step is as follows:
Step 4.1:Using the k-means algorithm initialization clusters based on ε fields.
Step 4.2:Initialize the information flow function in each ε fieldFollowing judgements are pressed from set of data objects D
Condition selects k initial cluster center, and its specific calculating process is as follows:
Above formula nεIt is the number of data object in each ε field,I-th in for space crucial term vector and its
Cluster center vectorInner product.
Decision condition is as follows:
γ is the threshold value for setting, and only meets above formula condition and is then classified as cluster, then screen k classes out.
Step 4.3:To every class keywords i, (i ∈ (1,2 ..., m)) are redistributed, poly- by probability function p (i) selection
Class center j ', its specific calculating process is as follows:
By the corresponding cluster centre j ' of p (i) value MAXIMUM SELECTIONs.
Step 4.4:According to the result of decision function Δ (I), Ge Cu centers are recalculated, its specific calculating process is as follows:
Meet above formula, then recalculate Ge Cu centers.
Step 4.5:If cluster center changes, step 4.2 is gone to, otherwise iteration terminates, export cluster result.
Step 5:According to enterprise's concrete condition, comprehensive keyword efficiency optimization and value rate optimize, and selection is suitable crucial
Word optimisation strategy reaches web information flow target.
A kind of new clustering algorithm realizes that search engine keywords optimize, its false code process
Input:The kernel keyword that website is extracted, cluster is initialized based on ε fields, initializes the information content in each ε field
Function
Output:Global information flow function Ii→jThe maximum k cluster of summation.
Claims (2)
1. a kind of new clustering algorithm realizes that search engine keywords optimize, the present invention relates to Semantic Web technology field, specifically
It is related to a kind of new clustering algorithm method to realize that search engine keywords optimize, it is characterized in that, comprise the following steps:
Step 1:Kernel keyword is determined according to business event, related keyword is collected using search engine, these keywords exist
There are corresponding data items in search engine, such as national monthly volumes of searches, degree of contention and each clicking cost of estimationDeng
Step 2:With reference to enterprise product and market analysis, the above-mentioned related keyword set for searching of dimensionality reduction is screened;
Step 3:For the keyword set after screening dimensionality reduction, by the corresponding page of search engine search keyword, remember here
Dimensionality reduction is four-dimensional again by five dimensional vectors for record homepage webpage number and total searched page number, i.e. each keyword, and it was specifically calculated
Journey is as follows:
Here associative key number is m, existing followingMatrix:
、、、、It is followed successively by monthly volumes of searches, degree of contention, the estimation of i-th corresponding this country of keyword
Each clicking cost, homepage webpage number, total searched page number
Dimensionality reduction is the four-dimension again, i.e.,
It is search efficiency,It is value rate, as following formula:
Step 4:Using a kind of new clustering algorithm, clustering processing is carried out to above-mentioned keyword, its specific sub-step is as follows:
Step 4.1:Using being based onThe k-means algorithm initialization clusters in field;
Step 4.2:Initialize eachThe information flow function in field, following judgement bars are pressed from set of data objects D
Part selects k initial cluster center;
Step 4.3:To every class keywordsRedistributed, by probability functionIn selection cluster
The heart;
Step 4.4:According to decision functionResult, recalculate Ge Cu centers;
Step 4.5:If cluster center changes, step 4.2 is gone to, otherwise iteration terminates, export cluster result
Step 5:According to enterprise's concrete condition, comprehensive keyword efficiency optimization and value rate optimize, and select suitable keyword excellent
Change strategy and reach web information flow target.
2. a kind of new clustering algorithm according to claim 1 realizes that search engine keywords optimize, it is characterized in that, with
Specific calculating process in the upper step 4 is as follows:
Step 4:Using a kind of new clustering algorithm, clustering processing is carried out to above-mentioned keyword, its specific sub-step is as follows:
Step 4.1:Using being based onThe k-means algorithm initialization clusters in field
Step 4.2:Initialize eachThe information flow function in field, following judgement bars are pressed from set of data objects D
Part selects k initial cluster center, and its specific calculating process is as follows:
Above formulaFor eachThe number of data object in field,I-th crucial term vector and its cluster in for space
Center vectorInner product
Decision condition is as follows:
It is the threshold value for setting, only meets above formula condition and be then classified as cluster, then screens k classes out
Step 4.3:To every class keywordsRedistributed, by probability functionIn selection cluster
The heart, its specific calculating process is as follows:
PressThe corresponding cluster centre of value MAXIMUM SELECTION
Step 4.4:According to decision functionResult, recalculate Ge Cu centers, its specific calculating process is as follows:
Meet above formula, then recalculate Ge Cu centers
Step 4.5:If cluster center changes, step 4.2 is gone to, otherwise iteration terminates, export cluster result.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611086516.8A CN106776915A (en) | 2016-11-30 | 2016-11-30 | A kind of new clustering algorithm realizes that search engine keywords optimize |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611086516.8A CN106776915A (en) | 2016-11-30 | 2016-11-30 | A kind of new clustering algorithm realizes that search engine keywords optimize |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106776915A true CN106776915A (en) | 2017-05-31 |
Family
ID=58914906
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611086516.8A Pending CN106776915A (en) | 2016-11-30 | 2016-11-30 | A kind of new clustering algorithm realizes that search engine keywords optimize |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106776915A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110059725A (en) * | 2019-03-21 | 2019-07-26 | 中国科学院计算技术研究所 | A kind of detection malicious searches system and method based on search key |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103218435A (en) * | 2013-04-15 | 2013-07-24 | 上海嘉之道企业管理咨询有限公司 | Method and system for clustering Chinese text data |
CN103258000A (en) * | 2013-03-29 | 2013-08-21 | 北界创想(北京)软件有限公司 | Method and device for clustering high-frequency keywords in webpages |
-
2016
- 2016-11-30 CN CN201611086516.8A patent/CN106776915A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103258000A (en) * | 2013-03-29 | 2013-08-21 | 北界创想(北京)软件有限公司 | Method and device for clustering high-frequency keywords in webpages |
CN103218435A (en) * | 2013-04-15 | 2013-07-24 | 上海嘉之道企业管理咨询有限公司 | Method and system for clustering Chinese text data |
Non-Patent Citations (2)
Title |
---|
林元国 等: "K-means算法在关键词优化中的应用", 《计算机***应用》 * |
邓健爽 等: "基于搜索引擎的关键词自动聚类法", 《计算机科学》 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110059725A (en) * | 2019-03-21 | 2019-07-26 | 中国科学院计算技术研究所 | A kind of detection malicious searches system and method based on search key |
CN110059725B (en) * | 2019-03-21 | 2021-07-09 | 中国科学院计算技术研究所 | Malicious search detection system and method based on search keywords |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zhong et al. | Large patch convolutional neural networks for the scene classification of high spatial resolution imagery | |
CN106649616A (en) | Clustering algorithm achieving search engine keyword optimization | |
CN106933954A (en) | Search engine optimization technology is realized based on Decision Tree Algorithm | |
CN103761286B (en) | A kind of Service Source search method based on user interest | |
Harakawa et al. | accurate and efficient extraction of hierarchical structure ofWeb communities forWeb video retrieval | |
Huang et al. | Multilabel remote sensing image annotation with multiscale attention and label correlation | |
Chen et al. | Deep net architectures for visual-based clothing image recognition on large database | |
Nezamabadi-pour et al. | Concept learning by fuzzy k-NN classification and relevance feedback for efficient image retrieval | |
CN106909626A (en) | Improved Decision Tree Algorithm realizes search engine optimization technology | |
CN106933953A (en) | A kind of fuzzy K mean cluster algorithm realizes search engine optimization technology | |
Li et al. | Self-supervised learning-based weight adaptive hashing for fast cross-modal retrieval | |
Saikumar et al. | A Lite-SVM Based Semantic Search Model for Bigdata Analytics in Smart Cities | |
CN106874376A (en) | A kind of method of verification search engine keyword optimisation technique | |
CN111061939A (en) | Scientific research academic news keyword matching recommendation method based on deep learning | |
CN106776915A (en) | A kind of new clustering algorithm realizes that search engine keywords optimize | |
Landolsi et al. | Image annotation in social networks using graph and multimodal deep learning features | |
CN106897356A (en) | Improved Fuzzy C mean algorithm realizes that search engine keywords optimize | |
CN106802945A (en) | Fuzzy c-Means Clustering Algorithm based on VSM realizes that search engine keywords optimize | |
CN106874377A (en) | The improved clustering algorithm based on constraints realizes that search engine keywords optimize | |
CN106776923A (en) | Improved clustering algorithm realizes that search engine keywords optimize | |
CN106599118A (en) | Method for realizing search engine keyword optimization by improved density clustering algorithm | |
CN106649537A (en) | Search engine keyword optimization technology based on improved swarm intelligence algorithm | |
CN106933950A (en) | New Model tying algorithm realizes search engine optimization technology | |
Lu et al. | Data mining and social networks processing method based on support vector machine and k-nearest neighbor | |
CN106649536A (en) | Achievement of optimization of search engine keywords based on improved k Means algorithm |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20170531 |
|
WD01 | Invention patent application deemed withdrawn after publication |