CN103324637A - Method and system for mining hotspot message - Google Patents

Method and system for mining hotspot message Download PDF

Info

Publication number
CN103324637A
CN103324637A CN2012100790913A CN201210079091A CN103324637A CN 103324637 A CN103324637 A CN 103324637A CN 2012100790913 A CN2012100790913 A CN 2012100790913A CN 201210079091 A CN201210079091 A CN 201210079091A CN 103324637 A CN103324637 A CN 103324637A
Authority
CN
China
Prior art keywords
information
reprinting
hot
page source
intelligence page
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012100790913A
Other languages
Chinese (zh)
Other versions
CN103324637B (en
Inventor
姚磊
何军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Shiji Guangsu Information Technology Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201210079091.3A priority Critical patent/CN103324637B/en
Priority to PCT/CN2013/073011 priority patent/WO2013139290A1/en
Publication of CN103324637A publication Critical patent/CN103324637A/en
Application granted granted Critical
Publication of CN103324637B publication Critical patent/CN103324637B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

An embodiment of the invention provides a method and a system for mining a hotspot message. The method comprises steps as follows: relative hot values of message webpage sources are calculated according to the access times of the message webpage sources; reproduced weights of reproduction messages in the message webpage sources where the reproduction messages are reproduced are calculated according to the relative hot values of message webpage sources; and the reproduced weights of the reproduction messages in the message webpage sources are summed, the message hot value of each reproduction message is calculated, and the hotspot message is determined from the reproduction messages according to the sequence of the message hot values. With the adoption of the implementation method, the hotspot message can be automatically generated from a whole internet on the basis of the message hot values of the reproduction messages, so that the message pushing efficiency can be improved, labor and cost are saved, an inferior website can be dynamically eliminated, a superior website can be strengthened, and the mining quality can be optimized continuously.

Description

A kind of hot information method for digging and system
Technical field
Embodiment of the present invention relates to technical field of internet application, more specifically, relates to a kind of hot information method for digging and system.
Background technology
Along with the develop rapidly of computer technology and network technology, the effect that internet (Internet) brings into play in daily life, study and work is also increasing.People get used to knowing Internet news by number of ways such as portal website, news search websites.
Internet news is the news take network as carrier, have quick, many-sided, by all kinds of means, the characteristics such as multimedia, interaction.Internet news is to break through traditional dissemination of news concept, look, listen, aspect the sense to the brand-new experience of audient.It carries out orderly integration with the news of disordering, and has greatly reduced the thickness of information, allows people obtain the most effective news information within the shortest time.Moreover, following Internet news will no longer be subjected to traditional news briefing person's restriction, and the audient can issue the news of oneself, and obtain at short notice to propagate faster, and news will become the platform of people's interaction.Internet news will along with the raising of people understanding towards darker level development, this will overturn the traditional concept of Internet news fully.
At present, most of portal website, perhaps the news search website all can select some hot informations to be placed on homepage, reads with the guiding user.With news category, be divided into the classification such as domestic, international, amusement such as, some portal standing-meetings, then in these classification, provide hot news to read with the guiding user.
Yet such hot information is generally by editor's artificial selection, or the homepage article of comprehensive some portal websites generates.The information pushing inefficiency of such hot information, and the waste of manpower of information providing formula, and with larger subjective factor.
Simultaneously, in present prior art, the selection range of news can only be confined to some authoritative websites, so the data decimation scope is smaller, can not guarantee the accurate hit rate of hot information.
Summary of the invention
Embodiment of the present invention proposes a kind of hot information method for digging, with automatic Heat of Formation dot information, thereby improves information pushing efficient.
Embodiment of the present invention also proposes a kind of hot information digging system, with automatic Heat of Formation dot information, thereby improves information pushing efficient.
The concrete scheme of embodiment of the present invention is as follows:
A kind of hot information method for digging, the method comprises:
According to the relative temperature value between the access times computing information webpage source in Intelligence Page source;
Calculate each reprinting information according to the relative temperature value in Intelligence Page source and in reprinting reprinting weight in the Intelligence Page source of this reprinting information is arranged;
The reprinting weight of each reprinting information in each Intelligence Page source sued for peace, calculate the heatrate value that each reprints information, and from described reprinting information, determine hot information according to described heatrate value size order.
A kind of hot information digging system, this system comprises:
Temperature value computing unit is used for according to the relative temperature value between the access times computing information webpage source in Intelligence Page source relatively;
Reprint weight calculation unit, be used for calculating each reprinting information has the Intelligence Page source of this reprinting information in reprinting reprinting weight according to the relative temperature value in Intelligence Page source;
The hot information determining unit, be used for each reprinting information is sued for peace in the reprinting weight in each Intelligence Page source, calculate the heatrate value that each reprints information, and from described reprinting information, determine hot information according to described heatrate value size order.
Can find out from technique scheme, in embodiment of the present invention, at first according to the relative temperature value between the access times computing information webpage source in Intelligence Page source; Then calculate each reprinting information according to the relative temperature value in Intelligence Page source and in reprinting reprinting weight in the Intelligence Page source of this reprinting information is arranged; And each reprinting weight of reprinting information sued for peace, calculate the heatrate value that each reprints information, from reprinting information, determine hot information according to the size order of heatrate value again.This shows, use after the embodiment of the present invention, can based on the automatic Heat of Formation dot information of heatrate value of the information of reprinting from whole internet, therefore can improve information pushing efficient.
Description of drawings
Fig. 1 is the hot information method for digging schematic flow sheet according to embodiment of the present invention;
Fig. 2 is the hot information method for digging system schematic according to embodiment of the present invention;
Fig. 3 is the exemplary hot information mining process schematic diagram according to embodiment of the present invention;
Fig. 4 is the reprinting information recognition result schematic diagram according to embodiment of the present invention;
Fig. 5 shows schematic diagram according to the hot information of embodiment of the present invention.
Embodiment
For making the purpose, technical solutions and advantages of the present invention clearer, the present invention is described in further detail below in conjunction with accompanying drawing.
In embodiment of the present invention, with each Intelligence Page source as the voter, with every piece of reprinting information as the ballot subject matter, with the popular degree in each the Intelligence Page source weight as ballot.By every piece of ballot score of reprinting information of COMPREHENSIVE CALCULATING, the reprinting information that makes a good score is regarded as hot information, side by side in front, simultaneously, consider that dissemination of news needs the time, can use the issuing time of reprinting information as correction factor, proofread and correct the ballot score, thereby obtain last temperature rank.
Fig. 1 is the hot information method for digging schematic flow sheet according to embodiment of the present invention.
As shown in Figure 1, the method comprises:
Step 101: according to the relative temperature value between the access times computing information webpage source in Intelligence Page source.
Can by the access log of hot information in the Intelligence Page source and the access log of other news, calculate the access temperature in each Intelligence Page source here.Such as, the access times addition of putting down in writing in the access times of putting down in writing in the access log with hot information and the access log of other news is as the access times in Intelligence Page source.
Preferably, the Intelligence Page source can be various types of news websites.
The access temperature in computing information webpage source can have multiple account form, and its principle is: the access times in Intelligence Page source are more, and the relative temperature value in this Intelligence Page source should be higher.Such as:
For k Intelligence Page source, calculate its relative temperature value SiteHotness k, wherein:
SiteHotnes s k = norm * ( log ( AccessCount k ) / Σ k log ( AcessCoun t k ) ) ;
Wherein norm is normalization coefficient; AccessCount kBe the access times in k Intelligence Page source, K is the set in all Intelligence Page sources.
Such as: suppose to have gathered A in certain search engine, B, the news in three information webpages of C source supposes that these three information webpage sources are respectively 50,20,30 in the access times (AccessCount) of search engine.
The temperature SiteHotness of website C then C=norm* (log (30)/log (50+20+30));
The temperature SiteHotness of website B B=norm* (log (20)/log (50+20+30));
The temperature SiteHotness of website A A=norm* (log (50)/log (50+20+30)).
The truth of a matter in the above-mentioned logarithm can be 10, also can be e.Thereby guarantee the temperature SiteHotness of website A AGreater than the temperature SiteHotness of website C then C, and the temperature SiteHotness of website C CTemperature SiteHotness greater than website B B
Wherein, according to the concrete experience in application, the concrete value of norm can be made corresponding variation or adjustment.
Step 102: calculating each reprinting information according to the relative temperature value in Intelligence Page source has reprinting weight in the Intelligence Page source of this reprinting information in reprinting.
Can from each Intelligence Page source, determine described reprinting information based on the similarity algorithm of text feature here.Identify the papers published of news by the similarity algorithm based on text feature, namely identify the reprinting which news belongs to same piece of writing news.
Preferably, can further determine time factor according to each issuing time of reprinting information, and utilize this time factor that each heatrate value is revised.Exemplarily, can also will reprint the reproduced time of information as time factor.
Such as: for i reprinting information, calculate its heatrate value NewsHotness i
Wherein:
NewsHotnes s i = f ( PublishTime ) * Σ 1 K CitationHotnes s k ;
CitationHotness k=g(SiteHotness k);
Wherein K is the set that all reprinted the Intelligence Page source of this i reprinting information; PublishTime is the issuing time of this i reprinting information; F (PublishTime) transfers weight function, CitationHotness about the time of PublishTime kFor this i reprinting information at k reprinting weight of reprinting in the Intelligence Page source that this reprinting information is arranged, g (SiteHotness k) be about SiteHotness kTemperature transfer weight function.
Time transfers weight function f (PublishTime) to be used for guarantee information temperature value NewsHotness iTimeliness n.Usually, issuing time PublishTime is the closer to current time, and then the value of time accent weight function f (PublishTime) should be larger.
Time transfers the concrete functional form of weight function f (PublishTime) that numerous embodiments can be arranged, and can be linear, also can be nonlinear.As long as meet issuing time PublishTime the closer to current time, then the value of time accent weight function f (PublishTime) should larger (thereby guarantee information temperature value NewsHotness iValue can be larger) cardinal rule, embodiment of the present invention is to concrete functional form and the indefinite of f (PublishTime).
G (SiteHotness k) be that temperature is transferred weight function, be used for guaranteeing to reprint weight CitationHotness kQuality index.Usually, the relative temperature value SiteHotness of some websites kHigher, then it reprints weight CitationHotness kValue should be larger.
Similarly, temperature is transferred weight function g (SiteHotness k) concrete functional form numerous embodiments can be arranged, can be linear, also can be nonlinear.In fact, as long as meet the relative temperature value SiteHotness of website kHigher, then temperature is transferred weight function CitationHotness kThe larger cardinal rule of value, embodiment of the present invention is to concrete functional form and the indefinite of f (PublishTime).
Step 103: the reprinting weight of each reprinting information in each Intelligence Page source sued for peace, calculate the heatrate value that each reprints information, and from described reprinting information, determine hot information according to described heatrate value size order.
Here, each reprinting weight of reprinting information is sued for peace, thereby calculate the heatrate value mark that each reprints information, then can according to after the height ordering, select suitable news number to represent.
Such as, can set in advance as showing 10 hot informations.After the heatrate value of each being reprinted information according to the height ordering is divided into line ordering, select from high to low 10 news numbers to represent as hot information so.
Preferably, in embodiment of the present invention, can also be first all news category, domestic such as being divided into, international, amusements etc. are used embodiment of the present invention again and are excavated each hot information in classifying in concrete classified news.
Based on above-mentioned analysis, embodiment of the present invention has also proposed a kind of hot information digging system.
Fig. 2 is the hot information method for digging system schematic according to embodiment of the present invention.
As shown in Figure 2, this system comprises relative temperature value computing unit 201, reprints weight calculation unit 202 and hot information determining unit 203.
Wherein:
Temperature value computing unit 201 is used for according to the relative temperature value between the access times computing information webpage source in Intelligence Page source relatively;
Reprint weight calculation unit 202, be used for calculating each reprinting information has the Intelligence Page source of this reprinting information in reprinting reprinting weight according to the relative temperature value in Intelligence Page source;
Hot information determining unit 203, be used for each reprinting information is sued for peace in the reprinting weight in each Intelligence Page source, calculate the heatrate value that each reprints information, and from described reprinting information, determine hot information according to described heatrate value size order.
Preferably, hot information determining unit 203, the issuing time that is further used for the information of reprinting according to each is determined time factor, and utilizes described time factor that described each heatrate value is revised.
Preferably, weight calculation unit 202 is further used for determining described reprinting information based on the similarity algorithm of text feature from each Intelligence Page source.
In one embodiment, temperature value computing unit 201 is used for for k Intelligence Page source relatively, calculates its relative temperature value SiteHotness k, wherein:
SiteHotnes s k = norm * ( log ( AccessCount k ) / Σ k log ( AcessCoun t k ) ) ;
Wherein norm is normalization coefficient; AccessCount kBe the access times in k Intelligence Page source, K is the set in all Intelligence Page sources.
In one embodiment, weight calculation unit 202 is used for for i reprinting information, calculates its heatrate value NewsHotness i NewsHotnes s i = f ( PublishTime ) * Σ 1 K CitationHotnes s k ;
CitationHotness k=g(SiteHotness k);
Wherein K is the set that all reprinted the Intelligence Page source of this i reprinting information; PublishTime is the issuing time of this i reprinting information; F (PublishTime) transfers weight function, CitationHotness about the time of PublishTime kFor this i reprinting information at k reprinting weight of reprinting in the Intelligence Page source that this reprinting information is arranged, g (SiteHotness k) be about SiteHotness kTemperature transfer weight function.
Similarly, the time transfers weight function f (PublishTime) to be used for guarantee information temperature value NewsHotness iTimeliness n.Usually, issuing time PublishTime is the closer to current time, and then the value of time accent weight function f (PublishTime) should be larger.
Time transfers the concrete functional form of weight function f (PublishTime) that numerous embodiments can be arranged, and can be linear, also can be nonlinear.As long as meet issuing time PublishTime the closer to current time, the cardinal rule that then value of time accent weight function f (PublishTime) should be larger, embodiment of the present invention is to concrete functional form and the indefinite of f (PublishTime).
G (SiteHotness k) be that temperature is transferred weight function, be used for guaranteeing to reprint weight CitationHotness kQuality index.Usually, the relative temperature value SiteHotness of some websites kHigher, then it reprints weight CitationHotness kValue should be larger.
Similarly, temperature is transferred weight function g (SiteHotness k) concrete functional form numerous embodiments can be arranged, can be linear, also can be nonlinear.In fact, as long as meet the relative temperature value SiteHotness of website kHigher, then temperature is transferred weight function CitationHotness kThe larger cardinal rule of value, embodiment of the present invention is to concrete functional form and the indefinite of f (PublishTime).
In one embodiment, this system further comprises hot information display unit 204.Hot information display unit 204 is used for showing the described hot information of determining from reprinting information.Such as, hot information display unit 204 can set in advance as showing 10 hot informations; After the heatrate value of each being reprinted information according to the height ordering is divided into line ordering, select from high to low 10 news numbers to represent as hot information.
Can according to embodiment of the present invention, from numerous news sources of internet, excavate hot news.Based on above-mentioned labor, Fig. 3 is the exemplary hot news mining process schematic diagram according to embodiment of the present invention.
As shown in Figure 3, at processing block 1 place, crawl out magnanimity news from the numerous news sources (such as news website) that come from the internet, and identify the concrete papers published of news, namely identify the reprinting which news belongs to same piece of writing news.
Such as: concrete recognition technology herein can be used based on the similarity of text feature and calculate.
Exemplarily, Fig. 4 is the reprinting news recognition result schematic diagram according to embodiment of the present invention.
The news of " China's Software Market was expected to reach 71,500,000,000 yuan in 2015 " from the different messages source shown in Figure 4 is actually the reprinting news of same news.
In processing block 2, by to the hot news access log of numerous news websites and the access log of other news, calculate the relative temperature value (namely accessing temperature) of each news website.
The relative temperature value calculating method of each website is as follows: SiteHotnes s k = norm * ( log ( AccessCount k ) / Σ k log ( AcessCoun t k ) ) ; Wherein K is the set of all websites, and norm is normalization coefficient, and AccessCount is the access times of each news website.
In processing block 3, in conjunction with the reprinting recognition result of processing block 1, the issuing time of reprinting news and the relative temperature value of each news website that processing block 2 calculates.
Such as: such as: for i reprinting news, calculate its news temperature value NewsHotness i
Wherein:
NewsHotnes s i = f ( PublishTime ) * Σ 1 K CitationHotnes s k ;
CitationHotness k=g(SiteHotness k);
Wherein K is that all reprinted this i set of reprinting the news website of news; PublishTime is this i issuing time of reprinting news; F (PublishTime) transfers weight function, CitationHotness about the time of PublishTime kReprint news and k reprinting reprinting weight in the news website of this reprinting news, g (SiteHotness are arranged for this i k) be about SiteHotness kTemperature transfer weight function.
Time transfers weight function f (PublishTime) to be used for guaranteeing news temperature value NewsHotness iTimeliness n.Usually, issuing time PublishTime is the closer to current time, and then the value of time accent weight function f (PublishTime) should be larger.
Time transfers the concrete functional form of weight function f (PublishTime) that numerous embodiments can be arranged, and can be linear, also can be nonlinear.As long as meet issuing time PublishTime the closer to current time, the cardinal rule that then value of time accent weight function f (PublishTime) should be larger, embodiment of the present invention is to concrete functional form and the indefinite of f (PublishTime).
In processing block 4, determine hot news according to the result of calculation of processing block 3, and by various ways such as microblogging, webpage, Emails hot news is pushed to the user.Determine after the hot news, hot news can be kept in the hot news access log, thus be convenient to the user at any time recall access.
For example: Fig. 5 shows schematic diagram according to the hot information of embodiment of the present invention.And embodiment of the present invention preferably demonstrates the concrete source of this hot information in pushing the result.
In embodiment of the present invention, at first according to the relative temperature value between the access times computing information webpage source in Intelligence Page source; Then calculate each reprinting information according to the relative temperature value in Intelligence Page source and in reprinting reprinting weight in the Intelligence Page source of this reprinting information is arranged; And each reprinting weight of reprinting information sued for peace, calculate the heatrate value that each reprints information, from reprinting information, determine hot information according to the size order of heatrate value again.This shows, use after the embodiment of the present invention, can be based on the automatic Heat of Formation dot information of heatrate value of the information of reprinting from whole internet, therefore can save artificial and reduce cost.
And embodiment of the present invention can also support the hot news of any amount to represent demand, and can support the calculating based on whole internet news, and the automatic mining by technorati authority, can dynamically eliminate website inferior, strengthen the high-quality website, so that Mining Quality is continued to optimize.
The above is preferred embodiment of the present invention only, is not for limiting protection scope of the present invention.Within the spirit and principles in the present invention all, any modification of doing, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (12)

1. a hot information method for digging is characterized in that, the method comprises:
According to the relative temperature value between the access times computing information webpage source in Intelligence Page source;
Calculate each reprinting information according to the relative temperature value in Intelligence Page source and in reprinting reprinting weight in the Intelligence Page source of this reprinting information is arranged;
The reprinting weight of each reprinting information in each Intelligence Page source sued for peace, calculate the heatrate value that each reprints information, and from described reprinting information, determine hot information according to described heatrate value size order.
2. hot information method for digging according to claim 1 is characterized in that, the method further comprises: determine time factor according to each issuing time of reprinting information, and utilize described time factor that each described heatrate value is revised.
3. hot information method for digging according to claim 1 is characterized in that, the method further comprises: the similarity algorithm based on text feature is determined described reprinting information from each Intelligence Page source.
4. hot information method for digging according to claim 1 is characterized in that,
Describedly according to the relative temperature value between the access times computing information webpage source in Intelligence Page source be:
For k Intelligence Page source, calculate its relative temperature value SiteHotness k, wherein:
SiteHotnes s k = norm * ( log ( AccessCount k ) / Σ k log ( AcessCoun t k ) ) ;
Wherein norm is normalization coefficient; AccessCount kBe the access times in k Intelligence Page source, K is the set in all Intelligence Page sources.
5. hot information method for digging according to claim 1 is characterized in that, described computing information temperature value comprises:
For i reprinting information, calculate its heatrate value NewsHotness i
NewsHotnes s i = f ( PublishTime ) * Σ 1 K CitationHotnes s k ;
CitationHotness k=g(SiteHotness k);
Wherein K is the set that all reprinted the Intelligence Page source of this i reprinting information; PublishTime is the issuing time of this i reprinting information; F (PublishTime) transfers weight function, CitationHotness about the time of PublishTime kFor this i reprinting information at k reprinting weight of reprinting in the Intelligence Page source that this reprinting information is arranged, g (SiteHotness k) be about SiteHotness kTemperature transfer weight function.
6. each described hot information method for digging is characterized in that according to claim 1-5, and the method further comprises:
The described hot information that displaying is determined from reprinting information.
7. a hot information digging system is characterized in that, this system comprises:
Temperature value computing unit is used for according to the relative temperature value between the access times computing information webpage source in Intelligence Page source relatively;
Reprint weight calculation unit, be used for calculating each reprinting information has the Intelligence Page source of this reprinting information in reprinting reprinting weight according to the relative temperature value in Intelligence Page source;
The hot information determining unit, be used for each reprinting information is sued for peace in the reprinting weight in each Intelligence Page source, calculate the heatrate value that each reprints information, and from described reprinting information, determine hot information according to described heatrate value size order.
8. hot information digging system according to claim 7, it is characterized in that, the hot information determining unit, the issuing time that is further used for the information of reprinting according to each is determined time factor, and utilizes described time factor that described each heatrate value is revised.
9. hot information digging system according to claim 7 is characterized in that, reprints weight calculation unit, is further used for determining described reprinting information based on the similarity algorithm of text feature from each Intelligence Page source.
10. hot information digging system according to claim 7 is characterized in that,
Temperature value computing unit is used for for k Intelligence Page source relatively, calculates its relative temperature value SiteHotness k, wherein:
SiteHotnes s k = norm * ( log ( AccessCount k ) / Σ k log ( AcessCoun t k ) ) ;
Wherein norm is normalization coefficient; AccessCount kBe the access times in k Intelligence Page source, K is the set in all Intelligence Page sources.
11. hot information digging system according to claim 7 is characterized in that,
Reprint weight calculation unit, be used for for i reprinting information, calculate its heatrate value
NewsHotness i NewsHotnes s i = f ( PublishTime ) * Σ 1 K CitationHotnes s k ;
CitationHotness k=g(SiteHotness k);
Wherein K is the set that all reprinted the Intelligence Page source of this i reprinting information; PublishTime is the issuing time of this i reprinting information; F (PublishTime) transfers weight function, CitationHotness about the time of PublishTime kFor this i reprinting information at k reprinting weight of reprinting in the Intelligence Page source that this reprinting information is arranged, g (SiteHotness k) be about SiteHotness kTemperature transfer weight function.
12. each described hot information digging system is characterized in that according to claim 7-10, this system further comprises the hot information display unit;
Described hot information display unit is used for showing the described hot information of determining from reprinting information.
CN201210079091.3A 2012-03-23 2012-03-23 A kind of hot information method for digging and system Active CN103324637B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201210079091.3A CN103324637B (en) 2012-03-23 2012-03-23 A kind of hot information method for digging and system
PCT/CN2013/073011 WO2013139290A1 (en) 2012-03-23 2013-03-21 Hot information mining method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210079091.3A CN103324637B (en) 2012-03-23 2012-03-23 A kind of hot information method for digging and system

Publications (2)

Publication Number Publication Date
CN103324637A true CN103324637A (en) 2013-09-25
CN103324637B CN103324637B (en) 2017-12-12

Family

ID=49193384

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210079091.3A Active CN103324637B (en) 2012-03-23 2012-03-23 A kind of hot information method for digging and system

Country Status (2)

Country Link
CN (1) CN103324637B (en)
WO (1) WO2013139290A1 (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103714132A (en) * 2013-12-17 2014-04-09 北京本果信息技术有限公司 Method and equipment used for mining hot events based on regions and industries
CN104504059A (en) * 2014-12-22 2015-04-08 合一网络技术(北京)有限公司 Multimedia resource recommending method
CN105045890A (en) * 2015-07-29 2015-11-11 百度在线网络技术(北京)有限公司 Method and device for determining hot news in target news source
CN105450608A (en) * 2014-08-28 2016-03-30 华为技术有限公司 Digital media content pushing method and digital media content pushing device
CN105630929A (en) * 2015-12-22 2016-06-01 北京奇虎科技有限公司 Comment based news recommendation weight determination method and apparatus
WO2016091051A1 (en) * 2014-12-12 2016-06-16 北京奇虎科技有限公司 Method and device for identifying web page type
CN105843963A (en) * 2016-04-19 2016-08-10 北京金山安全软件有限公司 Website selection method and server
CN106383919A (en) * 2016-11-21 2017-02-08 青岛农业大学 Method and system for determining news transmission effect
CN107179996A (en) * 2016-03-10 2017-09-19 爱思开海力士有限公司 Data storage device and its operating method
CN109145246A (en) * 2018-07-31 2019-01-04 成都华栖云科技有限公司 A kind of news virtual click amount implementation method based on paas media cloud multi-tenant platform
CN112202889A (en) * 2020-09-30 2021-01-08 深圳前海微众银行股份有限公司 Information pushing method and device and storage medium
CN113987372A (en) * 2021-12-27 2022-01-28 昆仑智汇数据科技(北京)有限公司 Hot data acquisition method, device and equipment of domain business object model
CN114221988A (en) * 2021-11-03 2022-03-22 新浪网技术(中国)有限公司 Content distribution network hotspot analysis method and system
US11360640B2 (en) 2017-03-22 2022-06-14 Alibaba Group Holding Limited Method, device and browser for presenting recommended news, and electronic device

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108205589B (en) * 2017-12-29 2022-02-15 成都优易数据有限公司 Heat iterative calculation method
CN113468428A (en) * 2021-07-16 2021-10-01 中国银行股份有限公司 Hotspot acquisition method, device and equipment and readable storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101127046A (en) * 2007-09-25 2008-02-20 腾讯科技(深圳)有限公司 Method and system for sequencing to blog article
CN101140587A (en) * 2007-10-15 2008-03-12 深圳市迅雷网络技术有限公司 Searching method and apparatus
CN101246498A (en) * 2008-03-27 2008-08-20 腾讯科技(深圳)有限公司 News web page searching method
CN101814171A (en) * 2009-02-24 2010-08-25 李晓萌 Media-oriented network influence index calculation method
US9418114B1 (en) * 2013-06-19 2016-08-16 Google Inc. Augmenting a content item using search results content
US9460198B1 (en) * 2012-07-26 2016-10-04 Google Inc. Process for serializing and deserializing data described by a schema

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101409634B (en) * 2007-10-10 2011-04-13 中国科学院自动化研究所 Quantitative analysis tools and method for internet news influence based on information retrieval
CN101625693A (en) * 2009-08-10 2010-01-13 北京精讯云顿数据软件有限公司 Method and system of online article statistics

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101127046A (en) * 2007-09-25 2008-02-20 腾讯科技(深圳)有限公司 Method and system for sequencing to blog article
CN101140587A (en) * 2007-10-15 2008-03-12 深圳市迅雷网络技术有限公司 Searching method and apparatus
CN101246498A (en) * 2008-03-27 2008-08-20 腾讯科技(深圳)有限公司 News web page searching method
CN101814171A (en) * 2009-02-24 2010-08-25 李晓萌 Media-oriented network influence index calculation method
US9460198B1 (en) * 2012-07-26 2016-10-04 Google Inc. Process for serializing and deserializing data described by a schema
US9418114B1 (en) * 2013-06-19 2016-08-16 Google Inc. Augmenting a content item using search results content

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103714132A (en) * 2013-12-17 2014-04-09 北京本果信息技术有限公司 Method and equipment used for mining hot events based on regions and industries
CN105450608A (en) * 2014-08-28 2016-03-30 华为技术有限公司 Digital media content pushing method and digital media content pushing device
WO2016091051A1 (en) * 2014-12-12 2016-06-16 北京奇虎科技有限公司 Method and device for identifying web page type
CN104504059A (en) * 2014-12-22 2015-04-08 合一网络技术(北京)有限公司 Multimedia resource recommending method
CN104504059B (en) * 2014-12-22 2018-03-27 合一网络技术(北京)有限公司 Multimedia resource recommends method
CN105045890A (en) * 2015-07-29 2015-11-11 百度在线网络技术(北京)有限公司 Method and device for determining hot news in target news source
CN105630929A (en) * 2015-12-22 2016-06-01 北京奇虎科技有限公司 Comment based news recommendation weight determination method and apparatus
CN107179996A (en) * 2016-03-10 2017-09-19 爱思开海力士有限公司 Data storage device and its operating method
CN107179996B (en) * 2016-03-10 2020-12-08 爱思开海力士有限公司 Data storage device and method of operating the same
CN105843963A (en) * 2016-04-19 2016-08-10 北京金山安全软件有限公司 Website selection method and server
CN106383919A (en) * 2016-11-21 2017-02-08 青岛农业大学 Method and system for determining news transmission effect
US11360640B2 (en) 2017-03-22 2022-06-14 Alibaba Group Holding Limited Method, device and browser for presenting recommended news, and electronic device
CN109145246A (en) * 2018-07-31 2019-01-04 成都华栖云科技有限公司 A kind of news virtual click amount implementation method based on paas media cloud multi-tenant platform
CN112202889A (en) * 2020-09-30 2021-01-08 深圳前海微众银行股份有限公司 Information pushing method and device and storage medium
WO2022068659A1 (en) * 2020-09-30 2022-04-07 深圳前海微众银行股份有限公司 Information pushing method and apparatus and storage medium
CN112202889B (en) * 2020-09-30 2023-05-23 深圳前海微众银行股份有限公司 Information pushing method, device and storage medium
CN114221988A (en) * 2021-11-03 2022-03-22 新浪网技术(中国)有限公司 Content distribution network hotspot analysis method and system
CN114221988B (en) * 2021-11-03 2024-05-03 新浪技术(中国)有限公司 Content distribution network hotspot analysis method and system
CN113987372A (en) * 2021-12-27 2022-01-28 昆仑智汇数据科技(北京)有限公司 Hot data acquisition method, device and equipment of domain business object model

Also Published As

Publication number Publication date
WO2013139290A1 (en) 2013-09-26
CN103324637B (en) 2017-12-12

Similar Documents

Publication Publication Date Title
CN103324637A (en) Method and system for mining hotspot message
US10698960B2 (en) Content validation and coding for search engine optimization
CN102929959B (en) A kind of book recommendation method based on user behavior
US8990208B2 (en) Information management and networking
CN102254038B (en) System and method for analyzing network comment relevance
CN105389389B (en) A kind of network public-opinion propagation situation medium control analysis method
CN105247507A (en) Influence score of a brand
CN103077172B (en) A kind of method and apparatus for excavating cheating user
US20120042020A1 (en) Micro-blog message filtering
US20130173574A1 (en) Search engine optimization with secured search
CN103324666A (en) Topic tracing method and device based on micro-blog data
CN102088419A (en) Method and system for searching information of good friends in social network
CN103886501B (en) Post-loan risk early warning system based on semantic emotion analysis
CN105389329A (en) Open source software recommendation method based on group comments
CN107526718A (en) Method and apparatus for generating text
CN104182457A (en) Poisson-process-model-based method for predicting event popularity in social network
CN105988975A (en) Automatic chapter cutting method
US20130054591A1 (en) Search engine optimization recommendations based on social signals
CN103577504A (en) Method and device for putting personalized contents
US20140214788A1 (en) Analyzing uniform resource locators
CN103631946A (en) Content pushing system based on geographic positions
CN104281619A (en) System and method for ordering search results
CN102664744A (en) Group-sending recommendation method in network message communication
CN107862555A (en) Forecasting system and method based on exponential smoothing
CN102999576A (en) Method and equipment for confirming page description information corresponding to target pages

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
ASS Succession or assignment of patent right

Owner name: SHENZHEN SHIJI LIGHT SPEED INFORMATION TECHNOLOGY

Free format text: FORMER OWNER: TENGXUN SCI-TECH (SHENZHEN) CO., LTD.

Effective date: 20131021

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 518044 SHENZHEN, GUANGDONG PROVINCE TO: 518057 SHENZHEN, GUANGDONG PROVINCE

TA01 Transfer of patent application right

Effective date of registration: 20131021

Address after: 518057 Tencent Building, 16, Nanshan District hi tech park, Guangdong, Shenzhen

Applicant after: Shenzhen Shiji Guangsu Information Technology Co., Ltd.

Address before: Shenzhen Futian District City, Guangdong province 518044 Zhenxing Road, SEG Science Park 2 East Room 403

Applicant before: Tencent Technology (Shenzhen) Co., Ltd.

C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant