CN103473271A - Optimized storing method for mass data - Google Patents

Optimized storing method for mass data Download PDF

Info

Publication number
CN103473271A
CN103473271A CN2013103631307A CN201310363130A CN103473271A CN 103473271 A CN103473271 A CN 103473271A CN 2013103631307 A CN2013103631307 A CN 2013103631307A CN 201310363130 A CN201310363130 A CN 201310363130A CN 103473271 A CN103473271 A CN 103473271A
Authority
CN
China
Prior art keywords
data
magnitudes
time period
cutting
reach
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2013103631307A
Other languages
Chinese (zh)
Other versions
CN103473271B (en
Inventor
董营
孟诗寂
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SUZHOU MAIKE NETWORK SAFETY TECHNOLOGY Co Ltd
Original Assignee
SUZHOU MAIKE NETWORK SAFETY TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SUZHOU MAIKE NETWORK SAFETY TECHNOLOGY Co Ltd filed Critical SUZHOU MAIKE NETWORK SAFETY TECHNOLOGY Co Ltd
Priority to CN201310363130.7A priority Critical patent/CN103473271B/en
Publication of CN103473271A publication Critical patent/CN103473271A/en
Application granted granted Critical
Publication of CN103473271B publication Critical patent/CN103473271B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses an optimized storing method for mass data, which is carried out according to the following steps of S1. dividing data subjected to validity checking into small blocks according to a certain rule, and storing each small block of data into a corresponding list; S2. judging the lists for storing the data in S1, and if the data in the lists reach the magnitude order of millions, dividing the data of the lists by using set time periods as time intervals, and dividing the lists into small lists for storage again, wherein each small list is used for only storing the data within the corresponding time period. Through the optimized storing method for the mass data, the problems of rapid database expansion, limitless enlargement and low inquiry speed caused by the accumulation of the mass data, and the disadvantage of reducing data-utilizing efficiency and the like are solved, and the effects of reducing the load of a database and improving the data-utilizing efficiency are achieved.

Description

A kind of storage method of the optimization for mass data
Technical field
The present invention relates to a kind of optimization storage means of data, be specifically related to a kind of optimization storage means of mass data, belong to the microcomputer data processing field.
Background technology
At present, due to the develop rapidly of data, data volume is increasing, and the storage of large data becomes a very large difficult problem with inquiry.Cloud, as the treatment technology of large data and fast data, more and more is applied in a lot of message areas now, how to process as required huge cloud data and has become natural problem.Processing the most direct method of mass data is directly to deposit corresponding database in, this method requires high to server performance, if each use all will be inquired about required data in huge database, inquiry velocity is slow, efficiency is low, and if only make simple submeter and also can't reach desirable effect, the load of database is still very large, can not position data fast, can't improve data query speed.
Summary of the invention
Goal of the invention: the object of the invention is to for the deficiencies in the prior art, a kind of storage means of the optimization for mass data that reduces database loads, improves data query speed and utilization ratio is provided.
Technical scheme: the invention provides a kind of storage means of the optimization for mass data, carry out as follows:
S1, will be divided into fritter according to certain rule through the data of legitimacy verification, the data of each fritter will be deposited in corresponding table;
S2, in step S1 the storage data table judged, if the data in table reach 1,000,000 magnitudes, this table is carried out to the data cutting according to the time period of setting as the time interval, again be divided into little table storage, only store the data in the corresponding time period in each little table.
Being further defined to of technical solution of the present invention, in step S2, the time period of described setting is a time interval i from short to long time period, while according to the time period of setting, carrying out the data cutting, first selects i time period to carry out the data cutting as the time interval; After the data cutting, the little table generated is judged, if also exist data to reach the table of 1,000,000 magnitudes, select the table that i-1 time period reaches 1,000,000 magnitudes to data again as the time interval to carry out the data cutting, circulation is operation as above, until countless certificate reaches, the table of 1,000,000 magnitudes exists or the time period of setting is all used.
Further, in step S2, the time period of described setting is one day, one week and one month, while according to the time period of setting, carrying out the data cutting, first selects within one month, as the time interval, carry out the data cutting; After the data cutting, the little table generated is judged, if there is no data reach the table of 1,000,000 magnitudes, complete this secondary data storage; If also exist data to reach the table of 1,000,000 magnitudes, the table of selecting the week age section again data to be reached to 1,000,000 magnitudes carries out the data cutting; After the data cutting, the little table generated is judged, if there is no data reach the table of 1,000,000 magnitudes, complete this secondary data storage; If also exist data to reach the table of 1,000,000 magnitudes, select the table that a time period reaches 1,000,000 magnitudes to data again to carry out the data cutting, complete this secondary data storage.
Further, if the data in the little table again be divided into exceed the scope of the time period of setting, delete automatically.
Further, in step S2, after the table that data are reached to 1,000,000 magnitudes carries out the data cutting, the data of cutting apart are judged, if the table after two and plural data unit merge still is less than 1,000,000 magnitudes, first table is merged, and then deposited in little table.
Beneficial effect: a kind of storage means of the optimization for mass data provided by the invention, logical table that at times data is reached to 1,000,000 magnitudes carries out the data cutting, data to each time period are safeguarded separately, the accumulation that has solved mass data causes the database rapid expansion, unlimited expansion, inquiry velocity is slow, has reduced the drawbacks such as data utilization ratio, reach the reduction database loads, improve the effect of data utilization ratio.
The accompanying drawing explanation
The process flow diagram that Fig. 1 is a kind of storage means of the optimization for mass data provided by the invention.
Embodiment
Below technical solution of the present invention is elaborated, but protection scope of the present invention is not limited to described embodiment.
embodiment:the present embodiment provides a kind of storage means of the optimization for mass data, the present invention is applied in the reception & disposal of cloud to large data, after Cloud Server reception mass data is resolved data, to be optimized storage to data, the method flow diagram of concrete optimization storage as shown in Figure 1, carries out as follows:
S1, will be divided into fritter according to certain rule through the data of legitimacy verification, the data of each fritter will be deposited in corresponding table.
The legitimacy verification comprises verification to receiving packet and to the verification of data in bag, whether packet is mainly to come the source of verification bag correct by the md5 value, whether be that network attack injects bag, if packet is legal, packet is resolved, to the data analysis of the inside, whether be needed correct data.
The rule of Data Segmentation, the table corresponding according to data determined, all data can not leave in a table, data after parsing have fixing sign show the each several part data this exist which the table in, such as user profile exists in subscriber's meter, network traffic information exists in the corresponding table of network traffics, and fixing rule is the sign of appointing.
The table of storage data is just built up when building storehouse, does not need dynamically to build table.Different list data structure is different, table be table with show between be interdependence again separately independently, the data in table are that in another table, data deposit in after merging by calculating, during use, independently are used separately.If the data that the data in table are not processed in table can increase always, the purpose of the method is exactly to process the data that always increase, and makes it can infinitely not increase.
To the verification of network data legitimacy, to data cut apart and database is built table for current proven technique, those skilled in the art can adopt conventional knowledge to realize, does not do discussion herein.
S2, in step S1 the storage data table judged, if the data in table reach 1,000,000 magnitudes, this table being carried out to the data cutting according to the time period of setting as the time interval is judged the data of cutting apart, if the table after two and plural data unit merge still is less than 1,000,000 magnitudes, first table is merged, and then deposited in little table.The data of only storage correspondence in the time period in each little table.Such as within every 15 minutes, receiving data, these data are deposited in the table of nearest a day, then the synthetic data of every four data deposit in the tables of data of nearest a week, by that analogy, the data that exceed time range in each table are deleted automatically, and so size and the data volume of database can invalidly not increase.
Concrete method of carrying out at times Interval data is: the time period of setting is a time interval i from short to long time period, while according to the time period of setting, carrying out the data cutting, first selects i time period to carry out the data cutting as the time interval; After the data cutting, the little table generated is judged, if also exist data to reach the table of 1,000,000 magnitudes, select the table that i-1 time period reaches 1,000,000 magnitudes to data again as the time interval to carry out the data cutting, circulation is operation as above, until countless certificate reaches, the table of 1,000,000 magnitudes exists or the time period of setting is all used.If the data in the little table again be divided into exceed the scope of the time period of setting, delete automatically.
In the present embodiment, the time period of setting is one day, one week and one month, while according to the time period of setting, carrying out the data cutting, first selects within one month, as the time interval, carry out the data cutting; After the data cutting, the little table generated is judged, if there is no data reach the table of 1,000,000 magnitudes, complete this secondary data storage; If also exist data to reach the table of 1,000,000 magnitudes, the table of selecting the week age section again data to be reached to 1,000,000 magnitudes carries out the data cutting; After the data cutting, the little table generated is judged, if there is no data reach the table of 1,000,000 magnitudes, complete this secondary data storage; If also exist data to reach the table of 1,000,000 magnitudes, select the table that a time period reaches 1,000,000 magnitudes to data again to carry out the data cutting, complete this secondary data storage.If the data in the little table again be divided into exceed the scope of the time period of setting, delete automatically.
To data, huge table carries out the data cutting to logical of the present invention at times, and the data of each time period are safeguarded separately, and the accumulation that has solved mass data causes the database rapid expansion, unlimited expansion, and inquiry velocity is slow.
As mentioned above, although meaned and explained the present invention with reference to specific preferred embodiment, it shall not be construed as the restriction to the present invention self.Under the spirit and scope of the present invention prerequisite that does not break away from the claims definition, can make in the form and details various variations to it.

Claims (5)

1. the storage means of the optimization for mass data, is characterized in that, carries out as follows:
S1, will be divided into fritter according to certain rule through the data of legitimacy verification, the data of each fritter will be deposited in corresponding table;
S2, in step S1 the storage data table judged, if the data in table reach 1,000,000 magnitudes, this table is carried out to the data cutting according to the time period of setting as the time interval, again be divided into little table storage, only store the data in the corresponding time period in each little table.
2. a kind of storage means of the optimization for mass data according to claim 1, it is characterized in that, in step S2, the time period of described setting is a time interval i from short to long time period, while according to the time period of setting, carrying out the data cutting, first select i time period to carry out the data cutting as the time interval; After the data cutting, the little table generated is judged, if also exist data to reach the table of 1,000,000 magnitudes, select the table that i-1 time period reaches 1,000,000 magnitudes to data again as the time interval to carry out the data cutting, circulation is operation as above, until countless certificate reaches, the table of 1,000,000 magnitudes exists or the time period of setting is all used.
3. a kind of storage means of the optimization for mass data according to claim 2, it is characterized in that, in step S2, the time period of described setting is one day, one week and one month, while according to the time period of setting, carrying out the data cutting, first select within one month, as the time interval, carry out the data cutting; After the data cutting, the little table generated is judged, if there is no data reach the table of 1,000,000 magnitudes, complete this secondary data storage; If also exist data to reach the table of 1,000,000 magnitudes, the table of selecting the week age section again data to be reached to 1,000,000 magnitudes carries out the data cutting; After the data cutting, the little table generated is judged, if there is no data reach the table of 1,000,000 magnitudes, complete this secondary data storage; If also exist data to reach the table of 1,000,000 magnitudes, select the table that a time period reaches 1,000,000 magnitudes to data again to carry out the data cutting, complete this secondary data storage.
4. a kind of storage means of the optimization for mass data according to claim 1, is characterized in that, if the data in the little table again be divided into exceed the scope of the time period of setting, deletes automatically.
5. a kind of storage means of the optimization for mass data according to claim 1, it is characterized in that, in step S2, after the table that data are reached to 1,000,000 magnitudes carries out the data cutting, the data of cutting apart are judged, if the table after two and plural data unit merge still is less than 1,000,000 magnitudes, first table is merged, and then deposited in little table.
CN201310363130.7A 2013-08-20 2013-08-20 A kind of optimization storage method for mass data Active CN103473271B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310363130.7A CN103473271B (en) 2013-08-20 2013-08-20 A kind of optimization storage method for mass data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310363130.7A CN103473271B (en) 2013-08-20 2013-08-20 A kind of optimization storage method for mass data

Publications (2)

Publication Number Publication Date
CN103473271A true CN103473271A (en) 2013-12-25
CN103473271B CN103473271B (en) 2017-09-26

Family

ID=49798119

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310363130.7A Active CN103473271B (en) 2013-08-20 2013-08-20 A kind of optimization storage method for mass data

Country Status (1)

Country Link
CN (1) CN103473271B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104182546A (en) * 2014-09-09 2014-12-03 北京国双科技有限公司 Method and device for querying data in databases
CN106233287A (en) * 2015-03-02 2016-12-14 微软技术许可有限责任公司 Management to the data base querying of large data collection
CN106600735A (en) * 2016-12-14 2017-04-26 天津飞鸟科技有限公司 Fingerprint time recorder and mobile phone communication recognition intelligent system
CN107798048A (en) * 2017-07-28 2018-03-13 昆明理工大学 A kind of negative data library management method for radio heliograph Mass Data Management
CN109800252A (en) * 2019-03-05 2019-05-24 深圳市国晨工程造价咨询有限公司 A kind of engineering project Records Information Management System

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1747398A (en) * 2004-09-08 2006-03-15 大唐移动通信设备有限公司 Mass performance data statistical method in network element management system
CN101207513A (en) * 2006-12-22 2008-06-25 中兴通讯股份有限公司 Apparatus and method for saving historical data
CN101566986A (en) * 2008-04-21 2009-10-28 阿里巴巴集团控股有限公司 Method and device for processing data in online business processing
CN101697152A (en) * 2009-10-23 2010-04-21 金蝶软件(中国)有限公司 Database storage system and method and device for splitting data thereof
US20130066882A1 (en) * 2011-09-09 2013-03-14 Onzo Limited Data storage method and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1747398A (en) * 2004-09-08 2006-03-15 大唐移动通信设备有限公司 Mass performance data statistical method in network element management system
CN101207513A (en) * 2006-12-22 2008-06-25 中兴通讯股份有限公司 Apparatus and method for saving historical data
CN101566986A (en) * 2008-04-21 2009-10-28 阿里巴巴集团控股有限公司 Method and device for processing data in online business processing
CN101697152A (en) * 2009-10-23 2010-04-21 金蝶软件(中国)有限公司 Database storage system and method and device for splitting data thereof
US20130066882A1 (en) * 2011-09-09 2013-03-14 Onzo Limited Data storage method and system

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104182546A (en) * 2014-09-09 2014-12-03 北京国双科技有限公司 Method and device for querying data in databases
CN104182546B (en) * 2014-09-09 2017-10-27 北京国双科技有限公司 The data query method and device of database
CN106233287A (en) * 2015-03-02 2016-12-14 微软技术许可有限责任公司 Management to the data base querying of large data collection
CN106233287B (en) * 2015-03-02 2019-07-02 微软技术许可有限责任公司 Management to the data base querying of large data collection
CN106600735A (en) * 2016-12-14 2017-04-26 天津飞鸟科技有限公司 Fingerprint time recorder and mobile phone communication recognition intelligent system
CN107798048A (en) * 2017-07-28 2018-03-13 昆明理工大学 A kind of negative data library management method for radio heliograph Mass Data Management
CN109800252A (en) * 2019-03-05 2019-05-24 深圳市国晨工程造价咨询有限公司 A kind of engineering project Records Information Management System

Also Published As

Publication number Publication date
CN103473271B (en) 2017-09-26

Similar Documents

Publication Publication Date Title
CN103473271A (en) Optimized storing method for mass data
CN103902593B (en) A kind of method and apparatus of Data Migration
CN105912397A (en) Resources management method and device
CN102780641B (en) Flow table aging method and device of quick forwarding engine, and switch
CN104363282B (en) A kind of cloud computing resource scheduling method and device
CN104134173A (en) Main and distribution network graph/model/data integrated system based on remote retrieval and information interaction
CN106649646A (en) Method and device for deleting duplicated data
CN103914265A (en) Cluster fine-grained memory management method
CN104765749A (en) Data storage method and device
CN103970853A (en) Method and device for optimizing search engine
CN102404411A (en) Data synchronization method of cloud storage system
CN104298687A (en) Hash partitioning management method and device
CN108174235A (en) A kind of video loading method and device
CN103853838A (en) Data processing method and device
CN103607731A (en) Method and device for processing measurement reports
Sun et al. A cost-efficient scheduling algorithm of on-demand broadcasts
CN104978695A (en) Electricity charge data processing method and apparatus
CN104123329B (en) Searching method and device
CN102685222B (en) A kind of cloud SRM device for electric power system
CN103297542A (en) Operating system bus and balancing method supporting online expansion and retraction of components
CN104242993A (en) Medium-low voltage power communication access network bandwidth predication method
CN104679821A (en) Data access processing method and data access processing device
CN104992698A (en) Song on-demand arrangement method and device for KTV song-on-demand system
CN106612298B (en) A kind of content distribution method and system based on large-scale network node
CN103796226A (en) Network optimizing method and apparatus

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP02 Change in the address of a patent holder
CP02 Change in the address of a patent holder

Address after: Room 301-302, 3rd Floor, Tiancheng Information Building, No. 88 South Tiancheng Road, High Speed Rail New City, Xiangcheng District, Suzhou City, Jiangsu Province, 215133

Patentee after: SUZHOU MAXNET NETWORK SAFETY TECHNOLOGY Co.,Ltd.

Address before: 215021 International Science and Technology Park Phase III 8B, No. 1355 Jinjihu Avenue, Industrial Park, Suzhou City, Jiangsu Province

Patentee before: SUZHOU MAXNET NETWORK SAFETY TECHNOLOGY Co.,Ltd.