CN108363710A - Cloud system catalogue with intelligent reconstruction Data View - Google Patents
Cloud system catalogue with intelligent reconstruction Data View Download PDFInfo
- Publication number
- CN108363710A CN108363710A CN201710505969.8A CN201710505969A CN108363710A CN 108363710 A CN108363710 A CN 108363710A CN 201710505969 A CN201710505969 A CN 201710505969A CN 108363710 A CN108363710 A CN 108363710A
- Authority
- CN
- China
- Prior art keywords
- catalogue
- data
- attribute
- user
- file
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3322—Query formulation using system suggestions
- G06F16/3323—Query formulation using system suggestions using document space presentation or visualization, e.g. category, hierarchy or range presentation and selection
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/18—File system types
- G06F16/182—Distributed file systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/22—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks comprising specially adapted graphical user interfaces [GUI]
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Human Computer Interaction (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Cloud system catalogue with intelligent reconstruction Data View.It is characterized in that:Based on distributed file system, database and search engine technique, provide a kind of data can flexible distributed, displaying, the cloud system Directory technology scheme that structure can be recombinated intelligently, with the relatively-stationary cloud system Directory technology scheme of alternative structure.The invention can provide more set high in the clouds data directory views for a set of high in the clouds data, and the create-rule of data directory meets its own different data sort logic by User Defined and shows needs.User adds the attribute of catalogue when initially setting up catalogue according to the demand of oneself, and attribute and the attribute value of the catalogue are stored in database and search engine;When user uploads new data to the catalogue, the attributive character of catalogue can be inherited automatically;User can form the catalogue of various different views according to different attribute values.
Description
It is characterized in that:Based on distributed file system, database and search engine technique, providing a kind of data can be flexible
Distribution, displaying, the cloud system Directory technology scheme that structure can be recombinated intelligently, with the relatively-stationary cloud system catalogue skill of alternative structure
Art scheme.The invention can be provided for a set of high in the clouds data cover high in the clouds data directory views, the create-rule of data directory by with
Family is self-defined to meet its own different data sort logic and displaying needs.
1 technical field
The present invention relates to a series of technical fields, main technical field is as follows:
A, distributed file system or cluster file system
B, database technology, wherein relating separately to structured database and k/v type unstructured databases
C, search engine technique
D, tomcat server technology
E, java programming language
F, linux operating system
2 background technologies
Currently, as cloud system in the market flourishes, the country has had a large amount of enterprise and company to develop a system
Arrange cloud product well known and application, such as Baidu's Dropbox, 360 cloud disks etc..The data of most of existing cloud product
Catalogue is similar with the well known file system of windows operating systems.User can establish high in the clouds catalogue, and by high in the clouds
File data storing can operate the data for having stored into cloud catalogue, for example increase newly, delete in cloud catalogue
The operations such as remove, replicate, moving.
But existing cloud product data catalogue has certain limitation, is exemplified below:
The reason of Fig. 1 is set bibliographic structure, user's such organization directory is when user establishes catalogue, and user wishes
Data classification is carried out first, in accordance with " place ", and then is finely divided according to " time ".But user needs the inquiry of data
What Seeking Truth changed at any time, when user only needs all data of reading " 2015 " (at this point, user no longer needs to focus first on
Data belong to where), in Data View relatively-stationary in this way, need to respectively enter " Chengdu ", " Beijing ", " Nanjing " this three
" 2015 " catalogue in a different top-level directory structure goes to inquire, and efficiency data query is very low.If user both needs to lead to
The logical level for crossing original catalogue goes inquiry data, and needs frequently by only " place " without concern for the classification of " time "
When mode goes inquiry data, unless user is allowed to carry out rearranging two sets of different data mesh of formation manually to a large amount of high in the clouds data
It records to store identical data, otherwise goes inquiry data can be very painful by original data directory.Unfortunately, once allowing
User oneself goes to safeguard two sets of data views (catalogue) simultaneously by way of Manual arranging, that is hardly possible thing again
Feelings, this will imply that user often uploads a data, is required for this part of data while being written in two views.If it is assumed that
User was also needed to simultaneously first, in accordance with " time ", then retrieved data (the bibliographic structure logical level with Fig. 1 according still further to " place "
It is exactly the opposite), then current high in the clouds file system is just more difficult to meet.
As described above, the main problem of current cloud product data catalogue is:After user establishes high in the clouds bibliographic structure, when
When needing to adjust catalog views, need move combining by a large amount of high in the clouds data manually to form new bibliographic structure, it can not
Data mode classification and displaying logic, identical high in the clouds data content in convenient, flexible adjustment catalogue can not use difference
Logical view presented.Bibliographic structure relatively-stationary in this way, inflexible catalogue adjustment mode, single Data View
Presentation mode can not meet the profound needs of most users.
3 invention contents
Problem to be solved by this invention is:There is provided it is a kind of can intelligent reconstruction cloud data directory, both met user for
While the additions and deletions of catalogue change the functional requirement looked into, and allow user according to the custom and needs of oneself, to same sets of data,
Under the premise of a large amount of manual settings, generates a variety of different data directory views and used.
The software and hardware architecture of the present invention is as shown in Fig. 3, is described as follows:
(1) computer hardware that the present invention uses is based on x86 platforms, needs to support linux operating systems, needs to have net
Mouth needs to have certain cpu operational capabilities and a degree of disk storage to storage file to supply outside access
Data, database data and search engine data.Due to being cloud cluster system, thus need multiple stage computers by interchanger or
Person's router carries out network connection deployment.It is illustrated with 3 computers in attached drawing.
(2) actual storage present invention uses distributed file system as file data, distributed file system can
To be deployed in the computer hardware that (1) is previously mentioned.
(3) present invention uses mysql databases and redis databases, the search engine solr to increase income has been used, has been both needed to
It to be carried out together with distributed file system in the computer that clustered deploy(ment) is previously mentioned with (1).
(4) present invention uses the application programs that java language is write, and will apply journey using tomcat application servers
Sequence is deployed on the computer where distributed file system simultaneously.
(5) in addition to this, it is also necessary to provide complete set front-end interface give user access, allow user by web,
App or other modes are checked using data.
Technical problems to be solved needed for the present invention are realized in:
The first step:User can establish high in the clouds catalogue, and when establishing catalogue, user needs to add the attribute of catalogue, such as one
A entitled Beijing of catalogue, the attribute that can add this catalogue is place, because Beijing is a specific object value in place, this
Sample definition, which is more in line with, uses needs.The attribute of this catalogue and attribute value can be stored in database and search engine by system
In.(directory name will automatically become attribute value)
Second step:When user uploads files under some specified high in the clouds catalogue, application program can receive file, and
File is stored in distributed file system.It is handled by application program, reads the attribute and attribute of its affiliated catalogue of file
Value, the attribute and attribute value of the unique identifier of high in the clouds file and subordinate catalogue are stored in database and search engine, this
As soon as sample, the file uploaded on high in the clouds has the attributive character of its catalogue naturally.So far, it has formd
The relationship of file and catalogue, the Data View of a set of class windows can be constituted by repeating above step.
Third walks:As mentioned before, when user needs new data directory, it is no longer necessary to original high in the clouds catalogue
It is combed and is integrated, it is only necessary to specify new view that logic is presented for original data directory.Specifically, user needs
It is the attribute and attribute displaying level of specified new directory view, for example, it is attribute " place " to specify new view first layer catalogue
File, second layer catalogue are that (it is first layer catalogue, " place " that user, which can also specify " time ", for the file of attribute " time "
For second layer catalogue, logic is presented and is voluntarily arranged according to the requirement of user).Application system will divide from database and search engine
Whole attribute values of " place " attribute and " time " attribute are read in analysis, and new catalogue knot is successively generated according to attribute displaying level
Structure, and matched file data is filled into new catalogue, create a set of new Data View.Wherein catalogue create-rule sum number
It is more complicated according to matching rule, it is related to more complicated logic rules, is described in detail being put into embodiment.
Finally:User only needs simply to specify a properties presentation layer, you can is not influencing original data directory
Under the premise of, obtain a new data directory.In this new data directory, occupation mode and original high in the clouds catalogue
It is completely the same.For same set of high in the clouds data, user can create different more parts of the data directories that logic is presented.
The present invention propose it is a kind of can intelligent reconstruction high in the clouds catalogue, the beneficial effects of the invention are as follows:
● high in the clouds Data View can be reconstructed according to file attribute, to same sets of data provide a variety of Data Views into
Row, which is consulted, to be used.The efficiency of management of the user for data will be greatly improved.
4 description of the drawings
Fig. 1 is the former catalog views of the present invention, this figure is the catalog views structure arranged according to catalogue, wherein wherein A-L
It represents and belongs to file in different directories, remaining node on behalf catalogue, italic character segment generation in catalogue or file in figure
The attributive character of list file or catalogue;
Fig. 2 is the Smart Directory view of the present invention, which shows in the data basis belonging to Fig. 1, passes through attribute
Catalog views structure after reconstruct, wherein A-L represent the file belonged in different directories, remaining node on behalf catalogue, figure
Either italic character segment represents the attributive character of file or catalogue to middle catalogue in file;
Fig. 3 is the basic structure composed of software and hardware of the present invention;
5 specific implementation modes
We are illustrated with simple original bibliographic structure such as Fig. 1.
Such as Fig. 1, wherein the 1 first layer subdirectory for former catalogue and the top-level directory in system, when user creaties directory
Its attribute is " place ", this layer of catalogue is altogether there are three catalogue, and 2 be the second layer subdirectory of former catalogue, and attribute is " time ", this
There are six catalogues for layer.
Illustrating how user utilizes the present invention so that the data in current Fig. 1 formed a new data directory Fig. 2 it
Before, there are three big rules to have to individually be illustrated:
A, inheritance of attribute principle
When createing directory, lower directory has an attribute of all higher level's catalogues automatically, for example, in Fig. 1 file it is entitled " at
The catalogue of the subordinate entitled " 2015 " of catalogue all ", attributive character are " place:The Chengdu & times:2015 ", and with such
It pushes away.Either italic character segment represents the attributive character of file or catalogue to catalogue in file in figure.
When upper transmitting file, the attribute of file where file is inherited automatically, for example enter in Fig. 1 when user uploads A files
When folder name is the catalogue of the subordinate entitled " 2015 " of the catalogue in " Chengdu ", system will equally will record the category of A files automatically
Property is characterized as:" place:The Chengdu & times:2015 ", it is consistent with the file where it, and so on.
B, new data catalogue create-rule
Fig. 1 is user's legacy data catalogue, as shown in Figure 1, attribute " place " be respectively there are three attribute value altogether " at
All ", " Beijing ", " Nanjing ", there are two attribute value " 2015 ", " 2016 " altogether for attribute " time ".When the attribute layer of user's selection
It is secondary when being from " time " to " place ", " time " attribute value that system will first be read in all legacy datas, and generate level-one
Catalogue, then " place " attribute value in all legacy datas is read, generate second-level directory respectively again under first class catalogue.To scheme
For 2, system has read all " time " attributes first, as shown in Figure 1, in our legacy data, " time " only has
Two attribute values, then system generate two first class catalogues of entitled " 2015 ", " 2016 ", then will respectively " 2015 ",
It is divided into according to the attribute value of attribute " place " in " 2016 " the two catalogues and does not generate entitled " Chengdu ", " Beijing ", " Nanjing " three
A subprime directory generates altogether six subprime directories.
System will add attributive character, such as " 2015 " this catalogue for new catalogue automatically, carry " the time:2015 "
This attributive character, and inheritance of attribute principle is deferred to, also it is its automatic addition higher level's directory attribute feature.
C, Data Matching rule
As described in inheritance rules above, after the file A in Fig. 1 has uploaded to catalogue, have " automatically
Point:The Chengdu & times:2015 " attributive character.According to this attributive character, in new Data View, this document A will
It is " Chengdu " that searching, which is provided simultaneously with attribute " place ", and attribute " time " is that the new view directory of " 2015 " match and adds automatically
Enter.
When user wishes first to be finely divided data according to " time " access data according still further to " place ", pass through system
" time " attribute is ordered into before " place " attribute, and Request System is submitted to reconstruct Data View.System is generated according to catalogue
Rule generates new data directory, and the bibliographic structure after intelligent reconstruction is as shown in Fig. 2.Wherein 3 be the first layer of catalogue after reconstruct
Subdirectory, attribute are " time ", and there are two catalogues altogether for this catalogue, wherein 4 be the second layer subdirectory of catalogue after reconstruct, category
Property be " place ", there are six catalogues altogether for this layer of catalogue.All new data directories are it by system when generating
Produce match attribute feature.
Such as Fig. 2, in new data directory, system is according to Data Matching principle, by 12 original files, redistribution
Into the matched catalogue of attributive character therewith, then it is presented to user.
For a user, when needing different Data Views, it is only necessary to the category that will simply set originally
Property carry out permutation and combination again and can be realized.The attributive character of file itself does not change, the only attributive character row of variation
It arranges the mode of combination and is used with forming different Data View supply users.
6 evaluations
In original Data View, or even in the Data View including operating system, for same sets of data, all use
Identical presentation logic carries out data presentation.The advantage of doing so is that the use habit of user is defined, but harm is same
Obviously, that is, the demand for having strangled user that the more logics of data, various dimensions are presented.It is more and more huger in unstructured data
Today, for user for a variety of data presentation modes, a variety of data show logics, increasingly thirst for, for example sales force includes
The sales figure of sales territory and sale product information, the finance report comprising different type and comprising different time of financial staff
Table etc., as long as the data of user need the attributive character of various dimensions to illustrate, then needing flexibility and changeability, being not limited to list
One is presented the data directory of logic.Needed based on such, it is believed that provide it is a kind of flexibly, can intelligent reconstruction cloud data mesh
Record technical solution is very important.
Claims (4)
1. can intelligent reconstruction structure and automatic grouped data cloud system catalogue, it is characterised in that:It is previously defined according to user
Categorical attribute in cloud catalogue and on its subdirectory reconstructs mesh by the sequence of permutation and combination attribute according to the expectation of user
Directory structures, and classification is re-started to data, a variety of data directories can be provided simultaneously for the same data set.
2. according to claim 1, pre-defined categorical attribute is characterized in that:Catalogue in system has prefabricated classification
The categorical attribute of attributive character, catalogue can customize, and directory name is attribute value.Catalog classification attribute has inherited characteristics, under
Grade catalogue inherits the categorical attribute feature of higher level's catalogue automatically.
3. according to claim 1, pre-defined categorical attribute is characterized in that:File in system also has prefabricated point
Generic attribute feature, the categorical attribute of file are directly inherited from the catalogue where file.
It is characterized in that 4. according to claim 1, file reclassifies:File after reconstruct will be according to the prefabricated of its own
Characteristic of division automatically to be stored in the new directory structure met with its prefabricated characteristic of division.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710505969.8A CN108363710A (en) | 2017-06-20 | 2017-06-20 | Cloud system catalogue with intelligent reconstruction Data View |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710505969.8A CN108363710A (en) | 2017-06-20 | 2017-06-20 | Cloud system catalogue with intelligent reconstruction Data View |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108363710A true CN108363710A (en) | 2018-08-03 |
Family
ID=63010114
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710505969.8A Pending CN108363710A (en) | 2017-06-20 | 2017-06-20 | Cloud system catalogue with intelligent reconstruction Data View |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108363710A (en) |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1131767A (en) * | 1995-11-11 | 1996-09-25 | 广州市金资信息技术有限公司 | Information processing method |
US6243815B1 (en) * | 1997-04-25 | 2001-06-05 | Anand K. Antur | Method and apparatus for reconfiguring and managing firewalls and security devices |
US20080021883A1 (en) * | 2006-07-21 | 2008-01-24 | Stephen Alstrin | Methods, systems and computer program products for monitoring changes to objects on a computer system using information models and baselines |
CN101226552A (en) * | 2008-02-01 | 2008-07-23 | 北京乾坤化物数字技术有限公司 | Method for management of magnanimity information using directory composed of multidimensional structure tree |
CN101334785A (en) * | 2008-07-30 | 2008-12-31 | 浙江大学 | Method for organizing dummy catalog and managing naming space for distributed file systems |
CN103116618A (en) * | 2013-01-28 | 2013-05-22 | 南开大学 | Telefile system mirror image method and system based on lasting caching of client-side |
CN104360982A (en) * | 2014-11-21 | 2015-02-18 | 浪潮(北京)电子信息产业有限公司 | Implementation method and system for host system directory structure based on reconfigurable chip technology |
CN105025086A (en) * | 2015-06-11 | 2015-11-04 | 江南大学 | A cloud safe storing/retrieving method targeted at mobile intelligent terminals |
CN105760781A (en) * | 2016-03-02 | 2016-07-13 | 四川师范大学 | Storage method, restoration method and operation method of ordered and derivable large-data files |
-
2017
- 2017-06-20 CN CN201710505969.8A patent/CN108363710A/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1131767A (en) * | 1995-11-11 | 1996-09-25 | 广州市金资信息技术有限公司 | Information processing method |
US6243815B1 (en) * | 1997-04-25 | 2001-06-05 | Anand K. Antur | Method and apparatus for reconfiguring and managing firewalls and security devices |
US20080021883A1 (en) * | 2006-07-21 | 2008-01-24 | Stephen Alstrin | Methods, systems and computer program products for monitoring changes to objects on a computer system using information models and baselines |
CN101226552A (en) * | 2008-02-01 | 2008-07-23 | 北京乾坤化物数字技术有限公司 | Method for management of magnanimity information using directory composed of multidimensional structure tree |
CN101334785A (en) * | 2008-07-30 | 2008-12-31 | 浙江大学 | Method for organizing dummy catalog and managing naming space for distributed file systems |
CN103116618A (en) * | 2013-01-28 | 2013-05-22 | 南开大学 | Telefile system mirror image method and system based on lasting caching of client-side |
CN104360982A (en) * | 2014-11-21 | 2015-02-18 | 浪潮(北京)电子信息产业有限公司 | Implementation method and system for host system directory structure based on reconfigurable chip technology |
CN105025086A (en) * | 2015-06-11 | 2015-11-04 | 江南大学 | A cloud safe storing/retrieving method targeted at mobile intelligent terminals |
CN105760781A (en) * | 2016-03-02 | 2016-07-13 | 四川师范大学 | Storage method, restoration method and operation method of ordered and derivable large-data files |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11816126B2 (en) | Large scale unstructured database systems | |
US11341171B2 (en) | Method and apparatus for implementing a set of integrated data systems | |
US11468103B2 (en) | Relational modeler and renderer for non-relational data | |
US8996453B2 (en) | Distribution of data in a lattice-based database via placeholder nodes | |
US7043472B2 (en) | File system with access and retrieval of XML documents | |
CN100468402C (en) | Sort data storage and split catalog inquiry method based on catalog tree | |
US7487191B2 (en) | Method and system for model-based replication of data | |
CN111259006A (en) | Universal distributed heterogeneous data integrated physical aggregation, organization, release and service method and system | |
US20080249995A1 (en) | Method and system for attribute management in a namespace | |
CN104462185B (en) | A kind of digital library's cloud storage system based on mixed structure | |
US20170372043A1 (en) | Satellite and central asset registry systems and methods and rights management systems | |
US10229185B2 (en) | Method and system for configuration management of hierarchically-organized unstructured data using associative templates | |
KR101689782B1 (en) | Method for accessing files of a file system according to metadata and device implementing the method | |
JP2000090077A (en) | Dynamic changing method for system configuration of document and document managing method | |
EP3864504B1 (en) | Changeset conflict rebasing | |
Srivastava et al. | Analysis of various NoSql database | |
CN109542861A (en) | File management method, device and system | |
CN111274294A (en) | Universal distributed heterogeneous data integrated logic convergence organization, release and service method and system | |
JP4167359B2 (en) | Data management system and data management method | |
US20150058363A1 (en) | Cloud-based enterprise content management system | |
KR20220143696A (en) | Multi-Valued Primary Keys for Unique Identifiers of Multiple Subjects | |
CN108363710A (en) | Cloud system catalogue with intelligent reconstruction Data View | |
US8386517B2 (en) | System and method for brokering information between a plurality of commercially distinct clients | |
Atzeni et al. | Data modeling across the evolution of database technology | |
Krstić et al. | Testing the performance of NoSQL databases via the database benchmark tool |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20180803 |
|
WD01 | Invention patent application deemed withdrawn after publication |