CN108363710A - Cloud system catalogue with intelligent reconstruction Data View - Google Patents

Cloud system catalogue with intelligent reconstruction Data View Download PDF

Info

Publication number
CN108363710A
CN108363710A CN201710505969.8A CN201710505969A CN108363710A CN 108363710 A CN108363710 A CN 108363710A CN 201710505969 A CN201710505969 A CN 201710505969A CN 108363710 A CN108363710 A CN 108363710A
Authority
CN
China
Prior art keywords
catalogue
data
attribute
user
file
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710505969.8A
Other languages
Chinese (zh)
Inventor
孔朝晖
母亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201710505969.8A priority Critical patent/CN108363710A/en
Publication of CN108363710A publication Critical patent/CN108363710A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3322Query formulation using system suggestions
    • G06F16/3323Query formulation using system suggestions using document space presentation or visualization, e.g. category, hierarchy or range presentation and selection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/22Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks comprising specially adapted graphical user interfaces [GUI]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Human Computer Interaction (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Cloud system catalogue with intelligent reconstruction Data View.It is characterized in that:Based on distributed file system, database and search engine technique, provide a kind of data can flexible distributed, displaying, the cloud system Directory technology scheme that structure can be recombinated intelligently, with the relatively-stationary cloud system Directory technology scheme of alternative structure.The invention can provide more set high in the clouds data directory views for a set of high in the clouds data, and the create-rule of data directory meets its own different data sort logic by User Defined and shows needs.User adds the attribute of catalogue when initially setting up catalogue according to the demand of oneself, and attribute and the attribute value of the catalogue are stored in database and search engine;When user uploads new data to the catalogue, the attributive character of catalogue can be inherited automatically;User can form the catalogue of various different views according to different attribute values.

Description

Cloud system catalogue with intelligent reconstruction Data View
It is characterized in that:Based on distributed file system, database and search engine technique, providing a kind of data can be flexible Distribution, displaying, the cloud system Directory technology scheme that structure can be recombinated intelligently, with the relatively-stationary cloud system catalogue skill of alternative structure Art scheme.The invention can be provided for a set of high in the clouds data cover high in the clouds data directory views, the create-rule of data directory by with Family is self-defined to meet its own different data sort logic and displaying needs.
1 technical field
The present invention relates to a series of technical fields, main technical field is as follows:
A, distributed file system or cluster file system
B, database technology, wherein relating separately to structured database and k/v type unstructured databases
C, search engine technique
D, tomcat server technology
E, java programming language
F, linux operating system
2 background technologies
Currently, as cloud system in the market flourishes, the country has had a large amount of enterprise and company to develop a system Arrange cloud product well known and application, such as Baidu's Dropbox, 360 cloud disks etc..The data of most of existing cloud product Catalogue is similar with the well known file system of windows operating systems.User can establish high in the clouds catalogue, and by high in the clouds File data storing can operate the data for having stored into cloud catalogue, for example increase newly, delete in cloud catalogue The operations such as remove, replicate, moving.
But existing cloud product data catalogue has certain limitation, is exemplified below:
The reason of Fig. 1 is set bibliographic structure, user's such organization directory is when user establishes catalogue, and user wishes Data classification is carried out first, in accordance with " place ", and then is finely divided according to " time ".But user needs the inquiry of data What Seeking Truth changed at any time, when user only needs all data of reading " 2015 " (at this point, user no longer needs to focus first on Data belong to where), in Data View relatively-stationary in this way, need to respectively enter " Chengdu ", " Beijing ", " Nanjing " this three " 2015 " catalogue in a different top-level directory structure goes to inquire, and efficiency data query is very low.If user both needs to lead to The logical level for crossing original catalogue goes inquiry data, and needs frequently by only " place " without concern for the classification of " time " When mode goes inquiry data, unless user is allowed to carry out rearranging two sets of different data mesh of formation manually to a large amount of high in the clouds data It records to store identical data, otherwise goes inquiry data can be very painful by original data directory.Unfortunately, once allowing User oneself goes to safeguard two sets of data views (catalogue) simultaneously by way of Manual arranging, that is hardly possible thing again Feelings, this will imply that user often uploads a data, is required for this part of data while being written in two views.If it is assumed that User was also needed to simultaneously first, in accordance with " time ", then retrieved data (the bibliographic structure logical level with Fig. 1 according still further to " place " It is exactly the opposite), then current high in the clouds file system is just more difficult to meet.
As described above, the main problem of current cloud product data catalogue is:After user establishes high in the clouds bibliographic structure, when When needing to adjust catalog views, need move combining by a large amount of high in the clouds data manually to form new bibliographic structure, it can not Data mode classification and displaying logic, identical high in the clouds data content in convenient, flexible adjustment catalogue can not use difference Logical view presented.Bibliographic structure relatively-stationary in this way, inflexible catalogue adjustment mode, single Data View Presentation mode can not meet the profound needs of most users.
3 invention contents
Problem to be solved by this invention is:There is provided it is a kind of can intelligent reconstruction cloud data directory, both met user for While the additions and deletions of catalogue change the functional requirement looked into, and allow user according to the custom and needs of oneself, to same sets of data, Under the premise of a large amount of manual settings, generates a variety of different data directory views and used.
The software and hardware architecture of the present invention is as shown in Fig. 3, is described as follows:
(1) computer hardware that the present invention uses is based on x86 platforms, needs to support linux operating systems, needs to have net Mouth needs to have certain cpu operational capabilities and a degree of disk storage to storage file to supply outside access Data, database data and search engine data.Due to being cloud cluster system, thus need multiple stage computers by interchanger or Person's router carries out network connection deployment.It is illustrated with 3 computers in attached drawing.
(2) actual storage present invention uses distributed file system as file data, distributed file system can To be deployed in the computer hardware that (1) is previously mentioned.
(3) present invention uses mysql databases and redis databases, the search engine solr to increase income has been used, has been both needed to It to be carried out together with distributed file system in the computer that clustered deploy(ment) is previously mentioned with (1).
(4) present invention uses the application programs that java language is write, and will apply journey using tomcat application servers Sequence is deployed on the computer where distributed file system simultaneously.
(5) in addition to this, it is also necessary to provide complete set front-end interface give user access, allow user by web, App or other modes are checked using data.
Technical problems to be solved needed for the present invention are realized in:
The first step:User can establish high in the clouds catalogue, and when establishing catalogue, user needs to add the attribute of catalogue, such as one A entitled Beijing of catalogue, the attribute that can add this catalogue is place, because Beijing is a specific object value in place, this Sample definition, which is more in line with, uses needs.The attribute of this catalogue and attribute value can be stored in database and search engine by system In.(directory name will automatically become attribute value)
Second step:When user uploads files under some specified high in the clouds catalogue, application program can receive file, and File is stored in distributed file system.It is handled by application program, reads the attribute and attribute of its affiliated catalogue of file Value, the attribute and attribute value of the unique identifier of high in the clouds file and subordinate catalogue are stored in database and search engine, this As soon as sample, the file uploaded on high in the clouds has the attributive character of its catalogue naturally.So far, it has formd The relationship of file and catalogue, the Data View of a set of class windows can be constituted by repeating above step.
Third walks:As mentioned before, when user needs new data directory, it is no longer necessary to original high in the clouds catalogue It is combed and is integrated, it is only necessary to specify new view that logic is presented for original data directory.Specifically, user needs It is the attribute and attribute displaying level of specified new directory view, for example, it is attribute " place " to specify new view first layer catalogue File, second layer catalogue are that (it is first layer catalogue, " place " that user, which can also specify " time ", for the file of attribute " time " For second layer catalogue, logic is presented and is voluntarily arranged according to the requirement of user).Application system will divide from database and search engine Whole attribute values of " place " attribute and " time " attribute are read in analysis, and new catalogue knot is successively generated according to attribute displaying level Structure, and matched file data is filled into new catalogue, create a set of new Data View.Wherein catalogue create-rule sum number It is more complicated according to matching rule, it is related to more complicated logic rules, is described in detail being put into embodiment.
Finally:User only needs simply to specify a properties presentation layer, you can is not influencing original data directory Under the premise of, obtain a new data directory.In this new data directory, occupation mode and original high in the clouds catalogue It is completely the same.For same set of high in the clouds data, user can create different more parts of the data directories that logic is presented.
The present invention propose it is a kind of can intelligent reconstruction high in the clouds catalogue, the beneficial effects of the invention are as follows:
● high in the clouds Data View can be reconstructed according to file attribute, to same sets of data provide a variety of Data Views into Row, which is consulted, to be used.The efficiency of management of the user for data will be greatly improved.
4 description of the drawings
Fig. 1 is the former catalog views of the present invention, this figure is the catalog views structure arranged according to catalogue, wherein wherein A-L It represents and belongs to file in different directories, remaining node on behalf catalogue, italic character segment generation in catalogue or file in figure The attributive character of list file or catalogue;
Fig. 2 is the Smart Directory view of the present invention, which shows in the data basis belonging to Fig. 1, passes through attribute Catalog views structure after reconstruct, wherein A-L represent the file belonged in different directories, remaining node on behalf catalogue, figure Either italic character segment represents the attributive character of file or catalogue to middle catalogue in file;
Fig. 3 is the basic structure composed of software and hardware of the present invention;
5 specific implementation modes
We are illustrated with simple original bibliographic structure such as Fig. 1.
Such as Fig. 1, wherein the 1 first layer subdirectory for former catalogue and the top-level directory in system, when user creaties directory Its attribute is " place ", this layer of catalogue is altogether there are three catalogue, and 2 be the second layer subdirectory of former catalogue, and attribute is " time ", this There are six catalogues for layer.
Illustrating how user utilizes the present invention so that the data in current Fig. 1 formed a new data directory Fig. 2 it Before, there are three big rules to have to individually be illustrated:
A, inheritance of attribute principle
When createing directory, lower directory has an attribute of all higher level's catalogues automatically, for example, in Fig. 1 file it is entitled " at The catalogue of the subordinate entitled " 2015 " of catalogue all ", attributive character are " place:The Chengdu & times:2015 ", and with such It pushes away.Either italic character segment represents the attributive character of file or catalogue to catalogue in file in figure.
When upper transmitting file, the attribute of file where file is inherited automatically, for example enter in Fig. 1 when user uploads A files When folder name is the catalogue of the subordinate entitled " 2015 " of the catalogue in " Chengdu ", system will equally will record the category of A files automatically Property is characterized as:" place:The Chengdu & times:2015 ", it is consistent with the file where it, and so on.
B, new data catalogue create-rule
Fig. 1 is user's legacy data catalogue, as shown in Figure 1, attribute " place " be respectively there are three attribute value altogether " at All ", " Beijing ", " Nanjing ", there are two attribute value " 2015 ", " 2016 " altogether for attribute " time ".When the attribute layer of user's selection It is secondary when being from " time " to " place ", " time " attribute value that system will first be read in all legacy datas, and generate level-one Catalogue, then " place " attribute value in all legacy datas is read, generate second-level directory respectively again under first class catalogue.To scheme For 2, system has read all " time " attributes first, as shown in Figure 1, in our legacy data, " time " only has Two attribute values, then system generate two first class catalogues of entitled " 2015 ", " 2016 ", then will respectively " 2015 ", It is divided into according to the attribute value of attribute " place " in " 2016 " the two catalogues and does not generate entitled " Chengdu ", " Beijing ", " Nanjing " three A subprime directory generates altogether six subprime directories.
System will add attributive character, such as " 2015 " this catalogue for new catalogue automatically, carry " the time:2015 " This attributive character, and inheritance of attribute principle is deferred to, also it is its automatic addition higher level's directory attribute feature.
C, Data Matching rule
As described in inheritance rules above, after the file A in Fig. 1 has uploaded to catalogue, have " automatically Point:The Chengdu & times:2015 " attributive character.According to this attributive character, in new Data View, this document A will It is " Chengdu " that searching, which is provided simultaneously with attribute " place ", and attribute " time " is that the new view directory of " 2015 " match and adds automatically Enter.
When user wishes first to be finely divided data according to " time " access data according still further to " place ", pass through system " time " attribute is ordered into before " place " attribute, and Request System is submitted to reconstruct Data View.System is generated according to catalogue Rule generates new data directory, and the bibliographic structure after intelligent reconstruction is as shown in Fig. 2.Wherein 3 be the first layer of catalogue after reconstruct Subdirectory, attribute are " time ", and there are two catalogues altogether for this catalogue, wherein 4 be the second layer subdirectory of catalogue after reconstruct, category Property be " place ", there are six catalogues altogether for this layer of catalogue.All new data directories are it by system when generating Produce match attribute feature.
Such as Fig. 2, in new data directory, system is according to Data Matching principle, by 12 original files, redistribution Into the matched catalogue of attributive character therewith, then it is presented to user.
For a user, when needing different Data Views, it is only necessary to the category that will simply set originally Property carry out permutation and combination again and can be realized.The attributive character of file itself does not change, the only attributive character row of variation It arranges the mode of combination and is used with forming different Data View supply users.
6 evaluations
In original Data View, or even in the Data View including operating system, for same sets of data, all use Identical presentation logic carries out data presentation.The advantage of doing so is that the use habit of user is defined, but harm is same Obviously, that is, the demand for having strangled user that the more logics of data, various dimensions are presented.It is more and more huger in unstructured data Today, for user for a variety of data presentation modes, a variety of data show logics, increasingly thirst for, for example sales force includes The sales figure of sales territory and sale product information, the finance report comprising different type and comprising different time of financial staff Table etc., as long as the data of user need the attributive character of various dimensions to illustrate, then needing flexibility and changeability, being not limited to list One is presented the data directory of logic.Needed based on such, it is believed that provide it is a kind of flexibly, can intelligent reconstruction cloud data mesh Record technical solution is very important.

Claims (4)

1. can intelligent reconstruction structure and automatic grouped data cloud system catalogue, it is characterised in that:It is previously defined according to user Categorical attribute in cloud catalogue and on its subdirectory reconstructs mesh by the sequence of permutation and combination attribute according to the expectation of user Directory structures, and classification is re-started to data, a variety of data directories can be provided simultaneously for the same data set.
2. according to claim 1, pre-defined categorical attribute is characterized in that:Catalogue in system has prefabricated classification The categorical attribute of attributive character, catalogue can customize, and directory name is attribute value.Catalog classification attribute has inherited characteristics, under Grade catalogue inherits the categorical attribute feature of higher level's catalogue automatically.
3. according to claim 1, pre-defined categorical attribute is characterized in that:File in system also has prefabricated point Generic attribute feature, the categorical attribute of file are directly inherited from the catalogue where file.
It is characterized in that 4. according to claim 1, file reclassifies:File after reconstruct will be according to the prefabricated of its own Characteristic of division automatically to be stored in the new directory structure met with its prefabricated characteristic of division.
CN201710505969.8A 2017-06-20 2017-06-20 Cloud system catalogue with intelligent reconstruction Data View Pending CN108363710A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710505969.8A CN108363710A (en) 2017-06-20 2017-06-20 Cloud system catalogue with intelligent reconstruction Data View

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710505969.8A CN108363710A (en) 2017-06-20 2017-06-20 Cloud system catalogue with intelligent reconstruction Data View

Publications (1)

Publication Number Publication Date
CN108363710A true CN108363710A (en) 2018-08-03

Family

ID=63010114

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710505969.8A Pending CN108363710A (en) 2017-06-20 2017-06-20 Cloud system catalogue with intelligent reconstruction Data View

Country Status (1)

Country Link
CN (1) CN108363710A (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1131767A (en) * 1995-11-11 1996-09-25 广州市金资信息技术有限公司 Information processing method
US6243815B1 (en) * 1997-04-25 2001-06-05 Anand K. Antur Method and apparatus for reconfiguring and managing firewalls and security devices
US20080021883A1 (en) * 2006-07-21 2008-01-24 Stephen Alstrin Methods, systems and computer program products for monitoring changes to objects on a computer system using information models and baselines
CN101226552A (en) * 2008-02-01 2008-07-23 北京乾坤化物数字技术有限公司 Method for management of magnanimity information using directory composed of multidimensional structure tree
CN101334785A (en) * 2008-07-30 2008-12-31 浙江大学 Method for organizing dummy catalog and managing naming space for distributed file systems
CN103116618A (en) * 2013-01-28 2013-05-22 南开大学 Telefile system mirror image method and system based on lasting caching of client-side
CN104360982A (en) * 2014-11-21 2015-02-18 浪潮(北京)电子信息产业有限公司 Implementation method and system for host system directory structure based on reconfigurable chip technology
CN105025086A (en) * 2015-06-11 2015-11-04 江南大学 A cloud safe storing/retrieving method targeted at mobile intelligent terminals
CN105760781A (en) * 2016-03-02 2016-07-13 四川师范大学 Storage method, restoration method and operation method of ordered and derivable large-data files

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1131767A (en) * 1995-11-11 1996-09-25 广州市金资信息技术有限公司 Information processing method
US6243815B1 (en) * 1997-04-25 2001-06-05 Anand K. Antur Method and apparatus for reconfiguring and managing firewalls and security devices
US20080021883A1 (en) * 2006-07-21 2008-01-24 Stephen Alstrin Methods, systems and computer program products for monitoring changes to objects on a computer system using information models and baselines
CN101226552A (en) * 2008-02-01 2008-07-23 北京乾坤化物数字技术有限公司 Method for management of magnanimity information using directory composed of multidimensional structure tree
CN101334785A (en) * 2008-07-30 2008-12-31 浙江大学 Method for organizing dummy catalog and managing naming space for distributed file systems
CN103116618A (en) * 2013-01-28 2013-05-22 南开大学 Telefile system mirror image method and system based on lasting caching of client-side
CN104360982A (en) * 2014-11-21 2015-02-18 浪潮(北京)电子信息产业有限公司 Implementation method and system for host system directory structure based on reconfigurable chip technology
CN105025086A (en) * 2015-06-11 2015-11-04 江南大学 A cloud safe storing/retrieving method targeted at mobile intelligent terminals
CN105760781A (en) * 2016-03-02 2016-07-13 四川师范大学 Storage method, restoration method and operation method of ordered and derivable large-data files

Similar Documents

Publication Publication Date Title
US11816126B2 (en) Large scale unstructured database systems
US11341171B2 (en) Method and apparatus for implementing a set of integrated data systems
US11468103B2 (en) Relational modeler and renderer for non-relational data
US8996453B2 (en) Distribution of data in a lattice-based database via placeholder nodes
US7043472B2 (en) File system with access and retrieval of XML documents
CN100468402C (en) Sort data storage and split catalog inquiry method based on catalog tree
US7487191B2 (en) Method and system for model-based replication of data
CN111259006A (en) Universal distributed heterogeneous data integrated physical aggregation, organization, release and service method and system
US20080249995A1 (en) Method and system for attribute management in a namespace
CN104462185B (en) A kind of digital library's cloud storage system based on mixed structure
US20170372043A1 (en) Satellite and central asset registry systems and methods and rights management systems
US10229185B2 (en) Method and system for configuration management of hierarchically-organized unstructured data using associative templates
KR101689782B1 (en) Method for accessing files of a file system according to metadata and device implementing the method
JP2000090077A (en) Dynamic changing method for system configuration of document and document managing method
EP3864504B1 (en) Changeset conflict rebasing
Srivastava et al. Analysis of various NoSql database
CN109542861A (en) File management method, device and system
CN111274294A (en) Universal distributed heterogeneous data integrated logic convergence organization, release and service method and system
JP4167359B2 (en) Data management system and data management method
US20150058363A1 (en) Cloud-based enterprise content management system
KR20220143696A (en) Multi-Valued Primary Keys for Unique Identifiers of Multiple Subjects
CN108363710A (en) Cloud system catalogue with intelligent reconstruction Data View
US8386517B2 (en) System and method for brokering information between a plurality of commercially distinct clients
Atzeni et al. Data modeling across the evolution of database technology
Krstić et al. Testing the performance of NoSQL databases via the database benchmark tool

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20180803

WD01 Invention patent application deemed withdrawn after publication