CN107402988A - A kind of distributed NewSQL Database Systems and Query semi-structured for data method - Google Patents

A kind of distributed NewSQL Database Systems and Query semi-structured for data method Download PDF

Info

Publication number
CN107402988A
CN107402988A CN201710580456.3A CN201710580456A CN107402988A CN 107402988 A CN107402988 A CN 107402988A CN 201710580456 A CN201710580456 A CN 201710580456A CN 107402988 A CN107402988 A CN 107402988A
Authority
CN
China
Prior art keywords
data
user
json
executive plan
hbase
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710580456.3A
Other languages
Chinese (zh)
Other versions
CN107402988B (en
Inventor
晋彤
谭恒亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yunrun Da Data Service Co ltd
Original Assignee
Guangzhou Special Road Mdt Infotech Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Special Road Mdt Infotech Ltd filed Critical Guangzhou Special Road Mdt Infotech Ltd
Publication of CN107402988A publication Critical patent/CN107402988A/en
Application granted granted Critical
Publication of CN107402988B publication Critical patent/CN107402988B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9017Indexing; Data structures therefor; Storage structures using directory or table look-up
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • G06F16/2379Updates performed during online database operations; commit processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2219Large Object storage; Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2272Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2282Tablespace storage structures; Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/2433Query languages
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation
    • G06F16/24534Query rewriting; Transformation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation
    • G06F16/24534Query rewriting; Transformation
    • G06F16/24542Plan optimisation
    • G06F16/24545Selectivity estimation or determination
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2471Distributed queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/252Integrating or interfacing systems involving database management systems between a Database Management System and a front-end application
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/254Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/316Indexing structures
    • G06F16/319Inverted lists
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/466Transaction processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system
    • G06F9/5088Techniques for rebalancing the load in a distributed system involving task migration
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/5022Workload threshold

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Operations Research (AREA)
  • Computing Systems (AREA)
  • Fuzzy Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Library & Information Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Computer And Data Communications (AREA)
  • Devices For Executing Special Programs (AREA)

Abstract

The invention discloses a kind of distributed NewSQL Database Systems, including:Control unit, in a manner of database interface accessing user ask, and by the user request be sent to planning unit;Wherein, user's request includes the querying condition for the JSON data that needs are inquired about;Planning unit, for parsing user's request, executive plan corresponding to compiling and customization;Execution unit, for according to executive plan, starting collaboration processing module and obtaining index data;And tables of data is inquired about according to the index data, obtain Query Result;And the Query Result is returned to described control unit;Hbase units, store the tables of data and concordance list;The Hbase units include the collaboration processing module, for according to the querying condition search index table, obtaining the corresponding index data.The invention also discloses Query semi-structured for data method.The present invention realizes the data query of JSON forms, and effect and performance are bad when solving the problems, such as to handle semi-structured data.

Description

A kind of distributed NewSQL Database Systems and Query semi-structured for data method
Technical field
The present invention relates to big data technical field, more particularly to a kind of distributed NewSQL Database Systems and semi-structured Data query method.
Background technology
Hbase units are one of foremost distributed NoSQL databases in Hadoop ecosystems at present.Hbase is mono- First primary clustering includes HMaster and HRegionsever, provides the user the data model of form types, is drawn by major key scope It is divided into multiple region, HMaster is responsible for and distributed region, and HRegionserver is responsible for the read-write of region data. The data of existing Hbase units storage do not have point of data type, are byte arrays, therefore such as this to storage JSON Semi-structured data can there are problems that in query aspects.JSON formatted datas are stored in Hbase units, then conventional meeting Whole JSON objects are stored as character string.Following defect be present in which:
Want when filter record, it is necessary to which all records are all read out and then filtered in client, in number According to measure it is larger in the case of the performance can not be received.
Will more new record when, it is necessary to record is read out be updated again for specific field after re-write Hbase units are covered.
The content of the invention
The purpose of the embodiment of the present invention is to provide a kind of distributed NewSQL Database Systems and data query method, can be real The data query of existing JSON forms, effect and performance are bad when solving the problems, such as to handle semi-structured data.
To achieve the above object, the embodiments of the invention provide a kind of distributed NewSQL databases, including:
Control unit, in a manner of database interface accessing user ask, and by the user request be sent to meter Draw unit;It is additionally operable to Query Result returning to user;Wherein, user's request includes the inquiry for the JSON data that needs are inquired about Condition, the Query Result are the JSON data that are obtained according to the querying condition;
Planning unit, for parsing user's request, executive plan corresponding to compiling and customization;
Execution unit, for according to executive plan, starting collaboration processing module and obtaining with being looked into described in user request The corresponding index data of inquiry condition;And tables of data is inquired about according to the index data of acquisition, it is corresponding described so as to obtain Query Result;And the Query Result is returned to described control unit;
Hbase units, for storing the tables of data and concordance list, wherein, the bottom increase JSON types of Hbase units Data, the JSON data are stored entirely in bottom HFile;
The Hbase units also include the collaboration processing module, and the collaboration processing module is used for according to the inquiry Condition query concordance list, obtain the corresponding index data;Wherein, stored in the concordance list by the JSON data The index data for the inverted index form that the type nested as one is generated.
Compared with prior art, a kind of distributed NewSQL Database Systems disclosed by the invention, it is single by controlling first Member accessing user in a manner of database interface is asked, and user's request is sent into planning unit;Then planning unit is passed through Parse user's request, executive plan corresponding to compiling and generation;Then, association is started according to executive plan by execution unit Index number corresponding with the querying condition of user request in the concordance list of Hbase units is obtained with processing module According to;And the tables of data of Hbase units is inquired about according to the index data of acquisition, so as to obtain the corresponding Query Result, Obtain JSON data;And the Query Result is returned to described control unit, finally by the skill of control unit return user Art scheme, the data query of JSON forms can be realized, effect and performance are bad when solving the problems, such as to handle semi-structured data.
Further, the distributed NewSQL Database Systems also include:Distributed transaction management device, for when described When being related to distributed transaction in executive plan, coordinate the multi-party completion distributed transaction management in the executive plan.
Further, the Hbase units also include Hbase unit api interfaces, and the execution unit is used for according to acquisition The index data tables of data is inquired about by the Hbase units api interface, so as to the Query Result corresponding to obtaining.
Further, the database interface is JDBC or ODBC.
The embodiment of the invention also discloses a kind of Query semi-structured for data method, based on described in the embodiments of the present invention Distributed NewSQL Database Systems, including:
Control unit, accessing user is asked in a manner of database interface, and user request is sent into plan Unit;Wherein, user's request includes the querying condition for the JSON data that needs are inquired about, and the Query Result is according to The JSON data that querying condition is obtained;
The user is parsed by planning unit to ask, executive plan corresponding to compiling and customization;
By execution unit according to executive plan, start querying condition inquiry described in the collaboration processing module of Hbase units Concordance list, obtain the corresponding index data;Wherein, stored in the concordance list by the JSON data as one The index data for the inverted index form that nested type is generated;The concordance list is stored in the Hbase units;
Tables of data is inquired about according to the index data of acquisition by the execution unit, looked into so as to described corresponding to obtaining Ask result;And Query Result is returned to described control unit;Wherein, the tables of data is stored in the Hbase units; The bottom increase JSON categorical datas of Hbase units, the JSON data are stored entirely in bottom HFile;
The Query Result is returned to by user by described control unit.
Compared with prior art, a kind of Query semi-structured for data method disclosed by the invention, passes through control unit first Accessing user is asked in a manner of database interface, and user's request is sent into planning unit;Then planning unit solution is passed through Analyse user's request, executive plan corresponding to compiling and generation;Then, collaboration is started according to executive plan by execution unit The index data corresponding with the querying condition of user request in the concordance list of processing module acquisition Hbase units; And the tables of data of Hbase units is inquired about according to the index data of acquisition, so as to obtain the corresponding Query Result, that is, obtain Obtain JSON data;And the Query Result is returned to described control unit, finally by the technical side of control unit return user Case, the data query of JSON forms can be realized, effect and performance are bad when solving the problems, such as to handle semi-structured data.
Further, by distributed transaction management device when being related to distributed transaction in the executive plan, institute is coordinated State the multi-party completion distributed transaction management in executive plan.
Further, the Hbase unit APs I of the Hbase units is passed through during the execution unit inquiry tables of data Interface polls tables of data, so as to obtain corresponding Query Result;
Further, the database interface is JDBC or ODBC.
Brief description of the drawings
Fig. 1 is a kind of structural representation for distributed NewSQL databases that the embodiment of the present invention 1 provides;
Fig. 2 is a kind of schematic flow sheet for Query semi-structured for data method that the embodiment of the present invention 2 provides;
Fig. 3 is to generate to perform meter in a kind of step S2 for Query semi-structured for data method that the embodiment of the present invention 2 provides The schematic flow sheet drawn.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation describes, it is clear that described embodiment is only part of the embodiment of the present invention, rather than whole embodiments.It is based on Embodiment in the present invention, those of ordinary skill in the art are obtained every other under the premise of creative work is not made Embodiment, belong to the scope of protection of the invention.
Referring to Fig. 1, Fig. 1 is a kind of structural representation for distributed NewSQL Database Systems that the embodiment of the present invention 1 provides Figure;The present embodiment 1 includes following structures:
Control unit 1, in a manner of database interface accessing user ask, and by the user request be sent to meter Draw unit 2;It is additionally operable to Query Result returning to user;Wherein, user's request includes looking into for the JSON data that needs are inquired about Inquiry condition, the Query Result are the JSON data that are obtained according to the querying condition;
Planning unit 2, for parsing user's request, executive plan corresponding to compiling and customization;
Execution unit 3, for obtaining described with user request according to executive plan, startup collaboration processing module 41 The corresponding index data of querying condition;And tables of data is inquired about according to the index data of acquisition, so as to obtain corresponding institute State Query Result;And the Query Result is returned to described control unit 1;
Hbase units 4, for storing the tables of data and concordance list, wherein, the bottom increase JSON classes of Hbase units 4 Type data, the JSON data are stored entirely in bottom HFile;
The Hbase units 4 also include the collaboration processing module 41, and the collaboration processing module 41 is used for according to Querying condition search index table, obtain the corresponding index data;Wherein, stored in the concordance list by the JSON The index data for the inverted index form that the data type nested as one is generated.
The present embodiment has increased JSON categorical datas newly in the bottom of Hbase units 4, and in bottom HFile, JSON data are whole Body is stored, and JSON index column is also served as into a nested type when building secondary index and is indexed, therefore energy The arbitrary fields inquiry for JSON is supported, index is created and revises.
Further, the distributed NewSQL Database Systems also include:Distributed transaction management device, for when execution When being related to affairs in the works, coordinate the multi-party completion distributed transaction management in executive plan.Distributed transaction management device utilizes Java issued transactions API (JTA) realizes distributing real time system and transaction management;Wherein, JTA, i.e. Java Transaction API, JTA allow application program perform distributing real time system --- on two or more network computer resources access and Update the data.
Specifically, after user's request of the planning unit 2 for receiving control unit 1, parsing user's request, and pass through height Fast SQL engines compile SQL, then regenerate executive plan.In addition, execution unit 2 returns to after being additionally operable to executive plan generation Control unit 1.And control unit 1 is additionally operable to judge whether needs according to the content of executive plan after executive plan is received The intervention of distributed transaction management device, if it is desired, then start distributed transaction management device.
Further, the Hbase units 4 also include Hbase unit api interfaces, and the execution unit 3 is used for basis and obtained The index data taken inquires about tables of data by the Hbase units api interface, so as to obtain the corresponding inquiry knot Fruit.
Further, the database interface is JDBC or ODBC.
Further, control unit 1 is also connected with a monitor, for being responsible for metadata management and for monitoring bottom Hbase Region load, avoids specific region load too high, and using cooperateing with processing module 41 to redistribute Region。
In addition, control unit 1 is additionally operable to coordinate data communication, the management overall flow between multiple roles.
Wherein, planning unit 2 is used for the process for generating executive plan, specifically includes:
Judge to whether there is the prestore SQL statement corresponding with SQL statement in common buffer pool, if so, then output and SQL Executive plan corresponding to sentence, if it is not, then
Syntax check is carried out to SQL statement, if syntax error returns to error message to user, otherwise,
Semantic test is carried out to SQL statement, if semantic error returns to error message to user, otherwise,
View and expression formula conversion, conversion results corresponding to acquisition are carried out to SQL statement;
Optimizer, optimizer selection result corresponding to acquisition are selected according to transformation result;
According to data connection approach and the order of connection corresponding to the selection of optimizer selection result;
According to connected mode and the path of order of connection selection search;
Executive plan is generated according to searching route, and exports executive plan.
When it is implemented, control unit 1, accessing user asks in a manner of database interface first, and please by user Ask and be sent to planning unit 2;Then parse user by planning unit 2 to ask, executive plan corresponding to compiling and generation;Connect , judged whether according to the content of executive plan to need the intervention of distributed transaction management device by control unit 1, if needed Will, then start distributed transaction management device, the multi-party completion coordinated by distributed transaction management device in executive plan is distributed Transaction management;Then, the rope that collaboration processing module 41 obtains Hbase units 4 is started according to executive plan by execution unit 3 Draw index data corresponding with the querying condition of user request in table;And according to the index number of acquisition it is investigated that The tables of data of Hbase units 4 is ask, so as to obtain the corresponding Query Result, that is, obtains JSON data;And return to the inquiry As a result to described control unit 1, user is returned finally by control unit 1.
The distributed NewSQL Database Systems of the present embodiment can realize the data query of JSON forms, solve processing half hitch The problem of structure data age fruit and bad performance.
Referring to Fig. 2, Fig. 2 is a kind of schematic flow sheet for Query semi-structured for data method that the embodiment of the present invention 2 provides; The distributed NewSQL Database Systems provided based on the embodiment of the present invention 1, the present embodiment 2 are comprised the steps:
S1, control unit 1, accessing user is asked in a manner of database interface, and user request is sent to Planning unit 2;Wherein, user's request includes the querying condition for the JSON data that needs are inquired about, and the Query Result is root The JSON data obtained according to the querying condition;
S2, the user is parsed by planning unit 2 asked, executive plan corresponding to compiling and customization;
S3, by execution unit 3 according to executive plan, inquire about bar described in the collaboration processing module 41 that starts Hbase units 4 Part search index table, obtain the corresponding index data;Wherein, stored in the concordance list and made by the JSON data The index data of the inverted index form generated by a nested type;The concordance list is stored in the Hbase units 4 In;
S4, by the execution unit 3 according to the index data of acquisition inquire about tables of data, so as to obtain corresponding institute State Query Result;And Query Result is returned to described control unit 1;Wherein, the tables of data is stored in the Hbase units 4;The bottom increase JSON categorical datas of Hbase units 4, the JSON data are stored entirely in bottom HFile;
S5, by described control unit 1 by the Query Result return user.
The present embodiment has increased JSON categorical datas newly in the bottom of Hbase units 4, and in bottom HFile, JSON data are whole Body is stored, and JSON index column is also served as into a nested type when building secondary index and is indexed, therefore energy The arbitrary fields inquiry for JSON is supported, index is created and revises.
Further, the present embodiment step S2 is completed after generating executive plan, in addition to executive plan is returned into control Unit 1, by control unit 1 after executive plan is received, it is additionally operable to judge whether to need to be distributed according to the content of executive plan The intervention of formula task manager, if it is desired, then start distributed transaction management device, specifically, passing through distributed transaction management Device coordinates the multi-party completion distributed transaction management in executive plan when being related to affairs in executive plan;If it is not required, then Directly perform step S3.
Further, the Hbase units of the Hbase units 4 are passed through when the execution unit 3 inquires about the tables of data 4API interface polls tables of data, so as to obtain corresponding Query Result;
Further, the database interface is JDBC or ODBC.
Wherein, referring to Fig. 3, Fig. 3 is to be used for the schematic flow sheet that generates executive plan by planning unit 2 in step S2, Specifically include:
S201, judge to whether there is the prestore SQL statement corresponding with SQL statement in common buffer pool, if so, then exporting Executive plan corresponding with SQL statement, if it is not, then
S202, syntax check is carried out to SQL statement, if syntax error returns to error message to user, otherwise,
S203, semantic test is carried out to SQL statement, if semantic error returns to error message to user, otherwise,
S204, view and expression formula conversion, conversion results corresponding to acquisition are carried out to SQL statement;
S205, according to transformation result select optimizer, optimizer selection result corresponding to acquisition;
S206, data connection approach and the order of connection according to corresponding to the selection of optimizer selection result;
S207, the path for selecting to search for according to connected mode and the order of connection;
S208, executive plan generated according to searching route, and export executive plan.
When it is implemented, control unit 1, accessing user asks in a manner of database interface first, and please by user Ask and be sent to planning unit 2;Then parse user by planning unit 2 to ask, executive plan corresponding to compiling and generation;Connect , judged whether according to the content of executive plan to need the intervention of distributed transaction management device by control unit 1, if needed Will, then start distributed transaction management device, the multi-party completion coordinated by distributed transaction management device in executive plan is distributed Transaction management;Then, the rope that collaboration processing module 41 obtains Hbase units 4 is started according to executive plan by execution unit 3 Draw index data corresponding with the querying condition of user request in table;And according to the index number of acquisition it is investigated that The tables of data of Hbase units 4 is ask, so as to obtain the corresponding Query Result, that is, obtains JSON data;And return to the inquiry As a result to described control unit 1, user is returned finally by control unit 1.
The distributed NewSQL Database Systems of the present embodiment can realize the data query of JSON forms, solve processing half hitch The problem of structure data age fruit and bad performance.
Described above is the preferred embodiment of the present invention, it is noted that for those skilled in the art For, under the premise without departing from the principles of the invention, some improvements and modifications can also be made, these improvements and modifications are also considered as Protection scope of the present invention.

Claims (8)

  1. A kind of 1. distributed NewSQL Database Systems, it is characterised in that including:
    Control unit, in a manner of database interface accessing user ask, and it is single that user request is sent into plan Member;It is additionally operable to Query Result returning to user;Wherein, user's request includes the inquiry bar for the JSON data that needs are inquired about Part, the Query Result are the JSON data that are obtained according to the querying condition;
    Planning unit, for parsing user's request, executive plan corresponding to compiling and customization;
    Execution unit, for according to executive plan, starting collaboration processing module and obtaining the inquiry bar asked with the user The corresponding index data of part;And tables of data is inquired about according to the index data of acquisition, so as to obtain the corresponding inquiry As a result;And the Query Result is returned to described control unit;
    Hbase units, for storing the tables of data and concordance list, wherein, the bottom increase JSON number of types of Hbase units According to the JSON data are stored entirely in bottom HFile;
    The Hbase units also include the collaboration processing module, and the collaboration processing module is used for according to the querying condition Search index table, obtain the corresponding index data;Wherein, stored in the concordance list by the JSON data conduct The index data for the inverted index form that one nested type is generated.
  2. 2. distributed NewSQL Database Systems as claimed in claim 1, it is characterised in that also include:Distributed transaction pipe Device is managed, for when being related to distributed transaction in the executive plan, coordinating the distribution of the multi-party completion in the executive plan Transaction management.
  3. 3. distributed NewSQL Database Systems as claimed in claim 2, it is characterised in that the Hbase units also include Hbase unit api interfaces, the execution unit are used to be connect by the Hbase unit APs I according to the index data of acquisition Mouth inquiry tables of data, so as to obtain the corresponding Query Result.
  4. 4. distributed NewSQL Database Systems as claimed in claim 3, it is characterised in that the database interface is JDBC Or ODBC.
  5. A kind of 5. Query semi-structured for data method, based on the distributed NewSQL numbers described in any one of the claims 1~4 According to storehouse system, it is characterised in that including:
    Control unit, accessing user is asked in a manner of database interface, and user request is sent into plan list Member;Wherein, user's request includes the querying condition for the JSON data that needs are inquired about, and the Query Result is to be looked into according to The JSON data that inquiry condition is obtained;
    The user is parsed by planning unit to ask, executive plan corresponding to compiling and customization;
    By execution unit according to executive plan, start querying condition search index described in the collaboration processing module of Hbase units Table, obtain the corresponding index data;Wherein, stored in the concordance list by the JSON data as a nesting The index data of inverted index form that is generated of type;The concordance list is stored in the Hbase units;
    Tables of data is inquired about according to the index data of acquisition by the execution unit, so as to obtain the corresponding inquiry knot Fruit;And Query Result is returned to described control unit;Wherein, the tables of data is stored in the Hbase units;Hbase is mono- The bottom increase JSON categorical datas of member, the JSON data are stored entirely in bottom HFile;
    The Query Result is returned to by user by described control unit.
  6. 6. a kind of Query semi-structured for data method as claimed in claim 5, it is characterised in that pass through distributed transaction management Device coordinates the multi-party completion distributed transaction pipe in the executive plan when being related to distributed transaction in the executive plan Reason.
  7. 7. a kind of Query semi-structured for data method as claimed in claim 6, it is characterised in that the execution unit inquires about institute Tables of data is inquired about by the Hbase units api interface of the Hbase units when stating tables of data, so as to obtain corresponding inquiry knot Fruit.
  8. 8. a kind of Query semi-structured for data method as claimed in claim 7, it is characterised in that the database interface is JDBC or ODBC.
CN201710580456.3A 2016-09-21 2017-07-17 Distributed NewSQL database system and semi-structured data query method Expired - Fee Related CN107402988B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2016108423997 2016-09-21
CN201610842399.7A CN106446153A (en) 2016-09-21 2016-09-21 Distributed newSQL database system and method

Publications (2)

Publication Number Publication Date
CN107402988A true CN107402988A (en) 2017-11-28
CN107402988B CN107402988B (en) 2020-01-03

Family

ID=58166840

Family Applications (24)

Application Number Title Priority Date Filing Date
CN201610842399.7A Pending CN106446153A (en) 2016-09-21 2016-09-21 Distributed newSQL database system and method
CN201710585103.2A Expired - Fee Related CN107402995B (en) 2016-09-21 2017-07-17 Distributed newSQL database system and method
CN201710580431.3A Active CN107491485B (en) 2016-09-21 2017-07-17 Method for generating execution plan, plan unit device and distributed NewSQ L database system
CN201710580791.3A Active CN107291948B (en) 2016-09-21 2017-07-17 Access method of distributed newSQL database
CN201710580416.9A Expired - Fee Related CN107291947B (en) 2016-09-21 2017-07-17 Semi-structured data query method and distributed NewSQL database system
CN201710580456.3A Expired - Fee Related CN107402988B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system and semi-structured data query method
CN201710580403.1A Expired - Fee Related CN107368575B (en) 2016-09-21 2017-07-17 Load-balanced distributed NewSQL database system
CN201710581275.2A Active CN107329837B (en) 2016-09-21 2017-07-17 Load balancing method and unit and distributed NewSQL database system
CN201710581193.8A Expired - Fee Related CN107451219B (en) 2016-09-21 2017-07-17 Method for analyzing second index and distributed New SQL database
CN201710581273.3A Expired - Fee Related CN107451221B (en) 2016-09-21 2017-07-17 Database interface unit device and distributed NewSQL database system
CN201710580435.1A Expired - Fee Related CN107480198B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system and full-text retrieval method
CN201710580796.6A Expired - Fee Related CN107402992B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system and full-text retrieval establishing method
CN201710580423.9A Active CN107402987B (en) 2016-09-21 2017-07-17 Full-text retrieval method and distributed NewSQL database system
CN201710580417.3A Expired - Fee Related CN107463632B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system and data query method
CN201710580739.8A Expired - Fee Related CN107402990B (en) 2016-09-21 2017-07-17 Distributed New SQL database system and semi-structured data storage method
CN201710580720.3A Expired - Fee Related CN107402989B (en) 2016-09-21 2017-07-17 Full-text retrieval establishing method and distributed NewSQL database system
CN201710580794.7A Expired - Fee Related CN107451214B (en) 2016-09-21 2017-07-17 Non-primary key query method and distributed NewSQL database system
CN201710581291.1A Expired - Fee Related CN107463637B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system and data storage method
CN201710581229.2A Expired - Fee Related CN107491345B (en) 2016-09-21 2017-07-17 Method for writing picture data and distributed NewSQ L database system
CN201710581256.XA Expired - Fee Related CN107391653B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system and picture data storage method
CN201710581237.7A Expired - Fee Related CN107463635B (en) 2016-09-21 2017-07-17 Method for inquiring picture data and distributed NewSQL database system
CN201710580754.2A Expired - Fee Related CN107402991B (en) 2016-09-21 2017-07-17 Method for writing semi-structured data and distributed NewSQL database system
CN201710580752.3A Expired - Fee Related CN107247808B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system and picture data query method
CN201710581195.7A Expired - Fee Related CN107451220B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system

Family Applications Before (5)

Application Number Title Priority Date Filing Date
CN201610842399.7A Pending CN106446153A (en) 2016-09-21 2016-09-21 Distributed newSQL database system and method
CN201710585103.2A Expired - Fee Related CN107402995B (en) 2016-09-21 2017-07-17 Distributed newSQL database system and method
CN201710580431.3A Active CN107491485B (en) 2016-09-21 2017-07-17 Method for generating execution plan, plan unit device and distributed NewSQ L database system
CN201710580791.3A Active CN107291948B (en) 2016-09-21 2017-07-17 Access method of distributed newSQL database
CN201710580416.9A Expired - Fee Related CN107291947B (en) 2016-09-21 2017-07-17 Semi-structured data query method and distributed NewSQL database system

Family Applications After (18)

Application Number Title Priority Date Filing Date
CN201710580403.1A Expired - Fee Related CN107368575B (en) 2016-09-21 2017-07-17 Load-balanced distributed NewSQL database system
CN201710581275.2A Active CN107329837B (en) 2016-09-21 2017-07-17 Load balancing method and unit and distributed NewSQL database system
CN201710581193.8A Expired - Fee Related CN107451219B (en) 2016-09-21 2017-07-17 Method for analyzing second index and distributed New SQL database
CN201710581273.3A Expired - Fee Related CN107451221B (en) 2016-09-21 2017-07-17 Database interface unit device and distributed NewSQL database system
CN201710580435.1A Expired - Fee Related CN107480198B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system and full-text retrieval method
CN201710580796.6A Expired - Fee Related CN107402992B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system and full-text retrieval establishing method
CN201710580423.9A Active CN107402987B (en) 2016-09-21 2017-07-17 Full-text retrieval method and distributed NewSQL database system
CN201710580417.3A Expired - Fee Related CN107463632B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system and data query method
CN201710580739.8A Expired - Fee Related CN107402990B (en) 2016-09-21 2017-07-17 Distributed New SQL database system and semi-structured data storage method
CN201710580720.3A Expired - Fee Related CN107402989B (en) 2016-09-21 2017-07-17 Full-text retrieval establishing method and distributed NewSQL database system
CN201710580794.7A Expired - Fee Related CN107451214B (en) 2016-09-21 2017-07-17 Non-primary key query method and distributed NewSQL database system
CN201710581291.1A Expired - Fee Related CN107463637B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system and data storage method
CN201710581229.2A Expired - Fee Related CN107491345B (en) 2016-09-21 2017-07-17 Method for writing picture data and distributed NewSQ L database system
CN201710581256.XA Expired - Fee Related CN107391653B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system and picture data storage method
CN201710581237.7A Expired - Fee Related CN107463635B (en) 2016-09-21 2017-07-17 Method for inquiring picture data and distributed NewSQL database system
CN201710580754.2A Expired - Fee Related CN107402991B (en) 2016-09-21 2017-07-17 Method for writing semi-structured data and distributed NewSQL database system
CN201710580752.3A Expired - Fee Related CN107247808B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system and picture data query method
CN201710581195.7A Expired - Fee Related CN107451220B (en) 2016-09-21 2017-07-17 Distributed NewSQL database system

Country Status (1)

Country Link
CN (24) CN106446153A (en)

Families Citing this family (48)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107391744B (en) * 2017-08-10 2020-06-16 东软集团股份有限公司 Data storage method, data reading method, data storage device, data reading device and equipment
CN107480260B (en) * 2017-08-16 2021-02-23 北京奇虎科技有限公司 Big data real-time analysis method and device, computing equipment and computer storage medium
CN107688660B (en) * 2017-09-08 2020-03-13 上海达梦数据库有限公司 Parallel execution plan execution method and device
CN107766572A (en) * 2017-11-13 2018-03-06 北京国信宏数科技有限责任公司 Distributed extraction and visual analysis method and system based on economic field data
CN108228750A (en) * 2017-12-21 2018-06-29 浪潮软件股份有限公司 A kind of distributed data base and its method that data are managed
CN108038215A (en) * 2017-12-22 2018-05-15 上海达梦数据库有限公司 Data processing method and system
CN109992409B (en) * 2018-01-02 2021-07-30 ***通信有限公司研究院 Method, device and system for segmenting data storage area, electronic equipment and medium
CN108829507B (en) * 2018-03-30 2019-07-26 北京百度网讯科技有限公司 The resource isolation method, apparatus and server of distributed data base system
CN108664616A (en) * 2018-05-14 2018-10-16 浪潮软件集团有限公司 ROWID-based Oracle data batch acquisition method
CN108846044A (en) * 2018-05-30 2018-11-20 浪潮软件股份有限公司 A kind of map application dispositions method and device
CN108920519A (en) * 2018-06-04 2018-11-30 贵州数据宝网络科技有限公司 One-to-many data supply system and method
CN109033209B (en) * 2018-06-29 2021-12-31 新华三大数据技术有限公司 Spark storage process processing method and device
CN109241076A (en) * 2018-08-01 2019-01-18 上海依图网络科技有限公司 A kind of data query method and device
CN109271428A (en) * 2018-09-11 2019-01-25 北京市计算中心 Data pick-up method and method for exhibiting data based on geography information
CN109408591B (en) * 2018-10-12 2021-11-09 北京聚云位智信息科技有限公司 Decision-making distributed database system supporting SQL (structured query language) driven AI (Artificial Intelligence) and feature engineering
CN109298976B (en) * 2018-10-17 2022-04-12 成都索贝数码科技股份有限公司 Heterogeneous database cluster backup system and method
CN109408515A (en) * 2018-11-01 2019-03-01 郑州云海信息技术有限公司 A kind of index execution method and apparatus
CN109684412A (en) * 2018-12-25 2019-04-26 成都虚谷伟业科技有限公司 A kind of distributed data base system
CN109726250B (en) * 2018-12-27 2020-01-17 星环信息科技(上海)有限公司 Data storage system, metadata database synchronization method and data cross-domain calculation method
CN111488340B (en) * 2019-01-29 2023-09-12 菜鸟智能物流控股有限公司 Data processing method and device and electronic equipment
CN110046161A (en) * 2019-03-18 2019-07-23 平安普惠企业管理有限公司 Method for writing data and device, storage medium, electronic equipment
CN110086602B (en) * 2019-04-16 2022-02-11 上海交通大学 Rapid implementation method of SM3 password hash algorithm based on GPU
CN110110234B (en) * 2019-05-13 2020-10-16 重庆天蓬网络有限公司 Big data real-time searching system and method
CN110275901B (en) * 2019-06-25 2021-08-24 北京创鑫旅程网络技术有限公司 Cache data calling method and device
CN110457363B (en) * 2019-07-05 2023-11-21 中国平安人寿保险股份有限公司 Query method, device and storage medium based on distributed database
CN110413642B (en) * 2019-08-02 2022-05-27 北京快立方科技有限公司 Application-unaware fragmentation database parsing and optimizing method
CN110569257B (en) * 2019-09-16 2022-04-01 上海达梦数据库有限公司 Data processing method, corresponding device, equipment and storage medium
CN110704437B (en) * 2019-09-26 2022-05-20 上海达梦数据库有限公司 Method, device, equipment and storage medium for modifying database query statement
CN112688976A (en) * 2019-10-17 2021-04-20 广州迈安信息科技有限公司 Data processing transmission service system adopting JDBC/HTTP standard
CN110888919B (en) * 2019-12-04 2023-06-30 阳光电源股份有限公司 HBase-based method and device for statistical analysis of big data
CN113032479A (en) * 2019-12-24 2021-06-25 上海昂创信息技术有限公司 HBase non-primary key indexing method and HBase system
CN111309581B (en) * 2020-02-28 2023-09-12 中国工商银行股份有限公司 Application performance detection method and device in database upgrading scene
CN111651453B (en) * 2020-04-30 2024-02-06 中国平安财产保险股份有限公司 User history behavior query method and device, electronic equipment and storage medium
CN113760960A (en) * 2020-06-01 2021-12-07 北京搜狗科技发展有限公司 Information generation method and device for generating information
CN111797112B (en) * 2020-06-05 2022-04-01 武汉大学 PostgreSQL preparation statement execution optimization method
CN113806611A (en) * 2020-06-17 2021-12-17 海信集团有限公司 Method and equipment for storing search engine results
CN111930705B (en) * 2020-07-07 2023-03-14 中国电子科技集团公司电子科学研究院 Binary message protocol data processing method and device
CN112148792B (en) * 2020-09-16 2024-04-12 鹏城实验室 Partition data adjustment method, system and terminal based on HBase
CN112052347B (en) * 2020-10-09 2024-06-04 北京百度网讯科技有限公司 Image storage method and device and electronic equipment
CN112416925B (en) * 2020-11-02 2024-04-09 浙商银行股份有限公司 Query method based on ordered distributed index structure and distributed database system
CN112364033B (en) * 2021-01-13 2021-04-13 北京云真信科技有限公司 Data retrieval system
CN113760900A (en) * 2021-02-19 2021-12-07 西安京迅递供应链科技有限公司 Method and device for real-time data summarization and interval summarization
CN112905615B (en) * 2021-03-02 2023-03-24 浪潮云信息技术股份公司 Distributed consistency protocol submission method and system based on sequence verification
CN112925841B (en) * 2021-03-26 2022-11-08 瀚高基础软件股份有限公司 Distributed JDBC implementation method, device and computer-readable storage medium
CN113407662B (en) * 2021-08-19 2021-12-14 深圳市明源云客电子商务有限公司 Sensitive word recognition method, system and computer readable storage medium
CN113742370B (en) * 2021-11-02 2022-04-19 阿里云计算有限公司 Data query method and statistical information ciphertext generation method of full-encryption database
CN115129724A (en) * 2022-08-29 2022-09-30 畅捷通信息技术股份有限公司 Statistical report paging method, system, equipment and medium
CN116861455B (en) * 2023-06-25 2024-04-26 上海数禾信息科技有限公司 Event data processing method, system, electronic device and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102902932A (en) * 2012-09-18 2013-01-30 武汉华工安鼎信息技术有限责任公司 Structured query language (SQL) rewrite based database external encryption/decryption system and usage method thereof
CN104731945A (en) * 2015-03-31 2015-06-24 浪潮集团有限公司 Full-text searching method and device based on HBase

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101477568A (en) * 2009-02-12 2009-07-08 清华大学 Integrated retrieval method for structured data and non-structured data
CN101567006B (en) * 2009-05-25 2012-07-04 中兴通讯股份有限公司 Database system and distributed SQL statement execution plan reuse method
CN102163195B (en) * 2010-02-22 2013-04-24 北京东方通科技股份有限公司 Query optimization method based on unified view of distributed heterogeneous database
CN102375853A (en) * 2010-08-24 2012-03-14 ***通信集团公司 Distributed database system, method for building index therein and query method
CN102201010A (en) * 2011-06-23 2011-09-28 清华大学 Distributed database system without sharing structure and realizing method thereof
CN102289482A (en) * 2011-08-02 2011-12-21 北京航空航天大学 Unstructured data query method
CN103150304B (en) * 2011-12-06 2016-11-23 郑红云 Cloud Database Systems
CN103577407B (en) * 2012-07-19 2016-10-12 国际商业机器公司 Querying method and inquiry unit for distributed data base
US20140074860A1 (en) * 2012-09-12 2014-03-13 Pingar Holdings Limited Disambiguator
CN103092970A (en) * 2013-01-24 2013-05-08 华为技术有限公司 Database operation method and device
US9773021B2 (en) * 2013-01-30 2017-09-26 Hewlett-Packard Development Company, L.P. Corrected optical property value-based search query
CN103377292B (en) * 2013-07-02 2017-02-15 华为技术有限公司 Database result set caching method and device
US20150039587A1 (en) * 2013-07-31 2015-02-05 Oracle International Corporation Generic sql enhancement to query any semi-structured data and techniques to efficiently support such enhancements
CN103473321A (en) * 2013-09-12 2013-12-25 华为技术有限公司 Database management method and system
CN104794123B (en) * 2014-01-20 2018-07-27 阿里巴巴集团控股有限公司 A kind of method and device building NoSQL database indexes for semi-structured data
CN103984726B (en) * 2014-05-16 2017-03-29 上海新炬网络信息技术有限公司 A kind of local correction method of data base's implement plan
CN104133858B (en) * 2014-07-15 2017-08-01 武汉邮电科学研究院 Intelligence analysis system with double engines and method based on row storage
CN104503985A (en) * 2014-12-03 2015-04-08 浪潮电子信息产业股份有限公司 Method for automatically creating Solr index file by Hbase data
CN104572895B (en) * 2014-12-24 2018-02-23 天津南大通用数据技术股份有限公司 MPP databases and Hadoop company-datas interoperability methods, instrument and implementation method
CN104731922A (en) * 2015-03-26 2015-06-24 江苏物联网研究发展中心 System and method for rapidly retrieving structural data based on distributed type database HBase
CN104750815B (en) * 2015-03-30 2017-11-03 浪潮集团有限公司 The storage method and device of a kind of Lob data based on HBase
CN105389375B (en) * 2015-11-18 2018-10-02 福建师范大学 A kind of image index setting method, system and search method based on visible range
CN105740410A (en) * 2016-01-29 2016-07-06 浪潮电子信息产业股份有限公司 Data statistics method based on Hbase secondary index

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102902932A (en) * 2012-09-18 2013-01-30 武汉华工安鼎信息技术有限责任公司 Structured query language (SQL) rewrite based database external encryption/decryption system and usage method thereof
CN104731945A (en) * 2015-03-31 2015-06-24 浪潮集团有限公司 Full-text searching method and device based on HBase

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
APACHEPHOENIX,APACHE.ORG: "Apache Phoenix", 《HTTP://PHOENIX.APACHE.ORG/PRESENTATIONS/OC-HUG-2014-10-4X3.PDF》 *
吴国泉: "基于HBase的全文索引及检索技术的研究", 《万方数据库学位论文》 *

Also Published As

Publication number Publication date
CN107247808B (en) 2020-01-10
CN107402992B (en) 2020-06-09
CN107480198A (en) 2017-12-15
CN107402991A (en) 2017-11-28
CN107451219B (en) 2020-06-09
CN107451220A (en) 2017-12-08
CN107451220B (en) 2020-06-09
CN107402990A (en) 2017-11-28
CN107451214A (en) 2017-12-08
CN107402992A (en) 2017-11-28
CN107463637B (en) 2020-05-19
CN107391653B (en) 2020-05-19
CN107451221B (en) 2020-09-04
CN107463632B (en) 2020-06-09
CN107480198B (en) 2020-05-19
CN107402990B (en) 2020-06-09
CN107402987A (en) 2017-11-28
CN107402987B (en) 2020-04-03
CN107391653A (en) 2017-11-24
CN107491485B (en) 2020-08-04
CN107291947B (en) 2020-03-10
CN107291948A (en) 2017-10-24
CN107291947A (en) 2017-10-24
CN107329837B (en) 2020-06-09
CN107402995B (en) 2020-06-09
CN107451221A (en) 2017-12-08
CN107491345A (en) 2017-12-19
CN107491485A (en) 2017-12-19
CN107451219A (en) 2017-12-08
CN107463637A (en) 2017-12-12
CN107463635B (en) 2020-09-25
CN107368575B (en) 2020-06-09
CN107491345B (en) 2020-08-04
CN107402988B (en) 2020-01-03
CN107463632A (en) 2017-12-12
CN107247808A (en) 2017-10-13
CN107402995A (en) 2017-11-28
CN107463635A (en) 2017-12-12
CN107451214B (en) 2020-05-19
CN107329837A (en) 2017-11-07
CN107368575A (en) 2017-11-21
CN107402989B (en) 2020-10-27
CN107291948B (en) 2020-05-19
CN106446153A (en) 2017-02-22
CN107402991B (en) 2020-05-19
CN107402989A (en) 2017-11-28

Similar Documents

Publication Publication Date Title
CN107402988A (en) A kind of distributed NewSQL Database Systems and Query semi-structured for data method
US11755575B2 (en) Processing database queries using format conversion
US11681702B2 (en) Conversion of model views into relational models
CN109299102B (en) HBase secondary index system and method based on Elastcissearch
CN106547796B (en) Database execution method and device
CN104123374B (en) The method and device of aggregate query in distributed data base
US10585887B2 (en) Multi-system query execution plan
US8650181B2 (en) OLAP execution model using relational operations
CN103455540B (en) The system and method for generating memory model from data warehouse model
US20110010379A1 (en) Database system with query interception and redirection
US20120215810A1 (en) Database query mechanism using links as an aggregate base
CN106484694B (en) Full-text search method and system based on distributed data base
US20170068703A1 (en) Local database cache
US20170060539A1 (en) Native access plan source code generation
CN107368477A (en) The method and system of class SQL query based on HBase coprocessors
CN110413642A (en) A kind of parsing of fragment data library and optimization method using unaware

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20191204

Address after: Room 5303, 1023 Gaopu Road, Tianhe Software Park, Tianhe District, Guangzhou City, Guangdong 510000

Applicant after: Yunrun Da Data Service Co.,Ltd.

Address before: 510000 Yuexiu District, Guangzhou Province, north of the text of the text of the North Road, No. 68, the east wing of the text of the building on the ground floor, No. six, No. 602, No.

Applicant before: GUANGZHOU TEDAO INFORMATION TECHNOLOGY Co.,Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20200103

Termination date: 20210717

CF01 Termination of patent right due to non-payment of annual fee