CN111400556A - Data query method and device, computer equipment and storage medium - Google Patents

Data query method and device, computer equipment and storage medium Download PDF

Info

Publication number
CN111400556A
CN111400556A CN202010152795.3A CN202010152795A CN111400556A CN 111400556 A CN111400556 A CN 111400556A CN 202010152795 A CN202010152795 A CN 202010152795A CN 111400556 A CN111400556 A CN 111400556A
Authority
CN
China
Prior art keywords
data
data set
organization
field
retrieval
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN202010152795.3A
Other languages
Chinese (zh)
Inventor
汤奇峰
汤明嘉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Data Exchange Corp
Original Assignee
Shanghai Data Exchange Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Data Exchange Corp filed Critical Shanghai Data Exchange Corp
Priority to CN202010152795.3A priority Critical patent/CN111400556A/en
Publication of CN111400556A publication Critical patent/CN111400556A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9035Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A data query method and device, computer equipment and storage medium are provided, the method comprises: receiving a retrieval request, and extracting a retrieval keyword from the retrieval request; identifying a data organization and a data field corresponding to target data to be acquired according to the retrieval key words, wherein the data organization is used for representing the attribute of the target data, and the data field is used for representing the field of the target data; inquiring a data set corresponding to the target data according to the identified data organization and the data field; and obtaining an access link of the data set, and outputting the access link, wherein the access link is used for accessing the data set to obtain the target data from the data set. By the method, an efficient data management system can be established to improve the retrieval efficiency of the data.

Description

Data query method and device, computer equipment and storage medium
Technical Field
The invention relates to the technical field of computer data processing, in particular to a data query method and device, computer equipment and a storage medium.
Background
With the development of computer technology and big data technology, data query and management are applied in the aspects of people's life. However, in the existing data query method, the corresponding full-text retrieval is usually performed directly according to the keywords, and the search engine needs to perform full-text traversal on the document to be retrieved, so that the retrieval efficiency is low.
Disclosure of Invention
The invention solves the technical problem of how to establish an efficient data management system to improve the retrieval and search efficiency.
In order to solve the above technical problem, an embodiment of the present invention provides a data query method, where the method includes: receiving a retrieval request, and extracting a retrieval keyword from the retrieval request; identifying a data organization and a data field corresponding to target data to be acquired according to the retrieval key words, wherein the data organization is used for representing the attribute of the target data, and the data field is used for representing the field of the target data; inquiring a data set corresponding to the target data according to the identified data organization and the data field; and obtaining an access link of the data set, and outputting the access link, wherein the access link is used for accessing the data set to obtain the target data from the data set.
Optionally, after identifying the data organization and the data field corresponding to the target data according to the query keyword, the method further includes: determining the limit condition of the retrieval request according to the identified data organization and data field; and checking the retrieval key words according to the limiting conditions, and if the checking is passed, continuously executing the data set corresponding to the target data inquired according to the identified data organization and data field.
Optionally, after the checking the search keyword according to the limiting condition, the method further includes: and if the verification fails, generating an error message and outputting the error message.
Optionally, the querying a data set corresponding to the target data according to the identified data organization and data field includes: and querying a data set corresponding to the query key by using a search server by taking the identified data organization and data field as the query key, wherein the search server comprises the corresponding relation between the query key and the data set.
Optionally, the method further includes: and querying a data set corresponding to the retrieval keyword by using the search server.
Optionally, the method further includes: receiving a data set to be input; identifying data organization and data fields corresponding to the data set to be input; and inputting the data set to be input according to the identified data organization and the data field, and generating an access link of the data set to be input.
Optionally, after identifying the data organization and the data field corresponding to the data set to be entered, the method further includes: carrying out validity check on the data set to be input; and if the verification is passed, continuing to execute the data set to be recorded according to the identified data organization and data field.
Optionally, the entering the data set to be entered according to the identified data organization and data field includes: and inputting the data set to be input through a stream processing platform, wherein the stream processing platform is used for executing distributed data input.
Optionally, after the data set to be entered is entered according to the identified data organization and data field, the method further includes: and if the logging is not successful, executing rollback operation and outputting a logging failure message.
An embodiment of the present invention further provides a data query device, where the device includes: the request receiving module is used for receiving a retrieval request and extracting a retrieval keyword from the retrieval request; the identification module is used for identifying a data organization and a data field corresponding to target data to be acquired according to the retrieval key words, wherein the data organization is used for representing the attribute of the target data, and the data field is used for representing the field to which the target data belongs; the data set query module is used for querying a data set corresponding to the target data according to the identified data organization and the data field; and the result feedback module is used for acquiring an access link of the data set and outputting the access link, and the access link is used for accessing the data set to acquire the target data from the data set.
An embodiment of the present invention further provides a computer device, which includes a memory and a processor, where the memory stores a computer program, and the processor implements the steps of any one of the above methods when executing the computer program.
An embodiment of the present invention further provides a storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps of any one of the methods described above.
Compared with the prior art, the technical scheme of the embodiment of the invention has the following beneficial effects:
the embodiment of the invention provides a data query method, which comprises the following steps: receiving a retrieval request, and extracting a retrieval keyword from the retrieval request; identifying a data organization and a data field corresponding to target data to be acquired according to the retrieval key words, wherein the data organization is used for representing the attribute of the target data, and the data field is used for representing the field of the target data; inquiring a data set corresponding to the target data according to the identified data organization and the data field; and obtaining an access link of the data set, and outputting the access link, wherein the access link is used for accessing the data set to obtain the target data from the data set. Compared with the prior art, in the scheme of the invention, when a user needs to inquire certain target data, only the retrieval request needs to be lifted to the data management platform, and the data management platform can automatically extract the retrieval key words according to the request to obtain the data organization and the data field corresponding to the target data, so as to identify the data set corresponding to the target data and return the retrieval result to the user in the mode of the access link corresponding to the obtained data. According to the scheme, data organization, a data field and a data set are taken as units for cross data management, data entry, storage and retrieval are provided from dimensions such as data attributes, data fields, data sources and the like, and an efficient data management system is established so as to improve the data retrieval efficiency.
Further, for the received retrieval request, after identifying the corresponding data organization and data field, the data query platform can first determine whether the retrieval request is legal according to the corresponding limiting conditions, and for the illegal request, the data query platform ends the query process and outputs an error message, so as to save the data calculation amount of the platform and improve the running speed of the platform.
Furthermore, the relation among data organization, data fields and data sets is established through the search server, and the data query operation is realized according to the characteristics of full-text search, accurate search and the like of the search server.
Furthermore, when data is input into the platform, the legality of the data to be input can be checked firstly, so that the legality and the queryability of the data input into the platform are guaranteed.
Furthermore, the data management efficiency is improved by adopting an advanced distributed open source system architecture, and a multi-layer data set management system is convenient for data query.
Drawings
FIG. 1 is a flow chart of a data query method according to an embodiment of the present invention;
FIG. 2 is a partial flow diagram of a data query method according to an embodiment of the present invention;
FIG. 3 is a partial flow diagram of another data query method according to an embodiment of the invention;
fig. 4 is a schematic structural diagram of a data query device according to an embodiment of the present invention.
Detailed Description
As a background, the existing data query methods are inefficient in retrieval.
For example, an open data platform (opendata in china) built by the shanghai data transaction center is a portal platform for open data resources and industry applications in china, and the platform provides an intelligent data search engine, can provide users with intelligent search of open data resources in different fields such as government affairs and enterprises, and helps a data resource demander to solve the problem of 'where data is'. The platform also provides an industry information and data reference scene solution related to the open data, and provides industry news and application scenes using the open data to industry companies to help enterprises to utilize and create the social value of the open data. In order to meet the requirements of data set management and data information release of an open data platform and realize a service function, an open data set management background with large-scale data set management capability needs to be constructed.
Based on the current situation, an open data set management background with large-scale data set management capability needs to be established, and the following problems mainly exist:
problem 1: data entry and storage problems; as open source data resources of various government and civil websites need to be crawled and updated regularly, the platform needs to realize the function of crawling and entering updated data regularly, and meanwhile, a manual data entry interface is reserved. Meanwhile, the original opendata website does not provide the cloud storage and downloading functions of the data set, and with the development of services, the cloud data set and algorithm resources need to be integrated, and a background is required to have the cloud storage and calling functions aiming at the large-capacity data set.
Problem 2: a data management problem; the opendata website provides intelligent retrieval functions of a plurality of open data sets, a complete and efficient data management system needs to be established, newly and variably adding and changing of the open data sets are effectively managed, and retrieval and searching efficiency is improved.
Problem 3: issue of information issue;
at present, the information of the opendata website depends on a Content Management System (CMS), which is completely separated from the data set Management, so that the later operation and maintenance are greatly inconvenient, and the information Management function needs to be integrated into a new Management background.
To solve the above technical problem, an embodiment of the present invention provides a data query method, including: receiving a retrieval request, and extracting a retrieval keyword from the retrieval request; identifying a data organization and a data field corresponding to target data to be acquired according to the retrieval key words, wherein the data organization is used for representing the attribute of the target data, and the data field is used for representing the field of the target data; inquiring a data set corresponding to the target data according to the identified data organization and the data field; and obtaining an access link of the data set, and outputting the access link, wherein the access link is used for accessing the data set to obtain the target data from the data set. By the method, an efficient data management system can be established to improve the retrieval efficiency of the data.
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in detail below.
In an embodiment, please refer to fig. 1, fig. 1 is a schematic flow chart of a data query method in an embodiment, where the method may include the following steps:
s101, receiving a retrieval request, and extracting a retrieval keyword from the retrieval request;
the retrieval request is a request for acquiring target data from the data query platform, may be a request input by a user at an input end of the data query platform, and may also be a request initiated by the user to the data query platform through a terminal. The data query platform is a platform integrating data entry, data processing and storage and data query, and is the basis for realizing data query.
After the data query platform receives the retrieval request, the retrieval key word needs to be extracted from the retrieval request according to the established identification rule, so as to continue the following steps. The recognition rule can be realized according to technologies such as natural language recognition or keyword recognition.
S102, identifying a data organization and a data field corresponding to target data to be acquired according to the retrieval key words, wherein the data organization is used for representing the attribute of the target data, and the data field is used for representing the field of the target data;
the data organization is a logic main body for managing the input data of the data query platform, the data organization is used for representing the attributes of the data, the attributes are used for describing the characteristics of the data so as to classify the data according to the characteristics, and the data characteristics represented by the attributes are irrelevant to the field to which the data belong. For example, the attribute of the data may be a subject related to the data (such as a company, an organization name, etc.), or may be a function related to the data (such as financing, etc.).
The data field is another logic main body for managing the entered data by the data query platform, and represents the field of the data, and the field of the data can be set according to the industry field (such as finance, news information, government affair data and the like).
The data query platform can acquire the category of the target data queried this time in the platform, namely the data organization and the data field according to the retrieval key words so as to acquire the corresponding data from the platform.
S103, inquiring a data set corresponding to the target data according to the identified data organization and the data field;
the data set is the minimum unit of data recorded by the data query platform management. The data entered by the data query platform is mostly obtained by crawling from websites or other databases, and the data set can be divided according to the source (such as websites or databases) of the crawled data and can also be divided according to the entered data volume. The data set contains detailed information and storage addresses of a group of data.
The data organization is an upper level directory of data sets, each data set having one and only one corresponding data organization. The data fields are a tagged management system of the data sets, each data set can correspond to 0, 1 or more data fields, and rapid retrieval and query are facilitated.
And the data query platform positions the target data to a corresponding data set according to the identified data organization and the data field and uses the target data as a return value of the retrieval request.
S104, obtaining an access link of the data set, and outputting the access link, wherein the access link is used for accessing the data set to obtain the target data from the data set.
The data retrieval platform outputs the access link of the located data set to complete the operation of the data query, wherein the access link of the data set can be a webpage (such as the Hypertext Markup language (HTM L) format) or a data storage path, and the like.
In the data query method, when a user needs to query certain target data, the data management platform can automatically extract the search keyword according to the request only by lifting the search request to the data management platform so as to obtain the data organization and the data field corresponding to the target data, thereby identifying the data set corresponding to the target data and returning the search result to the user in the mode of the access link corresponding to the obtained data. According to the scheme, data organization, a data field and a data set are taken as units for cross data management, data entry, storage and retrieval are provided from dimensions such as data attributes, data fields, data sources and the like, and an efficient data management system is established so as to improve the data retrieval efficiency.
In the data query platform, the data organization and the data field in the data query platform can be modified and deleted according to requirements, or the existing limit conditions corresponding to the data organization and the data field can be modified and deleted.
In one example, the new addition operation for the new addition data organization is: and inputting the name of the data organization in the newly added page, wherein the name is not empty and unique, and popping up a dialog box after clicking a 'confirm newly added' button to prompt the information of success or failure of newly added. If the new adding is successful, recording the operation information in the operation record of the data query platform, and clicking a 'confirm' button to jump to a page of 'data organization retrieval'; and if the new addition fails, clicking a 'confirm' button to close the dialog box and staying at the current new page.
The retrieval operation of the data organization may include: the data organization retrieval function only retrieves the data organization name field and supports fuzzy search; if the search box is empty, searching the full data organization by default, and displaying the full data organization when the page is loaded; each page displays 16 rows of data organization records, if exceeding, page display and the like
The data organization modification operation may include: the name of the modified data organization is entered in a page of the data organization, and the name field is non-empty and unique. And clicking a 'modification confirmation' button to pop up a dialog box to prompt the modification success or failure information. If the modification is successful, recording the operation information in the operation record, and clicking a 'confirm' button to jump to a 'data organization retrieval' page; and if the modification fails, clicking an 'confirm' button to close the dialog box, and staying at the current page.
The data organization deletion operation may include: and after clicking a 'delete' button on a page of a data organization, popping up a dialog box to prompt the information of successful or failed deletion. If any data set exists in the current data field, the deletion fails. If the deletion is successful, recording the operation information in the operation record, and clicking a 'confirm' button to jump to a 'data organization retrieval' page; and if the deletion fails, clicking a 'confirm' button to close the dialog box and staying at the current page. The delete action is a logical delete.
Operations of adding, modifying, retrieving and deleting in the data domain can be referred to corresponding operations of data organization.
In addition, the data query platform can also perform operations such as adding, modifying, querying and deleting on the data set.
For example: the operation of adding a new data set may include: and inputting a data set name on the newly added page, wherein the data set name and the data organization field are not null. The selectable value of the data organization is all data organizations in the current platform, the selectable value of the data field label is all data fields in the current platform, and the selectable value of the data format is pre-bottoming. The data field label and the data format support a plurality of labels, and a selection box is added by clicking an 'add label' button. When the new adding action is executed, the selecting box with the value of 'please select' is regarded as invalid, if the values of a plurality of selecting boxes are equal, the duplication is removed, and only one of the selecting boxes is regarded as valid. And clicking a 'confirm newly added' button to pop up a dialog box to prompt the information of success or failure of newly added. If the new adding is successful, recording the operation information in the operation record, updating the system modification time field, and clicking the 'confirm' button to jump to a 'data set retrieval' page; and if the new adding fails, clicking a 'confirm' button to close the dialog box and staying at the current page.
The operation of data set retrieval may include: inputting a searched keyword in a search box; the retrieval conditions may include: in the function of retrieving the data set, any query element can be set to be empty, if a certain element is set to be empty, the element is not used as a constraint condition in the query, and if all the elements are set to be empty, the whole data set is queried.
The data set modification time refers to the time when a record corresponding to a certain data set is modified in the data query platform, for example, the time field is updated by importing, updating, adding, deleting and other operations from a crawler database, and the time can be accurate to seconds.
As a non-limiting example, the search results of the data set may be displayed 16 per page, with more than 16 pages being displayed in pages. Because of more fields, the query result table needs to support a lower sliding bar, and the width of each column of the table can be manually adjusted at the front end as much as possible. Clicking the 'modification' in the operation column to enter a modification interface of the corresponding data set. Four fields of the data set name, the data source, the data field label and the data format support fuzzy search, and other fields do not support fuzzy search.
The data set modification operation may include:
and (3) entering the name of the modified data set and the data organization field in the page of the data set to be modified, wherein the fields of the two items are non-null. The selectable value of the data organization is all data organizations in the current system, the selectable value of the data field label is all data fields in the current system, and the selectable value of the data format is pre-bottoming. The data field label and the data format support a plurality of labels, and a selection box is added by clicking an 'add label' button. When the new adding action is executed, the selecting box with the value of 'please select' is regarded as invalid, if the values of a plurality of selecting boxes are equal, the duplication is removed, and only one of the selecting boxes is regarded as valid. And clicking a 'confirm modification' button to pop up a dialog box to prompt new addition success or failure information. If the new adding is successful, recording the operation information in the operation record, updating the system modification time field, and clicking the 'confirm' button to jump to a 'data set retrieval' page; and if the new adding fails, clicking a 'confirm' button to close the dialog box and staying at the current page. The "delete" button functions in the same way. The delete action is a logical delete.
In an embodiment, please refer to fig. 1 and fig. 2, fig. 2 is a partial flow diagram of a data query method in an embodiment. After the step S102 in fig. 1 of identifying the data organization and the data field corresponding to the target data according to the query keyword, the method may further include:
step S201, determining the limit condition of the retrieval request according to the identified data organization and data field;
step S202, checking the search keyword according to the limiting condition;
step S203, if the verification is passed, continuing to execute the data set corresponding to the target data inquired according to the identified data organization and data field.
The limitation condition of the data organization and the data field is a condition for determining whether the retrieval request satisfies the retrieval requirement according to the data organization and the data field. The data query platform may determine whether the current search request is legal according to the identified data organization and data field, if the search keyword is missing, the corresponding data organization or data field cannot be located, and if the data organization and data field corresponding to the search keyword are queriable, the search keyword is verified, and if the search keyword passes the verification, the corresponding data set may be continuously obtained, that is, step S103 in fig. 1 is executed.
Optionally, after the step S202 of checking the search keyword according to the limiting condition, the method may further include: and step S204, if the verification fails, generating an error message and outputting the error message.
And for the search key words which fail to pass the verification, the data query platform outputs an error message to the search request. The error message is used for reminding a user that the data query is wrong, can be directly displayed on the output end (such as a display screen) of the platform, and can also be sent to the terminal of the user.
In this embodiment, after identifying the corresponding data organization and data field for the received retrieval request, the data query platform may first determine whether the retrieval request is legal according to the corresponding constraint condition, and for an illegal request, end the query process and output an error message, so as to save the data calculation amount of the platform and improve the operating speed of the platform.
In an embodiment, with continuing reference to fig. 1, the step S103 in fig. 1 querying the data set corresponding to the target data according to the identified data organization and data field may include: and querying a data set corresponding to the query key by using a search server by taking the identified data organization and data field as the query key, wherein the search server comprises the corresponding relation between the query key and the data set.
The query operation of the data query platform can be realized by a search server, namely a search engine, and the search server executes query through key-value matching relation. In the invention, the data organization and the data field are taken as query keys, the data set is taken as a value corresponding to the query key, and the data set corresponding to the data organization and the data field is obtained through a search server.
The method comprises the steps of integrating a search server in a data query platform, realizing the storage, retrieval and use problems of a large number of open data sets and industry information, and providing high-quality and high-efficiency open data resources and services for the society.
Optionally, the data query platform may also query the data set corresponding to the search keyword by using the search server. I.e. by the search server, in the data entered by the platform, the corresponding data set is queried.
In the embodiment, the relation among the data organization, the data field and the data set is established through the search server, and the data query operation is realized according to the characteristics of full-text search, accurate search and the like of the search server.
In one embodiment, please refer to fig. 3, fig. 3 is a partial flow diagram illustrating a data query method in another embodiment, and the method may further include:
step S301, receiving a data set to be input;
step S302, identifying data organization and data field corresponding to the data set to be input;
step S303, inputting the data set to be input according to the identified data organization and data field, and generating an access link of the data set to be input.
The data query platform can also perform data entry, data is entered by taking a data set as a unit, a data set is newly added in the platform, data organization and a data field corresponding to the data set are identified, a corresponding relation among the three logic main bodies is established, and the three logic main bodies are stored in the data query platform to complete the data entry. After completing the data entry, an access link may also be generated to access the data set for access by a user of the data query.
Optionally, the entered data may be information crawled from a website, and the data query platform further includes an information management function. A background manager of the platform can perform information entry management and full-text information retrieval, and data entry and data retrieval processes are realized.
In this embodiment, the data query platform further includes a data entry process, which provides a data source for data query, and establishes a data entry logic of the platform corresponding to the data query process.
In an embodiment, with continuing reference to fig. 3, after the step S302 in fig. 3 of identifying the data organization and the data field corresponding to the data set to be entered, the method may further include: carrying out validity check on the data set to be input; and if the verification is passed, continuing to execute the data set to be recorded according to the identified data organization and data field.
Optionally, after the data set to be entered is entered according to the identified data organization and data field in step S303, the method further includes: and if the logging is not successful, executing rollback operation and outputting a logging failure message.
When data is recorded, validity check needs to be performed on the recorded data set, for example, whether the recorded data set meets the condition of a newly added data set is checked, and the content of the data to be recorded can also be checked to ensure that the recorded data meets the recording condition. And outputting a message of failed logging for different data to be logged for validity check. And rolling back the data in the platform to the state before the entry.
Optionally, the recorded data can be written into a MySQ L database, and after the data is successfully written, the data is transmitted into an ElasticSearch system for structured storage.
In this embodiment, when data is entered into the platform, the validity of the data to be entered may be checked first, so as to ensure the validity and queryability of the data entered into the platform.
In one embodiment, with continuing reference to fig. 3, the step S303 of entering the data set to be entered according to the identified data organization and data domain includes: and inputting the data set to be input through a stream processing platform, wherein the stream processing platform is used for executing distributed data input.
The stream processing platform can execute distributed stream processing functions, and can be a Kafka stream processing platform and the like. The distributed data entry operation is executed, the data of a plurality of open websites and open data resource websites can be crawled regularly by the front end, the timeliness of data updating is greatly improved, and the work of manual operation and data maintenance is reduced.
In the embodiment, the advanced distributed open source system architecture is adopted to improve the data management efficiency, and the multi-layer data set management system is convenient for data query.
Fig. 4 is a schematic structural diagram of a data query device according to an embodiment of the present invention. Those skilled in the art understand that the data query apparatus of the present embodiment can be used to implement the method technical solutions in the embodiments shown in fig. 1 to fig. 3. The device may specifically include:
a request receiving module 401, configured to receive a search request, and extract a search keyword from the search request;
an identifying module 402, configured to identify, according to the search keyword, a data organization and a data field corresponding to target data to be acquired, where the data organization is used to represent an attribute of the target data, and the data field is used to represent a field to which the target data belongs;
a data set query module 403, configured to query a data set corresponding to the target data according to the identified data organization and data field;
a result feedback module 404, configured to obtain an access link of the data set, and output the access link, where the access link is used to access the data set to obtain the target data from the data set.
In one embodiment, the data query apparatus further includes:
a retrieval current determination module for determining a restriction condition of the retrieval request according to the identified data organization and data domain;
and the verification passing module is used for verifying the retrieval key words according to the limiting conditions, and if the verification passes, continuing to execute the query of the data set corresponding to the target data according to the identified data organization and the data field.
In one embodiment, the data query apparatus further includes:
and the verification failure module is used for generating an error message and outputting the error message if the verification fails.
In an embodiment, please continue to refer to fig. 4, the identifying module 402 is further configured to query the data set corresponding to the query key by using a search server with the identified data organization and data field as the query key, where the search server includes a corresponding relationship between the query key and the data set.
In one embodiment, the data query apparatus further includes:
and the data set query module is used for querying the data set corresponding to the retrieval keyword by using the search server.
In one embodiment, the data query apparatus further includes:
the recording initiating module is used for receiving a data set to be recorded;
the input identification module is used for identifying the data organization and the data field corresponding to the data set to be input;
and the entry module is used for entering the data set to be entered according to the identified data organization and the data field and generating an access link of the data set to be entered.
In one embodiment, the data query apparatus further includes:
the input checking module is used for checking the legality of the data set to be input;
and the continuous execution module is used for continuously executing the data set to be recorded according to the identified data organization and the data field if the verification passes.
In one embodiment, the logging module is further configured to log the data set to be logged through a stream processing platform, where the stream processing platform is configured to perform distributed data logging.
In one embodiment, the data query apparatus further includes:
and the logging failure processing module is used for executing rollback operation and outputting a logging failure message if the logging is unsuccessful.
For more details about the operation principle and the operation mode of the data query apparatus, reference may be made to the related descriptions in fig. 1 to fig. 3, which are not described herein again.
Further, the embodiment of the present invention also discloses a computer device, which includes a memory and a processor, where the memory stores computer instructions capable of running on the processor, and the processor executes the technical solutions of the methods in the embodiments shown in fig. 1 to fig. 3 when executing the computer instructions.
Further, the embodiment of the present invention also discloses a storage medium, on which computer instructions are stored, and when the computer instructions are executed, the technical solutions of the methods in the embodiments shown in fig. 1 to fig. 3 are executed. Preferably, the storage medium may include a computer-readable storage medium such as a non-volatile (non-volatile) memory or a non-transitory (non-transient) memory. The storage medium may include ROM, RAM, magnetic or optical disks, and the like.
Although the present invention is disclosed above, the present invention is not limited thereto. Various changes and modifications may be effected therein by one skilled in the art without departing from the spirit and scope of the invention as defined in the appended claims.

Claims (12)

1. A method for data query, the method comprising:
receiving a retrieval request, and extracting a retrieval keyword from the retrieval request;
identifying a data organization and a data field corresponding to target data to be acquired according to the retrieval key words, wherein the data organization is used for representing the attribute of the target data, and the data field is used for representing the field of the target data;
inquiring a data set corresponding to the target data according to the identified data organization and the data field;
and obtaining an access link of the data set, and outputting the access link, wherein the access link is used for accessing the data set to obtain the target data from the data set.
2. The method according to claim 1, wherein after identifying the data organization and the data field corresponding to the target data according to the query keyword, the method further comprises:
determining the limit condition of the retrieval request according to the identified data organization and data field;
and checking the retrieval key words according to the limiting conditions, and if the checking is passed, continuously executing the data set corresponding to the target data inquired according to the identified data organization and data field.
3. The method according to claim 2, wherein after the checking the search keyword according to the restriction condition, the method further comprises:
and if the verification fails, generating an error message and outputting the error message.
4. The method of claim 1, wherein the querying the data set corresponding to the target data according to the identified data organization and data domain comprises:
and querying a data set corresponding to the query key by using a search server by taking the identified data organization and data field as the query key, wherein the search server comprises the corresponding relation between the query key and the data set.
5. The method of claim 4, further comprising:
and querying a data set corresponding to the retrieval keyword by using the search server.
6. The method of claim 1, further comprising:
receiving a data set to be input;
identifying data organization and data fields corresponding to the data set to be input;
and inputting the data set to be input according to the identified data organization and the data field, and generating an access link of the data set to be input.
7. The method according to claim 6, wherein after identifying the data organization and the data domain corresponding to the data set to be entered, the method further comprises:
carrying out validity check on the data set to be input;
and if the verification is passed, continuing to execute the data set to be recorded according to the identified data organization and data field.
8. The method according to claim 6, wherein said entering the data set to be entered according to the identified data organization and data realm comprises:
and inputting the data set to be input through a stream processing platform, wherein the stream processing platform is used for executing distributed data input.
9. The method according to claim 7, wherein after entering the data set to be entered according to the identified data organization and data domain, further comprising:
and if the logging is not successful, executing rollback operation and outputting a logging failure message.
10. A data query apparatus, characterized in that the apparatus comprises:
the request receiving module is used for receiving a retrieval request and extracting a retrieval keyword from the retrieval request;
the identification module is used for identifying a data organization and a data field corresponding to target data to be acquired according to the retrieval key words, wherein the data organization is used for representing the attribute of the target data, and the data field is used for representing the field to which the target data belongs;
the data set query module is used for querying a data set corresponding to the target data according to the identified data organization and the data field;
and the result feedback module is used for acquiring an access link of the data set and outputting the access link, and the access link is used for accessing the data set to acquire the target data from the data set.
11. A computer device comprising a memory and a processor, the memory storing a computer program, wherein the processor implements the steps of the method of any one of claims 1 to 9 when executing the computer program.
12. A storage medium having a computer program stored thereon, the computer program, when being executed by a processor, realizing the steps of the method of any one of claims 1 to 9.
CN202010152795.3A 2020-03-06 2020-03-06 Data query method and device, computer equipment and storage medium Withdrawn CN111400556A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010152795.3A CN111400556A (en) 2020-03-06 2020-03-06 Data query method and device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010152795.3A CN111400556A (en) 2020-03-06 2020-03-06 Data query method and device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
CN111400556A true CN111400556A (en) 2020-07-10

Family

ID=71434131

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010152795.3A Withdrawn CN111400556A (en) 2020-03-06 2020-03-06 Data query method and device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN111400556A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101364223A (en) * 2007-06-25 2009-02-11 法国电信公司 Method and system for providing navigable search results
CN107729336A (en) * 2016-08-11 2018-02-23 阿里巴巴集团控股有限公司 Data processing method, equipment and system
WO2018156551A1 (en) * 2017-02-22 2018-08-30 Data.World, Inc. Platform management of integrated access datasets utilizing federated query generation and schema rewriting optimization
CN110321408A (en) * 2019-05-30 2019-10-11 重庆金融资产交易所有限责任公司 Searching method, device, computer equipment and the storage medium of knowledge based map

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101364223A (en) * 2007-06-25 2009-02-11 法国电信公司 Method and system for providing navigable search results
CN107729336A (en) * 2016-08-11 2018-02-23 阿里巴巴集团控股有限公司 Data processing method, equipment and system
WO2018156551A1 (en) * 2017-02-22 2018-08-30 Data.World, Inc. Platform management of integrated access datasets utilizing federated query generation and schema rewriting optimization
CN110321408A (en) * 2019-05-30 2019-10-11 重庆金融资产交易所有限责任公司 Searching method, device, computer equipment and the storage medium of knowledge based map

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
李益民: "一种基于关键词的大规模Deep Web信息检索***", 《图书情报工作》 *

Similar Documents

Publication Publication Date Title
US11188537B2 (en) Data processing
US8914414B2 (en) Integrated repository of structured and unstructured data
US9009201B2 (en) Extended database search
CN108509547B (en) Information management method, information management system and electronic equipment
US9122769B2 (en) Method and system for processing information of a stream of information
US9753977B2 (en) Method and system for managing database
US20050240570A1 (en) Partial query caching
US8027969B2 (en) Efficient calculation of sets of distinct results in an information retrieval service
WO2022019973A1 (en) Enterprise knowledge graphs using enterprise named entity recognition
US11573967B2 (en) Enterprise knowledge graphs using multiple toolkits
US20210357373A1 (en) Efficient indexing for querying arrays in databases
CN111191153A (en) Information technology consultation service display device
US9984108B2 (en) Database joins using uncertain criteria
WO2020192663A1 (en) Data management method and related device
US8745008B2 (en) Propagating per-custodian preservation and collection requests between ediscovery management applications and content archives
CN116414854A (en) Data asset query method, device, computer equipment and storage medium
US11847121B2 (en) Compound predicate query statement transformation
CN111400556A (en) Data query method and device, computer equipment and storage medium
US20100205197A1 (en) Two-valued logic database management system with support for missing information
US20160019204A1 (en) Matching large sets of words
KR101083425B1 (en) Database detecting system and detecting method using the same
US11675751B2 (en) Systems and methods for capturing data schema for databases during data insertion
US11954223B2 (en) Data record search with field level user access control
US11250010B2 (en) Data access generation providing enhanced search models
JP3498926B2 (en) Document database management system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20200710