CN113656530A - Intelligent storage method, system and storage medium for big data financial information - Google Patents

Intelligent storage method, system and storage medium for big data financial information Download PDF

Info

Publication number
CN113656530A
CN113656530A CN202110902557.4A CN202110902557A CN113656530A CN 113656530 A CN113656530 A CN 113656530A CN 202110902557 A CN202110902557 A CN 202110902557A CN 113656530 A CN113656530 A CN 113656530A
Authority
CN
China
Prior art keywords
chat
influence
degree
historical
records
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110902557.4A
Other languages
Chinese (zh)
Inventor
不公告发明人
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiang Zhenghao
Original Assignee
Jiang Zhenghao
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiang Zhenghao filed Critical Jiang Zhenghao
Priority to CN202110902557.4A priority Critical patent/CN113656530A/en
Publication of CN113656530A publication Critical patent/CN113656530A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3346Query execution using probabilistic model

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Human Computer Interaction (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a big data financial information intelligent storage method, a system and a storage medium, wherein the method comprises the steps of obtaining the influence degree of historical chatting records of users in a storage block and the important index of the historical chatting records; and updating the storage position of the historical chat record based on the influence degree and the importance index. And updating the storage position of the historical chat records based on the influence degree of the historical chat records of the users in the storage block and the importance indexes of the historical chat records. The user can conveniently inquire the chat records in the corresponding storage blocks according to the importance indexes of the historical chat records. In addition, the system can directly transfer the information in the storage block according to the importance index of the information, thereby reducing the times of traversing data by the system, improving the load capacity of the system and increasing the running speed of the system.

Description

Intelligent storage method, system and storage medium for big data financial information
Technical Field
The invention relates to the technical field of information security, in particular to a big data financial information intelligent storage method, a big data financial information intelligent storage system and a big data financial information intelligent storage medium.
Background
The financial field relates to banking, insurance and security businesses, which need to communicate with people in a large amount and have various data of customers. For this reason, it is necessary to efficiently manage and store information and materials of users.
Currently, there are two ways to store user information: one is to store the user information in a form of converting the user information into electronic information by an electronic storage medium such as an electronic computer, and the other is to record the user information in a paper material for archiving. The paper archive method is easy to lose user information. The manner of electronic archiving is secure relative to the manner of paper archiving. On the one hand, in the big data era, the user information is extremely large in volume, and the electronic archive also needs extremely large storage space to accommodate the user information. However, the storage space of the electronic storage medium is limited, and it cannot satisfy the requirement of storing all the user information with huge data volume. At present, aiming at the condition of insufficient storage space, a means is adopted to delete part of user information at fixed time, but the information which is important for the user is easy to delete directly. On the other hand, when the information storage amount is too large, and a user needs to find certain information, the system resources needing to be mobilized are large, the system load is large, and the performance of the system is directly influenced.
Disclosure of Invention
The invention aims to provide a method, a system and a storage medium for intelligently storing big data financial information, which are used for solving the existing problems.
In a first aspect, an embodiment of the present invention provides a big data financial information intelligent storage method, including:
obtaining the influence degree of the historical chat records of the users in the storage block and the importance index of the historical chat records; the influence degree represents the degree of influence of the events stated in the historical chat records on the user; the importance index represents the degree of influence of the events stated by the historical chat records on the user in the future, which is expected before the historical chat records are stored in the storage block; the chat records are the information of each speaking of the user or the opposite user in the dialog box; each chat record comprises a speaking object, speaking time and speaking content;
and updating the storage position of the historical chat record based on the influence degree and the importance index.
Optionally, the obtaining the influence of the historical chat records of the users in the storage block includes:
determining a time period from the moment of storing the historical chat records in the storage block to the current moment as an investigation time period;
obtaining a chat log of a user and an opposite user in a research time period; the chat log comprises a plurality of chat records, and each chat record is information of each speaking of the user or the opposite user in a dialog box; each chat record comprises a speaking object, speaking time and speaking content;
obtaining keywords of each chat record in the chat log and keywords of the historical chat records; the keywords can represent behavioral intents of the chat records;
obtaining the association degree between the keywords of the chat records and the keywords of the historical chat records; the relevance degree represents the relevance degree between the events mentioned by the historical chat records and the events mentioned by the chat records; each chat record and the historical chat records have one degree of association, and a plurality of chat records correspond to a plurality of degrees of association;
determining the chat records with the association degree larger than a preset value as associated chat records;
and taking the number of the associated chat records as the influence degree of the historical chat records.
Optionally, the updating the storage location of the historical chat record based on the influence degree and the importance index includes:
predicting, based on the degree of influence and the chat log, a subsequent degree of influence of the event stated by the historical chat log after the current moment; the subsequent degree of influence represents a degree of influence of an event stated in the historical chat log on the user after a current time;
and updating the storage position of the historical chat record based on the subsequent influence degree and the importance index.
Optionally, predicting a subsequent influence of the event stated by the historic chat record after the current time based on the influence and the chat log comprises:
obtaining the number of the chat records in the chat log;
taking the quotient of the influence degree and the number as an influence factor;
and taking the product of the influence factor and the influence degree as the predicted subsequent influence degree.
Optionally, the updating the storage location of the historical chat record based on the subsequent influence degree and the importance index includes:
taking the quotient of the subsequent influence degree and the number of the chat records in the chat log as an adjusting factor;
taking the product of the adjustment factor and the importance index as a new importance index of the historical chat record;
and storing the historical chat records on the storage blocks corresponding to the new importance indexes.
In a second aspect, an embodiment of the present invention provides an intelligent storage system for big data financial information, where the system includes:
the obtaining module is used for obtaining the influence degree of the historical chat records of the users in the storage block and the importance indexes of the historical chat records; the influence degree represents the degree of influence of the events stated in the historical chat records on the user; the importance index represents the degree of influence of the events stated by the historical chat records on the user in the future, which is expected before the historical chat records are stored in the storage block; the chat records are the information of each speaking of the user or the opposite user in the dialog box; each chat record comprises a speaking object, speaking time and speaking content;
and the updating module is used for updating the storage position of the historical chat record based on the influence degree and the importance index.
Optionally, the obtaining the influence of the historical chat records of the users in the storage block includes:
determining a time period from the moment of storing the historical chat records in the storage block to the current moment as an investigation time period;
obtaining a chat log of a user and an opposite user in a research time period; the chat log comprises a plurality of chat records, and each chat record is information of each speaking of the user or the opposite user in a dialog box; each chat record comprises a speaking object, speaking time and speaking content;
obtaining keywords of each chat record in the chat log and keywords of the historical chat records; the keywords can represent behavioral intents of the chat records;
obtaining the association degree between the keywords of the chat records and the keywords of the historical chat records; the relevance degree represents the relevance degree between the events mentioned by the historical chat records and the events mentioned by the chat records; each chat record and the historical chat records have one degree of association, and a plurality of chat records correspond to a plurality of degrees of association;
determining the chat records with the association degree larger than a preset value as associated chat records;
and taking the number of the associated chat records as the influence degree of the historical chat records.
Optionally, the updating the storage location of the historical chat record based on the influence degree and the importance index includes:
predicting, based on the degree of influence and the chat log, a subsequent degree of influence of the event stated by the historical chat log after the current moment; the subsequent degree of influence represents a degree of influence of an event stated in the historical chat log on the user after a current time;
and updating the storage position of the historical chat record based on the subsequent influence degree and the importance index.
Optionally, predicting a subsequent influence of the event stated by the historic chat record after the current time based on the influence and the chat log comprises:
obtaining the number of the chat records in the chat log;
taking the quotient of the influence degree and the number as an influence factor;
and taking the product of the influence factor and the influence degree as the predicted subsequent influence degree.
In a third aspect, an embodiment of the present invention provides a computer-readable storage medium, on which a computer program is stored, where the computer program is configured to, when executed by a processor, implement the steps of any one of the methods described above and store the historical chat records.
Compared with the prior art, the invention has the following beneficial effects:
the embodiment of the invention provides a method, a system and a storage medium for intelligently storing big data financial information, wherein the method comprises the steps of obtaining the influence degree of historical chat records of users in a storage block and the importance index of the historical chat records; the influence degree represents the degree of influence of the events stated in the historical chat records on the user; the importance index represents the degree of influence of the events stated by the historical chat records on the user in the future, which is expected before the historical chat records are stored in the storage block; the chat records are the information of each speaking of the user or the opposite user in the dialog box; each chat record comprises a speaking object, speaking time and speaking content; and updating the storage position of the historical chat record based on the influence degree and the importance index. Based on the influence degree of the historical chat records of the users in the storage block and the importance indexes of the historical chat records, the storage positions of the historical chat records are updated, unimportant historical chat records are stored in unimportant blocks, important chat records are stored in important storage blocks, and the important chat records need to be deleted, so that the chat records (information) of the users can be effectively managed, and meanwhile, the users can conveniently inquire the chat records in the corresponding storage blocks according to the importance indexes of the historical chat records. In addition, the system can directly transfer the information in the storage block according to the importance index of the information, thereby reducing the times of traversing data by the system, improving the load capacity of the system and increasing the running speed of the system.
Drawings
Fig. 1 is a flowchart of a big data financial information intelligent storage method according to an embodiment of the present invention.
Fig. 2 is a schematic block structure diagram of an electronic device according to an embodiment of the present invention.
The labels in the figure are: a bus 500; a receiver 501; a processor 502; a transmitter 503; a memory 504; a bus interface 505.
Detailed Description
The present invention will be described in detail below with reference to the accompanying drawings.
Examples
The embodiment of the invention provides an intelligent storage method of big data financial information, as shown in fig. 1, the method comprises the following steps:
s101: and obtaining the influence degree of the historical chat records of the users in the storage block and the importance index of the historical chat records.
Wherein the influence level is indicative of a degree of influence of an event stated in the historical chat log on the user at the present time; the importance index represents the degree of influence of the events stated by the historical chat records on the user in the future, which is expected before the historical chat records are stored in the storage block; the chat records are the information of each speaking of the user or the opposite user in the dialog box; each chat record includes the subject of the utterance, the time of the utterance, and the content of the utterance.
S102: and updating the storage position of the historical chat record based on the influence degree and the importance index.
By adopting the scheme, the storage position of the historical chat records is updated based on the influence degree of the historical chat records of the user in the storage block and the importance indexes of the historical chat records, the unimportant historical chat records are stored in the unimportant block, the important chat records are stored in the important storage block, and the important chat records need to be deleted, so that the chat records (information) of the user can be effectively managed, and meanwhile, the user can conveniently inquire the chat records in the corresponding storage block according to the importance indexes of the historical chat records. In addition, the system can directly transfer the information in the storage block according to the importance index of the information, thereby reducing the times of traversing data by the system, improving the load capacity of the system and increasing the running speed of the system.
Optionally, the obtaining the influence of the historical chat records of the users in the storage block includes:
determining a time period from the moment of storing the historical chat records in the storage block to the current moment as an investigation time period; the investigation period may be one hour, two hours, half a day, one week, half a month, one month, three months, half a year, one year, two years, etc. For example, if the time of storing the history chat records in the storage block is 8/4/12: 00 in 2021, and the current time is 8/5/12: 00 in 2021, the time of day between 12:00 in 8/4/2021 and 12:00 in 8/5/2021 is considered.
Obtaining a chat log of a user and an opposite user in a research time period; the chat log comprises a plurality of chat records, and each chat record is information of each speaking of the user or the opposite user in a dialog box; each chat record includes the subject of the utterance, the time of the utterance, and the content of the utterance. For example, one chat log is "user a: what 2021 year, 8 month, 6 day, 11:00 eaten today
User B (opposite user): how to make dumplings in 2021, 8 months, 6 days, 11:01
The user A: yesterday eating dumpling, today do not want to eat dumpling, 2021 year 8 month 6 day 11:03
And a user B: how to eat the noodles in 2021, 8 months, 6 days 11:04
The user A: may be prepared by the following steps. 8/6/11/05 in 2021 "
Then, there are 5 chat records "user a: what 2021 year, 8 month, 6 days 11:00 "" user B (opposite user) eat today: how the dumplings are in 2021, 8 months, 6 days 11:01 ": yesterday eat the dumpling, today do not want to eat the dumpling 8 months 6 days 11:03 "" user B: how to eat the noodles in 2021, 8 months, 6 days 11:04 ": may be prepared by the following steps. 8/6/11/05 "in 2021. The chat objects are user A and user B.
Obtaining keywords of each chat record in the chat log and keywords of the historical chat records; the keywords can represent behavioral intents of the chat records; optionally, the keywords of the chat records are extracted through a three-layer bayesian probability model (LDA). Keywords of the historical chat records are extracted through a three-layer Bayesian probability model (LDA).
And obtaining the association degree between the keywords of the chat records and the keywords of the historical chat records. The relevance degree represents the relevance degree between the events mentioned by the historical chat records and the events mentioned by the chat records; and an association degree exists between each chat record and the historical chat records, and a plurality of chat records correspond to a plurality of association degrees. Wherein, the obtaining mode of the association degree is as follows: converting the keywords of the chat records into chat word vectors, converting the keywords of the historical chat records into historical word vectors, and taking cosine values between included angles of the chat word vectors and the historical word vectors as the relevancy.
Determining the chat records with the association degree larger than a preset value as associated chat records; the value of the preset value may be 0.5.
And taking the number of the associated chat records as the influence degree of the historical chat records.
For example, if there are 5 associated chat logs, the influence of the historical chat log is 5.
Optionally, the updating the storage location of the historical chat record based on the influence degree and the importance index includes:
predicting, based on the degree of influence and the chat log, a subsequent degree of influence of the event stated by the historical chat log after the current moment; the subsequent degree of influence represents a degree of influence of an event stated in the historical chat log on the user after a current time;
and updating the storage position of the historical chat record based on the subsequent influence degree and the importance index.
Optionally, predicting a subsequent influence of the event stated by the historic chat record after the current time based on the influence and the chat log comprises:
obtaining the number of the chat records in the chat log;
taking the quotient of the influence degree and the number of the chat records in the chat log as an influence factor;
and taking the product of the influence factor and the influence degree as the predicted subsequent influence degree.
Optionally, the updating the storage location of the historical chat record based on the subsequent influence degree and the importance index includes:
taking the quotient of the subsequent influence degree and the number of the chat records in the chat log as an adjusting factor; for example, the subsequent influence degree is h, the number of chat records in the chat log is n, and the adjustment factor r = h/n.
Taking the product of the adjustment factor and the importance index as a new importance index of the historical chat record;
and storing the historical chat records on the storage blocks corresponding to the new importance indexes.
Assuming that the original significance index is 1 and the new significance index is 2, the historical chat records are moved from the storage block with the significance index of 1 to the storage block with the significance index of 2 for storage. Therefore, the intelligent and automatic effective management of the user information is realized, and the system performance is improved by colleagues.
As an alternative embodiment, the method further comprises: before the historical chat records are stored in the storage blocks, the importance indexes of the historical chat records are predicted, the historical chat records are stored in the storage blocks corresponding to the importance indexes, the mode of storing the historical chat records is the same as the mode of storing the chat records, and the prediction method of the importance indexes of the historical chat records is the same as the method of obtaining the chat records. Specifically, the method for obtaining the importance index of the chat record and storing the chat record in the storage block includes:
extracting keywords of the chat records through a three-layer Bayesian probability model; the keywords can represent behavioral intents of the chat records;
identifying the importance index of the chat records according to the keywords of the chat records and the keywords of the front and back chat records of the chat records; the keywords of the front and back chat records comprise keywords of a prepositive phrase section formed by the first N chat records of the chat records and keywords of a postive phrase section formed by the last M chat records of the chat records; n, M is 0 or a positive integer; the importance index represents a predicted degree of subsequent impact of the event stated by the chat log on the user;
and storing the chat records in a block corresponding to the importance index.
It should be noted that, when storing the chat records for the first time, before storing the chat records in the block corresponding to the importance index, the method further includes: the storage space is partitioned to obtain a plurality of blocks, and the importance index of each block is determined. In the embodiment of the present invention, the storage space may be divided into 3 blocks, and the significance indexes of the 3 blocks are 1, 2, and 3, respectively. The greater the importance index, the more important the historical chat history (chat history) stored for that tile, and the greater the subsequent impact on the user.
Before identifying the importance index of the chat log according to the keywords of the chat log and the keywords of the preceding and following chat logs, the method further comprises: and obtaining keywords of a prepositive phrase section formed by the first N chat records of the chat records, and obtaining keywords of a postive phrase section formed by the last M chat records of the chat records. The keywords of the prefix segment formed by the first N chat records of the chat records can be obtained through a three-layer bayesian probability model (LDA), and the keywords of the suffix segment formed by the last M chat records of the chat records can be obtained through a three-layer bayesian probability model (LDA).
Optionally, the obtaining of the keyword of the prefix segment formed by the first N chat records of the chat records includes:
obtaining the speaking contents of the first N chat records of the chat records, and connecting the speaking contents end to end according to the sequence of speaking time to form the preposed speech segment;
extracting keywords of the preposed language segments based on the three-layer Bayesian probability model;
obtaining keywords of a postscript section formed by the last M chat records of the chat records, including:
obtaining the speaking contents of the last M chat records of the chat records, and connecting the speaking contents end to end according to the sequence of speaking time to form the post-positioned speech segment;
and extracting the keywords of the postword section based on the three-layer Bayesian probability model.
Taking the above example as an example, the chat log is "user a: what 2021 year, 8 month, 6 day, 11:00 eaten today
User B (opposite user): how to make dumplings in 2021, 8 months, 6 days, 11:01
The user A: yesterday eating dumpling, today do not want to eat dumpling, 2021 year 8 month 6 day 11:03
And a user B: how to eat the noodles in 2021, 8 months, 6 days 11:04
The user A: may be prepared by the following steps. 8/6/11/05 in 2021 "
Then, there are 5 chat records "user a: what 2021 year, 8 month, 6 days 11:00 "" user B (opposite user) eat today: how the dumplings are in 2021, 8 months, 6 days 11:01 ": yesterday eat the dumpling, today do not want to eat the dumpling 8 months 6 days 11:03 "" user B: how to eat the noodles in 2021, 8 months, 6 days 11:04 ": may be prepared by the following steps. 8/6/11/05 "in 2021. The chat objects are user A and user B.
Chat record "user a: yesterday eating a dumpling, today the first N (N = 2) chat records that did not want to eat a dumpling by 2021 year 8, 6, 11:03 "include" user a: what 2021 year, 8 month, 6 day, 11:00 eaten today
User B (opposite user): how to make dumplings is 11:01 at 8 months and 6 days in 2021.
The last M (M = 2) chat records include "user B: how to eat the noodles in 2021, 8 months, 6 days 11:04
The user A: may be prepared by the following steps. 8/6/11/05 "in 2021.
The front language section is how to eat dumplings today, and the rear language section is how to eat noodles. ".
Optionally, the determining the importance index of the chat record according to the keyword of the chat record and the keywords of the preceding and following chat records of the chat record includes:
connecting the keywords of the chat records and the keywords of the front and back chat records of the chat records end to form a keyword entry; for example, the keywords of the chat log include "5 yuan" and "unlawful", and the keywords of the chat log before and after the chat log include "book" and "how much money". According to the speaking time of the keywords in the chat records, the keyword entries obtained by connecting the keywords end to end are 'how much money 5 yuan is not available from the beginning'.
Forming a key phrase by the key words of the chat records and the key words of the front and back chat records of the chat records; the keyword group comprises a plurality of keywords; according to the above examples, the keyword group includes keywords such as "dumpling", "do not want to eat" and "noodle".
Obtaining a first correlation index between each keyword in the keyword group and a plurality of standard keywords in a big database; it should be noted that the standard keywords are stored in the big database by the user in advance according to the individual speech habits, or are the standard keywords trained by the user according to the big data technology. The obtaining mode of the first correlation index between each keyword in the keyword group and a plurality of standard keywords in the big database is as follows: the plurality of keywords correspond to the plurality of first correlation indexes. The size of the first correlation index characterizes the size of similarity of the keyword to the standard keyword; each standard keyword in the large database corresponds to an importance index; the standard keywords are keywords which are confirmed by a user in advance to be stored in a large database.
Obtaining a first correlation index between each keyword in the keyword set and a plurality of standard keywords in a big database comprises:
the keywords are converted into keyword vectors, and the standard keywords are converted into keyword vectors.
Obtaining a cosine value of an included angle between the keyword vector and the standard keyword vector; each keyword vector corresponds to a plurality of standard keyword vectors, and each keyword vector corresponds to a plurality of cosine values; aiming at each keyword vector, obtaining a cosine value mean value and a cosine value variance of a plurality of cosine values corresponding to the keyword vector; if the cosine value variance is greater than a set value, obtaining a maximum value in the cosine values, and taking the quotient of the maximum value and the cosine value mean value and adding the cosine value variance as the first correlation index, for example, if the first correlation index is d1, the maximum value in the cosine values is max, the cosine value mean value is p, and the cosine value variance is t, then the first correlation index d1= (max/p) + t.
The optional set point value is 0.5.
Obtaining a second correlation index between the keyword entry and a plurality of standard keywords in a big database;
if there is only one keyword entry and there are multiple standard keywords, then multiple second correlation indexes will be obtained correspondingly. The magnitude of the second correlation index characterizes the magnitude of similarity of the keyword entry to the standard keyword. The obtaining of the second correlation index between the keyword entry and the plurality of standard keywords in the big database specifically comprises:
and converting the keyword into a term vector, and taking the mean value of cosine values between the term vector and the standard keyword vector as the second correlation index.
And taking the sum of the second correlation index and the first correlation index as the importance index of the chat records.
By adopting the scheme above, the important index of the chat record is identified according to the keywords of the chat record and the keywords of the front and back chat records of the chat record, the chat record is stored in the block corresponding to the important index, so that the important events are intelligently identified and stored in a partition manner, the same important events are stored in one block, a user can conveniently search and manage the chat record, and the effectiveness and convenience of user information storage, namely management are improved.
Thus, when a user needs to find a piece of information, the user can directly find the piece of information (chat log) in the block where the importance index is lost. When the user needs to delete the unimportant information, the data information (chat history) in the block with the low importance index can be directly deleted. In addition, the method also comprises the step of regularly updating the chat records (information) in the storage blocks, namely updating the storage positions of the chat records, namely moving the chat records (information) which are not important any more to the blocks with low importance indexes and moving the chat records (information) which become important to the blocks with high importance indexes. Specifically, the method for updating the storage location of the chat log includes: obtaining the influence degree of the historical chat records of the users in the storage block and the importance index of the historical chat records; it should be noted that the storage location of the historical chat history is updated based on the influence degree and the importance index. It should be noted that after storing the chat history in a block, the chat history becomes the historical chat history of the user.
The embodiment of the application also correspondingly provides an executing main body for executing the steps, and the executing main body can be an intelligent storage system for the big data financial information. Big data financial information intelligence storage system includes:
the obtaining module is used for obtaining the influence degree of the historical chat records of the users in the storage block and the importance indexes of the historical chat records; the influence degree represents the degree of influence of the events stated in the historical chat records on the user; the importance index represents the degree of influence of the events stated by the historical chat records on the user in the future, which is expected before the historical chat records are stored in the storage block; the chat records are the information of each speaking of the user or the opposite user in the dialog box; each chat record comprises a speaking object, speaking time and speaking content;
and the updating module is used for updating the storage position of the historical chat record based on the influence degree and the importance index.
With regard to the system in the above embodiment, the specific manner in which each module performs the operation has been described in detail in the embodiment related to the method, and will not be elaborated here.
An embodiment of the present invention further provides an electronic device, as shown in fig. 2, including a memory 504, a processor 502, and a computer program stored on the memory 504 and executable on the processor 502, where the processor 502 implements the steps of any one of the above-described intelligent large-data financial information storage methods when executing the program.
Where in fig. 2 a bus architecture (represented by bus 500) is shown, bus 500 may include any number of interconnected buses and bridges, and bus 500 links together various circuits including one or more processors, represented by processor 502, and memory, represented by memory 504. The bus 500 may also link together various other circuits such as peripherals, voltage regulators, power management circuits, and the like, which are well known in the art, and therefore, will not be described any further herein. A bus interface 505 provides an interface between the bus 500 and the receiver 501 and transmitter 503. The receiver 501 and the transmitter 503 may be the same element, i.e. a transceiver, providing a means for communicating with various other apparatus over a transmission medium. The processor 502 is responsible for managing the bus 500 and general processing, and the memory 504 may be used for storing data used by the processor 502 in performing operations.
In the embodiment of the invention, the big data financial information intelligent storage system is installed in the robot, and particularly can be stored in a memory in the form of a software functional module and can be processed and run by a processor.
Embodiments of the present invention further provide a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps of any one of the above-mentioned big data financial information intelligent storage methods and the above-mentioned historical chat records and chat records.
The algorithms and displays presented herein are not inherently related to any particular computer, virtual machine, or other apparatus. Various general purpose systems may also be used with the teachings herein. The required structure for constructing such a system will be apparent from the description above. Moreover, the present invention is not directed to any particular programming language. It is appreciated that a variety of programming languages may be used to implement the teachings of the present invention as described herein, and any descriptions of specific languages are provided above to disclose the best mode of the invention.
In the description provided herein, numerous specific details are set forth. It is understood, however, that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description.
Similarly, it should be appreciated that in the foregoing description of exemplary embodiments of the invention, various features of the invention are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the disclosure and aiding in the understanding of one or more of the various inventive aspects. However, the disclosed method should not be interpreted as reflecting an intention that: that the invention as claimed requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the detailed description are hereby expressly incorporated into this detailed description, with each claim standing on its own as a separate embodiment of this invention.
Those skilled in the art will appreciate that the modules in the device in an embodiment may be adaptively changed and disposed in one or more devices different from the embodiment. The modules or units or components of the embodiments may be combined into one module or unit or component, and furthermore they may be divided into a plurality of sub-modules or sub-units or sub-components. All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and all of the processes or elements of any method or apparatus so disclosed, may be combined in any combination, except combinations where at least some of such features and/or processes or elements are mutually exclusive. Each feature disclosed in this specification (including any accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.
Furthermore, those skilled in the art will appreciate that while some embodiments herein include some features included in other embodiments, rather than other features, combinations of features of different embodiments are meant to be within the scope of the invention and form different embodiments. For example, in the following claims, any of the claimed embodiments may be used in any combination.
The various component embodiments of the invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art will appreciate that a microprocessor or Digital Signal Processor (DSP) may be used in practice to implement some or all of the functionality of some or all of the components in an apparatus according to an embodiment of the invention. The present invention may also be embodied as apparatus or device programs (e.g., computer programs and computer program products) for performing a portion or all of the methods described herein. Such programs implementing the present invention may be stored on computer-readable media or may be in the form of one or more signals. Such a signal may be downloaded from an internet website or provided on a carrier signal or in any other form.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The usage of the words first, second and third, etcetera do not indicate any ordering. These words may be interpreted as names.

Claims (10)

1. The intelligent storage method for the big data financial information is characterized by comprising the following steps:
obtaining the influence degree of the historical chat records of the users in the storage block and the importance index of the historical chat records; the influence degree represents the degree of influence of the events stated in the historical chat records on the user; the importance index represents the degree of influence of the events stated by the historical chat records on the user in the future, which is expected before the historical chat records are stored in the storage block; the chat records are the information of each speaking of the user or the opposite user in the dialog box; each chat record comprises a speaking object, speaking time and speaking content;
and updating the storage position of the historical chat record based on the influence degree and the importance index.
2. The method of claim 1, wherein obtaining the influence of the historical chat history of the users in the storage block comprises:
determining a time period from the moment of storing the historical chat records in the storage block to the current moment as an investigation time period;
obtaining a chat log of a user and an opposite user in a research time period; the chat log comprises a plurality of chat records, and each chat record is information of each speaking of the user or the opposite user in a dialog box; each chat record comprises a speaking object, speaking time and speaking content;
obtaining keywords of each chat record in the chat log and keywords of the historical chat records; the keywords can represent behavioral intents of the chat records;
obtaining the association degree between the keywords of the chat records and the keywords of the historical chat records; the relevance degree represents the relevance degree between the events mentioned by the historical chat records and the events mentioned by the chat records; each chat record and the historical chat records have one degree of association, and a plurality of chat records correspond to a plurality of degrees of association;
determining the chat records with the association degree larger than a preset value as associated chat records;
and taking the number of the associated chat records as the influence degree of the historical chat records.
3. The method of claim 2, wherein updating the storage location of the historical chat log based on the influence level and the importance index comprises:
predicting, based on the degree of influence and the chat log, a subsequent degree of influence of the event stated by the historical chat log after the current moment; the subsequent degree of influence represents a degree of influence of an event stated in the historical chat log on the user after a current time;
and updating the storage position of the historical chat record based on the subsequent influence degree and the importance index.
4. The method of claim 3, wherein predicting a subsequent degree of influence of the event stated by the historic chat log after the current time based on the degree of influence and the chat log comprises:
obtaining the number of the chat records in the chat log;
taking the quotient of the influence degree and the number as an influence factor;
and taking the product of the influence factor and the influence degree as the predicted subsequent influence degree.
5. The method of claim 1, wherein updating the storage location of the historical chat log based on the subsequent influence level and the importance index comprises:
taking the quotient of the subsequent influence degree and the number of the chat records in the chat log as an adjusting factor;
taking the product of the adjustment factor and the importance index as a new importance index of the historical chat record;
and storing the historical chat records on the storage blocks corresponding to the new importance indexes.
6. An intelligent storage system for big data financial information, the system comprising:
the obtaining module is used for obtaining the influence degree of the historical chat records of the users in the storage block and the importance indexes of the historical chat records; the influence degree represents the degree of influence of the events stated in the historical chat records on the user; the importance index represents the degree of influence of the events stated by the historical chat records on the user in the future, which is expected before the historical chat records are stored in the storage block; the chat records are the information of each speaking of the user or the opposite user in the dialog box; each chat record comprises a speaking object, speaking time and speaking content;
and the updating module is used for updating the storage position of the historical chat record based on the influence degree and the importance index.
7. The system of claim 6, wherein obtaining the influence of the historical chat history of the users in the storage block comprises:
determining a time period from the moment of storing the historical chat records in the storage block to the current moment as an investigation time period;
obtaining a chat log of a user and an opposite user in a research time period; the chat log comprises a plurality of chat records, and each chat record is information of each speaking of the user or the opposite user in a dialog box; each chat record comprises a speaking object, speaking time and speaking content;
obtaining keywords of each chat record in the chat log and keywords of the historical chat records; the keywords can represent behavioral intents of the chat records;
obtaining the association degree between the keywords of the chat records and the keywords of the historical chat records; the relevance degree represents the relevance degree between the events mentioned by the historical chat records and the events mentioned by the chat records; each chat record and the historical chat records have one degree of association, and a plurality of chat records correspond to a plurality of degrees of association;
determining the chat records with the association degree larger than a preset value as associated chat records;
and taking the number of the associated chat records as the influence degree of the historical chat records.
8. The system of claim 7, wherein updating the storage location of the historical chat log based on the influence level and the importance index comprises:
predicting, based on the degree of influence and the chat log, a subsequent degree of influence of the event stated by the historical chat log after the current moment; the subsequent degree of influence represents a degree of influence of an event stated in the historical chat log on the user after a current time;
and updating the storage position of the historical chat record based on the subsequent influence degree and the importance index.
9. The system of claim 8, wherein predicting a subsequent degree of influence of the event stated by the historical chat log after the current time based on the degree of influence and the chat log comprises:
obtaining the number of the chat records in the chat log;
taking the quotient of the influence degree and the number as an influence factor;
and taking the product of the influence factor and the influence degree as the predicted subsequent influence degree.
10. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 5 and stores the history chat log.
CN202110902557.4A 2021-08-06 2021-08-06 Intelligent storage method, system and storage medium for big data financial information Pending CN113656530A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110902557.4A CN113656530A (en) 2021-08-06 2021-08-06 Intelligent storage method, system and storage medium for big data financial information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110902557.4A CN113656530A (en) 2021-08-06 2021-08-06 Intelligent storage method, system and storage medium for big data financial information

Publications (1)

Publication Number Publication Date
CN113656530A true CN113656530A (en) 2021-11-16

Family

ID=78478583

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110902557.4A Pending CN113656530A (en) 2021-08-06 2021-08-06 Intelligent storage method, system and storage medium for big data financial information

Country Status (1)

Country Link
CN (1) CN113656530A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100318575A1 (en) * 2009-06-15 2010-12-16 Microsoft Corporation Storage or removal actions based on priority
CN105812231A (en) * 2014-12-29 2016-07-27 阿里巴巴集团控股有限公司 Chatting record fast identification method and device thereof
CN108920675A (en) * 2018-07-09 2018-11-30 北京百悟科技有限公司 A kind of method, apparatus of information processing, computer storage medium and terminal
CN111767396A (en) * 2020-05-18 2020-10-13 北京沃东天骏信息技术有限公司 Data processing method, device, equipment and computer readable storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100318575A1 (en) * 2009-06-15 2010-12-16 Microsoft Corporation Storage or removal actions based on priority
CN105812231A (en) * 2014-12-29 2016-07-27 阿里巴巴集团控股有限公司 Chatting record fast identification method and device thereof
CN108920675A (en) * 2018-07-09 2018-11-30 北京百悟科技有限公司 A kind of method, apparatus of information processing, computer storage medium and terminal
CN111767396A (en) * 2020-05-18 2020-10-13 北京沃东天骏信息技术有限公司 Data processing method, device, equipment and computer readable storage medium

Similar Documents

Publication Publication Date Title
US8775410B2 (en) Method for using dual indices to support query expansion, relevance/non-relevance models, blind/relevance feedback and an intelligent search interface
US20180314727A1 (en) Cognitive deduplication-aware data placement in large scale storage systems
US20110179002A1 (en) System and Method for a Vector-Space Search Engine
US20110184981A1 (en) Personalize Search Results for Search Queries with General Implicit Local Intent
CN108255958A (en) Data query method, apparatus and storage medium
US20190138507A1 (en) Data Processing Method and System and Client
US8793246B1 (en) Identifying ranking scores for domains of interest
CN101183379A (en) Attribute level federation from multiple data sources
JP2015537283A (en) Mail indexing and retrieval using a hierarchical cache
US8140538B2 (en) System and method of data caching for compliance storage systems with keyword query based access
BRPI0714938A2 (en) sorting functions using an incrementally updatable modified bayesian query query classifier
US10817515B2 (en) Cognitive data filtering for storage environments
CN110807138B (en) Method and device for determining search object category
CN110968765B (en) Book searching method, computing device and computer storage medium
US11182438B2 (en) Hybrid processing of disjunctive and conjunctive conditions of a search query for a similarity search
CN115516447A (en) Hot news intention identification method, device and equipment and readable storage medium
US20200278989A1 (en) Information processing apparatus and non-transitory computer readable medium
CN112070550A (en) Keyword determination method, device and equipment based on search platform and storage medium
CN112256863A (en) Method and device for determining corpus intentions and electronic equipment
CN115794945A (en) Block chain data storage method and device, readable storage medium and computer equipment
US20160307000A1 (en) Index-side diacritical canonicalization
CN102129454A (en) Method and system for processing encyclopaedia data based on cloud storage
CN113656530A (en) Intelligent storage method, system and storage medium for big data financial information
US10592480B1 (en) Affinity scoring
CN103902687B (en) The generation method and device of a kind of Search Results

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20211116

WD01 Invention patent application deemed withdrawn after publication