CN114817458A - Bid-winning item retrieval method based on funnel model and cosine algorithm - Google Patents

Bid-winning item retrieval method based on funnel model and cosine algorithm Download PDF

Info

Publication number
CN114817458A
CN114817458A CN202210294741.XA CN202210294741A CN114817458A CN 114817458 A CN114817458 A CN 114817458A CN 202210294741 A CN202210294741 A CN 202210294741A CN 114817458 A CN114817458 A CN 114817458A
Authority
CN
China
Prior art keywords
bid
item
query
winning
words
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210294741.XA
Other languages
Chinese (zh)
Inventor
廖泽丽
陈锋
谢忠任
周剑洪
张毅
赵伟
肖佳杭
赵航翊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing Dasicong Information Technology Co ltd
Original Assignee
Chongqing Dasicong Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chongqing Dasicong Information Technology Co ltd filed Critical Chongqing Dasicong Information Technology Co ltd
Priority to CN202210294741.XA priority Critical patent/CN114817458A/en
Publication of CN114817458A publication Critical patent/CN114817458A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3334Selection or weighting of terms from queries, including natural language queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/08Auctions

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Economics (AREA)
  • Computational Linguistics (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Engineering & Computer Science (AREA)
  • General Business, Economics & Management (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Databases & Information Systems (AREA)
  • Accounting & Taxation (AREA)
  • Human Resources & Organizations (AREA)
  • Data Mining & Analysis (AREA)
  • Marketing (AREA)
  • Artificial Intelligence (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Operations Research (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Game Theory and Decision Science (AREA)
  • Educational Administration (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a bid-winning project retrieval method based on a funnel model and a cosine algorithm, which can be used for associating scattered data in two links of bid inviting and bid winning by combining with the existing massive project data to obtain a basic life chain of a project, further calculating the floating rate of the bid-winning project according to the associated chain data to help a user to accurately bid, and performing macroscopic analysis or single project analysis on the bid-inviting market according to the floating rate data statistics; data dimensions for items in a project cycle can be measured, such as: the construction scale, the bid inviting range, the qualification requirement, the bid opening time and the like are supplemented, and the project information is perfected.

Description

Bid-winning item retrieval method based on funnel model and cosine algorithm
Technical Field
The invention relates to the field of bid-winning item retrieval, in particular to a bid-winning item retrieval method based on a funnel model and a cosine algorithm.
Background
At present, the industry generally has no perfect project complete cycle data, and the national management target for various bidding data is gradually refined and integrated. The data is very important for bidding practitioners, and can break two largest data islands before project implementation, so that the data of the whole project period can be accurately closed.
Disclosure of Invention
Aiming at the problems, the invention provides a bid-winning item retrieval method based on a funnel model and a cosine algorithm.
The invention is realized by the following technical scheme:
a bid-winning item retrieval method based on a funnel model and a cosine algorithm comprises the following steps:
s1, according to the item name of the bid-winning item, matching the item name of the bid-winning item, and judging the result, if the result is empty, executing the step S2;
s2, carrying out fuzzy matching on the bidding items, carrying out word reduction including query on the word stock after word segmentation and the bidding names of the bidding items, judging results, if the results include that the query returns, executing the step S3, and if the results include that the query does not return, continuing the query from the last word reduction;
s3, judging whether the text of the bidding project contains the owner unit name, if yes, executing the step S4;
s4, judging whether the text of the bidding project contains the subtracted words and sentences in the query, if so, successfully associating, and if not, executing the step S5;
s5, judging whether the text of the deleted words and the bidding items contains the keywords: marking sections and packets, judging whether the item names of the bid items and the item names of the medium items contain the subtracted words and sentences which are not deleted in the query, and if so, executing a step S6;
and S6, obtaining the matching degree of the item name of the bid-winning item and the subtracted words containing the words and sentences which are not deleted in the query according to the cosine algorithm, judging the result, and extracting the association success result.
Further, step S0 is included, the related bid-closing data to be retrieved is pre-fetched, and includes the item name, the owner unit and the bid-closing time, where the owner unit of the bid-closing item is not empty.
Further, the step S1 of determining whether the owner unit of the winning bid item and the owner unit of the matching result mutually include the following sub-steps:
s101, judging whether the owner unit of the bid-winning project and the owner unit of the matching result mutually contain, if so, executing the step S102, otherwise, indicating that the association is successful;
and S102, judging whether the bid-winning time of the bid-winning item and the bid-winning deadline of the bid-winning item are less than one month or not, if so, executing the step S2, and if not, indicating that the association is successful.
Further, the step S2 specifically includes the following sub-steps:
s201, fuzzy matching is carried out, and keyword processing is carried out on the bidding items and the bidding items;
s202, carrying out word segmentation on the item name of the bid-winning item after the keyword processing;
and S203, carrying out word reduction including query on the bidding names of the word stock and the bidding items after word segmentation processing, judging results, executing the step S3 if the results include the query and return, and continuing the query from the last word reduction if the results include the query and return does not.
Further, the processing of the keywords in step S201 specifically includes the following substeps:
s2011, replacing English brackets in the item name of the winning bid item and the item name of the bidding item with Chinese brackets;
s2012, deleting the keywords in the name of the winning bid item: the name of the project, the number of the project, the non-mark section and the bid item.
Further, the word segmentation processing in step S202 specifically includes: and carrying out IK intelligent word segmentation on the project name of the bid-winning project and the project name of the bid-inviting project after keyword processing.
Further, the step S203 specifically includes the following sub-steps:
s2031, carrying out word reduction including query on the word stock after word segmentation and the bid-seeking name of the bid-seeking item, wherein when the number of the search words is less than 4, the association is failed, and when the number of the search words is more than 4, the step S2032 is executed;
s2032, judging the result containing the query, if the result contains the query and returns, executing the step S3, and if the result contains the query and does not return, continuing the query by subtracting the word from the tail.
Further, in step S5, if the first result is negative, it is determined whether the phrase deleted in the query includes the keyword.
Further, the step S5 of determining whether the subtracted word contains the deleted word or sentence in the query contains the keyword specifically includes the following steps:
s501, judging whether the subtracted words contain the deleted words in the query or not, wherein the deleted words contain the keywords: union, punctuation, supervision, design and general contract, and judges whether the project name of the bidding project contains the subtracted words and sentences which are not deleted in the query, if not, judges whether the subtracted words contain the deleted words and sentences in the query and if so, executes the step S502;
s502, obtaining the matching degree of the item name of the bid-winning item and the subtracted words containing the undeleted words and sentences in the query through a cosine algorithm, and executing the step S503 when the matching degree is less than 60%; when the matching degree is more than 60%, indicating that the association is successful;
s503, judging the matching degree of the item name of the replaced bid-winning item and the item name of the bid-winning item, executing the step S504 when the matching degree is less than 94%, and indicating that the association is successful when the matching degree is more than 94%;
s504, whether the text of the bid-seeking item contains the subtracted words and sentences in the query or not is judged, if the result is negative, the association is failed, and if the result is positive, the association is successful.
Further, the step S6 specifically includes the following sub-steps:
s601, obtaining the matching degree of the item name of the bid-winning item and the subtracted words containing the undeleted words and sentences in the query according to a cosine algorithm, judging the matching degree, executing the step S602 when the matching degree is less than 60%, and indicating that the association is successful when the matching degree is more than 60%;
s602, judging the matching degree of the item name of the replaced bid-winning item and the item name of the bid-winning item, judging the matching degree, executing the step S603 when the matching degree is less than 94%, and indicating that the association is successful when the matching degree is more than 94%;
and S603, judging whether the text of the bid-calling item contains the subtracted words and sentences in the query, if not, indicating that the association is failed, and if so, indicating that the association is successful.
The invention has the beneficial effects that:
(1) according to the invention, the existing massive project data is combined, scattered data in two links of bid inviting and bid winning can be correlated to obtain a basic life chain of a project, the downward floating rate of the bid inviting project can be further calculated according to the correlated chain data to help a user to accurately bid, and macroscopic analysis or single project analysis can be performed on the bid inviting and bidding market according to the downward floating rate data statistics;
(2) the invention can be used for the data dimensions in a project period, such as: supplementing the construction scale, the bid inviting range, the qualification requirement, the bid opening time and the like, and perfecting project information;
(3) the method can analyze important reference information such as bidding trends of an owner unit and a winning unit through big data;
(4) the invention provides a technical scheme for project full life cycle information management.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.
Fig. 1 is a flowchart of a method for searching a bid-winning item based on a funnel model and a cosine algorithm according to an embodiment of the present invention;
fig. 2 is a schematic structural diagram of a terminal device of a bid-winning item retrieval method based on a funnel model and a cosine algorithm according to an embodiment of the present invention;
fig. 3 is a schematic structural diagram of a computer-readable storage medium of a bid-winning item retrieval method based on a funnel model and a cosine algorithm according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail below with reference to examples and accompanying drawings, and the exemplary embodiments and descriptions thereof are only used for explaining the present invention and are not meant to limit the present invention.
Example 1
As shown in fig. 1, the embodiment provides a bid-winning item retrieval method based on a funnel model and a cosine algorithm, including the following steps:
a bid-winning item retrieval method based on a funnel model and a cosine algorithm comprises the following steps:
s1, according to the item name of the bid-winning item, matching the item name of the bid-winning item, and judging the result, if the result is empty, executing the step S2;
s2, carrying out fuzzy matching on the bidding items, carrying out word reduction including query on the word stock after word segmentation and the bidding names of the bidding items, judging results, if the results include that the query returns, executing the step S3, and if the results include that the query does not return, continuing the query from the last word reduction;
s3, judging whether the text of the bidding project contains the name of the owner unit, if not, failing to associate, if so, executing the step S4;
s4, judging whether the text of the bidding project contains the subtracted words and sentences in the query, if so, successfully associating, and if not, executing the step S5;
s5, judging whether the text of the deleted words and sentences and the bidding items contains the keywords: marking sections and packets, judging whether the item names of the bid items and the item names of the medium items contain the subtracted words and sentences which are not deleted in the query, and if so, executing a step S6;
and S6, obtaining the matching degree of the item name of the bid-winning item and the subtracted words containing the words and sentences which are not deleted in the query according to the cosine algorithm, judging the result, and extracting the association success result.
Further, step S0 is included, the related bid-closing data to be retrieved is pre-fetched, and includes the item name, the owner unit and the bid-closing time, where the owner unit of the bid-closing item is not empty.
Further, the step S1 of determining whether the owner unit of the winning bid item and the owner unit of the matching result mutually include the following sub-steps:
s101, judging whether the owner unit of the bid-winning project and the owner unit of the matching result mutually contain, if so, executing the step S102, otherwise, indicating that the association is successful;
and S102, judging whether the bid-winning time of the bid-winning item and the bid-winning deadline of the bid-winning item are less than one month or not, if so, executing the step S2, and if not, indicating that the association is successful.
Further, the step S2 specifically includes the following sub-steps:
s201, fuzzy matching is carried out, and keyword processing is carried out on the bidding items and the bidding items;
s202, carrying out word segmentation on the item name of the bid-winning item after the keyword processing;
and S203, carrying out word reduction including query on the word stock after the word segmentation and the bid name of the bid item, judging the result, if the result is that the query is returned, executing the step S3, and if the result is that the query is not returned, continuously querying from the tail word reduction.
Further, the processing of the keywords in step S201 specifically includes the following substeps:
s2011, replacing English brackets in the item name of the winning bid item and the item name of the bidding item with Chinese brackets;
s2012, deleting the keywords in the name of the winning bid item: the name of the project, the number of the project, the non-mark section and the bid item.
Further, the word segmentation processing in step S202 specifically includes: and carrying out IK intelligent word segmentation on the project name of the bid-winning project and the project name of the bid-inviting project after keyword processing.
Further, the step S203 specifically includes the following sub-steps:
s2031, carrying out word reduction including query on the word stock after word segmentation and the bid-seeking name of the bid-seeking item, wherein when the number of the search words is less than 4, the association is failed, and when the number of the search words is more than 4, the step S2032 is executed;
s2032, judging the result containing the query, if the result contains the query and returns, executing the step S3, and if the result contains the query and does not return, continuing the query by subtracting the word from the tail.
Further, in step S5, if the first result is negative, it is determined whether the phrase deleted in the query includes the keyword.
Further, the step S5 of determining whether the subtracted word contains the deleted word or sentence in the query contains the keyword specifically includes the following steps:
s501, judging whether the subtracted words contain the deleted words in the query or not, wherein the deleted words contain the keywords: union, punctuation, supervision, design and general contract, and judges whether the project name of the bidding project contains the subtracted words and sentences which are not deleted in the query, if not, judges whether the subtracted words contain the deleted words and sentences in the query and if so, executes the step S502;
s502, obtaining the matching degree of the item name of the bid-winning item and the subtracted words containing the undeleted words and sentences in the query through a cosine algorithm, and executing the step S503 when the matching degree is less than 60%; when the matching degree is more than 60%, indicating that the association is successful;
s503, judging the matching degree of the item name of the replaced bid-winning item and the item name of the bid-winning item, executing the step S504 when the matching degree is less than 94%, and indicating that the association is successful when the matching degree is more than 94%;
s504, whether the text of the bid-seeking item contains the subtracted words and sentences in the query or not is judged, if the result is negative, the association is failed, and if the result is positive, the association is successful.
Further, the step S6 specifically includes the following sub-steps:
s601, obtaining the matching degree of the item name of the bid-winning item and the subtracted words containing the undeleted words and sentences in the query according to a cosine algorithm, judging the matching degree, executing the step S602 when the matching degree is less than 60%, and indicating that the association is successful when the matching degree is more than 60%;
s602, judging the matching degree of the item name of the replaced bid-winning item and the item name of the bid-winning item, judging the matching degree, executing the step S603 when the matching degree is less than 94%, and indicating that the association is successful when the matching degree is more than 94%;
and S603, judging whether the text of the bid-calling item contains the subtracted words and sentences in the query, if not, indicating that the association is failed, and if so, indicating that the association is successful.
In this embodiment, the matching degree is gradually adjusted and optimized according to multiple repeated experiments, and there is no specific formula.
Example 2
On the basis of embodiment 1, this embodiment further provides a bid-winning item retrieval system based on a funnel model and a cosine algorithm, including:
the full matching module is used for fully matching the item names of the bid-winning items according to the item names of the bid-winning items;
the fuzzy matching module is used for carrying out fuzzy matching on the bid-winning items, and carrying out word reduction including query on the bid-winning names of the word bank and the bid-winning items after word segmentation processing;
the word reduction containing query judging module is used for judging whether the text of the bid-calling item contains the name of the owner unit and whether the word reduction contains the deleted words and sentences in the query;
and the keyword judgment module is used for judging whether the texts of the deleted words and sentences and the bidding items contain keywords: marking sections and packets, and judging whether the item names of the bid items and the item names of the medium items contain the subtracted words and sentences which are not deleted in the query;
the cosine matching degree calculation module is used for acquiring the matching degree of the item name of the bid-winning item and the subtracted words containing the undeleted words and sentences in the query according to a cosine algorithm;
and the association judgment module is used for judging association success/failure in the full matching module, the fuzzy matching module and the word reduction containing query judgment module, the keyword judgment module and the cosine matching degree calculation module.
Example 3
Referring to fig. 2, on the basis of embodiment 1, this embodiment proposes a terminal device for winning a bid item based on a funnel model and a cosine algorithm, where the terminal device 200 includes at least one memory 210, at least one processor 220, and a bus 230 connecting different platform systems.
The memory 210 may include readable media in the form of volatile memory, such as Random Access Memory (RAM)211 and/or cache memory 212, and may further include Read Only Memory (ROM) 213.
The memory 210 further stores a computer program, and the computer program can be executed by the processor 220, so that the processor 220 executes any one of the above bid-winning item retrieval methods based on the funnel model and the cosine algorithm in this embodiment of the application, and a specific implementation manner thereof is consistent with the implementation manner and the achieved technical effect described in the above embodiment of the method, and details of some contents are not repeated. Memory 210 may also include a program/utility 214 having a set (at least one) of program modules 215, including but not limited to: an operating system, one or more application programs, other program modules, and program data, each of which, or some combination thereof, may comprise an implementation of a network environment.
Accordingly, processor 220 may execute the computer programs described above, as well as may execute programs/utilities 214.
Bus 230 may be a local bus representing one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, a processor, or any other type of bus structure.
Terminal device 200 may also communicate with one or more external devices 240, such as a keyboard, pointing device, Bluetooth device, etc., as well as with one or more devices capable of interacting with terminal device 200, and/or with any device (e.g., router, modem, etc.) that enables terminal device 200 to communicate with one or more other computing devices. Such communication may occur via an input/output (I/O) interface 250. Also, the terminal device 200 may communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network such as the internet) through the network adapter 260. The network adapter 260 may communicate with other modules of the terminal device 200 via the bus 230. It should be understood that although not shown in the figures, other hardware and/or software modules may be used in conjunction with terminal device 200, including but not limited to: microcode, device drivers, redundant processors, external disk drive arrays, RAID systems, tape drives, and data backup storage platforms, to name a few.
Example 4
On the basis of embodiment 1, this embodiment provides a computer-readable storage medium for bid-winning item retrieval based on a funnel model and a cosine algorithm, where the computer-readable storage medium has instructions stored thereon, and the instructions, when executed by a processor, implement any one of the above-mentioned methods for bid-winning item retrieval based on a funnel model and a cosine algorithm. The specific implementation manner is consistent with the implementation manner and the achieved technical effect described in the embodiment of the method, and some contents are not described again.
Fig. 3 shows a program product 300 provided by the present embodiment for implementing the method, which may employ a portable compact disc read only memory (CD-ROM) and include program codes, and may be run on a terminal device, such as a personal computer. However, the program product 300 of the present invention is not limited in this respect, and in this embodiment, the readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. Program product 300 may employ any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. A readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
A computer readable storage medium may include a propagated data signal with readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A readable storage medium may also be any readable medium that is not a readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a readable storage medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing. Program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server. In the case of a remote computing device, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., through the internet using an internet service provider).
The foregoing shows and describes the general principles and broad features of the present invention and advantages thereof. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed. The scope of the invention is defined by the appended claims and equivalents thereof.

Claims (10)

1. A bid-winning item retrieval method based on a funnel model and a cosine algorithm is characterized by comprising the following steps:
s1, according to the item name of the bid-winning item, matching the item name of the bid-winning item, and judging the result, if the result is empty, executing the step S2;
s2, carrying out fuzzy matching on the bidding items, carrying out word reduction including query on the word stock after word segmentation and the bidding names of the bidding items, judging results, if the results include that the query returns, executing the step S3, and if the results include that the query does not return, continuing the query from the last word reduction;
s3, judging whether the text of the bidding project contains the name of the owner unit, if so, executing the step S4;
s4, judging whether the text of the bidding project contains the subtracted words and sentences in the query, if so, successfully associating, and if not, executing the step S5;
s5, judging whether the text of the deleted words and the bidding items contains the keywords: marking sections and packets, judging whether the item names of the bid items and the item names of the medium items contain the subtracted words and sentences which are not deleted in the query, and if so, executing a step S6;
and S6, obtaining the matching degree of the item name of the bid-winning item and the subtracted words containing the words and sentences which are not deleted in the query according to the cosine algorithm, judging the result, and extracting the association success result.
2. The method for retrieving a bid-closing item based on a funnel model and a cosine algorithm as claimed in claim 1, further comprising step S0, pre-fetching related bid-closing data to be retrieved, which contains item name, owner unit and bid-closing time, wherein the owner unit of the bid-closing item is not empty.
3. The method for searching for a bid-winning item based on a funnel model and a cosine algorithm as claimed in claim 1, wherein the step of determining whether the owner unit of the bid-winning item and the owner unit of the matching result mutually comprise the following sub-steps in step S1:
s101, judging whether the owner unit of the bid-winning project and the owner unit of the matching result mutually contain, if so, executing the step S102, otherwise, indicating that the association is successful;
and S102, judging whether the bid-winning time of the bid-winning item and the bid-winning deadline of the bid-winning item are less than one month or not, if so, executing the step S2, and if not, indicating that the association is successful.
4. The method for retrieving a bid-winning item based on a funnel model and a cosine algorithm as claimed in claim 1, wherein said step S2 specifically comprises the following sub-steps:
s201, fuzzy matching is carried out, and keyword processing is carried out on the bidding items and the bidding items;
s202, carrying out word segmentation on the item name of the bid-winning item after the keyword processing;
and S203, carrying out word reduction including query on the bidding names of the word stock and the bidding items after word segmentation processing, judging results, executing the step S3 if the results include the query and return, and continuing the query from the last word reduction if the results include the query and return does not.
5. The method for searching for a bid-winning item based on a funnel model and a cosine algorithm as claimed in claim 4, wherein the processing of the keyword in step S201 specifically comprises the following substeps:
s2011, replacing English brackets in the item name of the winning bid item and the item name of the bidding item with Chinese brackets;
s2012, deleting the keywords in the name of the winning bid item: the name of the project, the number of the project, the non-mark section and the bid item.
6. The method for searching for a bid-winning item based on a funnel model and a cosine algorithm as claimed in claim 4, wherein the specific steps of the word segmentation processing in step S202 are as follows: and carrying out IK intelligent word segmentation on the project name of the bid-winning project and the project name of the bid-inviting project after keyword processing.
7. The method for retrieving a bid-winning item based on a funnel model and a cosine algorithm as claimed in claim 4, wherein said step S203 specifically comprises the following substeps:
s2031, carrying out word reduction including query on the word stock after word segmentation and the bid inviting name of the bid inviting item, wherein when the number of the search words is less than 4, the association is failed, and when the number of the search words is more than 4, the step S2032 is executed;
s2032, judging the result containing the query, if the result contains the query and returns, executing the step S3, and if the result contains the query and does not return, continuing the query by subtracting the word from the tail.
8. The method as claimed in claim 1, wherein in step S5, if the result is negative, it is determined whether the subtracted word or sentence in the query contains the keyword.
9. The method for searching for a bid-winning item based on the funnel model and the cosine algorithm as claimed in claim 8, wherein the step S5 of determining whether the subtracted word or sentence in the query contains the keyword specifically comprises the steps of:
s501, judging whether the subtracted words contain the deleted words in the query or not, wherein the deleted words contain the keywords: union, punctuation, supervision, design and general contract, and judges whether the project name of the bidding project contains the subtracted words and sentences which are not deleted in the query, if not, judges whether the subtracted words contain the deleted words and sentences in the query and if so, executes the step S502;
s502, obtaining the matching degree of the item name of the bid-winning item and the subtracted words containing the undeleted words and sentences in the query through a cosine algorithm, and executing the step S503 when the matching degree is less than 60%; when the matching degree is more than 60%, indicating that the association is successful;
s503, judging the matching degree of the item name of the replaced bid-winning item and the item name of the bid-winning item, executing the step S504 when the matching degree is less than 94%, and indicating that the association is successful when the matching degree is more than 94%;
s504, whether the text of the bid-seeking item contains the subtracted words and sentences in the query or not is judged, if the result is negative, the association is failed, and if the result is positive, the association is successful.
10. The method for retrieving a bid-winning item based on a funnel model and a cosine algorithm as claimed in claim 1, wherein said step S6 specifically comprises the following sub-steps:
s601, obtaining the matching degree of the item name of the bid-winning item and the subtracted words containing the undeleted words and sentences in the query according to a cosine algorithm, judging the matching degree, executing the step S602 when the matching degree is less than 60%, and indicating that the association is successful when the matching degree is more than 60%;
s602, judging the matching degree of the item name of the replaced bid-winning item and the item name of the bid-winning item, judging the matching degree, executing the step S603 when the matching degree is less than 94%, and indicating that the association is successful when the matching degree is more than 94%;
and S603, judging whether the text of the bid-calling item contains the subtracted words and sentences in the query, if not, indicating that the association is failed, and if so, indicating that the association is successful.
CN202210294741.XA 2022-03-24 2022-03-24 Bid-winning item retrieval method based on funnel model and cosine algorithm Pending CN114817458A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210294741.XA CN114817458A (en) 2022-03-24 2022-03-24 Bid-winning item retrieval method based on funnel model and cosine algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210294741.XA CN114817458A (en) 2022-03-24 2022-03-24 Bid-winning item retrieval method based on funnel model and cosine algorithm

Publications (1)

Publication Number Publication Date
CN114817458A true CN114817458A (en) 2022-07-29

Family

ID=82530230

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210294741.XA Pending CN114817458A (en) 2022-03-24 2022-03-24 Bid-winning item retrieval method based on funnel model and cosine algorithm

Country Status (1)

Country Link
CN (1) CN114817458A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115187349A (en) * 2022-09-13 2022-10-14 工保科技(浙江)有限公司 Information processing method and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115187349A (en) * 2022-09-13 2022-10-14 工保科技(浙江)有限公司 Information processing method and device

Similar Documents

Publication Publication Date Title
CN109493977B (en) Text data processing method and device, electronic equipment and computer readable medium
US9864741B2 (en) Automated collective term and phrase index
US6904402B1 (en) System and iterative method for lexicon, segmentation and language model joint optimization
CN110263177B (en) Knowledge graph construction method for event prediction and event prediction method
CN110276023B (en) POI transition event discovery method, device, computing equipment and medium
Wang et al. Nested named entity recognition: a survey
WO2019217096A1 (en) System and method for automatically responding to user requests
WO2020259280A1 (en) Log management method and apparatus, network device and readable storage medium
CN108491389B (en) Method and device for training click bait title corpus recognition model
US8386238B2 (en) Systems and methods for evaluating a sequence of characters
US20120158742A1 (en) Managing documents using weighted prevalence data for statements
US9754083B2 (en) Automatic creation of clinical study reports
CN111753082A (en) Text classification method and device based on comment data, equipment and medium
CN110781669A (en) Text key information extraction method and device, electronic equipment and storage medium
Hu et al. Self-supervised synonym extraction from the web.
Abad et al. Supporting analysts by dynamic extraction and classification of requirements-related knowledge
WO2021042529A1 (en) Article abstract automatic generation method, device, and computer-readable storage medium
CN114817458A (en) Bid-winning item retrieval method based on funnel model and cosine algorithm
CN111737607B (en) Data processing method, device, electronic equipment and storage medium
US20230139642A1 (en) Method and apparatus for extracting skill label
US10534788B2 (en) Automatically determining a recommended set of actions from operational data
CN112417860A (en) Training sample enhancement method, system, device and storage medium
US11853700B1 (en) Machine learning techniques for natural language processing using predictive entity scoring
Zaraket et al. Arabic temporal entity extraction using morphological analysis
CN111126073A (en) Semantic retrieval method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination