CN103092987A - Fast document retrieval method and device - Google Patents

Fast document retrieval method and device Download PDF

Info

Publication number
CN103092987A
CN103092987A CN2013100482461A CN201310048246A CN103092987A CN 103092987 A CN103092987 A CN 103092987A CN 2013100482461 A CN2013100482461 A CN 2013100482461A CN 201310048246 A CN201310048246 A CN 201310048246A CN 103092987 A CN103092987 A CN 103092987A
Authority
CN
China
Prior art keywords
driver
file
index file
log recording
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2013100482461A
Other languages
Chinese (zh)
Inventor
张影宣
王东辉
潘纯峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
UNITED ELECTRONICS CO Ltd
Original Assignee
UNITED ELECTRONICS CO Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by UNITED ELECTRONICS CO Ltd filed Critical UNITED ELECTRONICS CO Ltd
Priority to CN2013100482461A priority Critical patent/CN103092987A/en
Publication of CN103092987A publication Critical patent/CN103092987A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a fast document retrieval method and a device. The Fast document retrieval method comprises the steps of creating an indexed file, wherein the indexed file is obtained by extracting file information in update sequence number (USN) log records of all actuators; inputting a keyword of the name of the file during document retrieval each time and retrieving information in the indexed file; and displaying a retrieving result. The fast document retrieval method and the device are capable of reducing waste of system resources and improving retrieving speed.

Description

A kind of accelerated file access method and apparatus
Technical field
The present invention relates to field of computer technology, refer to especially a kind of accelerated file access method and apparatus.
Background technology
At present, along with the capacity of hard disk is increasing, the quantity of disk File is also more and more, we usually forget the deposit position of file, only remember name or part name or other attributes, for example file type of file, date createds etc., at this moment we just need a kind of service of document retrieval.
Carried the function of the search of file in the windows system, its supports by date created, revise date, file type, file size retrieves, and also supports the asterisk wildcard retrieval, but powerful function also bring drawback.It is totally to retrieve, and makes retrieval rate slack-off, can not rapid feedback, and need the user to wait for.For example: based on hardware environment be CPU:Inter I3, in to save as 2GB/DDR3, hard disk be in 500G/7200 situation about turning, CPU usage 24%-27% during the data of retrieval DISK to Image, system resource expends greatly.
Summary of the invention
In view of this, the object of the invention is to propose a kind of accelerated file access method and apparatus, can reduce the waste of system resource, improve retrieval rate.
Based on above-mentioned purpose accelerated file access method provided by the invention, comprise the following steps:
Create index file, described index file is the fileinfo that extracts in each driver USN log recording;
When carrying out document retrieval, the key word of input file title is retrieved in indexed file at every turn; And
Show result for retrieval.
Alternatively, create described index file, comprise the following steps:
Obtain each Driver Information;
Judge that whether each driver file system title is the NTFS form, changes the driver of non-NTFS form;
Judge on the driver of NTFS form whether opened the USN log recording, if open, obtain each driver USN log recording; If do not open, first open the USN log recording, then obtain each driver USN log recording;
Create index file according to each driver USN log information of obtaining.
Alternatively, described index file extracts file name, file address and the Establish reason in each driver USN log recording.
Further, the record in described index file is arranged sequentially according to file name letter ascending order.
Further, described index file is the fileinfo that extracts in each driver USN log recording, simultaneously can the described index file of regular update, can synchronize with the fileinfo in each driver USN log recording.
Based on above-mentioned purpose, the present invention also provides a kind of accelerated file access device, comprising:
The index file creation module is used for creating index file, and described index file is the fileinfo that extracts in each driver USN log recording;
Retrieval module for the key word of input file title, is retrieved in indexed file; And
Display module, the result that is used for retrieving shows.
Alternatively, described index file creation module comprises:
The obtaining information module is used for obtaining each Driver Information, judges whether each driver file system title is the NTFS form, if the NTFS form, the obtaining information module is connected with detection module; If not the NTFS form, the obtaining information module is connected with modular converter, and described modular converter is connected with detection module;
Modular converter is used for the driver that the file system title is not the NTFS form is changed;
Detection module, for whether having opened driver USN log recording on the driver that judges the NTFS form, if open, described detection module is connected with creation module; If do not open, described detection module is connected with opening module, and described opening module is connected with creation module;
Opening module is used for opening driver USN log recording;
Creation module is used for obtaining each driver USN log information, and creates index file.
Alternatively, described index file extracts file name, file address and the Establish reason in each driver USN log recording.
Further, the record in described index file is arranged sequentially according to file name letter ascending order.
Further, described index file is the fileinfo that extracts in each driver USN log recording, simultaneously can the described index file of regular update, can synchronize with the fileinfo in each driver USN log recording.
As can be seen from above, a kind of accelerated file access method and apparatus provided by the invention, by direct search index file, index file is the fileinfo that extracts in each driver USN log recording, simultaneously can the described index file of regular update.Thereby can save retrieval time, improve recall precision.
Description of drawings
Fig. 1 is the schematic flow sheet of a kind of accelerated file access method of the embodiment of the present invention;
Fig. 2 is the method flow schematic diagram that the embodiment of the present invention creates index file;
Fig. 3 is the structural representation of a kind of accelerated file access device of the embodiment of the present invention.
Embodiment
For making the purpose, technical solutions and advantages of the present invention clearer, below in conjunction with specific embodiment, and with reference to accompanying drawing, the present invention is described in more detail.
Consult shown in Figure 1ly, the schematic flow sheet for a kind of accelerated file access method of inventive embodiments comprises:
Step 101 judges whether there is index file in system, specific implementation process is as follows:
When 1) having index file in system being detected, directly carry out step 102.
When 2) not having index file in system being detected, first create index file, then execution in step 102.Consult shown in Figure 2ly, be that the embodiment of the present invention creates the method flow schematic diagram of index file, wherein:
Step 201 is obtained each Driver Information.
In one embodiment of the invention, the information by Win32API read local driver, need to utilize and obtain the driver list with the GetLogicalDriveStrings function, then obtain the file system type of specifying driver by the GetVolumelnformation function.
Step 202 judges whether each driver file system type is the NTFS form, and specific implementation process is as follows:
1) when the driver file system type being detected and be the NTFS form, directly carry out step 203.
2) when the driver file system type being detected and be not the NTFS form, can use file system converter, utilize the Convert.exe in system that the driver file system type of non-NTFS form is converted to new technology file system.And then carry out step 203.
Step 203 judges on the driver of NTFS form whether opened the USN log recording.Specific implementation process is as follows:
1) if the driver USN log recording of having opened the NTFS form detected, directly carry out step 204.
2) if the driver USN log recording of not opening the NTFS form detected, first the driver USN log recording of NTFS form is opened, then carry out step 204.
Step 204 is obtained each driver USN log recording.
Step 205 creates index file according to each driver USN log information of obtaining.In an embodiment of the present invention, extract file name, file address and Establish reason in each driver USN log recording, create index file.Preferably, the record in index file is arranged sequentially according to file name letter ascending order.
Step 102, the key word of input file title is retrieved in indexed file.Specific implementation process is as follows:
In one embodiment of the invention, described index file is the fileinfo that extracts in each driver USN log recording, in order to synchronize with the fileinfo in each driver USN log recording, and can the regular update index file.When retrieving at every turn, only need the key word of input file title, then directly retrieve in indexed file.
Step 103 shows result for retrieval.
In an embodiment of the present invention, the result of retrieval is that file name, file address and the Establish reason of this retrieving files that will record in index file shows.
Consult shown in Figure 3ly, be the structural representation of a kind of accelerated file access device of the embodiment of the present invention, wherein:
Detect index file and whether have module 301, can judge whether there is index file in system, its major function comprises:
When 1) having index file in system being detected, be connected with retrieval module 308.
When 2) not having index file in system being detected, be connected with index file creation module 302.
Index file creation module 302 is used for creating index file, comprising:
Whether obtaining information module 303 exists module 301 to be connected with described detection index file, and its major function comprises:
1) obtain each Driver Information: in one embodiment of the invention, the information by Win32API read local driver, need to utilize and obtain the driver list with the GetLogicalDriveStrings function, then obtain the file system type of specifying driver by the GetVolumelnformation function.
2) judge whether each driver file system title is the NTFS form, when the driver file system type being detected and be the NTFS form, obtaining information module 303 is connected with detection module 305.When the driver file system type being detected and be not the NTFS form, obtaining information module 303 is connected with modular converter 304, and modular converter 304 is connected with detection module 305.
Modular converter 304, in an embodiment of the present invention, the driver that the file system title is not the NTFS form is changed, can be used file system converter, utilize the Convert.exe in system that the driver file system type of non-NTFS form is converted to new technology file system.
Detection module 305, for whether having opened driver USN log recording on the driver that judges the NTFS form, its major function comprises:
1) if the driver USN log recording of having opened the NTFS form detected, detection module 305 is connected with creation module 307.
2) if the driver USN log recording of not opening the NTFS form detected, detection module 305 is connected with opening module 306, and opening module 306 and creation module 307.
Opening module 306 can opening driver USN log recording.
Creation module 307 is obtained each driver USN log recording, and creates index file according to each driver USN log information of obtaining.In an embodiment of the present invention, extract file name, file address and Establish reason in each driver USN log recording, create index file.Preferably, the record in index file is arranged sequentially according to file name letter ascending order.
Retrieval module 308, the key word that is used for the input file title, retrieve in indexed file: in one embodiment of the invention, described index file is the fileinfo that extracts in each driver USN log recording, in order to synchronize with the fileinfo in each driver USN log recording, can the regular update index file.When retrieving at every turn, only need the key word of input file title, then directly retrieve in indexed file.
Display module 309, the result that is used for retrieving shows; In an embodiment of the present invention, the result of retrieval is that file name, file address and the Establish reason of this retrieving files that will record in index file shows.
This shows.A kind of rapid file searching method and device that the present invention realizes creatively can create index file based on the USN log recording, retrieve by index file; And the speed that creates index file is very fast, and 1 minute just can 1,000,000 files of index, for a plurality of files of the hundreds of thousands on the hard disk of hundreds of G, set up the time of index needs and also only have several seconds, computing machine newly can better establishment index speed faster; The index file that creates is very little also, 160,000 files, and the size of the index file of generation can be controlled in 1MB; The speed of retrieving by index file is exceedingly fast, and index file makes search speed just can complete in second at 1-4; Meanwhile, rapid file searching method of the present invention and device implement simple and easy to do.
Those of ordinary skill in the field are to be understood that: the above only is specific embodiments of the invention; be not limited to the present invention; within the spirit and principles in the present invention all, any modification of making, be equal to replacement, improvement etc., within all should being included in protection scope of the present invention.

Claims (10)

1. an accelerated file access method, is characterized in that, comprises the following steps:
Create index file, described index file is the fileinfo that extracts in each driver USN log recording;
When carrying out document retrieval, the key word of input file title is retrieved in indexed file at every turn; And
Show result for retrieval.
2. search method according to claim 1, is characterized in that, creates described index file, comprises the following steps:
Obtain each Driver Information;
Judge that whether each driver file system title is the NTFS form, changes the driver of non-NTFS form;
Judge on the driver of NTFS form whether opened the USN log recording, if open, obtain each driver USN log recording; If do not open, first open the USN log recording, then obtain each driver USN log recording;
Create index file according to each driver USN log information of obtaining.
3. search method according to claim 1, is characterized in that, described index file extracts file name, file address and the Establish reason in each driver USN log recording.
4. search method according to claim 3, is characterized in that, the record in described index file is arranged sequentially according to file name letter ascending order.
5. the described search method of any one according to claim 1 to 4, it is characterized in that, described index file is the fileinfo that extracts in each driver USN log recording, simultaneously can the described index file of regular update, can synchronize with the fileinfo in each driver USN log recording.
6. an accelerated file access device, is characterized in that, comprising:
The index file creation module is used for creating index file, and described index file is the fileinfo that extracts in each driver USN log recording;
Retrieval module for the key word of input file title, is retrieved in indexed file; And
Display module, the result that is used for retrieving shows.
7. indexing unit according to claim 6, is characterized in that, described index file creation module comprises:
The obtaining information module is used for obtaining each Driver Information, judges whether each driver file system title is the NTFS form, if the NTFS form, the obtaining information module is connected with detection module; If not the NTFS form, the obtaining information module is connected with modular converter, and described modular converter is connected with detection module;
Modular converter is used for the driver that the file system title is not the NTFS form is changed;
Detection module, for whether having opened driver USN log recording on the driver that judges the NTFS form, if open, described detection module is connected with creation module; If do not open, described detection module is connected with opening module, and described opening module is connected with creation module;
Opening module is used for opening driver USN log recording;
Creation module is used for obtaining each driver USN log information, and creates index file.
8. indexing unit according to claim 6, is characterized in that, described index file extracts file name, file address and the Establish reason in each driver USN log recording.
9. indexing unit according to claim 8, is characterized in that, the record in described index file is arranged sequentially according to file name letter ascending order.
10. the described indexing unit of any one according to claim 6 to 9, it is characterized in that, described index file is the fileinfo that extracts in each driver USN log recording, simultaneously can the described index file of regular update, can synchronize with the fileinfo in each driver USN log recording.
CN2013100482461A 2013-02-06 2013-02-06 Fast document retrieval method and device Pending CN103092987A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2013100482461A CN103092987A (en) 2013-02-06 2013-02-06 Fast document retrieval method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2013100482461A CN103092987A (en) 2013-02-06 2013-02-06 Fast document retrieval method and device

Publications (1)

Publication Number Publication Date
CN103092987A true CN103092987A (en) 2013-05-08

Family

ID=48205552

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2013100482461A Pending CN103092987A (en) 2013-02-06 2013-02-06 Fast document retrieval method and device

Country Status (1)

Country Link
CN (1) CN103092987A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104899337A (en) * 2015-07-01 2015-09-09 中国农业银行股份有限公司 File index building method and system
CN105574192A (en) * 2015-12-24 2016-05-11 张梅云 Computer document retrieval method
CN108874820A (en) * 2017-05-11 2018-11-23 广东爱瓦力科技股份有限公司 A kind of system file searching method
CN111274207A (en) * 2020-01-16 2020-06-12 青岛科技大学 Novel file retrieval method and system based on data visualization analysis
CN112380416A (en) * 2020-11-25 2021-02-19 北京慕华信息科技有限公司 Method for updating course index, course searching method and device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1790334A (en) * 2004-12-17 2006-06-21 佳能株式会社 Document management system and document management method
CN101197700A (en) * 2006-12-05 2008-06-11 阿里巴巴公司 Method and system for providing log service
CN102609502A (en) * 2012-02-02 2012-07-25 深圳市中兴移动通信有限公司 Method and system for mobile terminal desktop searching based on log mode

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1790334A (en) * 2004-12-17 2006-06-21 佳能株式会社 Document management system and document management method
US20060136412A1 (en) * 2004-12-17 2006-06-22 Canon Kabushiki Kaisha Document management system and document management method
CN101197700A (en) * 2006-12-05 2008-06-11 阿里巴巴公司 Method and system for providing log service
CN102609502A (en) * 2012-02-02 2012-07-25 深圳市中兴移动通信有限公司 Method and system for mobile terminal desktop searching based on log mode

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
特种兵-AK47: ""文件搜索工具-everything,相关原理"", 《HTTP://BLOG.CSDN.NET/M13666368773/ARTICLE/DETAILS/8363180》, 21 December 2012 (2012-12-21), pages 1 - 3 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104899337A (en) * 2015-07-01 2015-09-09 中国农业银行股份有限公司 File index building method and system
CN104899337B (en) * 2015-07-01 2018-05-29 中国农业银行股份有限公司 The method for building up and system of a kind of file index
CN105574192A (en) * 2015-12-24 2016-05-11 张梅云 Computer document retrieval method
CN108874820A (en) * 2017-05-11 2018-11-23 广东爱瓦力科技股份有限公司 A kind of system file searching method
CN108874820B (en) * 2017-05-11 2021-11-30 广东爱瓦力科技股份有限公司 System file searching method
CN111274207A (en) * 2020-01-16 2020-06-12 青岛科技大学 Novel file retrieval method and system based on data visualization analysis
CN112380416A (en) * 2020-11-25 2021-02-19 北京慕华信息科技有限公司 Method for updating course index, course searching method and device

Similar Documents

Publication Publication Date Title
US11651149B1 (en) Event selection via graphical user interface control
US11550829B2 (en) Systems and methods for load balancing in a system providing dynamic indexer discovery
US20200034414A1 (en) Determining an extraction rule from positive and negative examples
US20160098409A1 (en) Statistics Value Chart Interface Row Mode Drill Down
US20190213765A1 (en) Systems and methods for rendering a visualization using event data
CN110633186A (en) Log monitoring system for electric power metering micro-service architecture and implementation method
JP5950282B2 (en) Apparatus and method for generating a user interface
RU2595524C2 (en) Device and method for processing content of web resource in browser
CN103092987A (en) Fast document retrieval method and device
US20190266153A1 (en) Searches of highly structured data
CN113688288B (en) Data association analysis method, device, computer equipment and storage medium
CN108628885B (en) Data synchronization method and device and storage equipment
CN103425257A (en) Method and device for prompting information of uncommon characters
US20140310324A1 (en) Systems and methods for automatically sorting and indexing electronic files
CN103092981A (en) Method and electronic equipment for building speech marks
CN102508857B (en) Desktop cloud searching method based on event correlation
CN111782731A (en) Data synchronization method and device
Han et al. Understanding and modeling behavior patterns in cross‐device web search
US20150089018A1 (en) Centralized management of webservice resources in an enterprise
CN103678383A (en) Data processing method and electronic device
CN103164491B (en) The method and apparatus of a kind of data processing and retrieval
CN102346559A (en) Method and device for deleting lexical items in input method as well as character input tool
CN105740997A (en) Method and device for controlling task flow, and database management system
CN107818126B (en) Full-text information retrieval method oriented to Mongo database
Jansen et al. Designing Scalable Cyberinfrastructure for Metadata Extraction in Billion-Record Archives.

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20130508