CN104823169B - For the index configurations that can search for data in network - Google Patents

For the index configurations that can search for data in network Download PDF

Info

Publication number
CN104823169B
CN104823169B CN201380053433.7A CN201380053433A CN104823169B CN 104823169 B CN104823169 B CN 104823169B CN 201380053433 A CN201380053433 A CN 201380053433A CN 104823169 B CN104823169 B CN 104823169B
Authority
CN
China
Prior art keywords
data
data field
search
size
subregion
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201380053433.7A
Other languages
Chinese (zh)
Other versions
CN104823169A (en
Inventor
J·M·高德博格
J·B·汉德勒
A·M·A·麦克哈尼
E·K·E·恩沃卡
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
A9com
A9com Co Ltd
Original Assignee
A9com Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US13/650,931 external-priority patent/US9507750B2/en
Priority claimed from US13/650,961 external-priority patent/US9047326B2/en
Application filed by A9com Co Ltd filed Critical A9com Co Ltd
Priority to CN201811424497.4A priority Critical patent/CN110096502A/en
Publication of CN104823169A publication Critical patent/CN104823169A/en
Application granted granted Critical
Publication of CN104823169B publication Critical patent/CN104823169B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • G06F16/1824Distributed file systems implemented using Network-attached Storage [NAS] architecture
    • G06F16/1827Management specifically adapted to NAS
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2272Management thereof

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

To be indexed and storage the data that can search for can be uploaded to network service using the entity of computing device.The data may include multiple data fields, and each data field has one or more associations value.The network service can analyze the data field and its respective associated value, to determine the data field type of the data field and will to be allowed for the search option of the data field.It is at least partially based on the data field type and described search option, the network service can produce search index configurations/mode.It is at least partially based on search index configurations/mode of the generation, the network service can produce the search index for the data.In some embodiments, the data conversion can also be indexed mutually compatible format at described search by network service.

Description

For the index configurations that can search for data in network
Background technique
Computing device is usually used to through network such as Internet traffic.The network-based service provided by service provider Become more universal.Computing device is frequently used to be connected to network-based service, and the network-based service can provide Service will be can search for data such as storage by computing device use/retrieval or provide additional processing power to calculating dress It sets.About the network-based storage that can search for data, the user of computing device usually require as its data option and installment and/or Format, so that its data can be indexed and be stored by network-based service.Conventional method usually requires user and determines for its number According to suitable configuration.Conventional method it may also be desirable to the format that the data of user have to comply with, and then require user by its data It is converted into the format.This may be inconvenient, fiber crops for the user for wanting to be stored and searched for using network-based service Tired or difficult, and then reduce overall customer experience.
Detailed description of the invention
Each embodiment according to the disclosure is described with reference to the accompanying drawings, in the accompanying drawings:
Fig. 1 shows the exemplary environments of the aspect using each embodiment;
Fig. 2 shows the exemplary system embodiments for the index configurations that can search for data in networked environment;
Fig. 3 is shown using the exemplary web page browsing ring for the index configurations that can search for data in networked environment Border;
Fig. 4 shows the examplar search that can be generated according to each embodiment and indexes;
Fig. 5 shows the exemplary method embodiment for the index configurations that can search for data in networked environment;
Fig. 6 shows the exemplary means that can be used to realize the aspect of each embodiment;
Fig. 7 shows the example components of client terminal device described device as shown in Figure 6;And
Fig. 8 shows the environment that each embodiment can be achieved.
Specific embodiment
The system and method that description generates index configurations, the index configurations can be used to generate for by least one net The search of the received data of network indexes.At least some embodiments allow computing device that will count by network (for example, internet) It is stored in distribution provided by network service (that is, Internet Service Provider) according to uploading to.Network service can be analyzed and be uploaded Data are with the type (that is, data field type) of the data field of each data field in the multiple data fields of determination.Network Service can analyze uploaded data to determine whether one or more for each data field in multiple data fields A search option is included in uploaded data.
At least some embodiments permission computing devices are upload the data to by network (for example, internet) to be taken by network In storage distribution provided by business (that is, Internet Service Provider, network-based service etc.).One or more user/entities (for example, using one or more computing devices) can search for uploaded data by network using search index, described to search Rustling sound, which draws, to be provided by network service.
In some embodiments, the data uploaded may include multiple data fields.Network service can be analyzed and be uploaded Data are with the type (that is, data field type) of the data field of each data field in the multiple data fields of determination.For example, Each data field can have the type including integer, text type or literal type.
In addition, network service can analyze uploaded data to determine whether for every number in multiple data fields It is included in uploaded data according to one or more search options of field.For example, network service can determine for each phase Answer data field, if allow to include the option in the search index that will be generated by respective data field.Network service is also It can determine for each respective data field, if allow to calculate the option of the face number of respective data field.In addition, network takes Business can determine for each respective data field, if allows to will be responsive to search inquiry and returns/provide and respective data word The option of the associated value of section.
In some embodiments, network service can produce for data field type determined by being at least partially based on and The index configurations (that is, search index configurations, mode, index setting etc.) of the data for the search option that will allow.Network service Index configurations can be at least partially based on and generate the search index for being used for data.
It can such as be provided according to each embodiment, be described below and propose various other functions and advantage.
Fig. 1 shows the exemplary environments 100 using the aspect in each embodiment.Exemplary environments 100 may include At least one computing device 102, network 104 (for example, internet, Intranet, local network, local area network etc.) and network service 106 (that is, Internet Service Provider, network-based services etc.).At least one computing device 102 can be led to by network 104 Letter it is connected to network service 106.In some embodiments, computing device 102 can be in the feelings of no network 104 such as internet Network service 106 is transmitted under condition.As shown in fig. 1, user 108 or other realities of at least one computing device 102 also may be present Body (for example, individual, company, tissue, group etc.) 108.User or entity 108 can by network 104 by data 110 from least one A computing device 102 is transmitted to network and services 106 (and vice versa).
In some embodiments, network service 106 may include and/or utilize the one or more for being connected to network 104 Host or server.For example, memory space can be rented user or another reality to client, such as device 102 by network service 106 Body (for example, company, tissue, group, individual etc.) 108.Therefore, network 104 can be used in user/entity 108 of computing device 102 It will be in the data of device 102 storage to network service 106.In other words, user/entity 108 and/or device 102 can be through It is stored by network service 106 using network-based calculating.
In an example, computing device 102 can be transmitted by network 104 and will be stored in network services 106 Data 110, as shown in fig. 1.Data 110 can be any data for network-based calculating, such as be used to search for, data The data of library storage, operation application program, operation virtual machine, operation operating system etc..Computing device 102, which can transmit, to be stored Data 110 in the storage distribution provided by service 106.For example, user/entity 108 is commercially available or rental service 106 on Memory space, and store distribution and can be assigned and be assigned to user/entity 108.In some embodiments, user/entity 108 can have particular account and/or storage distribution in service 106;The memory space of entity 108 is assigned to (for example, storage Distribution) it can be associated with the account of entity 108.
Entity 108 may also hope to network service 106 and provide the search index for being used for data 110.Conventional method usually needs Want entity 108 first to provide data 110 to be indexed to configuration (that is, index configurations, mode, index setting etc.), or normal Configuration/format (for example, search data format (SDF)) that rule method may need solid data 110 to have to comply with, therefore it is required that Its data 110 is converted into required configuration by entity 110.However, this is inconvenient, troublesome or difficult for entity 108 's.
In some embodiments, data 110 can be transferred to network service 106, and network service 106 by entity 108 The index of (for example, being not necessarily to instructions or requests from entity 108) analysis data 110 and generation for data 110 it can match automatically Set (for example, search index configurations, search indexing model etc.).For example, in some embodiments, network service 106 can pass through Determination includes the data field type 112 of one or more data fields in data 110 and the use that determination will allow Data 110 are analyzed in including the search option 114 of one or more data fields in data 110.
About the Class1 12 for determining data field, may be present can be associated with data 110 (for example, document, file etc.) A variety of data field types, such as data field, the data field of literal type or the data field of text type of integer. In some embodiments, data 110 may include multiple data fields, and each data field includes value (for example, data field " title " can have the value of " ABCD- brand shirt ";Data field " price " can have " value etc. of $ 20 ').Network service 106 It can analyze including multiple data fields in data 110, to determine the number of each data field in multiple data fields According to the type of field.
For example, for each data field, network service 106 can determine whether the value of each respective data field wraps Include the integer amount more than specified integer amount threshold value (for example, the value of data field " price " is integer entirely);If it is this Sample, then can determine that respective data field is integer data field type.Network service 106 can also be for example, by determination It is below at least one determine whether data field is lteral data field type: associated with data field value has super Cross specified text numerical lower limits value but lower than the alphabetic character quantity of specified text the upper limit of the number value;It is related to data field The number of the different value of connection is lower than specified text different number threshold value;The percentage of different value is lower than specified text difference hundred Divide and compares threshold value;Or the length of value is lower than specified word length threshold value.In some embodiments, network service 106 can be such as The frequency and/or percentage for considering the different value in the length and data field value of data field value are data field to be identified as Text type;If in data field value there are many different values and data field value it is very long (for example, having more than threshold value Alphabetic character number), then data field is likely to text type.In some embodiments, if data field is not Integer type or literal type, then data field can be text type.
About search option 114 is determined, network service 106 can determine that will be allowed is used for 110 (data word of data Section) one or more search options 114.For example, having determined that the data word including the data field in data 110 When segment type, network service 106 can be determined whether to allow by data field to include the option in the search index that will be generated, Whether allow to calculate the option of the face number of data field, and/or whether allows return/offer for the search value of data type Option.
For example, if the data field type of data field is confirmed as text (for example, data field is " product description " And value is long paragraph), then network service 106 may be selected not include in the option searched in index by data field (and value). In another example, for the data field with integer data field type (for example, data field is " production year " and value It is the time) for, network service 106 may be selected to allow by data field to include the option in the search index that will be generated, And it services 106 to allow to calculate the option of the face number of data field.Face number, which can be, is in data word to how many search result The counting of a certain classification of section.For example, network service 106 can determine that it is necessary to mention if data field is " production year " For face number, face number instruction how many search result are associated with a certain classification;For example, " 1984 (23), 2002 (12), 2010 (18) " example for showing the face number about " production year " data field, wherein 23 search results are associated with " 1984 ", 12 Search result is associated with " 2002 " and 18 search results are associated with " 2010 ".
In some embodiments, network service 106 can also decide the value for allowing returned data field.For example, in response to Searching request, not all data field (and value) needs that can search for are returned (for example, retrieval and presentation).Network service 106 It can decide whether the value of returned data field.
The generation of the configuration for data 110 is turned now to, network service can be automatic (for example, being not necessarily to the finger from entity 108 Enable) generate the configuration (for example, search index configurations, mode etc.) for being used for data 110.In some embodiments, the configuration Can at least partly assist in how index data 110;The index configurations can at least partly manage incite somebody to action how index data 110.The configuration or mode may specify the data field type including each data field in data 110, and instruction is each Whether data field can search for, and indicate whether each data field is graduate (for example, classifiable) and can have Help establish other similar informations of index.After generating the configuration for the data 110 that will be indexed, network service 106 It can produce the search index of the data 110 for configuring produced by being at least partially based on.
Fig. 2 shows using the example web page browsing environment for the index configurations that can search for data in networked environment 200.Example web page browsing environment 200 may include the example web page 202 reproduced by application program such as web browser.At this In a example, webpage 202 can be provided by network service associated with domain ABCD.com.
For example, user/entity (for example, client of network service) can be retailer and can upload and sell shirt phase The data of pass.Data can be indexed by network service and store and make it to other people as user/entity potential customers can Search.Network service can analyze the type that the data include the data field of each data field in data with determination (that is, data field type).For example, data relevant to the sale of shirt may include data field, such as " color " 206, " greatly It is small " 208, " price " 210, " description " and other fields.Network service can analyze the value of each data field with each phase of determination Answer the type of data field.Network service may further determine that the one or more options for being allowed for each data field (for example, searching Rope option).Network service then can generate configuration/mode of the data for that will index.Then, network service can be based on institute It states configuration/mode and generates the index for being used for data.
For example, network service can recognize data field " color " and determine the value of the data field (for example, " red Color ", " blue ", " white ", " green " etc.) it is alphabetical/text, and " color " data field can be identified as text class Type.(in this example, can be uploaded by entity associated with " color " data field data and be worth (for example, " red ", " blue ", " white ", " green " etc.)).In another example, network service can recognize at least part of uploaded data In " size " data field, and it is numerical value that determination, which includes value in " size " data field,.In this example, network Service can determine that " size " data field is integer type.In another example, network service can recognize uploaded data extremely The value of " description " data field in few a part, and can determine that described value includes both number and alphabetic character and/or institute Stating value is that very long and/or described value has different term/phrase/symbols for number of characters.In this example, net Network service can determine that " description " data field is text type.
About search option, network service be can determine for each of data field, if be allowed corresponding data Field includes the option in the search index that will be generated.For example, in some embodiments, can be omitted from search index " description " data field (and respective value).If it is, then the inquiry will when the inquiry that operation is indexed about search Do not search for " description " data field.However, " description " data field and value can be included in search index by some embodiments.
In addition, network service can be determined whether the option for the face number for allowing to calculate each data field.As mentioned above, face Number indicates that the result of how many matching search inquiries has the particular value (or range of value) for specific data field.For example, such as Shown in Fig. 2, " color " data field with " red " value has face number 23 (that is, for 23 search of " red " shirt As a result), however " blue " value of " color " data field has face number 28 (that is, 28 search results for being directed to " blue " shirt) Etc..In some embodiments, described value can be overlapped (i.e., it is not necessary to for accurate matching).For example, having blue and red bar The shirt of line can be associated with " blue " and " red " value and/or has other values.In some embodiments, network service can It is determined that the face number of some data fields is calculated, but it is not necessary to calculating the face number of all data fields.For example, network service It can determine the face number that should have " color ", " size " and " price ", the face number of " description " may be not present.
In addition, network service can be determined whether the value for allowing returned data field.For example, can have including in data Data field " internal product identification number ", the value of the data field be the product ID inside entity and be not intended to by Show the client of entity;Similarly, network service can determine the value for not allowing to return such data field.
It is contemplated that the additional option and number relevant to other projects that those skilled in the art will appreciate that may be present According to.For example, network service can be determined whether the option for allowing that data field can be made to classify (for example, classifiable).With reference to figure 2, in some embodiments, " price " data field can be classified/be classified by its value (for example, from lowest price to most High price, from ceiling price to lowest price etc.), " color " data field can alphabet sequence classification (being not shown in Fig. 2) etc. Deng.In another example (not shown), data relevant to media file such as music, video, book, photo etc. may be present.Matchmaker The example data field of body file may include but be not limited to: " title ", " artist/author ", " creation time ", " price ", " grade " etc..
The type including the data field in data and one for including data field in data has been determined A or multiple search options, network service can produce the configuration (that is, search index configurations, mode etc.) for data, described to match The generation set is at least partially based on identified data field type and search option.
After generating configuration, network service can be at least partially based on generated configuration and generate the search rope for being used for data Draw.Therefore, network service can be utilized by the data that entity provides and deposited by the search index for data that network service generates Storage.
Fig. 3 shows the exemplary system embodiment 300 for the index configurations that can search for data in networked environment.Show Example property system implementation plan 300 may include system controller 302, at least one communication transceiver 304, data field type analysis Device 306, search option analyzer 308, index configurations generator 310, index generator 312 and at least one storage distribution 314.
System controller 302 can promote system and execute for the various of the index configurations that can search for data in networked environment Operation.System controller 302 can be communicated at least one communication transceiver 304, to promote to one outside system 300 or In the data transmission in multiple sources and/or the data receiver and promotion system in one or more sources outside system 300 Data communication.
System 300 can be by data field type analyzer via communication transceiver 304 (for example, from entity) received data 306 analyses, to determine and include the associated type of each of data field in data.Data can also be by searching for Option analyzer 308 is analyzed, to determine whether one or more search options about the data being included in data Each of field.Search option determined by data field type determined by being at least partially based on and one or more, Index configurations generator 310 can produce search index configurations/mode.Then, search index caused by being at least partially based on is matched / mode is set, index generator 312 can produce the search index for data.It can be searched for by data and for caused by data Index is stored in one or more storage distribution 314.
It is contemplated that all parts of exemplary system 300 and/or part can be implemented as hardware, software or both Combination.For example, the various pieces of system 300 can via circuit, processor, application program, a part of procedure code, algorithm or its Any combination etc. is realized.It is further contemplated that Fig. 3 is example and is intended only for illustrative purpose.For example, all parts are not It must be configured according to Fig. 3.In some embodiments, all parts need not couple closely to each other, and alternatively spread In more dispersed system.For example, component such as index generator can reside in individually/heterogeneous networks and/or system, but still Remain to the communication connection of other components.
Fig. 4 is shown can be according to the examplar search index 400 that each embodiment of the disclosure generates.With reference to Fig. 4, searching Root node 402 may be present in drawing in rustling sound.In the example in figure 4, data can be uploaded by entity such as T-shirt retailer.Data can be right The information for the T-shirt (root node 402) that can be used for selling that Ying Yu is manufactured about entity.It is relevant to T-shirt that expression may be present The father node (for example, 404,406,408) of the data field of data.For example, T-shirt can have color data field 404, size Data field 406 and price data field 408.
With continued reference to the example of Fig. 4, data field can have the child node (example for indicating the value in each respective data field Such as, 410,412,414,416,418).For example, can at least exist two kinds of colors (red 410 and blue 412), a kind of size (in And two kinds of Price Range (< $ 10416 and $ 10- $ 20418) number 414).Last set result/project also may be present (for example, T-shirt Shirt 420,422,424,426,428,430), described search result/project can correspond to one or more in data field and value It is a.
In this example, all three data fields (color 404, size 406 and price 408) are intended to be included in search In index, there can be face number and may be in response to relevant search inquiry and/return value is provided.For example, as shown in Figure 4, face Color: red 410 can have face number three, and color: blue 412 can have face number two.Size: medium size 414 can have face number two. Price: < $ 10416 can have face number one, and price: $ 10- $ 20418 can have face number two.In addition, the search inquiry of color: For example, red 410 will return to T-shirt 422,424 and 428;For example, search red 410 and blue 412, will return to T-shirt 422 Etc..Although examplar search index 400 is shown as tree construction, however, it is contemplated that, it can be in a number of alternative manners and/or sharp Search index is generated with other structures.
Fig. 5 shows the exemplary method embodiment 500 for the index configurations that can search for data in networked environment.Again It is secondary, it should be understood that in the range of each embodiment, there may be executed or be performed in parallel by similar or substitution order Other step, less step or alternative steps, unless otherwise stated.At step 502, exemplary method embodiment 500 It can receive the data that will be indexed.For example, the data that will be indexed that method 500 can be uploaded by entity, and the data can Including multiple data fields (or at least one data field).In some embodiments, illustrative methods may further determine that and number According to the title of associated data field.At step 504, illustrative methods 500 can determine data word associated with data The type of section.For example, the method can determine multiple field classes associated with each data field in multiple data fields The field type of type.Multiple field types may include in (but being not limited to) integer type, literal type or text type at least It is a kind of.The type of data field can be determined from the type of multiple data fields.It in some embodiments, can be based on label, letter Number or it is other instruction to identify multiple data fields and its type and/or title.At step 506, method 500 can determine will The one or more search options about data field associated with data allowed.For example, one or more of search Option may include at least one of the following: include the option in the search index that will be generated by respective data field;Meter It calculates the option of the face number of respective data field or the option of one or more values associated with respective data field is provided.Step 508 may include that the type for being at least partially based on data field and one or more search options are matched come the index for generating for data It sets.Then, at step 510, method 500 can be at least partially based on the search for generating for the index configurations of data and being used for data Index.In some embodiments, it can be produced based on whether data are the combinations of structural data, free circumferential edge or both Raw search index.In some embodiments, data, index configurations or can be by one or more can also be provided in illustrative methods At least one of the index of search inquiry search.
It may be present including the various other information in index configurations.For example, configuration can be reserved for about data field whether Can facet (i.e., if the face number of data field should be calculated), whether data field can classify (that is, with data field Whether search result should be classified) etc. information.
In some embodiments, network service can by reception/upload data conversion in the first format at the second format, Second format is mutually compatible with search index and the data for being converted into the second format can be stored in one or more and deposits In storage distribution.For example, network service can receive data from entity, the data can have any one in several various formats Kind is a variety of, such as .PDF .DOC .DOCX .CSV .JSON .XML.Data can be automatically converted by network service can be with net The format of road service mutually compatible (for example, can be by ... identification, can be by ... use), such as searches for data format (SDF).
In some embodiments, network service can be based on following operation change data: by the first format and the second format Compare, and modify associated with the first format at least one data field with corresponding to it is associated with the second format extremely Few one data word section.For example, network service may compare from entity received data format and modify/upload the lattice Formula, so that it is mutually compatible with network service.This may include identifying whether to add, remove or be changed to one of the format Or multiple data fields.
In some embodiments, it is more than specified that network service can be had based on value associated with data field is determined The integer character amount of integer amount threshold value is integer type come the type for determining data field.In addition, network service can be by true It is fixed it is below at least one come the type for determining data field be literal type: it is more than finger that value associated with data field, which has, Fixed text numerical lower limits value but lower than the alphabetic character quantity of specified text the upper limit of the number value;It is associated with data field The number of different value is lower than specified text different number threshold value;The percentage of different value is lower than specified text different weight percentage Threshold value;Or the length of value is lower than specified word length threshold value.In addition, network service can determine data based on following person is determined The type of field is text type: value associated with data field has integer and word more than specified amount of text threshold value At least one of alphabetic character quantity;The number of kinds of characters is more than specified text different number threshold value;The hundred of kinds of characters Divide than being more than specified text different weight percentage threshold value;Or the length of character is more than specified text size threshold value.
In some embodiments, network service can determine to allow by data field to include indexing in the search that will be generated In option, it is described decision be at least partially based on reception include the signal in data field, the signal designation data field It will be included in search index.Network service can also decide the option for allowing to calculate the face number of data field, and the decision is extremely It is at least partly based on and determines that the quantity of at least one value associated with data field is more than specified face number lower limit value and is lower than Specified face number upper limit value.Network service can further determine to allow to provide in response to related search queries and data field phase Associated value, it includes the signal in data field that the decision, which is at least partially based on reception, the signal designation will provide with The associated value of data field.
In some embodiments, one or more search inquiries (for example, term in search inquiry) can be taken by network Business utilizes.For example, network service can be inferred to the searcher just facet in specific data field from search inquiry.Thus, for example, Network service can determine that data field should be literal type.
In some embodiments, when searcher input inquiry term and request search when, can be by correlation with spy Determine ranking expression (for example, order of result) and one or more search results are presented.The disclosure allow creation consider it is other because Element such as inquires the more complicated list of irrelevant factor (for example, may be present including the popularity data field in data) It reaches.The disclosure may also allow for by check data and determine it is meaningful for data field popularity analyze, so as to It is proposed workable ranking expression.For example, body of text data field type and its length may be present (for example, or its length Inverse) can be considered and can for ranking express useful information is provided.
In some embodiments, data field type may also include geographic location type, time type, data type or Float.
With the consistent each embodiment of the disclosure also using sample data.For example, mention can be first by sample for user/necrosis Notebook data is provided to network service.Network service can analyze the sample data to determine the type and search choosing of data field ?.Based on data field type and for the search option of sample data, network service can produce index configurations, and subsequent base Search index is generated in generated index configurations.
Fig. 6 shows the exemplary electronic user apparatus 600 that can be used according to each embodiment.Although showing portable Computing device (such as E-book reader or tablet computer), it is to be understood that can be according to discussed herein each A embodiment is using any electronic device that can receive, determine and/or handle input, and wherein described device may include (example As) desktop computer, notebook computer, personal digital assistant, smart phone, video game console, TV set-top box and Portable media player.In some embodiments, computing device 600 can be simulator, and operation amplifier such as can be used The device of device execution signal processing.In this example, computing device 600 has display screen 602, the display screen on front side It will believe under normal operation to user's (for example, on the computing device on display screen same side) display in face of display screen Breath.In this example, computing device include at least one video camera 604 or at least one video camera at least one Static or video image information other image-forming components are captured on the visual field.In some embodiments, computing device may be wrapped only Containing an image-forming component, and in other embodiments, computing device may include several image-forming components.Each image capture Element for example can be video camera, charge coupled device (CCD), mobile detection sensor or infrared sensor and many other Possibility.If there is multiple images capture element on the computing device, described image capture element can be inhomogeneity Type.In some embodiments, at least one image-forming component may include at least one wide angle optical element such as fish-eye lens, institute Stating wide angle optical element allows video camera to capture image in wide range such as 180 degree or greater angle.In addition, each image Capture element may include being configured in extremely rapid succession capture the digital still video camera of subsequent frame, or can capture stream video Video cameras.
Exemplary computing devices 600 further include at least one microphone 606 or can capture audio data (such as device users If saying or order) other audio capturing devices.In this example, microphone 606 is placed on device and display Shield on 602 identical sides, so that the microphone usually can be if preferably acquisition equipment user says.At least some In embodiment, microphone can be shotgun microphone, and the shotgun microphone is essentially directly captured from the front of microphone Acoustic information, and the sound of limited quantity is only picked up from other directions.It should be understood that microphone is in different embodiments In can be positioned on the edge of any suitable surface in any region, face or device, and the multiple microphone can be used for Audio recording and filtering purpose etc..
Exemplary computing devices 600 further include at least one orientation sensor 608, such as position and/or mobile determining element. Sensors with auxiliary electrode may include for example being operable to the orientation of detection computing device and/or changing in orientation of and the small movement of device Accelerometer or gyroscope.Orientation sensor may also include electronics or digital compass, and the electronics or digital compass can indicate to fill Set the direction (such as north or south) for being determined pointed (such as relative to main shaft or other such directions).Orientation sensor may be used also Including or including global positioning system (GPS) or it is operable to determine the relative coordinate of the position of computing device and the phase of device To the similar setting element of the information moved greatly.Each embodiment may include that any of this one or more class component is suitble to Combination.As should be appreciated that, algorithm or mechanism for determining relative position, orientation and/or movement can at least partly be taken Certainly in the selection for the element that can be used for device.
Fig. 7 shows one group of universal component of exemplary computing devices 700 (as relative to device 600 described in Fig. 6) Logic arrangement.In this example, device includes the place for executing the instruction being storable in memory device or element 704 Manage device 702.As the apparent for skilled in the art, device may include the memories of many types, data storage or Nonvolatile computer readable storage medium, the first data storage of the program instruction such as executed for processor 702, for scheming The SAM Stand Alone Memory of picture or data, for the removable memory etc. with other device shared informations.Described device usually will packet Some type of display element 706 is included, such as touch screen or liquid crystal display (LCD), but such as the device of portable media player Information may be transmitted via other mechanisms (such as passing through audio tweeter).As discussed, the device in many embodiments To include can be at least one image capture element 708 that other objects near projected image or device are imaged, such as Video camera or infrared sensor.The method of capture image or video is carried out using computing device, using video camera element at this It is also known in field and will not be discussed in detail herein.It will be appreciated that single image, multiple images, week can be used The imaging of phase property, consecutive image capture, image stream etc. execute image capture.In addition, device may include starting and/or stopping image The ability of capture, such as when receiving order from user, application program or other devices.Exemplary means, which are similarly included, to be operated To capture at least one audio capturing component 712 of audio-frequency information, such as monophonic or stereo from least one Main way Microphone array.Microphone can be unidirectional or omnidirectional microphone known to such device.
In some embodiments, the computing device 700 of Fig. 7 may include one or more communication device (not shown), such as Wi-Fi, bluetooth, RF, wired or wireless communication system.Device in many embodiments can with network such as Internet traffic, and And it can be communicated with other such devices.In some embodiments, device may include can be received from user it is conventional defeated At least one the additional input device entered.This conventional input is for example including button, touch tablet, touch screen, steering wheel, manipulation Bar, keyboard, mouse, keypad or user can be by any other such devices or element being used to device input order. However, in some embodiments, this device may not include any button, thereby increases and it is possible to only can be via vision and audio The combination of order controls so that user can without being contacted with device control device.
Device 700 may also include at least one orientation or motion sensor 710.As discussed, sensors with auxiliary electrode can wrap The accelerometer or gyroscope or electronics or digital compass for being operable to detection orientation and/or changing in orientation of are included, it is described to indicate Device be determined towards direction.Mechanism can also (or alternatively) include or including global positioning system (GPS) or can operate Come determine computing device position relative coordinate and device relatively large movement information similar setting element.Device It may include other elements, such as can allow position finding by triangulation or another such method.These mechanism can be with processor 702 communications, and then any of many action described herein or proposed can be performed in device.
As example, device of the computing device as described in relative to Fig. 6 can capture and/or track whithin a period of time user Various information.This information may include any suitable information, such as position, action (for example, send message or creation file), User behavior is (for example, how long user executes time quantum, the user's browser interface of a subtask, user effort in task Mode etc.), customer parameter (for example, user likes how receiving information), the request opening application program, being submitted, connect The calling etc. of receipts.As explained above, the information can by link information or other associations it is such in a manner of store, in turn Any suitable dimension or dimension group access information can be used in user.
As discussed, distinct methods can be realized in various environment according to the embodiment.For example, Fig. 8 is shown For realizing according to the embodiment of the environment 800 of the various aspects of each embodiment.As will be appreciated, although making for explanatory purposes With network-based environment, but each embodiment can be optionally realized using varying environment.System includes E-customer End device 802, the electronic client device may include being operable to send and receive request, message on suitable network 804 Or information and any appropriate means that information is sent back to device users.The example of such client terminal device includes individual calculus Machine, mobile phone, hand-held messaging device, laptop computer, set-top box, personal digital assistant, E-book reader etc..Net Network may include any suitable network comprising intranet, internet, Cellular Networks, local area network or any other such network or on State the combination of network.Component used in such system can be at least partially dependent on the type of selected network and/or environment.For It is well-known via the agreement and component of such network communication, thus is no longer discussed in detail herein.Communication on network can To be realized via wired or wireless connection and combinations thereof.In this embodiment, network includes internet, because environment includes It, can for receiving request and the in response to the request Web server 806 of service content, however for other networks Using the replacement device for servicing similar purpose, as the skilled person will be apparent.
Shown environment includes at least one apps server 808 and data storage 810.It should be appreciated that can With in the presence of can be chained up or in other ways come configure several apps servers, layer or other elements, process or Component, these apps servers, layer or other elements, process or component can interact to execute and such as store from suitable data The task of device acquisition data.As it is used herein, term " data storage ", which refers to, can store, access and acquisition data Any device or device combination, described device may include data server, database, data storage device and data storage medium Any combination and any number.Apps server may include any suitable hardware and software, the hardware and software view It executes the needs of the aspect of the one or more application program of client terminal device and data storage integrates and handle applications Most of data access and service logic.Apps server provides the access control service to cooperate with data storage, And such as text that will be transmitted to user, picture, audio and/or video content can be generated, in this embodiment institute Stating content can be provided services to the user in the form of the suitable structured language of HTML, XML or another by Web server.It is all Content delivery between request and the disposition and client terminal device 802 and apps server 808 of response can be taken by network Device 806 be engaged in dispose.It should be appreciated that network server and apps server are not necessary, and only exemplary group Part, because structured code discussed herein can be on any suitable device or host as noted elsewhere herein It executes.
Data storage 810 may include several independent tables of data, database or other data storage mechanisms and medium, use To store data relevant to particular aspects.For example, shown data storage includes generating data 812 for storing With the mechanism of user information 816, the mechanism can be used for the content at service creation end.Data storage is also shown as including being used for The mechanism of storage record or session data 814.It should be understood that, it is understood that there may be many being stored in data storage may be needed Other aspects, such as page image information and access right information, the aspect can be optionally stored in mechanism listed above In any mechanism or in the additional mechanism that is stored in data storage 810.Data storage 810 can be by associated with it Logic operates, and instructs to receive from apps server 808, and obtains data in response to described instruction, updates Data handle data in other ways.In an example, user can submit searching request for certain type of element. In this situation, data storage may access user information to verify the identity of user, and may have access to catalog detail information To obtain the information of the element in relation to the type.Then information can such as be returned to use in the form of the results list on webpage Family, user can check the list via the browser on user apparatus 802.It can be in the private pages or window of browser In check the information of interested particular element.
Each server is typically included operating system, and the operating system provides the general management for being used for the server With the executable program instructions of operation, and each server is typically included the computer-readable medium of store instruction, the finger Enabling can make server execute its expectation function in the processor execution by server.Operating system be suitable for realizing mode and The general utility functions of server are well-known or commercially available, and are easy to be realized by those of ordinary skill in the art, especially It is to be realized according to disclosure herein.
In one embodiment, environment is distributed computing environment, and the environment utilizes via communication link, uses one A or multiple computer networks are directly connected to several computer systems and component to interconnect.However, ordinary skill Personnel should be understood that this system can be same smooth in the system with component more less or more than component illustrated in fig. 8 Ground operation.Therefore, the description of the system 800 in Fig. 8 substantially should be regarded as illustrative, and not limit the scope of the present disclosure.
As discussed above, each embodiment can be implemented in broad range of operating environment, the environment is one May include under a little situations one or more subscriber computers, computing device or can be used for operating in multiple application programs any one The processing unit of a application program.User or client terminal device may include any universal personal in multiple general purpose personal computers Computer such as the desktop computer or notebook computer of operation standard operating system, and runs mobile software and can Support cellular devices, wireless device and the hand-held device of multiple network connections and message-sending protocol.This system may also include multiple Work station, the work station run various commercially available operating systems and for a specific purpose (as develop and data base administration) Any application program in other known application program.These devices may also include other electronic devices, such as virtual terminal, slim Client, game system and can be via other devices of network communication.
The also implementable part at least one service or Web service of various aspects, as that can be service orientation type frame structure Part.If transmitting for any suitable type can be used to communicate for the service of Web service, such as by using being in extensible markup language (XML) message of format, and exchanged using suitable agreements such as such as SOAP (originating from " Simple Object Access Protocol ").This kind of clothes The process that business is provided or executed can be write with any suitable language, such as web service description language (WSDL).Use the languages such as such as WSDL Client code automatically generates functionality in such as each SOAP framework of speech permission.
Most of embodiments are the various markets of known use to those skilled in the art using at least one The network of either one or two of upper commercially available agreement for supporting communication, the agreement such as TCP/IP, OSI, FTP, UPnP, NFS, CIFS and AppleTalk.For example, network can be local area network, wide area network, Virtual Private Network, internet, intranet, external connection Any combination of net, public switch telephone network, infrared network, wireless network and above-mentioned network.
In the embodiment using Web server, Web server can run various servers or mid-tier application In any application program, including HTTP server, ftp server, CGI server, data server, java server and industry Business apps server.The server can also respond the request from user apparatus and execute program or script, such as logical Cross execution one or more may be embodied as one or more with any programming language (such asC, C# or C++) or it is any The web application of scripting language (such as Perl, Python or TCL) and combinations thereof the script write or program.The server is also It may include database server, including but not limited to these are commercially available With
Environment may include various data storages as discussed above and other memories and storage medium.These are situated between Matter can reside in various positions, as one or more computers local (and/or residing in one or more computers) Any one of computer on storage medium, or on separate network or all computers.In specific group embodiment, letter Breath may reside in storage area network familiar to those skilled in the art (" SAN ").Similarly, can it is optionally local and/ Or long-range storage is for executing any necessary file for the function of belonging to computer.In the feelings that system includes the device of computer Under condition, every kind of such device may include the hardware element that can be electrically coupled via bus, and the element includes, for example, at least One central processing unit (CPU), at least one input unit (for example, mouse, keyboard, controller, touch screen or keypad) and At least one output device (for example, display equipment, printer or loudspeaker).Such system may also include one or more and deposit Storage device, such as disc driver, optical storage and solid-state storage device, such as random access memory (" RAM ") or read-only Memory (" ROM ") and removable medium device, memory card, flash card etc..
Such device may also include computer-readable storage media reader, communication device (such as modem, network Block (wirelessly or non-wirelessly), infrared communications set etc.) and working storage, as discussed above.Computer-readable storage medium Matter reader can connect or be configured to receive computer readable storage medium with computer readable storage medium, to indicate remote Journey, local, fixation and/or mobile storage means and for interim and/or more permanently contain, store, transmit and capture The storage medium of computer-readable information.System and various devices generally also include multiple software applications, module, service or Other elements at least one working storage device, including operating system and application program, such as client application journey Sequence or Web browser.It is to be appreciated that alternate embodiment can have numerous variations compared to embodiment as described above. For example, it is possible to use custom hardware and/or particular element may be implemented in hardware, software (including portable software, it is such as small Program) or hardware and software in.Furthermore, it is possible to using the connection with such as other computing devices of network inputs/output device.
Storage medium and computer-readable medium containing code or partial code may include known in the art or used Any suitable medium, including storage medium and communication media, such as (but not limited to) by storing and/or transmitting information (as based on Calculation machine readable instruction, data structure, program module or other data) any method or technique in the volatibility implemented and non- Volatibility, removable and irremovable medium, including RAM, ROM, EEPROM, flash memory or other memory technologies, CD-ROM, number Word general optic disc (DVD) or other optical memory, magnetic holder, tape, magnetic disk storage or other magnetic storage devices, or it is available In any other medium for storing wanted information and being accessed for system and device.Based on disclosure provided in this article and religious doctrine, originally Technical field those of ordinary skill will be appreciated that the other way and/or method for realizing each embodiment.
Therefore, the description and the appended drawings should be understood in descriptive sense rather than in restrictive sense.However, it will be evident that : in the case where not departing from the broader spirit and scope of the invention as illustrated in the claims, it can be done Various modifications and change out.
The various embodiments of the disclosure can be described in view of following clause:
A1. a kind of computer implemented method for the index configurations that can search for data in networked environment comprising:
The data that will be indexed are received, the data include multiple data fields;
Determine title associated with each data field in the multiple data field;
Determine the field class in multiple field types associated with each data field in the multiple data field Type, the multiple field type include at least one of integer type, literal type or text type;
Determine whether one or more search options for each of the data field, it is one or Multiple search options include at least one of the following: including the choosing in the search index that will be generated by respective data field ?;Calculate the option of the face number of the respective data field;Or provide one or more associated with the respective data field The option of a value;
It is at least partially based on the field type including each data field in the data and whether allows institute The determination of one or more search options is stated to generate the search index configurations for the data;And
The described search index configurations for the data are at least partially based on to generate the search rope for the data Draw.
A2. the computer implemented method as described in clause A1, wherein the data are the first formats, further include:
The data are converted into the second format from first format, second format indexes mutually simultaneous with described search Hold;And
The data for being converted into second format are stored in one or more storage distribution.
A3. the computer implemented method as described in clause A2, wherein the data are converted into institute from first format Stating the second format includes:
First format is compared with second format;And
Modification corresponds to related with second format at least one associated data field of first format At least one data field of connection.
A4. the computer implemented method as described in clause A2, wherein second format is search data format (SDF).
A5. a kind of computer implemented method comprising:
The data that will be indexed are received,
The type for determining data field associated with the data, determines the data word from multiple data field types The type of section;
One or more search options about the data field associated with the data that determination will allow;
The type and the one or more search option of the data field are at least partially based on to generate for described The index configurations of data;And
The index configurations for the data are at least partially based on to generate the search index for the data.
A6. the computer implemented method as described in clause A5, wherein the data are the first formats, further include:
The data are converted into the second format from first format, second format indexes mutually simultaneous with described search Hold;And
The data for being converted into second format are stored in one or more storage distribution.
A7. the computer implemented method as described in clause A6, wherein the data are converted into institute from first format Stating the second format includes:
First format is compared with second format;And
Modification corresponds to related with second format at least one associated data label of first format At least one data label of connection.
A8. the computer implemented method as described in clause A5, wherein the multiple data field type include integer type, At least one of text type, literal type, geographic location type, time type, data type or float.
A9. the computer implemented method as described in clause A8, wherein determining that the type of the data field includes:
Determine that value associated with the data field has the integer character quantity for being higher than specified integer amount threshold value; And
The type for determining the data field is the integer type.
A10. the computer implemented method as described in clause A8, wherein determining that the type of the data field includes:
Determine at least one of the following: value associated with the data field, which has, is higher than specified amount of text threshold The alphanumeric character quantity of value;Different value number associated with the data field is higher than specified text different number threshold Value;The high percentage of different value is in specified text different weight percentage threshold value;Or the length of value is higher than specified text size threshold Value;And
The type for determining the data field is the text type.
A11. the computer implemented method as described in clause A8, wherein determining that the type of the data field includes:
Determine it is below at least one: value associated with the data field, which has, is higher than specified text numerical lower limits Value but the alphanumeric character quantity for being lower than specified text the upper limit of the number value;Different value associated with the data field Number is lower than specified text different number threshold value;The percentage of different value is lower than specified text different weight percentage threshold value;Or The length of value is lower than specified word length threshold value;And
The type for determining the data field is the literal type.
A12. the computer implemented method as described in clause A5, wherein one or more of search options may include following At least one of: it include the option in the described search index that will be generated by the data field;Calculate the data The option of the face number of field;Or the option of value associated with the data field is provided in response to related search queries.
A13. the computer implemented method as described in clause A12, what wherein determination will allow one or more of searches Rope option includes determining to allow to include the option in the described search index that will be generated by the data field, described It determines to be at least partially based on the signal received include in the data field or determines that the type of the data field is text At least one of type, data field described in the signal designation will be included in described search index.
A14. the computer implemented method as described in clause A12, what wherein determination will allow one or more of searches Rope option include determine allow to calculate the data field face number the option, it is described decision be at least partially based on determine with The quantity of multiple values of the associated distribution of data field is lower than specified face number upper limit value.
A15. the computer implemented method as described in clause A12, what wherein determination will allow one or more of searches Rope option includes determining to allow to provide described value associated with the data field in response to the related search queries The option, the decision are at least partially based on the signal received include in the data field or determination and the data word Lower than specified at least one of return value length threshold, the signal designation will provide the length of the associated described value of section Described value associated with the data field.
A16. the computer implemented method as described in clause A5, further include:
There is provided the data, the index configurations or by can be by the index that one or more search inquiries are searched for At least one.
A17. the computer implemented method as described in clause A5, further include:
One or more Client-initiated inputs are at least partially based on to modify the index configurations.
A18. a kind of system comprising:
At least one communication transceiver;
One or more storage distribution;
At least one processor;And
Memory device comprising cause the system to perform the following operation when being executed by least one described processor Instruction:
The data that will be indexed are received via at least one described communication transceiver;
The type for determining data field associated with the data, determines the data word from multiple data field types The type of section;
One or more search options about the data field associated with the data that determination will allow;
The type and the one or more search option of the data field are at least partially based on to generate for described The index configurations of data;And
The index configurations for the data are at least partially based on to generate the search index for the data.
A19. the system as described in clause A18, wherein the data are the first formats, and wherein described instruction leads to institute State system further,
The data are converted into the second format from first format, second format indexes mutually simultaneous with described search Hold;And
The data for being converted into second format are stored in one or more of storage distribution.
A20. the system as described in clause A19, wherein described instruction causes the system based on following operation come will be described Data are converted into second format from first format: first format is compared with second format;And It is associated at least with second format to correspond to modify at least one data field associated with first format One data word section.
A21. a kind of nonvolatile computer readable storage medium comprising the instruction of element, described instruction exist for identification The computing system is caused to perform the following operation when being executed by the processor of computing system:
The data that will be indexed are received,
The type for determining data field associated with the data, determines the data word from multiple data field types The type of section;
One or more search options about the data field associated with the data that determination will allow;
The type and the one or more search option of the data field are at least partially based on to generate for described The index configurations of data;And
The index configurations for the data are at least partially based on to generate the search index for the data.
A22. the nonvolatile computer readable storage medium as described in clause A21, wherein the multiple data field type Including in integer type, text type, literal type, geographic location type, time type, data type or float It is at least one.
A23. the nonvolatile computer readable storage medium as described in clause A22, wherein described instruction leads to the calculating System is literal type based at least one of the following is determined come the type for determining the data field: with the data The associated value of field, which has, is higher than specified text numerical lower limits value but lower than the alphabetical number of specified text the upper limit of the number value Word character quantity;The number of different value associated with the data field is lower than specified text different number threshold value;It is different The percentage of value is lower than specified text different weight percentage threshold value;Or the length of value is lower than specified word length threshold value.
A24. the nonvolatile computer readable storage medium as described in clause A21, wherein one or more of search are selected Item includes at least one of the following: including the option in the described search index that will be generated by the data field;Meter Calculate the option of the face number of the data field;Or value associated with the data field is provided in response to related search queries Option.
A25. the nonvolatile computer readable storage medium as described in clause A24, wherein determination will allow described one A or multiple search options include the option for determining to allow to calculate the face number of the data field, and the decision is at least partly Referred to based on determining that the quantity of at least one value associated with the data field is higher than specified face number lower limit value and is lower than Fixed face number upper limit value.
B1. a kind of computer implemented method for News Search subregion comprising:
Monitor stored data quantity or on the first subregion provided by network service in the rate of operation data At least one, first subregion is included in the storage distribution provided by the network service;
Detect in the quantity or the rate it is described at least one be respectively more than specified amount threshold or specified Rate-valve value;
The size for increasing by first subregion is executed in response to the detection or at least the second subregion is added to described deposit At least one of storage distribution, at least one of the increase or the addition are at least partially based on the institute of stored data State the rate of quantity or operation data;
It, will network flow associated with the storage distribution described in the increase or the addition during at least one Amount is directed to the cache memory provided by the network service;And
When it is described increase or the addition described at least one execution complete when, the network flow is directed to institute State storage distribution.
B2. the computer implemented method as described in clause B1, further include:
Search index of the monitoring for the storage distribution;
The size for detecting described search index is more than specified index size threshold value;And
Update for it is described storage distribution described search index with reflect about it is described storage distribution the increase or In the addition it is described at least one.
B3. the computer implemented method as described in clause B1, wherein if the size of first subregion is lower than most Big partition size threshold, then executing the increase to the size of first subregion, and wherein if described first point The size in area is at the maximum partition size threshold, then executing the addition of at least described second subregion.
B4. a kind of computer implemented method comprising:
Monitor that the data in the storage distribution in networked environment use, the storage distribution has many subregions comprising At least one subregion;
Determination includes whether the data at least one subregion described in described store in distribution use more than finger Determine threshold value;
The size of at least one subregion described in modifying includes at least one stored in the number of partitions in distributing It is a;
Will associated with storage distribution network flow guide far from the storage distribute in the size or institute State the associated part of modification of at least one of number;And
When the modification is completed, the network flow is directed to associated with the modification in the storage distribution The part.
B5. the computer implemented method as described in clause B4, further include:
The size for detecting the search index for the storage distribution is more than specified index size threshold value;And
It based on the size for modifying at least one subregion or include in the number of partitions in the storage distribution It is described at least one come update for it is described storage distribution described search index.
B6. the computer implemented method as described in clause B5, wherein updating described search index includes rebuilding for described The described search index of storage distribution is to reflect the size of at least one subregion described in the modification or be included in described deposit Storage distribution in number of partitions in it is described at least one.
B7. the computer implemented method as described in clause B4, wherein it includes being stored in the storage that the data, which use, Data bulk in distribution or at least one of the rate in the upper operation data of storage distribution.
B8. the computer implemented method as described in clause B7, wherein the specified threshold include specified amount threshold or At least one of specified rate-valve value, and be more than wherein the specified quantity threshold when stored data quantity occurs When the rate of value or operation data is more than at least one in the specified rate-valve value, the data use more than institute State specified threshold.
B9. such as the computer implemented method of clause B8, wherein at least it is based partially on the letter used about historical data Breath is to calculate the specified threshold.
B10. such as computer implemented method of clause B4, further include:
Determine that the network flow quantity for being directed into the storage distribution is higher than specified flow threshold;And
The storage distribution is modified based on the network flow quantity.
B11. such as computer implemented method of clause B10, wherein the network flow includes being stored in for searching for The search inquiry flow of data in the storage distribution.
B12. such as computer implemented method of clause B10, wherein being modified based on the network flow quantity described Storage distribution includes at least one of the following: modifying the size of at least one subregion;Modify the number of partitions; It or at least one subregion replacement with different size include at least one subregion in the number of partitions.
B13. such as the computer implemented method of clause B12, wherein the different size includes different cpu powers, no With RAM capacity, different at least one of hard drive space capacity or different bandwidth capacity.
B14. such as the computer implemented method of clause B4, wherein modify at least one subregion the size or At least one of described number of partitions includes in the size or the number of partitions for increase at least one subregion At least one, wherein executing if the size of at least one subregion is lower than maximum partition size threshold to institute State the increase of the size of at least one subregion, and wherein if the size of at least one subregion it is described most At big partition size threshold, then executing the increase to the number of partitions.
B15. such as the computer implemented method of clause B4, wherein modify at least one subregion the size or At least one of described number of partitions includes in the size or the number of partitions for reduce at least one subregion At least one, wherein executing the reduction to the number of partitions, and its if the number of partitions is greater than a subregion In if the number of partitions is a subregion, execute the reduction to the size of at least one subregion.
B16. the computer implemented method as described in clause B4, further include:
Determine that the CPU of the storage distribution is used, wherein modifying at least one of the size or the number is base At least one of used in the CPU for the determination that the data in the storage distribution use or the storage distributes.
B17. the computer implemented method as described in clause B4, further include:
It is described based on being modified with the data using at least one of associated configuration or Client-initiated input Store the configuration of distribution.
B18. the computer implemented method as described in clause B4, further include:
Distribute obtainable resource based on the storage determine when to execute in the size or the number extremely Few one modification.
B19. a kind of system comprising:
Storage distribution, the storage distribution have many subregions comprising at least one subregion;
At least one processor;And
Memory device comprising cause the system to perform the following operation when being executed by least one described processor Instruction:
Monitor that the data in the storage distribution use;
Determination includes whether the data at least one subregion described in described store in distribution use more than finger Determine threshold value;
The size of at least one subregion described in modifying includes at least one stored in the number of partitions in distributing It is a;
Will associated with storage distribution network flow guide far from the storage distribute in the size or institute State the associated part of modification of at least one of number;And
When the modification is completed, the network flow is directed to associated with the modification in the storage distribution The part.
B20. the system as described in clause B19, further include:
At least one load balancer, be configured to facilitate in modifying the size or the number described in extremely The network flow is directed away from the part of the storage distribution during one few, and facilitates to the size or institute State the part that network flow when at least one the modification in number is completed is directed into the storage distribution.
B21. the system as described in clause B20, wherein at least one described load balancer is configured to guide the net It includes many subregions in the storage distribution that network flow, which passes through,.
B22. the system as described in clause B19, further include:
At least one monitor module is configured to help to monitor that the data in the storage distribution use, And aid in determining whether include in the storage distribution described in data at least one subregion using whether being more than Specified threshold.
B23. a kind of nonvolatile computer readable storage medium of the instruction including element for identification, described instruction by The processor of computing system causes the computing system to perform the following operation when executing:
Monitor that the data in the storage distribution in networked environment use, the storage distribution has many subregions comprising At least one subregion;
Determination includes whether the data at least one subregion described in described store in distribution use more than finger Determine threshold value;
The size of at least one subregion described in modifying includes at least one stored in the number of partitions in distributing It is a;
Will associated with storage distribution network flow guide far from the storage distribute in the size or institute State the associated part of modification of at least one of number;And
When the modification is completed, the network flow is directed to associated with the modification in the storage distribution The part.
B24. the nonvolatile computer readable storage medium as described in clause B23, wherein described instruction leads to the calculating System searches for the size indexed more than specified index size threshold value for the storage distribution further, detecting, and It based on the size for modifying at least one subregion or include the institute in the number of partitions in the storage distribution At least one is stated to update the described search index for the storage distribution.
B25. the nonvolatile computer readable storage medium as described in clause B24, wherein update described search index includes Rebuild for it is described storage distribution described search index with reflect at least one subregion described in the modification the size or Including it is described storage distribution in number of partitions in described at least one.

Claims (15)

1. a kind of computer implemented method comprising:
Monitor that the data in the storage distribution in networked environment use, the storage distribution has is permitted including at least one subregion Multi partition;
Determination includes whether the data at least one subregion described in described store in distribution use more than specified threshold Value;
Modify the size of at least one subregion;
Network flow associated with the storage distribution is guided in separate storage distribution and is repaired with described in the size Change associated part;And
When the modification is completed, the network flow is directed to associated with the modification described in the storage distribution Part.
2. computer implemented method as described in claim 1, further include:
The size for detecting the search index for the storage distribution is more than specified index size threshold value;And
The described search index for the storage distribution is updated based on the size for modifying at least one subregion.
3. computer implemented method as claimed in claim 2, wherein updating described search index includes rebuilding to deposit for described The described search of storage distribution indexes the size to reflect at least one subregion described in the modification.
4. computer implemented method as described in claim 1, wherein it includes being stored in the storage point that the data, which use, The data bulk mixed or at least one of the rate in the upper operation data of storage distribution.
5. computer implemented method as claimed in claim 4, wherein the specified threshold includes specified amount threshold or refers to At least one of fixed rate-valve value, and be more than wherein the specified amount threshold when stored data quantity occurs Or the rate of operation data, when being more than at least one in the specified rate-valve value, the data use more than described specified Threshold value.
6. computer implemented method as claimed in claim 5 is wherein at least based partially on the information used about historical data To calculate the specified threshold.
7. computer implemented method as described in claim 1, further include:
Determine that the network flow quantity for being directed into the storage distribution is higher than specified flow threshold;And
The storage distribution is modified based on the network flow quantity.
8. computer implemented method as claimed in claim 7, wherein the network flow is described including being stored in for search The search inquiry flow of data in storage distribution.
9. computer implemented method as claimed in claim 7, wherein modifying the storage based on the network flow quantity Distribution includes at least one of the following: modifying the size of at least one subregion;Modify number of partitions;Or with having At least one subregion replacement of different size includes at least one subregion in number of partitions.
10. computer implemented method as claimed in claim 9, wherein the different size includes different cpu powers, difference RAM capacity, different at least one of hard drive space capacity or different bandwidth capacity.
11. computer implemented method as described in claim 1, if wherein the size for modifying at least one subregion includes The size of at least one subregion is lower than maximum partition size threshold, increases the size of at least one subregion.
12. computer implemented method as described in claim 1, wherein the size for modifying at least one subregion includes reducing The size of at least one subregion, and wherein if number of partitions is a subregion, execute to it is described at least one The reduction of the size of subregion.
13. computer implemented method as described in claim 1, further include:
Determine that the CPU of the storage distribution is used, wherein modifying the size is based on the data in the storage distribution Using or the identified CPU of the storage distribution at least one of use.
14. computer implemented method as described in claim 1, further include:
Based on modifying the storage with the data using at least one of associated configuration or Client-initiated input The configuration of distribution.
15. computer implemented method as described in claim 1, further include:
Obtainable resource is distributed based on the storage to determine when to execute the modification to the size.
CN201380053433.7A 2012-10-12 2013-10-12 For the index configurations that can search for data in network Active CN104823169B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811424497.4A CN110096502A (en) 2012-10-12 2013-10-12 Implementation method, system and the medium of the index configurations that can search for data in network

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US13/650,931 2012-10-12
US13/650,931 US9507750B2 (en) 2012-10-12 2012-10-12 Dynamic search partitioning
US13/650,961 2012-10-12
US13/650,961 US9047326B2 (en) 2012-10-12 2012-10-12 Index configuration for searchable data in network
PCT/US2013/064731 WO2014059394A1 (en) 2012-10-12 2013-10-12 Index configuration for searchable data in network

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CN201811424497.4A Division CN110096502A (en) 2012-10-12 2013-10-12 Implementation method, system and the medium of the index configurations that can search for data in network

Publications (2)

Publication Number Publication Date
CN104823169A CN104823169A (en) 2015-08-05
CN104823169B true CN104823169B (en) 2018-12-21

Family

ID=50477970

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201811424497.4A Pending CN110096502A (en) 2012-10-12 2013-10-12 Implementation method, system and the medium of the index configurations that can search for data in network
CN201380053433.7A Active CN104823169B (en) 2012-10-12 2013-10-12 For the index configurations that can search for data in network

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN201811424497.4A Pending CN110096502A (en) 2012-10-12 2013-10-12 Implementation method, system and the medium of the index configurations that can search for data in network

Country Status (10)

Country Link
EP (1) EP2907034A4 (en)
JP (2) JP2015532493A (en)
KR (2) KR101737246B1 (en)
CN (2) CN110096502A (en)
AU (3) AU2013328901B2 (en)
BR (1) BR112015008146A2 (en)
CA (1) CA2888116C (en)
IN (1) IN2015DN03160A (en)
SG (2) SG10201606363SA (en)
WO (1) WO2014059394A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10289603B2 (en) 2012-10-12 2019-05-14 Amazon Technologies, Inc. Dynamic search partitioning

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9047326B2 (en) 2012-10-12 2015-06-02 A9.Com, Inc. Index configuration for searchable data in network
CN106131189A (en) * 2016-07-15 2016-11-16 柳州健科技有限公司 The network platform based on LAN
CN105979016A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Local area network data service system
CN106131188A (en) * 2016-07-15 2016-11-16 柳州健科技有限公司 LAN system
CN105979014A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Network data system based on local area network
CN105978913A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Network service system
CN105978739A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Network data platform based on local area network
CN105979015A (en) * 2016-07-15 2016-09-28 柳州健科技有限公司 Network data service platform based on local area network
CN106131192A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 The network system with data monitoring function based on LAN
CN106131195A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 There is the LAN system of data monitoring function
CN106131190A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 The network platform with data monitoring function based on LAN
CN106131194A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 There is the LAN platform of self-learning function
CN106060083A (en) * 2016-07-16 2016-10-26 柳州健科技有限公司 Network service system with data monitoring function
CN106101024A (en) * 2016-07-16 2016-11-09 柳州健科技有限公司 There is the LAN data system of data monitoring function
CN106131193A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 There is the local area network services platform of self-learning function
CN106131191A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 There is the LAN data service system of data monitoring function
CN106131196A (en) * 2016-07-16 2016-11-16 柳州健科技有限公司 The network system with self-learning function based on LAN
CN106060081A (en) * 2016-07-16 2016-10-26 柳州健科技有限公司 Network service platform with data monitor function
CN106060082A (en) * 2016-07-16 2016-10-26 柳州健科技有限公司 Local area network-based network service platform with data monitoring function
CN107977381B (en) * 2016-10-24 2021-08-27 华为技术有限公司 Data configuration method, index management method, related device and computing equipment
CN110019191A (en) * 2017-09-21 2019-07-16 阿里巴巴集团控股有限公司 Database information processing method and processing device
CN108881147B (en) * 2017-12-29 2019-07-05 视联动力信息技术股份有限公司 A kind of data processing method and device of view networking
CN110134661A (en) * 2019-05-22 2019-08-16 东北大学 A kind of academic big data storage querying method towards facet
CN112306604B (en) * 2020-08-21 2022-09-23 海信视像科技股份有限公司 Progress display method and display device for file transmission
US11658917B2 (en) * 2021-04-09 2023-05-23 Tekion Corp Selective offloading of bandwidth to enable large-scale data indexing
CN117596176B (en) * 2024-01-17 2024-04-19 苏州元脑智能科技有限公司 Network state measuring method, device, equipment and storage medium

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7788233B1 (en) * 2007-07-05 2010-08-31 Amazon Technologies, Inc. Data store replication for entity based partition

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1143349A1 (en) * 2000-04-07 2001-10-10 IconParc GmbH Method and apparatus for generating index data for search engines
US7716168B2 (en) * 2005-06-29 2010-05-11 Microsoft Corporation Modifying table definitions within a database application
US8341345B2 (en) * 2005-08-08 2012-12-25 International Business Machines Corporation System and method for providing content based anticipative storage management
US7668825B2 (en) * 2005-08-26 2010-02-23 Convera Corporation Search system and method
JP4772569B2 (en) * 2006-04-07 2011-09-14 株式会社日立製作所 System and method for performing directory unit migration in a common namespace
US8214345B2 (en) * 2006-10-05 2012-07-03 International Business Machines Corporation Custom constraints for faceted exploration
CN101523391A (en) * 2006-10-06 2009-09-02 日本电气株式会社 Information search system, information search method, and program
US7853900B2 (en) * 2007-05-21 2010-12-14 Amazon Technologies, Inc. Animations
US20100011368A1 (en) * 2008-07-09 2010-01-14 Hiroshi Arakawa Methods, systems and programs for partitioned storage resources and services in dynamically reorganized storage platforms
JP4762289B2 (en) * 2008-10-01 2011-08-31 株式会社日立製作所 A storage system that controls allocation of storage areas to virtual volumes that store specific pattern data
US9996572B2 (en) * 2008-10-24 2018-06-12 Microsoft Technology Licensing, Llc Partition management in a partitioned, scalable, and available structured storage
WO2010092576A1 (en) * 2009-02-11 2010-08-19 Xsignnet Ltd. Virtualized storage system and method of operating it
US8250026B2 (en) * 2009-03-06 2012-08-21 Peoplechart Corporation Combining medical information captured in structured and unstructured data formats for use or display in a user application, interface, or view
US20110131202A1 (en) * 2009-12-02 2011-06-02 International Business Machines Corporation Exploration of item consumption by customers
US8930332B2 (en) * 2010-03-12 2015-01-06 Salesforce.Com, Inc. Method and system for partitioning search indexes
JPWO2011118427A1 (en) 2010-03-24 2013-07-04 日本電気株式会社 Query device, query partitioning method, and query partitioning program
US8190593B1 (en) * 2010-04-14 2012-05-29 A9.Com, Inc. Dynamic request throttling
CN102959522B (en) * 2010-08-10 2016-01-13 株式会社日立制作所 The management method of computer system and management system
WO2012072879A1 (en) * 2010-11-30 2012-06-07 Nokia Corporation Method and apparatus for updating a partitioned index
WO2012085968A1 (en) * 2010-12-22 2012-06-28 Hitachi, Ltd. Storage apparatus and storage management method
US8620897B2 (en) * 2011-03-11 2013-12-31 Microsoft Corporation Indexing and searching features including using reusable index fields

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7788233B1 (en) * 2007-07-05 2010-08-31 Amazon Technologies, Inc. Data store replication for entity based partition

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10289603B2 (en) 2012-10-12 2019-05-14 Amazon Technologies, Inc. Dynamic search partitioning

Also Published As

Publication number Publication date
KR101737246B1 (en) 2017-05-17
CN110096502A (en) 2019-08-06
EP2907034A4 (en) 2016-05-18
CN104823169A (en) 2015-08-05
IN2015DN03160A (en) 2015-10-02
AU2017245374B2 (en) 2018-08-09
WO2014059394A1 (en) 2014-04-17
AU2013328901A1 (en) 2015-05-14
JP2015532493A (en) 2015-11-09
KR20150066575A (en) 2015-06-16
BR112015008146A2 (en) 2017-07-04
KR101782302B1 (en) 2017-09-26
AU2016231488B2 (en) 2017-09-21
CA2888116A1 (en) 2014-04-17
KR20170054579A (en) 2017-05-17
JP6339155B2 (en) 2018-06-06
AU2013328901B2 (en) 2016-07-28
AU2017245374A1 (en) 2018-01-18
SG10201606363SA (en) 2016-09-29
CA2888116C (en) 2018-03-27
EP2907034A1 (en) 2015-08-19
JP2017050012A (en) 2017-03-09
SG11201502828PA (en) 2015-05-28
AU2016231488A1 (en) 2016-10-06

Similar Documents

Publication Publication Date Title
CN104823169B (en) For the index configurations that can search for data in network
US11595477B2 (en) Cloud storage methods and systems
US9922360B2 (en) Recommendations system
CN104704522B (en) Recommend native applications
US9372901B2 (en) Searching for software applications based on application attributes
US9411839B2 (en) Index configuration for searchable data in network
US10135940B2 (en) Subscribing to event notifications using object instances
US9223902B1 (en) Architectures for content identification
KR20160138261A (en) Infrastructure for synchronization of mobile device with mobile cloud service
KR20160119185A (en) Cloud service custom execution environment
US10848434B2 (en) Performance management for query processing
US8645554B2 (en) Method and apparatus for identifying network functions based on user data
US10075508B2 (en) Application-centric socialization
US11882154B2 (en) Template representation of security resources
US9852135B1 (en) Context-aware caching
US20160125034A1 (en) Annotate Apps with Entities by Fusing Heterogeneous Signals

Legal Events

Date Code Title Description
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant