CN110968775A

CN110968775A - Training method of commodity attribute generation model, generation method, search method and system

Info

Publication number: CN110968775A
Application number: CN201811159166.2A
Authority: CN
Inventors: 汤海萍; 陈海勇
Original assignee: Beijing Jingdong Century Trading Co Ltd; Beijing Jingdong Shangke Information Technology Co Ltd
Current assignee: Beijing Jingdong Century Trading Co Ltd; Beijing Jingdong Shangke Information Technology Co Ltd
Priority date: 2018-09-30
Filing date: 2018-09-30
Publication date: 2020-04-07

Abstract

The invention discloses a training method of a commodity attribute generation model, a generation method and a search method and a system, wherein the training method comprises the following steps: the method comprises the steps of obtaining commodity information of a plurality of commodities, wherein the commodity information comprises texts and pictures; labeling attribute information of the plurality of commodities, wherein the attribute information comprises at least one attribute word; constructing a plurality of groups of training data, wherein each group of training data comprises commodity information and attribute information of the same commodity; and training the commodity attribute generation model according to the multiple groups of training data, wherein the commodity attribute generation model is used for generating attribute information according to commodity information. According to the invention, a commodity attribute generation model is trained based on deep learning, and further, based on the commodity attribute generation module, correct commodity attribute information can be automatically generated according to commodity information, more specifically, a commodity title and a commodity picture, so that the defect that the commodity title is written in an irregular manner in the current website can be overcome, and the search result is optimized.

Description

Training method of commodity attribute generation model, generation method, search method and system

Technical Field

The invention relates to the technical field of internet, in particular to a training method of a commodity attribute generation model, a generation method and a search method and a system.

Background

With the continuous development of internet technology, online shopping has penetrated the aspects of people's lives. When shopping online, often, due to the irregular writing of the seller on the title of the commodity, the buyer encounters the situation that the search result does not meet the requirement when searching for the commodity by using the search term, which greatly affects the shopping experience of the buyer. For example, the search term is "september", the search result may be "fanciful dark-colored men doing gym shorts with september", and the search result actually points to the product but is a penta-pant, that is, since the seller mistakenly adds a penta-pant in the product title, the penta-pant is also displayed in the search result of the search term, which affects the shopping experience of the buyer.

Disclosure of Invention

The invention aims to overcome the defect of irregular writing of a commodity title in the prior art and provides a training method of a commodity attribute generation model and generation and search methods and systems.

The invention solves the technical problems through the following technical scheme:

a training method for a commodity attribute generation model is characterized by comprising the following steps:

the method comprises the steps of obtaining commodity information of a plurality of commodities, wherein the commodity information comprises texts and pictures;

labeling attribute information of the plurality of commodities, wherein the attribute information comprises at least one attribute word;

constructing a plurality of groups of training data, wherein each group of training data comprises commodity information and attribute information of the same commodity;

and training the commodity attribute generation model according to the multiple groups of training data, wherein the commodity attribute generation model is used for generating attribute information according to commodity information.

Preferably, the step of training the product attribute generation model according to the plurality of sets of training data includes:

the sets of training data are trained using an encoding-decoding model framework.

Preferably, the step of training the plurality of sets of training data using an encoding-decoding model framework comprises:

for each group of training data, performing word segmentation on the text to obtain a word segmentation sequence;

converting the word segmentation sequence into a word segmentation vector;

encoding the word segmentation vector to obtain an encoded vector;

extracting a feature vector of the picture;

and decoding according to the coding vector and the characteristic vector to obtain attribute information.

Preferably, before the decoding step according to the coding vector and the feature vector, the training method further comprises:

inputting the encoding vector and the feature vector into an attention model;

calculating attention distribution values of the coding vector and the feature vector at the current moment;

the decoding according to the encoding vector and the feature vector specifically includes:

decoding according to the attention distribution value at the current moment and the decoding participle obtained at the previous moment to obtain the decoding participle at the current moment;

wherein the decoding participles are attribute participles.

Preferably, the step of converting the word segmentation sequence into a word segmentation vector comprises:

converting the word segmentation sequence into a word segmentation vector by using a word2vec model;

the step of extracting the feature vector of the picture comprises the following steps:

and extracting the feature vector of the picture by using a resnet model.

Preferably, the step of converting the word segmentation sequence into a word segmentation vector by using a word2vec model includes:

pre-training the word2vec model by using external data;

the step of extracting the feature vector of the picture by using the resnet model comprises the following steps:

and pre-training the resnet model by utilizing imagenet data.

Preferably, the plurality of articles comprises:

the similarity between the text in the commodity information and the labeled attribute information is higher than a first threshold value;

and/or searching for commodities according to the search terms, and returning commodities with the click rate higher than a second threshold value in the search results.

An electronic device comprising a memory, a processor and a computer program stored on the memory and operable on the processor, wherein the processor implements any one of the above training methods for a product property generation model when executing the computer program.

A computer-readable storage medium, on which a computer program is stored, wherein the computer program, when being executed by a processor, implements the steps of any one of the above methods for training a product property generation model.

A method for generating commodity attributes is characterized by comprising the following steps:

training the commodity attribute generation model by using any one of the training methods of the commodity attribute generation model;

acquiring commodity information of a commodity, wherein the commodity information comprises texts and pictures;

inputting the commodity information into the commodity attribute generation model;

and outputting the attribute information of the commodity.

Preferably, the attribute information includes at least one attribute word, and the step of outputting the attribute information includes:

selecting a preset number of attribute participles;

and outputting the selected attribute participles with the preset number.

An electronic device comprising a memory, a processor and a computer program stored on the memory and operable on the processor, wherein the processor implements any one of the above-mentioned methods for generating an attribute of an article when executing the computer program.

A computer-readable storage medium, on which a computer program is stored, wherein the computer program, when executed by a processor, implements the steps of any of the above-described methods for generating an attribute of an article.

A method for searching for a commodity, the method comprising:

generating attribute information by using any one of the commodity attribute generation methods;

inputting a search term;

searching the attribute information according to the search word;

and outputting the search result.

An electronic device comprising a memory, a processor and a computer program stored on the memory and operable on the processor, wherein the processor implements the method for searching for the article when executing the computer program.

A computer-readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, carries out the steps of the method of searching for an article as described above.

A training system for a commodity attribute generation model, the training system comprising:

the system comprises a first acquisition module, a second acquisition module and a display module, wherein the first acquisition module is used for acquiring commodity information of a plurality of commodities, and the commodity information comprises texts and pictures;

the labeling module is used for labeling the attribute information of the commodities, and the attribute information comprises at least one attribute word;

the construction module is used for constructing a plurality of groups of training data, and each group of training data comprises commodity information and attribute information of the same commodity;

and the training module is used for training the commodity attribute generation model according to the plurality of groups of training data, and the commodity attribute generation model is used for generating attribute information according to commodity information.

Preferably, the training module is specifically configured to train the plurality of sets of training data using a coding-decoding model framework.

Preferably, the training module comprises:

the word segmentation unit is used for segmenting words of the text for each group of training data to obtain a word segmentation sequence;

the vector conversion unit is used for converting the word segmentation sequence into word segmentation vectors;

the text coding unit is used for coding the word segmentation vectors to obtain coding vectors;

the picture coding unit is used for extracting a feature vector of a picture;

and the decoding unit is used for decoding according to the coding vector and the characteristic vector to obtain the attribute information.

Preferably, the training module further comprises:

the attention unit is used for receiving the coding vector and the feature vector and calculating attention distribution values of the coding vector and the feature vector at the current moment;

the decoding unit is specifically used for decoding according to the attention distribution value at the current moment and the decoded participle obtained at the previous moment to obtain the decoded participle at the current moment;

wherein the decoding participles are attribute participles.

Preferably, the vector conversion unit converts the word segmentation sequence into a word segmentation vector by using a word2vec model;

the picture coding unit extracts a feature vector of a picture by using a resnet model.

Preferably, the word2vec model is obtained by using external data to train in advance, and the resnet model is obtained by using imagenet data to train in advance.

Preferably, the plurality of articles comprises:

A system for generating an attribute of a commodity, the system comprising:

the training system of any one of the commodity attribute generation models is used for training the commodity attribute generation model;

the second acquisition module is used for acquiring commodity information of commodities, wherein the commodity information comprises texts and pictures;

the commodity information input module is used for inputting the commodity information into the commodity attribute generation model;

and the attribute information output module is used for outputting the attribute information of the commodity.

Preferably, the attribute information includes at least one attribute word, and the attribute information output module includes:

the selecting unit is used for selecting attribute participles with preset quantity;

and the output unit is used for outputting the selected attribute participles with the preset number.

A search system for an article, the search system comprising:

the system for generating any one of the above commodity attributes, configured to generate attribute information;

the search word input module is used for inputting search words;

the searching module is used for searching the attribute information according to the searching words;

and the search result output module is used for outputting the search result.

The positive progress effects of the invention are as follows: according to the invention, a commodity attribute generation model is trained based on deep learning, and further, based on the commodity attribute generation module, correct commodity attribute information can be automatically generated according to commodity information, more specifically, a commodity title and a commodity picture, so that the defect that the commodity title is written in an irregular manner in the current website can be overcome, and the search result is optimized.

Drawings

Fig. 1 is a flowchart of a training method of a product attribute generation model according to embodiment 1 of the present invention.

Fig. 2 is a flowchart of step S4 in the training method of the product property generation model according to embodiment 1 of the present invention.

Fig. 3 is a schematic diagram of a hardware structure of an electronic device according to embodiment 2 of the present invention.

Fig. 4 is a flowchart of a method for generating a product attribute according to embodiment 4 of the present invention.

Fig. 5 is a flowchart of a commodity searching method according to embodiment 7 of the present invention.

Fig. 6 is a schematic block diagram of a training system for a product attribute generation model according to embodiment 10 of the present invention.

Fig. 7 is a module diagram of a training module in the training system of the commodity attribute generation model according to embodiment 10 of the present invention.

Fig. 8 is a schematic block diagram of a product attribute generation system according to embodiment 11 of the present invention.

Fig. 9 is a block diagram of a search system for merchandise according to embodiment 12 of the present invention.

Detailed Description

The invention is further illustrated by the following examples, which are not intended to limit the scope of the invention.

Example 1

The embodiment provides a training method of a commodity attribute generation model, and fig. 1 shows a flowchart of the embodiment. Referring to fig. 1, the training method of the present embodiment includes:

s1, acquiring commodity information of a plurality of commodities;

s2, labeling attribute information of a plurality of commodities;

s3, constructing a plurality of groups of training data;

and S4, training the commodity attribute generation model according to the multiple groups of training data.

In this embodiment, the merchandise information includes text and pictures, wherein the text may include, but is not limited to, a title of the merchandise, and the pictures may include, but is not limited to, a picture of a cover page showing the merchandise. The attribute information includes at least one attribute word that may be used to represent, but is not limited to, brand, genre, style of the good. Each set of training data includes commodity information and attribute information of the same commodity.

More specifically, in this embodiment, the multiple commodities may include a commodity in which the similarity between the text in the commodity information and the labeled attribute information is higher than a first threshold, where the first threshold may be set in a self-defined manner according to a specific application. For example, in the product displayed by the high-quality merchant in the shopping website, the high-quality merchant adds the product title without adding other attribute word segmentation by mistake, and the attribute information of the product can be obtained from the product title of the high-quality merchant.

In this embodiment, the plurality of commodities may further include commodities searched according to the search term, and commodities with click rates higher than a second threshold in the returned search results are obtained according to the click log, where the second threshold may be set in a self-defined manner according to a specific application. The search result with high click rate can be used for indicating that the search result meets the requirements of the user, namely, the search result can be indirectly indicated that other attribute participles are not mistakenly added in the commodity title of the search result.

For example, the commodity is divided into five pants, the commodity title of the divided five pants obtained in step S1 is "the five pants in fancy boy shorts man 2018 is trimmed in summer to display the thin leisure shorts man trousers, the color of the five pants is 31", and the commodity displayed by the obtained cover picture is divided into five pants. The attribute information marked in step S2 is "color of floral prince divided into five pants cartoons". The set of training data constructed in step S3 may include: the style male pants man 2018 can be used for slimming leisure pants male pants in summer, namely the fifth pants card color 31, the cover picture and the style male pants card color.

It should be understood that when the training data is constructed in step S3, it may further include cleaning the above commodity information, for example, case-unifying, converting the number words, model words, etc. into a preset format, setting the size of the pictures to a uniform size, etc.

In this way, a plurality of sets of training data are constructed, and further, in step S4, a product attribute generation model for generating attribute information from product information can be trained from the plurality of sets of constructed data, that is, accurate product attribute information can be automatically generated from a product title and a product picture based on the product attribute generation module.

In the present embodiment, the encoding-decoding model framework may be utilized to train the above-constructed multiple sets of training data, and specifically, referring to fig. 2, step S4 may include:

s41, performing word segmentation on the text to obtain a word segmentation sequence;

and S42, converting the word segmentation sequence into word segmentation vectors.

In the above step, the word segmentation sequence may be input into an embedding layer, and the word segmentation sequence may be converted into a word segmentation vector. More specifically, word sequences may be converted to word vectors using, but not limited to, the word2vec model. Due to the limitation of training data, unknown words are easy to encounter during word segmentation conversion, and furthermore, in the embodiment, the word2vec model can be trained in advance by further utilizing external data so as to optimize the processing of the unknown words.

And S43, coding the word vector to obtain a coded vector.

In this step, the above-mentioned word segmentation vectors may be sequentially input to an RNN layer (Recurrent neural network) to obtain a code vector. More specifically, in the present embodiment, the above-described word segmentation vectors may be sequentially input to 4-layer BI-LSTM (bidirectional LSTM, composed of Fwd LSTM (forward LSTM, features are input into the network from front to back in time series) and Bwd LSTM (backward LSTM, features are input into the network from back to front in time series), where LSTM (Long Short-Term Memory ) is a special RNN structure), to obtain the encoding vectors.

And S44, extracting the feature vector of the picture.

In this step, the feature vector of the picture can be extracted using, but not limited to, the resnet (depth residual network) model. In addition, because there are many resnet parameters and the training speed is slow, in this embodiment, the resnet model can be trained in advance by further using imagenet (the database with the largest image recognition in the world at present) data, so as to improve the training speed.

S45, inputting the coding vector and the feature vector into an attention model;

s46, calculating the attention distribution values of the coding vector and the feature vector at the current moment;

and S47, decoding according to the attention distribution value of the current moment and the decoded participle obtained at the previous moment to obtain the decoded participle at the current moment.

In the above steps, decoding may be performed according to the coding vector and the feature vector via, but not limited to, an LSTM layer and a softmax (generalization of logistic regression model to the multi-classification problem) model, so as to obtain the attribute information. More specifically, different attention distribution values may be automatically set for the coding vector and the feature vector based on the attention model, that is, different attentions may be automatically allocated to the text and the picture in the commodity information, so that the text and the picture in the commodity information are better utilized, and better attribute information can be generated, where the decoded participle obtained in step S48 is an attribute participle in the attribute information.

In this embodiment, a commodity attribute generation model is trained based on deep learning, and then based on the commodity attribute generation module, correct commodity attribute information can be automatically generated according to commodity information, more specifically, according to a commodity title and a commodity picture, so that the defect that the commodity title is written irregularly in a current website can be overcome.

Example 2

The present embodiment provides an electronic device, which may be represented in the form of a computing device (for example, may be a server device), and includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor may implement the training method of the product attribute generation model provided in embodiment 1 when executing the computer program.

Fig. 3 shows a schematic diagram of a hardware structure of the present embodiment, and as shown in fig. 3, the electronic device 9 specifically includes:

at least one processor 91, at least one memory 92, and a bus 93 for connecting the various system components (including the processor 91 and the memory 92), wherein:

the bus 93 includes a data bus, an address bus, and a control bus.

Memory 92 includes volatile memory, such as Random Access Memory (RAM)921 and/or cache memory 922, and can further include Read Only Memory (ROM) 923.

Memory 92 also includes a program/utility 925 having a set (at least one) of program modules 924, such program modules 924 including, but not limited to: an operating system, one or more application programs, other program modules, and program data, each of which, or some combination thereof, may comprise an implementation of a network environment.

The processor 91 executes various functional applications and data processing, such as a training method of the product attribute generation model provided in embodiment 1 of the present invention, by executing the computer program stored in the memory 92.

The electronic device 9 may further communicate with one or more external devices 94 (e.g., a keyboard, a pointing device, etc.). Such communication may be through an input/output (I/O) interface 95. Also, the electronic device 9 may communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network, such as the Internet) via the network adapter 96. The network adapter 96 communicates with the other modules of the electronic device 9 via the bus 93. It should be understood that although not shown in the figures, other hardware and/or software modules may be used in conjunction with the electronic device 9, including but not limited to: microcode, device drivers, redundant processors, external disk drive arrays, RAID (disk array) systems, tape drives, and data backup storage systems, etc.

It should be noted that although in the above detailed description several units/modules or sub-units/modules of the electronic device are mentioned, such a division is merely exemplary and not mandatory. Indeed, the features and functionality of two or more of the units/modules described above may be embodied in one unit/module, according to embodiments of the application. Conversely, the features and functions of one unit/module described above may be further divided into embodiments by a plurality of units/modules.

Example 3

The present embodiment provides a computer-readable storage medium on which a computer program is stored, which when executed by a processor, implements the steps of the training method of the product property generation model provided in embodiment 1.

More specific examples, among others, that the readable storage medium may employ may include, but are not limited to: a portable disk, a hard disk, random access memory, read only memory, erasable programmable read only memory, optical storage device, magnetic storage device, or any suitable combination of the foregoing.

In a possible implementation manner, the present invention can also be implemented in the form of a program product, which includes program code for causing a terminal device to execute the steps of the training method for implementing the commodity attribute generation model in embodiment 1 when the program product runs on the terminal device.

Where program code for carrying out the invention is written in any combination of one or more programming languages, the program code may be executed entirely on the user device, partly on the user device, as a stand-alone software package, partly on the user device and partly on a remote device or entirely on the remote device.

Example 4

The present embodiment provides a method for generating a commodity attribute, and fig. 4 shows a flowchart of the present embodiment. Referring to fig. 4, the generation method of the present embodiment includes:

s101, training a commodity attribute generation model;

s102, acquiring commodity information of commodities;

s103, inputting the commodity information into a commodity attribute generation model;

and S104, outputting the attribute information of the commodity.

Specifically, step S101 is to train the product attribute generation model by using the training method of the product attribute generation model provided in embodiment 1, and further, after inputting the product information acquired in step S102 into the product attribute generation model, the attribute information of the product can be output, where in this embodiment, the product information includes text and pictures.

In order to simplify the attribute information, in this embodiment, a preset number of attribute participles may be selected, and the selected preset number of attribute participles are output in step S104. Specifically, a preset number of attribute participles having the highest degree of correlation with the commodity may be selected using a beam-search algorithm.

Based on the product attribute generation model obtained by the training method in embodiment 1, this embodiment can automatically generate correct product attribute information according to the text and the picture of each product in the website, so as to overcome the defect that the title of the product is written irregularly in the current website.

Example 5

The present embodiment provides an electronic device, which may be represented in the form of a computing device (for example, may be a server device), and includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor may implement the method for generating the commodity attribute provided in embodiment 4 when executing the computer program.

Example 6

The present embodiment provides a computer-readable storage medium on which a computer program is stored, which when executed by a processor, implements the steps of the generation method of the article attribute provided in embodiment 4.

Example 7

The present embodiment provides a method for searching for a commodity, and fig. 5 shows a flowchart of the present embodiment. Referring to fig. 5, the search method of the present embodiment includes:

s201, generating attribute information;

s202, inputting a search word;

s203, searching attribute information according to the search terms;

and S204, outputting the search result.

Specifically, step S201 generates attribute information of each item in the web site using the item attribute generation method provided in embodiment 4, so that based on the search word input in step S202, step S203 searches among a large amount of attribute information generated in step S201, and outputs a search result of the attribute information generated based on the pair of search words in step S204.

The correct commodity attribute of the commodity is generated based on the generation method in the embodiment 4, and the search result is optimized in the embodiment, so that the user can obtain the correct search result when searching according to the search term, and the commodity which is actually irrelevant to the search term due to the wrong information added by the seller in the commodity title is prevented from appearing in the search result, and the user experience is improved. In addition, the writing of the seller to the commodity information can be standardized, so that effective commodity information can be obtained.

Example 8

The present embodiment provides an electronic device, which may be represented in the form of a computing device (for example, may be a server device), and includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor executes the computer program to implement the method for searching for an article provided in embodiment 7.

Example 9

The present embodiment provides a computer-readable storage medium on which a computer program is stored, which when executed by a processor, implements the steps of the search method for an article provided in embodiment 7.

Example 10

The embodiment provides a training system for a commodity attribute generation model, and fig. 6 shows a module diagram of the embodiment. Referring to fig. 6, the training system 1 of the present embodiment includes:

a first obtaining module 11, configured to obtain commodity information of a plurality of commodities;

the labeling module 12 is used for labeling the attribute information of a plurality of commodities;

a construction module 13 for constructing a plurality of sets of training data;

and the training module 14 is used for training the commodity attribute generation model according to the multiple groups of training data.

For example, the commodity is divided into five pants, the commodity title of the divided five pants acquired by the first acquisition module 11 is "the five pants in the fancy boy shorts male 2018 are trimmed in summer to display the thin leisure shorts male pants in the five pants card with the color of 31", and the commodity displayed by the acquired cover picture is divided into five pants. The attribute information marked by the marking module 12 is 'the color of the floral prince divided into five pants card'. The set of training data constructed by the construction module 13 may include: the style male pants man 2018 can be used for slimming leisure pants male pants in summer, namely the fifth pants card color 31, the cover picture and the style male pants card color.

It should be understood that when the construction module 13 constructs the training data, it may also include cleaning the above commodity information, for example, unifying case, converting the number words, model words, etc. into a preset format, setting the size of the pictures to a unified size, etc.

In this way, a plurality of sets of training data are constructed, and the training module 14 can train a product attribute generation model for generating attribute information from product information according to the plurality of sets of constructed data, that is, based on the product attribute generation module, it is possible to automatically generate correct product attribute information from a product title and a product picture.

In the present embodiment, the training module 14 may utilize, but is not limited to, an encoding-decoding model framework to train the above-constructed multiple sets of training data, and specifically, referring to fig. 7, the training module 14 may include:

a word segmentation unit 141, configured to perform word segmentation on the text for each set of training data to obtain a word segmentation sequence;

and a vector conversion unit 142, configured to convert the word segmentation sequence into a word segmentation vector.

Specifically, the vector conversion unit 142 may include an embedding layer for converting the word segmentation sequence into a word segmentation vector. More specifically, the vector conversion unit 142 may include, but is not limited to, a word2vec model for converting a sequence of participles into a participle vector. Due to the limitation of training data, unknown words are easy to encounter during word segmentation conversion, and furthermore, in the embodiment, the word2vec model can be trained in advance by further utilizing external data so as to optimize the processing of the unknown words.

And the text encoding unit 143 is configured to encode the word segmentation vector to obtain an encoded vector.

Specifically, the text encoding unit 143 may sequentially input the above-described word segmentation vectors to an RNN layer (recurrent neural Network) to acquire an encoding vector. More specifically, in the present embodiment, the text encoding unit 143 may sequentially input the above-described word segmentation vectors into 4-layer BI-LSTM (bidirectional LSTM, composed of Fwd LSTM (forward LSTM, features are input into the network from front to back in time series) and Bwd LSTM (backward LSTM, features are input into the network from back to front in time series), where LSTM (Long Short-Term Memory ) is a special RNN structure, to obtain the encoding vectors.

And a picture encoding unit 144, configured to extract a feature vector of the picture.

Specifically, the picture encoding unit 144 may extract a feature vector of the picture using, but not limited to, a resnet (depth residual network) model. In addition, because there are many resnet parameters and the training speed is slow, in this embodiment, the resnet model can be trained in advance by further using imagenet (the database with the largest image recognition in the world at present) data, so as to improve the training speed.

An attention unit 145, configured to receive the encoded vector and the feature vector, and calculate an attention distribution value of the encoded vector and the feature vector at the current time;

and a decoding unit 146, configured to decode according to the attention distribution value at the current time and the decoded participle obtained at the previous time, so as to obtain the decoded participle at the current time.

In this embodiment, the decoding unit 146 may perform decoding according to the encoding vector and the feature vector via, but not limited to, an LSTM layer and a softmax (generalization of logistic regression model to multi-classification problem) model, so as to obtain the attribute information. More specifically, the decoding unit 146 may automatically set different attention distribution values for the coding vector and the feature vector based on the attention unit 145, that is, may automatically allocate different attentions to the text and the picture in the commodity information, so that the text and the picture in the commodity information are better utilized, and better attribute information can be generated, where the decoded word obtained by the decoding unit 146 is an attribute word in the attribute information.

In this embodiment, a commodity attribute generation model is trained based on deep learning, and then based on the commodity attribute generation module, correct commodity attribute information can be automatically generated according to commodity information, more specifically, a commodity title and a commodity picture, so that the defect that the commodity title is written irregularly in a current website can be overcome.

Example 11

The present embodiment provides a system for generating a commodity attribute, and fig. 8 shows a module diagram of the present embodiment. Referring to fig. 8, the generation system 2 of the present embodiment includes:

the training system 1 of the product attribute generation model in embodiment 10 is used to train a product attribute generation model;

a second obtaining module 21, configured to obtain commodity information of a commodity;

a commodity information input module 22 for inputting commodity information into the commodity attribute generation model;

and an attribute information output module 23 for outputting the attribute information of the product.

Specifically, the training system 1 provided in embodiment 1 is configured to train a product attribute generation model, and after inputting the product information acquired by the second acquisition module 21 into the product attribute generation model, the attribute information output module 23 is capable of outputting the attribute information of the product, where in this embodiment, the product information includes a text and a picture.

In order to simplify the attribute information, in this embodiment, the attribute information output module 23 may select a preset number of attribute participles, and output the selected preset number of attribute participles. Specifically, the attribute information output module 23 may select a preset number of attribute participles having the highest degree of correlation with the commodity using a beam-search algorithm.

Based on the product attribute generation model obtained by the training system in embodiment 10, this embodiment can automatically generate correct product attribute information according to the text and the picture of each product in the website, so as to overcome the defect that the title of the product is written irregularly in the current website.

Example 12

The present embodiment provides a search system for goods, and fig. 9 shows a block diagram of the present embodiment. Referring to fig. 9, the search system 3 of the present embodiment includes:

the product attribute generation system 2 in embodiment 11 for generating attribute information;

a search term input module 31 for inputting a search term;

a search module 32 for searching the attribute information according to the search word;

and a search result output module 33, configured to output a search result.

Specifically, the generation system 2 provided in embodiment 11 generates attribute information of each commodity in a web site, so that the search module 32 searches a large amount of attribute information generated by the generation system 2 based on a search word input by the search word input module 31, and outputs a search result based on the attribute information generated by the search word pair by the search result output module 33.

The generation system in the embodiment 11 is used to generate the correct commodity attribute of the commodity, and the embodiment optimizes the search result, so that the user can obtain the correct search result when searching according to the search term, and the commodity which is actually irrelevant to the search term due to the wrong information added by the seller in the commodity title does not appear in the search result, thereby improving the user experience. In addition, the writing of the seller to the commodity information can be standardized, so that effective commodity information can be obtained.

While specific embodiments of the invention have been described above, it will be appreciated by those skilled in the art that this is by way of example only, and that the scope of the invention is defined by the appended claims. Various changes and modifications to these embodiments may be made by those skilled in the art without departing from the spirit and scope of the invention, and these changes and modifications are within the scope of the invention.

Claims

1. A training method for a commodity attribute generation model is characterized by comprising the following steps:

2. The method of training a product property generation model according to claim 1, wherein the step of training a product property generation model based on the plurality of sets of training data comprises:

3. The training method of the commodity attribute generation model according to claim 2, wherein the step of training the plurality of sets of training data using the encode-decode model framework includes:

converting the word segmentation sequence into a word segmentation vector;

encoding the word segmentation vector to obtain an encoded vector;

extracting a feature vector of the picture;

4. A training method of a commodity attribute generation model according to claim 3, wherein before the step of decoding based on the code vector and the feature vector, the training method further comprises:

inputting the encoding vector and the feature vector into an attention model;

wherein the decoding participles are attribute participles.

5. The training method of the commodity attribute generation model according to claim 3, wherein the step of converting the word segmentation sequence into a word segmentation vector comprises:

and extracting the feature vector of the picture by using a resnet model.

6. The training method of the commodity attribute generation model according to claim 5, wherein the step of converting the word segmentation sequence into word segmentation vectors by using a word2vec model comprises:

pre-training the word2vec model by using external data;

and pre-training the resnet model by utilizing imagenet data.

7. A training method for a commodity attribute generation model according to claim 1, wherein the plurality of commodities include:

8. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the training method of the merchandise attribute generation model according to any one of claims 1-7 when executing the computer program.

9. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the training method of the merchandise property generation model according to any one of claims 1-7.

10. A method for generating an attribute of a commodity, the method comprising:

training a commodity attribute generation model using a training method of the commodity attribute generation model according to any one of claims 1 to 7;

and outputting the attribute information of the commodity.

11. The method of generating an attribute of a commodity according to claim 10, wherein the attribute information includes at least one attribute word, and the step of outputting the attribute information includes:

selecting a preset number of attribute participles;

and outputting the selected attribute participles with the preset number.

12. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the method of generating an attribute of a good according to claim 10 or 11 when executing the computer program.

13. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method of generating an attribute of an item of claim 10 or 11.

14. A method for searching for a commodity, the method comprising:

generating attribute information by using the generation method of the article attribute according to claim 10 or 11;

inputting a search term;

searching the attribute information according to the search word;

and outputting the search result.

15. An electronic device comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, wherein the processor implements the method of searching for an item of claim 14 when executing the computer program.

16. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method of searching for an article of manufacture according to claim 14.

17. A training system for a commodity attribute generation model, the training system comprising:

18. The training system for the commodity attribute generation model of claim 17, wherein the training module is specifically configured to train the plurality of sets of training data using a code-decode model framework.

19. The training system for the commodity property generation model according to claim 17, wherein said training module comprises:

the picture coding unit is used for extracting a feature vector of a picture;

20. The training system for the commodity property generation model of claim 19, wherein said training module further comprises:

wherein the decoding participles are attribute participles.

21. The training system of the commodity attribute generation model according to claim 19, wherein the vector conversion unit converts the word segmentation sequence into a word segmentation vector using a word2vec model;

22. The training system of commodity attribute generation model according to claim 21, wherein the word2vec model is pre-trained using external data, and the resnet model is pre-trained using imagenet data.

23. The system for training a commodity attribute generation model according to claim 17, wherein the plurality of commodities includes:

24. A system for generating an attribute of a commodity, the system comprising:

a training system for a commodity attribute generation model according to any one of claims 17 to 23, for training a commodity attribute generation model;

25. The system for generating an attribute of a commodity according to claim 24, wherein the attribute information includes at least one attribute word, and the attribute information output module includes:

26. A search system for an article, the search system comprising:

a generation system of an attribute of a commodity according to claim 24 or 25, for generating attribute information;

the search word input module is used for inputting search words;

and the search result output module is used for outputting the search result.