CN109615058A - A kind of training method of neural network model - Google Patents

A kind of training method of neural network model Download PDF

Info

Publication number
CN109615058A
CN109615058A CN201811246828.XA CN201811246828A CN109615058A CN 109615058 A CN109615058 A CN 109615058A CN 201811246828 A CN201811246828 A CN 201811246828A CN 109615058 A CN109615058 A CN 109615058A
Authority
CN
China
Prior art keywords
neural network
network model
training
user terminal
cloud server
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811246828.XA
Other languages
Chinese (zh)
Inventor
景蔚亮
王海波
陈邦明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Xinchu Integrated Circuit Co Ltd
Original Assignee
Shanghai Xinchu Integrated Circuit Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Xinchu Integrated Circuit Co Ltd filed Critical Shanghai Xinchu Integrated Circuit Co Ltd
Priority to CN201811246828.XA priority Critical patent/CN109615058A/en
Publication of CN109615058A publication Critical patent/CN109615058A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention relates to a kind of training methods of neural network model, belong to field of artificial intelligence.A kind of training method of neural network model, the user terminal that cloud server is set and is remotely connect with the cloud server;The training method specifically includes: step S1, and the cloud server is trained the neural network model using the first training dataset;Step S2, the cloud server judge the accuracy of the neural network model test: the neural network model is sent to the user terminal by step S3, the cloud server;Step S4, the user terminal continue to be trained the neural network model using the second training dataset;The neural network model is saved as available model by step S5, the user terminal, for subsequent use;In this way, neural network model can complete most training mission beyond the clouds, improve trained efficiency, also save the memory space of terminal device.

Description

A kind of training method of neural network model
Technical field
The present invention relates to a kind of field of artificial intelligence more particularly to a kind of training methods of neural network model.
Background technique
With artificial intelligence image recognition, in terms of application and its higher accuracy rate, by artificial intelligence Can technology be applied to terminal device and have become the direction being concerned, but artificial intelligence technology, especially neural network mould The operation of type has higher requirements for the performance and memory space of equipment, keeps it limited in the application of terminal device.Neural network Model is generally divided into two processes, training process and reasoning process, training process be using established neural network model with And the data set being collected into, model is trained, is made inferences later in reasoning process using trained model, it can Enough carry out the identification or classification of image or sound.Since training process needs to constantly update data, i.e., train every time for same One data need to be written and read at least once, therefore training process needs the higher processor of performance to read memory It takes and is written, and reasoning process is calculated using data and trained model, is obtained a result and is made reasoning, and number need to be only read According to, therefore reasoning process needs the higher processor of performance to be read out memory.Traditional application mode makes reasoning All in same processing unit and storage unit, this is often required that between storage unit and processing unit for process and training process Readwrite performance is all very high, brings challenges.The training process of the neural network model of terminal device is generally given at cloud at present simultaneously Device is managed, model is sent to terminal device again using the data set completion training process in cloud and made inferences by cloud processor, when again When needing training pattern, cloud processor is recycled to be trained, this often results in the loss of very big transmission data, to terminal Become a main limiting factor for equipment.Meanwhile data set used in cloud processor is if it is from other terminals It is collected in equipment, is trained then such data set is fully utilized, trained model is caused to be not necessarily suitable use Family, but if the cloud processor data only collected using the terminal device of user come the model of training user, if user Data set amount is insufficient, and the application precision that will also result in model is not high.
Summary of the invention
For the deficiency of technology, the problem to be solved in the present invention provides a kind of training method of neural network model.
Wherein, the user terminal that cloud server is set and is remotely connect with cloud server;
Training method specifically includes:
Step S1, cloud server are trained neural network model using the first training dataset;
Step S2, cloud server judge the accuracy of the test of neural network model:
If accuracy is greater than one first preset threshold, step S3 is turned to;
If accuracy is not more than the first preset threshold, return step S1, to continue to be trained neural network model;
Neural network model is sent to user terminal by step S3, cloud server;
Step S4, user terminal continues to be trained neural network model using the second training dataset, until nerve net Until the accuracy of the test of network model is greater than one second preset threshold;
Neural network model is saved as available model by step S5, user terminal, for subsequent use;
Second preset threshold is greater than the first preset threshold.
During later using model, user terminal persistently detects the errors number of the reasoning of available model, and Errors number re-execute the steps S4 to step S5 when being more than a third predetermined threshold value.
Cloud server is long-range respectively to connect a plurality of clients;
Cloud server collects the data that each user terminal uploads, as the first training dataset.
User terminal forms the second training dataset in such a way that data acquire.
The training system of the neural network model includes cloud server and user terminal, and cloud server remotely connects user End;
Cloud server includes:
First training unit, for being trained using the first training dataset to neural network model;
First judging unit connects the first training unit, for judging the accuracy of the test of neural network model and defeated First judging result out, when the first judging result indicates that the accuracy of the test of neural network model is not more than one first default threshold When value, the first training unit continues to be trained neural network model;
Transmission unit is separately connected the first judging unit and the first training unit, for indicating mind in the first judging result When the accuracy of test through network model is greater than first preset threshold, the neural network model that training is formed is sent to use Family end;
User terminal includes:
Receiving unit, for receiving neural network model;
Second training unit, connect receiving unit, for using the second training dataset continue to neural network model into Row training exports the mind that training is formed until the accuracy of the test of neural network model is greater than one second preset threshold Through network model;
Storage unit connects the second training unit, and neural network model is saved as available model by storage unit, for rear It is continuous to use;
Second preset threshold is greater than the first preset threshold.
User terminal further include:
Second judgment unit connects the second training unit, persistently detects available model during user is using model Reasoning errors number, and errors number be more than a third predetermined threshold value when neural network model is sent back to the second instruction again Practice and is trained in unit.
Cloud server is long-range respectively to connect a plurality of clients;
Cloud server collects the data that each user terminal uploads, as the first training dataset.
User terminal forms the second training dataset in such a way that data acquire.
Training method through the invention, neural network model can complete most training mission beyond the clouds, improve The efficiency of training, also saves the memory space of terminal device, simultaneously as only using user terminal collection in the trained later period Data set training neural network model rather than all using cloud collection data set, ensure that trained model more Suitable for user itself, during use later, once the number that inference errors occurs in model has been more than certain threshold value, It still then is trained model using the data set that terminal device and terminal device are collected, until the accuracy of model is higher than institute It is required that value improve the service efficiency of equipment without being transmitted to cloud again.
Detailed description of the invention
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below will be to needed in the embodiment attached Figure is briefly described, it should be understood that the following drawings illustrates only certain embodiments of the present invention, therefore is not construed as pair The restriction of range for those of ordinary skill in the art without creative efforts, can also be according to this A little attached drawings obtain other relevant attached drawings.
Fig. 1: the training method flow chart of neural network
Fig. 2: the system block diagram of neural network
Fig. 3: the accuracy threshold value of neural network
Fig. 4: the training and reasoning process of neural network model
Specific embodiment
A kind of training method of neural network model, specifically includes:
Step S1, cloud server 1 are trained neural network model using the first training dataset;
Step S2, cloud server 1 judge the accuracy of the test of neural network model:
If accuracy is greater than one first preset threshold, step S3 is turned to;
If accuracy is not more than the first preset threshold, return step S1, to continue to be trained neural network model;
Neural network model is sent to user terminal 2 by step S3, cloud server 1;
Step S4, user terminal 2 continues to be trained neural network model using the second training dataset, until nerve net Until the accuracy of the test of network model is greater than one second preset threshold;
Neural network model is saved as available model by step S5, user terminal 2, for subsequent use;
Second preset threshold is greater than the first preset threshold.
In the accuracy of the test of neural network model, test refers to output data and positive exact figures to neural network model According to being compared, correct data here refers to that the training data for being trained to neural network model concentrates corresponding mark Quasi- output data.
During later using model, user terminal 2 persistently detects the errors number of the reasoning of available model, and Errors number re-execute the steps S4 to step S5 when being more than a third predetermined threshold value.
Here reasoning refers to output prediction result, and whether mistake is judged by user feedback for reasoning, user Reasoning is wrong is just denoted as primary mistake for feedback.
As shown in Figure 1 and Figure 4, it by doing repetition training to neural network model with a large amount of training data, ultimately forms Neural network model the reasoning results meet result required by neural network model input data;
Cloud server 1 is long-range respectively to connect a plurality of clients 2;
Cloud server 1 collects the data that each user terminal 2 uploads, as the first training dataset.
The data set of cloud server 1 refers to the set for the data set that cloud server 1 is collected from a plurality of clients 2, such as The set of data set A shown in Fig. 4, data set B, data set C, data set D.
User terminal 2 forms the second training dataset in such a way that data acquire.
Data acquisition may include the data directly inputted by the different collected data of sensor or user Deng.
The training system of the neural network model includes cloud server 1 and user terminal 2, and cloud server 1 remotely connects User terminal 2;
Cloud server 1 includes:
First training unit 11, for being trained using the first training dataset to neural network model;
First judging unit 12 connects the first training unit 11, the accuracy of the test for judging neural network model And the first judging result is exported, when the first judging result indicates that the accuracy of the test of neural network model is pre- no more than one first If when threshold value, the first training unit 11 continues to be trained neural network model;
Transmission unit 13 is separately connected the first judging unit 12 and the first training unit 11, in the first judging result When indicating that the accuracy of the test of neural network model is greater than first preset threshold, the neural network model that training is formed is sent out It send to user terminal 2;
User terminal 2 includes:
Receiving unit 21, for receiving neural network model;
Second training unit 22 connects receiving unit 21, for being continued using the second training dataset to neural network mould Type is trained, and until the accuracy of the test of neural network model is greater than one second preset threshold, is exported training and is formed Neural network model;
Storage unit 23 connects the second training unit 22, and neural network model is saved as available model by storage unit 23, For subsequent use;
Second preset threshold is greater than the first preset threshold.
User terminal 2 trains accuracy threshold value, is determined according to the performance of user terminal 2 and the size of training dataset;
User terminal 2 can be mobile phone, tablet computer, computer;
User terminal 2 further include:
Second judgment unit 24 connects the second training unit 22, the mistake time of the reasoning for persistently detecting available model Number, and neural network model is sent back to again in the second training unit 22 when errors number is more than a third predetermined threshold value and is carried out Training.
As shown in Fig. 2,
Cloud server 1 includes the first training unit 11, the first judging unit 12, transmission unit 13;
User terminal 2 includes receiving unit 21, the second training unit 22, storage unit 23, second judgment unit 24;
In order to make it easy to understand, only showing a user terminal 2 in Fig. 2.
Cloud server 1 is long-range respectively to connect a plurality of clients 2;
Cloud server 1 collects the data that each user terminal 2 uploads, as the first training dataset.
User terminal 2 forms the second training dataset in such a way that data acquire.
The present invention is further illustrated in the following with reference to the drawings and specific embodiments, but not as the limitation of the invention.
Embodiment 1
For the training process of a user terminal neural network model, server 1 is trained model beyond the clouds first, institute The data set that the data set used is collected from each user terminal 2 from cloud server 1, data set A as shown in Figure 4, Data set B, data set C, data set D, when neural network model accuracy is more than the first preset threshold a% (as shown in Figure 3), The parameter of neural network model is passed into user terminal 2, user terminal 2 utilizes the data set (data as shown in Figure 4 of themselves capture Collection A or data set B or data set C or data set D) continue to train, until neural network model accuracy is higher than second When preset threshold b% (as shown in Figure 3), user (user A or user B as shown in Figure 4 or user C or user D) starts to make It is made inferences with neural network model, this process is as shown in Figure 4.Neural network model during use later, once When inference errors number is more than third predetermined threshold value (both reasoning failure threshold values), illustrate that the generalization ability of neural network model (pushes away Manage correct ability) decline, it needs to re-start training, until trained accuracy is more than user terminal training accuracy threshold value.
Training method through the invention, neural network model can complete most training mission beyond the clouds, improve The efficiency of training, also saves the memory space of terminal device, simultaneously because the training later period only uses the number of user terminal collection According to training neural network model is collected, so ensure that trained model is more applicable for user itself.
The above is only preferred embodiments of the present invention, are not intended to limit the implementation manners and the protection scope of the present invention, right For those skilled in the art, it should can appreciate that and all replace with being equal made by description of the invention and diagramatic content It changes and obviously changes obtained scheme, should all be included within the scope of the present invention.

Claims (8)

1. a kind of training method of neural network model, which is characterized in that setting cloud server and with the cloud service The user terminal that device remotely connects;
The training method specifically includes:
Step S1, the cloud server are trained the neural network model using the first training dataset;
Step S2, the cloud server judge the accuracy of the neural network model test:
If the accuracy is greater than one first preset threshold, step S3 is turned to;
If the accuracy is not more than first preset threshold, the step S1 is returned to, to continue to the neural network Model is trained;
The neural network model is sent to the user terminal by step S3, the cloud server;
Step S4, the user terminal continues to be trained the neural network model using the second training dataset, until institute The accuracy of the test of neural network model is stated greater than until one second preset threshold;
The neural network model is saved as available model by step S5, the user terminal, for subsequent use;
Second preset threshold is greater than first preset threshold.
2. training method as described in claim 1, which is characterized in that the user terminal is during using the available model The errors number of the reasoning of the available model is persistently detected, and again when the errors number is more than a third predetermined threshold value The step S4 is executed to the step S5.
3. training method as described in claim 1, which is characterized in that the cloud server remotely connects multiple described respectively User terminal;
The cloud server collects the data that each user terminal uploads, as first training dataset.
4. training method as described in claim 1, which is characterized in that the user terminal forms institute in such a way that data acquire State the second training dataset.
5. a kind of training system of neural network model, which is characterized in that including cloud server and user terminal, the cloud clothes Business device remotely connects the user terminal;
The cloud server includes:
First training unit, for being trained using the first training dataset to the neural network model;
First judging unit connects first training unit, the accuracy of the test for judging the neural network model And the first judging result is exported, when first judging result indicates that the accuracy of the test of the neural network model is not more than When one first preset threshold, first training unit continues to be trained the neural network model;
Transmission unit is separately connected first judging unit and first training unit, in the first judgement knot When fruit indicates that the accuracy of the test of the neural network model is greater than first preset threshold, by the nerve of training formation Network model is sent to the user terminal;
The user terminal includes:
Receiving unit, for receiving the neural network model;
Second training unit connects the receiving unit, for being continued using the second training dataset to the neural network mould Type is trained, until the accuracy of the test of the neural network model is greater than one second preset threshold, output training The neural network model formed;
Storage unit, connects second training unit, and the neural network model is saved as available mould by the storage unit Type, for subsequent use;
Second preset threshold is greater than first preset threshold.
6. training system as claimed in claim 5, which is characterized in that the user terminal further include:
Second judgment unit connects second training unit, the mistake time of the reasoning for persistently detecting the available model Number, and the neural network model is sent back into second training again when the errors number is more than a third predetermined threshold value It is trained in unit.
7. training system as claimed in claim 5, which is characterized in that the cloud server remotely connects multiple described respectively User terminal;
The cloud server collects the data that each user terminal uploads, as first training dataset.
8. training system as claimed in claim 5, which is characterized in that the user terminal forms institute in such a way that data acquire State the second training dataset.
CN201811246828.XA 2018-10-24 2018-10-24 A kind of training method of neural network model Pending CN109615058A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811246828.XA CN109615058A (en) 2018-10-24 2018-10-24 A kind of training method of neural network model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811246828.XA CN109615058A (en) 2018-10-24 2018-10-24 A kind of training method of neural network model

Publications (1)

Publication Number Publication Date
CN109615058A true CN109615058A (en) 2019-04-12

Family

ID=66002142

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811246828.XA Pending CN109615058A (en) 2018-10-24 2018-10-24 A kind of training method of neural network model

Country Status (1)

Country Link
CN (1) CN109615058A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110415678A (en) * 2019-06-13 2019-11-05 百度时代网络技术(北京)有限公司 Customized voice broadcast client, server, system and method
CN110532445A (en) * 2019-04-26 2019-12-03 长佳智能股份有限公司 The cloud transaction system and its method of neural network training pattern are provided
CN110659732A (en) * 2019-09-20 2020-01-07 上海新储集成电路有限公司 Method for intelligently adjusting neural network model
CN110703994A (en) * 2019-09-20 2020-01-17 上海新储集成电路有限公司 Data storage system and method of neural network model
CN111598139A (en) * 2020-04-24 2020-08-28 北京奇艺世纪科技有限公司 Data processing method and system
CN112085208A (en) * 2020-07-30 2020-12-15 北京聚云科技有限公司 Method and device for model training by using cloud

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9336483B1 (en) * 2015-04-03 2016-05-10 Pearson Education, Inc. Dynamically updated neural network structures for content distribution networks
CN106709917A (en) * 2017-01-03 2017-05-24 青岛海信医疗设备股份有限公司 Neural network model training method, device and system
CN107508866A (en) * 2017-08-08 2017-12-22 重庆大学 Reduce the method for the transmission consumption of mobile device end neural network model renewal
CN107871164A (en) * 2017-11-17 2018-04-03 济南浪潮高新科技投资发展有限公司 A kind of mist computing environment personalization deep learning method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9336483B1 (en) * 2015-04-03 2016-05-10 Pearson Education, Inc. Dynamically updated neural network structures for content distribution networks
CN106709917A (en) * 2017-01-03 2017-05-24 青岛海信医疗设备股份有限公司 Neural network model training method, device and system
CN107508866A (en) * 2017-08-08 2017-12-22 重庆大学 Reduce the method for the transmission consumption of mobile device end neural network model renewal
CN107871164A (en) * 2017-11-17 2018-04-03 济南浪潮高新科技投资发展有限公司 A kind of mist computing environment personalization deep learning method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
刘庆等: "基于非监督预训练的结构优化卷积神经网络", 《工程科学与技术》, 30 June 2017 (2017-06-30), pages 213 - 218 *
邵欣: "《物联网技术及应用》", 北京航空航天大学出版社, pages: 203 - 209 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110532445A (en) * 2019-04-26 2019-12-03 长佳智能股份有限公司 The cloud transaction system and its method of neural network training pattern are provided
CN110415678A (en) * 2019-06-13 2019-11-05 百度时代网络技术(北京)有限公司 Customized voice broadcast client, server, system and method
CN110659732A (en) * 2019-09-20 2020-01-07 上海新储集成电路有限公司 Method for intelligently adjusting neural network model
CN110703994A (en) * 2019-09-20 2020-01-17 上海新储集成电路有限公司 Data storage system and method of neural network model
CN110659732B (en) * 2019-09-20 2023-07-04 上海新储集成电路有限公司 Method for intelligently adjusting neural network model
CN111598139A (en) * 2020-04-24 2020-08-28 北京奇艺世纪科技有限公司 Data processing method and system
CN112085208A (en) * 2020-07-30 2020-12-15 北京聚云科技有限公司 Method and device for model training by using cloud

Similar Documents

Publication Publication Date Title
CN109615058A (en) A kind of training method of neural network model
CN110263921B (en) Method and device for training federated learning model
CN112181666B (en) Equipment assessment and federal learning importance aggregation method based on edge intelligence
CN109948669B (en) Abnormal data detection method and device
US11176418B2 (en) Model test methods and apparatuses
US11379723B2 (en) Method and apparatus for compressing neural network
CN110473083B (en) Tree risk account identification method, device, server and storage medium
CN112748941B (en) Method and device for updating target application program based on feedback information
Li et al. Online federated multitask learning
KR20200145827A (en) Facial feature extraction model learning method, facial feature extraction method, apparatus, device, and storage medium
CN112149733B (en) Model training method, model quality determining method, model training device, model quality determining device, electronic equipment and storage medium
CN112001274B (en) Crowd density determining method, device, storage medium and processor
CN111915020A (en) Method and device for updating detection model and storage medium
CN110414581B (en) Picture detection method and device, storage medium and electronic device
CN111414540B (en) Online learning recommendation method and device, online learning system and server
CN114861172B (en) Data processing method and system based on government affair service system
CN111526213A (en) Equipment control system based on block chain
CN112817563B (en) Target attribute configuration information determining method, computer device, and storage medium
CN112861659A (en) Image model training method and device, electronic equipment and storage medium
CN113052025A (en) Training method of image fusion model, image fusion method and electronic equipment
CN110610140B (en) Training method, device and equipment of face recognition model and readable storage medium
CN115690592B (en) Image processing method and model training method
CN110472537B (en) Self-adaptive identification method, device, equipment and medium
CN116435995A (en) Time series processing method, computer readable storage medium and electronic device
CN115830709A (en) Action recognition method based on federal edge learning, server and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination