CN101782976A - Automatic selection method for machine learning in cloud computing environment - Google Patents

Automatic selection method for machine learning in cloud computing environment Download PDF

Info

Publication number
CN101782976A
CN101782976A CN201010017918A CN201010017918A CN101782976A CN 101782976 A CN101782976 A CN 101782976A CN 201010017918 A CN201010017918 A CN 201010017918A CN 201010017918 A CN201010017918 A CN 201010017918A CN 101782976 A CN101782976 A CN 101782976A
Authority
CN
China
Prior art keywords
cloud
machine learning
algorithm
data
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201010017918A
Other languages
Chinese (zh)
Other versions
CN101782976B (en
Inventor
王汝传
孔强
任勋益
付雄
邓松
易侃
杨明慧
蒋凌云
邓勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Post and Telecommunication University
Nanjing University of Posts and Telecommunications
Original Assignee
Nanjing Post and Telecommunication University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Post and Telecommunication University filed Critical Nanjing Post and Telecommunication University
Priority to CN 201010017918 priority Critical patent/CN101782976B/en
Publication of CN101782976A publication Critical patent/CN101782976A/en
Application granted granted Critical
Publication of CN101782976B publication Critical patent/CN101782976B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention relates to an automatic selection method for machine learning in cloud computer environment. By using a cloud computing platform, a user can automatically and intelligently build a machine learning mathematic model which meets actual problems without building the operation environment of machine learning, selecting a machine to learn algorithm and even adjusting complicated machine learning functions and accompanying parameters but only using a Web method to upload sample data. Through the method, the use of machine learning is free from the environmental constraints and displays the advantages of the cloud computing platform, so that the machine learning model building is transparent to the user, so as to best reduce the use threshold of machine learning. The automatic selection method for machine learning in cloud computing environment solves the disadvantages of the unpredictability of model building selection, the manual experience of parameter adjustment, the difficulties of common users and the like when machine learning is applied in actual life.

Description

A kind of automatic selection method for machine learning in cloud computing environment
Technical field
The present invention is the autonomous system of selection of a kind of machine learning based on cloud computing environment.By using cloud computing platform, make the user need not to build the running environment of machine learning, also need not to learn machine learning algorithm, more need not adjust the machine learning function and the parameter thereof of numerous and complicated, only need under cloud computing platform, use the Web mode to upload training data and prediction test data, and determine to comprise information seldom such as usable range, expectation territory, just can obtain needed multiple machine learning model and specific descriptions, so that solve practical problems.
Background technology
Machine learning is the another important application after expert system application, artificial intelligence application, also is a kind of core research topic of artificial intelligence simultaneously.Its objective is to make computing machine can simulate or test human learning behavior, thereby acquire knowledge or technical ability can constantly be improved performance according to new information simultaneously.The ability of machine learning is very important feature, and H.A.Simon thinks that study is the adaptations that system does, and the system that makes finishes same next time or obtaining better to finish effect during similar task.R.s.Michalski thinks that study is the expression of constructing or revising for the experience things.The people that are engaged in DEVELOPMENT OF EXPERT SYSTEM think that then study is obtaining of knowledge.These viewpoints emphasize particularly on different fields, and first kind of viewpoint emphasized the external behavior effect learnt, then emphasize the internal procedure learnt for second kind, and the third mainly is from the point of view of practicability of knowledge engineering.
The Research of Machine Learning method is to use for reference the understanding to the self-teaching mechanism of the mankind own such as physiology, psychology, cognitive science, foundation is to the computation model or the cognitive model of human learning process, thereby form the various theories of learning and learning method, set up the learning system with application-specific of oriented mission.These goals in research influence each other and mutually promote.Since 1980 since Ka Neiji-Mei Long university holds first machine scientific seminar, machine learning development is very fast, has become one of central topic.
And the history development procedure of machine learning is divided into four-stage: (1) the mid-50 is to the ardent period of the mid-1960s; (2) the mid-1960s is to the calm period of the mid-1970s; (3) the mid-1970s is to the recovery period of the mid-80; Beginning in (4) 1986 years then is the latest stage of machine learning.And this period at present, outstanding feature is that machine learning has developed into an emerging frontier branch of science, has merged various learning method, and range of application is also increasing, and relevant academic activities are very active.
Machine learning develops into present stage, uses very extensively, and a lot of outstanding algorithms that have been born may be summarized to be basically based on the study of symbol with based on is-not symbol study, just connectionist learning.And the former is based on the study of symbol, generally comprise rote learning, instruct formula study, the study of example formula, matching test study, based on study of explaining or the like.
Wherein comparatively common algorithm has: decision Tree algorithms, genetic algorithm, Bayesian statistics algorithm, artificial neural network algorithm, algorithm of support vector machine, association rule algorithm or the like.Carry the MBM of these common algorithms in the method for this paper design, and used the EM algorithm to come parameter is carried out maximal possibility estimation.
But be to use machine learning techniques to handle specific tasks, mainly face three problems: (1) is at a certain specific tasks the time, setting up machine learning model wastes time and energy, because the distinctiveness of specific tasks details, be difficult to directly use for reference the system model that other have built, need select according to the personal experience.(2) even certain subtask, correctly selected relatively to meet the machine learning algorithm of objective fact essence, how its complicated parameter is provided with also is a problem that must solve, need rule of thumb or the long computing of subscriber computer obtains, single user's computing power (3) user that is difficult to deal with problems fast need learn and use concrete machine learning software, the machine learning algorithm numerous and complicated, autonomous learning need spend the plenty of time, simultaneously certain some algorithm of user's autonomous learning differ also that suitable user surely need solve each run into task.
And the cloud computing technology of emerging appearance but can solve above problem, makes convenient being applied in the reality of machine learning, the faster and better creation of value.
Cloud computing is a kind of novel computation model that proposes on the basis of development such as distributed system, grid computing, is a kind of method of emerging shared architecture, and what it was faced is ultra-large distributed environment, and core provides data storage and network service.This is a kind of payment and use pattern of serving of referring to, refer to by network with as required, the mode of easily expansion obtains required service.It is relevant with software, internet that this service can be IT, also can be other service arbitrarily.Cloud computing provides the most reliable, safest data storage center, the user again concern of data lose, trouble such as poisoning intrusion, cloud computing has simultaneously reached minimum to the equipment requirements of user side." cloud " mentioned in the cloud computing is that some can self and the virtual computational resource of management, is generally some large server clusters, comprises calculation server, storage server, broadband resource or the like.Cloud computing puts together all computational resources by various clouds are provided, and realizes management automatically by software, need not artificial participation.It is loaded down with trivial details details worry that this feasible supplier of application need not, and can be absorbed in the business of oneself more, helps innovating and reducing cost.Be applied in the machine learning, cloud computing can set up machine learning model and correlation module is selected for the user, and the achievement that makes the user enjoy machine learning techniques fast solves problem.
Existing cloud computing platform is based on Theoretical Calculation and stores service substantially, finds no the special cloud computing method of setting up for machine learning, and the present invention provides a kind of feasible implementation method in conjunction with the advantage and the characteristic of machine learning techniques and cloud computing technology.
Summary of the invention
Technical matters: the purpose of this invention is to provide is a kind of automatic selection method for machine learning in cloud computing environment.By using cloud computing platform, solved the problem of machine learning modeling inconvenience, provide a kind of and provide conveniently method in conjunction with cloud computing skill and machine learning techniques processing realistic problem.Thereby it is worried that the user be need not to loaded down with trivial details details, can be absorbed in the business of oneself more, helps innovating and reducing cost.
Technical scheme: the present invention makes the use of machine learning break away from the constraint of environment, the cloud computing platform computing power and the transparency have efficiently been given full play to, farthest reduced the use threshold of machine learning, make the user need not to comform in the multimachine device learning method and seek the suitable machine learning method by experiment repeatedly, shortcomings such as the artificial experience that is difficult to predictability, parameter adjustment that modeling is selected, domestic consumer's difficulty of learning have been solved when the practical application machine learning.
The present invention seeks to set up the method for the cloud computing platform that the machine learning service is provided.Under cloud computing platform, carry out system constructing by following three aspects: the various machine learning clouds of setting up a large amount of computing machines compositions that exist with the cloud form on the one hand, comprise decision Tree algorithms cloud, genetic algorithm cloud, Bayesian statistics algorithm cloud, artificial neural network algorithm cloud, algorithm of support vector machine cloud, association rule algorithm cloud, make the cloud computing platform acquiescence carry common machine learning algorithm; Generally estimate cloud, method discovery cloud, EM algorithm support cloud, valuation functions cloud, calculate cloud, machine learning algorithm expansion cloud by initial modeling cloud, the search volume formed by computer cluster equally on the other hand, thereby embody the advantage of cloud, calculate the suitable parameters of the machine learning use that domestic consumer is difficult to or need calculate for a long time by a large amount of computational resources; Last aspect is that cloud computing platform and user carry out mutual necessary module, comprises the Web interactive interface, and machine learning input/output module and cloud administration module are in order to support the operation of cloud computing platform.
Step 1) is under the uniform dispatching of cloud administration module, at first by the Web interactive interface, obtain the required rough description of dealing with problems of user, comprise the problem kind, big class under promptly selecting, from expert system, cognitive simulation, planning and problem solving, data mining, the network information service, pattern recognition, fault diagnosis, natural language understanding, robot and game, other classification, select
Step 2) enable initial modeling cloud,, enter different subclass interfaces, fill in corresponding more detailed information by the big class that the user in the step 1) provides, comprise carry out that sample is uploaded, selected method for expressing, determines the interpretation of result method, usable range, expectation territory,
Step 3) startup method is found cloud, compares according to the historical typical example of information of same that the user provides, and determines because of taking the machine learning algorithm of which kind of or which kind; This cloud module is accompanied by the subsequent step operation, thereby adjusts constantly according to each stage result of calculation,
The information that step 4) is imported step 2 kind of user then, input machine learning input/output module, must unitize, after the datumization, carry out operations such as missing values processing, noise data processing, data scrubbing, data integration, data conversion, data reduction successively, so that obtain the intermediate result that general algorithm can use
Step 5) starts the valuation functions cloud, according to the user in step 2) information of input sets up valuation functions, the quality that machine learning is separated judges and prepares, thereby the specific algorithm performance is predicted,
Step 6) is called the EM algorithm simultaneously and is supported cloud, and solution space is carried out maximal possibility estimation, calculates optimum solution or more excellent approximate location in solution space of separating, and increases search efficiency,
After step 7) arrives this step, illustrate that preliminary work finishes, be about to carry out the training process of machine learning, automatic judgement by above step, call one or several concrete machine learning cloud modules respectively and learn, comprise decision Tree algorithms cloud, genetic algorithm cloud, Bayesian statistics algorithm cloud, artificial neural network algorithm cloud, algorithm of support vector machine cloud, association rule algorithm cloud; As User Defined machine learning algorithm expand cloud, then preferentially call machine learning algorithm and expand cloud,
Step 8) is calculated through above step, selects one or several algorithm clouds, with its startup, passes through the Web interactive interface simultaneously to field feedback, comprises the step of calculating operation, the intermediate result that obtains, and current optimum solution changes,
Step 9) supports in the process that iterates of cloud at the EM algorithm, constantly turn back to step 6, step 7 is calculated, judge whether to reach end condition simultaneously, if reach end condition then jump procedure 10, otherwise the performance prediction algorithm that uses step 5 to formulate is judged the outstanding degree of separating, this step needs a large amount of computational resources, thereby need utilize the calculating advantage of cloud computing, must calculate outstanding separating as far as possible
When step 10) satisfies at end condition, as arriving computing time, there be not more excellent separating or the iteration end of algorithm own, result of calculation is converted to information by the machine learning input/output module with readability, return the client by the Web interactive interface again, and provide detailed data to download, preserve the machine learning result simultaneously, so that reuse, avoid double counting.
One, architecture
Whole proposal has comprised that cloud is generally estimated in decision Tree algorithms cloud, genetic algorithm cloud, Bayesian statistics algorithm cloud, artificial neural network algorithm cloud, algorithm of support vector machine cloud, association rule algorithm cloud, initial modeling cloud, search volume, method finds that cloud, EM algorithm support cloud, valuation functions cloud, calculating cloud, machine learning algorithm expansion cloud and Web interactive interface module, machine learning input/output module, cloud administration module.Mutual relationship as shown in Figure 2.
Provide the explanation of concrete module below:
Decision Tree algorithms cloud: decision Tree algorithms modeling service that the major function of this cloud provides and prediction service.Decision tree be a kind ofly be used to classify, the method for the forecasting type modeling of cluster, prediction, the thought of employing is " dividing and rule ", thus it is divided into some subclass with the search volume and sets up decision tree.This algorithm is one of most widely used induction algorithm at present, is a kind of algorithm that approaches discrete function, based on example, is commonly used to as sorter.And basic decision Tree algorithms can be described as a kind of greedy algorithm, and the algorithm of existing large-scale application all is that the improvement and the function of basic decision tree algorithm strengthened.This algorithm cloud comprises following common improved decision Tree algorithms equally: C4.5 method, CART method, SLIQ method, SPRINT method etc.
The C4.5 algorithm has increased the processing to continuous type attribute, property value vacancy, has improved the ability of cutting out of decision tree equally.The technology that the CART algorithm adopts a kind of two minutes recurrence to cut apart is divided into two subclass with sample, makes each non-leaf node of decision tree that two branches all be arranged, thereby obtains binary decision tree simple for structure.The SLIQ algorithm has mainly adopted the breadth First algorithm to claim decision tree; The SPRINT algorithm has well solved the restriction of memory size, has handled the ultra-large training set that other algorithms can not be suitable for, and effectively deep layer decision tree.
The function of decision Tree algorithms cloud provides basic decision Tree algorithms modeling training service and prediction service on the one hand, and the characteristic according to the input data of intelligence selects concrete improved decision Tree algorithms to carry out calculation process on the other hand.
Genetic algorithm cloud: genetic algorithm modeling service that the major function of this cloud provides and prediction service.Genetic algorithm is to carry out evolutionary process by " survival of the fittest " rule in simulating nature circle and the algorithm that designs.Bagley and Rosengerg have at first proposed the notion of genetic algorithm in the PhD dissertation at them in 1967.The monograph of Holland publication in 1975 has been established the theoretical foundation of genetic algorithm.Nowadays genetic algorithm has not only provided arthmetic statement clearly, and Set upOne The result of a little quantitative test has obtained widely should in various fieldsWhen being similar to production task planning, exploited in communication, TSP problem, knapsack problem and Flame Image Process and signal Processing etc., can adopt this algorithm cloud with, the problem description that proposes as the user.
The fundamental purpose of genetic algorithm cloud is to use genetic algorithm thought, and the simulation of evolving is gradually tried to achieve finally and separated to user's target.Carry out iteration by initial solution, constantly from old separating, produce new explanation according to certain rule, and expect new separate than old separate outstanding.If new explanation is high more by the value that valuation functions calculates, the chance that it keeps is also just big more.Genetic algorithm is only used the coded representation problem, and the value that obtains with valuation functions is foundation, does not require clear and definite analytical expression, therefore can solve the non-linear optimizing problem of arbitrary height.And easily combine, get its strong point, obtain more excellent effect with other algorithms.
Bayesian statistics algorithm cloud: Bayes Modeling service that the major function of this cloud provides and prediction service.Bayes is that statistic algorithm can be predicted may concern between the class members, belongs to the probable value of certain class as given certain sample number.Bayesian algorithm belongs to the certain kinds probability by the calculating sample and classifies mainly based on Bayes law.Compare with other method, Bayes can be specially adapted to the situation that sample is difficult to obtain in conjunction with sample information and prior probability.Owing to need to calculate prior probability,, be fit to small scale machine study more simultaneously along with increasing of sample makes rise appreciably computing time.When being similar to aspect such as information recovery and diagnosis, economic field, classification automatically, production quality control when user's problem description, can adopt this algorithm cloud.
The fundamental purpose of Bayesian statistics algorithm cloud is to use Bayes statistical method, calculates the probability of the object that belongs to a certain class, has the class of maximum probability and is class under this object.The object of its processing can be that disperse, continuous, also can be mixed type.Based on bayes method, common have naive Bayesian method and a Bayesian network method.This algorithm cloud comprises common bayes method method for establishing model and the expansion of constantly being upgraded.
Artificial neural network algorithm cloud: artificial neural network modeling service that the major function of this cloud provides and prediction service.Artificial neural network utilizes the intelligency activity of computer technology simulation human brain, simulates the structure and the information conduction pattern of biological neural network, and expresses with mathematical form.Artificial neural network is a basic technology in the current intelligent science and technology, and the symbolic reasoning mechanism of the connection mechanism of employing and artificial intelligence becomes two big camps of intelligence science and technology side by side.The anatomical physiology feature of artificial Neural Network Simulation human brain is with many parallel simple neurons, under certain topological structure connects, accept external information, stimulate mutually simultaneously, thereby reach distributed store, associative memory, the feedback refinement, the black box mapping, the weights balance is dynamically approached, record is deposited in holography, the effect of fault-tolerant anti-mistake.Simultaneously because the imictron interconnection, when quantity reaches certain rank, can form powerful self study, self-adaptation, self-organization, self diagnosis, self-reparing capability, by constantly feeding back between node, can simulate the reasoning from logic of human brain to a certain extent, scope therefore has a wide range of applications.Particularly, when approximation of function and credit risk assessment, can preferentially adopt this algorithm cloud in pattern-recognition.
The fundamental purpose of artificial neural network algorithm cloud is used artificial neural network technology, adopts the method for simulation cerebral nerve network, manual construction a kind of neural network that can realize certain function.This algorithm cloud can produce human brain neural network's mathematical model, forms a kind of model of setting up based on imitating cerebral nerve network structure and function.This model is interconnected by a large amount of simple components and neuron, and a kind of complex network of formation has the non-linear of height, can carry out complicated logical operation and nonlinear relationship and realize.Usable range is very extensive.
Algorithm of support vector machine cloud: support vector machine modeling service that the major function of this cloud provides and prediction service.Support vector machine (Support Vector Machine) is a kind of new mode identification method that grows up on the basis of Statistical Learning Theory in recent years, shows many distinctive advantages in solving small sample, non-linear and higher-dimension pattern recognition problem.The main application fields of support vector machine has pattern-recognition, approximation of function and probability density to estimate or the like that this algorithm cloud can be preferentially adopted in these fields.
The fundamental purpose of algorithm of support vector machine cloud is, sets up a kind of model, and sample vector is mapped to high latitude space, and structure optimal classification face obtains linear optimal decision function in higher dimensional space.The over-fitting of inhibition function is measured in interval by the control lineoid, simultaneously by having used kernel function to solve problem of dimension cleverly, and the directly related sample dimension of complexity of having avoided learning method to calculate.Optimal classification lineoid wherein makes classifying distance big as far as possible guaranteeing that sample does not have under the misclassification situation, and guarantees in the empiric risk minimum, makes fiducial range minimum in the boundary of generalization, thereby guarantees the real risk minimum.
Association rule algorithm cloud: correlation rule modeling service that the major function of this cloud provides and prediction service.The association rule algorithm purpose is to solve a class problem of excavating the relevance between the collection on the big transaction data set (TDS).The correlation rule analysis is a big generic task in the machine learning.It originates from the analysis to dichotomic variable, expresses two relations between the dichotomic variable with the mode of rule, and the relation between a plurality of dichotomic variable.Certainly, later development also makes correlation rule not only be confined to dichotomic variable, also can analyze many classified variables and continuous variable.So correlation rule can be regarded as between the situational variables and concerns, and this relation table is reached the method for the rule of be very easy to explaining.The correlation rule analytical approach distributes to data and does not do any requirement, and the result of gained is complete in data, without any the subjectivity supposition, has reflected the essence of data objectively, and very strong cogency is arranged.The result that correlation rule obtains data analysis can be regarded as in the data between variable regular summary.Therefore correlation rule is after proposing, obtained a large amount of application in all trades and professions, particularly in the machine learning modeling in the extremely huge fields such as astronomical meteorological biology of market analysis, prestige assessment, commodity price analysis, intrusion detection and quantity of information, can preferentially adopt this algorithm cloud.
The fundamental purpose of association rule algorithm cloud is that the model of setting up model can solve following problem: the pattern of the incidence relation between different objects or the description of form; Improve related rate calculated and reduce storage space; Association analysis in mass data etc.
Initial modeling cloud: this cloud is uploaded sample, basic representation method, is determined that interpretation of result method, usable range, expectation territory etc. carry out initialization what the user provided, obtains initial model.
Cloud is generally estimated in the search volume: the major function of this cloud provides feasible solution and outstanding estimation position of separating, both obtained the hunting zone that is complementary with problem description in the possible space, get rid of the outstanding space of separating of to be born as much as possible, thereby improve search efficiency, reduce calculated amount.
Method is found cloud: the major function of this cloud is to select the suitable machine learning algorithm to set up model.By generally estimating and initial calculation, predictability is selected certain or certain several machine algorithm cloud.
The EM algorithm supports cloud: support vector machine modeling service that the major function of this cloud provides and prediction service.A lot of algorithms all will carry out the parameter estimation of model in the machine learning, just will carry out maximum likelihood estimation or maximum posteriori likelihood and estimate.When but the variable in the model was the Direct observation variable, maximum likelihood or maximum posteriori likelihood were obvious.But when some variable is hidden, carry out maximum likelihood and estimate that just very complicated being difficult to directly obtains.Exist under the situation of latent variable, model parameter is carried out estimation approach to be had a variety ofly, and a kind of popular maximum likelihood method of estimation is the Expectation-Maxi2mization algorithm, abbreviates the EM algorithm usually as.It is not directly the posteriority of complexity to be distributed to maximize or carry out analog computation, but adds some potential data on the basis of observed data, calculates and finish a series of simple maximizations or simulation thereby simplify.The EM algorithm be a kind of from non-complete data the maximum likelihood method of estimation of solving model parameter.Non-complete data generally is divided into two kinds of situations: a kind of is because the restriction or the mistake of observation process itself, as human error, the fragmentary data that obtains such as be difficult to measure; A kind of to be that the likelihood function of parameter is directly optimized very difficult, and introduce extra parameter, as parameter implicit or that lose.So to its optimization method is that definition raw observation data add excessive data composition " complete data ", the raw observation data just become " fragmentary data " naturally.
The valuation functions cloud: the major function of this cloud be reflection when setting up machine learning model with the degree that conforms to of target, and to the assessment of established model.According to the characteristic and the historical experience regulation valuation functions of each algorithm, whether good by inspection performance on training data on the other hand on the one hand, independently testing on the test data again.Test data wherein is to break away from the modelling algorithm, only participates in prediction and judges.
Calculate cloud: the major function of this cloud is to give full play to the advantage of cloud computing, under ultra-large distributed environment, utilizes the calculated performance, data storage and the network service that provide to come the magnanimity computing that machine learning needs is calculated.The calculating advantage of parallel computation, Distributed Calculation and grid computing has been given full play in cloud computing, and calculation services can well be provided.
Machine learning algorithm expands cloud: the major function of this cloud provide machine learning algorithm can't satisfy user's needs the time, be User Defined or platform itself interface from the upgrading reservation.Machine learning algorithm expands cloud according to the new learning algorithm of certain rule structure on the one hand, and this cloud is responsible for getting in touch other cloud and module on the other hand, thus the use that the new learning algorithm of feasible structure can be complete.
Web interactive interface module: the major function of this cloud provides interactive interface.Cloud computing support the user at an arbitrary position, use various terminals to obtain application service.Institute's requested resource is from " cloud ", rather than fixing tangible entity.Be applied in somewhere operation in " cloud ", but in fact the user need not to understand, also do not worry using the particular location of operation, that is to say at the user it is transparent.Only need a notebook or a mobile phone, just can realize all that we need, even comprise the task that supercomputing is such by the network service.Therefore, be best mode alternately by the Web interface, the user needn't be concerned about operation and the computing that carry out on the backstage, only needs to be concerned about the information of input and the result of output.This module and processing and user's interaction problems.
Machine learning input/output module: support vector machine modeling service that the major function of this cloud provides and prediction service.For the mathematical modeling of machine learning provides feasible input sample and parametric description, in fact also comprised the pre-service work of numerous and complicated.As much as possible that different recording form, different custom, different time weak point, diverse location, different data set is right, the data centralization of different ill-formalnesses, integration, cleaning.Usually to unitize, datumization, format conversion is carried out operations such as missing values processing, noise data processing, data scrubbing, data integration, data conversion, data reduction
The cloud administration module: the major function of this cloud is startup, execution and the monitor state of each module of management.Cloud computing is because its ultra-large property, generally have hundreds of thousands of station servers, large enterprise even have the hundreds of thousands station server, and at user transparent, this all needs a large amount of bookkeepings, control the ruly operation of each module, scheduling and allocating task are rationally utilized storage, calculating, bandwidth resources.
Two, method flow
1, builds and operational scheme
1. the user at first installs and starts the cloud administration module, increases successively by administration module that decision Tree algorithms cloud, genetic algorithm cloud, Bayesian statistics algorithm cloud, artificial neural network algorithm cloud, algorithm of support vector machine cloud, association rule algorithm cloud, initial modeling cloud, search volume estimate generally that cloud, method find that cloud, EM algorithm support cloud, valuation functions cloud, calculate cloud, machine learning algorithm expands cloud then and Web interactive interface module, machine learning input/output module.
2. start Web interactive interface module, wait for user's use.When the user set up the machine learning model request by the submission of Web interactive interface, the cloud administration module will start and call other cloud module, the mathematical modeling that carries out machine learning.
3. call initial modeling cloud, upload sample, basic representation method, determine that interpretation of result method, usable range, expectation territory etc. carry out initialization, obtain initial model what the user provided.
4. operation method is found cloud, compares according to the historical typical example of information of same that the user provides, and determines because of taking the machine learning algorithm of which kind of or which kind.This cloud module is accompanied by the subsequent step operation, thereby adjusts constantly according to each stage result of calculation.
5. the cloud administration module is by input machine learning input/output module, with the data of user by Web interactive interface input unitize, after the datumization, carry out operations such as missing values processing, noise data processing, data scrubbing, data integration, data conversion, data reduction successively, so that obtain the intermediate result that general algorithm can use.
6. starting the valuation functions cloud prepares to judging the quality that machine learning is separated.This step mainly is to formulate the specific algorithm of estimated performance, whether uses verification methods such as cross validation and leaving-one method, bootstrap method.
7. call the EM algorithm and support cloud, solution space is carried out maximal possibility estimation, calculate optimum solution or more excellent approximate location in solution space of separating, increase search efficiency.
The EM algorithm basic principle can be expressed as follows: can observed data be y, complete data x=(y, z), z is a hidden variable, the expression missing data, θ is a model parameter.θ about the posteriority distribution p (θ | y) very complicated, be difficult to carry out various different statistical computations.If z is known, then may obtain a simple interpolation posteriority distribution p about θ (θ | y, z), utilize p (θ | y, simplicity z) can be carried out various statistical computations.Then, can the supposition of z be conducted a survey and improve again, thereby with the maximization of a complexity or the problem reduction of sampling.
As can be seen, the EM algorithm is a kind of alternative manner, is mainly used in to ask the posteriority mode of distribution.
The specific implementation step is as follows: suppose that y is the non-complete observation data collection of obeying a certain distribution, and there is a complete data collection x=(y, z), then the density function of x is: p (x| θ)=p (y, z| θ)=p (z|y, θ) p (y| θ) therefrom as can be seen, density function p (x| θ) be by marginal density function p (θ | y), hypothesis, parameter θ initial estimate and the hidden variable z of hidden variable z and the decision of the relation between the observational variable y.
8. after preliminary work is finished, carry out the modeling process of machine learning, automatic judgement by above step, call one or several concrete machine learning cloud modules respectively and learn, comprise decision Tree algorithms cloud, genetic algorithm cloud, Bayesian statistics algorithm cloud, artificial neural network algorithm cloud, algorithm of support vector machine cloud, association rule algorithm cloud.As User Defined machine learning algorithm expand cloud, then preferentially call machine learning algorithm and expand cloud.
2, machine learning modeling flow process
1. decision Tree algorithms modeling
Decision tree can be regarded a tree-shaped forecast model as, and it comes classified instance by example is aligned to certain leaf node from root node, and leaf node is the classification under the example, shown in Fig. 3 decision tree basic configuration figure.The key problem of decision tree is to select the beta pruning of Split Attribute and decision tree.The algorithm of decision tree has a lot, and ID3, C4.5, CART or the like are arranged.These algorithms all adopt top-down greedy algorithm, and the best attribute of each node selection sort effect is 2 or a plurality of child node with node splitting, continue this process up to this tree classification based training collection exactly, or all properties are used all.As wherein classification regression tree (CART) is a kind of classification and regression algorithm in the machine learning.If training sample set L={X 1, X 2, X 3... X n, Y}, wherein, X i(i=1,2,3 ..., n) be called attribute vector; Y is called label vector or categorization vector.When Y is orderly quantitative value, be called regression tree; When Y is discrete value, be called classification tree.At the root node place of tree, search problem collection (data acquisition space) finds the optimum division variable and the corresponding division threshold value that make that the non-purity of data set descends maximum in the child node of future generation.
Non-here purity index is weighed with the Gini index, and it is defined as:
Figure G2010100179189D00081
Wherein, i (t) is the Gini index of node t, and p (i/t) is illustrated in the shared ratio of sample that belongs to the i class among the node t, and p (j/t) is the shared ratio of sample that belongs to the j class among the node t.Divide variable and divide threshold value with this root node t 1Split into t 2And t 3If, at certain node ti place, the remarkable reduction of further non-purity can not be arranged again, then this node ti becomes leaf node, otherwise continues to seek its optimum division variable and divide threshold value and divide.For classification problem, in leaf node, have only a class, this class is just as the class under the leaf node so, if there is the sample in a plurality of classes to exist in the node, determines classification under the node according to that maximum class of sample in the leaf node; For regression problem, then get the mean value of its quantitative value.Clearly, the very big undue fitting data of tree possibility, but less tree possibly can't catch important structure again.The best size of tree is the adjustment parameter of controlling models complicacy, and it should be by the selection of data adaptive.A kind of desirable strategy is to increase a bigger tree t 0, only when reaching minimum node size (such as 3), just stop fission process.The method of utilizing beta pruning strategy and 5 foldings or 10 folding cross validations to combine is then pruned this tree, thereby some noises and interfering data are got rid of, and obtains optimal tree.Thereby set up the mathematical model of decision tree.
2. genetic algorithm modeling
For little space, the classical method of exhaustion is just enough; And, then need to use special artificial intelligence technology to large space.Genetic algorithm (Genetic Algorithm) is a kind of in these technology, it be an analoglike biological evolution process and produce by the global optimizing algorithm of selecting operator, hybridization operator and three basic operators of mutation operator to form.It by selecting operator to select the good male parent of proterties, hybridizes computing by the hybridization operator from an initial family, and mutation operator carries out a little variation, the random search model space under certain rule of probability control.Evolution generation upon generation of reaches the requirement of setting up to the error functional value of finally separating correspondence.
The t time iteration, genetic algorithm is kept a potential colony that separates
Figure G2010100179189D00091
Each separates x 1 tThe evaluation function evaluation evaluation of using the valuation functions cloud to obtain.Then by selecting more suitable individuality (t+1 iteration) to form a new colony.The member of new colony carries out conversion by hybridization and variation, forms new separating.Cross combination the feature of two parent chromosomes (promptly waiting to ask the binary coding string of parameter), formed two similar offsprings by exchange parent corresponding segment.For example parent chromosome is (a 1, b 1, c 1, d 1, e 1) and (a 2, b 2, c 2, d 2, e 2), behind second gene, to hybridize, the offspring of generation is (a 1, b 1, c 2, d 2, e 2) and (a 2, b 2, c 1, d 1, e 1).The purpose of hybridization operator is to carry out message exchange between separating in that difference is potential.Variation is to change one or more genes (bit in the chromosome) on the selected chromosome randomly by the probability that equals aberration rate with.Some extra variabilities are introduced in being intended that to colony of mutation operator.Modeling process is shown in Fig. 4 genetic algorithm basic process figure.This process has been set up the genetic algorithm mathematical model thus.
3. Bayesian statistics modeling
Bayes is that the modeling of Bayes statistical method is a kind of method for classifying modes under the situation of known prior probability and class conditional probability.The classification results for the treatment of the branch sample of its processing depends on all of sample in each class field.If
Training sample set is divided into the M class, is designated as C={c 1, c 2..., c t..., c M, the prior probability of every class is P (c i), i=1,2 ..., M.When sample set is very big, can think P (c i)=c iSample number/total sample number.Treat branch sample X for one, it belongs to c iThe class conditional probability of class is P (X/c i), then according to the Bayes theorem,
Can obtain c iPosterior probability P (the c of class i/ X)=P (X/c i) P (c i)/P (X) P (ci/X).If P (c i/ X)=MaxjP (c j/ X), and i=1,2 ..., M, j=1,2 ..., M then has X ∈ c i, maximum a posteriori probability decision rule that Here it is.The Bayes sorting technique is proved relatively fully in theory, also is very widely on using.Overall probability distribution and the probability distribution function of all kinds of samples (or density function) usually are ignorant.In order to obtain them, just require sample enough big.In addition, when being used for text classification, the Bayes method requires the descriptor of expression text separate, and such condition general being difficult in actual text satisfied, so this method often is difficult to reach theoretic maximal value on effect.The prior probability by being based upon statistical and this method of class conditional probability can be set up Bayesian statistical model.
4. artificial neural network algorithm modeling
Artificial neural network (Artificial Neural Network. is called for short ANN) is the neural network that can realize certain function of manual construction on the basis that the mankind understand its cerebral nerve network understanding just.It is the human brain neural network's that theorizes a mathematical model, is based on imitation cerebral nerve network structure and function and a kind of information handling system of setting up.It is actually by a large amount of simple components and interconnects the complex network that forms, and has the non-linear of height, can carry out the system that complicated logical operation and nonlinear relationship realize.
Artificial neural network is organized by layer, and each layer is made up of a plurality of artificial neurons, does not have connecting line to connect between them, and layer with layer between be connected by connecting line.Artificial neural network can have individual layer, also multilayer can be arranged, and at present commonly used have an individual layer, two layers and three layers.
Artificial neural network is divided into two kinds of frame modes according to artificial neuron's data flow mode: forward direction type and feedback-type.If the artificial neuron metadata less than feedback, is not referred to as the forward direction type from being input to the output uniflux,, then be referred to as feedback-type if feedback (no matter feeding back to this neuron or other neuron of same layer) is arranged.
The artificial neuron is the base unit of artificial neural network, and the artificial neuron can have multiple model, but has a kind of horizontal substantially type the most common, and it is composed as follows:
I. input: an artificial neuron can have a plurality of inputs.
II. output: an artificial neuron can only have an output.
III. inner structure: will import addition with totalizer, and add deviate then, and calculate it with activation function then, the neuronic output of result calculated conduct
A. totalizer: will import linear, additive, exactly, be that the product with input and corresponding weight value sums up.
B. deviate: the value that totalizer produced is subjected to external disturbance and influence and produces deviation through regular meeting, therefore needs a deviation to adjust, and generally is used for θ k to represent k neuronic deviate.
C. activation function: be used for limiting the scope of neuron output value, generally-1~+ 1 or 0~1.
Activation function commonly used has Logistic, Simoid etc.
Link to each other with connecting line between the artificial neuron, every connecting line all has weights, and as mentioned above, the inner totalizer of the target nerve of connecting line unit can use these weights when summing up.Use ω IjRepresent the weights of i neuron to connecting line between j neuron.
Artificial neural network has learning functionality, and this study is trained it with real data sample exactly.A data sample has the input and output data, with of the input of input data as artificial neural network, the relatively output of artificial neural network and the output of sample then, by adjusting the parameter (being the weights and the neuronic deviation of connecting line) in the artificial neural network, make that both difference is 0 or within the acceptable scope.
Trained artificial neural network has certain judgement and inferential capability, and can carry out certain prediction and decision-making.Reflections propagate model (BP, Back Propagation) is the modal a kind of model of artificial neural network, has the application more than half of surpassing to adopt this model.It is a multilayer forward direction type structure, is made up of following three parts:
I. input layer: have only one deck, form by m neuron, receive extraneous m input xi (i=1,2 ..., m), each input links to each other with a neuron.The basic neuron of neuron right and wrong of this one deck does not have inner structure, and the value of its output is exactly the value of input.
II. hide layer: multilayer can be arranged, be made up of n neuron for every layer, these neurons are exactly the basic neuron of introducing previously.
III. output layer: having only one deck, be made up of p neuron, also is basic neuron.
It is the connection of multi-to-multi that the neuron that (comprises between a plurality of hiding layers) between above-mentioned each layer connects, and the input and output layer is man-to-man the connection with the external world, shown in Fig. 5 artificial neural network basic block diagram.
Substantially neuronic activation function adopts the Logistic function, and expression formula is:
O j = 1 1 + e - I j
Algorithm divides following step:
Calculate the input value of each the neuronic j that hides layer and output layer, thus the output valve of calculating:
A) input:
Figure G2010100179189D00112
I is that preceding one deck all and neuron j have the neuron that connects in the formula.
B) output: adopt Logistic function calculation output valve.
Calculate the error of output layer neuron j:
E rrj=O i(1-O j)(T j-O j)
T in the formula jBe sample class label.
Calculate the error of hiding each neuron j of layer:
E rrj = O j ( 1 - O j ) Σ k E rrk ω jk
In the formula k be the back one deck all with neuron j the neuron that is connected, E are arranged RrkThen be these neuronic errors.
Each connecting line weights ω in the computational grid IjModified value:
Δω ij=(l)E rriO j
(l) is the learning rate of algorithm in the formula, and this value is formulated voluntarily by the trainer.The selection of learning rate helps to seek the minimum weights of the overall situation, selects too for a short time, and learning process can be carried out very slowly, and is too big, may appear at swing between unsuitable the separating.Generally can select a constant between (0,1), empirical value commonly used is 1/t, and t is a number of iterations.
Calculate the new weights of this connecting line then, and revise it:
ω ij=ω ij+Δω ij
Hide the modified value of each neuron deviate in layer and the output layer in the computational grid:
Δθ j=(l)E rrj
Calculate the new deviate of this neuron then:
θ j=θ j+Δθ j
2) check end condition, several are generally arranged, as:
A) Δ ω iWith Δ θ jAll enough little, less than a certain designated value;
B) iterations has reached specified quantity.
This process has been set up the mathematical model of artificial neural network, by the neural network model that trains, can calculate the input sample, thereby obtain predicted value.
5. algorithm of support vector machine modeling
The initial thought of support vector machine is how to seek the optimal classification face for the linear separability problem, for feature space neutral line separable problem, the optimal classification face is exactly the interphase of interval γ maximum, according to the analysis of above-mentioned nuclear theory as can be known, it really under guaranteeing that sample is by the prerequisite of correctly classifying, have the interphase of best generalization ability.For the inseparable problem of feature space neutral line, can take all factors into consideration at interval and the influence of relaxation factor by a penalty factor.
The lineoid of a usefulness feature space of consideration is done the problem of two-value classification to given training dataset.For given sample point: (x 1, y 1) ..., (x l, y l), x i∈ R n, y i{ 1 ,+1} is vector x wherein for ∈ iMay be to extract the directly vector of structure of some feature from the object samples collection, also may be original vector be mapped to mapping vector in the nuclear space by certain kernel function.In feature space, construct segmentation plane:
(wx)+b=0 is feasible:
( w · x i ) + b ≥ 1 y i = 1 ( w · x i ) + b ≤ - 1 yi = - 1 ⇔ y i [ ( w · x i ) + b ] ≥ ( i = 1,2 , . . . , l )
Can calculate, the minor increment of the segmentation plane that training dataset to is given is:
p ( w , b ) = min { x i | y i = 1 } w · x i + b | w | - max { x i | y i = - 1 } w · x i + b | w | = 2 | w |
To optimizing the definition of segmentation plane, the problem of finding the solution to this plane can be reduced to as can be seen: under the situation of the formula of satisfying condition (3), calculating can maximize p (w, the normal vector w of segmentation plane b) and side-play amount b according to SVM.People such as Vapnik prove:
The normal vector w of cutting apart lineoid 0Be the linear combination of all training set vectors.Be w 0Can be described as:
Figure G2010100179189D00123
Definition discriminant function f (x)=w 0X+b 0Then the classification function of test set can be described as: label (x)=sgn (f (x))=sign (w 0X+b 0)
Under the situation of linear separability, all should satisfy all training samples | f (x) | 〉=1, hereinafter, we are satisfying | f (x) |<1 zone calls cuts apart the pairing borderline region of lineoid.
The finding the solution of optimum segmentation plane is equivalent under the former constraint below the maximization
Introduce Lagrange multiplier α i, i=1,2 ..., l, and definition
Figure G2010100179189D00125
Use the Wolfe antithesis fixed
MaxW ( α ) = Σ i α i - 1 2 w ( α ) · w ( α )
subject to α i ≥ 0 , Σ i a i y i = 0
Reason is converted into its dual problem to the problems referred to above:
For the inseparable training set of linearity, can introduce slack variable ξ i, be rewritten as following:
Min ( 1 2 | | w | | 2 + C Σ i ξ i )
Subject?to?y i(w·x i+b)≥1-ξ i,ξ i≥0
Similarly can obtain corresponding dual problem:
MaxW ( α ) = Σ i α i - 1 2 w ( α ) · w ( α )
subject to 0 ≤ α i ≤ C , Σ i a i y i = 0 ,
Finding the solution of this form is a typical constrained quadratic form optimization problem, the derivation algorithm that a lot of maturations have been arranged, in recent years, V.Vapnik, C.Burges, E.Osuna, T.Joachims, people's such as J.Platt a series of activities makes the algorithm of support vector machine of extensive training set is realized becoming possibility.
By the mathematical model that above description is set up, can seek out those automatically has the support vector of better separating capacity to classification, and the sorter that constructs thus can maximize the interval of class and class, thereby adaptive faculty and higher differentiation rate are preferably arranged.This method only need decide last classification results by the classification of the boundary sample of each class field, has finally set up the support vector machine mathematical model.
6. association rule algorithm modeling
Association rule mining is in a class problem of excavating the relevance between the collection on the big transaction data set (TDS).The correlation rule analysis is a big generic task in the machine learning.It originates from the analysis to dichotomic variable, expresses two relations between the dichotomic variable with the mode of rule, and the relation between a plurality of dichotomic variable.Certainly, later development also makes correlation rule not only be confined to dichotomic variable, also can analyze many classified variables and continuous variable.So correlation rule can be regarded as between the situational variables and concerns, and this relation table is reached the method for the rule of be very easy to explaining.
The correlation rule analytical approach distributes to data and does not do any requirement, and the result of gained is complete in data, without any the subjectivity supposition, has reflected the essence of data objectively, and very strong cogency is arranged.The result that correlation rule obtains data analysis can be regarded as in the data between variable regular summary.Therefore correlation rule has obtained a large amount of application in all trades and professions after proposing.
The algorithm of correlation rule is exactly by the solution procedure of input to output.If I={i 1, i 2..., i mBe m different item destination aggregation (mda), element wherein is called (Item).Note D is the set of item for the set of transaction T (Transaction), the T that concludes the business here, and
Figure G2010100179189D00133
Corresponding each transaction has unique sign, and as Transaction Identification Number, note is made TID.Correlation rule be shape as
Figure G2010100179189D00134
Implications, here,
Figure G2010100179189D00136
And X ∩ Y=θ.X is called the prerequisite of rule, and Y is the result.Rule Support in transaction set D (Support) is meant the ratio of the number of deals that comprises X and Y and All Activity number, is designated as
Figure G2010100179189D00138
Promptly
Figure G2010100179189D00139
Rule
Figure G2010100179189D001310
Confidence level in transaction set D (confidence) is meant the ratio of number of deals that comprises X and Y and the number of deals that comprises X, promptly
Figure G2010100179189D001311
A given transaction set D excavates the correlation rule problem and is exactly and produces support and confidence level respectively greater than the correlation rule of given minimum support of user (Minsupp) and minimum confidence level (Minconf), is called strong rule.
The task of association rule mining is exactly to excavate strong rules all among the data set D.Strong regular X] Item Sets (X ∪ Y) of Y correspondence must be collection frequently, collection (∪ Y) correlation rule of deriving frequently
Figure G2010100179189D001312
Degree of confidence can calculate with the support that frequently collect X and (X ∪ Y).
The mathematical model that contains description rule by above process obtains is the correlation rule modeling.
Beneficial effect: because networks development, information is explosive increase, how effectively to utilize these letters, and uses these information to boost productivity to become the problem that presses for solution.Present present situation is to have only few part can be by correct use in the information that can effectively obtain in a large number, the information that has consumed ample resources not only can not used effectively, and because Useful Information deeper is buried among the garbage, becoming more is difficult to utilize.Machine learning is to solve one of effective ways of this class problem.Along with going deep into and the specifically expansion of application of machine learning research, a large amount of machine learning modeling mission requirements have been brought.It is same because machine learning is of a great variety, the machine learning algorithm that need adapt at concrete problem description could be set up the mathematical model of compound preferably question essence feature, has often spent the machine learning model that the plenty of time seeks and can not well reflect objective reality.
Waste time and energy at the model that specific tasks are based upon on the machine learning basis,, need select according to the personal experience because the distinctiveness of specific tasks details is difficult to directly use for reference the machine learning model that other have built.Even correctly selected relatively to meet the machine learning algorithm of objective fact essence, how complicated parameter is set, also need rule of thumb or the long computing of subscriber computer obtains, single user's computing power is difficult to deal with problems fast.Simultaneously, the user need learn and use concrete machine learning software, the machine learning algorithm numerous and complicated, and user's autonomous learning need spend the plenty of time, each task that suitable user surely need solve and certain some algorithm of user's autonomous learning also differ.
The solution route that this programme provides makes full use of the strong cloud computing platform of computing power on the one hand, the computational problem of complexity when solving machine learning, utilize simple and easy usability, the transparency of cloud computing on the other hand at the user, solved the machine learning algorithm that domestic consumer is difficult to select to meet objective reality, thereby foundation can solve the machine learning model of practical problems fast, and finds suitable parameters as far as possible automatically.
Description of drawings
Fig. 1 machine learning modeling cloud computing flowchart,
Fig. 2 module relation diagram,
Fig. 3 decision tree basic configuration figure,
Fig. 4 genetic algorithm basic process figure,
Fig. 5 artificial neural network basic block diagram,
Embodiment
The present invention is the autonomous system of selection of a kind of machine learning based on cloud computing environment.By using cloud computing platform, the user need not to build the running environment of machine learning, also need not to select machine learning algorithm, more need not adjust the machine learning function of numerous and complicated and subsidiary parameter thereof, only need to use the Web mode to upload sample data, with regard to the machine learning mathematical model of setting up realistic problem of energy automated intelligent.The present invention makes the use of machine learning break away from the constraint of environment, has brought into play the advantage of cloud computing platform, makes the machine learning modeling at user transparent, has farthest reduced the use threshold of machine learning.Shortcomings such as the artificial experience that is difficult to predictability, parameter adjustment that modeling is selected, domestic consumer's difficulty have been solved when the practical application machine learning.The final platform of setting up can put together all computational resources fully in conjunction with the cloud computing advantage, realizes management automatically by software.In data analysis process, it integrates historical data and available data, makes the information of collecting more accurate, can provide the intelligence service for machine learning.The user no longer needs to be concerned about how to buy server, machine learning software according to the business demand of oneself, as long as the demand of basis oneself just can obtain the machine learning achievement by cloud computing platform, obtains the machine learning mathematical model, is used to solve practical problems.
Concrete steps are:
1. under the uniform dispatching of cloud administration module, at first by the Web interactive interface, obtain the required rough description of dealing with problems of user, comprise the problem kind, big class under promptly selecting is as selecting from expert system, cognitive simulation, planning and problem solving, data mining, the network information service, pattern recognition, fault diagnosis, natural language understanding, robot and game, other classification;
2. enable initial modeling cloud,, enter different subclass interfaces, fill in corresponding more detailed information by the big class that the user in the step 1 provides, comprise carry out that sample is uploaded, selected method for expressing, determines the interpretation of result method, usable range, expectation territory etc.
3. startup method is found cloud, compares according to the historical typical example of information of same that the user provides, and determines because of taking the machine learning algorithm of which kind of or which kind.This cloud module is accompanied by the subsequent step operation, thereby adjusts constantly according to each stage result of calculation.
4. then with the information of step 2 kind of user input, input machine learning input/output module, must unitize, after the datumization, carry out operations such as missing values processing, noise data processing, data scrubbing, data integration, data conversion, data reduction successively, so that obtain the intermediate result that general algorithm can use.
5. start the valuation functions cloud, set up valuation functions in the information of step 2 input, the quality judgement that machine learning is separated is prepared according to the user.This step mainly is to formulate the specific algorithm of estimated performance, whether uses verification methods such as cross validation and leaving-one method, bootstrap method.
6. call the EM algorithm simultaneously and support cloud, solution space is carried out maximal possibility estimation, calculate optimum solution or more excellent approximate location in solution space of separating, increase search efficiency.
7. after arriving this step, illustrate that preliminary work finishes, be about to carry out the training process of machine learning, automatic judgement by above step, call one or several concrete machine learning cloud modules respectively and learn, comprise decision Tree algorithms cloud, genetic algorithm cloud, Bayesian statistics algorithm cloud, artificial neural network algorithm cloud, algorithm of support vector machine cloud, association rule algorithm cloud.As User Defined machine learning algorithm expand cloud, then preferentially call machine learning algorithm and expand cloud.
8. after decision Tree algorithms cloud, genetic algorithm cloud, Bayesian statistics algorithm cloud, artificial neural network algorithm cloud, algorithm of support vector machine cloud, association rule algorithm cloud and machine learning algorithm expand the cloud startup, constantly as achievement in the middle of cloud administration module, calculating cloud, EM algorithm support cloud feedback result and the acquisition, thereby adjust self strategy automatically, approach outstanding separating.Pass through the Web interactive interface simultaneously to field feedback, comprise the step of calculating operation, the intermediate result that obtains, current optimum solution variation etc.
9. support in the process that iterates of cloud at the EM algorithm, constantly turn back to step 6, step 7 is calculated, the valuation functions of using step 5 to formulate obtains Performance Evaluation, thereby prediction algorithm is judged the outstanding degree of separating, this step needs a large amount of computational resources, thereby need utilize the calculating advantage of cloud computing, must calculate outstanding separating as far as possible.
10. when end condition satisfies, as computing time to, some generations iteration do not have more excellent separate or the iteration of algorithm own finishes, result of calculation is converted to information by the machine learning input/output module with readability, return the client by the Web interactive interface again, and provide detailed data to download, preserve the machine learning result simultaneously,, avoid double counting so that reuse.

Claims (1)

1. automatic selection method for machine learning in cloud computing environment is characterized in that the step that this method comprises is:
Step 1) is under the uniform dispatching of cloud administration module, at first by the Web interactive interface, obtain the required rough description of dealing with problems of user, comprise the problem kind, big class under promptly selecting, from expert system, cognitive simulation, planning and problem solving, data mining, the network information service, pattern recognition, fault diagnosis, natural language understanding, robot and game, other classification, select
Step 2) enable initial modeling cloud,, enter different subclass interfaces, fill in corresponding more detailed information by the big class that the user in the step 1) provides, comprise carry out that sample is uploaded, selected method for expressing, determines the interpretation of result method, usable range, expectation territory,
Step 3) startup method is found cloud, compares according to the historical typical example of information of same that the user provides, and determines because of taking the machine learning algorithm of which kind of or which kind; This cloud module is accompanied by the subsequent step operation, thereby adjusts constantly according to each stage result of calculation,
The information that step 4) is imported step 2 kind of user then, input machine learning input/output module, must unitize, after the datumization, carry out operations such as missing values processing, noise data processing, data scrubbing, data integration, data conversion, data reduction successively, so that obtain the intermediate result that general algorithm can use
Step 5) starts the valuation functions cloud, according to the user in step 2) information of input sets up valuation functions, the quality that machine learning is separated judges and prepares, thereby the specific algorithm performance is predicted,
Step 6) is called the EM algorithm simultaneously and is supported cloud, and solution space is carried out maximal possibility estimation, calculates optimum solution or more excellent approximate location in solution space of separating, and increases search efficiency,
After step 7) arrives this step, illustrate that preliminary work finishes, be about to carry out the training process of machine learning, automatic judgement by above step, call one or several concrete machine learning cloud modules respectively and learn, comprise decision Tree algorithms cloud, genetic algorithm cloud, Bayesian statistics algorithm cloud, artificial neural network algorithm cloud, algorithm of support vector machine cloud, association rule algorithm cloud; As User Defined machine learning algorithm expand cloud, then preferentially call machine learning algorithm and expand cloud,
Step 8) is calculated through above step, selects one or several algorithm clouds, with its startup, passes through the Web interactive interface simultaneously to field feedback, comprises the step of calculating operation, the intermediate result that obtains, and current optimum solution changes,
Step 9) supports in the process that iterates of cloud at the EM algorithm, constantly turn back to step 6, step 7 is calculated, judge whether to reach end condition simultaneously, if reach end condition then jump procedure 10, otherwise the performance prediction algorithm that uses step 5 to formulate is judged the outstanding degree of separating, this step needs a large amount of computational resources, thereby need utilize the calculating advantage of cloud computing, must calculate outstanding separating as far as possible
When step 10) satisfies at end condition, as arriving computing time, there be not more excellent separating or the iteration end of algorithm own, result of calculation is converted to information by the machine learning input/output module with readability, return the client by the Web interactive interface again, and provide detailed data to download, preserve the machine learning result simultaneously, so that reuse, avoid double counting.
CN 201010017918 2010-01-15 2010-01-15 Automatic selection method for machine learning in cloud computing environment Active CN101782976B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201010017918 CN101782976B (en) 2010-01-15 2010-01-15 Automatic selection method for machine learning in cloud computing environment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201010017918 CN101782976B (en) 2010-01-15 2010-01-15 Automatic selection method for machine learning in cloud computing environment

Publications (2)

Publication Number Publication Date
CN101782976A true CN101782976A (en) 2010-07-21
CN101782976B CN101782976B (en) 2013-04-10

Family

ID=42522964

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201010017918 Active CN101782976B (en) 2010-01-15 2010-01-15 Automatic selection method for machine learning in cloud computing environment

Country Status (1)

Country Link
CN (1) CN101782976B (en)

Cited By (85)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102468975A (en) * 2010-11-16 2012-05-23 苏州搜能信息科技有限公司 Resource management simulation cloud computing system for cloud computing mobile network and application system thereof
CN102509177A (en) * 2011-11-11 2012-06-20 国网信息通信有限公司 Locally weighted linear regression projection operation method based on cloud platform
CN102523246A (en) * 2011-11-23 2012-06-27 陈刚 Cloud computation treating system and method
CN102724298A (en) * 2012-05-25 2012-10-10 清华大学 Method for configuring storage parameter under cloud environment
CN102799512A (en) * 2012-07-04 2012-11-28 南京邮电大学 Virtual machine monitoring method in vector-autoregression-based cloud computing
CN103036745A (en) * 2012-12-21 2013-04-10 北京邮电大学 Anomaly detection system based on neural network in cloud computing
CN103139278A (en) * 2011-12-05 2013-06-05 北京网康科技有限公司 Network resource pre-fetching and cache accelerating method and device thereof
CN103605695A (en) * 2013-11-05 2014-02-26 佛山职业技术学院 Internet based artificial-intelligence knowledge logic system and method thereof
CN104951425A (en) * 2015-07-20 2015-09-30 东北大学 Cloud service performance adaptive action type selection method based on deep learning
CN105075180A (en) * 2013-02-05 2015-11-18 思科技术公司 Dynamically adjusting a set of monitored network properties using distributed learning machine feedback
WO2015184729A1 (en) * 2014-06-05 2015-12-10 Tsinghua University Method and system for hyper-parameter optimization and feature tuning of machine learning algorithms
CN105320835A (en) * 2014-07-15 2016-02-10 通用电气智能平台有限公司 Apparatus and method for time series data analysis method market
CN105808500A (en) * 2016-02-26 2016-07-27 山西牡丹深度智能科技有限公司 Realization method and device of deep learning
US9436507B2 (en) 2014-07-12 2016-09-06 Microsoft Technology Licensing, Llc Composing and executing workflows made up of functional pluggable building blocks
CN106067028A (en) * 2015-04-19 2016-11-02 北京典赞科技有限公司 The modeling method of automatic machinery based on GPU study
CN106250986A (en) * 2015-06-04 2016-12-21 波音公司 Advanced analysis base frame for machine learning
CN106416319A (en) * 2014-06-02 2017-02-15 高通股份有限公司 Deriving relationships from overlapping location data
CN106445988A (en) * 2016-06-01 2017-02-22 上海坤士合生信息科技有限公司 Intelligent big data processing method and system
CN106537423A (en) * 2014-07-12 2017-03-22 微软技术许可有限责任公司 Adaptive featurization as service
CN106528489A (en) * 2015-09-14 2017-03-22 上海羽视澄蓝信息科技有限公司 System for vehicle detection machine learning based on cloud computing
CN106779087A (en) * 2016-11-30 2017-05-31 福建亿榕信息技术有限公司 A kind of general-purpose machinery learning data analysis platform
CN107092962A (en) * 2016-02-17 2017-08-25 阿里巴巴集团控股有限公司 A kind of distributed machines learning method and platform
CN107255969A (en) * 2017-06-28 2017-10-17 重庆柚瓣家科技有限公司 Endowment robot supervisory systems
CN107291811A (en) * 2017-05-18 2017-10-24 浙江大学 A kind of sense cognition enhancing robot system based on high in the clouds knowledge fusion
CN107329445A (en) * 2017-06-28 2017-11-07 重庆柚瓣家科技有限公司 The method of robot behavior criterion intelligent supervision
CN107538492A (en) * 2017-09-07 2018-01-05 福物(上海)机器人科技有限公司 Intelligent control system, method and the intelligence learning method of mobile robot
CN107766940A (en) * 2017-11-20 2018-03-06 北京百度网讯科技有限公司 Method and apparatus for generation model
WO2018040561A1 (en) * 2016-08-31 2018-03-08 华为技术有限公司 Data processing method, device and system
CN107921262A (en) * 2015-08-26 2018-04-17 波士顿科学神经调制公司 For optimizing the machine learning of spinal cord stimulation
CN107977712A (en) * 2017-12-20 2018-05-01 四川九洲电器集团有限责任公司 Network type machine learning system
CN108009643A (en) * 2017-12-15 2018-05-08 清华大学 A kind of machine learning algorithm automatic selecting method and system
CN108228325A (en) * 2017-10-31 2018-06-29 深圳市商汤科技有限公司 Application management method and device, electronic equipment, computer storage media
US10026041B2 (en) 2014-07-12 2018-07-17 Microsoft Technology Licensing, Llc Interoperable machine learning platform
CN108343125A (en) * 2018-02-27 2018-07-31 浙江欧琳生活健康科技有限公司 The method and system of water under a kind of outlet device based on wireless control
CN108572610A (en) * 2017-03-13 2018-09-25 欧姆龙株式会社 Evaluation system, safety governor, medium capable of reading record and evaluation method
WO2018176215A1 (en) * 2017-03-28 2018-10-04 Oracle International Corporation Systems and methods for intelligently providing supporting information using machine-learning
CN108628669A (en) * 2018-04-25 2018-10-09 北京京东尚科信息技术有限公司 A kind of method and apparatus of scheduling machine learning algorithm task
CN108665072A (en) * 2018-05-23 2018-10-16 中国电力科学研究院有限公司 A kind of machine learning algorithm overall process training method and system based on cloud framework
CN108764267A (en) * 2018-04-02 2018-11-06 上海大学 A kind of Denial of Service attack detection method integrated based on confrontation type decision tree
CN108898229A (en) * 2018-06-26 2018-11-27 第四范式(北京)技术有限公司 For constructing the method and system of machine learning modeling process
CN108960433A (en) * 2018-06-26 2018-12-07 第四范式(北京)技术有限公司 For running the method and system of machine learning modeling process
CN109117266A (en) * 2018-07-13 2019-01-01 视云融聚(广州)科技有限公司 Video artefacts' intelligent training platform based on multi-layer framework
CN109240658A (en) * 2018-09-12 2019-01-18 郑州云海信息技术有限公司 A kind of method and device of software architecture selection neural network based
CN109325541A (en) * 2018-09-30 2019-02-12 北京字节跳动网络技术有限公司 Method and apparatus for training pattern
CN109376844A (en) * 2018-10-30 2019-02-22 银河水滴科技(北京)有限公司 The automatic training method of neural network and device recommended based on cloud platform and model
CN109409533A (en) * 2018-09-28 2019-03-01 深圳乐信软件技术有限公司 A kind of generation method of machine learning model, device, equipment and storage medium
JP2019046324A (en) * 2017-09-05 2019-03-22 ファナック株式会社 Controller
CN109635833A (en) * 2018-10-30 2019-04-16 银河水滴科技(北京)有限公司 A kind of image-recognizing method and system based on cloud platform and model intelligent recommendation
CN109635918A (en) * 2018-10-30 2019-04-16 银河水滴科技(北京)有限公司 The automatic training method of neural network and device based on cloud platform and preset model
WO2019080501A1 (en) * 2017-10-27 2019-05-02 平安科技(深圳)有限公司 Electronic apparatus, multi-model sample training method and system, and computer-readable storage medium
CN109711436A (en) * 2018-12-05 2019-05-03 量子云未来(北京)信息科技有限公司 A kind of artificial intelligence training pattern construction method, device and storage medium
CN110598777A (en) * 2019-09-03 2019-12-20 中国科学院深圳先进技术研究院 Data processing method and system based on end cloud cooperation
CN110653801A (en) * 2019-09-30 2020-01-07 哈尔滨工业大学 Guide control system of robot operating arm and flexible control and teaching learning method thereof
CN110679114A (en) * 2017-05-24 2020-01-10 国际商业机器公司 Method for estimating deletability of data object
CN110715953A (en) * 2019-09-18 2020-01-21 浙江大学 System and method for testing heat-conducting property of film material based on machine learning
CN110765163A (en) * 2019-10-17 2020-02-07 华普通用技术研究(广州)有限公司 Execution plan generation method for big data processing flow
TWI688838B (en) * 2017-10-06 2020-03-21 日商佳能股份有限公司 Control device, lithography device, measuring device, processing device, flattening device, and article manufacturing method
CN111079942A (en) * 2017-08-30 2020-04-28 第四范式(北京)技术有限公司 Distributed system for performing machine learning and method thereof
CN111177802A (en) * 2018-11-09 2020-05-19 安碁资讯股份有限公司 Behavior marker model training system and method
CN111210023A (en) * 2020-01-13 2020-05-29 哈尔滨工业大学 Automatic selection system and method for data set classification learning algorithm
CN111247539A (en) * 2017-10-19 2020-06-05 英国电讯有限公司 Algorithm integration
CN111338630A (en) * 2018-11-30 2020-06-26 上海寒武纪信息科技有限公司 Method and device for generating universal machine learning model file and storage medium
CN111461892A (en) * 2020-03-31 2020-07-28 支付宝(杭州)信息技术有限公司 Method and device for selecting derived variables of risk identification model
WO2020155594A1 (en) * 2019-02-02 2020-08-06 清华大学 Automated machine learning system and method for streaming data analysis for use in conceptual transfer
CN111523646A (en) * 2020-04-23 2020-08-11 国家开放大学 Remote education learning center intelligent perception network based on Internet of things and management method
CN111680717A (en) * 2020-05-12 2020-09-18 顺德职业技术学院 Product classification method and system on intelligent manufacturing production line based on deep learning
CN111684474A (en) * 2018-03-13 2020-09-18 欧姆龙株式会社 Arithmetic device, arithmetic method, and program thereof
CN111814864A (en) * 2020-07-03 2020-10-23 北京中计新科仪器有限公司 Artificial intelligent cloud platform system for mass spectrometry data and data analysis method
CN112288133A (en) * 2020-09-28 2021-01-29 珠海大横琴科技发展有限公司 Algorithm service processing method and device
CN112671757A (en) * 2020-12-22 2021-04-16 无锡江南计算技术研究所 Encrypted flow protocol identification method and device based on automatic machine learning
CN112698848A (en) * 2020-12-31 2021-04-23 Oppo广东移动通信有限公司 Downloading method and device of machine learning model, terminal and storage medium
CN113199483A (en) * 2015-07-31 2021-08-03 发那科株式会社 Robot system, robot control method, machine learning device, and machine learning method
US11238377B2 (en) 2019-09-14 2022-02-01 Oracle International Corporation Techniques for integrating segments of code into machine-learning model
US11307836B2 (en) 2018-06-08 2022-04-19 Shanghai Cambricon Information Technology Co., Ltd. General machine learning model, and model file generation and parsing method
CN114579822A (en) * 2021-12-13 2022-06-03 北京市建筑设计研究院有限公司 Method and device for pushing modeling tool, electronic equipment and storage medium
US11386352B2 (en) 2018-10-29 2022-07-12 Acer Cyber Security Incorporated System and method of training behavior labeling model
US11392764B2 (en) * 2018-04-13 2022-07-19 International Business Machines Corporation Classifying text to determine a goal type used to select machine learning algorithm outcomes
CN114927164A (en) * 2022-07-18 2022-08-19 深圳市爱云信息科技有限公司 Sample compatibility detection method, device, equipment and storage medium based on AIOT platform
US11487967B2 (en) * 2020-05-15 2022-11-01 International Business Machines Corporation Finetune image feature extraction using environmental data
WO2022227355A1 (en) * 2021-04-29 2022-11-03 华为云计算技术有限公司 Method and apparatus for acquiring knowledge
US11562267B2 (en) 2019-09-14 2023-01-24 Oracle International Corporation Chatbot for defining a machine learning (ML) solution
US11568318B2 (en) 2020-10-07 2023-01-31 Panasonic Intellectual Property Management Co., Ltd. Method for developing machine-learning based tool
US11663523B2 (en) 2019-09-14 2023-05-30 Oracle International Corporation Machine learning (ML) infrastructure techniques
US11769075B2 (en) 2019-08-22 2023-09-26 Cisco Technology, Inc. Dynamic machine learning on premise model selection based on entity clustering and feedback
WO2024001344A1 (en) * 2022-07-01 2024-01-04 华为云计算技术有限公司 Target function solving method and apparatus based on cloud computing technology and computing device

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101615265B (en) * 2009-08-11 2012-07-04 路军 Intelligent decision simulating experimental system based on multi-Agent technology

Cited By (130)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102468975B (en) * 2010-11-16 2014-09-10 苏州搜能信息科技有限公司 Resource management simulation cloud computing system for cloud computing mobile network and application system thereof
CN102468975A (en) * 2010-11-16 2012-05-23 苏州搜能信息科技有限公司 Resource management simulation cloud computing system for cloud computing mobile network and application system thereof
CN102509177A (en) * 2011-11-11 2012-06-20 国网信息通信有限公司 Locally weighted linear regression projection operation method based on cloud platform
CN102509177B (en) * 2011-11-11 2014-12-31 国家电网公司 Locally weighted linear regression projection operation method based on cloud platform
CN102523246A (en) * 2011-11-23 2012-06-27 陈刚 Cloud computation treating system and method
CN103139278A (en) * 2011-12-05 2013-06-05 北京网康科技有限公司 Network resource pre-fetching and cache accelerating method and device thereof
CN102724298A (en) * 2012-05-25 2012-10-10 清华大学 Method for configuring storage parameter under cloud environment
CN102799512A (en) * 2012-07-04 2012-11-28 南京邮电大学 Virtual machine monitoring method in vector-autoregression-based cloud computing
CN102799512B (en) * 2012-07-04 2015-06-03 南京邮电大学 Virtual machine monitoring method in vector-autoregression-based cloud computing
CN103036745A (en) * 2012-12-21 2013-04-10 北京邮电大学 Anomaly detection system based on neural network in cloud computing
CN105075180B (en) * 2013-02-05 2018-11-13 思科技术公司 Method, apparatus for adjusting network attribute and computer readable storage medium
CN105075180A (en) * 2013-02-05 2015-11-18 思科技术公司 Dynamically adjusting a set of monitored network properties using distributed learning machine feedback
CN103605695A (en) * 2013-11-05 2014-02-26 佛山职业技术学院 Internet based artificial-intelligence knowledge logic system and method thereof
CN106416319A (en) * 2014-06-02 2017-02-15 高通股份有限公司 Deriving relationships from overlapping location data
WO2015184729A1 (en) * 2014-06-05 2015-12-10 Tsinghua University Method and system for hyper-parameter optimization and feature tuning of machine learning algorithms
US9436507B2 (en) 2014-07-12 2016-09-06 Microsoft Technology Licensing, Llc Composing and executing workflows made up of functional pluggable building blocks
CN106537423A (en) * 2014-07-12 2017-03-22 微软技术许可有限责任公司 Adaptive featurization as service
US10026041B2 (en) 2014-07-12 2018-07-17 Microsoft Technology Licensing, Llc Interoperable machine learning platform
US10423445B2 (en) 2014-07-12 2019-09-24 Microsoft Technology Licensing, Llc Composing and executing workflows made up of functional pluggable building blocks
CN105320835A (en) * 2014-07-15 2016-02-10 通用电气智能平台有限公司 Apparatus and method for time series data analysis method market
CN106067028A (en) * 2015-04-19 2016-11-02 北京典赞科技有限公司 The modeling method of automatic machinery based on GPU study
CN106250986A (en) * 2015-06-04 2016-12-21 波音公司 Advanced analysis base frame for machine learning
CN104951425B (en) * 2015-07-20 2018-03-13 东北大学 A kind of cloud service performance self-adapting type of action system of selection based on deep learning
CN104951425A (en) * 2015-07-20 2015-09-30 东北大学 Cloud service performance adaptive action type selection method based on deep learning
CN113199483A (en) * 2015-07-31 2021-08-03 发那科株式会社 Robot system, robot control method, machine learning device, and machine learning method
CN107921262A (en) * 2015-08-26 2018-04-17 波士顿科学神经调制公司 For optimizing the machine learning of spinal cord stimulation
CN106528489A (en) * 2015-09-14 2017-03-22 上海羽视澄蓝信息科技有限公司 System for vehicle detection machine learning based on cloud computing
CN107092962B (en) * 2016-02-17 2021-01-26 创新先进技术有限公司 Distributed machine learning method and platform
CN107092962A (en) * 2016-02-17 2017-08-25 阿里巴巴集团控股有限公司 A kind of distributed machines learning method and platform
CN105808500A (en) * 2016-02-26 2016-07-27 山西牡丹深度智能科技有限公司 Realization method and device of deep learning
CN106445988A (en) * 2016-06-01 2017-02-22 上海坤士合生信息科技有限公司 Intelligent big data processing method and system
CN107784363B (en) * 2016-08-31 2021-02-09 华为技术有限公司 Data processing method, device and system
WO2018040561A1 (en) * 2016-08-31 2018-03-08 华为技术有限公司 Data processing method, device and system
CN107784363A (en) * 2016-08-31 2018-03-09 华为技术有限公司 Data processing method, apparatus and system
CN106779087A (en) * 2016-11-30 2017-05-31 福建亿榕信息技术有限公司 A kind of general-purpose machinery learning data analysis platform
CN106779087B (en) * 2016-11-30 2019-02-22 福建亿榕信息技术有限公司 A kind of general-purpose machinery learning data analysis platform
CN108572610A (en) * 2017-03-13 2018-09-25 欧姆龙株式会社 Evaluation system, safety governor, medium capable of reading record and evaluation method
CN108572610B (en) * 2017-03-13 2021-05-18 欧姆龙株式会社 Evaluation system, security controller, readable recording medium, and evaluation method
CN110741390A (en) * 2017-03-28 2020-01-31 甲骨文国际公司 System and method for intelligently providing supporting information using machine learning
US11443225B2 (en) 2017-03-28 2022-09-13 Oracle International Corporation Systems and methods for intelligently providing supporting information using machine-learning
WO2018176215A1 (en) * 2017-03-28 2018-10-04 Oracle International Corporation Systems and methods for intelligently providing supporting information using machine-learning
CN110741390B (en) * 2017-03-28 2023-12-22 甲骨文国际公司 System and method for intelligently providing support information using machine learning
CN107291811A (en) * 2017-05-18 2017-10-24 浙江大学 A kind of sense cognition enhancing robot system based on high in the clouds knowledge fusion
CN107291811B (en) * 2017-05-18 2019-11-29 浙江大学 A kind of sense cognition enhancing robot system based on cloud knowledge fusion
CN110679114B (en) * 2017-05-24 2021-08-06 国际商业机器公司 Method for estimating deletability of data object
CN110679114A (en) * 2017-05-24 2020-01-10 国际商业机器公司 Method for estimating deletability of data object
CN107255969B (en) * 2017-06-28 2019-10-18 重庆柚瓣家科技有限公司 Endowment robot supervisory systems
CN107255969A (en) * 2017-06-28 2017-10-17 重庆柚瓣家科技有限公司 Endowment robot supervisory systems
CN107329445A (en) * 2017-06-28 2017-11-07 重庆柚瓣家科技有限公司 The method of robot behavior criterion intelligent supervision
CN111079942A (en) * 2017-08-30 2020-04-28 第四范式(北京)技术有限公司 Distributed system for performing machine learning and method thereof
CN111079942B (en) * 2017-08-30 2023-03-24 第四范式(北京)技术有限公司 Distributed system for performing machine learning and method thereof
JP2019046324A (en) * 2017-09-05 2019-03-22 ファナック株式会社 Controller
US11048216B2 (en) 2017-09-05 2021-06-29 Fanuc Corporation Control device
CN107538492A (en) * 2017-09-07 2018-01-05 福物(上海)机器人科技有限公司 Intelligent control system, method and the intelligence learning method of mobile robot
TWI688838B (en) * 2017-10-06 2020-03-21 日商佳能股份有限公司 Control device, lithography device, measuring device, processing device, flattening device, and article manufacturing method
CN111247539A (en) * 2017-10-19 2020-06-05 英国电讯有限公司 Algorithm integration
WO2019080501A1 (en) * 2017-10-27 2019-05-02 平安科技(深圳)有限公司 Electronic apparatus, multi-model sample training method and system, and computer-readable storage medium
CN108228325B (en) * 2017-10-31 2020-12-29 深圳市商汤科技有限公司 Application management method and device, electronic equipment and computer storage medium
CN108228325A (en) * 2017-10-31 2018-06-29 深圳市商汤科技有限公司 Application management method and device, electronic equipment, computer storage media
CN107766940B (en) * 2017-11-20 2021-07-23 北京百度网讯科技有限公司 Method and apparatus for generating a model
CN107766940A (en) * 2017-11-20 2018-03-06 北京百度网讯科技有限公司 Method and apparatus for generation model
CN108009643A (en) * 2017-12-15 2018-05-08 清华大学 A kind of machine learning algorithm automatic selecting method and system
CN108009643B (en) * 2017-12-15 2018-10-30 清华大学 A kind of machine learning algorithm automatic selecting method and system
CN107977712A (en) * 2017-12-20 2018-05-01 四川九洲电器集团有限责任公司 Network type machine learning system
CN108343125A (en) * 2018-02-27 2018-07-31 浙江欧琳生活健康科技有限公司 The method and system of water under a kind of outlet device based on wireless control
CN108343125B (en) * 2018-02-27 2024-02-23 宁波欧琳科技股份有限公司 Method and system for draining water based on wireless control drainer
CN111684474B (en) * 2018-03-13 2024-04-16 欧姆龙株式会社 Arithmetic device, arithmetic method, and recording medium
CN111684474A (en) * 2018-03-13 2020-09-18 欧姆龙株式会社 Arithmetic device, arithmetic method, and program thereof
CN108764267A (en) * 2018-04-02 2018-11-06 上海大学 A kind of Denial of Service attack detection method integrated based on confrontation type decision tree
CN108764267B (en) * 2018-04-02 2021-08-10 上海大学 Denial of service attack detection method based on countermeasure decision tree integration
US11397851B2 (en) * 2018-04-13 2022-07-26 International Business Machines Corporation Classifying text to determine a goal type used to select machine learning algorithm outcomes
US11392764B2 (en) * 2018-04-13 2022-07-19 International Business Machines Corporation Classifying text to determine a goal type used to select machine learning algorithm outcomes
CN108628669A (en) * 2018-04-25 2018-10-09 北京京东尚科信息技术有限公司 A kind of method and apparatus of scheduling machine learning algorithm task
CN108665072A (en) * 2018-05-23 2018-10-16 中国电力科学研究院有限公司 A kind of machine learning algorithm overall process training method and system based on cloud framework
US11334330B2 (en) 2018-06-08 2022-05-17 Shanghai Cambricon Information Technology Co., Ltd. General machine learning model, and model file generation and parsing method
US11334329B2 (en) 2018-06-08 2022-05-17 Shanghai Cambricon Information Technology Co., Ltd. General machine learning model, and model file generation and parsing method
US11307836B2 (en) 2018-06-08 2022-04-19 Shanghai Cambricon Information Technology Co., Ltd. General machine learning model, and model file generation and parsing method
US11726754B2 (en) 2018-06-08 2023-08-15 Shanghai Cambricon Information Technology Co., Ltd. General machine learning model, and model file generation and parsing method
US11379199B2 (en) 2018-06-08 2022-07-05 Shanghai Cambricon Information Technology Co., Ltd. General machine learning model, and model file generation and parsing method
US11403080B2 (en) 2018-06-08 2022-08-02 Shanghai Cambricon Information Technology Co., Ltd. General machine learning model, and model file generation and parsing method
CN108898229A (en) * 2018-06-26 2018-11-27 第四范式(北京)技术有限公司 For constructing the method and system of machine learning modeling process
CN108960433A (en) * 2018-06-26 2018-12-07 第四范式(北京)技术有限公司 For running the method and system of machine learning modeling process
CN109117266A (en) * 2018-07-13 2019-01-01 视云融聚(广州)科技有限公司 Video artefacts' intelligent training platform based on multi-layer framework
CN109240658A (en) * 2018-09-12 2019-01-18 郑州云海信息技术有限公司 A kind of method and device of software architecture selection neural network based
CN109409533A (en) * 2018-09-28 2019-03-01 深圳乐信软件技术有限公司 A kind of generation method of machine learning model, device, equipment and storage medium
CN109325541A (en) * 2018-09-30 2019-02-12 北京字节跳动网络技术有限公司 Method and apparatus for training pattern
US11386352B2 (en) 2018-10-29 2022-07-12 Acer Cyber Security Incorporated System and method of training behavior labeling model
CN109635918A (en) * 2018-10-30 2019-04-16 银河水滴科技(北京)有限公司 The automatic training method of neural network and device based on cloud platform and preset model
CN109376844A (en) * 2018-10-30 2019-02-22 银河水滴科技(北京)有限公司 The automatic training method of neural network and device recommended based on cloud platform and model
CN109635833A (en) * 2018-10-30 2019-04-16 银河水滴科技(北京)有限公司 A kind of image-recognizing method and system based on cloud platform and model intelligent recommendation
CN111177802B (en) * 2018-11-09 2022-09-13 安碁资讯股份有限公司 Behavior marker model training system and method
CN111177802A (en) * 2018-11-09 2020-05-19 安碁资讯股份有限公司 Behavior marker model training system and method
CN111338630A (en) * 2018-11-30 2020-06-26 上海寒武纪信息科技有限公司 Method and device for generating universal machine learning model file and storage medium
CN111338630B (en) * 2018-11-30 2022-02-08 上海寒武纪信息科技有限公司 Method and device for generating universal machine learning model file and storage medium
CN109711436A (en) * 2018-12-05 2019-05-03 量子云未来(北京)信息科技有限公司 A kind of artificial intelligence training pattern construction method, device and storage medium
WO2020155594A1 (en) * 2019-02-02 2020-08-06 清华大学 Automated machine learning system and method for streaming data analysis for use in conceptual transfer
US11769075B2 (en) 2019-08-22 2023-09-26 Cisco Technology, Inc. Dynamic machine learning on premise model selection based on entity clustering and feedback
CN110598777B (en) * 2019-09-03 2022-12-27 中国科学院深圳先进技术研究院 Data processing method and system based on end cloud cooperation
CN110598777A (en) * 2019-09-03 2019-12-20 中国科学院深圳先进技术研究院 Data processing method and system based on end cloud cooperation
US11562267B2 (en) 2019-09-14 2023-01-24 Oracle International Corporation Chatbot for defining a machine learning (ML) solution
US11475374B2 (en) 2019-09-14 2022-10-18 Oracle International Corporation Techniques for automated self-adjusting corporation-wide feature discovery and integration
US11921815B2 (en) 2019-09-14 2024-03-05 Oracle International Corporation Techniques for the automated customization and deployment of a machine learning application
US11238377B2 (en) 2019-09-14 2022-02-01 Oracle International Corporation Techniques for integrating segments of code into machine-learning model
US11556862B2 (en) 2019-09-14 2023-01-17 Oracle International Corporation Techniques for adaptive and context-aware automated service composition for machine learning (ML)
US11663523B2 (en) 2019-09-14 2023-05-30 Oracle International Corporation Machine learning (ML) infrastructure techniques
US11625648B2 (en) 2019-09-14 2023-04-11 Oracle International Corporation Techniques for adaptive pipelining composition for machine learning (ML)
US11847578B2 (en) 2019-09-14 2023-12-19 Oracle International Corporation Chatbot for defining a machine learning (ML) solution
US11811925B2 (en) 2019-09-14 2023-11-07 Oracle International Corporation Techniques for the safe serialization of the prediction pipeline
CN110715953A (en) * 2019-09-18 2020-01-21 浙江大学 System and method for testing heat-conducting property of film material based on machine learning
CN110715953B (en) * 2019-09-18 2020-07-21 浙江大学 System and method for testing heat-conducting property of film material based on machine learning
CN110653801A (en) * 2019-09-30 2020-01-07 哈尔滨工业大学 Guide control system of robot operating arm and flexible control and teaching learning method thereof
CN110653801B (en) * 2019-09-30 2022-06-17 哈尔滨工业大学 Guide control system of robot operating arm and flexible control and teaching learning method thereof
CN110765163A (en) * 2019-10-17 2020-02-07 华普通用技术研究(广州)有限公司 Execution plan generation method for big data processing flow
CN111210023A (en) * 2020-01-13 2020-05-29 哈尔滨工业大学 Automatic selection system and method for data set classification learning algorithm
CN111210023B (en) * 2020-01-13 2023-04-11 哈尔滨工业大学 Automatic selection system and method for data set classification learning algorithm
CN111461892A (en) * 2020-03-31 2020-07-28 支付宝(杭州)信息技术有限公司 Method and device for selecting derived variables of risk identification model
CN111461892B (en) * 2020-03-31 2021-07-06 支付宝(杭州)信息技术有限公司 Method and device for selecting derived variables of risk identification model
CN111523646A (en) * 2020-04-23 2020-08-11 国家开放大学 Remote education learning center intelligent perception network based on Internet of things and management method
CN111680717A (en) * 2020-05-12 2020-09-18 顺德职业技术学院 Product classification method and system on intelligent manufacturing production line based on deep learning
US11487967B2 (en) * 2020-05-15 2022-11-01 International Business Machines Corporation Finetune image feature extraction using environmental data
CN111814864A (en) * 2020-07-03 2020-10-23 北京中计新科仪器有限公司 Artificial intelligent cloud platform system for mass spectrometry data and data analysis method
CN112288133A (en) * 2020-09-28 2021-01-29 珠海大横琴科技发展有限公司 Algorithm service processing method and device
US11568318B2 (en) 2020-10-07 2023-01-31 Panasonic Intellectual Property Management Co., Ltd. Method for developing machine-learning based tool
CN112671757B (en) * 2020-12-22 2023-10-31 无锡江南计算技术研究所 Encryption flow protocol identification method and device based on automatic machine learning
CN112671757A (en) * 2020-12-22 2021-04-16 无锡江南计算技术研究所 Encrypted flow protocol identification method and device based on automatic machine learning
CN112698848A (en) * 2020-12-31 2021-04-23 Oppo广东移动通信有限公司 Downloading method and device of machine learning model, terminal and storage medium
WO2022227355A1 (en) * 2021-04-29 2022-11-03 华为云计算技术有限公司 Method and apparatus for acquiring knowledge
CN114579822A (en) * 2021-12-13 2022-06-03 北京市建筑设计研究院有限公司 Method and device for pushing modeling tool, electronic equipment and storage medium
WO2024001344A1 (en) * 2022-07-01 2024-01-04 华为云计算技术有限公司 Target function solving method and apparatus based on cloud computing technology and computing device
CN114927164A (en) * 2022-07-18 2022-08-19 深圳市爱云信息科技有限公司 Sample compatibility detection method, device, equipment and storage medium based on AIOT platform

Also Published As

Publication number Publication date
CN101782976B (en) 2013-04-10

Similar Documents

Publication Publication Date Title
CN101782976B (en) Automatic selection method for machine learning in cloud computing environment
Abbasi et al. Predicting solutions of large-scale optimization problems via machine learning: A case study in blood supply chain management
Wu et al. Evolving RBF neural networks for rainfall prediction using hybrid particle swarm optimization and genetic algorithm
Cicek et al. Optimizing the artificial neural network parameters using a biased random key genetic algorithm for time series forecasting
Soui et al. Rule-based credit risk assessment model using multi-objective evolutionary algorithms
Papageorgiou Review study on fuzzy cognitive maps and their applications during the last decade
Filip Decision support and control for large-scale complex systems
Kahraman et al. Intelligence decision systems in enterprise information management
CN106779087A (en) A kind of general-purpose machinery learning data analysis platform
Papageorgiou Review study on fuzzy cognitive maps and their applications during the last decade
Hassan et al. A hybrid of multiobjective Evolutionary Algorithm and HMM-Fuzzy model for time series prediction
Janković et al. Machine learning models for ecological footprint prediction based on energy parameters
Alzaeemi et al. Examining the forecasting movement of palm oil price using RBFNN-2SATRA metaheuristic algorithms for logic mining
Zheng et al. Evolutionary machine learning builds smart education big data platform: Data-driven higher education
Taherdoost Machine learning algorithms: features and applications
Gao et al. A multi-objective service composition method considering the interests of tri-stakeholders in cloud manufacturing based on an enhanced jellyfish search optimizer
Gong et al. Evolutionary computation in China: A literature survey
Wang et al. Research on intelligence analysis technology of financial industry data based on genetic algorithm
Asiri et al. Short-Term Load Forecasting in Smart Grids using Hybrid Deep Learning
Lu et al. Sales forecasting of IT products using a hybrid MARS and SVR model
Chu et al. A data-driven meta-learning recommendation model for multi-mode resource constrained project scheduling problem
Jiao et al. An integration model for generating and selecting product configuration plans
Jackson et al. Automl approach to classification of candidate solutions for simulation models of logistic systems
Nalabala et al. Financial predictions based on fusion models-a systematic review
Yan et al. A short-term forecasting model with inhibiting normal distribution noise of sale series

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
EE01 Entry into force of recordation of patent licensing contract

Assignee: Jiangsu Jiqun Information Industry Co., Ltd.

Assignor: Nanjing Post & Telecommunication Univ.

Contract record no.: 2012320000280

Denomination of invention: Automatic selection method for machine learning in cloud computing environment

License type: Exclusive License

Open date: 20100721

Record date: 20120322

C14 Grant of patent or utility model
GR01 Patent grant