CN106779755A - A kind of network electric business borrows or lends money methods of risk assessment and model - Google Patents
A kind of network electric business borrows or lends money methods of risk assessment and model Download PDFInfo
- Publication number
- CN106779755A CN106779755A CN201611265956.XA CN201611265956A CN106779755A CN 106779755 A CN106779755 A CN 106779755A CN 201611265956 A CN201611265956 A CN 201611265956A CN 106779755 A CN106779755 A CN 106779755A
- Authority
- CN
- China
- Prior art keywords
- prime
- sample
- debt
- credit
- model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/01—Customer relationship services
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2411—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/03—Credit; Loans; Processing thereof
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Accounting & Taxation (AREA)
- Finance (AREA)
- General Physics & Mathematics (AREA)
- Marketing (AREA)
- General Business, Economics & Management (AREA)
- Strategic Management (AREA)
- Economics (AREA)
- Development Economics (AREA)
- Data Mining & Analysis (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- Technology Law (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a kind of network electric business debt-credit methods of risk assessment and model, method is comprised the following steps:The historical trading data of collection network electric business debt-credit client is used as sample set;The twin support vector cassification model of least square is built, and it is trained, set up network electric business debt-credit Credit Model;Risk model is borrowed or lent money according to the network electric business set up, forecast sample concentrates the promise breaking label of each sample;For each sample, will predict that the promise breaking label of the promise breaking label and reality for obtaining is compared, determine the predicated error of sample, and the size of each sample weights is determined according to the size of predicated error;Based on each sample weights, the twin support vector cassification model of weighted least-squares is built, re-establish network electric business debt-credit risk model, debt-credit risk is estimated.The present invention assessment degree of accuracy is high.
Description
Technical field
The present invention relates to a kind of network electric business debt-credit methods of risk assessment and model.
Background technology
Internet finance is a kind of brand-new Finance Service, and traditional small amount debt-credit service is transferred to internet by it
, it is necessary to the crowd for borrowing money can be lent ability and be ready to be lent based on certain condition in network loan platform searching on platform
Crowd, due to network loan have it is efficient, simple to operate, towards middle low layer take in crowd (for borrower) and peace
The advantage of entirely, transparent, income (for lender) high, thus network loan platform once release just get the nod rapidly with
Development[1].But compared with traditional pattern of lending, network loan there is also shortcoming, such as network loan is unsecured loan,
Lender's major part is all general population, the Investment & Financing knowledge without specialty, and some borrowers are in order at the mesh swindled
Carrying out provide a loan, this just to lender bring network loan risk, i.e. borrower regulation cut-off refund the date in, due to
A variety of causes cannot on time be paid off a loan according to lending agreement, and the possibility of monetary losses is brought to lender[2]。
Recent years, many scholars propose many new assessing credit risks moulds for the debt-credit risk control of network electric business
Type.These models can generally be divided into two classes:The first kind is application level analytic approach[5], Field Using Fuzzy Comprehensive Assessment[6]Deng statistics
The credit indicator evaluation system that method is set up, and specify the subjective weight of each index.Equations of The Second Kind is to use linear data
There are some general character in the risk evaluation model that mining algorithm is set up, these models:Assessment accuracy rate is low, and tracing it to its cause is
There is nonlinear organization in credit data.And the two sorting algorithm applications that can process nonlinear organization data more widely have:
SVMs[7]And BP neural network[8].But be present the unbalanced feature of class in network electric business loan credit data, such as clap
Bat loan platform is normally refunded to record and is about 10 times that promise breaking is recorded, for such risk data, supporting vector machine model
Error type I False Rate is higher, i.e., normal refund client is judged to the client that breaks a contract.BP neural network there is also same asking
Topic, and BP neural network model is when only having the error of training set and forecast set close, and model is just with generalization ability.
Therefore, it is necessary to design a kind of accuracy rate network electric business debt-credit Risk Forecast Method and model higher.
The content of the invention
Technical problem solved by the invention is, in view of the shortcomings of the prior art, there is provided a kind network electric business borrows or lends money wind
Dangerous appraisal procedure and model, predictablity rate are high.
Technical scheme provided by the present invention is:
A kind of network electric business borrows or lends money methods of risk assessment, comprises the following steps:
Step 1, the data of collection history debt-credit client are used as sample set;The data of each debt-credit client include that n is tieed up and evaluate
Achievement data and promise breaking label target, target=1 represent loan defaults, and target=0 represents normal refund;
Step 2, the data in sample set are pre-processed;The pretreatment includes that missing values treatment, nonnumeric type are commented
Valency achievement data quantifies and data normalization;
Step 3, the sample in sample set is divided into according to the value of promise breaking label target by two classes, the first kind is target=
0 normal refund client, matrix is built by its corresponding evaluation index dataEquations of The Second Kind is target=1
Loan defaults client, build matrix by its corresponding evaluation index dataWherein m1 and m2 are respectively two
Sample size in class sample,It is n-dimensional vector, represents i-th evaluation index of sample in kth class;
Step 4, based on two class samples, build the twin support vector cassification model of least square, and it is trained,
Set up network electric business debt-credit risk evaluation model;
Step 5, the network electric business debt-credit risk evaluation models according to 4 foundation, forecast sample concentrate the promise breaking of each sample
Label target;
Step 6, for each sample, it is actual separated that the promise breaking label target and step 1 obtained according to 5 predictions are collected
About label target determines the predicated error of sample, and the size of each sample weights is determined according to the size of predicated error;Sample
The determination principle of this weight is:The larger sample weights of predicated error are smaller, and the less sample weights of predicated error are larger;
Step 7, based on each sample weights, build the twin support vector cassification model of weighted least-squares, build again
Vertical network electric business debt-credit risk evaluation model;
Step 8, the n dimension evaluation index data for gathering new debt-credit client, substitute into the network electricity that step 7 determines after pretreatment
Loan risk evaluation model is negotiated to borrow, the corresponding promise breaking label target of borrower is connected to, predicts whether debt-credit client can provide a loan separated
About, it is estimated with to this debt-credit risk.
The step 4 specifically includes following steps:
Step 4.1, the structure twin support vector cassification model of least square:
s.t.-(K(B,CT)w1+e2b1)=e2-η2
s.t.K(A,CT)w2+e1b2=e1-η1
Wherein, C1And C2It is punishment parameter, optimal value is selected (i.e. first randomly by according to ten folding cross validations
It is 10 mutually disjoint size identical subsets to data cutting, then using 9 data training patterns of subset (training mould
C in type1And C2Span be:C1∈[2-5,…,25], C2∈[2-5,…,25]), tested using remaining 1 subset
Model;This process is repeated to possible 10 kinds of selections, average test error in 10 evaluation and tests is finally selected minimum
Model); η1And η2It is loss variable (being also called error);e1And e21 column vector is all for element;K(X,
CT) it is gaussian kernel function, X=A or B (σ is the width parameter of function, σ>0, σ according to
Ten folding cross validations select optimal value);w1And w2It is weight vector, b1And b2Presentation class threshold value;w1、w2、b1And b2To wait to ask
Parameter;
Step 4.2, w is asked for by below equation1、w2、b1And b2:
Wherein, U=[K (A, CT),e1], V=[K (B, CT),e2];
Step 4.3, according to fixed parameter set up with lower network electric business borrow or lend money risk model:
Wherein, x represents the n dimension evaluation index data of pretreated new reception client, and target (x)=0 represents pre-
Surveying the new debt-credit client can normally refund, and target (x)=1 represents predicts that the new debt-credit client understands loan defaults.
In the step 6, the computing formula of sample weights is:
Wherein, | ηki| (k=1,2) is i-th predicated error of sample in sample set kth class sample, and δ is | ηki|(k
=standard deviation 1,2).
The step 7 specifically includes following steps:
Step 7.1, the structure twin support vector cassification model of weighted least-squares:
The twin support vector cassification model (WLS-TSVM1) of first weighted least-squares:
s.t.-(K(B,CT)w'1+e2b1')=e2-η2
The twin support vector cassification model (WLS-TSVM2) of second weighted least-squares:
s.t.K(A,CT)w'2+e1b'2=e1-η1
Wherein, C1' and C'2It is punishment parameter, optimal value is selected according to ten folding cross validations;w'1And w'2For weigh to
Amount, b'1And b'2Presentation class threshold value;w'1、w'2、b'1And b'2It is parameter to be asked;
Step 7.1, w' is asked for by below equation1、w'2、b'1、b'2:
Wherein, M=[K (A, CT),e1], N=[K (B, CT),e2];
P1 -1WithIt is respectively with ρ1iAnd ρ2iIt is the diagonal matrix of the elements in a main diagonal;
Step 7.1, according to fixed parameter re-establish with lower network electric business borrow or lend money risk model:
Wherein, K (xT,CT) it is gaussian kernel function;σ is the width parameter of function,
σ>0, σ selects optimal value according to ten folding cross validations;X represents that the n of debt-credit client new after pre-processing ties up evaluation index number
According to target'(x)=0 represent and predict that the new debt-credit client can normally refund, target'(x)=1 represent prediction this is new
Debt-credit client may loan defaults.
A kind of network electric business borrows or lends money risk evaluation model, it is characterised in that the model is:
Wherein, x represents the n dimension evaluation index data of pretreated new debt-credit client, target'(x)=0 expression is in advance
Surveying the new debt-credit client can normally refund, and this debt-credit risk is relatively low;Target'(x debt-credit visitor for predicting that this is new)=1 is represented
Family meeting loan defaults, this debt-credit risk is higher;Model parameter is solved by above step.
Principle of the invention is illustrated below:
The twin support vector cassification model of 1 least square
Twin SVMs (twins support vector machine, be abbreviated as TSVM) is in SVMs
(SVM) put forward on the basis of, the training speed its purpose is to improve SVM.The basic thought of TSVM is to positive and negative class
Training sample constructs two nonparallel Optimal Separating Hyperplanes, and each Optimal Separating Hyperplane is meeting the condition away from another kind of data point
Under be fitted this class data point as far as possible[9].Compared with SVM, there are two advantages in TSVM:First, it is equal in positive and negative class number of samples
In the case of, the time complexity of SVM is O (m3), the time complexity of TSVM is O (2* (m/2)3), wherein m is training set sample
This number, from time complexity it can be seen that the training speed of TSVM improves 4 times compared to SVM[10];Second, it is uneven for class
The data of weighing apparatus, the influence that corresponding punishment parameter elimination class imbalance is brought is set when hyperplane is sought to each hyperplane[11]。
Due to the classification advantage of the two uniquenesses of TSVM, TSVM is widely applied to Image detection in recent years[12], chemical analysis[13]And well
Lower diagnosis[14]Etc. industry.
The training sample of given m n dimensions, is classified as two classes, and the wherein first kind is designated as matrixThe
Two class samples are designated as matrixWherein m1 and m2 are respectively the sample size in two class samples,For n tie up to
Amount, represents j-th sample of the i-th class;Then TSVM models are described in detail below:
In above formula, C1And C2It is punishment parameter, optimal value is selected according to ten folding cross validations;η1And η2For loss becomes
The difference of amount (being also called error), i.e. predicted value and actual value;e1,e21 column vector is all for element;C=[AT,BT]T, K (X,
CT) it is gaussian kernel function, X=A or B;σ is the width parameter of function, σ>0, σ according to
Ten folding cross validations select optimal value;w1And w2It is weight vector, b1And b2Presentation class threshold value, w1、w2, b1And b2To wait to ask
Parameter.
The twin SVMs of least square (Least squares twins support vector machine, letter
It is written as LS-TSVM) the least square thought is introduced, the slack variable η in LS-TSVM1And η2C has been used respectively1/ 2, C2The 2 of/2
Normal form, can thus omit η1>=0 and η2>=0 constraints, while inequality constraints is revised as equality constraint, so
Quadratic programming problem conversion is just reduced the computation complexity of twin SVMs for Solving Linear problem.
Here is the specific descriptions to non-linear LS-SVM models:
s.t.-(K(B,CT)w1+e2b1)=e2-η2 (4)
s.t.K(A,CT)w2+e1b2=e1-η1 (6)
(4) formula and (6) formula are substituted into (3) formula respectively and (5) formula is obtained:
(7) formula is asked on w respectively1And b1Derivative, and make it be equal to zero, draw
K(A,CT)T[K(A,CT)w1+e1b1]+C1K(B,CT)T[K(B,CT)w1+e2b1+e2]=0 (9)
It is matrix form to merge (9) formula and (10) formula, is obtained
Can be solved by (11) formula
In above formula:
U=[K (A, CT),e1]
V=[K (B, CT),e2]
Can similarly obtain:
(12) formula and (13) formula are substituted into following (14) formula respectively, two Optimal Separating Hyperplanes just can be obtained.For new
Sample number strong point, according to sample number strong point to two range estimation data point generics of Optimal Separating Hyperplane, data point from
Which hyperplane is nearer, and which kind of the data point just belongs to.
K(xT,CT)w1+b1=0, K (xT,CT)w2+b2=0 (14)
The twin support vector cassification model of 2 weighted least-squares
The twin SVMs of 2.1 weighted least-squares
Although LS-TSVM algorithms solve the problems, such as that TSVM calculates complicated, but when training set has data exception point,
The punishment parameter of formed objects is assigned to each sample in similar, algorithm robustness is reduced.In order to reduce data exception point
Influence to Optimal Separating Hyperplane, proposes that a kind of twin SVMs of weighted least-squares is calculated on the basis of LS-TSVM herein
Method, improves the not good enough problem of LS-TSVM algorithm robustness.Weighting treatment has been made to error in (3) formula and (5) formula, if first
Class sample and the corresponding weights of Equations of The Second Kind sample are respectively ρ1iAnd ρ2i, then corresponding optimization problem can be described as
WLS-TSVM1:
WLS-TSVM2:
The Lagrangian Form of WLS-TSVM1 formulas can be expressed as:
Wherein, α is m2The Lagrange multiplier of dimension.
Obtained according to KTT conditions:
Obtained by (18) formula and (19) formula:
Order
M=[K (A, CT),e1] (23)
N=[K (B, CT),e2] (24)
Can then be drawn by (22) formula:
By (20) formula, (21) Shi Ke get:
Wherein, P1 -1It is with (ρ1)iiIt is the diagonal matrix of the elements in a main diagonal.
M is introduced to WLS-TSVM2 formulas1The β Lagrange multipliers of dimension, can similarly obtain:
Wherein P2 -1It is with (ρ2)iiIt is the diagonal matrix of the elements in a main diagonal.
S is determined according to (25) formula and (27) formula1And s2, thus just two hyperplane can be determined by (14) formula.Weighting is most
The classification function that a young waiter in a wineshop or an inn multiplies twin SVMs can be expressed as:
Above calculating process establishes the twin vector machine Credit Risk Model of weighted least-squares, for new client,
Then collection customer information calculates the value for determining target, target'(x as the x vector inputs in (29) formula)=0 representative
Sample belongs to the first kind, that is, predict that the client can normally refund;Target'(x)=1 representative sample Equations of The Second Kind, the i.e. client
May loan defaults.
2.2 weights are set
In order to reduce influence of the data exception point to disaggregated model so as to improve the generalization ability of disaggregated model, while again not
Ignore effect of the data exception point to disaggregated model, herein using normpdf be more than zero the characteristics of, according to
The larger sample weights of probability density function values calculating sample weights, i.e. predicated error are smaller, the less sample power of predicated error
Value is larger, and influence of the data exception point to disaggregated model is reduced with this.
Specific weights set as follows:
Wherein, | ηki| (k=1,2) is i-th predicated error of sample in sample set kth class sample, and δ is | ηki|(k
=standard deviation 1,2).
Specific weights solution procedure is as follows:
The first step:The twin support vector cassification model of least square is set up using training set data;
Second step:According to the first step set up the twin support vector cassification model of least square to training set data again
Classification, obtains classification results;
3rd step:The classification results obtained according to the actual classification of training set sample and second step calculate training set sample and miss
Difference;
4th step:Sample weights are calculated according to (30) formula.
The 3 network electric business debt-credit Credit Risk Assessment Models based on WLS-TSVM
Network electric business debt-credit Credit Risk Model estimation flow is as follows:Transaction data is borrowed or lent money according to existing electric business history first
Built one's credit risk evaluation model, and the credit standing of borrower is then assessed with Credit Risk Model.By to having trained
Data set sets up disaggregated model, and new borrower is divided into two classes by application model:Default risk is high and default risk is low, for
Default risk borrower's refusal loaning bill project high passes through, so as to reduce default risk.
The specific execution step of the network electric business debt-credit Credit Risk Model based on WLS-TSVM algorithms is as follows:
(data of history debt-credit client can be from existing network electric business loan platform for the data of collection history debt-credit client
Obtain) used as training sample, the data to be collected include that (target=1 is represented for evaluation index data and promise breaking label target
Loan defaults, target=0 represents normal refund), evaluation index data are specifically included:
Borrower's essential information field (name, sex, identification card number, cell-phone number, No. QQ, conventional mailbox, marital status,
Credit number, Employment, schooling, contact person etc.);
Network behavior field (accesses type, access times, average access duration of website etc.);
Academic student status field (academic type, academic level, graduation universities and colleges, graduation department, graduation time, major name,
Habit form, length of schooling etc.);
(borrower is in third party's financial platform history loaning bill number of times, history loaning bill success rate, credit for third-party platform field
Grade etc.);
Social networks field (friend quantity, friend species number, friend averaged historical of the borrower in social platforms such as Tengxuns
Loaning bill number of times, friend's history loaning bill success rate, friend's average credit grade scoring etc.);
Loaning bill information field (borrowing balance, the life of loan, borrowing rate, use of the loan, credit grade, be in debt income ratio,
It is engaged in the time of work, house now and possesses situation, annual income, history loaning bill promise breaking label etc.);
2. the treatment of data prediction, including missing values, the digitlization of nonnumeric field, data normalization:
2.1. missing values treatment, missing values are deleted more than the row of training set data total amount 60%;Missing values are less than instruction
Practice the row of collection data total amount 60%, missing values use -1 is filled up;
2.2. nonnumeric field digitlization:
Time map is year, month, day;
Place name is mapped as longitude and latitude according to national longitude and latitude table;
Ordinal data is mapped as ordinal number, and (such as schooling can be divided into illiteracy, primary school, junior middle school, senior middle school, university etc., text
Blind->0, primary school->1, junior middle school->2, senior middle school->3, university->4);
Nominal level variable is mapped as 0-1 dummy variables, and (it is 1 that such as sex sets female, and man is for 0);
2.3. it is standardized with zscore function pair data, eliminates the isomerism between data;
3. training set data is divided into two classes by the value according to promise breaking label target, and the first kind is normal for target=0's
Refund client, is represented with matrix A, and Equations of The Second Kind is the loan defaults client of target=1, is represented with matrix B;Pre- by data
Value of the training set data for the treatment of in addition to target according to target constitutes a line of matrix A or B, i.e., It is n-dimensional vector, represents i-th evaluation index of sample in kth class;
4. application training collection data, according to the twin support vector cassification model of least square, set up the debt-credit of network electric business
Credit Risk Model:
3.1. C matrixes, C=[A are calculatedT,BT]T;
3.2. K (A, C are determined according to gaussian kernel functionT) and K (B, CT);
3.3. calculating matrix U and V;
3.4. by the C of ten folding cross validation selection sort errors minimum1And C2, and determined according to (12) formula and (13) formula
Final w and the value of b;
3.5. two Optimal Separating Hyperplanes are determined according to (14);
5. according to the promise breaking label of the 4 twin support vector cassification model prediction training sets of least square set up, i.e. root
The value of the target of electric business client is determined according to (29) formula;
6. the target values according to the 5 target values for obtaining and training set electric business client's real trade determine sample error,
And sample weights ρ is determined according to (30) formula1i、ρ2i、P1 -1And P2 -1;
7. application training collection data re-establish network electricity according to the twin support vector cassification model of weighted least-squares
Negotiate to borrow loan Credit Risk Model:
7.1. calculating matrix M and N;
7.2. Lagrange multiplier α and β are determined according to (30) formula and (32) formula;
7.3. by the C of ten folding cross validation selection sort errors minimum1' and C2', and it is true according to (29) formula and (31) formula
The value of fixed final w and b;
7.4. two Optimal Separating Hyperplanes are determined according to (14);
8. the value of the target of new electric business client, target'(x are determined according to (29) formula)=0 represent and predict the visitor
Whether family can normally refund, target'(x)=1 represent predict the client may loan defaults.
Beneficial effect:
The present invention proposes a kind of twin based on weighted least-squares on the basis of the twin SVMs of least square
(Weighted least squares twins support vector machine, are abbreviated as WLS- to SVMs
TSVM network electric business debt-credit risk evaluation model).The twin supporting vector machine model of weighted least-squares takes full advantage of twin
The advantage that SVMs can quick and precisely classify to the unbalanced risk data of class, introduces the least square thought and simplifies
Calculating during twin model construction of SVM, while having done weighting treatment to the error in twin SVMs, reduces
Influence of the data exception point to Optimal Separating Hyperplane, so as to improve nicety of grading, largely improves risk profile accurate
True rate.Shown by the experiment of actual data analysis, WLS-TSVM algorithms reduce shadow of the noise data to disaggregated model really
Ring, model training speed and classification accuracy are improved.
Brief description of the drawings
Fig. 1 is the roc curve maps of BP neural network;
Fig. 2 is the roc curve maps of SVM;
The roc curve maps of Fig. 3 WLS-TSVM of the invention.
Specific embodiment
The present invention is described in more detail below in conjunction with the drawings and specific embodiments.
In the present embodiment, from the credit risk data for patting network loan industry in loan " witch mirror cup " air control algorithm contest
As experimental data.
(1) data set is constituted
Value according to the label target that broken a contract in experimental data may determine that the financial status of borrower, target=1 generations
Table loan defaults, target=0 represents normal refund.Debt-credit data record is randomly selected from experiment competition data to respectively constitute
The training set and test set of model, data set have six big field classifications, data set after each big field type subdivision
Totally 207 data dimensions, training set and test set data distribution are as shown in table 1.
The data set of table 1 is constituted
(2) data prediction
Due to there is character type data in risk data, and matlab is unable to the data of processing character type, so right herein
The numeral of character type is processed, and the address field of character type has been substituted for into the corresponding longitude and latitude in address.
The missing values number of each row of risk data table is counted with histc functions, 18000 have been more than for missing values number
The row of (data total amount 60%), are deleted, by treatment, 174 attributes of final residue.
It is standardized from zscore function pair data in testing herein, the difference between heterogeneity data is eliminated with this
It is different.
(3) interpretation
Experimental situation:Win7 operating systems, 6G internal memories, CPU2.5Hz, matlab2014a.To add with matlab instruments
The power twin algorithm of support vector machine of least square (WLS-TSVM), SVMs (SVM) and BP neural network are on forecast set
Prediction classification accuracy compare, the performance of model is measured according to following two indexs:(1) classification accuracy;(2)
The model training time.Table 2 is three kinds of time complexity contrasts of algorithm, and wherein m is sample total number, and m1, m2 are respectively positive and negative
Class number of samples, n is data dimension.Table 3 is the Comparative result of the prediction classification of SVM and WLS-TSVM on data set.
The Algorithms T-cbmplexity of table 2 is contrasted
The experimental result of table 3 is contrasted
From table 3 it can be seen that in terms of predictablity rate, the predictablity rate of WLS-TSVM models is higher than SVM and BP god
Through the predictablity rate of network.Time complexity analysis contrast according to table 2, it can be seen that the time complexity of WLS-TSVM is small
Take in the execution time major part of SVM and the time complexity of BP neural network, and algorithm and spend in the training of model.Pass through
Experimental data in table 3, we are really it can be seen that in terms of training time length, the WLS-TSVM training times are less than really
The training time of SVM and BP neural network model.
Fig. 1~Fig. 3 is respectively the roc curve maps and WLS-TSVM of the invention of the roc curve maps of BP neural network, SVM
Roc curve maps;The abscissa of roc curves is FPR (false positive rate, negative and positive class rate), and ordinate is TPR
(true positive rate, real class rate).With roc (Receiver Operating Characteristic) curves and
Auc values can be with the overall classification performance of classification device, and from Fig. 1, Fig. 2, the roc curve maps in Fig. 3 can be seen that WLS-
Auc ((Area Under roc Curve)) value (area under auc values correspondence blue curve) of TSVM is neural apparently higher than BP
The auc values of network and SVM algorithm.
Network electric business debt-credit Credit Risk Assessment Model knot of the present invention based on the twin SVMs of weighted least-squares
Having closed twin SVMs can process the advantage of the unbalanced nonlinear organization data of class, with weighted least-squares method letter
Change the calculating of twin SVMs, while having done weighting treatment to error term, reduce data exception point to disaggregated model
Influence, so not only increase classification predictablity rate, and reduce the model training time.Experimental section is patting loan
As a example by risk data in " witch mirror cup " air control contest, illustrate that the network electric business debt-credit risk model based on WLS-TSVM can be with
Satisfied nicety of grading is obtained with the less training time.Illustrate that this model has herein by theory and practice preferable
Data fitting effect, be with a wide range of applications.
Bibliography
[1] money gold leaf, current situation and prospect [J] finance forum, 2012 (1) of Yang Fei China P2P network loans:
45-51.
[2]Xinhui Chen,Xiangxiang Ji,Xinrui Zhang.A Contrastive Study of P2P
Network Lending in China and American[C].Asia-Pacific E-Business and E-
Government Conference(APEE2014),2014.
[3]Tang Guolei.P2P network lending in the credit riskrating of the
individual[J].International English Education Research,2015(9):23-26.
[4]Fengzhi Liu,MingyanWang.The credit risk of theborrower’s credit in
P2P network[J].International English Education Research,2015(8):72-75.
[5]Niejin,Hulinshuang.P2P network lending platform evaluationbased
onlink analysis[J].Research on Library Science,2015(13):37-45.
[6]Liqun You,Yudan Zhu.P2P network lending platform business model
evaluation[J].Journal of Commercial Economics,2015(32):86-88.
[7]JinYu,ZhuYudan.Data-Driven Approach to Predict Default Risk of
Loan for Online Peer-to-Peer(P2P)Lending[C].Communication Systems and Network
Technologies(CSNT),2015Fifth International Conference on,2015.
[8]DunGang Zang,MngYu Qi,YanMing Fu.The credit risk assessment of P2P
lending based on BP neural network[C].The 2014International Conference on
Industrial Engineering and Management Science(IEMS2014),2014.
[9]Xinjun Peng,A ν-twin support vector machine(ν-TSVM)classifier and
its geometric algorithms[J].Information Sciences,2010,180(20):3863-3875.
[10]Xijiong Xie,Shiliang Sun.Multitask centroid twins
supportvectormachines[J].Neurocomputing,2015,149:1085-1091.
[11]Yuan-Hai Shao,Xiang-Yu Hua,Li-Ming Liu,Zhi-Min Yang,Nai-Yang
Deng.Combined outputs framework for twins supportvectormachines[J]
.AppliedIntelligence,2015,43(2):424-438.
[12] Wei little Li, Liu Jinghua, yellow shake green grass or young crops applications Raman spectrum and imaging technique observe α particle irradiations to Rhizopus oryzae
Role and influence [J] University of Anhui's journal (natural science edition), 2014,38 (1):66-72.
[13] Qiu Mengchan, Su Chengli, Zhong Guocai, beam Jianping are based on KICA-TSVR method cracking of ethylene product and are distributed soft survey
Amount [J] control engineerings, 2015,25 (33):458-464.
[14] Dong Hongbo, Shen Zhongjie, Yao Ya peak are based on underground coal mine rig bearing failure diagnosis [J] the collieries of TWSVM
Machinery, 2015,36 (5):298-300.
Claims (5)
1. a kind of network electric business borrows or lends money methods of risk assessment, it is characterised in that comprise the following steps:
Step 1, the data of collection history debt-credit client are used as sample set;The data of each debt-credit client include that n ties up evaluation index
Data and promise breaking label target, target=1 represent loan defaults, and target=0 represents normal refund;
Step 2, the data in sample set are pre-processed;The pretreatment includes that missing values treatment, nonnumeric type evaluation refer to
Mark data quantify and data standardization;
Step 3, the sample in sample set is divided into according to the value of promise breaking label target by two classes, the first kind is target=0's
Normal refund client, matrix is built by its corresponding evaluation index dataEquations of The Second Kind is target=1's
Loan defaults client, matrix is built by its corresponding evaluation index dataWherein m1 and m2 are respectively two classes
Sample size in sample,It is n-dimensional vector, represents i-th evaluation index of sample in kth class;
Step 4, based on two class samples, build the twin support vector cassification model of least square, and it is trained, set up
Network electric business borrows or lends money risk evaluation model;
Step 5, the network electric business debt-credit risk evaluation models according to 4 foundation, forecast sample concentrate the promise breaking label of each sample
target;
Step 6, the actual promise breaking mark collected for each sample, the promise breaking label target and step 1 obtained according to 5 predictions
Sign target and determine the predicated error of sample, and the size of each sample weights is determined according to the size of predicated error;Sample is weighed
Weight determination principle be:The larger sample weights of predicated error are smaller, and the less sample weights of predicated error are larger;
Step 7, based on each sample weights, build the twin support vector cassification model of weighted least-squares, re-establish net
Network electric business borrows or lends money risk evaluation model;
Step 8, the n dimension evaluation index data for gathering new debt-credit client, the network electric business that step 7 determination is substituted into after pretreatment are borrowed
Borrow risk evaluation model, be connected to the corresponding promise breaking label target of borrower, predict debt-credit client whether can loan defaults, with
This debt-credit risk is estimated.
2. network electric business according to claim 1 borrows or lends money methods of risk assessment, it is characterised in that the step 4 is specifically wrapped
Include following steps:
Step 4.1, the structure twin support vector cassification model of least square:
s.t.-(K(B,CT)w1+e2b1)=e2-η2
s.t.K(A,CT)w2+e1b2=e1-η1
Wherein, C1And C2It is punishment parameter, optimal value is selected according to ten folding cross validations;η1And η2It is loss variable;e1With
e21 column vector is all for element;C=[AT,BT]T,K(X,CT) it is gaussian kernel function, X=A or B;w1And w2It is weight vector, b1
And b2Presentation class threshold value;w1、w2、b1And b2It is parameter to be asked;
Step 4.2, w is asked for by below equation1、w2、b1And b2:
Wherein, U=[K (A, CT),e1], V=[K (B, CT),e2];
Step 4.3, according to fixed parameter set up with lower network electric business borrow or lend money risk model:
Wherein, x represents the n dimension evaluation index data of pretreated new reception client, and target (x)=0 represents that prediction should
New debt-credit client can normally refund, and target (x)=1 represents predicts that the new debt-credit client understands loan defaults.
3. network electric business according to claim 2 borrows or lends money methods of risk assessment, it is characterised in that in the step 6, sample
The computing formula of weight is:
Wherein, | ηki| (k=1,2) is i-th predicated error of sample in sample set kth class sample, and δ is | ηki| (k=1,2)
Standard deviation.
4. network electric business according to claim 3 borrows or lends money methods of risk assessment, it is characterised in that the step 7 is specifically wrapped
Include following steps:
Step 7.1, the structure twin support vector cassification model of weighted least-squares:
The twin support vector cassification model (WLS-TSVM1) of first weighted least-squares:
s.t.-(K(B,CT)w'1+e2b′1)=e2-η2
The twin support vector cassification model (WLS-TSVM2) of second weighted least-squares:
s.t.K(A,CT)w'2+e1b'2=e1-η1
Wherein, C '1And C'2It is punishment parameter, optimal value is selected according to ten folding cross validations;w′1And w'2It is weight vector, b '1
And b'2Presentation class threshold value;w′1、w'2、b′1And b'2It is parameter to be asked;
Step 7.1, w ' is asked for by below equation1、w'2、b′1、b'2:
Wherein, M=[K (A, CT),e1], N=[K (B, CT),e2];
WithIt is respectively with ρ1iAnd ρ2iIt is the diagonal matrix of the elements in a main diagonal;
Step 7.1, according to fixed parameter re-establish with lower network electric business borrow or lend money risk model:
Wherein, K (xT,CT) it is gaussian kernel function;X represents that the n of debt-credit client new after pre-processing ties up evaluation index data,
Target'(x)=0 represent predict the new debt-credit client can normally refund, target'(x)=1 represent predict the new debt-credit
Client may loan defaults.
5. a kind of network electric business debt-credit risk evaluation model, it is characterised in that the model is:
Wherein, x represents the n dimension evaluation index data of pretreated new debt-credit client, target'(x)=0 expression prediction should
New debt-credit client can normally refund, and this debt-credit risk is relatively low;Target'(x)=1 represent and predict new debt-credit client's meeting
Loan defaults, this debt-credit risk is higher;
The step of model parameter is by any one of Claims 1 to 4 1~7 is solved.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611265956.XA CN106779755A (en) | 2016-12-31 | 2016-12-31 | A kind of network electric business borrows or lends money methods of risk assessment and model |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611265956.XA CN106779755A (en) | 2016-12-31 | 2016-12-31 | A kind of network electric business borrows or lends money methods of risk assessment and model |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106779755A true CN106779755A (en) | 2017-05-31 |
Family
ID=58952506
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611265956.XA Pending CN106779755A (en) | 2016-12-31 | 2016-12-31 | A kind of network electric business borrows or lends money methods of risk assessment and model |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106779755A (en) |
Cited By (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107633030A (en) * | 2017-09-04 | 2018-01-26 | 深圳市华傲数据技术有限公司 | Credit estimation method and device based on data model |
CN107633455A (en) * | 2017-09-04 | 2018-01-26 | 深圳市华傲数据技术有限公司 | Credit estimation method and device based on data model |
CN107798390A (en) * | 2017-11-22 | 2018-03-13 | 阿里巴巴集团控股有限公司 | A kind of training method of machine learning model, device and electronic equipment |
CN108009911A (en) * | 2017-11-29 | 2018-05-08 | 上海出版印刷高等专科学校 | A kind of method of identification P2P network loan borrower's default risks |
CN108229542A (en) * | 2017-12-21 | 2018-06-29 | 中智诚征信有限公司 | A kind of cycle debt-credit credit risk monitoring method based on Time-Series analysis technology |
CN108229422A (en) * | 2018-01-25 | 2018-06-29 | 浙江中控技术股份有限公司 | A kind of method and apparatus of image identification |
CN108256990A (en) * | 2017-07-25 | 2018-07-06 | 平安普惠企业管理有限公司 | Server, the method for indicating risk and computer readable storage medium |
CN108648068A (en) * | 2018-05-16 | 2018-10-12 | 长沙农村商业银行股份有限公司 | A kind of assessing credit risks method and system |
CN109360105A (en) * | 2018-09-18 | 2019-02-19 | 平安科技(深圳)有限公司 | Product risks method for early warning, device, computer equipment and storage medium |
CN109409677A (en) * | 2018-09-27 | 2019-03-01 | 深圳壹账通智能科技有限公司 | Enterprise Credit Risk Evaluation method, apparatus, equipment and storage medium |
CN109670940A (en) * | 2018-11-12 | 2019-04-23 | 深圳壹账通智能科技有限公司 | Credit Risk Assessment Model generation method and relevant device based on machine learning |
CN109685653A (en) * | 2019-01-21 | 2019-04-26 | 北京工业大学 | A method of fusion deepness belief network and the monitoring of the credit risk of isolated forest algorithm |
CN109961362A (en) * | 2019-02-19 | 2019-07-02 | 合肥工业大学 | P2P platform credit risk dynamic evaluation method and system |
CN110135700A (en) * | 2019-04-23 | 2019-08-16 | 北京淇瑀信息科技有限公司 | Credit Risk Assessment method and device based on expandtabs data |
CN110310199A (en) * | 2019-06-27 | 2019-10-08 | 上海上湖信息技术有限公司 | Borrow or lend money construction method, system and the debt-credit Risk Forecast Method of risk forecast model |
CN110349009A (en) * | 2019-07-02 | 2019-10-18 | 北京淇瑀信息科技有限公司 | A kind of bull debt-credit violation correction method, apparatus and electronic equipment |
CN110349012A (en) * | 2019-07-12 | 2019-10-18 | 腾讯科技(深圳)有限公司 | Data predication method and computer readable storage medium |
CN110364257A (en) * | 2019-07-18 | 2019-10-22 | 泰康保险集团股份有限公司 | People's vehicle Risk Forecast Method, device, medium and electronic equipment |
CN110378786A (en) * | 2019-07-29 | 2019-10-25 | 中国工商银行股份有限公司 | Model training method, promise breaking conduction Risk Identification Method, device and storage medium |
CN110610213A (en) * | 2019-09-20 | 2019-12-24 | 苏州大学 | Mail classification method, device, equipment and computer readable storage medium |
CN110619434A (en) * | 2019-09-17 | 2019-12-27 | 北谷电子有限公司上海分公司 | Method and device for predicting repayment probability, electronic equipment and storage medium |
CN110619024A (en) * | 2019-09-20 | 2019-12-27 | 苏州大学 | Credit evaluation method, system and related device |
CN110633804A (en) * | 2019-09-12 | 2019-12-31 | 深圳众赢维融科技有限公司 | Financial product incoming item evaluation method, server and storage medium |
CN110910002A (en) * | 2019-11-15 | 2020-03-24 | 安徽海汇金融投资集团有限公司 | Account receivable default risk identification method and system |
CN111275541A (en) * | 2020-01-14 | 2020-06-12 | 中信百信银行股份有限公司 | Borrower quality evaluation method and system based on multi-dimensional information, electronic device and computer readable storage medium |
CN111353882A (en) * | 2020-04-17 | 2020-06-30 | 新分享科技服务(深圳)有限公司 | Privatized deployment retail asset wind control method and device and electronic equipment |
CN111461876A (en) * | 2020-05-07 | 2020-07-28 | 赵玉洁 | E-commerce credit system management system and method based on big data |
CN111652713A (en) * | 2020-07-01 | 2020-09-11 | 中国银行股份有限公司 | Equity wind control modeling method and device |
CN111814989A (en) * | 2020-06-02 | 2020-10-23 | 西安工程大学 | Transformer fault diagnosis method for optimizing twin support vector machine based on locust algorithm |
CN111985796A (en) * | 2020-08-07 | 2020-11-24 | 华中科技大学 | Method for predicting concrete structure durability based on random forest and intelligent algorithm |
CN112016244A (en) * | 2020-08-07 | 2020-12-01 | 中国交通建设股份有限公司吉林省分公司 | Durable concrete multi-target mix proportion optimization method based on SVM and intelligent algorithm |
CN112598422A (en) * | 2020-12-23 | 2021-04-02 | 平安银行股份有限公司 | Transaction risk assessment method, system, device and storage medium |
CN112734555A (en) * | 2020-12-31 | 2021-04-30 | 南京视察者智能科技有限公司 | Design method of personal credit risk assessment model based on big data |
CN112785414A (en) * | 2021-01-04 | 2021-05-11 | 上海海事大学 | Credit risk prediction method based on knowledge graph and ontology inference engine |
CN112785116A (en) * | 2020-12-08 | 2021-05-11 | 精英数智科技股份有限公司 | Coal mine enterprise loan assessment method and device |
CN113793210A (en) * | 2021-09-17 | 2021-12-14 | 吉林亿联银行股份有限公司 | Method for evaluating network loan credit, related device and computer storage medium |
CN115730254A (en) * | 2022-12-06 | 2023-03-03 | 中电金信软件有限公司 | Method and device for expanding modeling sample data label |
CN117426774A (en) * | 2023-12-21 | 2024-01-23 | 深圳腾信百纳科技有限公司 | User emotion assessment method and system based on intelligent bracelet |
-
2016
- 2016-12-31 CN CN201611265956.XA patent/CN106779755A/en active Pending
Cited By (50)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108256990A (en) * | 2017-07-25 | 2018-07-06 | 平安普惠企业管理有限公司 | Server, the method for indicating risk and computer readable storage medium |
CN107633030B (en) * | 2017-09-04 | 2020-11-27 | 深圳市华傲数据技术有限公司 | Credit evaluation method and device based on data model |
CN107633455A (en) * | 2017-09-04 | 2018-01-26 | 深圳市华傲数据技术有限公司 | Credit estimation method and device based on data model |
CN107633030A (en) * | 2017-09-04 | 2018-01-26 | 深圳市华傲数据技术有限公司 | Credit estimation method and device based on data model |
CN107798390A (en) * | 2017-11-22 | 2018-03-13 | 阿里巴巴集团控股有限公司 | A kind of training method of machine learning model, device and electronic equipment |
CN107798390B (en) * | 2017-11-22 | 2023-03-21 | 创新先进技术有限公司 | Training method and device of machine learning model and electronic equipment |
CN108009911A (en) * | 2017-11-29 | 2018-05-08 | 上海出版印刷高等专科学校 | A kind of method of identification P2P network loan borrower's default risks |
CN108229542A (en) * | 2017-12-21 | 2018-06-29 | 中智诚征信有限公司 | A kind of cycle debt-credit credit risk monitoring method based on Time-Series analysis technology |
CN108229422A (en) * | 2018-01-25 | 2018-06-29 | 浙江中控技术股份有限公司 | A kind of method and apparatus of image identification |
CN108648068A (en) * | 2018-05-16 | 2018-10-12 | 长沙农村商业银行股份有限公司 | A kind of assessing credit risks method and system |
CN109360105A (en) * | 2018-09-18 | 2019-02-19 | 平安科技(深圳)有限公司 | Product risks method for early warning, device, computer equipment and storage medium |
CN109409677A (en) * | 2018-09-27 | 2019-03-01 | 深圳壹账通智能科技有限公司 | Enterprise Credit Risk Evaluation method, apparatus, equipment and storage medium |
WO2020062660A1 (en) * | 2018-09-27 | 2020-04-02 | 深圳壹账通智能科技有限公司 | Enterprise credit risk evaluation method, apparatus and device, and storage medium |
CN109670940A (en) * | 2018-11-12 | 2019-04-23 | 深圳壹账通智能科技有限公司 | Credit Risk Assessment Model generation method and relevant device based on machine learning |
CN109685653A (en) * | 2019-01-21 | 2019-04-26 | 北京工业大学 | A method of fusion deepness belief network and the monitoring of the credit risk of isolated forest algorithm |
CN109961362A (en) * | 2019-02-19 | 2019-07-02 | 合肥工业大学 | P2P platform credit risk dynamic evaluation method and system |
CN110135700A (en) * | 2019-04-23 | 2019-08-16 | 北京淇瑀信息科技有限公司 | Credit Risk Assessment method and device based on expandtabs data |
CN110310199A (en) * | 2019-06-27 | 2019-10-08 | 上海上湖信息技术有限公司 | Borrow or lend money construction method, system and the debt-credit Risk Forecast Method of risk forecast model |
CN110310199B (en) * | 2019-06-27 | 2022-03-18 | 上海上湖信息技术有限公司 | Method and system for constructing loan risk prediction model and loan risk prediction method |
CN110349009B (en) * | 2019-07-02 | 2024-01-26 | 北京淇瑀信息科技有限公司 | Multi-head lending default prediction method and device and electronic equipment |
CN110349009A (en) * | 2019-07-02 | 2019-10-18 | 北京淇瑀信息科技有限公司 | A kind of bull debt-credit violation correction method, apparatus and electronic equipment |
CN110349012A (en) * | 2019-07-12 | 2019-10-18 | 腾讯科技(深圳)有限公司 | Data predication method and computer readable storage medium |
CN110364257A (en) * | 2019-07-18 | 2019-10-22 | 泰康保险集团股份有限公司 | People's vehicle Risk Forecast Method, device, medium and electronic equipment |
CN110378786A (en) * | 2019-07-29 | 2019-10-25 | 中国工商银行股份有限公司 | Model training method, promise breaking conduction Risk Identification Method, device and storage medium |
CN110633804A (en) * | 2019-09-12 | 2019-12-31 | 深圳众赢维融科技有限公司 | Financial product incoming item evaluation method, server and storage medium |
CN110633804B (en) * | 2019-09-12 | 2022-06-24 | 简链科技(广东)有限公司 | Financial product incoming evaluation method, server and storage medium |
CN110619434A (en) * | 2019-09-17 | 2019-12-27 | 北谷电子有限公司上海分公司 | Method and device for predicting repayment probability, electronic equipment and storage medium |
CN110610213A (en) * | 2019-09-20 | 2019-12-24 | 苏州大学 | Mail classification method, device, equipment and computer readable storage medium |
CN110619024A (en) * | 2019-09-20 | 2019-12-27 | 苏州大学 | Credit evaluation method, system and related device |
CN110910002A (en) * | 2019-11-15 | 2020-03-24 | 安徽海汇金融投资集团有限公司 | Account receivable default risk identification method and system |
CN110910002B (en) * | 2019-11-15 | 2023-07-28 | 安徽海汇金融投资集团有限公司 | Account receivables default risk identification method and system |
CN111275541A (en) * | 2020-01-14 | 2020-06-12 | 中信百信银行股份有限公司 | Borrower quality evaluation method and system based on multi-dimensional information, electronic device and computer readable storage medium |
CN111353882A (en) * | 2020-04-17 | 2020-06-30 | 新分享科技服务(深圳)有限公司 | Privatized deployment retail asset wind control method and device and electronic equipment |
CN111461876A (en) * | 2020-05-07 | 2020-07-28 | 赵玉洁 | E-commerce credit system management system and method based on big data |
CN111814989A (en) * | 2020-06-02 | 2020-10-23 | 西安工程大学 | Transformer fault diagnosis method for optimizing twin support vector machine based on locust algorithm |
CN111652713B (en) * | 2020-07-01 | 2024-02-27 | 中国银行股份有限公司 | Equity wind control modeling method and device |
CN111652713A (en) * | 2020-07-01 | 2020-09-11 | 中国银行股份有限公司 | Equity wind control modeling method and device |
CN112016244A (en) * | 2020-08-07 | 2020-12-01 | 中国交通建设股份有限公司吉林省分公司 | Durable concrete multi-target mix proportion optimization method based on SVM and intelligent algorithm |
CN111985796A (en) * | 2020-08-07 | 2020-11-24 | 华中科技大学 | Method for predicting concrete structure durability based on random forest and intelligent algorithm |
CN111985796B (en) * | 2020-08-07 | 2021-06-18 | 华中科技大学 | Method for predicting concrete structure durability based on random forest and intelligent algorithm |
CN112785116A (en) * | 2020-12-08 | 2021-05-11 | 精英数智科技股份有限公司 | Coal mine enterprise loan assessment method and device |
CN112598422A (en) * | 2020-12-23 | 2021-04-02 | 平安银行股份有限公司 | Transaction risk assessment method, system, device and storage medium |
CN112734555A (en) * | 2020-12-31 | 2021-04-30 | 南京视察者智能科技有限公司 | Design method of personal credit risk assessment model based on big data |
CN112785414A (en) * | 2021-01-04 | 2021-05-11 | 上海海事大学 | Credit risk prediction method based on knowledge graph and ontology inference engine |
CN112785414B (en) * | 2021-01-04 | 2024-01-26 | 上海海事大学 | Credit risk prediction method based on knowledge graph and ontology inference engine |
CN113793210A (en) * | 2021-09-17 | 2021-12-14 | 吉林亿联银行股份有限公司 | Method for evaluating network loan credit, related device and computer storage medium |
CN115730254B (en) * | 2022-12-06 | 2023-10-13 | 中电金信软件有限公司 | Method and device for expanding modeling sample data label |
CN115730254A (en) * | 2022-12-06 | 2023-03-03 | 中电金信软件有限公司 | Method and device for expanding modeling sample data label |
CN117426774A (en) * | 2023-12-21 | 2024-01-23 | 深圳腾信百纳科技有限公司 | User emotion assessment method and system based on intelligent bracelet |
CN117426774B (en) * | 2023-12-21 | 2024-04-09 | 深圳腾信百纳科技有限公司 | User emotion assessment method and system based on intelligent bracelet |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106779755A (en) | A kind of network electric business borrows or lends money methods of risk assessment and model | |
Huang et al. | Enterprise credit risk evaluation based on neural network algorithm | |
Li et al. | Reject inference in credit scoring using semi-supervised support vector machines | |
Pai et al. | A support vector machine-based model for detecting top management fraud | |
De Jongh et al. | The impact of pre-selected variance in ation factor thresholds on the stability and predictive power of logistic regression models in credit scoring | |
CN106611375A (en) | Text analysis-based credit risk assessment method and apparatus | |
Ayyildiz et al. | Individual credit ranking by an integrated interval type-2 trapezoidal fuzzy Electre methodology | |
Gao | The use of machine learning combined with data mining technology in financial risk prevention | |
Muslim et al. | New model combination meta-learner to improve accuracy prediction P2P lending with stacking ensemble learning | |
CN108492001A (en) | A method of being used for guaranteed loan network risk management | |
Lee et al. | Evaluating borrowers’ default risk with a spatial probit model reflecting the distance in their relational network | |
Kanaparthi | AI-based personalization and trust in digital finance | |
Peat et al. | Using neural nets to combine information sets in corporate bankruptcy prediction | |
CN107590733A (en) | Platform methods of risk assessment is borrowed based on the net of geographical economy and social networks | |
Liu et al. | Credit scoring prediction leveraging interpretable ensemble learning | |
Yang et al. | An evidential reasoning rule-based ensemble learning approach for evaluating credit risks with customer heterogeneity | |
Lee et al. | Application of machine learning in credit risk scorecard | |
Nicula | Some aspects concerning the measurement of credit risk | |
Sharma et al. | A Logistic Regression Based Credit Risk Assessment Using WoE Bining and Enhanced Feature Engineering Approach ANOVA and Chi-Square | |
Guo et al. | Credit Default Prediction on Time-Series Behavioral Data Using Ensemble Models | |
Kalayci et al. | Early NPL Warning for SME Credit Risk: An Experimental Study. | |
Liu et al. | Theory and application of logistic-SVM two-stage hybrid model | |
Balina et al. | Assessing individual credit risk on the basis of discriminant analysis by Poland's cooperative banks | |
KR102588210B1 (en) | Apparatus and method for alternatively rating credit through the application of statistical tests to open data | |
CN117094817B (en) | Credit risk control intelligent prediction method and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20170531 |
|
WD01 | Invention patent application deemed withdrawn after publication |