CN106960455A

CN106960455A - Orient transaudient method and terminal

Info

Publication number: CN106960455A
Application number: CN201710161239.0A
Authority: CN
Inventors: 姜瑜
Original assignee: Yulong Computer Telecommunication Scientific Shenzhen Co Ltd
Current assignee: Yulong Computer Telecommunication Scientific Shenzhen Co Ltd
Priority date: 2017-03-17
Filing date: 2017-03-17
Publication date: 2017-07-18

Abstract

The invention provides the transaudient method of one kind orientation and terminal, wherein, this method includes：The pixel coordinate of user's face characteristic point is obtained by the camera device of mobile terminal；According to the outer parameter acquiring of the intrinsic parameter of camera device and camera device world coordinates corresponding with the pixel coordinate of the user's face characteristic point；Wherein, the intrinsic parameter of camera device includes the optics and geometrical characteristic parameter of the camera device；The outer parameter of camera device is obtained using the camera device scaling method；Mobile terminal is controlled to transmit sound to the direction of the user's face characteristic point according to world coordinates.The relative distance and orientation between mobile terminal and user's head can not be obtained exactly in the prior art by being solved by the present invention, cause to orient the problem of transaudient precision is poor, and then improve the transaudient degree of accuracy of orientation.

Description

Orient transaudient method and terminal

Technical field

The present invention relates to communication technical field, and in particular to the transaudient method of one kind orientation and terminal.

Background technology

The principle of audio frequency directional early in begun to before half a century research, all achieved in theory and practice it is many into Really.In recent years, with the progress of society, the portable mobile equipment using mobile phone as representative is more and more applied to each social angle Fall.People often use audio and set when various occasions are operated the activity such as commercial affairs, amusement and leisure using mobile device It is standby.But traditional audio frequency apparatus can not ensure privacy in public using often causing noise, can be with one using earphone Determine to solve problem in degree, but it is unfavorable to human ear using earphone for a long time, hearing permanent damage can be caused.In such premise Under, the micro audio directional research based on audio frequency directional technology turns into the focus of attention of association area naturally.

Presently, for mobile device micro audio directional system also in the trial stage, association area is according to Some audio frequency directional achievements in research, it is proposed that some are used for the orientation acoustic transmission technique scheme of mobile device, but right in the prior art Setting accuracy between user's head and mobile device is not high.

The content of the invention

In view of this, the embodiments of the invention provide the transaudient method of one kind orientation and terminal, to solve in the prior art not The relative distance and orientation between mobile terminal and user's head can be obtained exactly, and then cause to orient transaudient precision poor Problem.

Therefore, the embodiments of the invention provide following technical scheme：

First aspect present invention orients transaudient method there is provided one kind, including：Obtained by the camera device of mobile terminal The pixel coordinate of user's face characteristic point；According to the outer parameter acquiring of the intrinsic parameter of the camera device and the camera device with The corresponding world coordinates of pixel coordinate of the user's face characteristic point；Wherein, the intrinsic parameter of the camera device includes described The optics and geometrical characteristic parameter of camera device；The outer parameter of the camera device is obtained using the camera device scaling method Take；The mobile terminal is controlled to transmit sound to the direction of the user's face characteristic point according to the world coordinates.

With reference to first aspect present invention, in first aspect present invention first embodiment, according in the camera device Parameter and the outer parameter acquiring of the camera device world coordinates bag corresponding with the pixel coordinate of the user's face characteristic point Include：Camera device coordinate corresponding with the pixel coordinate is obtained according to the intrinsic parameter of the camera device；According to the shooting The outer parameter acquiring of the device world coordinates corresponding with the camera device coordinate.

In first aspect present invention first embodiment, first aspect present invention second embodiment, the camera device The corresponding Intrinsic Matrix of intrinsic parameter be expressed as：Wherein, k_xIt is expressed as X-axis amplification coefficient, k_yRepresent Y Axle amplification coefficient, (u_o, v_o) represent the origin of image coordinate system corresponding with the user's face characteristic point in pixel coordinate system institute Corresponding coordinate；The corresponding outer parameter matrix of outer parameter of the camera device is expressed as：Wherein, ComponentRepresent each reference axis of world coordinate system in shooting respectively Direction vector under device coordinate system,For motion vector, represent world coordinate system origin in camera device Position in coordinate system.

In first aspect present invention second embodiment, the embodiment of first aspect present invention the 3rd, pass through equation below Camera device coordinate corresponding with the pixel coordinate is obtained according to the intrinsic parameter of the camera device：

By equation below according to the outer parameter acquiring of the camera device is corresponding with the camera device coordinate World coordinates：

Wherein, world coordinate system：(x_W, y_W, z_W)；Camera device coordinate system：(x_C, y_C, z_C), figure As coordinate system：(x, y), pixel coordinate system：(u, v).

With reference to first aspect present invention, first aspect first embodiment, first aspect second embodiment or first It is described mobile whole according to world coordinates control in the embodiment of aspect the 3rd, the embodiment of first aspect present invention the 4th Hold to after the direction transmission sound of the user's face characteristic point, also include：Obtained by the sensor of the mobile terminal Running parameter of the user's face characteristic point relative to the mobile terminal；Obtained and the user in real time according to the running parameter The corresponding world coordinates of pixel coordinate of face feature point.

Second aspect present invention orients transaudient terminal there is provided one kind, including：First acquisition module, for passing through movement The camera device of terminal obtains the pixel coordinate of user's face characteristic point；Second acquisition module, for according to the camera device Intrinsic parameter and corresponding with the pixel coordinate of the user's face characteristic point world of outer parameter acquiring of the camera device sit Mark；Wherein, the intrinsic parameter of the camera device includes the optics and geometrical characteristic parameter of the camera device；The camera device Outer parameter using the camera device demarcation terminal obtain；Transport module, for being moved according to world coordinates control is described Dynamic terminal transmits sound to the direction of the user's face characteristic point.

With reference to second aspect present invention, in second aspect present invention first embodiment, second acquisition module includes： First acquisition unit, sits for obtaining camera device corresponding with the pixel coordinate according to the intrinsic parameter of the camera device Mark；Second acquisition unit, for the outer parameter acquiring according to the camera device it is corresponding with the camera device coordinate described in World coordinates.

With reference to second aspect present invention first embodiment, in second aspect present invention second embodiment, the shooting The corresponding Intrinsic Matrix of intrinsic parameter of device is expressed as：Wherein, k_xIt is expressed as X-axis amplification coefficient, k_yTable Show Y-axis amplification coefficient, (u_o, v_o) represent the origin of image coordinate system corresponding with the user's face characteristic point in pixel coordinate The corresponding coordinate of system；The corresponding outer parameter matrix of outer parameter of the camera device is expressed as：Its In, componentRepresent that each reference axis of world coordinate system is being taken the photograph respectively As the direction vector under device coordinate system,For motion vector, represent that world coordinate system origin is filled in shooting Put the position in coordinate system.

With reference to second aspect present invention second embodiment, in the embodiment of second aspect present invention the 3rd, described first Acquiring unit is additionally operable to obtain take the photograph corresponding with the pixel coordinate according to the intrinsic parameter of the camera device by equation below As device coordinate：

The second acquisition unit be additionally operable to by equation below according to the outer parameter acquiring of the camera device with it is described The corresponding world coordinates of camera device coordinate：

With reference to second aspect present invention, second aspect first embodiment, second aspect second embodiment or second In the embodiment of aspect the 3rd, the embodiment of second aspect present invention the 4th, the terminal also includes：3rd acquisition module, is used After being controlled the mobile terminal to transmit sound to the direction of the user's face characteristic point according to the world coordinates, pass through The sensor of the mobile terminal obtains running parameter of the user's face characteristic point relative to the mobile terminal；4th obtains mould Block, for obtaining world coordinates corresponding with the pixel coordinate of the user's face characteristic point in real time according to the running parameter.

Third aspect present invention provides a kind of electronic equipment, including：At least one processor；And with described at least one The memory of individual processor communication connection；Wherein, have can be by the instruction of one computing device, institute for the memory storage Instruction is stated by least one described computing device, so that at least one described computing device following steps：By mobile whole The camera device at end obtains the pixel coordinate of user's face characteristic point；Filled according to the intrinsic parameter of the camera device and the shooting The outer parameter acquiring put world coordinates corresponding with the pixel coordinate of the user's face characteristic point；Wherein, the camera device Intrinsic parameter include the optics and geometrical characteristic parameter of the camera device；The outer parameter of the camera device uses the shooting Device normalization method is obtained；The mobile terminal is controlled to be passed to the direction of the user's face characteristic point according to the world coordinates Defeated sound.

Embodiment of the present invention technical scheme, has the following advantages that：

The embodiments of the invention provide the transaudient method of one kind orientation and terminal, obtained and used by the camera device of mobile terminal The pixel coordinate of family face feature point；According to the outer parameter acquiring of the intrinsic parameter of camera device and camera device and the user's face The corresponding world coordinates of pixel coordinate of characteristic point；Wherein, the optics of the intrinsic parameter of camera device including the camera device and several What characteristic parameter, optics and geometric properties such as can be focal length, image center；The outer parameter use of camera device this take the photograph As device normalization method is obtained, the scaling method can be the scaling method based on two-dimensional target, can be with by the world coordinates Further be accurately positioned the distance between mobile terminal and user's head and orientation, according to world coordinates control mobile terminal to this The direction transmission sound of user's face characteristic point.Mobile terminal can not be obtained exactly in the prior art by being solved by the present invention Relative distance and orientation between user's head, cause to orient the problem of transaudient precision is poor, and then it is transaudient to improve orientation The degree of accuracy.

Brief description of the drawings

, below will be to specific in order to illustrate more clearly of the specific embodiment of the invention or technical scheme of the prior art The accompanying drawing to be used needed for embodiment or description of the prior art is briefly described, it should be apparent that, in describing below Accompanying drawing is some embodiments of the present invention, for those of ordinary skill in the art, before creative work is not paid Put, other accompanying drawings can also be obtained according to these accompanying drawings.

Fig. 1 shows the structure chart of mobile phone in the embodiment of the present invention；

Fig. 2 is the flow chart of the transaudient method of orientation according to embodiments of the present invention；

Fig. 3 is the transaudient schematic flow sheet of the adaptive directionality based on image ranging according to embodiments of the present invention；

Fig. 4 is national forest park in Xiaokeng schematic diagram according to embodiments of the present invention；

Fig. 5 is image coordinate system according to embodiments of the present invention and pixel coordinate system relation schematic diagram；

Fig. 6 is a structured flowchart of the transaudient terminal of orientation according to embodiments of the present invention；

Fig. 7 is the structured flowchart of the second acquisition module according to embodiments of the present invention；

Fig. 8 is another structured flowchart of the transaudient terminal of orientation according to embodiments of the present invention.

Embodiment

To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with the embodiment of the present invention In accompanying drawing, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is A part of embodiment of the present invention, rather than whole embodiments.Based on the embodiment in the present invention, those skilled in the art are not having There is the every other embodiment made and obtained under the premise of creative work, belong to the scope of protection of the invention.

As shown in figure 1, being the application scenarios schematic diagram of embodiments of the invention.Mobile terminal can be mobile phone or flat board electricity The mobile devices such as brain, mobile terminal is by taking mobile phone as an example, and the part-structure block diagram of mobile phone is as shown in figure 1, mobile phone includes radio circuit 210th, memory 220, input block 230, display unit 240, sensor 250, voicefrequency circuit 260, wireless module 270, processing Device 280 and the grade of power supply 290 part.It will be understood by those skilled in the art that the handset structure shown in Fig. 1 does not constitute opponent The restriction of machine, can be included than illustrating more or less parts, either combine some parts or different parts arrangement.

Wherein RF circuits 210 be used for receive and send messages or communication process in, the reception and transmission of signal.Memory 220 is used for Software program and module are stored, processor 280 is stored in the software program and module of memory 220 by operation, so that Perform various function application and the data processing of mobile phone.Input block 230 is used for the numeral or character information for receiving input, with And the key signals that generation is set with the user of mobile phone and function control is relevant are inputted.Input block 230 may include contact panel 231 and other input equipments 232.Other input equipments 232 can include but is not limited to physical keyboard, function key, mouse, behaviour Make the one or more in bar.Display unit 240 be used for show by user input information or be supplied to user information and The various menus of mobile phone.Display unit 240 can include display panel 241.Contact panel 231 can cover display panel 241, when Contact panel 231 is detected after the touch operation on or near it, sends processor 280 to determine the class of touch event Type, corresponding visual output is provided with preprocessor 280 according to the type of touch event on display panel 241.

Mobile phone may also include at least one sensor 250, such as optical sensor, motion sensor and other sensors.Light Sensor may include ambient light sensor and proximity transducer, and environmental sensor can adjust display according to the light and shade of ambient light The brightness of panel 241, proximity transducer can close display panel 241 and/or backlight when mobile phone is moved in one's ear.This implementation Optical sensor can be arranged on the housing of the front and back of mobile phone in example, blocked area during for detecting that user holds mobile phone Domain.Pressure sensor can also be included herein, be arranged in the front of mobile phone or back housing, for the side by detecting pressure Formula obtains occlusion area during user's grip mobile phone.In addition, mobile phone can also configure gyroscope, barometer, hygrometer, temperature The other sensors such as meter, infrared ray sensor, are repeated no more.

Voicefrequency circuit 260, loudspeaker 261, microphone 262 can provide the COBBAIF between user and mobile phone.Wireless mould Block 270 can be WIFI module, provide the user wireless the Internet access service.

Processor 280 is the control centre of mobile phone, using various interfaces and the various pieces of connection whole mobile phone, is led to Cross operation or perform and be stored in software program and/or module in memory 220, and call and be stored in memory 220 Data, perform the various functions and processing data of mobile phone, so as to carry out integral monitoring to mobile phone.Optionally, processor 280 can be with Including one or more processing units.In addition, mobile phone also includes the power supply 290 powered of each part, by power-supply management system with Processor 280 is logically contiguous, so as to realize the functions such as management charging, electric discharge and power managed by power-supply management system.

Although not shown, mobile phone can also include camera, bluetooth module etc., will not be repeated here.

A kind of transaudient method of orientation is provided in the present embodiment, available for above-mentioned mobile terminal, such as mobile phone, flat board electricity Brain etc., Fig. 2 is the flow chart of the transaudient method of orientation according to embodiments of the present invention, as shown in Fig. 2 the flow includes following step Suddenly：

Step S201, the pixel coordinate of user's face characteristic point is obtained by the camera device of mobile terminal.

Step S202, according to the outer parameter acquiring of the intrinsic parameter of camera device and camera device and the user's face characteristic point The corresponding world coordinates of pixel coordinate；Wherein, the intrinsic parameter of camera device includes the optics and geometric properties ginseng of camera device Number, optics and geometric properties such as can be focal length, image center；The outer parameter of the camera device uses the camera device Scaling method is obtained, and the scaling method can be the scaling method based on two-dimensional target.And then can be obtained by the world coordinates The distance between mobile terminal and user's head and orientation are got, control mobile terminal transmits sound according to the range-azimuth.

Step S203, controls mobile terminal to transmit sound to the direction of user's face characteristic point according to world coordinates.With The gradually maturation of ultrasonic directional acoustic transmission technique, and relevant device develop to miniaturization, cost degradation, specific to mobile terminal Directional loudspeaker module, can using ultrasonic directional acoustical generator as directional loudspeaker, used cooperatively with conventional speakers, It is ultimately oriented transaudient.

By above-mentioned steps, on the basis of the pixel coordinate of user's face characteristic point is got, according to camera device The outer parameter acquiring of intrinsic parameter and camera device world coordinates corresponding with the pixel coordinate of the user's face characteristic point, by obtaining The world coordinates got can be with the distance between more accurate positioning mobile terminal and user's head and orientation, and then can control Mobile terminal processed is according to the localized delivery sound, so as to ensure that the transaudient degree of accuracy of mobile terminal directional.

Above-mentioned steps S202 is related to the outer parameter acquiring and user plane of the intrinsic parameter and camera device according to camera device The corresponding world coordinates of pixel coordinate of portion's characteristic point, in one alternate embodiment, is obtained according to the intrinsic parameter of camera device Camera device coordinate corresponding with the pixel coordinate, it is corresponding with the camera device coordinate according to the outer parameter acquiring of camera device The world coordinates.

In one alternate embodiment, the corresponding Intrinsic Matrix of the intrinsic parameter of camera device is expressed as：Wherein, k_xIt is expressed as X-axis amplification coefficient, k_yRepresent Y-axis amplification coefficient, (u_o, v_o) represent special with the user's face The origin of a little corresponding image coordinate system is levied in the coordinate corresponding to pixel coordinate system；The corresponding outer parameter of outer parameter of camera device Matrix is expressed as：Wherein, component Direction vector of each reference axis of world coordinate system under camera device coordinate system is represented respectively,For displacement to Amount, represents position of the world coordinate system origin in camera device coordinate system.Specifically, filled by equation below according to the shooting The intrinsic parameter put obtains camera device coordinate corresponding with the pixel coordinate：

Sat by equation below according to outer parameter acquiring world corresponding with the camera device coordinate of the camera device Mark：

Because user is during using mobile terminal call, continuous mobile status is likely to be at, appearance is often converted Gesture, so as to cause the distance between mobile terminal and user's head and direction to change, therefore, in an alternative embodiment In, according to world coordinates control direction from the mobile terminal to user's face characteristic point transmit sound after, pass through mobile terminal Sensor obtain running parameter of the user's face characteristic point relative to mobile terminal, and the parameter that these have changed is accused in time Know the transaudient module of orientation, so that the latter restarts image ranging, reacquire the position of user's head, so as to change distance ginseng Number so that the directional loudspeaker of mobile terminal changes transaudient direction and the distance purpose transaudient to reach self adaptation.

In a complete orientation process of carrying sound, as shown in figure 3, using Identification of Images with being surveyed based on monocular image sequence Range-azimuth of the user's head with respect to mobile phone is determined away from the mode being combined.In the state of initialization, mobile phone is used Camera IMAQ is carried out to user's face and side face part, generate initial user's head feature database to calculate. Generate behind initial characteristicses storehouse, when user needs to use orientation transaudient, mobile phone opens the camera just to user, carries out quadratic diagram As collection, determine that user is now placed in the image range of collection using face recognition technology, ensure that user is in orientation with this In transaudient effective range.The image after collection and feature database are contrasted again, using recognition result differentiate the sex of active user with Substantially (because the people of different sexes and different age group, ear opposite faces position is different, therefore is oriented transaudient the age Module needs to provide the transaudient scheme for adaptation according to specifically used people) to determine transaudient scheme.

Process on obtaining the distance between mobile terminal and user's head and orientation by image ranging, can at one Select in embodiment, after carrying out secondary image collection and successfully identifying user, a crucial step is to utilize camera collection Image carry out ranging, calculate the distance between mobile phone and user's head and orientation.Mobile phone of today, generally equipped with high score The camera of resolution, this alternative embodiment uses monocular image sequence ranging technology, the high-definition image meter gathered using camera Calculate the relative distance and orientation between mobile phone and user's head.

Reconstructing three-dimensional scenic by two dimensional image needs to use four coordinate systems, is respectively：

World coordinate system：(X_W, Y_W, Z_W)；

Camera coordinate system：(X_C, Y_C, Z_C), wherein Z_CAxle and optical axis coincidence, whole coordinate system meet the right-hand rule；

Image coordinate system：(x, y), wherein coordinate origin are the intersection point of optical axis and imaging plane；

Pixel coordinate system：(u, v), wherein coordinate origin are the upper left corner of image, and u axles are computer screen x-axis (water It is flat), v axles are computer screen y-axis (vertical).

The monocular distance-finding method model of this alternative embodiment can be reduced to national forest park in Xiaokeng, as shown in Figure 4.O_CIt is to take the photograph As the photocentre of head, Π_iiIt is imaging plane.In national forest park in Xiaokeng, on imaging plane as being to turn upside down with actual object, But the photo of actual photographed, scaling has been carried out to image and has been adjusted with direction.Therefore can be by Π_iIt is considered equivalent imaging Plane.

O_CIt is the photocentre of camera, while being also the origin of camera coordinate system.Assuming that point P (the such as users of certain in scene Face some characteristic point) coordinate be P (x, y, z), P points are in imaging plane

Π_iOn subpoint be P_i(x_i, y_i, z_i), then it can obtain under camera coordinate system between target point and imaging point Relation (relation 1), and the internal reference exponential model of camera is then described in target point and pixel coordinate system in camera coordinate system Transformational relation between subpoint.

Image coordinate system xoy and pixel coordinate system uov are distributed in the plane of delineation, after point (x, the y) correspondence in the former (u, v) in person, and the position in the origin respective pixel coordinate system of image coordinate system is designated as (u_o, v_o), its relation such as Fig. 5 institutes Show.Being tied to pixel coordinate system progress coordinate transform from image coordinate needs the conversion in physical quantity, can make α_xAnd α_yRepresent from Imaging plane to pixel planes amplification coefficient, otherwise can also be interpreted as 1/ α_xWith 1/ α_yRepresent respectively each in X and Y-direction The actual physical size that pixel is represented.It can then respectively obtain between the point in point and the pixel coordinate system in image coordinate system Transformational relation (relation 2).

Relation 2 is combined with relation 1, Intrinsic Matrix only relevant with camera this body structure can be obtainedIt is designated as M_in, four parameters for constituting this matrix are respectively x-axis amplification coefficient k_x, y-axis amplification coefficient k_yAnd figure The origin of picture coordinate system is in the corresponding coordinate points of pixel coordinate system to (u_o, v_o).Also due in the presence of four parameters, therefore M_inAgain It is referred to as four parameter model.In the difference between not considering the respective amplification coefficient of xy axles, k can be approximately considered_x=k_y=k.

There is internal reference exponential model, accordingly also there is outer parameter model.It is used to describe world coordinate system and camera coordinate system Transformational relation.If certain coordinate of point in world coordinate system is (x_w, y_w, z_w), the coordinate in camera coordinate system is (x_c, y_c, z_c), there is transition matrix between the twoIt is designated as^cM_W, it is the outer parameter matrix of camera, represents world coordinates It is O_WX_WY_WZ_WWith camera coordinate system O_cX_cY_cZ_cRelative position.ComponentWorld coordinate system O is represented respectively_WX_WY_WZ_WEach reference axis exists Camera coordinate system O_cX_cY_cZ_cUnder direction vector,It is motion vector, represents that world coordinate system origin exists Position in camera coordinate system.

According to above-mentioned camera inside and outside parameter model, pixel can be obtained by forgoing relationship and intrinsic parameter model matrix and sat Relation between mark system and camera coordinate system：

And according to outer parameter model matrix, then can obtain the relation between world coordinate system and camera coordinate system：

The matrix constituted by the point of identical part, i.e. camera coordinate system in formula (1) and formula (2), can be by two Formula simultaneous, then can finally obtain the relation of brief introduction between world coordinate system and pixel coordinate system：

Formula (3) carries out coordinate transformation between providing the subpoint P ' in spatial scene in any point P and digital picture Function.(u, v) on the left of formula is the user's face characteristic point coordinate value on the image that camera is shot, and passes through inside and outside parameter The computing of participation, you can obtain user's face characteristic point (can also regard whole mobile phone as one point, together with mobile phone camera When, for convenience of understand, user's head is also reduced to a point here) for origin coordinate system in coordinate value (x_W, y_W, z_W)。 Further according to European geometric space range formulaThe sky between mobile phone and user's head can be finally given Between actual range d.

By setting up the model of image ranging, the image ranging gathered based on mobile phone camera can be completed, measure is based on Locus where mobile phone, user's head, and then range-azimuth between the two can be calculated.Distance parameter etc. is passed Pass directional loudspeaker module, you can realize that orientation is transaudient.

A kind of transaudient terminal of orientation is additionally provided in the present embodiment, and the device is used to realize above-described embodiment and preferred real Mode is applied, repeating no more for explanation had been carried out.As used below, term " module " can realize the soft of predetermined function The combination of part and/or hardware.Although the device described by following examples is preferably realized with software, hardware, or The realization of the combination of software and hardware is also that may and be contemplated.

Fig. 6 is a structured flowchart of the transaudient terminal of orientation according to embodiments of the present invention, as shown in fig. 6, including：First Acquisition module 61, the pixel coordinate of user's face characteristic point is obtained for the camera device by mobile terminal；Second obtains mould Block 62, the pixel coordinate for the outer parameter acquiring and user's face characteristic point of the intrinsic parameter according to camera device and camera device Corresponding world coordinates；Wherein, the intrinsic parameter of camera device includes the optics and geometrical characteristic parameter of camera device；Camera device Outer parameter using camera device demarcation terminal obtain；Transport module 63, for according to world coordinates control mobile terminal to this The direction transmission sound of user's face characteristic point.

By above-mentioned terminal, on the basis of the pixel coordinate of user's face characteristic point is got, according to camera device The outer parameter acquiring of intrinsic parameter and camera device world coordinates corresponding with the pixel coordinate of the user's face characteristic point, by obtaining The world coordinates got can be with the distance between more accurate positioning mobile terminal and user's head and orientation, and then can control Mobile terminal processed is according to the localized delivery sound, so as to ensure that the transaudient degree of accuracy of mobile terminal directional.

Fig. 7 is the structured flowchart of the second acquisition module according to embodiments of the present invention, as shown in fig. 7, the second acquisition module 62 include：First acquisition unit 621, sits for obtaining camera device corresponding with pixel coordinate according to the intrinsic parameter of camera device Mark；Second acquisition unit 622, sits for outer parameter acquiring world corresponding with camera device coordinate according to camera device Mark.

Alternatively, the corresponding Intrinsic Matrix of the intrinsic parameter of camera device is expressed as：Wherein, k_xTable It is shown as X-axis amplification coefficient, k_yRepresent Y-axis amplification coefficient, (u_o, v_o) represent image coordinate corresponding with the user's face characteristic point The origin of system is in the coordinate corresponding to pixel coordinate system；The corresponding outer parameter matrix of outer parameter of the camera device is expressed as：Wherein, componentThe world is represented respectively Direction vector of each reference axis of coordinate system under camera device coordinate system,For motion vector, the world is represented Position of the coordinate origin in camera device coordinate system.

Alternatively, the first acquisition unit is additionally operable to be obtained with being somebody's turn to do according to the intrinsic parameter of the camera device by equation below The corresponding camera device coordinate of pixel coordinate：

The second acquisition unit is additionally operable to be filled with the shooting according to the outer parameter acquiring of the camera device by equation below Put the corresponding world coordinates of coordinate：

Fig. 8 is another structured flowchart of the transaudient terminal of orientation according to embodiments of the present invention, as shown in figure 8, the terminal Also include：3rd acquisition module 81, for controlling the mobile terminal to the side of the user's face characteristic point according to the world coordinates To after transmission sound, change of the user's face characteristic point relative to the mobile terminal is obtained by the sensor of mobile terminal and joined Number；4th acquisition module 82, for obtaining corresponding with the pixel coordinate of user's face characteristic point in real time according to the running parameter World coordinates.

The transaudient terminal of orientation in the present embodiment is presented in the form of functional unit, and unit here refers to ASIC electricity Road, performs one or more softwares or the processor and memory of fixed routine, and/or other can provide the device of above-mentioned functions Part.

The further function description of above-mentioned modules is identical with above-mentioned correspondence embodiment, will not be repeated here.

It is to lead to it will be understood by those skilled in the art that realizing all or part of flow in above-described embodiment method Cross computer program to instruct the hardware of correlation to complete, described program can be stored in a computer read/write memory medium In, the program is upon execution, it may include such as the flow of the embodiment of above-mentioned each method.Wherein, described storage medium can be magnetic Dish, CD, read-only memory (ROM) or random access memory (RAM) etc..

Although being described in conjunction with the accompanying embodiments of the invention, those skilled in the art can not depart from the present invention Spirit and scope in the case of various modification can be adapted and modification, such modification and modification are each fallen within by appended claims institute Within the scope of restriction.

Claims

1. the transaudient method of one kind orientation, it is characterised in that including：

The pixel coordinate of user's face characteristic point is obtained by the camera device of mobile terminal；

According to the outer parameter acquiring of the intrinsic parameter of the camera device and the camera device and the user's face characteristic point The corresponding world coordinates of pixel coordinate；Wherein, the intrinsic parameter of the camera device includes the optics and geometry of the camera device Characteristic parameter；The outer parameter of the camera device is obtained using the camera device scaling method；

The mobile terminal is controlled to transmit sound to the direction of the user's face characteristic point according to the world coordinates.

2. according to the method described in claim 1, it is characterised in that filled according to the intrinsic parameter of the camera device and the shooting The outer parameter acquiring put world coordinates corresponding with the pixel coordinate of the user's face characteristic point includes：

Camera device coordinate corresponding with the pixel coordinate is obtained according to the intrinsic parameter of the camera device；

According to the outer parameter acquiring of the camera device world coordinates corresponding with the camera device coordinate.

3. method according to claim 2, it is characterised in that the corresponding Intrinsic Matrix of intrinsic parameter of the camera device It is expressed as：Wherein, k_xIt is expressed as X-axis amplification coefficient, k_yRepresent Y-axis amplification coefficient, (u_o, v_o) represent with The origin of the corresponding image coordinate system of the user's face characteristic point is in the coordinate corresponding to pixel coordinate system；

The corresponding outer parameter matrix of outer parameter of the camera device is expressed as：Wherein, componentRepresent each reference axis of world coordinate system in camera device respectively Direction vector under coordinate system,For motion vector, represent world coordinate system origin in camera device coordinate Position in system.

4. method according to claim 3, it is characterised in that pass through intrinsic parameter of the equation below according to the camera device Obtain camera device coordinate corresponding with the pixel coordinate：

[\begin{matrix} u \\ v \\ 1 \end{matrix}] = M_{i n} [\begin{matrix} x_{c} / z_{c} \\ y_{c} / z_{c} \\ 1 \\ 1 / z_{c} \end{matrix}];

Pass through outer parameter acquiring with the camera device coordinate corresponding world of the equation below according to the camera device Coordinate：

Wherein, world coordinate system：(x_W, y_W, z_W)；Camera device coordinate system：(x_C, y_C, z_C), image coordinate System：(x, y), pixel coordinate system：(u, v).

5. according to any described method in Claims 1-4, it is characterised in that moved according to world coordinates control is described After dynamic terminal transmits sound to the direction of the user's face characteristic point, also include：

Running parameter of the user's face characteristic point relative to the mobile terminal is obtained by the sensor of the mobile terminal；

World coordinates corresponding with the pixel coordinate of the user's face characteristic point is obtained according to the running parameter in real time.

6. the transaudient terminal of one kind orientation, it is characterised in that including：

First acquisition module, the pixel coordinate of user's face characteristic point is obtained for the camera device by mobile terminal；

Second acquisition module, for the intrinsic parameter according to the camera device and the camera device outer parameter acquiring with it is described The corresponding world coordinates of pixel coordinate of user's face characteristic point；Wherein, the intrinsic parameter of the camera device includes the shooting The optics and geometrical characteristic parameter of device；The outer parameter of the camera device is obtained using camera device demarcation terminal；

Transport module, for controlling the mobile terminal to be passed to the direction of the user's face characteristic point according to the world coordinates Defeated sound.

7. terminal according to claim 6, it is characterised in that second acquisition module includes：

First acquisition unit, for obtaining camera device corresponding with the pixel coordinate according to the intrinsic parameter of the camera device Coordinate；

Second acquisition unit, for the outer parameter acquiring according to the camera device it is corresponding with the camera device coordinate described in World coordinates.

8. terminal according to claim 7, it is characterised in that the corresponding Intrinsic Matrix of intrinsic parameter of the camera device It is expressed as：Wherein, k_xIt is expressed as X-axis amplification coefficient, k_yRepresent Y-axis amplification coefficient, (u_o, v_o) represent with The origin of the corresponding image coordinate system of the user's face characteristic point is in the coordinate corresponding to pixel coordinate system；

9. terminal according to claim 8, it is characterised in that the first acquisition unit is additionally operable to by equation below root Camera device coordinate corresponding with the pixel coordinate is obtained according to the intrinsic parameter of the camera device：

[\begin{matrix} u \\ v \\ 1 \end{matrix}] = M_{i n} [\begin{matrix} x_{c} / z_{c} \\ y_{c} / z_{c} \\ 1 \\ 1 / z_{c} \end{matrix}];

The second acquisition unit is additionally operable to the outer parameter acquiring according to the camera device and the shooting by equation below The corresponding world coordinates of device coordinate：

10. according to any described terminal in claim 6 to 9, it is characterised in that the terminal also includes：

3rd acquisition module, for controlling the mobile terminal to the side of the user's face characteristic point according to the world coordinates To after transmission sound, user's face characteristic point is obtained relative to the mobile terminal by the sensor of the mobile terminal Running parameter；

4th acquisition module, for obtaining the pixel coordinate pair with the user's face characteristic point in real time according to the running parameter The world coordinates answered.