CN108388399A - The method of state management and system of virtual idol - Google Patents

The method of state management and system of virtual idol Download PDF

Info

Publication number
CN108388399A
CN108388399A CN201810032045.5A CN201810032045A CN108388399A CN 108388399 A CN108388399 A CN 108388399A CN 201810032045 A CN201810032045 A CN 201810032045A CN 108388399 A CN108388399 A CN 108388399A
Authority
CN
China
Prior art keywords
state
virtual idol
virtual
idol
technical ability
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810032045.5A
Other languages
Chinese (zh)
Other versions
CN108388399B (en
Inventor
秦萌萌
贾志强
俞晓君
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Virtual Point Technology Co Ltd
Original Assignee
Beijing Guangnian Wuxian Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Guangnian Wuxian Technology Co Ltd filed Critical Beijing Guangnian Wuxian Technology Co Ltd
Priority to CN201810032045.5A priority Critical patent/CN108388399B/en
Publication of CN108388399A publication Critical patent/CN108388399A/en
Application granted granted Critical
Publication of CN108388399B publication Critical patent/CN108388399B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0487Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The present invention provides a kind of method of state management of virtual idol, and virtual idol has specific image characteristics, and is demonstrated out by hologram device, and method includes:Obtain multi-modal input;The intention in multi-modal input or operation are parsed, to obtain instructing for the conversion intention of condition conversion or conversion;Convert the current state of virtual idol to the new state for the virtual idol that conversion is intended to or conversion instruction indicates;New state includes:Open virtual idol required ability or technical ability module under new state.The method of state management and system of virtual idol provided by the invention provide a kind of virtual idol, can pass through the multi-modal interaction of holographic imaging completion and user.In addition, virtual idol provided by the invention also includes various states, for example, halted state, audio output state, waiting recording state, recording state, standby mode and technical ability open state, and the present invention can also be managed the state of virtual idol, improve the interactive experience of user.

Description

The method of state management and system of virtual idol
Technical field
The present invention relates to artificial intelligence fields, specifically, being related to a kind of method of state management and system of virtual idol.
Background technology
The exploitation of robot chat interactive system is dedicated to imitating human conversation.The relatively more extensive chat machine of early stage application People's application program includes the received input of the processing such as siri chat robots on small i chat robots or iPhone (including text or voice) and corresponding response is made according to input, to attempt to imitate the friendship between the mankind between context Mutually.
But at present for, for the relevant robot of virtual idol chat interactive system exploitation it is also less perfect, still Do not go out to be now able to carry out multi-modal interaction with user and the product of multi-modal interaction that virtual idol state can be managed.
Therefore, the present invention provides a kind of method of state management and system of virtual idol.
Invention content
To solve the above problems, the present invention provides a kind of method of state management of virtual idol, the virtual idol tool There are specific image characteristics, and be demonstrated out by hologram device, the method comprises the steps of:
Obtain multi-modal input;
The intention in the multi-modal input or operation are parsed, to obtain referring to for the conversion intention of condition conversion or conversion It enables;
Convert the current state of the virtual idol to the virtual idol that the conversion is intended to or conversion instruction indicates The new state of picture;
The new state includes:Open the virtual idol required ability or technical ability module under the new state.
According to one embodiment of present invention, the state of the virtual idol is divided into dormant state, active state and waiting State, wherein
Dormant state includes:Halted state and standby mode;
Active state includes:Recording state, audio output state and technical ability open state;
Under a halt condition, the virtual idol out of service;
In the standby state, the virtual idol described in running background;
Under recording state, multi-modal output before stopping starts to detect audio signal;
Under audio output state, the ability or technical ability mould language interactive module in the block is called to engage in the dialogue interaction;
Under technical ability open state, the ability or technical ability mould a song and dance module in the block is called to carry out a song and dance.
According to one embodiment of present invention, the wait state is to wait for recording state.
According to one embodiment of present invention, under the wait state, in conjunction with high in the clouds brain to the multi-modal input Analysis result come determine the state to be entered be audio output state or technical ability open state, and entrance audio output shape After state or technical ability open state, by the feedback in conjunction with the high in the clouds brain come executive capability or the multimode of technical ability module unlatching State exports.
According to one embodiment of present invention, under any type active state, if appointing under detecting current state When business processing terminates and any multi-modal input data is not detected, current state is converted into dormant state and is waited for Machine state or halted state.
According to one embodiment of present invention, the highest priority of the recording state in the active state, virtual even As being waited for waiting under recording state, acquisition user speech is so that virtual idol enters recording state.
According to another aspect of the present invention, a kind of program product is additionally provided, it includes as described above for executing The series of instructions of either method step.
According to another aspect of the present invention, a kind of virtual idol is additionally provided, which is characterized in that the virtual idol tool Standby specific virtual image and preset attribute, the condition conversion process of the virtual idol is executed using method as described above.
According to another aspect of the present invention, a kind of system for managing state of virtual idol, the system packet are additionally provided Contain:
Smart machine is mounted with the virtual idol thereon, for obtaining multi-modal input, and has natural language reason Solution, visual perception, the ability for touching perception, language voice output, emotional facial expressions action output;
Hologram device is used to obtain multi-modal input and converts the image of the virtual idol to hologram simultaneously Show the hologram;
High in the clouds brain is used in the wait state, will be into determine according to the analysis result to the multi-modal input The state entered is audio output state or technical ability open state, and is entering audio output state or technical ability open state Afterwards, the multi-modal output of virtual idol described in decision.
The method of state management and system of a kind of virtual idol provided by the invention provide a kind of virtual idol, Neng Goutong The mode for crossing holographic imaging completes multi-modal interaction with user.In addition, the condition managing system of virtual idol provided by the invention Virtual idol in system also includes various states, for example, halted state, audio output state, waiting recording state, recording shape State, standby mode and technical ability open state, and the present invention can also be managed the state of virtual idol, improve use The interactive experience at family.
Other features and advantages of the present invention will be illustrated in the following description, also, partly becomes from specification It obtains it is clear that understand through the implementation of the invention.The purpose of the present invention and other advantages can be by specification, rights Specifically noted structure is realized and is obtained in claim and attached drawing.
Description of the drawings
Attached drawing is used to provide further understanding of the present invention, and a part for constitution instruction, the reality with the present invention It applies example and is used together to explain the present invention, be not construed as limiting the invention.In the accompanying drawings:
Fig. 1 shows that the multi-modal interaction of the system for managing state of virtual idol according to an embodiment of the invention is shown It is intended to;
Fig. 2 shows the structure diagram of the system for managing state of virtual idol according to an embodiment of the invention;
Fig. 3 shows the state classification figure of the system for managing state of virtual idol according to an embodiment of the invention;
Fig. 4 shows the condition conversion signal of the system for managing state of virtual idol according to an embodiment of the invention Figure;
Fig. 5 shows the module frame chart of the system for managing state of virtual idol according to an embodiment of the invention;
Fig. 6 shows the flow chart of the method for state management of virtual idol according to an embodiment of the invention;
Fig. 7 shows another flow chart of the method for state management of virtual idol according to an embodiment of the invention; And
Fig. 8 shows according to an embodiment of the invention in user, smart machine, hologram device and high in the clouds brain The flow chart communicated between four directions.
Specific implementation mode
To make the object, technical solutions and advantages of the present invention clearer, the embodiment of the present invention is made below in conjunction with attached drawing Further it is described in detail.
It is clear to state, it needs to carry out before embodiment as described below:
The virtual idol that the present invention mentions has specific image characteristics using hologram device as main presentation interface;
By supporting the smart machine of input and output and control module to realize multi-modal human-computer interaction, has natural language reason Solution, visual perception touch the AI abilities such as perception, language voice output, emotional facial expressions action output;
Configurable social property, personality attribute, personage's technical ability etc., make user (Quadratic Finite Element enthusiast) enjoy amusement and individual character Change the virtual portrait of Flow Experience.
The high in the clouds brain being previously mentioned is to provide the virtual idol to carry out semantic understanding (language language to the interaction demand of user Reason and good sense solution, Action Semantic understanding, visual identity, affection computation, cognition calculate) processing capacity terminal, realize with user Interaction, to help user to carry out decision.
Each embodiment of the present invention is described in detail below in conjunction with the accompanying drawings.
Fig. 1 shows that the multi-modal interaction of the system for managing state of virtual idol according to an embodiment of the invention is shown It is intended to.As shown in Figure 1, carrying out multi-modal interaction needs user 101, smart machine 102, hologram device 103 and high in the clouds brain 104.Wherein, the user 101 interacted with virtual idol can be true people, the virtual idol of another virtual idol and entity The interaction of picture, another virtual idol and the virtual idol of entity and the interactive process and single people and virtual idol of virtual idol Process is similar.Therefore, only show the multi-modal interactive process of user (people) and virtual idol in Fig. 1.
The virtual process interacted between idol and user 101 is in Fig. 1:
Interaction required early-stage preparations or condition have, and virtual idol is carried and operated on smart machine 102, and empty Quasi- idol has specific image characteristics.Virtual idol have natural language understanding, visual perception, touch perception, language output, The AI abilities such as emotional facial expressions action output.In order to coordinate the touch perceptional function of virtual idol, it is also required to install on smart machine There is the component for having and touching perceptional function.According to one embodiment of present invention, in order to promote interactive experience, virtual idol exists It is indicated in the predeterminable area of hologram device after being activated, the overlong time for avoiding user from waiting for.
It should be noted that the image of virtual idol and dressing up and being not limited to one mode.Virtual idol can have For different images and dress up.The image of virtual idol is generally 3D high mould animating images.Virtual idol can have difference Appearance and decoration.Each virtual idol image can also correspond to it is a variety of different dress up, the classification dressed up can be according to season Section classification, can also classify according to occasion.These images and dresss up and can reside in high in the clouds brain 104, there may also be In smart machine 102, it can be called at any time when needing to call these images and dressing up.
Social property, personality attribute and the personage's technical ability of virtual idol are also not necessarily limited to a kind of or a kind of.Visual human can To have a variety of social properties, multiple personality attribute and a variety of personage's technical ability.These social properties, personality attribute and personage Technical ability can arrange in pairs or groups respectively, and be not secured to a kind of collocation mode, and user can select and arrange in pairs or groups as needed.
According to one embodiment of present invention, for show virtual idol hologram device 103 include communication interface, imaging Device and output device.Wherein, communication interface receives the vivid of the virtual idol that smart machine 102 transmits and interaction number According to.Imaging device is connect with communication interface, for converting the image of virtual idol to hologram, and hologram is shown In predeterminable area.Output device is connect with communication interface and imaging device, and hologram and virtual idol are current for rendering The display data of state.
It is that multi-modal interactive process obtains multi-modal input first below.Multi-modal input can be that user 101 sends out , can also be to be inputted by perceiving environment.Multi-modal input can include text, voice, vision and perception information etc. The information of multiple modalities.The reception device for obtaining multi-modal input is respectively mounted and is configured at smart machine or hologram device On, these reception devices include to receive the received text device of text, receive the pronunciation receiver of voice, receive taking the photograph for vision As head and the infrored equipment etc. of reception perception information.
Then, the intention in multi-modal input or operation are parsed, to obtain for the conversion intention of condition conversion or conversion Instruction.In multi-modal interactive process, virtual idol can interact under various states with user 101, each state is all Have the ability or technical ability module of different virtual idols.
In order to convert the state of virtual idol during virtual idol and user 101 interact, need to solve in real time The intention in multi-modal input or operation are analysed, analyzes in multi-modal input and whether comprising user 101 to convert virtual idol state Wish, with obtain for condition conversion conversion be intended to or conversion instruct.
After obtaining conversion intention or conversion instruction, it is intended to refer to next, converting the current state of virtual idol to conversion The new state of the virtual idol shown.According to one embodiment of present invention, the state of virtual idol includes dormant state, enlivens shape State and wait state.Wherein, dormant state includes:Halted state and standby mode;Active state includes:Recording state, audio Output state and technical ability open state.The operating condition of virtual idol is under each state, under a halt condition, void out of service Quasi- idol;In the standby state, in the virtual idol of running background;Under recording state, audio input signal is detected, starts ability Or technical ability mould recording module in the block records audio input data;Under audio output state, in call capability or technical ability module Language interactive module engage in the dialogue interaction;Under technical ability open state, call capability or technical ability mould a song and dance mould in the block Block carries out a song and dance.
Finally, virtual idol required ability or technical ability module under new state are opened.
In one embodiment of the invention, the screen cover of smart machine 102 is to hologram device 103, and shows on the screen Show the image of virtual idol, the image of virtual idol is the view of four angles, be respectively front view, rearview, left view with And right view.
According to another embodiment of the invention, a kind of virtual idol, has specific virtual image and preset attribute, The condition conversion process of virtual idol is executed using the method for state management of virtual idol provided by the invention.
Fig. 2 shows the structure diagram of the system for managing state of virtual idol according to an embodiment of the invention.Such as Shown in Fig. 2, multi-modal interactive needs are completed by system:User 101, smart machine 102 and high in the clouds brain 104.Wherein, intelligence Energy equipment 102 includes reception device 102A, processing unit 102B, output device 102C and attachment device 102D.High in the clouds brain 104 include communication device 1041.
It is needed in user 101, smart machine 102 and high in the clouds in the system for managing state of virtual idol provided by the invention Unobstructed communication port is established between brain 104, so as to complete the interaction of user 101 and virtual idol.In order to complete to hand over Mutual task, smart machine 102 and high in the clouds brain 104 are configured with the device and component for supporting completion interaction.With virtual idol As the object of interaction can be a side, or multi-party.
Smart machine 102 includes reception device 102A, processing unit 102B, output device 102C and attachment device 102D.Wherein, reception device 102A is for receiving multi-modal input.The example of reception device 102A includes keyboard, cursor control Equipment (mouse), for voice operating microphone, scanner, touch function (such as to detect the capacitive of physical touch Sensor), camera (action touched is not related to using the detection of visible or nonvisible wavelength) etc..Smart machine 102 can be with Multi-modal input is obtained by above-mentioned input equipment.Output device 102C is for exporting virtual idol and user 101 Interactive multi-modal output data, details are not described herein.
Processing unit 102B is for handling the interaction data transmitted by high in the clouds brain 104 in interactive process.Attachment device 102D is used for contacting between high in the clouds brain 104, and processing unit 102B processing reception devices 102A is pretreated multi-modal defeated The data for entering or being transmitted by high in the clouds brain.Attachment device 102D sends call instruction to call the robot on high in the clouds brain 104 Ability.
In the wait state, high in the clouds brain 104 can be entered according to analysis result to multi-modal input to determine State is audio output state or technical ability open state, and after entering audio output state or technical ability open state, certainly The multi-modal output of virtual idol described in plan.
The communication device 1041 that high in the clouds brain 104 includes is for completing writing to each other between smart machine 102.Communication It keeps in communication and contacts between attachment device 102D on device 1041 and smart machine 102, receive sending for smart machine 102 Request, and the handling result that high in the clouds brain 104 is sent out is sent, it is Jie linked up between smart machine 102 and high in the clouds brain 104 Matter.
Fig. 3 shows the state classification figure of the system for managing state of virtual idol according to an embodiment of the invention. As shown in figure 3, virtual idol state 300 includes dormant state 301, active state 302 and wait state 303.Wherein, suspend mode State 301 includes halted state 3011 and standby mode 3012.Active state 302 includes recording state 3021, audio output State 3022 and technical ability open state 3023.
According to one embodiment of present invention, the ability of virtual idol state or technical ability includes, under halted state 3011, Virtual idol out of service;Under standby mode 3012, in the virtual idol of running background;Under recording state 3021, sound is detected Frequency input signal, starts ability or technical ability mould recording module in the block records audio input data;In voice output state 3022 Under, call capability or technical ability mould language interactive module in the block engage in the dialogue interaction;Under technical ability open state 3023, energy is called Power or technical ability mould a song and dance module in the block carry out a song and dance.
In the system for managing state of virtual idol provided by the invention, wait state is important in virtual idol state Component part is the bridge between recording state and audio output state or technical ability open state.One according to the present invention Embodiment, wait state 303 can wait for recording state, as respond the state interrupted, in the case where waiting for recording state, knot High in the clouds brain 104 is closed to the analysis result of multi-modal input to determine that the state to be entered is audio output state 3022 or skill Energy open state 3023, and after entering audio output state 3022 or technical ability open state 3023, by combining high in the clouds big The feedback of brain 104 carrys out the multi-modal output that executive capability or technical ability module are opened.
According to one embodiment of present invention, under any type active state 302, if under detecting current state When task processing terminates and any multi-modal input data is not detected, current state is converted to dormant state 301 In standby mode 3012 or halted state 3011.In addition, the highest priority of the recording state 3021 in active state 302, In the case where virtual idol is waited for 303 i.e. waiting recording state, acquisition user speech is so that virtual idol enters recording State.
Fig. 4 shows the condition conversion signal of the system for managing state of virtual idol according to an embodiment of the invention Figure.
Virtual idol in the system for managing state of virtual idol provided by the invention has a variety of different states, each Virtual idol is but also with different ability or technical ability under state.Virtual idol can be when carrying out multi-modal interact with user The state of virtual idol is converted under the guidance of user.
After interactive program in smart machine 102 is activated, virtual idol enters halted state immediately.In pause shape Under state, virtual idol is out of service.When there is the generation of activation event, virtual idol enters waiting recording state.The present invention's In one embodiment, activation event can be that user 101 presses the button opened and wait for recording state, i.e., smart machine 102 can Record button or virtually wait for record button to be waited for comprising entity, user 101 press entity wait for record button or When virtual waiting record button, the state of virtual idol is converted into waiting recording state by halted state.In addition, it is necessary to explanation It is that activation event can also be that other forms, the present invention do not make limitation to the activation form for activating event.
After virtual idol is converted into waiting recording state by halted state, if virtual idol detects that user speaks, Then the state of virtual idol is converted into recording state by waiting recording state.Under recording state, virtual idol detection audio is defeated Enter signal, starts ability or technical ability mould recording module in the block records audio input data.When virtual idol is in recording state When, and when virtual idol detects words such as " goodbyes ", virtual idol is converted into standby mode by recording state.At this point, user 101 Show the wish for terminating this recording, conversion conditions wait for the next multimode of user 101 to virtual idol to standby mode immediately State inputs.
If virtual idol be in recording state, and detects that user speaks stopping, then virtual idol is by recording state turn Turn to waiting recording state.In addition, being switched to audio output state by recording state if necessary to virtual idol, virtual idol is first Waiting recording state is first converted by recording state, then audio output state is converted by waiting recording state.When virtual idol When in audio output state, user 101 can open a dialogue interaction with virtual idol, and virtual idol can play out and user The interactive audio of 101 interactions, at the end of interactive audio plays, the state of virtual idol is converted into pause by audio output state State.
In addition, when virtual idol is in standby, if user 101 sends out wake-up and is intended to or instructs, virtually Idol is converted into waiting recording state by standby mode.Wake-up herein is intended to can be the specific audio that sends out of virtual idol with And the particular organisms feature of specific limb action or user 101.
Technical ability open state is switched to by recording state if necessary to virtual idol, virtual idol is turned by recording state first Waiting recording state is turned to, then technical ability open state is converted by waiting recording state.When virtual idol is in technical ability opening state When state, call capability or technical ability mould a song and dance module in the block carry out a song and dance, and a song and dance is showed user 101。
When virtual idol is in technical ability open state, and virtual idol is sung and finishes or be interrupted, then virtual idol by Technical ability open state is converted into waiting recording state.When virtual idol is in technical ability open state, and virtual idol sings beginning, Then virtual idol is converted into standby mode by technical ability open state.
Fig. 5 shows the module frame chart of the system for managing state of virtual idol according to an embodiment of the invention.Such as Shown in Fig. 5, system includes acquisition module 501, is intended to module 502, block of state 503 and technical ability module 504.Wherein, it obtains Module 501 includes text collection unit 5011, audio collection unit 5012, vision collecting unit 5013 and perception collecting unit 5014。
Acquisition module 501 is for obtaining multi-modal input.Wherein, text collection unit 5011 is used for acquiring text message. Audio collection unit 5012 is used for acquiring audio-frequency information.Vision collecting unit 5013 is used for acquiring visual information.Perception acquisition is single Member 5014 is used for acquiring perception information.The example of acquisition module 501 includes keyboard, cursor control device (mouse), is used for voice The microphone of operation, scanner, touch function (such as to detect the capacitance type transducers of physical touch), camera, sensing control Equipment, such as use visible or nonvisible wavelength ray, signal, environmental data.Above-mentioned input equipment can be passed through To obtain multi-modal input data.Multi-modal input can include one kind in text, audio, vision and perception data, Can include a variety of, the present invention restricts not to this.
It is intended to module 502 to be used to parse intention or the operation in multi-modal input, to obtain the conversion for condition conversion It is intended to or conversion instructs.It includes resolution unit 5021 to be intended to module 502, and resolution unit 5021 is used to parse multi-modal input, with It obtains the conversion for including in multi-modal input and is intended to or converts instruction.Conversion is intended to or conversion instruction can be used in instructing virtual idol As the conversion between various states.
Block of state 503 is used to convert the current state of virtual idol to the new shape for the virtual idol that conversion is intended to refer to State.According to one embodiment of present invention, virtual idol includes various states, for example, dormant state, active state and waiting shape State.Wherein, dormant state includes halted state and standby mode.Active state include recording state, audio output state and Technical ability open state.Intermediate active state includes to wait for recording state.Block of state 503 includes conversion unit 5031, at one In embodiment, conversion unit 5031 can convert the state of virtual idol to active state by dormant state, also can will be empty The state of quasi- idol is converted into dormant state by active state.
Technical ability module 504 is for opening virtual idol required ability or technical ability module under new state.Technical ability module 504 include opening unit 5041, and after virtual idol is converted into new state, it is corresponding that opening unit 5041 opens new state immediately The ability or technical ability of virtual idol.
Fig. 6 shows the flow chart of the method for state management of virtual idol according to an embodiment of the invention.
As shown in fig. 6, in step s 601, obtaining multi-modal input.In this step, smart machine 102 or holography are set Standby 103 can obtain multi-modal input, and multi-modal input can be that user 101 inputs, and can also be its for having input function What his equipment inputted.Smart machine 102 and hologram device 103 can be configured with the related device for obtaining multi-modal input.Multimode State input can be the input of the forms such as text input, audio input and perception input.
Then, in step S602, the intention in multi-modal input or operation are parsed, to obtain turning for condition conversion Change and is intended to or converts instruction.It is needed comprising much information in order to know the interaction intent information of user 101 in multi-modal input The intention in multi-modal input or operation are parsed, obtains being intended to for the conversion of condition conversion according to intention or operation or conversion refers to It enables.
Then, in step S603, the current state of virtual idol is converted to the new shape for the virtual idol being intended to refer to State.According to one embodiment of present invention, virtual idol includes various states, for example, dormant state, active state and waiting shape State.Wherein, dormant state includes halted state and standby mode.Active state include recording state, audio output state and Technical ability open state.Intermediate active state includes to wait for recording state.Knowing for the conversion intention of condition conversion or conversion After instruction, in this step it converts the current state of virtual idol to the new state for the virtual idol being intended to refer to.
Finally, virtual idol enters new state, in step s 604, opens virtual idol required energy under new state Power or technical ability module.Each state of virtual idol includes ability or technical ability module under state.According to the present invention one A embodiment, under a halt condition, virtual idol out of service;In the standby state, in the virtual idol of running background;It is recording Under state, audio input signal is detected, starts ability or technical ability mould recording module in the block records audio input data;In audio Under output state, call capability or technical ability mould language interactive module in the block engage in the dialogue interaction;Under technical ability open state, adjust A song and dance is carried out with ability or technical ability mould a song and dance module in the block.
In addition, the system for managing state of virtual idol provided by the invention can also coordinate a kind of program product, it includes Series of instructions for executing the method for state management step for completing virtual idol.
Fig. 7 shows another flow chart of the method for state management of virtual idol according to an embodiment of the invention.
As shown in fig. 7, in step s 701, smart machine 102 sends out request to high in the clouds brain 104.Later, in step In S702, smart machine 102 is constantly in the state interacted with high in the clouds brain 104.In interactive process, 102 meeting of smart machine Clocked operation is carried out to returned data the time it takes.
In step S703, if the reply data not returned for a long time, for example, being more than scheduled time span 5S, then smart machine 102 can select to carry out local reply, generate local common reply data.Then, defeated in step S704 Go out the animation with local common response cooperation, and voice playing equipment is called to carry out speech play.
Fig. 8 shows according to an embodiment of the invention in user, smart machine, hologram device and high in the clouds brain The flow chart communicated between four directions.
In order to realize the multi-modal interaction between smart machine 102 and user 101, need user 101, smart machine 102, Communication connection is set up between hologram device 103 and high in the clouds brain 104.This communication connection should be it is real-time, unobstructed, It can ensure interactive impregnable.
In order to complete to interact, some conditions or premise are needed to have.These conditions or premise include smart machine Virtual idol is loaded and run in 102, and smart machine 102 has the hardware facility of perception and control function.In addition, complete Breath equipment 103 can receive the image of the virtual idol of the transmission of smart machine 102, and convert the image of virtual idol to holography Hologram is included in predeterminable area by image.
Complete early-stage preparations after, smart machine 102 start with user 101 be unfolded interact, first, smart machine 102 and/or Hologram device 103 obtains multi-modal input, and multi-modal input can be that user 101 sends out, and can also be that miscellaneous equipment is sent out 's.At this point, two sides that expanding data transmits are user 101 and smart machine 102 and/or hologram device 103.Then, it parses more Intention in mode input or operation, to obtain instructing for the conversion intention of condition conversion or conversion.
Then, when virtual idol is in wait state, smart machine 102 sends to high in the clouds brain 104 and asks, high in the clouds brain 104 according to the analysis result of multi-modal input come determine the state to be entered be audio output state or technical ability open state, And after entering audio output state or technical ability open state, high in the clouds brain 104 replys smart machine 102, empty described in decision The multi-modal output of quasi- idol.At this point, two sides of expansion communication are smart machine 102 and high in the clouds brain 104.
It is transported after smart machine 102 receives data and the instruction of the transmission of high in the clouds brain 104 or in smart machine 102 After the current state of capable virtual idol is converted into the new state for the virtual idol that conversion is intended to refer to, smart machine 102 can incite somebody to action The display data of the vivid and virtual idol current state of virtual idol is transmitted to hologram device 103.Hologram device 103 can incite somebody to action The image of virtual idol is converted to hologram, and the hologram by virtual idol includes in the preset areas of hologram device 103 Domain.At this point, two sides of expansion communication are smart machine 102 and hologram device 103.
Finally, hologram device 103 can be by the display data of the hologram of virtual idol and virtual idol current state Output, shows user 101.Two sides that communication is unfolded at this time are hologram device 103 and user 101.
The method of state management and system of a kind of virtual idol provided by the invention provide a kind of virtual idol, Neng Goutong The mode for crossing holographic imaging completes multi-modal interaction with user.In addition, the condition managing system of virtual idol provided by the invention Virtual idol in system also includes various states, for example, halted state, audio output state, waiting recording state, recording shape State, standby mode and technical ability open state, and the present invention can also be managed the state of virtual idol, improve use The interactive experience at family.
It should be understood that disclosed embodiment of this invention is not limited to specific structure disclosed herein, processing step Or material, and the equivalent substitute for these features that those of ordinary skill in the related art are understood should be extended to.It should also manage Solution, term as used herein is used only for the purpose of describing specific embodiments, and is not intended to limit.
" one embodiment " or " embodiment " mentioned in specification means the special characteristic described in conjunction with the embodiments, structure Or characteristic includes at least one embodiment of the present invention.Therefore, the phrase " reality that specification various places throughout occurs Apply example " or " embodiment " the same embodiment might not be referred both to.
While it is disclosed that embodiment content as above but described only to facilitate understanding the present invention and adopting Embodiment is not limited to the present invention.Any those skilled in the art to which this invention pertains are not departing from this Under the premise of the disclosed spirit and scope of invention, any modification and change can be made in the implementing form and in details, But the scope of patent protection of the present invention, still should be subject to the scope of the claims as defined in the appended claims.

Claims (9)

1. a kind of method of state management of virtual idol, which is characterized in that the virtual idol has specific image characteristics, and It is demonstrated out by hologram device, the method comprises the steps of:
Obtain multi-modal input;
The intention in the multi-modal input or operation are parsed, to obtain instructing for the conversion intention of condition conversion or conversion;
Convert the current state of the virtual idol to the virtual idol that the conversion is intended to or conversion instruction indicates New state;
The new state includes:Open the virtual idol required ability or technical ability module under the new state.
2. the method for state management of virtual idol as described in claim 1, which is characterized in that the state of the virtual idol point For dormant state, active state and wait state, wherein
Dormant state includes:Halted state and standby mode;
Active state includes:Recording state, audio output state and technical ability open state;
Under a halt condition, the virtual idol out of service;
In the standby state, the virtual idol described in running background;
Under recording state, multi-modal output before stopping starts to detect audio signal;
Under audio output state, the ability or technical ability mould language interactive module in the block is called to engage in the dialogue interaction;
Under technical ability open state, the ability or technical ability mould a song and dance module in the block is called to carry out a song and dance.
3. the method for state management of virtual idol as claimed in claim 2, which is characterized in that wait state shape for before Waiting recording state at the end of state.
4. the method for state management of the virtual idol as described in claim 1-3, which is characterized in that
Under the waiting recording state, the analysis result of the multi-modal input is entered to determine in conjunction with high in the clouds brain State is audio output state or technical ability open state, and after entering audio output state or technical ability open state, is led to The feedback crossed in conjunction with the high in the clouds brain carrys out the multi-modal output that executive capability or technical ability module are opened.
5. the method for state management of the virtual idol such as claim 2-3, which is characterized in that
Under any type active state, if the task processing under detecting current state terminates and is not detected to appoint When what multi-modal input data, current state is converted to standby mode or halted state into dormant state.
6. the method for state management of virtual idol as described in any of claims 5, which is characterized in that the active state In recording state highest priority, virtual idol be waited for i.e. wait for recording state under, acquire user speech So that virtual idol enters recording state.
7. a kind of program product, it includes for executing a series of of the method and step as described in any one of claim 1-6 Instruction.
8. a kind of virtual idol, which is characterized in that the virtual idol has specific virtual image and preset attribute, using such as Method described in claim 1-6 executes the condition conversion process of the virtual idol.
9. a kind of system for managing state of virtual idol, which is characterized in that the system includes:
Smart machine is mounted with virtual idol as claimed in claim 8 thereon, for obtaining multi-modal input, and has certainly Right language understanding, visual perception, the ability for touching perception, language voice output, emotional facial expressions action output;
Hologram device is used to obtain multi-modal input and converts the image of virtual idol as claimed in claim 8 to Hologram simultaneously shows the hologram;
High in the clouds brain, is used in the wait state, to be entered to determine according to analysis result to the multi-modal input State is audio output state or technical ability open state, and after entering audio output state or technical ability open state, certainly The multi-modal output of plan virtual idol as claimed in claim 8.
CN201810032045.5A 2018-01-12 2018-01-12 Virtual idol state management method and system Active CN108388399B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810032045.5A CN108388399B (en) 2018-01-12 2018-01-12 Virtual idol state management method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810032045.5A CN108388399B (en) 2018-01-12 2018-01-12 Virtual idol state management method and system

Publications (2)

Publication Number Publication Date
CN108388399A true CN108388399A (en) 2018-08-10
CN108388399B CN108388399B (en) 2021-04-06

Family

ID=63076699

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810032045.5A Active CN108388399B (en) 2018-01-12 2018-01-12 Virtual idol state management method and system

Country Status (1)

Country Link
CN (1) CN108388399B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110850971A (en) * 2019-10-25 2020-02-28 智亮君 Handshake interaction method and system between hand model and intelligent mirror and storage medium
CN111290682A (en) * 2018-12-06 2020-06-16 阿里巴巴集团控股有限公司 Interaction method and device and computer equipment
CN113362263A (en) * 2021-05-27 2021-09-07 百度在线网络技术(北京)有限公司 Method, apparatus, medium, and program product for changing the image of a virtual idol

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016206643A1 (en) * 2015-06-26 2016-12-29 北京贝虎机器人技术有限公司 Method and device for controlling interactive behavior of robot and robot thereof
CN106863319A (en) * 2017-01-17 2017-06-20 北京光年无限科技有限公司 A kind of robot awakening method and device
CN107197384A (en) * 2017-05-27 2017-09-22 北京光年无限科技有限公司 The multi-modal exchange method of virtual robot and system applied to net cast platform
CN107294837A (en) * 2017-05-22 2017-10-24 北京光年无限科技有限公司 Engaged in the dialogue interactive method and system using virtual robot
CN107340859A (en) * 2017-06-14 2017-11-10 北京光年无限科技有限公司 The multi-modal exchange method and system of multi-modal virtual robot
CN107340865A (en) * 2017-06-29 2017-11-10 北京光年无限科技有限公司 Multi-modal virtual robot exchange method and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016206643A1 (en) * 2015-06-26 2016-12-29 北京贝虎机器人技术有限公司 Method and device for controlling interactive behavior of robot and robot thereof
CN106863319A (en) * 2017-01-17 2017-06-20 北京光年无限科技有限公司 A kind of robot awakening method and device
CN107294837A (en) * 2017-05-22 2017-10-24 北京光年无限科技有限公司 Engaged in the dialogue interactive method and system using virtual robot
CN107197384A (en) * 2017-05-27 2017-09-22 北京光年无限科技有限公司 The multi-modal exchange method of virtual robot and system applied to net cast platform
CN107340859A (en) * 2017-06-14 2017-11-10 北京光年无限科技有限公司 The multi-modal exchange method and system of multi-modal virtual robot
CN107340865A (en) * 2017-06-29 2017-11-10 北京光年无限科技有限公司 Multi-modal virtual robot exchange method and system

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111290682A (en) * 2018-12-06 2020-06-16 阿里巴巴集团控股有限公司 Interaction method and device and computer equipment
CN110850971A (en) * 2019-10-25 2020-02-28 智亮君 Handshake interaction method and system between hand model and intelligent mirror and storage medium
CN113362263A (en) * 2021-05-27 2021-09-07 百度在线网络技术(北京)有限公司 Method, apparatus, medium, and program product for changing the image of a virtual idol
CN113362263B (en) * 2021-05-27 2023-09-15 百度在线网络技术(北京)有限公司 Method, apparatus, medium and program product for transforming an image of a virtual idol

Also Published As

Publication number Publication date
CN108388399B (en) 2021-04-06

Similar Documents

Publication Publication Date Title
CN107704169B (en) Virtual human state management method and system
CN109271018A (en) Exchange method and system based on visual human's behavioral standard
CN107632706B (en) Application data processing method and system of multi-modal virtual human
CN110400251A (en) Method for processing video frequency, device, terminal device and storage medium
CN107294837A (en) Engaged in the dialogue interactive method and system using virtual robot
CN107340865A (en) Multi-modal virtual robot exchange method and system
CN105141587B (en) Virtual doll interaction method and device
CN107329990A (en) A kind of mood output intent and dialogue interactive system for virtual robot
CN109324688A (en) Exchange method and system based on visual human's behavioral standard
CN107808191A (en) The output intent and system of the multi-modal interaction of visual human
CN106796789A (en) Interacted with the speech that cooperates with of speech reference point
WO2018230160A1 (en) Information processing system, information processing method, and program
CN109343695A (en) Exchange method and system based on visual human's behavioral standard
WO2018006374A1 (en) Function recommending method, system, and robot based on automatic wake-up
CN108416420A (en) Limbs exchange method based on visual human and system
CN108388399A (en) The method of state management and system of virtual idol
CN107888965A (en) Image present methods of exhibiting and device, terminal, system, storage medium
CN107784355A (en) The multi-modal interaction data processing method of visual human and system
CN108595012A (en) Visual interactive method and system based on visual human
CN113703585A (en) Interaction method, interaction device, electronic equipment and storage medium
CN108681398A (en) Visual interactive method and system based on visual human
CN108415561A (en) Gesture interaction method based on visual human and system
WO2022121592A1 (en) Livestreaming interaction method and apparatus
CN113409805B (en) Man-machine interaction method and device, storage medium and terminal equipment
CN108646918A (en) Visual interactive method and system based on visual human

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20230927

Address after: 100000 6198, Floor 6, Building 4, Yard 49, Badachu Road, Shijingshan District, Beijing

Patentee after: Beijing Virtual Dynamic Technology Co.,Ltd.

Address before: 100000 Fourth Floor Ivy League Youth Venture Studio No. 193, Yuquan Building, No. 3 Shijingshan Road, Shijingshan District, Beijing

Patentee before: Beijing Guangnian Infinite Technology Co.,Ltd.

TR01 Transfer of patent right