CN111178348B

CN111178348B - Method for tracking target object and sound box equipment

Info

Publication number: CN111178348B
Application number: CN201911253455.3A
Authority: CN
Inventors: 张卓
Original assignee: Guangdong Genius Technology Co Ltd
Current assignee: Guangdong Genius Technology Co Ltd
Priority date: 2019-12-09
Filing date: 2019-12-09
Publication date: 2024-03-22
Anticipated expiration: 2039-12-09
Also published as: CN111178348A

Abstract

The embodiment of the invention discloses a method for tracking a target object and sound box equipment, which are used for automatically moving to a proper detection area for identifying a question if the written position changes when the question is identified, so that a simpler and more convenient searching interaction scheme is realized. The method of the embodiment of the invention comprises the following steps: acquiring a first opening instruction; opening the first camera according to the first opening instruction; recognizing a first gesture of a target object at a first moment and first writing information on a desktop in a preset range through the first camera, wherein the definition of the first writing information on the desktop is not in a preset threshold; and moving the pulley according to the position indicated by the first gesture, so that the first camera recognizes a second gesture of the target object at a second moment and second writing information on a desktop within a preset range, and the definition of the second writing information on the desktop is within the preset threshold.

Description

Method for tracking target object and sound box equipment

Technical Field

The present invention relates to the field of education technologies, and in particular, to a method for tracking a target object, a speaker device, and a storage medium.

Background

In the prior art, when the topic is identified, if the written position changes, the position of the electronic equipment can only be manually moved to identify the topic, so that the method is not convenient and intelligent.

Disclosure of Invention

The embodiment of the invention provides a method for tracking a target object, sound box equipment and a storage medium, which are used for automatically moving to a proper detection area for topic identification if the written position changes when topic identification is carried out, so that a simpler and more convenient search interaction scheme is realized.

In view of this, a first aspect of the present invention provides a method for tracking a target object, where the method is applied to a speaker apparatus, and the speaker apparatus includes a first camera with a pulley, and the method may include:

acquiring a first opening instruction;

opening the first camera according to the first opening instruction;

recognizing a first gesture of a target object at a first moment and first writing information on a desktop in a preset range through the first camera, wherein the definition of the first writing information on the desktop is not in a preset threshold;

and moving the pulley according to the position indicated by the first gesture, so that the first camera recognizes a second gesture of the target object at a second moment and second writing information on a desktop within a preset range, and the definition of the second writing information on the desktop is within the preset threshold.

Optionally, in some embodiments of the present invention, the method may further include:

matching the second writing information with preset reference writing information;

if the second writing information is matched with the reference writing information, determining that the second writing information is accurate, and prompting indication information of successful writing;

if the second writing information is not matched with the reference writing information, determining that the second writing information is inaccurate, and prompting indication information of writing errors.

transmitting the second writing information to terminal equipment, wherein the second writing information is used for viewing by a user of the terminal equipment;

receiving evaluation indication information of the second writing information sent by the terminal equipment;

and planning a learning plan of the target object according to the evaluation indication information.

Optionally, in some embodiments of the present invention, the speaker apparatus includes a second camera; the method may further comprise:

acquiring a second opening instruction;

opening the second camera according to the second opening instruction;

Identifying the current sitting posture of the target object through the second camera;

when the current sitting posture is not matched with a preset reference sitting posture, generating prompt information, wherein the prompt information is used for prompting the target object to adjust the information of the current sitting posture;

and reminding the prompt information.

Optionally, in some embodiments of the present invention, the reminding the prompt information may include:

reminding the prompt information in a text mode;

or,

reminding the prompt information in a text and voice mode;

or,

and reminding the prompt information in a voice mode.

A second aspect of the embodiment of the present invention provides a speaker apparatus, where the speaker apparatus includes a first camera with a pulley, and the speaker apparatus further includes:

the acquisition module is used for acquiring a first opening instruction;

the processing module is used for starting the first camera according to the first starting instruction; recognizing a first gesture of a target object at a first moment and first writing information on a desktop in a preset range through the first camera, wherein the definition of the first writing information on the desktop is not in a preset threshold; and moving the pulley according to the position indicated by the first gesture, so that the first camera recognizes a second gesture of the target object at a second moment and second writing information on a desktop within a preset range, and the definition of the second writing information on the desktop is within the preset threshold.

Alternatively, in some embodiments of the invention,

the processing module is further used for matching the second writing information with preset reference writing information; if the second writing information is matched with the reference writing information, determining that the second writing information is accurate, and prompting indication information of successful writing; if the second writing information is not matched with the reference writing information, determining that the second writing information is inaccurate, and prompting indication information of writing errors.

Optionally, in some embodiments of the present invention, the speaker apparatus further includes: a transmitting module;

the sending module is used for sending the second writing information to the terminal equipment, and the second writing information is used for being checked by a user of the terminal equipment;

the acquisition module is further used for receiving evaluation indication information of the second writing information sent by the terminal equipment;

and the processing module is also used for planning a learning plan of the target object according to the evaluation indication information.

Optionally, in some embodiments of the present invention, the speaker apparatus includes a second camera;

the acquisition module is also used for acquiring a second opening instruction;

The processing module is further used for starting the second camera according to the second starting instruction; identifying the current sitting posture of the target object through the second camera; when the current sitting posture is not matched with a preset reference sitting posture, generating prompt information, wherein the prompt information is used for prompting the target object to adjust the information of the current sitting posture; and reminding the prompt information.

Alternatively, in some embodiments of the invention,

the processing module is specifically used for reminding the prompt information in a text mode;

or,

the processing module is specifically used for reminding the prompt information in a text and voice mode;

or,

the processing module is specifically used for reminding the prompt information in a voice mode.

A third aspect of the present invention provides a sound box apparatus, comprising:

a memory storing executable program code;

a processor coupled to the memory;

the processor invokes the executable program code stored in the memory for performing the steps of the method of tracking a target object as described in the first aspect of the invention and any optional implementation of the first aspect.

A fourth aspect of the embodiments of the present invention provides a readable storage medium having stored thereon a computer program which, when executed by a processor, implements the steps of a method of tracking a target object as described in the first aspect and any alternative implementation of the first aspect of the present invention.

A fifth aspect of the embodiments of the present invention discloses a computer program product which, when run on a computer, causes the computer to perform part or all of the steps of any one of the methods of tracking a target object disclosed in the first aspect of the embodiments of the present invention.

A sixth aspect of the embodiment of the present invention discloses an application publishing platform, which is configured to publish a computer program product, where the computer program product when run on a computer causes the computer to execute part or all of the steps of any one of the methods for tracking a target object disclosed in the first aspect of the embodiment of the present invention.

A sixth aspect of the embodiment of the present invention provides a wearable device, where the electronic device includes a speaker device as described in the second aspect or the third aspect.

From the above technical solutions, the embodiment of the present invention has the following advantages:

in an embodiment of the present invention, the method is applied to a speaker apparatus, where the speaker apparatus includes a first camera with a pulley, and the method may include: acquiring a first opening instruction; opening the first camera according to the first opening instruction; recognizing a first gesture of a target object at a first moment and first writing information on a desktop in a preset range through the first camera, wherein the definition of the first writing information on the desktop is not in a preset threshold; and moving the pulley according to the position indicated by the first gesture, so that the first camera recognizes a second gesture of the target object at a second moment and second writing information on a desktop within a preset range, and the definition of the second writing information on the desktop is within the preset threshold. When the method is used for identifying the questions, if the written position changes, the method automatically moves to a proper detection area to identify the questions, so that a simpler and more convenient searching interaction scheme is realized.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the description of the embodiments and the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings may be obtained according to these drawings.

FIG. 1 is a schematic diagram of one embodiment of a method for tracking a target object according to an embodiment of the present invention;

FIG. 2 is a schematic diagram of another embodiment of a method for tracking a target object according to an embodiment of the present invention;

FIG. 3 is a schematic diagram of another embodiment of a method for tracking a target object according to an embodiment of the present invention;

FIG. 4A is a schematic diagram of an embodiment of a speaker unit according to an embodiment of the present invention;

FIG. 4B is a schematic diagram of another embodiment of a sound box apparatus according to an embodiment of the present invention;

FIG. 5A is a schematic view of another embodiment of a sound box apparatus according to an embodiment of the present invention;

fig. 5B is a schematic diagram of another embodiment of the sound box apparatus according to the embodiment of the present invention.

Detailed Description

In order that those skilled in the art will better understand the present invention, reference will now be made to the accompanying drawings in which embodiments of the invention are illustrated, it being apparent that the embodiments described are only some, but not all, of the embodiments of the invention. Based on the embodiments of the present invention, it should be understood that the present invention is within the scope of protection.

In the embodiment of the invention, the wearable device can be directly worn on the user or be integrated into the clothes or accessories of the user. The wearable device is not only a hardware device, but also can realize powerful intelligent functions through software support and data interaction and cloud interaction, such as: the mobile phone terminal has the advantages of calculating function, positioning function and alarming function, and can be connected with mobile phones and various terminals. Wearable devices may include, but are not limited to, wrist-supported watch types (e.g., watches, wrist products, etc.), foot-supported shoes (e.g., shoes, socks, or other leg wear products), head-supported Glass types (e.g., glasses, helmets, headbands, etc.), and smart apparel, school bags, crutches, accessories, etc. in various non-mainstream product forms. In the following embodiments, a wearable device is described as an example of a smart watch.

As shown in fig. 1, an embodiment of a method for tracking a target object in an embodiment of the present invention is shown, where the method is applied to a sound box device, where the sound box device includes a first camera with a pulley, and the method may include:

101. and acquiring a first opening instruction.

The sound box device obtains a first opening instruction, which may include: the sound box equipment responds to input operation of a user and generates a first opening instruction; or the sound box equipment receives a first opening instruction sent by other electronic equipment.

The user can perform input operations such as clicking, double clicking, sliding, pressing and the like on a preset control in the sound box device; or, the user performs a preset input operation on the sound box device, for example, rotating the sound box device up and down or left and right, shaking the sound box device, and the like; the speaker device generates a first opening instruction in response to these input operations by the user.

Or the user sends out the sound of the preset information, and the sound box equipment detects the sound of the preset information to generate a first opening instruction.

Or, other electronic devices, such as a remote controller, a terminal device, a wearable device, and the like, respond to the operation of the user, generate a first opening instruction, send the first opening instruction to the sound box device, and the sound box device receives the first opening instruction.

It can be understood that, the manner in which the other electronic devices send the first opening instruction to the speaker device may be a wired or wireless manner, which is not limited in particular. Wireless means may include wireless fidelity (Wireless Fidelity, WIFI) or bluetooth, infrared, etc.

102. And starting the first camera according to the first starting instruction.

The sound box equipment can start the first camera according to the first starting instruction. I.e. after the first camera is turned on, the first camera may start to work.

Under the working state, the loudspeaker box device can scan the working (working, reading and the like) area of the child and identify the operation area and the operation edge. For example 1m ² Is provided.

103. And recognizing a first gesture of the target object at a first moment and first writing information on the desktop in a preset range through the first camera, wherein the definition of the first writing information on the desktop is not in a preset threshold value.

Wherein, it is understood that the first written information on the desktop refers to the first written information in the book placed on the desktop by the user. The first writing information may be information printed in a book, or may be information manually written by a user in a dictation process.

It should be noted that, the first gesture of the target object at the first moment may refer to a gesture of the user pointing to the first written information on the desktop.

The first camera of the sound box device is in a normally open state in the whole accompanying child operation and reading process, and if the gesture of the child is in a preset recognition area, the definition of the first writing information on the recognized desktop is within a preset threshold value, so that the first writing information can be clearly recognized.

104. And moving the pulley according to the position indicated by the first gesture, so that the first camera recognizes a second gesture of the target object at a second moment and second writing information on a desktop within a preset range, and the definition of the second writing information on the desktop is within the preset threshold.

Wherein, it is understood that the second written information on the desktop refers to the second written information in the book placed on the desktop by the user. The second writing information may be information printed in a book, or may be information manually written by a user in the process of dictation.

It should be noted that, the second gesture of the target object at the second moment may refer to a gesture of the user pointing to the second written information on the desktop.

For example, when a child's gesture is found to move to a non-preset recognition area, the machine may automatically track movement to the preset recognition area without requiring the child to manually move the speaker device.

For example, the nodding interaction area of a general non-wide-angle camera is A4, and when a user opens a book to align the left side of the book with the sound box device, the nodding interaction area is just the optimal A4 interaction area; however, when a child asks about information on the right side of the book, the speaker device may not be able to recognize the information of the book well, and at this time, the speaker device may be moved to the right side of the book to implement interaction within an optimal range.

Alternatively, in some embodiments of the present invention, the speaker device may play the second written information and the user may listen and write in a book placed on the desktop. The second writing information may be writing information of chinese, english, or other writing information, which is not specifically limited in the embodiment of the present invention.

105. And matching the second writing information with preset reference writing information.

Optionally, the speaker device may match the second writing information with preset reference writing information to determine whether the second writing information is accurate, and further, may determine a learning effect of the user.

106. If the second writing information is matched with the reference writing information, determining that the second writing information is accurate, and prompting indication information of successful writing.

If the sound box equipment judges that the second written information result is accurate, the indication information of successful writing can be popped up. For example, the text indicating information such as "writing correctly, very excellent", or "continue to refuel"; optionally, the text indication information can be popped up again and simultaneously with the sound reminding, and the sound can be the playing sound of the text indication information or other prompting sounds; for example: is a winning alert tone or an alert tone of "ou, hello, ja", "write pair", "complete correct", etc.

Further, the manner of ejecting the indication information of successful writing by the sound box device may be a rotary ejecting manner, or ejecting a bubble, where the indication information of successful writing is displayed, or ejecting the indication information of successful writing by a shaking manner, or other ejecting manners, which is not limited in the embodiment of the present invention.

Furthermore, when the sound box device ejects a bubble and the indication information of successful writing is displayed in the bubble, the sound box device can detect whether the bubble is clicked by a user within a specified duration, if so, the sound box device can add corresponding virtual resources (such as learning points, game coins, virtual props and the like) to the corresponding personal account number of the user, so that the user can be better stimulated to practice writing, and man-machine interaction is improved.

107. If the second writing information is not matched with the reference writing information, determining that the second writing information is inaccurate, and prompting indication information of writing errors.

If the sound box equipment judges that the current dictation result is wrong, the indication information of the writing error can be popped up. Optionally, the text indication information can be popped up again and simultaneously with the sound reminding, and the sound can be the playing sound of the text indication information or other prompting sounds; for example: the prompt tone is wrong, or the prompt tone such as ' bad meaning, writing error, continuous effort, good effort required by classmates ', no relation, continuous oiling next time ' and the like.

It should be noted that, in the embodiment of the present invention, steps 105 to 108 are optional steps.

In an embodiment of the present invention, the method is applied to a speaker apparatus, where the speaker apparatus includes a first camera with a pulley, and the method may include: acquiring a first opening instruction; opening the first camera according to the first opening instruction; recognizing a first gesture of a target object at a first moment and first writing information on a desktop in a preset range through the first camera, wherein the definition of the first writing information on the desktop is not in a preset threshold; and moving the pulley according to the position indicated by the first gesture, so that the first camera recognizes a second gesture of the target object at a second moment and second writing information on a desktop within a preset range, and the definition of the second writing information on the desktop is within the preset threshold. When the method is used for identifying the questions, if the written position changes, the method automatically moves to a proper detection area to identify the questions, so that a simpler and more convenient searching interaction scheme is realized. Furthermore, the sound box equipment can also judge whether the second writing information is accurate, if so, the correct writing indicating information can be prompted, and if not, the wrong writing indicating information can be prompted.

As shown in fig. 2, another embodiment of a method for tracking a target object in an embodiment of the present invention is shown, where the method is applied to a speaker apparatus, where the speaker apparatus includes a first camera with a pulley, and the method may include:

201. and acquiring a first opening instruction.

202. And starting the first camera according to the first starting instruction.

203. And recognizing a first gesture of the target object at a first moment and first writing information on the desktop in a preset range through the first camera, wherein the definition of the first writing information on the desktop is not in a preset threshold value.

204. And moving the pulley according to the position indicated by the first gesture, so that the first camera recognizes a second gesture of the target object at a second moment and second writing information on a desktop within a preset range, and the definition of the second writing information on the desktop is within the preset threshold.

It should be noted that, steps 201 to 204 may refer to steps 101 to 104 in the embodiment shown in fig. 1, and will not be described herein.

Optionally, in some embodiments of the present invention, the speaker device may further obtain an illumination intensity of the current environment; if the illumination intensity of the current environment is smaller than a preset threshold value, adjusting the distance between the first camera and the desktop to be smaller; and then moving the pulley according to the position indicated by the first gesture, so that the first camera recognizes a second gesture of the target object at a second moment and second writing information on the desktop in a preset range.

Optionally, in some embodiments of the present invention, the speaker device may further obtain an illumination intensity of the current environment; if the illumination intensity of the current environment is larger than a preset threshold value, adjusting the distance between the first camera and the desktop to be larger; and then moving the pulley according to the position indicated by the first gesture, so that the first camera recognizes a second gesture of the target object at a second moment and second writing information on the desktop in a preset range.

205. And sending the second writing information to terminal equipment, wherein the second writing information is used for viewing by a user of the terminal equipment.

The sound box device sends the second writing information to the terminal device in a wired or wireless mode. It is understood that the terminal devices mentioned in the embodiments of the present invention may include general handheld electronic terminals, such as smart phones, portable terminals, personal digital assistants (Personal Digital Assistant, PDA), portable multimedia player (Personal Media Player, PMP) devices, notebook computers, notebooks (Note Pad), wireless broadband (Wireless Broadband, wibro) terminals, tablet computers (personal computer, PC) and smart PCs.

The terminal device can be a terminal device used by parents, teachers or other guardians bound with the sound box device, and can send learning information of children to the terminal device in real time or periodically. Parents or teachers or other guardians can better master the learning profile of the child.

206. And receiving evaluation indication information which is sent by the terminal equipment and is used for the second writing information.

After the sound box equipment sends the second writing information to the terminal equipment, a user corresponding to the terminal equipment can evaluate the second writing information, or the terminal equipment automatically evaluates the second writing information to generate evaluation indication information. And the terminal equipment sends the evaluation indication information to the sound box equipment, and the sound box equipment receives the evaluation indication information of the second writing information sent by the terminal equipment. Wherein the evaluation indication information may include: the indication information of whether the writing is correct or not may also include how the correct writing information is, may also include some small skills related to learning the knowledge point (for example, the same words may be learned together, and learning is distinguished by means of word-forming, sentence-making, etc.), and the like, and is not limited in this particular.

Optionally, the terminal device may plan a learning plan of the target object according to the evaluation indication information, and then send the learning plan to the speaker device, where the speaker device may display or play the learning plan.

207. And planning a learning plan of the target object according to the evaluation indication information.

Optionally, the speaker device plans a learning plan of the target object according to the evaluation indication information. For example, if the second written information is written correctly, then fewer points may be planned in the learning plan for the target object with respect to the learning duration of the knowledge point; if the second written information is erroneous, then a plurality of points may be planned in the learning plan for the target object with respect to the learning duration for the knowledge point. The second writing information may be learning information accumulated for a period of time, or may be current writing information, which is not limited herein.

Optionally, in some embodiments of the present invention, the sound box device may play entertainment information when detecting that the learning duration of the target object is greater than a preset duration. The entertainment information can enable the user to relax the brain in the learning process, and achieves the effect of labor and escape combination.

Furthermore, the sound box device can interact with the terminal device used by parents, teachers or other guardians, namely, the second writing information is sent to the terminal device, the terminal device can learn and plan the learning of the child according to the second writing information, and the sound box device can learn and plan the learning of the child according to the second writing information, so that the child can learn better.

As shown in fig. 3, another embodiment of a method for tracking a target object in an embodiment of the present invention is shown, where the method is applied to a speaker apparatus, where the speaker apparatus includes a first camera with a pulley, and the method may include:

301. and acquiring a first opening instruction.

302. And starting the first camera according to the first starting instruction.

303. And recognizing a first gesture of the target object at a first moment and first writing information on the desktop in a preset range through the first camera, wherein the definition of the first writing information on the desktop is not in a preset threshold value.

304. And moving the pulley according to the position indicated by the first gesture, so that the first camera recognizes a second gesture of the target object at a second moment and second writing information on a desktop within a preset range, and the definition of the second writing information on the desktop is within the preset threshold.

It should be noted that, steps 301 to 304 may refer to steps 101 to 104 in the embodiment shown in fig. 1, and will not be described herein.

305. And acquiring a second opening instruction.

The speaker device obtaining the second opening instruction may include: the sound box equipment responds to input operation of a user and generates a second opening instruction; or the sound box equipment receives a second opening instruction sent by other electronic equipment.

The user can perform input operations such as clicking, double clicking, sliding, pressing and the like on a preset control in the sound box device; or, the user performs a preset input operation on the sound box device, for example, rotating the sound box device up and down or left and right, shaking the sound box device, and the like; the speaker device generates a second opening instruction in response to these input operations by the user.

Or the user sends out the sound of the preset information, and the sound box equipment detects the sound of the preset information to generate a second opening instruction.

Or, other electronic devices (for example, remote controllers, terminal devices, wearable devices, etc.), respond to the operation of the user, generate a second opening instruction, send the second opening instruction to the sound box device, and the sound box device receives the second opening instruction.

It can be understood that the manner in which the other electronic devices send the second opening instruction to the speaker device may be a wired manner or a wireless manner, which is not limited in particular. Wireless means may include wireless fidelity (Wireless Fidelity, WIFI) or bluetooth, infrared, etc.

306. And starting the second camera according to the second starting instruction.

The sound box equipment can start the second camera according to the second starting instruction. I.e. after the second camera is turned on, the second camera may start to operate.

It will be appreciated that the loudspeaker device further comprises a second camera. The first camera is used for recognizing the desktop condition and gesture interaction of the child, the first camera looks down at the desktop, and the sound box equipment can interact with a user based on the first camera and voice; the second camera is used for identifying sitting postures and facial expressions of the children; further, the speaker apparatus is provided with a pulley, which can be used for autonomous movement.

307. And identifying the current sitting posture of the target object through the second camera.

The sound box device can recognize the current sitting posture of the target object through the second camera. Further, the facial expression of the target object can be recognized through the second camera. Facial expressions may include, among other things, happiness, laughing, difficulty, thinking, crying, etc.

The sound box device can play different music, or play different videos, or play different pictures, or play different types of learning content, etc. through different facial expressions of the target object.

308. And when the current sitting posture is not matched with the preset reference sitting posture, generating prompt information, wherein the prompt information is used for prompting the target object to adjust the information of the current sitting posture.

When the current sitting posture is not matched with the preset reference sitting posture, the sound box equipment generates prompt information which is used for prompting the target object to adjust the information of the current sitting posture.

Optionally, when the current sitting posture matches with a preset reference sitting posture, the sound box device generates the information of the appearance, and the information of the appearance is used for prompting the target object to keep the information of the current sitting posture.

The current sitting posture of the user can be lying on a table, can be straight, or can be other sitting postures.

309. And reminding the prompt information.

The sound box device reminds the prompt information, which can include but is not limited to the following implementation modes:

the sound box equipment reminds the prompt information in a text mode; or,

the sound box equipment reminds the prompt information in a text and voice mode; or,

the sound box equipment reminds the prompt information in a voice mode.

Optionally, the speaker device may wake up the information, which may include, but is not limited to, the following implementations:

the sound box equipment reminds the information of the surface through a text mode; or,

the sound box equipment reminds the information of the surface through words and voices; or,

The sound box equipment reminds the information of the surface through a voice mode.

It should be noted that the timing of steps 301-304 and steps 305-309 are not limited.

Further, the other camera of the sound box device can recognize the current sitting posture of the target object through the second camera; when the current sitting posture is not matched with a preset reference sitting posture, generating prompt information, wherein the prompt information is used for prompting the target object to adjust the information of the current sitting posture; and reminding the prompt information. The sitting posture-keeping device is used for the user to form a good sitting posture and pay attention to the health.

As shown in fig. 4A, an embodiment of a sound box apparatus according to an embodiment of the present invention is shown, where the sound box apparatus includes a first camera with a pulley, and the sound box apparatus may further include:

an obtaining module 401, configured to obtain a first opening instruction;

a processing module 402, configured to turn on the first camera according to the first turn-on instruction; recognizing a first gesture of a target object at a first moment and first writing information on a desktop in a preset range through the first camera, wherein the definition of the first writing information on the desktop is not in a preset threshold; and moving the pulley according to the position indicated by the first gesture, so that the first camera recognizes a second gesture of the target object at a second moment and second writing information on a desktop within a preset range, and the definition of the second writing information on the desktop is within the preset threshold.

Alternatively, in some embodiments of the invention,

the processing module 402 is further configured to match the second writing information with preset reference writing information; if the second writing information is matched with the reference writing information, determining that the second writing information is accurate, and prompting indication information of successful writing; if the second writing information is not matched with the reference writing information, determining that the second writing information is inaccurate, and prompting indication information of writing errors.

Optionally, in some embodiments of the present invention, as shown in fig. 4B, another embodiment of the sound box apparatus in the embodiment of the present invention is shown. The sound box device further includes: a transmission module 403;

a sending module 403, configured to send the second writing information to a terminal device, where the second writing information is used for a user of the terminal device to view;

the obtaining module 401 is further configured to receive evaluation indication information sent by the terminal device for the second writing information;

the processing module 402 is further configured to plan a learning plan of the target object according to the evaluation indication information.

The obtaining module 401 is further configured to obtain a second opening instruction;

the processing module 402 is further configured to turn on the second camera according to the second turn-on instruction; identifying the current sitting posture of the target object through the second camera; when the current sitting posture is not matched with a preset reference sitting posture, generating prompt information, wherein the prompt information is used for prompting the target object to adjust the information of the current sitting posture; and reminding the prompt information.

Alternatively, in some embodiments of the invention,

the processing module 402 is specifically configured to remind the prompt information by text;

or,

the processing module 402 is specifically configured to remind the prompt information by text and voice;

or,

the processing module 402 is specifically configured to remind the prompt information by means of voice.

Alternatively, in some embodiments of the invention,

the processing module 402 is further configured to identify, by the second camera, a facial expression of the target object; different music, or different videos, or different pictures are played through different facial expressions of the target object.

As shown in fig. 5A, which is a schematic diagram of another embodiment of the soundbox apparatus according to an embodiment of the present invention, may include:

A memory 501 in which executable program codes are stored;

a processor 502 coupled to the memory;

a transceiver 503, the transceiver 503 being connected to the processor 502 via a bus;

a first camera 504 provided with a pulley, the first camera 504 provided with a pulley being connected to the processor 502;

the transceiver 503 is configured to perform the following steps:

acquiring a first opening instruction;

the processor 502 invokes the executable program code stored in the memory 501 for performing the steps of:

opening the first camera 504 according to the first opening instruction;

recognizing a first gesture of a target object at a first moment and first written information on a desktop in a preset range through the first camera 504, wherein the definition of the first written information on the desktop is not in a preset threshold;

and moving the pulley according to the position indicated by the first gesture, so that the first camera 504 recognizes a second gesture of the target object at a second moment and second writing information on a desktop within a preset range, and the definition of the second writing information on the desktop is within the preset threshold.

Alternatively, in some embodiments of the invention,

The processor 502 invokes the executable program code stored in the memory 501, and is further configured to perform the steps of:

Alternatively, in some embodiments of the invention,

the transceiver 503 is configured to perform the following steps:

Optionally, in some embodiments of the present invention, as shown in fig. 5B, another embodiment of the sound box apparatus in the embodiment of the present invention is shown. The sound box device comprises a second camera 505;

The transceiver 503 is configured to perform the following steps:

acquiring a second opening instruction;

opening the second camera 505 according to the second opening instruction;

identifying a current sitting position of the target object by the second camera 505;

and reminding the prompt information.

Alternatively, in some embodiments of the invention,

reminding the prompt information in a text mode;

or,

reminding the prompt information in a text and voice mode;

or,

and reminding the prompt information in a voice mode.

Alternatively, in some embodiments of the invention,

identifying facial expressions of the target object through the second camera; different music, or different videos, or different pictures are played through different facial expressions of the target object.

In the above embodiments, it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented in software, may be implemented in whole or in part in the form of a computer program product.

The computer program product includes one or more computer instructions. When loaded and executed on a computer, produces a flow or function in accordance with embodiments of the present invention, in whole or in part. The computer may be a general purpose computer, a special purpose computer, a computer network, or other programmable apparatus. The computer instructions may be stored in a computer-readable storage medium or transmitted from one computer-readable storage medium to another computer-readable storage medium, for example, the computer instructions may be transmitted from one website, computer, server, or data center to another website, computer, server, or data center by a wired (e.g., coaxial cable, fiber optic, digital subscriber line (Digital Subscriber Line, DSL)) or wireless (e.g., infrared, wireless, microwave, etc.). The computer readable storage medium may be any available medium that can be stored by a computer or a data storage device such as a server, data center, etc. that contains an integration of one or more available media. The usable medium may be a magnetic medium (e.g., a floppy Disk, a hard Disk, a magnetic tape), an optical medium (e.g., DVD (Digital Video Disc)), or a semiconductor medium (e.g., a Solid State Disk (SSD)), or the like.

It will be clear to those skilled in the art that, for convenience and brevity of description, specific working procedures of the above-described systems, apparatuses and units may refer to corresponding procedures in the foregoing method embodiments, which are not repeated herein.

In the several embodiments provided in the present invention, it should be understood that the disclosed systems, devices, and methods may be implemented in other manners. For example, the apparatus embodiments described above are merely illustrative, e.g., the division of the units is merely a logical function division, and there may be additional divisions when actually implemented, e.g., multiple units or components may be combined or integrated into another system, or some features may be omitted or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be an indirect coupling or communication connection via some interfaces, devices or units, which may be in electrical, mechanical or other form.

The units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.

In addition, each functional unit in the embodiments of the present invention may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in software functional units.

The integrated units, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied essentially or in part or all of the technical solution or in part in the form of a software product stored in a storage medium, including instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to perform all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a magnetic disk, or an optical disk, or other various media capable of storing program codes.

The above embodiments are only for illustrating the technical solution of the present invention, and not for limiting the same; although the invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims

1. A method of tracking a target object, the method being applied to a loudspeaker device including a first camera having a pulley, the method comprising:

acquiring a first opening instruction;

opening the first camera according to the first opening instruction;

recognizing a first gesture of a target object at a first moment and first writing information on a desktop in a preset range through the first camera, wherein the definition of the first writing information on the desktop is not in a preset threshold value, and the first gesture is a gesture of the target object pointing to the first writing information on the desktop;

2. The method according to claim 1, wherein the method further comprises:

3. The method according to claim 1 or 2, characterized in that the method further comprises:

4. A method according to claim 1 or 2, wherein the sound box device comprises a second camera; the method further comprises the steps of:

acquiring a second opening instruction;

opening the second camera according to the second opening instruction;

and reminding the prompt information.

5. The method of claim 4, wherein the prompting the reminder information comprises:

reminding the prompt information in a text mode;

or,

reminding the prompt information in a text and voice mode;

or,

and reminding the prompt information in a voice mode.

6. The utility model provides a sound box equipment, its characterized in that, sound box equipment includes the first camera that possesses the pulley, sound box equipment still includes:

the acquisition module is used for acquiring a first opening instruction;

the processing module is used for starting the first camera according to the first starting instruction; recognizing a first gesture of a target object at a first moment and first writing information on a desktop in a preset range through the first camera, wherein the definition of the first writing information on the desktop is not in a preset threshold value, and the first gesture is a gesture of the target object pointing to the first writing information on the desktop; and moving the pulley according to the position indicated by the first gesture, so that the first camera recognizes a second gesture of the target object at a second moment and second writing information on a desktop within a preset range, and the definition of the second writing information on the desktop is within the preset threshold.

7. The sound box apparatus of claim 6, wherein,

8. The loudspeaker apparatus of claim 6 or 7, wherein the loudspeaker apparatus further comprises: a transmitting module;

9. The loudspeaker device of claim 6 or 7, wherein the loudspeaker device comprises a second camera;

The acquisition module is also used for acquiring a second opening instruction;

10. The loudspeaker apparatus of claim 9 wherein the speaker unit is configured to receive the sound signal,

or,