WO2006031731A2 - Apparatus and method for capturing the expression of a performer - Google Patents

Apparatus and method for capturing the expression of a performer Download PDF

Info

Publication number
WO2006031731A2
WO2006031731A2 PCT/US2005/032418 US2005032418W WO2006031731A2 WO 2006031731 A2 WO2006031731 A2 WO 2006031731A2 US 2005032418 W US2005032418 W US 2005032418W WO 2006031731 A2 WO2006031731 A2 WO 2006031731A2
Authority
WO
WIPO (PCT)
Prior art keywords
color
curves
motion
data
motion capture
Prior art date
Application number
PCT/US2005/032418
Other languages
French (fr)
Other versions
WO2006031731A3 (en
Inventor
Stephen G. Perlman
Kenneth A. Pearce
Tim S. Cotter
Greg Lasalle
John Speck
Original Assignee
Rearden, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US10/942,413 external-priority patent/US8194093B2/en
Priority claimed from US10/942,609 external-priority patent/US20060055706A1/en
Application filed by Rearden, Inc. filed Critical Rearden, Inc.
Publication of WO2006031731A2 publication Critical patent/WO2006031731A2/en
Publication of WO2006031731A3 publication Critical patent/WO2006031731A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/246Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/161Detection; Localisation; Normalisation
    • G06V40/166Detection; Localisation; Normalisation using acquisition arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30196Human being; Person
    • G06T2207/30201Face
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/24Aligning, centring, orientation detection or correction of the image
    • G06V10/245Aligning, centring, orientation detection or correction of the image by locating a pattern; Special marks for positioning

Definitions

  • This invention relates generally to the field of motion capture.
  • the invention relates to an improved apparatus and method for tracking and capturing the motion and/or expression of a performer.
  • Motion capture refers generally to the tracking and recording of human motion. Motion capture systems are used for a variety of applications including, for example, video games and computer-generated movies. In a typical motion capture session, the motion of a "performer" is captured and translated to a computer-generated character.
  • a plurality of motion tracking markers 101-116 are attached at various points on a performer's body.
  • the points are selected based on the known limitations of the human skeleton.
  • markers 107 and 114 attached to the performer's knees, represent pivot points for markers 115 and 116, attached to the performer's feet.
  • markers 104 and 111 attached to the performer's elbows, represent pivot points for sensors 105 and 112, attached to the performer's hands.
  • the motion markers attached to the performer are active devices that measure their position in a magnetic field enveloping the performer.
  • the motion markers 101-116 are comprised of retro-reflective material, i.e., a material which reflects light back in the direction from which it came, ideally over a wide range of angles of incidence.
  • Two or more cameras 120, 121 ,122 are positioned to capture the light reflected off of the retro- reflective markers 101-116.
  • a motion tracking unit 150 coupled to the cameras is programmed with the relative position of each of the markers 101-116 and the known limitations of the performer's body. For example, if the relationship between motion sensor 107 and 115 is programmed into the motion tracking unit 150, the motion tracking unit 150 will understand that sensor 107 and 115 are always a fixed distance apart, and that sensor 115 may move 107 within a specified range. These constraints allow the motion capture system to usually be able to identify each marker distinctly from the other and thereby know which part of the body each marker's position is identifying. The markers don't actually identify any body parts, strictly their own position and indentity.
  • the motion capture system is able to determine the position of the markers 101-116 via triangulation between multiple cameras (at least 2) that see the same marker. Using this information and the visual data provided from the cameras 120-122, the motion tracking unit 150 generates artificial motion data representing the movement of the performer during the motion capture session.
  • a graphics processing unit 152 renders an animated representation of the performer on a computer display 160 (or similar display device) using the motion data. For example, the graphics processing unit 152 may apply the captured motion of the performer to different animated characters and/or to include the animated characters in different computer-generated scenes.
  • the motion tracking unit 150 and the graphics processing unit 152 are programmable cards coupled to the bus of a computer (e.g., such as the PCI and AGP buses found in many personal computers).
  • a computer e.g., such as the PCI and AGP buses found in many personal computers.
  • Motion Analysis Corporation See, e.g., www.motionanalysis.com.
  • the motion tracking unit 150 may lose track of the markers. For example, if a performer lays down on the floor on his/her stomach (thereby covering a number of markers), moves around on the floor and then stands back up, the motion tracking unit 150 may not be capable of re-identifying all of the markers.
  • a method comprising: applying a series of curves on specified regions of a performer's face; tracking the movement of the series of curves during a motion capture session; and generating motion data representing the movement of the performer's face using the tracked movement of the series of curves.
  • FIG. 1 illustrates a prior art motion tracking system for tracking the motion of a performer using retro-reflective markers and cameras.
  • FIG. 2 illustrates one embodiment of the invention which employs color coded retro-reflective markers to improve tracking performance.
  • FIG. 3 illustrates a portion of a color-coded database employed in one embodiment of the invention.
  • FIG. 4 illustrates a method for tracking a performer's facial expressions according to one embodiment of the invention.
  • FIGS. 5a-b illustrates an exemplary curve pattern employed in one embodiment of the invention.
  • FIG. 6 illustrates a connectivity map employed in one embodiment of the invention.
  • FIG. 7 illustrates a camera arrangement in which a plurality of cameras are focused on a specified volume of space.
  • FIG. 8 illustrates extrapolation of points within a surface patch used in one embodiment of the invention.
  • FIG. 9 illustrates an exemplary series of curves captured and analyzed by the embodiments of the invention described herein.
  • Figure 2 illustrates one embodiment of the invention which tracks the motion of a performer more precisely than prior motion capture systems.
  • a plurality of retro-reflective markers 201- 216 are positioned at various points of the performer's body.
  • color coding is applied to the retro-reflective markers 201-216 to enable more effective tracking of the markers.
  • each element 201-216 reflects light of different colors (i.e., different frequencies). The different colors may then be used to uniquely identify each individual retro-reflective element.
  • the motion capture system comprises at least one camera controller 250, a motion capture controller
  • each camera 220-222 may itself include a camera controller (i.e., in lieu, or in addition to the camera controller 250 included within the motion capture system 200). In another embodiment, the camera controller may be included within the motion capture controller 252.
  • Each camera controller 250 is provided with color coding data
  • the color coding data 253 may be stored within a database on the motion capture system 200 (along with the position of each of the markers 201-216 on the performer's body and/or the physical relationship between each of the markers).
  • An exemplary portion of the database is illustrated in Figure 3 which shows how a different color may be associated with the position of each retro-reflective element 201-216 on the performer's body (e.g., the color blue is associated with the element on the performer's left knee).
  • the colors may be represented by different levels of red (“R"), green (“G”) and blue (“B").
  • the camera controller 250 uniquely identifies each individual retro-reflective element. As such, when a group of markers 201-216 move out of range of the cameras, the camera controller 250 no longer needs to rely on the physical relationship between the markers to identify the markers when they move back in range (as in current motion capture systems). Rather, if a particular color is reflected from an element, the camera controller 250 immediately knows which element the light emanated from based on the color coding scheme. The end result is that the "clean up" process is significantly reduced, or eliminated altogether, resulting in significantly reduced production costs.
  • the number of colors used is less than the total number of retro-reflective markers 201-216. That is, the same color (or similar colors) may be used for two or more retro-reflective markers 201-216. Accordingly, to distinguish between markers of the same (or similar) colors, the camera controller 250 may also factor in the physical relationship between each of the markers to improve accuracy as in prior systems. This information may be useful, for example, if a significant number of retro-reflective markers are used, resulting in colors which are too similar to accurately differentiate. In addition, from a practical standpoint, it may be easier to work with retro-reflective markers of a limited number of colors. Given that the camera controller 250 may be programmed with the relationship between each of the retro-reflective markers 201-216, a color-coding scheme of even a few colors will improve accuracy significantly.
  • each of the plurality of cameras 220-222 supports a resolution of 640X480 pixels at 100 frames per second and video is captured in the form of a stream of bitmap images.
  • any video format may be employed while still complying with the underlying principles of the invention.
  • the cameras are coupled to the camera controller 250 via an IEEE-1394 ("FireWire") port such as an IEEE-1394A ("FireWire A”) port.
  • the cameras may be coupled via IEEE-1394B (“FireWire B”), Universal Serial Bus 2.0 (“USB 2.0”), or an IEEE-802.11 wireless channel. It should be noted, however, that the underlying principles of the present invention are not limited to any particular communication standard.
  • An exemplary architecture of the camera controller 250 includes a FireWire A bus for each controlled camera 220-222, a processor sufficient to record the video stream from each controlled camera 220- 222, Random Access Memory ("RAM") sufficient to capture the video stream from the cameras 220-222, and storage sufficient to store several (e.g., two) hours of captured video per camera 220-222.
  • the camera controller 250 may include a 2.4 GHz Intel Pentium® processor, 1 GB of RAM, 3 Serial ATA 200 GB hard drives, and Microsoft Windows XP®.
  • the camera controller 250 and the motion capture controller 252 are programmable cards coupled to the bus of a computer (e.g., such as a PCI/AGP bus).
  • the camera controller 250 may also compress the video using one or more digital video compression formats (e.g., MPEG-4, Real Video 8, AVI, . . . etc).
  • digital video compression formats e.g., MPEG-4, Real Video 8, AVI, . . . etc.
  • the cameras 220-222 are frame- synchronized for capturing video. Synchronization may be performed by a separate synchronization unit (not shown) communicatively connected to each camera 220-222. Alternatively, synchronization may be performed through FireWire (e.g., with each FireWire bus providing a synchronization signal to each camera).
  • FireWire e.g., with each FireWire bus providing a synchronization signal to each camera.
  • the camera controller 250 is communicatively connected to a motion capture controller 252 through a Category 6 Ethernet cable.
  • Other embodiments of the connection include, but are not limited to, FireWire, USB 2.0, and IEEE 802.11 wireless connection.
  • An exemplary architecture of a motion capture controller comprises a processor and volatile memory sufficient to process collected data from the camera controller 250 and sufficient storage to store the processed data.
  • One specific example of an architecture is a Dual two gigahertz G5 Power Macintosh®, two gigabytes of Random Access Memory (“RAM”) and a two hundred gigabyte hard drive.
  • the camera controller 250 and the motion capture controller 252 are programmable cards coupled to the bus of a computer (e.g., such as a PCI/AGP bus), or may be implemented as software executed on a single computer.
  • a computer e.g., such as a PCI/AGP bus
  • the underlying principles of the invention are not limited to any particular hardware or software architecture.
  • the motion capture controller 252 uses the motion data captured by the camera controller to generate 3-D motion data representing the motion of the performer during a performance.
  • the 3-D representation may be used, for example, to render a graphical animation of a character on a computer display 260 (or similar display device).
  • the motion capture controller 252 may include the animated character in different computer-generated scenes.
  • the motion capture controller 252 may store the 3-D motion data in a file (e.g., a .obj file) which may subsequently used to reconstruct the motion of the performer.
  • the "point cloud” may be comprised of color-coded retro-reflective markers, each of which may be uniquely identified by a motion tracking unit 250 based on color and/or relative position.
  • Another problem with current motion capture systems is that the number of markers on the face is limited. Thus, not enough points for sensitive and critical movements (e.g., movement around the mouth and eyes) exist in order to make a faithful recreation of the performer's face.
  • markers on the face can interfere with the performer's performance or with its capture. For example, markers on the lips may get in the way of natural lip motion in speech, or if an expression results in a lip being curled into the mouth, a marker may become completely obscured from all the motion capture cameras.
  • a series of reflective curves are painted on the performer's face and the displacement of the series of curves is tracked over time.
  • the system is able to generate significantly more surface data than traditional marker-based tracking systems.
  • a series of reflective "curves" are painted on the performer's face in the embodiments of the invention described below, the underlying principles of the invention may also be implemented using a variety of other types of facial markings (e.g., using a grid of horizontal and vertical lines deformed over the performers face).
  • Figure 4 illustrates one embodiment of a motion tracking system for performing the foregoing operations.
  • a predefined facial curve pattern 401 is adjusted to fit the topology of each performer's face 402.
  • the three-dimensional (3-D) curve pattern is adjusted based on a 3-D map of the topology of the performer's face captured using a 3-D scanning system.
  • the scan may be performed, for example, using a 3-D scanning system such as those available from Cyberware® (e.g., using the Cyberware® Color 3-D Scanner, Model 3030RGB/PS).
  • a unique facial curve pattern 401 may then be created using the scanned 3-D facial topology.
  • the performer will be asked to provide a "neutral" expression during the scanning process.
  • the curves defined by the curve pattern 401 are painted on the face of the performer using retro-reflective, non-toxic paint or theatrical makeup with colors corresponding to the colors shown in Figures 5a-b.
  • the performer's face is first painted with a solid contrasting color (e.g. black) to the lines that are subsequently painted.
  • paints that glow under special illumination e.g. so-called "black lights” are used so as to be distinctly delineated when so illuminated.
  • a physical 3-D mask is created with slits/holes corresponding to the curves defined by the curve pattern. The 3-D mask may then be placed over the face of the performer to apply the paint.
  • the 3-D mask is generated by providing the scanned topology of the user's face to a 3-D printer.
  • a preexisting mask may be used.
  • Features of the mask may be aligned and stretched to features of the performer (e.g., the nose holes of the mask fit over the nose holes of the performer, the mouth area of the mask fits over the mouth of the performer, the eye holes of the mask fit over the eye sockets of the performer, etc).
  • a projection e.g., a projection of light
  • onto the performer's face may serve as a guide for painting the curve pattern.
  • the 3-D curve pattern may be manually adjusted to the face of the performer (e.g., by a makeup artist). Once a particular curve pattern is selected, curves may be placed on a given performer in the same locations each time they are applied using, for example, a projector or a stencil.
  • Figure 5a illustrates an exemplary curve pattern, flattened into a 2D image
  • Figure 5b illustrates the curve pattern applied to an exemplary performer's face in 3D.
  • the curve pattern is designed to meet the visual requirements of the optical capture system while still representing a configuration of surface patches and/or polygons that lends itself to good quality facial deformation. In areas of high deformation, short lines with many intersections help achieve higher resolution.
  • each curve has a unique identifying name and/or number (to support systematic data processing) and a color that can be easily identified by the optical capture system.
  • a unique identifying name and/or number to support systematic data processing
  • a color that can be easily identified by the optical capture system to support systematic data processing.
  • Three different curve colors are associated with three different possible facial curve types:
  • Contours generally form concentric loops around the mouth and eyes. Contours are colored red in Figures 5a-b (e.g., lines 100-107; 300-301; 400-402; and 1400-1402).
  • Transition curves are neither clearly contours or radials. Transition curves are colored blue in Figures 5a-b (e.g., lines 700-701 ; 900; 1700-1701 ; 1900; and 3002-3004).
  • no curve can intersect another curve of the same color (or type).
  • Another defined property of the curve pattern is that each polygon and/or surface patch created by the curves must be a quadrilateral. The above list of properties is not necessarily exhaustive, and all of the above listed properties do not need to be followed in generating the curve pattern 401.
  • the curve pattern is tracked by a motion capture processing system 410 comprised of one or more camera controllers 405 and a central motion capture controller 406 during the course of a performance.
  • each of the camera controllers 405 and central motion capture controller 406 is implemented using a separate computer system.
  • the cameral controllers and motion capture controller may be implemented as software executed on a single computer system or as any combination of hardware and software.
  • each of the camera controllers 405 and/or the motion capture controller 406 is programmed with data 403 representing the curve pattern 401.
  • the motion capture system 410 uses this information to trace the movement of each curve within the curve pattern during a performance. For example, the performer's facial expressions provided by each of the cameras 404 (e.g., as bitmap images) are analyzed and the curves identified using the defined curve pattern.
  • the curve data 403 is provided to the motion capture system in the form of a "connectivity map," an example of which is illustrated in Figure 6.
  • the connectivity map is a text file representation of the curve pattern 401 which includes a list of all curves in the pattern and a list of all surface patches in the pattern, with each patch defined by its bounding curves. It is used by the camera controllers 405 and/or the central motion capture controller 406 to identify curves and intersections in the optically captured data. This, in turn, allows point data from the curves to be organized into surface patches and ultimately the triangulated mesh of a final 3-D geometry 407.
  • the connectivity map includes the following four sections: (1 ) A single command to set the level of subdivision for all curves (identified as "Section 0" in Figure 6). This determines how many polygonal faces will be created between intersections along each curve.
  • the connectivity map is stored as an extended .obj file (such as the .obj files supported by certain 3D modeling software packages, such as Maya, by Alias Systems Corp.), with the section data described above appearing as comments.
  • the connectivity map may be stored as an .obj file without the extensions referred to in the previous sentence.
  • the motion capture system 410 performs multiple levels of motion capture processing.
  • Each camera controller is responsible for capturing video provided from one or more cameras 404, storing it to disk, and performing the first portion of the motion capture processing under the control of the motion capture controller 406.
  • a single command from the motion capture controller 406 may be generated to instruct all camera controllers to start or stop a capture session, thereby allowing for frame-synchronized captures when combined with an external synchronization trigger.
  • each camera controller 405 captures video streams and stores the streams to a storage device (e.g., a hard drive) for subsequently processing.
  • the streams are stored in an Audio Video Interleave ("AVI") format, although various other formats may be used.
  • AVI Audio Video Interleave
  • each camera controller performs the following operations for each frame of captured AVI video.
  • each of the images are visually optimized and cleaned so that curves may be easily identified apart from background noise.
  • the contrast is increased between any background images/noise and the curve pattern.
  • color balance adjustments may be applied so that the relative balances of red, green and blue are accurate.
  • Various other image processing techniques may be applied to the image prior to identifying each of the curves.
  • the curves are mathematically located from within the images.
  • the intersection points of each of the curves are also located.
  • the mesh definition in the connectivity map is then used to identify the curves in each of the images. In one embodiment, this is accomplished by correlating the captured images with the curve data provided in the connectivity map. Once the curves and intersection points are identified, curve data is quantized into line segments to support the final desired polygonal resolution. The resulting intersection points of the lines are then used as the vertices of planar triangles that make up the output geometric mesh.
  • Figure 8 illustrates a surface patch defined by four intersection points 801-804.
  • a series of points are identified along each of the curves, such as point 810 on the curve defined by intersection points 810 and 803; point 811 on the curve defined by intersection points 802 and 804; point 812 on the curve defined by intersection points 801 and 802; and point 813 on the curve defined by intersection points 803 and 804.
  • three points are identified on each of the curves. It should be noted, however, that more or fewer points may be identified on each curve while still complying with the underlying principles of the invention (e.g., depending on the desired resolution of the system).
  • the data collected in the foregoing manner is stored in a 2-D curve file.
  • Each camera controller generates a separate 2-D curve file containing 2-D data collected from the unique perspective of its camera.
  • the 2-D curve file is an .obj file (e.g., with all Z coordinates set to zero).
  • the 2-D curve files are provided to the central motion capture controller 406 which uses the data within the 2-D curve files to generate a 3-D representation of each of the curves and vertices.
  • the central motion capture controller uses the location of the 2-D curves and vertices provided from different perspectives, the central motion capture controller generates full 3-D data (i.e., including Z values), for each of the curves/vertices.
  • central motion capture controller stores the 3-D data within a single .obj file. Once again, however, various alternate file formats may be used.
  • the end result is a single geometric mesh definition per frame of capture.
  • This geometric mesh is a close approximation of the surface of the face at each frame of capture, and when viewed in succession, the sequence of meshes provide a close approximation of the motion of the face.
  • only a single reference frame is used to generate the 3D mesh. All subsequent motion frames will then use the location information of the points of each curve to reposition the vertices of the face model.
  • An exemplary curve pattern captured in an AVI frame is illustrated in Figure 9. A 2-D .obj representation of the curve pattern and a 3-D .obj representation of the curve pattern, collected using the techniques described above, is provided in the appendix at the end of this detailed description.
  • the "Nodes” section identifies the 12 primary vertices 901-912 where the various curves shown in Figure 9 intersect.
  • the "Segments” section identifies points on the line segments connecting each of the 12 primary vertices. In the example, three points on each line segment are identified.
  • the "Patches” section identifies the extrapolated points within each patch (i.e., extrapolated from the three points on each line segment as described above) followed by "face” data (f) which identifies the 3 vertices for each triangle within the patch.
  • the 3-D data (which follows the 2-D data in the appendix) provides the 3-D coordinates for each point (v), and "face" data (f) identifying three vertices for each triangle in the 3-D mesh.
  • the following is an exemplary hardware platform which may be used for each camera controller:
  • a FireWire Rev. A port to couple each camera controller to each camera it controls.
  • a Processor sufficient to record the video stream e.g., a 2.4 GHZ Pentium processor
  • Random access memory or other high-speed memory sufficient to capture each video stream e.g., 1GB Double-Data Rate Synchronous Dynamic RAM
  • An OS that maximizes the performance characteristics of the system e.g., Windows XP.
  • each camera controller may be equipped with 120GB of storage space per camera.
  • a SCSI or ATA RAID controller may be used to keep up with the demands of capturing from one or more cameras.
  • 3x 200GB Serial ATA drives are used.
  • each of the camera controllers may be implemented as software executed within a single computer system.
  • the motion capture controller 406 is implemented on a dual 2GHZ G5 Macintosh with 2 GB of RAM and a 200GB mass storage device.
  • the motion capture controller 406 is not limited to any particular hardware configuration.
  • each camera 404 supports a resolution of 640x480 at 100 frames per second, global shutter, and five cameras are used to provide complete coverage of the face and head of the performer.
  • FireWire-based color cameras utilizing C-mount lenses are employed in one embodiment of the invention.
  • the FireWire connection provides both a data interface and power to each camera.
  • the cameras are running at 100 fps or faster. Resolution may vary, but initial cameras will provide 640x480 sub- pixel resolution, utilizing a 2x2 RGGB mosaic image sensor.
  • the focus of the camera lenses extend to a 4' cube volume of space to allow the actor some freedom of movement while the capture takes place. Currently, the minimum focus distance used is 5'; the maximum is 9'; and the target distance is 7.'
  • a 16mm lens with a 2/3" image sensor provides an approximately 30 degree angle of view and sufficient depth of field to cover the target area.
  • each camera captures video at the same time.
  • Each 1394 bus has its own synchronization signal and all cameras on that bus will sync to it automatically. However, given that there will likely be variance between the timing among 1394 busses; each 1394 bus may be synced with each other. An external synchronization device may also be used to synchronize and trigger the cameras.
  • Direct source lighting is sometimes problematic because lines that don't directly face the source are significantly darker. Thus, one embodiment of the invention will utilize dispersed ambient lighting to equalize the return of light between all lines.
  • Figure 7 illustrates one embodiment of a system layout in which five cameras 404 are focused on a 4' cube volume of space 700.
  • the cameras of this embodiment are positioned approximately T from the target area of the capture.
  • the cameras are varied along the Z-axis to provide maximum coverage of the target area (where the Z-axis points out of the performer's face towards the camera).
  • Indirect ambient lighting surrounds the target area and produces an even contrast level around the entire capture surface.
  • Embodiments of the invention may include various steps as set forth above. The steps may be embodied in machine-executable instructions which cause a general-purpose or special-purpose processor to perform certain steps.
  • Various elements which are not relevant to the underlying principles of the invention such as computer memory, hard drive, input devices, have been left out of the figures to avoid obscuring the pertinent aspects of the invention.
  • the various functional modules illustrated herein and the associated steps may be performed by specific hardware components that contain hardwired logic for performing the steps, such as an application-specific integrated circuit ("ASIC") or by any combination of programmed computer components and custom hardware components.
  • ASIC application-specific integrated circuit
  • Elements of the present invention may also be provided as a machine-readable medium for storing the machine-executable instructions.
  • the machine-readable medium may include, but is not limited to, flash memory, optical disks, CD-ROMs, DVD ROMs, RAMs, EPROMs, EEPROMs, magnetic or optical cards, propagation media or other type of machine-readable media suitable for storing electronic instructions.
  • the present invention may be downloaded as a computer program which may be transferred from a remote computer (e.g., a server) to a requesting computer (e.g., a client) by way of data signals embodied in a carrier wave or other propagation medium via a communication link (e.g., a modem or network connection).

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Human Computer Interaction (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Analysis (AREA)
  • Studio Devices (AREA)

Abstract

A method is described comprising: applying a series of curves on specified regions of a performer’s face; tracking the movement of the series of curves during a motion capture session; and generating motion data representing the movement of the performer’s face using the tracked movement of the series of curves.

Description

APPARATUS AND METHOD FOR CAPTURING THE EXPRESSION OF A PERFORMER
BACKGROUND OF THE INVENTION
Field of the Invention
[0001] This invention relates generally to the field of motion capture.
More particularly, the invention relates to an improved apparatus and method for tracking and capturing the motion and/or expression of a performer.
Description of the Related Art
[0002] "Motion capture" refers generally to the tracking and recording of human motion. Motion capture systems are used for a variety of applications including, for example, video games and computer-generated movies. In a typical motion capture session, the motion of a "performer" is captured and translated to a computer-generated character.
[0003] As illustrated in Figure 1, in a motion capture system, a plurality of motion tracking markers 101-116 are attached at various points on a performer's body. The points are selected based on the known limitations of the human skeleton. For example, markers 107 and 114, attached to the performer's knees, represent pivot points for markers 115 and 116, attached to the performer's feet. Similarly, markers 104 and 111 , attached to the performer's elbows, represent pivot points for sensors 105 and 112, attached to the performer's hands.
[0004] Different types of motion capture systems have been developed over the years. For example, in a "magnetic" motion capture system, the motion markers attached to the performer are active devices that measure their position in a magnetic field enveloping the performer. By contrast, in an optical motion capture system, such as that illustrated in Figure 1, the motion markers 101-116 are comprised of retro-reflective material, i.e., a material which reflects light back in the direction from which it came, ideally over a wide range of angles of incidence. Two or more cameras 120, 121 ,122 are positioned to capture the light reflected off of the retro- reflective markers 101-116.
[0005] A motion tracking unit 150 coupled to the cameras is programmed with the relative position of each of the markers 101-116 and the known limitations of the performer's body. For example, if the relationship between motion sensor 107 and 115 is programmed into the motion tracking unit 150, the motion tracking unit 150 will understand that sensor 107 and 115 are always a fixed distance apart, and that sensor 115 may move 107 within a specified range. These constraints allow the motion capture system to usually be able to identify each marker distinctly from the other and thereby know which part of the body each marker's position is identifying. The markers don't actually identify any body parts, strictly their own position and indentity. Also, once the markers are identified individually, the motion capture system is able to determine the position of the markers 101-116 via triangulation between multiple cameras (at least 2) that see the same marker. Using this information and the visual data provided from the cameras 120-122, the motion tracking unit 150 generates artificial motion data representing the movement of the performer during the motion capture session. [0006] A graphics processing unit 152 renders an animated representation of the performer on a computer display 160 (or similar display device) using the motion data. For example, the graphics processing unit 152 may apply the captured motion of the performer to different animated characters and/or to include the animated characters in different computer-generated scenes. In one implementation, the motion tracking unit 150 and the graphics processing unit 152 are programmable cards coupled to the bus of a computer (e.g., such as the PCI and AGP buses found in many personal computers). One well known company which produces motion capture systems is Motion Analysis Corporation (see, e.g., www.motionanalysis.com).
[0007] One problem which exists with current motion capture systems, however, is that when the markers move out of range of the cameras, the motion tracking unit 150 may lose track of the markers. For example, if a performer lays down on the floor on his/her stomach (thereby covering a number of markers), moves around on the floor and then stands back up, the motion tracking unit 150 may not be capable of re-identifying all of the markers.
[0008] As such, after a performance, a significant amount of "clean up" is typically required during which computer programmers or animators manually identify each of the "lost" markers to the image tracking unit 150, resulting in significant additional production costs. [0009] In addition, while current motion capture systems are well suited for tracking full body motion, current systems are ill-equipped for tracking the more detailed, expressive movement of a human face. For example, the size of the markers used in current systems allows for only a limited number of markers to be placed on a performer's face, and movement around the performer's lips and eyes, which are small but critical in expression, may be lost by the use of a limited number of markers. [0010] Accordingly, what is needed is an improved apparatus and method for tracking and capturing the motion and/or expression of a performer.
SUMMARY
[0011] A method is described comprising: applying a series of curves on specified regions of a performer's face; tracking the movement of the series of curves during a motion capture session; and generating motion data representing the movement of the performer's face using the tracked movement of the series of curves. BRIEF DESCRIPTION OF THE DRAWINGS
[0012] A better understanding of the present invention can be obtained from the following detailed description in conjunction with the drawings, in which:
[0013] FIG. 1 illustrates a prior art motion tracking system for tracking the motion of a performer using retro-reflective markers and cameras. [0014] FIG. 2 illustrates one embodiment of the invention which employs color coded retro-reflective markers to improve tracking performance.
[0015] FIG. 3 illustrates a portion of a color-coded database employed in one embodiment of the invention.
[0016] FIG. 4 illustrates a method for tracking a performer's facial expressions according to one embodiment of the invention. [0017] FIGS. 5a-b illustrates an exemplary curve pattern employed in one embodiment of the invention.
[0018] FIG. 6 illustrates a connectivity map employed in one embodiment of the invention.
[0019] FIG. 7 illustrates a camera arrangement in which a plurality of cameras are focused on a specified volume of space. [0020] FIG. 8 illustrates extrapolation of points within a surface patch used in one embodiment of the invention.
[0021] FIG. 9 illustrates an exemplary series of curves captured and analyzed by the embodiments of the invention described herein.
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS [0022] Described below is an improved apparatus and method for capturing still images and video on a data processing device. In the following description, for the purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be apparent, however, to one skilled in the art that the present invention may be practiced without some of these specific details. In other instances, well-known structures and devices are shown in block diagram form to avoid obscuring the underlying principles of the invention.
EMBODIMENTS OF THE INVENTION Color-Coded Motion Capture
[0023] Figure 2 illustrates one embodiment of the invention which tracks the motion of a performer more precisely than prior motion capture systems. As in prior systems, a plurality of retro-reflective markers 201- 216 are positioned at various points of the performer's body. Unlike prior systems, however, color coding is applied to the retro-reflective markers 201-216 to enable more effective tracking of the markers. Specifically, as a result of the color coding, each element 201-216 reflects light of different colors (i.e., different frequencies). The different colors may then be used to uniquely identify each individual retro-reflective element. [0024] In the exemplary embodiment, the motion capture system comprises at least one camera controller 250, a motion capture controller
252 and color coding data 253 of the retro-reflective markers 201-216. In one embodiment, each camera 220-222 may itself include a camera controller (i.e., in lieu, or in addition to the camera controller 250 included within the motion capture system 200). In another embodiment, the camera controller may be included within the motion capture controller 252.
[0025] Each camera controller 250 is provided with color coding data
253 identifying the respective colors of each of the retro-reflective markers 201-216. The color coding data 253 may be stored within a database on the motion capture system 200 (along with the position of each of the markers 201-216 on the performer's body and/or the physical relationship between each of the markers). An exemplary portion of the database is illustrated in Figure 3 which shows how a different color may be associated with the position of each retro-reflective element 201-216 on the performer's body (e.g., the color blue is associated with the element on the performer's left knee). As indicated in Figure 3, the colors may be represented by different levels of red ("R"), green ("G") and blue ("B"). However, various different color coding schemes may be employed while still complying with the underlying principles of the invention. [0026] Using the designated color coding scheme, the camera controller 250 uniquely identifies each individual retro-reflective element. As such, when a group of markers 201-216 move out of range of the cameras, the camera controller 250 no longer needs to rely on the physical relationship between the markers to identify the markers when they move back in range (as in current motion capture systems). Rather, if a particular color is reflected from an element, the camera controller 250 immediately knows which element the light emanated from based on the color coding scheme. The end result is that the "clean up" process is significantly reduced, or eliminated altogether, resulting in significantly reduced production costs.
[0027] In one embodiment, the number of colors used is less than the total number of retro-reflective markers 201-216. That is, the same color (or similar colors) may be used for two or more retro-reflective markers 201-216. Accordingly, to distinguish between markers of the same (or similar) colors, the camera controller 250 may also factor in the physical relationship between each of the markers to improve accuracy as in prior systems. This information may be useful, for example, if a significant number of retro-reflective markers are used, resulting in colors which are too similar to accurately differentiate. In addition, from a practical standpoint, it may be easier to work with retro-reflective markers of a limited number of colors. Given that the camera controller 250 may be programmed with the relationship between each of the retro-reflective markers 201-216, a color-coding scheme of even a few colors will improve accuracy significantly.
[0028] In one embodiment, each of the plurality of cameras 220-222 supports a resolution of 640X480 pixels at 100 frames per second and video is captured in the form of a stream of bitmap images. However, any video format may be employed while still complying with the underlying principles of the invention. In one embodiment, the cameras are coupled to the camera controller 250 via an IEEE-1394 ("FireWire") port such as an IEEE-1394A ("FireWire A") port. Alternatively, the cameras may be coupled via IEEE-1394B ("FireWire B"), Universal Serial Bus 2.0 ("USB 2.0"), or an IEEE-802.11 wireless channel. It should be noted, however, that the underlying principles of the present invention are not limited to any particular communication standard.
[0029] An exemplary architecture of the camera controller 250 includes a FireWire A bus for each controlled camera 220-222, a processor sufficient to record the video stream from each controlled camera 220- 222, Random Access Memory ("RAM") sufficient to capture the video stream from the cameras 220-222, and storage sufficient to store several (e.g., two) hours of captured video per camera 220-222. By way of example, the camera controller 250 may include a 2.4 GHz Intel Pentium® processor, 1 GB of RAM, 3 Serial ATA 200 GB hard drives, and Microsoft Windows XP®. In another embodiment, the camera controller 250 and the motion capture controller 252 are programmable cards coupled to the bus of a computer (e.g., such as a PCI/AGP bus). However, as described below, the underlying principles of the invention are not limited to any particular hardware or software architecture. The camera controller 250 may also compress the video using one or more digital video compression formats (e.g., MPEG-4, Real Video 8, AVI, . . . etc).
[0030] In one embodiment, the cameras 220-222 are frame- synchronized for capturing video. Synchronization may be performed by a separate synchronization unit (not shown) communicatively connected to each camera 220-222. Alternatively, synchronization may be performed through FireWire (e.g., with each FireWire bus providing a synchronization signal to each camera). By frame-synchronizing the cameras, the data captured by each camera will be at roughly the same moment in time. So, if the performer (and the markers attached to the performer) is in the process of a rapid motion, there will be less discrepancy between the measurements made by each camera in a given frame time of each marker, and more accurate position in space will be measured when the captured marker positions are triangulated. [0031] In one embodiment, the camera controller 250 is communicatively connected to a motion capture controller 252 through a Category 6 Ethernet cable. Other embodiments of the connection include, but are not limited to, FireWire, USB 2.0, and IEEE 802.11 wireless connection. An exemplary architecture of a motion capture controller comprises a processor and volatile memory sufficient to process collected data from the camera controller 250 and sufficient storage to store the processed data. One specific example of an architecture is a Dual two gigahertz G5 Power Macintosh®, two gigabytes of Random Access Memory ("RAM") and a two hundred gigabyte hard drive. In another embodiment, the camera controller 250 and the motion capture controller 252 are programmable cards coupled to the bus of a computer (e.g., such as a PCI/AGP bus), or may be implemented as software executed on a single computer. However, as described below, the underlying principles of the invention are not limited to any particular hardware or software architecture.
[0032] In one embodiment, the motion capture controller 252 uses the motion data captured by the camera controller to generate 3-D motion data representing the motion of the performer during a performance. The 3-D representation may be used, for example, to render a graphical animation of a character on a computer display 260 (or similar display device). By way of example, the motion capture controller 252 may include the animated character in different computer-generated scenes. The motion capture controller 252 may store the 3-D motion data in a file (e.g., a .obj file) which may subsequently used to reconstruct the motion of the performer.
High-Precision Motion Capture
[0033] As mentioned above, current motion capture systems lack the precision necessary for capturing low-level, detailed movement. For example, to capture the facial expressions of a performer, current systems rely on the same general techniques as those described above for full body motion, resulting in a "point cloud" (i.e. a locus of points in 3D space) of markers positioned close together on the face of the performer. Because they are positioned so close together, however, it is difficult for current motion capture systems to differentiate each of the markers during a performance, particularly during a dramatic change in the performer's expression (e.g., when the performer suddenly laughs or sneezes). [0034] To improve accuracy, the same general type of color-coding techniques described above may be employed. For example, the "point cloud" may be comprised of color-coded retro-reflective markers, each of which may be uniquely identified by a motion tracking unit 250 based on color and/or relative position. [0035] Another problem with current motion capture systems is that the number of markers on the face is limited. Thus, not enough points for sensitive and critical movements (e.g., movement around the mouth and eyes) exist in order to make a faithful recreation of the performer's face. [0036] A further problem is that markers on the face can interfere with the performer's performance or with its capture. For example, markers on the lips may get in the way of natural lip motion in speech, or if an expression results in a lip being curled into the mouth, a marker may become completely obscured from all the motion capture cameras. [0037] To solve the foregoing problems, in one embodiment of the invention, a series of reflective curves are painted on the performer's face and the displacement of the series of curves is tracked over time. By analyzing curves instead of discrete data points, the system is able to generate significantly more surface data than traditional marker-based tracking systems. Although a series of reflective "curves" are painted on the performer's face in the embodiments of the invention described below, the underlying principles of the invention may also be implemented using a variety of other types of facial markings (e.g., using a grid of horizontal and vertical lines deformed over the performers face). [0038] Figure 4 illustrates one embodiment of a motion tracking system for performing the foregoing operations. In this embodiment, a predefined facial curve pattern 401 is adjusted to fit the topology of each performer's face 402. In one embodiment, the three-dimensional (3-D) curve pattern is adjusted based on a 3-D map of the topology of the performer's face captured using a 3-D scanning system. The scan may be performed, for example, using a 3-D scanning system such as those available from Cyberware® (e.g., using the Cyberware® Color 3-D Scanner, Model 3030RGB/PS). A unique facial curve pattern 401 may then be created using the scanned 3-D facial topology. In one embodiment, the performer will be asked to provide a "neutral" expression during the scanning process.
[0039] In one embodiment, the curves defined by the curve pattern 401 are painted on the face of the performer using retro-reflective, non-toxic paint or theatrical makeup with colors corresponding to the colors shown in Figures 5a-b. In another embodiment the performer's face is first painted with a solid contrasting color (e.g. black) to the lines that are subsequently painted. In yet another embodiment, paints that glow under special illumination (e.g. so-called "black lights") are used so as to be distinctly delineated when so illuminated. In one embodiment, to accurately apply the curve pattern, a physical 3-D mask is created with slits/holes corresponding to the curves defined by the curve pattern. The 3-D mask may then be placed over the face of the performer to apply the paint. In one embodiment, the 3-D mask is generated by providing the scanned topology of the user's face to a 3-D printer. [0040] Rather than printing a custom mask to apply the set of curves, a preexisting mask may be used. Features of the mask may be aligned and stretched to features of the performer (e.g., the nose holes of the mask fit over the nose holes of the performer, the mouth area of the mask fits over the mouth of the performer, the eye holes of the mask fit over the eye sockets of the performer, etc). In an alternate embodiment, a projection (e.g., a projection of light) onto the performer's face may serve as a guide for painting the curve pattern.
[0041] In an alternate embodiment, the 3-D curve pattern may be manually adjusted to the face of the performer (e.g., by a makeup artist). Once a particular curve pattern is selected, curves may be placed on a given performer in the same locations each time they are applied using, for example, a projector or a stencil. [0042] Figure 5a illustrates an exemplary curve pattern, flattened into a 2D image, and Figure 5b illustrates the curve pattern applied to an exemplary performer's face in 3D. The curve pattern is designed to meet the visual requirements of the optical capture system while still representing a configuration of surface patches and/or polygons that lends itself to good quality facial deformation. In areas of high deformation, short lines with many intersections help achieve higher resolution. In areas of low deformation, long lines with few intersections may suffice. [0043] As indicated in Figure 5a, in one embodiment, each curve has a unique identifying name and/or number (to support systematic data processing) and a color that can be easily identified by the optical capture system. Three different curve colors are associated with three different possible facial curve types:
(1 ) "Contours" generally form concentric loops around the mouth and eyes. Contours are colored red in Figures 5a-b (e.g., lines 100-107; 300-301; 400-402; and 1400-1402).
(2) "Radials" generally issue outward from the mouth and eyes in spoke-like patterns. Radials are colored green in Figures 5a-b (e.g., lines 500-508; 600-604; 1000-1001 ; 1500-1507; 1600-1604; and 2000-2001 ).
(3) "Transition" curves are neither clearly contours or radials. Transition curves are colored blue in Figures 5a-b (e.g., lines 700-701 ; 900; 1700-1701 ; 1900; and 3002-3004).
[0044] In one embodiment, no curve can intersect another curve of the same color (or type). Another defined property of the curve pattern is that each polygon and/or surface patch created by the curves must be a quadrilateral. The above list of properties is not necessarily exhaustive, and all of the above listed properties do not need to be followed in generating the curve pattern 401. [0045] Once the curve pattern is applied, in one embodiment, the curve pattern is tracked by a motion capture processing system 410 comprised of one or more camera controllers 405 and a central motion capture controller 406 during the course of a performance. In one embodiment, each of the camera controllers 405 and central motion capture controller 406 is implemented using a separate computer system. Alternatively, the cameral controllers and motion capture controller may be implemented as software executed on a single computer system or as any combination of hardware and software.
[0046] In one embodiment, each of the camera controllers 405 and/or the motion capture controller 406 is programmed with data 403 representing the curve pattern 401. The motion capture system 410 uses this information to trace the movement of each curve within the curve pattern during a performance. For example, the performer's facial expressions provided by each of the cameras 404 (e.g., as bitmap images) are analyzed and the curves identified using the defined curve pattern.
[0047] In one embodiment, the curve data 403 is provided to the motion capture system in the form of a "connectivity map," an example of which is illustrated in Figure 6. The connectivity map is a text file representation of the curve pattern 401 which includes a list of all curves in the pattern and a list of all surface patches in the pattern, with each patch defined by its bounding curves. It is used by the camera controllers 405 and/or the central motion capture controller 406 to identify curves and intersections in the optically captured data. This, in turn, allows point data from the curves to be organized into surface patches and ultimately the triangulated mesh of a final 3-D geometry 407.
[0048] In one embodiment, the connectivity map includes the following four sections: (1 ) A single command to set the level of subdivision for all curves (identified as "Section 0" in Figure 6). This determines how many polygonal faces will be created between intersections along each curve.
(2) A list of all curves organized by type (contour, radial or transition), with each curve having a unique name and/or number and a color that match the curve type (identified as "Section 1" in Figure 6).
(3) For each curve, an ordered list of other curves that it intersects along its length (identified as "Section 2" in Figure 6).
(4) A list of all surface patches, each defined by the curves that make up its sides (identified as "Section 3" in Figure 6).
[0049] In one embodiment, the connectivity map is stored as an extended .obj file (such as the .obj files supported by certain 3D modeling software packages, such as Maya, by Alias Systems Corp.), with the section data described above appearing as comments. Alternatively, the connectivity map may be stored as an .obj file without the extensions referred to in the previous sentence.
[0050] In one embodiment, the motion capture system 410 performs multiple levels of motion capture processing. Each camera controller is responsible for capturing video provided from one or more cameras 404, storing it to disk, and performing the first portion of the motion capture processing under the control of the motion capture controller 406. In one embodiment, a single command from the motion capture controller 406 may be generated to instruct all camera controllers to start or stop a capture session, thereby allowing for frame-synchronized captures when combined with an external synchronization trigger. [0051] Once a capture is initiated, each camera controller 405 captures video streams and stores the streams to a storage device (e.g., a hard drive) for subsequently processing. In one embodiment, the streams are stored in an Audio Video Interleave ("AVI") format, although various other formats may be used.
[0052] In one embodiment, each camera controller performs the following operations for each frame of captured AVI video. First , each of the images are visually optimized and cleaned so that curves may be easily identified apart from background noise. In one embodiment, the contrast is increased between any background images/noise and the curve pattern. In addition, color balance adjustments may be applied so that the relative balances of red, green and blue are accurate. Various other image processing techniques may be applied to the image prior to identifying each of the curves.
[0053] After the images are processed, the curves are mathematically located from within the images. The intersection points of each of the curves are also located. The mesh definition in the connectivity map is then used to identify the curves in each of the images. In one embodiment, this is accomplished by correlating the captured images with the curve data provided in the connectivity map. Once the curves and intersection points are identified, curve data is quantized into line segments to support the final desired polygonal resolution. The resulting intersection points of the lines are then used as the vertices of planar triangles that make up the output geometric mesh. [0054] By way of example, Figure 8 illustrates a surface patch defined by four intersection points 801-804. In one embodiment, to quantize the curve data into line segments, a series of points are identified along each of the curves, such as point 810 on the curve defined by intersection points 810 and 803; point 811 on the curve defined by intersection points 802 and 804; point 812 on the curve defined by intersection points 801 and 802; and point 813 on the curve defined by intersection points 803 and 804. In the example shown in Figure 8, three points are identified on each of the curves. It should be noted, however, that more or fewer points may be identified on each curve while still complying with the underlying principles of the invention (e.g., depending on the desired resolution of the system).
[0055] To extrapolate points within the surface patch, In one embodiment, once the points on each of the curves are identified, they are logically interconnected to form lines which intersect one another, as illustrated in Figure 8. The intersection points of each of the lines are identified (e.g., point 820) and all of the points are used to define the vertices of a series of adjacent triangles within the surface patch (a technique referred to as "tessellation"). Two such triangles, 830 and 831 , are identified in Figure 8.
[0056] The data collected in the foregoing manner is stored in a 2-D curve file. Each camera controller generates a separate 2-D curve file containing 2-D data collected from the unique perspective of its camera. In one embodiment, the 2-D curve file is an .obj file (e.g., with all Z coordinates set to zero). However, the underlying principles of the invention are not limited to any particular file format. [0057] The 2-D curve files are provided to the central motion capture controller 406 which uses the data within the 2-D curve files to generate a 3-D representation of each of the curves and vertices. That is, using the location of the 2-D curves and vertices provided from different perspectives, the central motion capture controller generates full 3-D data (i.e., including Z values), for each of the curves/vertices. In one embodiment, central motion capture controller stores the 3-D data within a single .obj file. Once again, however, various alternate file formats may be used.
[0058] The end result is a single geometric mesh definition per frame of capture. This geometric mesh is a close approximation of the surface of the face at each frame of capture, and when viewed in succession, the sequence of meshes provide a close approximation of the motion of the face. In one embodiment, in order to maintain texture coordinates on face geometry throughout an animation sequence, only a single reference frame is used to generate the 3D mesh. All subsequent motion frames will then use the location information of the points of each curve to reposition the vertices of the face model. [0059] An exemplary curve pattern captured in an AVI frame is illustrated in Figure 9. A 2-D .obj representation of the curve pattern and a 3-D .obj representation of the curve pattern, collected using the techniques described above, is provided in the appendix at the end of this detailed description.
[0060] Those of ordinary skill in the art will readily understand the data contained within each of the sections of the 2-D and 3-D .obj files. Briefly, starting with the 2-D curve data, the "Nodes" section identifies the 12 primary vertices 901-912 where the various curves shown in Figure 9 intersect. The "Segments" section identifies points on the line segments connecting each of the 12 primary vertices. In the example, three points on each line segment are identified. The "Patches" section identifies the extrapolated points within each patch (i.e., extrapolated from the three points on each line segment as described above) followed by "face" data (f) which identifies the 3 vertices for each triangle within the patch. [0061] The 3-D data (which follows the 2-D data in the appendix) provides the 3-D coordinates for each point (v), and "face" data (f) identifying three vertices for each triangle in the 3-D mesh. [0062] The following is an exemplary hardware platform which may be used for each camera controller:
• A FireWire Rev. A port to couple each camera controller to each camera it controls. • An RJ45 1000Base-T Gigabit Ethernet port for communication with the central motion capture controller.
• A Processor sufficient to record the video stream (e.g., a 2.4 GHZ Pentium processor)
• Random access memory or other high-speed memory sufficient to capture each video stream (e.g., 1GB Double-Data Rate Synchronous Dynamic RAM)
• An OS that maximizes the performance characteristics of the system (e.g., Windows XP).
• Permanent storage sufficient to store two or more hours of captured video per camera controlled. At a rate of 30MB/sec, each camera controller may be equipped with 120GB of storage space per camera. A SCSI or ATA RAID controller may be used to keep up with the demands of capturing from one or more cameras. In another embodiment, 3x 200GB Serial ATA drives are used.
[0063] The foregoing details are provided merely for the purpose of illustration. The underlying principles of the invention are not limited to any particular hardware or software platform. For example, as mentioned above, each of the camera controllers may be implemented as software executed within a single computer system. [0064] In one embodiment, the motion capture controller 406 is implemented on a dual 2GHZ G5 Macintosh with 2 GB of RAM and a 200GB mass storage device. However, the motion capture controller 406 is not limited to any particular hardware configuration. [0065] As mentioned above, in one embodiment, each camera 404 supports a resolution of 640x480 at 100 frames per second, global shutter, and five cameras are used to provide complete coverage of the face and head of the performer. FireWire-based color cameras utilizing C-mount lenses are employed in one embodiment of the invention. The FireWire connection provides both a data interface and power to each camera. In one embodiment, the cameras are running at 100 fps or faster. Resolution may vary, but initial cameras will provide 640x480 sub- pixel resolution, utilizing a 2x2 RGGB mosaic image sensor. [0066] In one embodiment, the focus of the camera lenses extend to a 4' cube volume of space to allow the actor some freedom of movement while the capture takes place. Currently, the minimum focus distance used is 5'; the maximum is 9'; and the target distance is 7.' A 16mm lens with a 2/3" image sensor provides an approximately 30 degree angle of view and sufficient depth of field to cover the target area. [0067] In one embodiment, each camera captures video at the same time. Each 1394 bus has its own synchronization signal and all cameras on that bus will sync to it automatically. However, given that there will likely be variance between the timing among 1394 busses; each 1394 bus may be synced with each other. An external synchronization device may also be used to synchronize and trigger the cameras. [0068] Direct source lighting is sometimes problematic because lines that don't directly face the source are significantly darker. Thus, one embodiment of the invention will utilize dispersed ambient lighting to equalize the return of light between all lines.
[0069] Figure 7 illustrates one embodiment of a system layout in which five cameras 404 are focused on a 4' cube volume of space 700. The cameras of this embodiment are positioned approximately T from the target area of the capture. The cameras are varied along the Z-axis to provide maximum coverage of the target area (where the Z-axis points out of the performer's face towards the camera). Indirect ambient lighting surrounds the target area and produces an even contrast level around the entire capture surface. [0070] Embodiments of the invention may include various steps as set forth above. The steps may be embodied in machine-executable instructions which cause a general-purpose or special-purpose processor to perform certain steps. Various elements which are not relevant to the underlying principles of the invention such as computer memory, hard drive, input devices, have been left out of the figures to avoid obscuring the pertinent aspects of the invention.
[0071] Alternatively, in one embodiment, the various functional modules illustrated herein and the associated steps may be performed by specific hardware components that contain hardwired logic for performing the steps, such as an application-specific integrated circuit ("ASIC") or by any combination of programmed computer components and custom hardware components.
[0072] Elements of the present invention may also be provided as a machine-readable medium for storing the machine-executable instructions. The machine-readable medium may include, but is not limited to, flash memory, optical disks, CD-ROMs, DVD ROMs, RAMs, EPROMs, EEPROMs, magnetic or optical cards, propagation media or other type of machine-readable media suitable for storing electronic instructions. For example, the present invention may be downloaded as a computer program which may be transferred from a remote computer (e.g., a server) to a requesting computer (e.g., a client) by way of data signals embodied in a carrier wave or other propagation medium via a communication link (e.g., a modem or network connection). [0073] Throughout the foregoing description, for the purposes of explanation, numerous specific details were set forth in order to provide a thorough understanding of the present system and method. It will be apparent, however, to one skilled in the art that the system and method may be practiced without some of these specific details. For example, while the embodiments of the invention set forth above employ an .obj representation of the 2-D and 3-D data, various other file types may be used while still complying with the underlying principles of the invention. [0074] Accordingly, the scope and spirit of the present invention should be judged in terms of the claims which follow.
APPENDIX
2-D Curve .OBJ File:
# QCap 2d intermediate file
# configuration βPatchTest/Configs/Capture3anim.cfg
# camera config βPatchTest/Configs/Camera3.cfg
# qbj reference βPatchTest/testβPatches.qbj
# capture avi: 6PatchTest/727Capture/anim/cam3/anim3.avi
# graph obj βPatchTest/Outputs/graph2dcam3anim
# num divs 3
# frame number 1 #
# CamPos -0.07879126 +5.26676273 +28.06798553
# Nodes v -1, 97392690 -1 07431972 +0.00000000 # 1 v -1, 38112295 -1.10586524 +0.00000000 # 2
V -1.94250190 -0 04210759 +0.00000000 # 3 V -1, 46626675 -0 32032418 +0.00000000 # 4 V -1, 01778662 +0 78737569 +0.00000000 # V -0, 90682340 +0 16821600 +0.00000000 # 6 V +0, 09229846 +0 20500717 +0.00000000 # 7D L V -0 46925020 -0 26685110 +0.00000000 # 8 V +0, 12524356 21420789 +0.00000000 # 9 V -0 50753808 -0 98335826 +0.00000000 # 10 V -0.90752012 -2 08183002 +0.00000000 # 11 V -0.93476307 -1.40090668 +0.00000000 # 12
# Segments
# ref node 1 to 2 v -1.88817441 -1.07677734 +0.00000000 # 13 v -1.73365057 -1.08120596 +0.00000000 # 14 v -1.57832527 -1.09207094 +0.00000000 # 15
# ref node 1 to 3 v -2.02279949 0 . 91511810 +0.00000000 # 16 v -2.01127958 0 . 56780803 +0.00000000 # 17 v -2.01818895 -0 . 18221754 +0.00000000 # 18 # ref node 1 to 11 v -1.89252400 -1.33948851 +0.00000000 # 19 v -1.53722715 -1.72880912 +0.00000000 # 20 v -1.14227486 -2.06838703 +0.00000000 # 21
# ref node 2 to 4 v -1.46942806 -0.88874239 +0.00000000 # 22 v -1.47398293 -0.64232862 +0.00000000 # 23 v -1.48249972 -0.40496421 +0.00000000 # 24
# ref node 2 to 12 v -1.35785246 -1.16308212 +0.00000000 # 25 v -1.19748890 -1.28624499 +0.00000000 # 26 v -1.05077696 -1.41390395 +0.00000000 # 27
# ref node 3 to 4 v -1.87016106 -0.08351994 +0.00000000 # 28 v -1.73302794 -0.16202335 +0.00000000 # 29 v -1.60396945 -0.23860896 +0.00000000 # 30
# ref node 3 to 5 v -1.82844138 +0.16903846 +0.00000000 # 31 v -1.54463696 +0.48444188 +0.00000000 # 32 v -1.22927189 +0.76298791 +0.00000000 # 33
# ref node 4 to 6 v -1.43346179 -0.14927626 +0.00000000 # 34 v -1.26640391 +0.02475371 +0.00000000 # 35 v -1.10597372 +0.18744487 +0.00000000 # 36
# ref node 5 to 6 v -0.99989671 +0.69490540 +0.00000000 # 37 v -0.96908879 +0.53566402 +0.00000000 # 38 v -0.94003153 +0.36418778 +0.00000000 # 39
# ref node 5 to 7 v -0.79045194 +0.81359112 +0.00000000 # 40 v -0.38208273 +0.60620642 +0.00000000 # 41 v +0.01468797 +0.35845342 +0.00000000 # 42
# ref node 6 to 8 v -0.86345023 +0.16402812 +0.00000000 # 43 v -0.69971013 +0.04638975 +0.00000000 # 44 v -0.52908832 -0.08177856 +0.00000000 # 45 # ref node 7 to 8
V -0.03782166 +0.10093655 +0.00000000 # 46
V -0.18275049 -0.01497812 +0.00000000 # 47
V -0.31062376 -0.12739645 +0.00000000 # 48
# ref node 7 to 9
V +0.19888037 -0.00571947 +0.00000000 # 49
V +0.19490257 -0.44645983 +0.00000000 # 50
V +0.20095029 -0.96389931 +0.00000000 # 51
# ref node 8 to 10
V -0.44282863 -0.34857011 +0.00000000 # 52
V -0.44731554 -0.58253217 +0.00000000 # 53
V -0.45390007 -0.81122655 +0.00000000 # 54
# ref node 9 to 10
V +0.02093080 -1.16445100 +0.00000000 # 55
V -0.14491892 -1.08534122 +0.00000000 # 56
V -0.31961635 -1.03620934 +0.00000000 # 57
# ref node 9 to 11
V +0.05893474 -1.43344414 +0.00000000 # 58
V -0.29023099 -1.77611458 +0.00000000 # 59
V -0.68079162 -2.09481359 +0.00000000 # *888=== = = = 8 == 60
# ref node 10 to 12
V -0.53929967 -1.08528543 +0.00000000 # 61
V -0.66158485 -1.22676969 +0.00000000 # 62
V -0.80080003 -1.38589847 +0.00000000 # 63
# ref node 11 to 12
V -0.90017086 -1.98392475 +0.00000000 # 64
V -0.88597983 -1.79487479 +0.00000000 # 65
V -0.90703517 -1.62483418 +0.00000000 # 66
# Patches
# patch (ref 1 2 4 3) (found 9 8 6 4)
V -1.94201326 -0.90960985 +0.00000000 67
V -1.79448068 -0.89957356 +0.00000000 68
V -1.64789939 -0.89466411 +0.00000000 69
V -1.93168771 -0.57751322 +0.00000000 70
V -1.78445733 -0.59549576 +0.00000000 71
V -1.64114118 -0.61708033 +0.00000000 72
m m MI 41--
V -1.93770528 -0.21450090 +0.00000000 # 73
V -1.78696704 -0.27498978 +0.00000000 # 74
V -1.64308548 -0.33638209 +0.00000000 # 75 f 1 13 67 f 67 16 1 f 13 14 68 f 68 67 13
14 15 69 f 69 68 14 f 15 2 22 f 22 69 15 f 16 67 70
70 17 16 f 67 68 71 f 71 70 67 f 68 69 72 f 72 71 68 f 69 22 23
23 72 69 f 17 70 73
73 18 17 f 70 71 74 f 74 73 70 f 71 72 75 f 75 74 71 f 72 23 24
24 75 72 f 18 73 28 f 28 3 18 f 73 74 29 f 29 28 73 f 74 75 30 f 30 29 74 f 75 24 4 f 4 30 75
# end of patch
# patch (ref 3 4 6 5) (found 4 6 2 D
V -1.76840758 +0.12164260 +0.00000000 # 76
V -1.65719903 +0.03382918 +0.00000000 # 77
V -1.5499' 1988 -0.05393972 +0.00000000 # 78
V -1.50200534 +0.41576684 +0.00000000 # 79
V -1.42488790 +0.29150587 +0.00000000 # 80
V -1.34960341 +0.16479778 +0.00000000 # 81
V -1.20968723 +0.67684817 +0.00000000 # 82
m m m
V -1.17511904 +0.52474141 +0.00000000 # 83
V -1.1425* 5370 +0.36594769 +0.00000000 # 84 f 3 28 76 f 76 31 3 f 28 29 77 f 77 76 28 f 29 30 78 f 78 77 29 f 30 4 34 f 34 78 30 f 31 76 79 f 79 32 31 f 76 77 80 f 80 79 76
77 78 81 f 81 80 77
78 34 35 f 35 81 78 f 32 79 82 f 82 33 32 f 79 80 83 f 83 82 79 f 80 81 84 f 84 83 80 f 81 35 36 f 36 84 81 f 33 82 37 f 37 5 33 f 82 83 38
38 37 82 f 83 84 39 f 39 38 83 f 84 36 6 f 6 39 84
# end of patch
# patch (ref 5 6 8 7) (found 1 2 5 3)
V -0.80268133 +0.70376182 +0.00000000 # 85
V -0.82067740 +0.53699791 +0.00000000 # 86
V -0.84048349 +0.36466652 +0.00000000 # 87
V -0.44288766 +0.50127840 +0.00000000 # 88
V -0.52472460 +0.35871845 +0.00000000 # 89
V -0.6047' 7138 +0.21596235 +0.00000000 # 90
V -0.09987094 +0.26895592 +0.00000000 # 91
V -0.23970987 +0.15907945 +0.00000000 # 92 1414414141414141141414-----------
V -0 .37051862 +0.05026680 +0.00000000 # 93 f 5 37 85
85 40 5
37 38 86
86 85 37 f 38 39 87
87 86 38 f 39 6 43
43 87 39 f 40 85 88
88 41 40 f 85 86 89 f 89 88 85 f 86 87 90
90 89 86 f 87 43 44 f 44 90 87 f 41 88 91 f 91 42 41 f 88 89 92 f 92 91 88 f 89 90 93 f 93 92 89 f 90 44 45 f 45 93 90 f 42 91 46
46 7 42 f 91 92 47 f 47 46 91
92 93 48
48 47 92
93 45 8 f 8 48 93
# end of patch
m11 m 44114441414) 41--------
# patch (ref 7 8 10 9) (found 3 5 7 10)
V +0 .05936889 -0 .07349060 +0. 00000000 # 94
V -0 .10973242 -0 .15469839 +0. 00000000 # 95
V -0 .26144 481 -0 .24316306 +0. 00000000 # 96
V +0 .06564 914 -0 .46652544 +0. 00000000 # 97
V -0 .10495630 -0 .49191234 +0. 00000000 # 98
V -0 .26400471 -0 .53418404 +0. 00000000 # 99
V +0 .08196294 -0 .92675823 +0. 00000000 # 100
V -0 .08961675 -0 .87226200 +0. 00000000 # 101
V -0 .26208374 -0 .84349310 +0. 00000000 # 102 f 7 46 94 f 94 49 7 f 46 47 95
95 94 46 f 47 48 96
96 95 47 f 48 8 52 f 52 96 48 f 49 94 97
97 50 49 f 94 95 98 f 98 97 94
95 96 99 f 99 98 95
96 52 53
53 99 96 f 50 97 100
100 51 50 f 97 98 101 f 101 100 97 f 98 99 102 f 102 101 98
99 53 54 f 54 102 99 f 51 100 55 f 55 9 51
100 101 56 f 56 55 100
101 102 57
57 56 101 f 102 54 10 f 10 57 102
# end of patch
m m 41411H 44 m m---
# patch (ref 9 10 12 H) (found 10 7 12 11)
V -0 .03340572 -1. 36659396 +0. 00000000 # 103
V -0 .1870. .959 -1. 25521636 +0. 00000000 # 104
V -0 .35217524 -1. 17552054 +0. 00000000 # 105
V -0 .3394< 5929 -1. 68379068 +0. 00000000 # 106
V -0 .42534745 -1. 52255034 +0. 00000000 # 107
V -0 .53308535 -1. 38759983 +0. 00000000 # 108
V -0 .68712240 -1. 98646998 +0. 00000000 # 109
V -0 .69858575 -1. 78775406 +0. 00000000 # 110
V -0 .74391729 -1. 60941589 +0. 00000000 # 111 f 9 55 103
103 58 9
55 56 104 f 104 103 55
56 57 105 f 105 104 56 f 57 10 61
61 105 57 f 58 103 106 f 106 59 58
103 104 107 f 107 106 103 f 104 105 108 f 108 107 104 f 105 61 62
62 108 105 f 59 106 109
109 60 59 f 106 107 110 f 110 109 106 f 107 108 111 f 111 110 107 f 108 62 63 f 63 111 108 f 60 109 64 f 64 11 60 f 109 110 65
65 64 109 f 110 111 66 f 66 65 110 f 111 63 12 f 12 66 111
# end of patch
MI MIII M M MI-----
# patch (ref 11 12 2 D (found 11 12 8 9)
V -1. 12161958 -1 .97554851 +0 .00000000 # 112
V -1. 08208 680 -1 .79934120 +0 .00000000 # 113
V -1. 06843603 -1 .63210869 +0 .00000000 # 114
V -1. 48550296 -1 .66737521 +0 .00000000 # 115
V -1. 38869011 -1 .55295002 +0 .00000000 # 116
V -1. 30523658 -1 .43666792 +0 .00000000 # 117
V -1. 81520140 -1 .31485069 +0 .00000000 # 118
V -1. 67330 658 -1 .26992464 +0 .00000000 # 119
V -1. 53499472 -1 .22313774 +0 .00000000 # 120
11 64 112 f 112 21 11 f 64 65 113 f 113 112 64 f 65 66 114
114 113 65 f 66 12 27 f 27 114 66 f 21 112 115
115 20 21 f 112 113 116 f 116 115 112 f 113 114 117 f 117 116 113 f 114 27 26 f 26 117 114 f 20 115 118 f 118 19 20 f 115 116 119 f 119 118 115 f 116 117 120 f 120 119 116
117 26 25 f 25 120 117 f 19 118 13 f 13 1 19 f 118 119 14 f 14 13 118 f 119 120 15
15 14 119 f 120 25 2
2 15 120
# end of patch 3-D Curve .OBJ File:
# QCap 3d output file
V -2.85527713 -0.03221592 -8419.06712138 # 1
V -0.14224246 -0.01534742 -486.89139080 # 2
V -2.62345009 -0.01762889 -8530.84375664 # 3
V -0.23255046 -0.00190729 -922.55685018 # 4
V -0.01866798 +0.00069954 -61.34703938 # 5
V -0.00982721 -0.00007092 -34.32393390 # 6
V -0.00000000 +0.00000003 -0.00040929 # 7
V -0.00012924 -0.00000162 -0.69155313 # 8
V +0.00003162 -0.02699671 -0.25688340 # 9
V -0.00016476 -0.00598800 -0.95836709 # 10
V -0.00842451 -2.63002279 -20.88180203 # 11
V -0.01001579 -0.07330902 -15.53092465 # 12
V -2.01115896 -0.02628853 -5672.71661209 # 13
V -1.07313199 -0.02219785 -3924.29190026 # 14
V -0.44514342 -0.01782408 -1753.86040551 # 15
V -3.41517693 -0.02466901 -9609.40554189 # 16
V -3.37326454 -0.02043514 -9845.39485197 # 17
V -3.43472897 -0.02099859 -10161.444581061 18
V -2.03236442 -0.07463509 -6502.32132962 # 19
V -0.36363304 -0.46971348 -1427.70800497 # 20
V -0.03501660 -2.49761670 -98.21258141 # 21
V -0.23503214 -0.00438823 -788.44786431 # 22
V -0.24203240 -0.00218083 -962.18015297 # 23
V -0.25496605 -0.00217421 -1049.38634279 # 24
V -0.12413873 -0.02155250 -366.76451410 # 25
V -0.05056833 -0.04119963 -126.99065078 # 26
V -0.02145381 -0.08052446 -51.73002238 # 27
V -1.88039672 -0.01307629 -6327.78123433 # 28
V -1.12155807 -0.00909434 -4400.85608913 # 29
V -0.56634649 -0.00536210 -2594.71590092 # 30
V -1.62107081 -0.01191306 -5764.88409138 # 31
V -0.41079149 -0.00469387 -2280.82000873 # 32
V -0.06726505 +0.00002837 -266.89035889 # 33
V -0.19865805 -0.00145849 -705.77779978 # 34
V -0.07815112 -0.00058524 -283.20306573 # 35
V -0.03080266 -0.00021141 -102.30732482 # 36
V -0.01682194 +0.00016390 -56.73622862 # 37
V -0.01441297 -0.00005423 -47.73177889 # 38
V -0.01215826 -0.00008078 -39.86332712 # 39
V -0.00434614 +0.00105931 -22.90726567 # 40 V -0.00006641 +0.00011584 -0.65094993 # 41
V -0.00000005 +0.00000179 -0.00207036
V -0.00674721 -0.00005568 -26.94709063
V -0.00144630 -0.00002574 -12.45557619
V -0.00024749 -0.00000367 -1.77636902
V -0.00000012 -0.00000001 -0.00345901
V -0.00000251 -0.00000011 -0.05193291
V -0.00002016 -0.00000038 -0.18241833
V +0.00000000 -0.00000000 -0.00002159
V +0.00000010 -0.00001891 -0.00078312
V +0.00001753 -0.00614668 -0.14239711
V -0.00008403 -0.00000239 -0.55814146
V -0.00008506 -0.00007836 -0.54528040
V -0.00009877 -0.00091320 -0.60508061
V +0.00002711 -0.02114960 -0.22083773
V +0.00001004 -0.01318929 -0.10110221
V -0.00000932 -0.00976820 -0.21216153
V +0.00009276 -0.09562358 -0.75384385
V +0.00024612 -0.62347514 -2.22389916
V -0.00090188 -2.73897631 -7.06516040
V -0.00023790 -0.01312871 -1.35721522
V -0.00085836 -0.02856311 -4.51073016
V -0.00381511 -0.06670572 -14.02032951
V -0.00828948 -1.85179529 -20.34411406 64
V -0.00783671 -0.66381689 -20.80194305
V -0.00880770 -0.24749125 -17.43733671
V -2.53419271 -0.01904568 -7083.31425689
V -1.39365898 -0.01317829 -4600.26935242
V -0.68013621 -0.00888908 -2758.99655253
V -2.49214466 -0.01547562 -7440.51375781
V -1.37477979 -0.01006933 -4813.86728792
V -0.68687306 -0.00630708 -2978.84966637
V -2.54307764 -0.01609799 -7789.94723278
V -1.39249405 -0.01025520 -4962.39409348
V -0.70667953 -0.00642809 -3109.89677588
V -1.28677020 -0.01020176 -4936.76106444
V -0.77834018 -0.00725734 -3511.92005176
V -0.40130071 -0.00371621 -1798.32295641
V -0.32988980 -0.00312686 -1515.17077348
V -0.20263166 -0.00190769 -923.28330745
V -0.12915929 -0.00110602 -535.21923286
V -0.05878617 -0.00025971 -230.52838388
V -0.04910164 -0.00031992 -172.85586576
V -0.03923099 -0.00029169 -141.91551275
V -0.00467838 +0.00027407 -23.80184244
Figure imgf000033_0001
V -0.00521873 -0.00000271 -25.14510237 # 86
V -0.00584621 -0.00005237 -26.17773299 # 87
V -0.00012404 +0.00003166 -1.13154307 88
V -0.00028037 -0.00000355 -2.64744933 # 89
V -0.00060445 -0.00001128 -5.48150737 90
V -0.00000054 +0.00000024 -0.01571355 # 91
V -0.00000999 -0.00000024 -0.11885048 92
V -0.00004294 -0.00000088 -0.42698580 93
V -0.00000001 -0.00000000 -0.00050427 # 94
V -0.00000050 -0.00000003 -0.01193154 95
V -0.00001100 -0.00000035 -0.10635566 96
V +0.00000011 -0.00002340 -0.00119929 97
V -0.00000033 -0.00003169 -0.00844615 # 98
V -0.00001055 -0.00004896 -0.09674927 # 99
V +0.00001133 -0.00420525 -0.09220026 # 100
V +0.00000443 -0.00215703 -0.04402449 # 101
V -0.00000793 -0.00144571 -0.11328230 # 102
V +0.00005516 -0.06620950 -0.44985113 103
V +0.00002237 -0.03543796 -0.22793116 # 104
V -0.00002037 -0.02281117 -0.27080849 105
V +0.00003692 -0.38805329 -0.69049081 # 106
V -0.00005338 -0.15727912 -0.62637565 107
V -0.00022221 -0.07233456 -1.36582291 # 108
V -0.00124661 -1.90076302 -5.98913556 4t= 8 == 109
V -0.00142655 -0.65008611 -8.91167313 # 110
V -0.00232268 -0.23364924 -11.28525589 # 111
V -0.03132206 -1.74892739 -89.16884195 # 112
V -0.02696359 -0.66409792 -75.33651175 # 113
V -0.02477265 -0.25774329 -63.07236653 # 114
V -0.26394852 -0.33458667 -1041.68660713 115
V -0.15294564 -0.17435603 -535.92246871 # 116
V -0.09509394 -0.09239214 -276.07382663 # 117
V -1.51148809 -0.06159783 -4851.07674779 118
V -0.79879554 -0.04579098 -3169.24692727 # 119
V -0.35061080 -0.03157385 -1267.84849781 120 f 1 13 67 f 67 16 1 f 13 14 68 f 68 67 13 f 14 15 69 f 69 68 14 f 15 2 22 f 22 69 15 f 16 67 70 1441414141144141-- 41-1- 441--- 4141-- 411-- 4141-- 41- 4--1- 4--
70 17 16 f 67 68 71 f 71 70 67
68 69 72 f 72 71 68
69 22 23
23 72 69
17 70 73
73 18 17 f 70 71 74
74 73 70
71 72 75 f 75 74 71 f 72 23 24
24 75 72 f 18 73 28 f 28 3 18
73 74 29
29 28 73 f 74 75 30 f 30 29 74 f 75 24 4
4 30 75
3 28 76 f 76 31 3
28 29 77 f 77 76 28 f 29 30 78
78 77 29 f 30 4 : 34
34 78 30 f 31 76 79 f 79 32 31
76 77 80 f 80 79 76
77 78 81 f 81 80 77 f 78 34 35 f 35 81 78 f 32 79 82 f 82 33 32 f 79 80 83 f 83 82 79 f 80 81 84
84 83 80 MIi M 41) M m MI- MII- MI- M- MI MII- MI- M--- MI- MI-- MI-- MII M-^I M M---
f 81 35 36 f 36 84 81
33 82 37
37 5 33 f 82 83 38
38 37 82
83 84 39 f 39 38 83
84 36 6 f 6 39 84
5 37 85 f 85 40 5
37 38 86
86 85 37 f 38 39 87
87 86 38 f 39 6 43
43 87 39 f 40 85 88
88 41 40
85 86 89 f 89 88 85
86 87 90 f 90 89 86 f 87 43 44 f 44 90 87 f 41 88 91
91 42 41 f 88 89 92
92 91 88 f 89 90 93 f 93 92 89
90 44 45 f 45 93 90 f 42 91 46 f 46 7 42 f 91 92 47 f 47 46 91
92 93 48 f 48 47 92
93 45 8 f 8 48 93 f 7 46 94
94 49 7
46 47 95 41 m1441 m 4114411i 441441411- 4--- 411-- 4--------
95 94 46 f 47 48 96 f 96 95 47
48 8 52 f 52 96 48
49 94 97 f 97 50 49
94 95 98
98 97 94
95 96 99 f 99 98 95 f 96 52 53
53 99 96 f 50 97 100
100 51 50 f 97 98 101 f 101 100 97
98 99 102 f 102 101 98 f 99 53 54 f 54 102 99
51 100 55
55 9 51 f 100 101 56
56 55 100 f 101 102 57 f 57 56 101
102 54 10 f 10 57 102 f 9 55 103 f 103 58 9 f 55 56 104 f 104 103 55 f 56 57 105 f 105 104 56
57 10 61 f 61 105 57 f 58 103 106 f 106 59 58 f 103 104 107
107 106 103 f 104 105 108 f 108 107 104
105 61 62 f 62 108 105 114414114414411- 44141---------
f 59 106 109 f 109 60 59 f 106 107 110 f 110 109 106
107 108 111 f 111 110 107
108 62 63 f 63 111 108
60 109 64 f 64 11 60 f 109 110 65
65 64 109 f 110 111 66 f 66 65 110 f 111 63 12
12 66 111 f 11 64 112 f 112 21 11 f 64 65 113 f 113 112 64 f 65 66 114 f 114 113 65
66 12 27 f 27 114 66 f 21 112 115 f 115 20 21 f 112 113 116 f 116 115 112 f 113 114 117
117 116 113 f 114 27 26 f 26 117 114 f 20 115 118 f 118 19 20 f 115 116 119 f 119 118 115 f 116 117 120 f 120 119 116
117 26 25 f 25 120 117
19 118 13 f 13 1 19
118 119 14 f 14 13 118 f 119 120 15 f 15 14 119 f 120 25 2 f 2 15 120

Claims

CLAIMSWhat is claimed is:
1. A method comprising: applying a series of curves on specified regions of a performer's face; tracking the movement of the series of curves during a motion capture session; and generating motion data representing the movement of the performer's face using the tracked movement of the series of curves.
2. The method as in claim 1 wherein the curves are comprised of a retro-reflective material.
3. The method as in claim 1 wherein two or more different colors are used for different curves applied to different portions of the performer's face.
4. The method as in claim 3 further comprising: identifying one or more of the curves based on the color of the curves.
5. The method as in claim 1 wherein applying further comprises: creating a mask having slits corresponding to the curves; placing the mask over the performer's face; and applying the curves through the slits in the mask.
6. The method as in claim 1 wherein tracking comprises capturing a video of the curves from two or more different angles and wherein generating motion data comprises: generating two-dimensional ("2-D") data representing the motion of the curves in two dimensions from each of the two or more different angles; and using the 2-D data from the two or more different angles to generate three- dimensional ("3-D") data representing the motion of the curves in three dimensions.
7. The method as in claim 6 further comprising: storing the 2-D data and 3-D data in a .OBJ file format.
8. A method comprising: capturing video of a plurality of curves painted on a performer's face during a motion capture session; identifying each of the curves and intersection points of the curves within frames of the captured video; and generating motion data describing the motion of each of the curves and/or intersection points over time during the motion capture session.
9. The method as in claim 8 wherein capturing further comprises: generating two or more video streams of the plurality of curves, the two or more video streams captured from two or more different angles; and storing the two or more streams to a mass storage device.
10. The method as in claim 9 wherein the video streams are stored in the Audio Video Interleaved ("AVI") format.
11. The method as in claim 8 wherein identifying further comprises: cleaning the video frames to increase contrast between the curves and other image data within the video frames.
12. The method as in claim 8 wherein identifying further comprises: correlating the curve images from the captured frames with curve data provided in a curve data file; and identifying curves having the highest correlation to corresponding curves in the curve data file.
13. The method as in claim 12 wherein correlating further comprises: comparing a color of each of the curves in the captured video frames to a known color of each of the curves stored within the curve data file.
14. The method as in claim 8 further comprising: quantizing the identified curves into a plurality of line segments based on a specified resolution; using the intersection points of the line segments as the vertices of planar triangles to create a geometric mesh; and generating motion data describing the geometric mesh.
15. The method as in claim 9 further comprising: storing two-dimensional ("2-D") data for each of the two or more video streams; and analyzing the 2-D data from each of the two or more video streams to generate 3-D data describing the motion of the curves in three dimensions during the performance.
16. The method as in claim 15 wherein the 2-D data and 3-D data are stored in a .OBJ file format.
17. A machine-readable medium having program code stored thereon which, when executed by a machine, causes the machine to perform the operations of: capturing video of a plurality of curves painted on a performer's face during a motion capture session; identifying each of the curves and intersection points of the curves within frames of the captured video; and generating motion data describing the motion of each of the curves and/or intersection points over time during the motion capture session.
18. The machine-readable medium as in claim 8 wherein capturing further comprises: generating two or more video streams of the plurality of curves, the two or more video streams captured from two or more different angles; and storing the two or more streams to a mass storage device.
19. The machine-readable medium as in claim 9 wherein the video streams are stored in the Audio Video Interleaved ("AVI") format.
20. The machine-readable medium as in claim 8 wherein identifying further comprises: cleaning the video frames to increase contrast between the curves and other image data within the video frames.
21. The machine-readable medium as in claim 8 wherein identifying further comprises: correlating the curve images from the captured frames with curve data provided in a curve data file; and identifying curves having the highest correlation to corresponding curves in the curve data file.
22. The machine-readable medium as in claim 21 wherein correlating further comprises: comparing a color of each of the curves in the captured video frames to a known color of each of the curves stored within the curve data file.
23. The machine-readable medium as in claim 8 further comprising: quantizing the identified curves into a plurality of line segments based on a specified resolution; using the intersection points of the line segments as the vertices of planar triangles to create a geometric mesh; and generating motion data describing the geometric mesh.
24. The machine-readable medium as in claim 9 further comprising: storing two-dimensional ("2-D") data for each of the two or more video streams; and analyzing the 2-D data from each of the two or more video streams to generate 3-D data describing the motion of the curves in three dimensions during the performance.
25. The machine-readable medium as in claim 15 wherein the 2-D data and 3-D data are stored in a .OBJ file format.
26. A system comprising: a plurality of camera controllers to capture video of a plurality of curves painted on a performer's face during a motion capture session; curve identification logic identifying each of the curves and intersection points of the curves within frames of the captured video; and motion capture logic generating motion data describing the motion of each of the curves and/or intersection points overtime during the motion capture session.
27. A method comprising: positioning a plurality of color-coded motion capture markers at a plurality of points on a performer's body, wherein the color-coded motion capture markers are colored with at least two or more different colors; and tracking the color-coded motion capture markers during a motion capture session using two or more color cameras and a color-coded motion capture subsystem, the color-coded motion capture subsystem identifying each individual color-coded motion capture element based on its color and/or its relationship to the other color-coded motion capture markers.
28. The method as in claim 27 wherein tracking further comprises: performing a lookup within a database or table comprising color-coding data, the color coding data associating a particular color with each individual color-coded motion tracking element.
29. The method as in claim 27 wherein the color-coded motion capture markers are comprised of a retro-reflective material.
30. The method as in claim 28 further comprising: tracking the color-coded motion capture markers based on a predefined spatial relationship between each of the color-coded motion capture markers.
31. The method as in claim 27 further comprising: generating motion data describing the movement of the color-coded motion capture markers.
32. The method as in claim 31 further comprising: providing the motion data to a graphics processing subsystem, the graphics processing subsystem using the motion data to generate a graphical representation of the movement of the performer during the motion capture session.
33. A system comprising: a plurality of color cameras positioned to capture light reflected off of a plurality of color-coded motion capture markers, each of the color-coded motion capture markers positioned at a different point on a performer's body; and a color-coded motion tracking subsystem to identify each of the color- coded motion capture markers based on the color of the light reflected off of each of the color-coded motion capture markers, and to generate motion data describing the motion of each of the color-coded motion capture markers.
34. The system as in claim 33 further comprising: a graphics processing subsystem to interpret the motion data and responsively generate a graphical representation of the performer's motion.
35. The system as in claim 33 wherein the number of colors used to color the color-coded motion capture markers is less than the total number of color- coded motion capture markers.
36. The system as in claim 35 wherein the color-coded motion tracking subsystem identifies color-coded motion capture markers of the same color using a predefined spatial relationship between each of the color-coded motion tracking markers.
37. The system as in claim 34 further comprising: a color display to display the graphical representation of the performer's motion.
38. A system comprising: camera means positioned to capture light reflected off of a plurality of color-coded motion capture markers, each of the color-coded motion capture markers positioned at a different point on a performer's body; and color-coded motion tracking means to identify each of the color-coded motion capture markers based on the color of the light reflected off of each of the color-coded motion capture markers, and to generate motion data describing the motion of each of the color-coded motion capture markers.
39. The system as in claim 38 further comprising: graphics processing means to interpret the motion data and responsively generate a graphical representation of the performer's motion.
40. The system as in claim 38 wherein the number of colors used to color the color-coded motion capture markers is less than the total number of color- coded motion capture markers.
41. The system as in claim 40 wherein the color-coded motion tracking means identifies color-coded motion capture markers of the same color using a predefined spatial relationship between each of the color-coded motion tracking markers.
42. The system as in claim 38 further comprising: a color display to display the graphical representation of the performer's motion.
PCT/US2005/032418 2004-09-15 2005-09-12 Apparatus and method for capturing the expression of a performer WO2006031731A2 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US10/942,413 US8194093B2 (en) 2004-09-15 2004-09-15 Apparatus and method for capturing the expression of a performer
US10/942,609 US20060055706A1 (en) 2004-09-15 2004-09-15 Apparatus and method for capturing the motion of a performer
US10/942,609 2004-09-15
US10/942,413 2004-09-15

Publications (2)

Publication Number Publication Date
WO2006031731A2 true WO2006031731A2 (en) 2006-03-23
WO2006031731A3 WO2006031731A3 (en) 2009-04-02

Family

ID=36060608

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2005/032418 WO2006031731A2 (en) 2004-09-15 2005-09-12 Apparatus and method for capturing the expression of a performer

Country Status (1)

Country Link
WO (1) WO2006031731A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9996162B2 (en) 2015-12-21 2018-06-12 Intel Corporation Wearable sensor system for providing a personal magnetic field and techniques for horizontal localization utilizing the same

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6850872B1 (en) * 2000-08-30 2005-02-01 Microsoft Corporation Facial image processing methods and systems

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6850872B1 (en) * 2000-08-30 2005-02-01 Microsoft Corporation Facial image processing methods and systems

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
GRAHAM , M. THE POWER OF TEXTURE: A NEW APPROACH FOR SURFACE CAPTURE OF THE HUMAN HAND 30 April 2004, pages 1 - 23 *
GUSKOV, 1. ET AL.: 'Trackable Surfaces' SIGGRAPH 2003 July 2003, pages 251 - 257, 379 *
PARKE, F. ET AL.: 'Computer Generated Animation of Faces' SIGGRAPH 1972, pages 451 - 457 *
SCOTT, R. SPARKING LIFE NOTES ON THE PERFORMANCE CAPTURE SESSIONS FOR THE LORD OF THE RINGS: THE TWO TOWERS vol. 37, no. 4, November 2003, page 17 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9996162B2 (en) 2015-12-21 2018-06-12 Intel Corporation Wearable sensor system for providing a personal magnetic field and techniques for horizontal localization utilizing the same
US10289208B2 (en) 2015-12-21 2019-05-14 Intel Corporation Wearable sensor system for providing a personal magnetic field and techniques for horizontal localization utilizing the same

Also Published As

Publication number Publication date
WO2006031731A3 (en) 2009-04-02

Similar Documents

Publication Publication Date Title
US8194093B2 (en) Apparatus and method for capturing the expression of a performer
US20060055706A1 (en) Apparatus and method for capturing the motion of a performer
US11069135B2 (en) On-set facial performance capture and transfer to a three-dimensional computer-generated model
US11671717B2 (en) Camera systems for motion capture
US8334872B2 (en) Inverse kinematics for motion-capture characters
US8218825B2 (en) Capturing and processing facial motion data
US8330823B2 (en) Capturing surface in motion picture
Sand et al. Continuous capture of skin deformation
US20090284529A1 (en) Systems, methods and devices for motion capture using video imaging
JP2008145431A (en) Apparatus and method for 3-dimensional surface geometry reconstruction
CN113012293A (en) Stone carving model construction method, device, equipment and storage medium
Resch et al. Sticky projections—a new approach to interactive shader lamp tracking
JP2009545083A (en) FACS (Facial Expression Coding System) cleaning in motion capture
WO2006031731A2 (en) Apparatus and method for capturing the expression of a performer
Woodward et al. A low cost framework for real-time marker based 3-D human expression modeling
Nguyen et al. High resolution 3d content creation using unconstrained and uncalibrated cameras
US11178355B2 (en) System and method for generating visual animation
Sibbing et al. Markerless reconstruction of dynamic facial expressions
Minoh et al. Direct manipulation of 3D virtual objects by actors for recording live video content
US20240127539A1 (en) Mechanical weight index maps for mesh rigging
US20230336679A1 (en) Motion capture using synchronized and aligned devices
US8896607B1 (en) Inverse kinematics for rigged deformable characters
Beeler et al. The Birth of a Digital Actor
Woodward et al. Journal of Applied Research and Technology
Root et al. Performance and Motion Capture

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 05796513

Country of ref document: EP

Kind code of ref document: A2