pose-capture
v0.4.0
Published
A set of components to perform Tensorflow.js pose detection on hands, face, and body. Provides a player to perform visualization overlays while source video or webcam is playing.
Downloads
9
Maintainers
Readme
Releases
- 0.4.0 Added "isimage" attribute to pose detection components to allow for image sources
Pose-Capture
Pose Capture is a package which contains a number of custom elements related to capturing and recording poses in various ways, all enabled by Tensorflow.js.
Each of these components act like a normal video player, but allow pose tracking, eventing, and overlay visualization of the points.
These pose capture components can record keyframes to a JSON file, and offer the option of recording audio from the video source to allow playback synced with the keyframe points.
Video sources can be a video file supported by your browser or your webcam. As video data needs
to be analyzed by Tensorflow.js, they are subject to the same security restrictions
as any video being captured to a <canvas>
element. This typically means that your video
can't come from a different domain and must be done over HTTPS.
Most of the components DO deliver a depth coordinate to compliment the X/Y coordinates, however as the first goal of this package is to overlay points accurately over video, there hasn't been much thought put into how to normalize/use this 3rd dimension. Additionally, some of these solutions can deliver full 3D keypoints as well, but these are in meters, and don't map well to pixels on your screen.
Hopefully the 3D story will be fleshed out in future updates.
Main Components
The main components in this library serve to show video playback from a source, whether a browser supported video file or a user's webcam.
Without the helper components listed below, these components will not show visualization or a playback controls UI.
<posedetection-video>
<posedetection-video>
is to capture full-body poses. The currently configured
Pose Detection solution uses BlazePose.
MediaPipe BlazePose can detect 33 keypoints, in addition to the 17 COCO keypoints, it provides additional keypoints for face, hands and feet.
<bodypix-video>
<bodypix-video>
is to capture full-body poses. The currently configured
BodyPix solution tracks similar points as the <posedetection-video>
above, but seems
a bit less accurate, at least the way it's configured in this package. BodyPix seems more appropriate for
live segmentation, so the use-case presented here might not be appropriate for how it was
intended. More exploration is needed, so unless you know what you're doing, you
may want to rely on <posedetection-video>
instead.
<facelandmark-video>
<facelandmark-video>
is to capture many points to create something that can
be used as a 3D mesh of a person's face.
This component uses MediaPipe Facemesh as the engine.
<handpose-video>
<handpose-video>
is to capture points along a person's hand including along each of their fingers.
This component uses MediaPipe Hands as the engine.
<pose-player>
<pose-player>
is to playback keyframes and audio captured by the above components
Helper Components
Helper components are designed to live as children inside the main pose capture components. They adhere to a specific API and live as slotted elements, where the parent component passes playback state or keyframes to them.
These components have a specific look/style, that can be customized a small bit, but the intention
is for consumers of this package to create their own using either the PlayerState
interface for
creating UI controls, or the AbstractVisualizer
class for visualization.
When adding both of these as children, be careful about order. A visualizer that takes up the entire size of the component can block mouse input to the player controls.
<pose-playback-controls>
<pose-playback-controls>
is a component designed to be a child inside an above
pose capture, and offers playback and recording controls for a video.
<visualization-canvas>
<visualization-canvas>
is a component designed to be a child inside an above
pose capture, and offers an overlay layer to see points captured on a given keyframe
Setup
Install dependencies:
npm i
Build
This project uses the TypeScript compiler to produce JavaScript that runs in modern browsers.
To build the JavaScript version of your component:
npm run build
To watch files and rebuild when the files are modified, run the following command in a separate shell:
npm run build:watch
Both the TypeScript compiler and lit-analyzer are configured to be very strict. You may want to change tsconfig.json
to make them less strict.
Testing
Note: There are no actual tests to run in this project yet
This project uses modern-web.dev's @web/test-runner for testing. See the modern-web.dev testing documentation for more information.
Tests can be run with the test
script, which will run your tests against Lit's development mode (with more verbose errors) as well as against Lit's production mode:
npm test
For local testing during development, the test:dev:watch
command will run your tests in Lit's development mode (with verbose errors) on every change to your source files:
npm test:watch
Alternatively the test:prod
and test:prod:watch
commands will run your tests in Lit's production mode.
Dev Server
This sample uses modern-web.dev's @web/dev-server for previewing the project without additional build steps. Web Dev Server handles resolving Node-style "bare" import specifiers, which aren't supported in browsers. It also automatically transpiles JavaScript and adds polyfills to support older browsers. See modern-web.dev's Web Dev Server documentation for more information.
To run the dev server and open the project in a new browser tab:
npm run serve
There is a development HTML file located at /dev/index.html
that you can view at http://localhost:8000/dev/index.html. Note that this command will serve your code using Lit's development mode (with more verbose errors). To serve your code against Lit's production mode, use npm run serve:prod
.
Linting
Linting of TypeScript files is provided by ESLint and TypeScript ESLint. In addition, lit-analyzer is used to type-check and lint lit-html templates with the same engine and rules as lit-plugin.
The rules are mostly the recommended rules from each project, but some have been turned off to make LitElement usage easier. The recommended rules are pretty strict, so you may want to relax them by editing .eslintrc.json
and tsconfig.json
.
To lint the project run:
npm run lint
Formatting
Prettier is used for code formatting. It has been pre-configured according to the Lit's style. You can change this in .prettierrc.json
.
Prettier has not been configured to run when committing files, but this can be added with Husky and and pretty-quick
. See the prettier.io site for instructions.
Static Site
This project includes a simple website generated with the eleventy static site generator and the templates and pages in /docs-src
. The site is generated to /docs
and intended to be checked in so that GitHub pages can serve the site from /docs
on the master branch.
To enable the site go to the GitHub settings and change the GitHub Pages "Source" setting to "master branch /docs folder".
To build the site, run:
npm run docs
To serve the site locally, run:
npm run docs:serve
To watch the site files, and re-build automatically, run:
npm run docs:watch
The site will usually be served at http://localhost:8000.