Sapiens2-1B-Pose

308-keypoint top-down pose estimation including detailed face (274 keypoints), hand, and foot keypoints. Predictions follow the Sociopticon keypoint format.

This repository contains the 1B Pose Estimation checkpoint, finetuned from the Sapiens2-1B pretrained backbone.

Pose is top-down — it requires bounding boxes from a person detector. We use RTMDet.

📄 Paper: arXiv:2604.21681
🌐 Project Page: rawalkhirodkar.github.io/sapiens2
💻 Code: github.com/facebookresearch/sapiens2

Model Details

Developed by: Meta
Model type: Vision Transformer
License: Sapiens2 License
Task: pose
Base model: facebook/sapiens2-pretrain-1b
Format: safetensors
File: sapiens2_1b_pose.safetensors

Quick Start

Install the Sapiens2 repo (pip install -e .), download the checkpoint, and run the demo:

# 1. Download the checkpoint to $SAPIENS_CHECKPOINT_ROOT/pose/
hf download facebook/sapiens2-pose-1b sapiens2_1b_pose.safetensors \
    --local-dir ~/sapiens2_host/pose

# 2. Run the demo (edit INPUT, OUTPUT, and MODEL_NAME inside the script)
cd $SAPIENS_ROOT/sapiens/pose
./scripts/demo/keypoints308.sh

See the Pose Estimation guide for details on inputs, outputs, and visualization options.

Model Card

Field	Value
Architecture	Sapiens2 ViT backbone + Pose Estimation head
Backbone parameters	1.462 B
Backbone FLOPs	4.715 T
Embedding dim	1536
Layers	40
Attention heads	24
Inference resolution	1024 × 768 (H × W)
Patch size	16

Sapiens2-Pose Family

Model	Params	FLOPs	Embed dim	Layers	Heads
Sapiens2-0.4B	0.398 B	1.260 T	1024	24	16
Sapiens2-0.8B	0.818 B	2.592 T	1280	32	16
Sapiens2-1B (this)	1.462 B	4.715 T	1536	40	24
Sapiens2-1B-4K	1.607 B	—	1536	40	24
Sapiens2-5B	5.071 B	15.722 T	2432	56	32

See the Sapiens2 Collection for all variants and other downstream task checkpoints.

Intended Use

Pose Estimation on human-centric imagery
Research on human-centric vision

License

Released under the Sapiens2 License.

Citation

@article{khirodkarsapiens2,
  title={Sapiens2},
  author={Khirodkar, Rawal and Wen, He and Martinez, Julieta and Dong, Yuan and Su, Zhaoen and Saito, Shunsuke},
  journal={arXiv preprint arXiv:2604.21681},
  year={2026}
}

facebook/sapiens2-pose-1b

Sapiens2-1B-Pose

Model Details

Quick Start

Model Card

Sapiens2-Pose Family

Intended Use

License

Citation

No reviews yet

Model Info

Community

Rating Guidelines