PRIMA / README.md

add example for loading the models

19121a2 verified about 1 month ago

4.49 kB

	---
	tags:
	- computer_vision
	- animal_pose_and_shape_estimation
	- DeepLabCut
	pipeline_tag: image-to-3d
	---
	# MODEL CARD:

	## Model Details

	• PRIMA model(s) developed by the [M.W.Mathis Lab](http://www.mackenziemathislab.org/) in 2026, trained to predict quadruped shape and pose from images.
	Please see paper link for details.

	• There are two main models:
	- `s1ckpt.ckpt` is the stage-1 model trained with Animal3D, CtrlAni3D, and Quadruped2D datasets.
	- `s3ckpt.ckpt` is the stage-3 model trained with Animal3D, CtrlAni3D, and Quadruped3D datasets.

	```python
	from pathlib import Path
	from huggingface_hub import hf_hub_download

	repo_id = "MLAdaptiveIntelligence/PRIMA"

	model_dir = Path("./prima_model")
	model_dir.mkdir(parents=True, exist_ok=True)

	# download stage-1 checkpoint
	s1_path = hf_hub_download(
	repo_id=repo_id,
	filename="s1ckpt.ckpt",
	local_dir=model_dir
	)

	# donwload stage-3 checkpoint
	s3_path = hf_hub_download(
	repo_id=repo_id,
	filename="s3ckpt.ckpt",
	local_dir=model_dir
	)
	```
	## Intended Use
	• Intended to be used for shape and pose estimation of quadruped images taken from a single view.

	• Intended for academic and research professionals working in fields related to animal behavior, such as neuroscience
	and ecology.

	• Not suitable as a zero-shot model for applications that require high shape and pose precision, but can be further optimized with 2D keypoint
	annotations or from SuperAnimal to improve accuracy. Also, it is not suitable for videos that look dramatically different from those
	we show in the paper.

	## Metrics
	• PA-MPJPE (Procrustes-aligned mean per-joint position error), computed over 3D joints.

	• PA-MPVPE (Procrustes-aligned mean per-vertex position error), computed over the SMAL mesh vertices.

	• PCK (Percentage of Correct Keypoints) measures the proportion of predicted keypoints within a specified threshold of the ground-truth keypoints.

	• AUC (Area Under the Curve), computed by integrating the PCK values as the threshold varies from 0 to 1.


	## Evaluation Data
	• In the paper we benchmark on Animal3d, CtrlAni3D, Quadruped2D, and AnimalKingdom.

	## Training Data:
	It consists of being trained together on the following datasets:
	- Animal3D see full details at (1).
	- CtrlAni3D See full details at (2).
	- Quadruped2D See full details at (3).
	- Quadruped3D See full details at paper link.


	## Ethical Considerations
	• No experimental data were collected for this model; all datasets used are cited.

	## License
	Modified MIT.

	Copyright 2026 by Mackenzie Mathis, Xiaohang Yu, and contributors.

	Permission is hereby granted to you (hereafter "LICENSEE") a fully-paid, non-exclusive,
	and non-transferable license for academic, non-commercial purposes only (hereafter “LICENSE”)
	to use the "MODEL" weights (hereafter "MODEL"), subject to the following conditions:

	The above copyright notice and this permission notice shall be included in all copies or substantial
	portions of the Software:

	This software may not be used to harm any animal deliberately.

	LICENSEE acknowledges that the MODEL is a research tool.
	THE MODEL IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING
	BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.
	IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY,
	WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE MODEL
	OR THE USE OR OTHER DEALINGS IN THE MODEL.

	If this license is not appropriate for your application, please contact Prof. Mackenzie W. Mathis
	(mackenzie@post.harvard.edu) and/or the TTO office at EPFL (tto@epfl.ch) for a commercial use license.

	Please cite paper link if you use this model in your work.

	## References
	1. Xu, J., Zhang, Y., Peng, J., Ma, W., Jesslen, A., Ji, P., Hu, Q., Zhang, J., Liu, Q.,
	Wang, J., et al.: Animal3d: A comprehensive dataset of 3d animal pose and shape.
	In: ICCV. pp. 9099–9109 (2023)
	2. Lyu, J., Zhu, T., Gu, Y., Lin, L., Cheng, P., Liu, Y., Tang, X., An, L.: Animer:
	Animal pose and shape estimation using a family-aware transformer. In: CVPR. pp.
	17486–17496 (2025)
	3. Ye, S., Filippova, A., Lauer, J., Schneider, S., Vidal, M., Qiu, T., Mathis, A.,
	Mathis, M.W.: Superanimal pretrained pose estimation models for behavioral analysis. Nature communications 15(1), 5165 (2024)