CVPR Tutorial June 2024: Deep Learning for Camera Physiological Measurement
Event: CVPR 2024 Tutorial · Duration: 59 min · ▶ Watch on YouTube
Abstract
This tutorial introduces deep learning techniques for camera-based physiological measurement. It covers the principles behind extracting vital signs like heart rate, breathing, and blood pressure from video, discussing the underlying optical models and photoplethysmography (PPG) waveforms. The presentation highlights the challenges of limited real-world datasets and explores solutions through synthetic data generation and data augmentation techniques. It also delves into various deep learning models, including convolutional neural networks and self-supervised learning approaches, for robust and accurate vital sign estimation.
Speakers
- Daniel McDuff — Affiliate Prof. University of Washington, Google
Talks (1)
- 00:00:00 — Daniel McDuff: Deep Learning for Camera Physiological Measurement
- An overview of deep learning techniques for camera-based physiological measurement, covering principles, data, models, optimization, and evaluation.
Key Takeaways
- Camera-based physiological measurement leverages subtle changes in skin color due to blood flow, offering a non-contact method for vital sign monitoring.
- Deep learning models, particularly CNNs and RNNs, are crucial for extracting accurate physiological signals from video, often employing motion and appearance branches.
- Data scarcity is a major challenge, addressed by synthetic data generation (e.g., SCAMPS dataset) and various data augmentation techniques (motion, speed, spatial, color space).
- Self-supervised learning methods like SimPer show promise in improving representation learning for periodic targets, enhancing model generalization and robustness.
- Labels matter significantly; models trained on finger-based PPG may not generalize well to face-based PPG due to morphological differences.
Methods / Models / Datasets Mentioned
Shafer's Dichromatic Reflection Model (DRM)Independent Component Analysis (ICA)Chrominance (CHROM)Local Grouping Index (LGI)Plane Orthogonal to Skin (POS)Principal Component Analysis (PCA)Green Channel (GREEN)Pulse-Blood Volume (PBV)DeepPhysPhysNetTS-CANEfficientPhysHybrid-CANSimPerMoCo v2SimCLRCVRLBYOLSCAMPS DatasetUBFC-rPPG DatasetPURE DatasetAFRL DatasetMMSE-HR DatasetMR-NIRP DatasetMAHNOB-HCI DatasetBP4D DatasetVIPL-HR DatasetCOHFACE DatasetUBFC-PHYS DatasetRICE CameraHRV DatasetOBF DatasetPFF DatasetVicarrPPG DatasetVicarrPPG-2/CleanerPPG DatasetCMU DatasetMMPD DatasetrPPG-ToolboxiPhys Toolbox
Topics
Camera Physiological Measurement · Deep Learning · Photoplethysmography (PPG) · Vital Signs · Synthetic Data Generation · Data Augmentation · Self-Supervised Learning · Model Architectures
Notes
Open for commentary — connections to other work, critiques, follow-up reading.