Bridging the Gap Between Computational Photography and Visual Recognition: 7th UG2+ Workshop & Challenge
Event: CVPR 2024 Workshop · Duration: 136 min · ▶ Watch on YouTube
Abstract
The 7th UG2+ Workshop & Challenge focuses on bridging the gap between computational photography and visual recognition. The workshop features invited talks from leading researchers in computational imaging and computer vision, covering topics such as health equity, low-light imaging, and face recognition in low-quality imagery. It also presents results from three challenge tracks: atmospheric turbulence mitigation, all-weather semantic segmentation, and UAV tracking/pose-estimation. A key theme is the development of robust and trustworthy algorithms that can perform well under non-ideal visual conditions and provide reliable uncertainty quantification for scientific applications.
Speakers
- Achuta Kadambi — Associate Prof. @ UCLA
- Kristina Monakhova — Postdoctoral Fellow, MIT (Incoming Assistant Professor @ Cornell CS)
- Minchul Kim — Ph.D. student @ MSU
Talks (3)
- 00:25:59 — Achuta Kadambi: On Health Equity and Computational Imaging
- This talk discusses how computational imaging can address issues algorithms alone cannot solve, using blood volume estimation as a running example, and explores biases in light-based sensors and solutions using radar.
- 00:33:02 — Kristina Monakhova: Learning noise models and error bars for low light imaging
- This talk presents methods for learning noise models for extremely low-light videography and quantifying uncertainty in denoising for multiphoton microscopy, emphasizing the importance of trustworthy algorithms for scientific imaging.
- 01:32:05 — Minchul Kim: Face Recognition in Low-Quality Imagery
- This talk explores methods for robust face recognition in low-quality imagery, focusing on adaptive margin loss functions, feature fusion techniques for video, and keypoint relative position encoding to improve performance and robustness.
Key Takeaways
- Computational imaging can address biases in light-based sensors and improve health equity by combining different modalities like radar and light.
- Developing physics-informed GANs for noise modeling allows for the creation of realistic synthetic noisy data, crucial for training robust denoisers for extreme low-light conditions.
- Quantifying uncertainty in deep learning reconstructions is vital for scientific imaging, enabling the detection of model hallucinations and driving adaptive acquisition strategies in microscopy.
- Face recognition models can be made more robust to low-quality images by using adaptive margin loss functions that emphasize hard but identifiable samples and de-emphasize unidentifiable ones.
- Feature fusion techniques, particularly two-stage paradigms involving clustering and aggregation, are effective for handling variable-length video inputs in face recognition, improving performance and efficiency.
Methods / Models / Datasets Mentioned
AdaFaceCAFaceKP-RPEV-BM4DFastDVDnetL2SIDPhysics-based noise GANQuantile regressionConformal risk controlBayesian DropoutEnsemble techniquesBlood Decomposition
Topics
Computational Photography · Visual Recognition · Atmospheric Turbulence Mitigation · Semantic Segmentation · UAV Tracking and Pose Estimation · Low Light Imaging · Noise Modeling · Uncertainty Quantification · Face Recognition · Feature Fusion
Notes
Open for commentary — connections to other work, critiques, follow-up reading.