Methods / Models / Datasets — Cross-Reference
7110 unique named entities across 243 videos
CLIP — 57 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 3D Generative AI: Efficient, high-def & controllable (2024)
- Multimodal AI for Edge AI (2024)
- The 3rd Monocular Depth Estimation Challenge (2024)
- Multimodal Algorithmic Reasoning Workshop & SMART-101 Challenge Awards (2024)
- 7th Multi-modal Learning Workshop (2024)
- 7th Multi-modal Learning Workshop (2024)
- The 13th Women in Computer Vision (WiCV) Workshop (2024)
- … and 49 more
GPT-4 — 32 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Multimodal Algorithmic Reasoning Workshop & SMART-101 Challenge Awards (2024)
- 7th Multi-modal Learning Workshop (2024)
- Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
- Towards the 3D Human Foundation Agent (2024)
- The 13th Women in Computer Vision (WiCV) Workshop (2024)
- ReGenAI Workshop CVPR 2024 (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- … and 24 more
ImageNet — 29 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- CVPRW-NAS 2024 - Day 1 Session 1 (2024)
- Towards the 3D Human Foundation Agent (2024)
- ReGenAI Workshop CVPR 2024 (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Diffusion-based Video Generative Models (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- … and 21 more
NeRF — 23 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 3D Generative AI: Efficient, high-def & controllable (2024)
- 3D/4D Generation and Modeling with Generative Priors (2024)
- Virtual Try-On Workshop (2024)
- The First Workshop on AI for 3D Generation (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- … and 15 more
Stable Diffusion — 22 videos
- CVPR 2024 Object-Centric Representation For Computer Vision Tutorial (2024)
- 7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
- ReGenAI Workshop CVPR 2024 (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- Diffusion-based Video Generative Models (2024)
- Anti-DreamBooth: Protecting Users from Personalized Text-to-Image Synthesis (2024)
- … and 14 more
SAM — 22 videos
- InNeRF360: Text-Guided 3D-Consistent Object Inpainting on 360° Neural Radiance Fields (2024)
- 7th Multi-modal Learning Workshop (2024)
- 7th Multi-modal Learning Workshop (2024)
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- CVPR 2024 Workshop on Data Curation and Augmentation in Medical Imaging (2024)
- CVPR MetaFood Workshop (2024)
- CVPR MetaFood Workshop (2024)
- … and 14 more
DINO — 19 videos
- CVPR 2024 Object-Centric Representation For Computer Vision Tutorial (2024)
- CVPR 2024 Workshop on Data Curation and Augmentation in Medical Imaging (2024)
- CVPR 2024 Workshop (2024)
- CVPR MetaFood Workshop (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- ViLMa Visual Localization and Mapping (2024)
- … and 11 more
ResNet — 18 videos
- CVPRW-NAS 2024 - Day 1 Session 1 (2024)
- Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
- CVPR 2024 Workshop (2024)
- The 6th International Workshop on Gaze Estimation and Prediction in the Wild (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
- … and 10 more
Transformer — 18 videos
- Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
- CVPR MetaFood Workshop (2024)
- CVPR 2024 Workshop (2024)
- All You Need to Know about Point Cloud Understanding (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- … and 10 more
DINOv2 — 17 videos
- Computer Vision Foundation Talk/Workshop (2024)
- Workshop on Graphic Design Understanding and Generation (2024)
- CVPR 2024 Workshop (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- ViLMa Visual Localization and Mapping (2024)
- Scalable Real-Time Abnormal Event Detection (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- … and 9 more
ResNet50 — 16 videos
- Computational Design of Diverse Morphologies and Sensors for Vision & Robotics (2024)
- 3D Generative AI: Efficient, high-def & controllable (2024)
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- CVPR 2024 Workshop on Data Curation and Augmentation in Medical Imaging (2024)
- CVPR MetaFood Workshop (2024)
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- Coarse-to-Fine Amodal Segmentation with Shape Prior (2024)
- CVPR 2024 Workshop (2024)
- … and 8 more
GPT-4V — 16 videos
- Multimodal Algorithmic Reasoning Workshop & SMART-101 Challenge Awards (2024)
- ReGenAI Workshop CVPR 2024 (2024)
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- VizWiz Grand Challenge: Opening Remarks (2024)
- … and 8 more
BERT — 16 videos
- 7th Multi-modal Learning Workshop (2024)
- Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
- The 13th Women in Computer Vision (WiCV) Workshop (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- CVPR 2024 Tutorial: Learning Deep Low-Dimensional Models from High-Dimensional Data: Theory to Practice (2024)
- All You Need to Know about Point Cloud Understanding (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- … and 8 more
GPT-4o — 16 videos
- Towards the 3D Human Foundation Agent (2024)
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- ReGenAI Workshop CVPR 2024 (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
- VizWiz Grand Challenge: Opening Remarks (2024)
- … and 8 more
GPT-3 — 15 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 3D Generative AI: Efficient, high-def & controllable (2024)
- The 13th Women in Computer Vision (WiCV) Workshop (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- … and 7 more
Sora — 15 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 3D Generative AI: Efficient, high-def & controllable (2024)
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- Diffusion-based Video Generative Models (2024)
- … and 7 more
ControlNet — 15 videos
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- Virtual Try-On Workshop (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- Diffusion-based Video Generative Models (2024)
- The First Workshop on AI for 3D Generation (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- … and 7 more
LoRA — 14 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR MetaFood Workshop (2024)
- Diffusion-based Video Generative Models (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- VizWiz Grand Challenge: Opening Remarks (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Mobile Intelligent Photography and Imaging (2024)
- … and 6 more
ResNet-50 — 14 videos
- Image Matching: Local Features and Beyond (2024)
- CVPR 2024 Workshop (2024)
- CVPR 2024 Workshop (2024)
- Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
- Scalable Real-Time Abnormal Event Detection (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- AI4Space 2024 Workshop (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- … and 6 more
ChatGPT — 13 videos
- Computational Design of Diverse Morphologies and Sensors for Vision & Robotics (2024)
- CVPR 2024 Workshop on Data Curation and Augmentation in Medical Imaging (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- The First Workshop on AI for 3D Generation (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- … and 5 more
AlexNet — 13 videos
- Image Matching: Local Features and Beyond (2024)
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- Virtual Try-On Workshop (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- Visual-Inertial Odometry for Small-sized Robots (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- … and 5 more
LLaVA — 13 videos
- Multimodal Algorithmic Reasoning Workshop & SMART-101 Challenge Awards (2024)
- Towards the 3D Human Foundation Agent (2024)
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
- VizWiz Grand Challenge: Opening Remarks (2024)
- … and 5 more
CNN — 13 videos
- Virtual Try-On Workshop (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- AI4Space 2024 Workshop (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- … and 5 more
MLP — 12 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- CVPR 2024 Workshop on Data Curation and Augmentation in Medical Imaging (2024)
- CVPR 2024 Tutorial: Learning Deep Low-Dimensional Models from High-Dimensional Data: Theory to Practice (2024)
- Machine Learning for Geometric Shape Analysis (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
- … and 4 more
GPT-2 — 11 videos
- Workshop on Graphic Design Understanding and Generation (2024)
- 7th Multi-modal Learning Workshop (2024)
- The 13th Women in Computer Vision (WiCV) Workshop (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- The 6th International Workshop on Gaze Estimation and Prediction in the Wild (2024)
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
- Visual-Inertial Odometry for Small-sized Robots (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- … and 3 more
U-Net — 10 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- Virtual Try-On Workshop (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
- Events-to-Video: Bringing Modern Computer Vision to Event Cameras (2025)
- … and 2 more
COCO — 10 videos
- CVPR 2024 Object-Centric Representation For Computer Vision Tutorial (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Machine Learning for Geometric Shape Analysis (2024)
- Welcome to the Workshop on Responsible Data! (2024)
- Synthetic Data for CV (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
- … and 2 more
DALL-E — 10 videos
- CVPR 2024 Object-Centric Representation For Computer Vision Tutorial (2024)
- Towards the 3D Human Foundation Agent (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- … and 2 more
ViT — 10 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- All You Need to Know about Point Cloud Understanding (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- Machine Learning for Geometric Shape Analysis (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- … and 2 more
Gaussian Splatting — 10 videos
- Image Matching: Local Features and Beyond (2024)
- Towards the 3D Human Foundation Agent (2024)
- CVPR MetaFood Workshop (2024)
- The First Workshop on AI for 3D Generation (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- 3D Foundation Models for Physical Intelligence (2024)
- … and 2 more
Gemini — 10 videos
- Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
- The 13th Women in Computer Vision (WiCV) Workshop (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- CV4MS @ CVPR 2024 (2024)
- Synthetic Data for CV (2024)
- Multi-stage reasoning for video understanding & scene generation (2025)
- … and 2 more
DreamFusion — 9 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- CV4MR 2024: 2nd Workshop on Computer Vision for Mixed Reality (2024)
- 7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- The First Workshop on AI for 3D Generation (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- … and 1 more
RANSAC — 9 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- CVPR 2024 Workshop (2024)
- Visual-Inertial Odometry for Small-sized Robots (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- CVPR 2024 Tutorial (2024)
- Event-based Feature Tracking and Visual Inertial Odometry (2025)
- … and 1 more
BLIP-2 — 9 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- 7th Multi-modal Learning Workshop (2024)
- 7th Multi-modal Learning Workshop (2024)
- The 6th International Workshop on Gaze Estimation and Prediction in the Wild (2024)
- CVPR 2024 Workshop (2024)
- VizWiz Grand Challenge: Opening Remarks (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- From Multimodal LLM to Human-level AI (2024)
- … and 1 more
PointNet — 9 videos
- CVPR 2024 Workshop (2024)
- Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
- Virtual Try-On Workshop (2024)
- All You Need to Know about Point Cloud Understanding (2024)
- The First Workshop on AI for 3D Generation (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
- Machine Learning for Geometric Shape Analysis (2024)
- … and 1 more
DALL-E 3 — 9 videos
- Multimodal Algorithmic Reasoning Workshop & SMART-101 Challenge Awards (2024)
- Panel Discussion on AI, Art, and Creativity (2024)
- 7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
- ReGenAI Workshop CVPR 2024 (2024)
- CVPR 2024 Workshop (2024)
- The First Workshop on AI for 3D Generation (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
- … and 1 more
Flamingo — 9 videos
- Multimodal Algorithmic Reasoning Workshop & SMART-101 Challenge Awards (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- VizWiz Grand Challenge: Opening Remarks (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- CV4MS @ CVPR 2024 (2024)
- From Multimodal LLM to Human-level AI (2024)
- Multi-stage reasoning for video understanding & scene generation (2025)
- … and 1 more
Midjourney — 9 videos
- Panel Discussion on AI, Art, and Creativity (2024)
- 7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
- CVPR 2024 Workshop (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- … and 1 more
ResNet18 — 9 videos
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- CVPR 2024 Tutorial: Learning Deep Low-Dimensional Models from High-Dimensional Data: Theory to Practice (2024)
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- CVPR 2024 Workshop (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- Scalable Real-Time Abnormal Event Detection (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- Machine Learning for Geometric Shape Analysis (2024)
- … and 1 more
MNIST — 9 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Welcome to the Workshop on Responsible Data! (2024)
- CV4MS @ CVPR 2024 (2024)
- 3D Foundation Models for Physical Intelligence (2024)
- Neuromorphic Vision Applications: From Robotic Foosball to Tracking Space Junk (2025)
- … and 1 more
CARLA — 9 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Synthetic Data for CV (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- X-WORLD: Accessibility, Vision, and Autonomy Meet (2025)
- CVPR 2025 Workshop on Autonomous Driving (2025)
- … and 1 more
GPT-3.5 — 8 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- ReGenAI Workshop CVPR 2024 (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- The missing rungs on the ladder to general AI (2025)
- Foundation Models in Radiology (2025)
- Foundation Models for Vision: From Vision to Clinical Reality (2025)
VQ-VAE — 8 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Object-Centric Representation For Computer Vision Tutorial (2024)
- Towards the 3D Human Foundation Agent (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- Mobile AI Workshop 2025: Introductory Talk (2025)
VQ-GAN — 8 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
COLMAP — 8 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- CVPR MetaFood Workshop (2024)
- ViLMa Visual Localization and Mapping (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
- 3D Foundation Models for Physical Intelligence (2024)
- Second Egocentric Vision (EgoVis) Workshop (2025)
nuScenes — 8 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- Argoverse Competitions 2025 (2025)
- Scalable Autonomous Driving via Fully Data-driven Simulation (2025)
DDPM — 8 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- Computer Vision Foundation Talk/Workshop (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Diffusion-based Video Generative Models (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- 3D Foundation Models for Physical Intelligence (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
Diffusion Models — 8 videos
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Scalable Neural Simulation for Autonomy (2025)
- Visual Generative Modeling: What’s After Diffusion? (2025)
- Visual Generative Modeling: What’s After Diffusion? (2025)
- Embodied Intelligence for Autonomous Systems on the Horizon (2025)
SIFT — 8 videos
- Image Matching: Local Features and Beyond (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- ViLMa Visual Localization and Mapping (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- Visual-Inertial Odometry for Small-sized Robots (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
- Self-supervised Learning for Dynamic 3D Scene Understanding (2025)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
StyleGAN — 8 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- Panel Discussion on AI, Art, and Creativity (2024)
- 7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
- The First Workshop on AI for 3D Generation (2024)
- Machine Learning for Geometric Shape Analysis (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- 3D Foundation Models for Physical Intelligence (2024)
InstructBLIP — 8 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- VizWiz Grand Challenge: Opening Remarks (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- From Multimodal LLM to Human-level AI (2024)
- ICCV 2023 Workshop on Vision and Language Algorithmic Reasoning (VLAR) (2025)
SimCLR — 8 videos
- CVPR Tutorial June 2024: Deep Learning for Camera Physiological Measurement (2024)
- All You Need to Know about Point Cloud Understanding (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- Welcome to the Workshop on Responsible Data! (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- CV4MS @ CVPR 2024 (2024)
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
- Foundation Models in Radiology (2025)
YOLO — 8 videos
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- Machine Learning for Geometric Shape Analysis (2024)
- Welcome to the Workshop on Responsible Data! (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
- CVsports Workshop at CVPR 2024, Seattle (2024)
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- Asynchronous Convolutional Networks for Object Detection in Neuromorphic Cameras (2025)
UniAD — 7 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
- Foundation models For autonomous driving (2025)
Waymo — 7 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- 3D Foundation Models for Physical Intelligence (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
- Scalable Autonomous Driving via Fully Data-driven Simulation (2025)
Imagen — 7 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- 7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- 3D Foundation Models for Physical Intelligence (2024)
VQGAN — 7 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- CVPR 2024 Workshop (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
Transformers — 7 videos
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
- Solving Real-World Challenges of Large-Scale AV Deployment (2025)
- Unsolved problems in video understanding (2025)
- The missing rungs on the ladder to general AI (2025)
Flow Matching — 7 videos
- Computer Vision Foundation Talk/Workshop (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
- Visual Generative Modeling: What’s After Diffusion? (2025)
- Visual Generative Modeling: What’s After Diffusion? (2025)
ResNet-18 — 7 videos
- Image Matching: Local Features and Beyond (2024)
- CVPRW-NAS 2024 - Day 1 Session 1 (2024)
- Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- AI agents in cancer research and oncology (2025)
VGG16 — 7 videos
- Image Matching: Local Features and Beyond (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
- AI4Space 2024 Workshop (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- CV4MS @ CVPR 2024 (2024)
Instant3D — 7 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- CVPR 2024 Workshop (2024)
- The First Workshop on AI for 3D Generation (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- 3D Foundation Models for Physical Intelligence (2024)
LLaMA — 7 videos
- The 13th Women in Computer Vision (WiCV) Workshop (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Visual-Inertial Odometry for Small-sized Robots (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- From Multimodal LLM to Human-level AI (2024)
BLIP — 7 videos
- Panel Discussion on AI, Art, and Creativity (2024)
- ReGenAI Workshop CVPR 2024 (2024)
- Welcome to the Workshop on Responsible Data! (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- CVsports Workshop at CVPR 2024, Seattle (2024)
- Multi-stage reasoning for video understanding & scene generation (2025)
- Cross-Modal 3D Scene Understanding (2025)
Grad-CAM — 7 videos
- 5th Face Anti-spoofing Workshop @ CVPR2024 (2024)
- CVPR 2024 Workshop (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- CV4MS @ CVPR 2024 (2024)
SLAM — 7 videos
- Computer Vision Foundation Workshop (2024)
- CVPR 2024 Workshop (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
- Visual-Inertial Odometry for Small-sized Robots (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
- Event-based vision and processing for tiny drones (2025)
- REALIZING THE PROMISE OF SPIKING NEUROMORPHIC HARDWARE (2025)
GANs — 7 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- The First Workshop on AI for 3D Generation (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- Scalable Real-Time Abnormal Event Detection (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
Ego4D — 7 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
- Generalization via Scaling Robotics (2025)
YOLOv8 — 7 videos
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- CVsports Workshop at CVPR 2024, Seattle (2024)
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
- Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
DVS — 7 videos
- All you need to know about self-driving: Intro to Self-Driving (2024)
- From Event-Based Visions to Real Systems (2025)
- iniVation Neuromorphic Vision Systems: Core Technology, Software, and Applications (2025)
- SCAMP-5: Vision Sensor with Pixel Parallel SIMD Processor Array (2025)
- Event Computer Vision 10 years Assessment: Where We Came From, Where We Are and Where We Are Heading To (2025)
- Event-Driven Sensing for a Humanoid Robot (2025)
- The development of the DVS and DAVIS sensors (2025)
LSTM — 6 videos
- Computational Design of Diverse Morphologies and Sensors for Vision & Robotics (2024)
- CVPR 2024 Workshop (2024)
- Virtual Try-On Workshop (2024)
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
- CVsports Workshop at CVPR 2024, Seattle (2024)
- Real-Time 6DOF Pose Relocalization for Event Cameras with Stacked Spatial LSTM Networks (2025)
PRISM-1 — 6 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 3D Generative AI: Efficient, high-def & controllable (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
DriveDreamer — 6 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
nuPlan — 6 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
WayveScenes101 — 6 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 3D Generative AI: Efficient, high-def & controllable (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
DALL-E 2 — 6 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- ReGenAI Workshop CVPR 2024 (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- SIGGRAPH 2025 Workshop on 3D Generative AI (2025)
- Visual Generative Modeling: What’s After Diffusion? (2025)
MAE — 6 videos
- CVPR 2024 Object-Centric Representation For Computer Vision Tutorial (2024)
- Welcome to the Workshop on Responsible Data! (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- 3D Foundation Models for Physical Intelligence (2024)
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
TensorFlow — 6 videos
- Multimodal AI for Edge AI (2024)
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
- Applications, Software and Hardware for Event-Based Vision (2025)
- REALIZING THE PROMISE OF SPIKING NEUROMORPHIC HARDWARE (2025)
- Mobile AI Workshop 2025: Introductory Talk (2025)
ShapeNet — 6 videos
- InNeRF360: Text-Guided 3D-Consistent Object Inpainting on 360° Neural Radiance Fields (2024)
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
- 3D Foundation Models for Physical Intelligence (2024)
- Synthetic Data for CV (2024)
VGG — 6 videos
- CVPRW-NAS 2024 - Day 1 Session 1 (2024)
- CVPR 2024 Tutorial: Learning Deep Low-Dimensional Models from High-Dimensional Data: Theory to Practice (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- CV4MS @ CVPR 2024 (2024)
- Self-supervised Learning for Dynamic 3D Scene Understanding (2025)
CIFAR-10 — 6 videos
- CVPRW-NAS 2024 - Day 1 Session 1 (2024)
- Diffusion-based Video Generative Models (2024)
- CVPR 2024 Tutorial: Learning Deep Low-Dimensional Models from High-Dimensional Data: Theory to Practice (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Scalable Real-Time Abnormal Event Detection (2024)
- CV4MS @ CVPR 2024 (2024)
VLM — 6 videos
- ReGenAI Workshop CVPR 2024 (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
DreamBooth — 6 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Virtual Try-On Workshop (2024)
- Diffusion-based Video Generative Models (2024)
- Anti-DreamBooth: Protecting Users from Personalized Text-to-Image Synthesis (2024)
- The First Workshop on AI for 3D Generation (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
Deep Learning — 6 videos
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- CVsports Workshop at CVPR 2024, Seattle (2024)
- Unsupervised Learning of Optical Flow and Camera Motion from Event Data (2025)
- Neuromorphic Vision Applications: From Robotic Foosball to Tracking Space Junk (2025)
- Bio-Inspired Embedded Event-based Visual Processing (2025)
DVS (Dynamic Vision Sensor) — 6 videos
- O-MMS: Zero-Shot Multi-Motion Segmentation With A Monocular Event Camera (2025)
- Applications, Software and Hardware for Event-Based Vision (2025)
- Event-based Visual Odometry: A Short Tutorial (2025)
- Event-Driven Convolution-Based Processing (2025)
- Object Motion Segmentation: Advantages from Event Data (2025)
- EVPropNet: Detecting Drones By Finding Propellers For Mid-Air Landing And Following (2025)
Point-E — 5 videos
- Computational Design of Diverse Morphologies and Sensors for Vision & Robotics (2024)
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Ghost Gym — 5 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
Lingo-2 — 5 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 3D Generative AI: Efficient, high-def & controllable (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
VIDAR — 5 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- 8th New Trends in Image Restoration and Enhancement Workshop and 8 Associated Challenges (2025)
LDM — 5 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- ReGenAI Workshop CVPR 2024 (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
PCA — 5 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- Virtual Try-On Workshop (2024)
- ViLMa Visual Localization and Mapping (2024)
- CV4MS @ CVPR 2024 (2024)
- Neuromorphic computing hardware and event-based vision: a perfect match? (2025)
SDXL — 5 videos
- Computer Vision Foundation Talk/Workshop (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
GAN — 5 videos
- Multimodal AI for Edge AI (2024)
- Scalable Real-Time Abnormal Event Detection (2024)
- AI4Space 2024 Workshop (2024)
- 3D Foundation Models for Physical Intelligence (2024)
- 8th New Trends in Image Restoration and Enhancement Workshop and 8 Associated Challenges (2025)
ConvNeXt — 5 videos
- Multimodal AI for Edge AI (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
- Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
- 8th New Trends in Image Restoration and Enhancement Workshop and 8 Associated Challenges (2025)
SuperPoint — 5 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- Image Matching: Local Features and Beyond (2024)
- CVPR MetaFood Workshop (2024)
- ViLMa Visual Localization and Mapping (2024)
- AI4Space 2024 Workshop (2024)
StyleGAN2 — 5 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- Virtual Try-On Workshop (2024)
- Machine Learning for Geometric Shape Analysis (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
EG3D — 5 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- The First Workshop on AI for 3D Generation (2024)
- Machine Learning for Geometric Shape Analysis (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Objaverse-XL — 5 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- The First Workshop on AI for 3D Generation (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- Synthetic Data for CV (2024)
UNet — 5 videos
- CVPR 2024 Workshop (2024)
- The 13th Women in Computer Vision (WiCV) Workshop (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- CV4MS @ CVPR 2024 (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
CycleGAN — 5 videos
- CVPRW-NAS 2024 - Day 1 Session 1 (2024)
- The 13th Women in Computer Vision (WiCV) Workshop (2024)
- Panel Discussion on AI, Art, and Creativity (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
DenseNet — 5 videos
- CVPRW-NAS 2024 - Day 1 Session 1 (2024)
- Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
Diffusion Model — 5 videos
- Multimodal Algorithmic Reasoning Workshop & SMART-101 Challenge Awards (2024)
- Generative AI by Getty Images: Addressing Concerns and Building Better Models (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- Generalization via Scaling Robotics (2025)
ViperGPT — 5 videos
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- From Multimodal LLM to Human-level AI (2024)
- Multi-stage reasoning for video understanding & scene generation (2025)
- Concept Learning Across Domains and Modalities (2025)
- ICCV 2023 Workshop on Vision and Language Algorithmic Reasoning (VLAR) (2025)
InternVideo — 5 videos
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- Unsolved problems in video understanding (2025)
Segment Anything — 5 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- ViLMa Visual Localization and Mapping (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
- Mobile AI Workshop 2025: Introductory Talk (2025)
ALIGN — 5 videos
- CV4MR 2024: 2nd Workshop on Computer Vision for Mixed Reality (2024)
- VizWiz Grand Challenge: Opening Remarks (2024)
- CV4MS @ CVPR 2024 (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- Welcome to the workshop on Computer Vision in the Wild (CVinW) (2025)
InstructPix2Pix — 5 videos
- Panel Discussion on AI, Art, and Creativity (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- Synthetic Data for CV (2024)
OpenPose — 5 videos
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
GCN — 5 videos
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
- CVsports Workshop at CVPR 2024, Seattle (2024)
- CVPR 24’ Tutorial: Unifying Spectral and Spatial Graph Neural Networks (2024)
MaskGIT — 5 videos
- CVPR 2024 Workshop (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Coarse-to-Fine Amodal Segmentation with Shape Prior (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
LAION-5B — 5 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- Synthetic Data for CV (2024)
Faster R-CNN — 5 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- Visual-Inertial Odometry for Small-sized Robots (2024)
- CV4MS @ CVPR 2024 (2024)
- Synthetic Data for CV (2024)
- Concept Learning Across Domains and Modalities (2025)
Mask R-CNN — 5 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
- Synthetic Data for CV (2024)
- X-WORLD: Accessibility, Vision, and Autonomy Meet (2025)
UMAP — 5 videos
- CVPR MetaFood Workshop (2024)
- CVPR 2024 Tutorial: Learning Deep Low-Dimensional Models from High-Dimensional Data: Theory to Practice (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- CV4MS @ CVPR 2024 (2024)
IP-Adapter — 5 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
Gato — 5 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
Open X-Embodiment — 5 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- Workshop on Autonomous Driving (2025)
PyTorch — 5 videos
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
- Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
- Event-Driven Convolution-Based Processing (2025)
- Mobile AI Workshop 2025: Introductory Talk (2025)
Diffusion Policy — 5 videos
- The First Workshop on AI for 3D Generation (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- 3D Foundation Models for Physical Intelligence (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
Diffusion models — 5 videos
- The First Workshop on AI for 3D Generation (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- Synthetic Data for CV (2024)
3D Gaussian Splatting — 5 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- ViLMa Visual Localization and Mapping (2024)
- Scalable Real-Time Abnormal Event Detection (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- Synthetic Data for CV (2024)
LAION — 5 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Welcome to the Workshop on Responsible Data! (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Random Forest — 5 videos
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- AI4Space 2024 Workshop (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
- Learning Spatiotemporal Filters to Track Visual Saliency (2025)
Instant-NGP — 5 videos
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Visual-Inertial Odometry for Small-sized Robots (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- 3D Foundation Models for Physical Intelligence (2024)
- CVPR 2025 - 2nd Workshop on Neural Fields Beyond Conventional Cameras (2025)
DAVIS — 5 videos
- From Event-Based Visions to Real Systems (2025)
- iniVation Neuromorphic Vision Systems: Core Technology, Software, and Applications (2025)
- Introduction of Celex Family Sensor and Event/Frame/Optical-flow Hybrid Processing (2025)
- SCAMP-5: Vision Sensor with Pixel Parallel SIMD Processor Array (2025)
- The development of the DVS and DAVIS sensors (2025)
GAIA-1 — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
VISTA — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
Lingo-1 — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 3D Generative AI: Efficient, high-def & controllable (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
MCTS — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
GNN — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
GenAD — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
DriveGPT4 — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
V-JEPA — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
Drive-WM — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
OccWorld — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
WoVoGen — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
Llama — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 3D Foundation Models for Physical Intelligence (2024)
PID controller — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- Perception and simulation for self-driving vehicles (2025)
- Neuromorphic computing hardware and event-based vision: a perfect match? (2025)
Cross-attention — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop (2024)
- CVPR 2024 Workshop (2024)
- CVPR 2024 Workshop (2024)
AMASS — 4 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- From Sim2Real 1.0 to 4.0 for Humanoid Whole-Body Control and Loco-Manipulation (2025)
- Estimating human motion in world coordinates (2025)
GigaGAN — 4 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
CNNs — 4 videos
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- What can biological systems teach us about embodied learning? (2025)
VILA — 4 videos
- Computer Vision Foundation Talk/Workshop (2024)
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
AWQ — 4 videos
- Computer Vision Foundation Talk/Workshop (2024)
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
DPO — 4 videos
- Computer Vision Foundation Talk/Workshop (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- THE BITTER LESSON FOR RL: VERIFICATION AS THE KEY TO REASONING LLMS (2025)
- Visual Generative Modeling: What’s After Diffusion? (2025)
MobileNetV2 — 4 videos
- Multimodal AI for Edge AI (2024)
- The 20th Embedded Vision Workshop (EVW2024) (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
SuperGlue — 4 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- Image Matching: Local Features and Beyond (2024)
- CVPR MetaFood Workshop (2024)
- AI4Space 2024 Workshop (2024)
Cityscapes — 4 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- X-WORLD: Accessibility, Vision, and Autonomy Meet (2025)
- IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme (2025)
ResNet-34 — 4 videos
- Image Matching: Local Features and Beyond (2024)
- Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
- CV4MS @ CVPR 2024 (2024)
- Lifting Monocular Events to 3D Human Poses (2025)
ProlificDreamer — 4 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- ReGenAI Workshop CVPR 2024 (2024)
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Magic3D — 4 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Segment Anything Model (SAM) — 4 videos
- Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
- CVPR 2024 Workshop (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Foundational Few-Shot Object Detection Challenge (2025)
Visual Genome — 4 videos
- Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Synthetic Data for CV (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
ViT (Vision Transformer) — 4 videos
- Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
- Machine Learning for Geometric Shape Analysis (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
- Mobile AI Workshop 2025: Introductory Talk (2025)
PointNet++ — 4 videos
- Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
- All You Need to Know about Point Cloud Understanding (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Concept Learning Across Domains and Modalities (2025)
HMR — 4 videos
- Towards the 3D Human Foundation Agent (2024)
- Virtual Try-On Workshop (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Estimating human motion in world coordinates (2025)
SMPL — 4 videos
- Towards the 3D Human Foundation Agent (2024)
- Virtual Try-On Workshop (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- CVsports Workshop at CVPR 2024, Seattle (2024)
PaLM — 4 videos
- The 13th Women in Computer Vision (WiCV) Workshop (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- CV4MS @ CVPR 2024 (2024)
- From Multimodal LLM to Human-level AI (2024)
BLIP2 — 4 videos
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- VizWiz Grand Challenge: Opening Remarks (2024)
- ICCV 2023 Workshop on Vision and Language Algorithmic Reasoning (VLAR) (2025)
Shap-E — 4 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- 3D Foundation Models for Physical Intelligence (2024)
ByteTrack — 4 videos
- CV4MR 2024: 2nd Workshop on Computer Vision for Mixed Reality (2024)
- The 20th Embedded Vision Workshop (EVW2024) (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
TensorRT — 4 videos
- The 20th Embedded Vision Workshop (EVW2024) (2024)
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
- CVsports Workshop at CVPR 2024, Seattle (2024)
LiDAR — 4 videos
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
Mixup — 4 videos
- CVPR 2024 Workshop on Data Curation and Augmentation in Medical Imaging (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- 8th New Trends in Image Restoration and Enhancement Workshop and 8 Associated Challenges (2025)
Muse — 4 videos
- CVPR 2024 Workshop (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
StyleDrop — 4 videos
- CVPR 2024 Workshop (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
VideoPoet — 4 videos
- CVPR 2024 Workshop (2024)
- Diffusion-based Video Generative Models (2024)
- From Multimodal LLM to Human-level AI (2024)
- Generalization via Scaling Robotics (2025)
LLM — 4 videos
- ReGenAI Workshop CVPR 2024 (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
CutMix — 4 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- 8th New Trends in Image Restoration and Enhancement Workshop and 8 Associated Challenges (2025)
CLIPScore — 4 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- Diffusion-based Video Generative Models (2024)
- VizWiz Grand Challenge: Opening Remarks (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
VideoMAE — 4 videos
- CVPR 2024 Workshop (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
- Unsolved problems in video understanding (2025)
- What can biological systems teach us about embodied learning? (2025)
Transformer Encoder — 4 videos
- CVPR 2024 Workshop (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
Objaverse — 4 videos
- CVPR 2024 Workshop (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- 3D Foundation Models for Physical Intelligence (2024)
AtlasNet — 4 videos
- Virtual Try-On Workshop (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
- 3D Foundation Models for Physical Intelligence (2024)
LINGO-1 — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
LINGO-2 — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
COMPASS — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
LM-Nav — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
RT-1 — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
BLIP-2 Q-Former — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
Vicuna-7B — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
ST-P3 — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
Waymo Open Dataset — 4 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Solving Real-World Challenges of Large-Scale AV Deployment (2025)
- CVPR 2025 Workshop on Autonomous Driving (2025)
Llama-2 — 4 videos
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- Vision and Language Algorithmic Reasoning Work & SMART-101 Challenge Awards (2025)
- Unsolved problems in video understanding (2025)
GET3D — 4 videos
- The First Workshop on AI for 3D Generation (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- 3D Foundation Models for Physical Intelligence (2024)
SDEdit — 4 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
ScanNet — 4 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
- 3D Foundation Models for Physical Intelligence (2024)
- Cross-Modal 3D Scene Understanding (2025)
Co-DETR — 4 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
- Foundational Few-Shot Object Detection Challenge (2025)
- Second Egocentric Vision (EgoVis) Workshop (2025)
DinoV2 — 4 videos
- 23598 The 5th Annual Embodied AI Workshop (2024)
- CV4MS @ CVPR 2024 (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
Foundation Models — 4 videos
- 23598 The 5th Annual Embodied AI Workshop (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
VAE — 4 videos
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- Mobile Intelligent Photography and Imaging (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
- 3D Foundation Models for Physical Intelligence (2024)
RT-2 — 4 videos
- CVPR 2024 Workshop on Autonomous Driving (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
DriveVLM — 4 videos
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
- Foundation models For autonomous driving (2025)
SigLIP — 4 videos
- 4th Workshop on Computer Vision in the Built Environment (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
- From Multimodal LLM to Human-level AI (2024)
IBM TrueNorth — 4 videos
- Object and Action Recognition on the Event-Based IBM TrueNorth Processor (2025)
- Event Computer Vision 10 years Assessment: Where We Came From, Where We Are and Where We Are Heading To (2025)
- Bio-Inspired Embedded Event-based Visual Processing (2025)
- Reconstruction, Motion Estimation and SLAM from Events (2025)
ATIS — 4 videos
- Introduction of Celex Family Sensor and Event/Frame/Optical-flow Hybrid Processing (2025)
- SCAMP-5: Vision Sensor with Pixel Parallel SIMD Processor Array (2025)
- Event Computer Vision 10 years Assessment: Where We Came From, Where We Are and Where We Are Heading To (2025)
- Bio-Inspired Embedded Event-based Visual Processing (2025)
SpiNNaker — 4 videos
- Neuromorphic Computing: towards event-based (cognitive) sensing and control (2025)
- Event-Driven Convolution-Based Processing (2025)
- Novel Hardware for Spatial AI (2025)
- Event-Driven Sensing for a Humanoid Robot (2025)
GLIDE — 3 videos
- Computational Design of Diverse Morphologies and Sensors for Vision & Robotics (2024)
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- SIGGRAPH 2025 Workshop on 3D Generative AI (2025)
PPO — 3 videos
- Computational Design of Diverse Morphologies and Sensors for Vision & Robotics (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Scalable Autonomous Driving via Fully Data-driven Simulation (2025)
RAG-driver — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
LMdrive — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
Nuro — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
Drive Anywhere — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
IRIS — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Welcome to the Workshop on Responsible Data! (2024)
SEM2 — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
DriveWorld — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
TrafficBots — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
SubjectDrive — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
UniWorld — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
MUVO — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
Think2Drive — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
DriveLM — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
MP3 — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
Q-Former — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
Flan-T5 — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- From Multimodal LLM to Human-level AI (2024)
VIVIT — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Second Egocentric Vision (EgoVis) Workshop (2025)
Nerfstudio — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- How to Train Your Humanoid: From Human Mesh Recovery to VideoMimic (2025)
HyperNeRF — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
Nerfies — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
DriveSim — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
Waabi World — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
Waymo's Waymax — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
KITTI-360 — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
NMP — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
Large Language Model — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
CarLLaVA — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Embodied Intelligence for Autonomous Systems on the Horizon (2025)
Copilot4D — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
NeuRAD — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
Chinchilla — 3 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- From Multimodal LLM to Human-level AI (2024)
LRM — 3 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Zero-1-to-3 — 3 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
IM-3D — 3 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- The First Workshop on AI for 3D Generation (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
Contrastive Learning — 3 videos
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
- Multimodal AI for Edge AI (2024)
- Second Egocentric Vision (EgoVis) Workshop (2025)
SkySense — 3 videos
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
SmoothQuant — 3 videos
- Computer Vision Foundation Talk/Workshop (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
TinyChat — 3 videos
- Computer Vision Foundation Talk/Workshop (2024)
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
TensorRT-LLM — 3 videos
- Computer Vision Foundation Talk/Workshop (2024)
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
DIT — 3 videos
- Computer Vision Foundation Talk/Workshop (2024)
- Diffusion-based Video Generative Models (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Rectified Flow — 3 videos
- Computer Vision Foundation Talk/Workshop (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
EDM — 3 videos
- Computer Vision Foundation Talk/Workshop (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
CLEVR — 3 videos
- CVPR 2024 Object-Centric Representation For Computer Vision Tutorial (2024)
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- Concept Learning Across Domains and Modalities (2025)
ADE20K — 3 videos
- CVPR 2024 Object-Centric Representation For Computer Vision Tutorial (2024)
- The 3rd Monocular Depth Estimation Challenge (2024)
- From Multimodal LLM to Human-level AI (2024)
TensorFlow Lite — 3 videos
- Multimodal AI for Edge AI (2024)
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
- Mobile AI Workshop 2025: Introductory Talk (2025)
OpenCV — 3 videos
- Multimodal AI for Edge AI (2024)
- Applications, Software and Hardware for Event-Based Vision (2025)
- Neuromorphic vision for humanoid robots (2025)
Depth Anything — 3 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- 3D Foundation Models for Physical Intelligence (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
Marigold — 3 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- ReGenAI Workshop CVPR 2024 (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
LoFTR — 3 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- Image Matching: Local Features and Beyond (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
ZoeDepth — 3 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- ReGenAI Workshop CVPR 2024 (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
Metric3D — 3 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- 3D Foundation Models for Physical Intelligence (2024)
- Second Egocentric Vision (EgoVis) Workshop (2025)
MiDaS — 3 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- Deep Stereo Matching in the Twenties (2024)
KITTI — 3 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- X-WORLD: Accessibility, Vision, and Autonomy Meet (2025)
Kinect — 3 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Synthetic Data for CV (2024)
MIM — 3 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- Scalable Real-Time Abnormal Event Detection (2024)
- SRVP: Strong Recollection Video Prediction Model Using Attention-Based Spatiotemporal Correlation Fusion (2025)
DINO-ViT — 3 videos
- Image Matching: Local Features and Beyond (2024)
- Scalable Real-Time Abnormal Event Detection (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
LightGlue — 3 videos
- Image Matching: Local Features and Beyond (2024)
- ViLMa Visual Localization and Mapping (2024)
- Synthetic Data for CV (2024)
NetVLAD — 3 videos
- Image Matching: Local Features and Beyond (2024)
- ViLMa Visual Localization and Mapping (2024)
- Workshop on Autonomous Driving (2025)
ResNet-101 — 3 videos
- Image Matching: Local Features and Beyond (2024)
- Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
- Generalization via Scaling Robotics (2025)
DBSCAN — 3 videos
- Image Matching: Local Features and Beyond (2024)
- 7th Multi-modal Learning Workshop (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
MeshGPT — 3 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 3D Foundation Models for Physical Intelligence (2024)
SyncDreamer — 3 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Ego-Exo4D — 3 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
ImageNet-1K — 3 videos
- CVPRW-NAS 2024 - Day 1 Session 1 (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- Synthetic Data for CV (2024)
MiniGPT-4 — 3 videos
- 7th Multi-modal Learning Workshop (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
Graph Neural Networks (GNNs) — 3 videos
- Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
- CVPR 24’ Tutorial: Unifying Spectral and Spatial Graph Neural Networks (2024)
OpenScene — 3 videos
- Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
- ViLMa Visual Localization and Mapping (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
NeRFs — 3 videos
- Towards the 3D Human Foundation Agent (2024)
- The First Workshop on AI for 3D Generation (2024)
- ViLMa Visual Localization and Mapping (2024)
Mask2Former — 3 videos
- Towards the 3D Human Foundation Agent (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
LaMDA — 3 videos
- The 13th Women in Computer Vision (WiCV) Workshop (2024)
- Visual-Inertial Odometry for Small-sized Robots (2024)
- From Multimodal LLM to Human-level AI (2024)
ActivityNet-QA — 3 videos
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Multi-stage reasoning for video understanding & scene generation (2025)
EgoSchema — 3 videos
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- Multi-stage reasoning for video understanding & scene generation (2025)
NEXT-QA — 3 videos
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- Multi-stage reasoning for video understanding & scene generation (2025)
- ICCV 2023 Workshop on Vision and Language Algorithmic Reasoning (VLAR) (2025)
MVCNN — 3 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- All You Need to Know about Point Cloud Understanding (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
Magic123 — 3 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
MVDream — 3 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
GPT — 3 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- CV4MS @ CVPR 2024 (2024)
Mask3D — 3 videos
- CV4MR 2024: 2nd Workshop on Computer Vision for Mixed Reality (2024)
- All You Need to Know about Point Cloud Understanding (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
Inverse Kinematics — 3 videos
- Human Motion Generation (HuMoGen) Workshop (2024)
- From Sim2Real 1.0 to 4.0 for Humanoid Whole-Body Control and Loco-Manipulation (2025)
- Humanoid Policy ~ Human Policy (2025)
BigGAN — 3 videos
- Panel Discussion on AI, Art, and Creativity (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
Swin Transformer — 3 videos
- OmniCV 2024 Workshop (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- Mobile AI Workshop 2025: Introductory Talk (2025)
L1 loss — 3 videos
- PBVS 2024 Workshop: Challenges and Results (2024)
- Machine Learning for Geometric Shape Analysis (2024)
- Mobile AI Workshop 2025: Introductory Talk (2025)
AUROC — 3 videos
- PBVS 2024 Workshop: Challenges and Results (2024)
- Scalable Real-Time Abnormal Event Detection (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
t-SNE — 3 videos
- 5th Face Anti-spoofing Workshop @ CVPR2024 (2024)
- Scalable Real-Time Abnormal Event Detection (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
Emu — 3 videos
- 7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
- ReGenAI Workshop CVPR 2024 (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
LLMs — 3 videos
- CVPR 2024 Workshop on Data Curation and Augmentation in Medical Imaging (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
MedSAM — 3 videos
- CVPR 2024 Workshop on Data Curation and Augmentation in Medical Imaging (2024)
- CV4MS @ CVPR 2024 (2024)
- AI agents in cancer research and oncology (2025)
Latent Diffusion — 3 videos
- CVPR 2024 Workshop (2024)
- Diffusion-based Video Generative Models (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
SR3 — 3 videos
- ReGenAI Workshop CVPR 2024 (2024)
- Mobile Intelligent Photography and Imaging (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
Stable Diffusion XL — 3 videos
- ReGenAI Workshop CVPR 2024 (2024)
- The First Workshop on AI for 3D Generation (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
RAG — 3 videos
- ReGenAI Workshop CVPR 2024 (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Learning the Language of Patients (2025)
DataComp — 3 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- Synthetic Data for CV (2024)
Swin-L — 3 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
- Synthetic Data for CV (2024)
OpenCLIP — 3 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
DETR — 3 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- CV4MS @ CVPR 2024 (2024)
CLIP Image Encoder — 3 videos
- CVPR MetaFood Workshop (2024)
- CVPR MetaFood Workshop (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
DDIM Inversion — 3 videos
- CVPR 2024 Workshop (2024)
- CVPR 2024 Workshop (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
ZeroScope — 3 videos
- CVPR 2024 Workshop (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
- Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
I3D — 3 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Dataset Distillation: A Comprehensive Review (2024)
- CVsports Workshop at CVPR 2024, Seattle (2024)
HyperDreamBooth — 3 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
ZipLoRA — 3 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
Textual Inversion — 3 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Anti-DreamBooth: Protecting Users from Personalized Text-to-Image Synthesis (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
MoCo — 3 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Welcome to the Workshop on Responsible Data! (2024)
- CV4MS @ CVPR 2024 (2024)
VQA — 3 videos
- The 6th International Workshop on Gaze Estimation and Prediction in the Wild (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- From Multimodal LLM to Human-level AI (2024)
ImageDream — 3 videos
- Virtual Try-On Workshop (2024)
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
LORA finetuning — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
Vista — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- Embodied Intelligence for Autonomous Systems on the Horizon (2025)
Alexnet — 3 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
AlphaGo — 3 videos
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- AI4Space 2024 Workshop (2024)
- THE BITTER LESSON FOR RL: VERIFICATION AS THE KEY TO REASONING LLMS (2025)
Inception-v3 — 3 videos
- Diffusion-based Video Generative Models (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
MinkUNet — 3 videos
- All You Need to Know about Point Cloud Understanding (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
LLaVA-1.5 — 3 videos
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- From Multimodal LLM to Human-level AI (2024)
TextVQA — 3 videos
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- VizWiz Grand Challenge: Opening Remarks (2024)
- From Multimodal LLM to Human-level AI (2024)
GradCAM — 3 videos
- Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- ICCVW 2023 VLAR Session 3 (2025)
GradCAM++ — 3 videos
- Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
ODIN — 3 videos
- Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
- Scalable Real-Time Abnormal Event Detection (2024)
- 3D Foundation Models for Physical Intelligence (2024)
SHAP — 3 videos
- Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
- CVPR 2024 Workshop (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
CAT3D — 3 videos
- The First Workshop on AI for 3D Generation (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- 3D Foundation Models for Physical Intelligence (2024)
DeepSDF — 3 videos
- The First Workshop on AI for 3D Generation (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Genie — 3 videos
- The First Workshop on AI for 3D Generation (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
SDF — 3 videos
- The First Workshop on AI for 3D Generation (2024)
- Machine Learning for Geometric Shape Analysis (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
KNN — 3 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Machine Learning for Geometric Shape Analysis (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
ScanNet++ — 3 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- ViLMa Visual Localization and Mapping (2024)
- 3D Foundation Models for Physical Intelligence (2024)
GroundingDINO — 3 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Foundational Few-Shot Object Detection Challenge (2025)
- Foundational Few-Shot Object Detection Challenge (2025)
ImageNetV2 — 3 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
LAION-2B — 3 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
LAION-400M — 3 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
Bard — 3 videos
- 23598 The 5th Annual Embodied AI Workshop (2024)
- VISION-AND-LANGUAGE ALGORITHMIC REASONING (VLAR) (2025)
- The missing rungs on the ladder to general AI (2025)
Kinetics — 3 videos
- 23598 The 5th Annual Embodied AI Workshop (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
- Generalization via Scaling Robotics (2025)
MPC — 3 videos
- 23598 The 5th Annual Embodied AI Workshop (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- What does Embodied Intelligence mean? Lessons Learned from Drone Racing (2025)
Octo — 3 videos
- 23598 The 5th Annual Embodied AI Workshop (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
PIVOT — 3 videos
- 23598 The 5th Annual Embodied AI Workshop (2024)
- CVPR 2024 Workshop (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
RT-2-X — 3 videos
- 23598 The 5th Annual Embodied AI Workshop (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
Vision Transformer — 3 videos
- CVPR 2024 Workshop (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- 8th New Trends in Image Restoration and Enhancement Workshop and 8 Associated Challenges (2025)
Block-NeRF — 3 videos
- ViLMa Visual Localization and Mapping (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
SUDS — 3 videos
- ViLMa Visual Localization and Mapping (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
SGD — 3 videos
- Dataset Distillation: A Comprehensive Review (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- Neuromorphic computing hardware and event-based vision: a perfect match? (2025)
Grad-CAM++ — 3 videos
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
ImageBind — 3 videos
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- From Multimodal LLM to Human-level AI (2024)
- Cross-Modal 3D Scene Understanding (2025)
YOLOv7 — 3 videos
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
- Vision and Language Algorithmic Reasoning Work & SMART-101 Challenge Awards (2025)
Common Crawl — 3 videos
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
RLHF — 3 videos
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- The missing rungs on the ladder to general AI (2025)
OCR — 3 videos
- VizWiz Grand Challenge: Opening Remarks (2024)
- From Multimodal LLM to Human-level AI (2024)
- Vision and Language Algorithmic Reasoning Work & SMART-101 Challenge Awards (2025)
PaLI — 3 videos
- VizWiz Grand Challenge: Opening Remarks (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
RedCaps — 3 videos
- VizWiz Grand Challenge: Opening Remarks (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- Synthetic Data for CV (2024)
ResNet34 — 3 videos
- Machine Learning for Geometric Shape Analysis (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- v2e: From Video Frames to Realistic DVS Events (2025)
GAIA — 3 videos
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
Transfuser — 3 videos
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario (2025)
Reinforcement Learning — 3 videos
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Towards intelligent robots (2025)
- What does Embodied Intelligence mean? Lessons Learned from Drone Racing (2025)
DrivingGaussian — 3 videos
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- Scalable Neural Simulation for Autonomy (2025)
MotionLM — 3 videos
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation models For autonomous driving (2025)
- Workshop on Embodied Intelligence for Autonomous Systems on the Horizon (2025)
UniSim — 3 videos
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
VAD — 3 videos
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation models For autonomous driving (2025)
- Embodied Intelligence for Autonomous Systems on the Horizon (2025)
Waymax — 3 videos
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Perception and simulation for self-driving vehicles (2025)
- Scalable Autonomous Driving via Fully Data-driven Simulation (2025)
MS-COCO — 3 videos
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- Synthetic Data for CV (2024)
- Events-to-Video: Bringing Modern Computer Vision to Event Cameras (2025)
RNN — 3 videos
- AI4Space 2024 Workshop (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
- Towards intelligent robots (2025)
HRNet — 3 videos
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- CVsports Workshop at CVPR 2024, Seattle (2024)
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
OpenVINO — 3 videos
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
- Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
CLIP (Contrastive Language-Image Pre-training) — 3 videos
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
SegFormer — 3 videos
- 4th Workshop on Computer Vision in the Built Environment (2024)
- CVsports Workshop at CVPR 2024, Seattle (2024)
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
ObjectNet — 3 videos
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
- Synthetic Data for CV (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
RAFT — 3 videos
- CV4MS @ CVPR 2024 (2024)
- Synthetic Data for CV (2024)
- Mobile AI Workshop 2025: Introductory Talk (2025)
MegaDepth — 3 videos
- 3D Foundation Models for Physical Intelligence (2024)
- Events-to-Video: Bringing Modern Computer Vision to Event Cameras (2025)
- Event-based Cameras: Challenges and Opportunities (2025)
BEVFormer — 3 videos
- Foundation Models for Autonomous Systems Workshop (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
- Scalable Neural Simulation for Autonomy (2025)
NAVSIM — 3 videos
- Foundation Models for Autonomous Systems Workshop (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
- CVPR 2025 Workshop on Autonomous Driving (2025)
DART — 3 videos
- All you need to know about self-driving: Intro to Self-Driving (2024)
- Spiking Neural Networks for Event-based Vision (2025)
- Visual Generative Modeling: What’s After Diffusion? (2025)
mPLUG-Owl — 3 videos
- From Multimodal LLM to Human-level AI (2024)
- Unsolved problems in video understanding (2025)
- ICCV 2023 Workshop on Vision and Language Algorithmic Reasoning (VLAR) (2025)
Intel Loihi — 3 videos
- Event-based attention and tracking on neuromorphic hardware (2025)
- Event-based Cameras: Challenges and Opportunities (2025)
- Event Computer Vision 10 years Assessment: Where We Came From, Where We Are and Where We Are Heading To (2025)
YOLOv3 — 3 videos
- v2e: From Video Frames to Realistic DVS Events (2025)
- Events-to-Video: Bringing Modern Computer Vision to Event Cameras (2025)
- Event-based Cameras: Challenges and Opportunities (2025)
ROS — 3 videos
- iniVation Neuromorphic Vision Systems: Core Technology, Software, and Applications (2025)
- Event-based vision and processing for tiny drones (2025)
- REALIZING THE PROMISE OF SPIKING NEUROMORPHIC HARDWARE (2025)
SNN — 3 videos
- LEARNING FROM EVENTS: ON THE FUTURE OF MACHINE LEARNING FOR EVENT-BASED CAMERAS (2025)
- Neuromorphic Computing: towards event-based (cognitive) sensing and control (2025)
- Hardware and Algorithm Co-design with Event Sensors (2025)
SCAMP — 3 videos
- LEARNING FROM EVENTS: ON THE FUTURE OF MACHINE LEARNING FOR EVENT-BASED CAMERAS (2025)
- Neuromorphic Computing: towards event-based (cognitive) sensing and control (2025)
- SCAMP-5: Vision Sensor with Pixel Parallel SIMD Processor Array (2025)
Spinnaker — 3 videos
- LEARNING FROM EVENTS: ON THE FUTURE OF MACHINE LEARNING FOR EVENT-BASED CAMERAS (2025)
- Bio-Inspired Embedded Event-based Visual Processing (2025)
- Neuromorphic vision for humanoid robots (2025)
TrueNorth — 3 videos
- LEARNING FROM EVENTS: ON THE FUTURE OF MACHINE LEARNING FOR EVENT-BASED CAMERAS (2025)
- Spiking Neural Networks for Event-based Vision (2025)
- Event-Driven Sensing for a Humanoid Robot (2025)
NAVSIM v2 — 3 videos
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
- NAVSIM v2: Pseudo-Simulation for Autonomous Driving & ICCV 2025 Challenge Winner Presentation (2025)
- Workshop on Embodied Intelligence for Autonomous Systems on the Horizon (2025)
Denoising Diffusion GANs (DDG) — 3 videos
- Visual Generative Modeling: What’s After Diffusion? (2025)
- Visual Generative Modeling: What’s After Diffusion? (2025)
- Visual Generative Modeling: What’s After Diffusion? (2025)
f-Distill — 3 videos
- Visual Generative Modeling: What’s After Diffusion? (2025)
- Visual Generative Modeling: What’s After Diffusion? (2025)
- Visual Generative Modeling: What’s After Diffusion? (2025)
DiffPD — 2 videos
- Computational Design of Diverse Morphologies and Sensors for Vision & Robotics (2024)
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
MuJoCo — 2 videos
- Computational Design of Diverse Morphologies and Sensors for Vision & Robotics (2024)
- From Sim2Real 1.0 to 4.0 for Humanoid Whole-Body Control and Loco-Manipulation (2025)
DOROTHIE — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
Dreamer v1 — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
Dreamer v2 — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
Dreamer v3 — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
Phenaki — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
MILE — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
Panacea — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
LidarDM — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
Iso-Dream — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
DriveAGI — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
ELM — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
DriveAdapter — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Embodied Intelligence for Autonomous Systems on the Horizon (2025)
Model Predictive Control — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
NSFF — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
D-NeRF — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
Carla — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
CNN E2E — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
CILRS — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
SafeDagger — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
BDD-X — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
PlanT — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
Transformer Block — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Repurposing generative models for 3D data: Towards a generative model-powered neural simulator (2025)
Lingo-Judge — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
Vision Encoder — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Second Egocentric Vision (EgoVis) Workshop (2025)
DiT — 2 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
PixArt-α — 2 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
T5 — 2 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- From Multimodal LLM to Human-level AI (2024)
Instruct-NeRF2NeRF — 2 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
Neus — 2 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- Image Matching: Local Features and Beyond (2024)
ConvNext — 2 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- The 6th International Workshop on Gaze Estimation and Prediction in the Wild (2024)
TextMesh — 2 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- 3D Foundation Models for Physical Intelligence (2024)
MeshLRM — 2 videos
DreamScene4D — 2 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
MV-Dream — 2 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Splatter Image — 2 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- The First Workshop on AI for 3D Generation (2024)
Free3D — 2 videos
- 3D Generative AI: Efficient, high-def & controllable (2024)
- The First Workshop on AI for 3D Generation (2024)
Sentinel-2 — 2 videos
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
- AI4Space 2024 Workshop (2024)
Supervised Learning — 2 videos
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
- Towards intelligent robots (2025)
SatMAE — 2 videos
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
Scale-MAE — 2 videos
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
USat — 2 videos
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
K-Means — 2 videos
- Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
MCUNet — 2 videos
- Computer Vision Foundation Talk/Workshop (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
TinyNAS — 2 videos
- Computer Vision Foundation Talk/Workshop (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
TinyEngine — 2 videos
- Computer Vision Foundation Talk/Workshop (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
StyleGAN-T — 2 videos
- Computer Vision Foundation Talk/Workshop (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
CrossDIT — 2 videos
- Computer Vision Foundation Talk/Workshop (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
LADD — 2 videos
- Computer Vision Foundation Talk/Workshop (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
NuScenes — 2 videos
- Scenic: An Open-Source Probabilistic Programming System for Data Generation and Safety in AI-Based Autonomy (2024)
- ViLMa Visual Localization and Mapping (2024)
Kubric — 2 videos
- CVPR 2024 Object-Centric Representation For Computer Vision Tutorial (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
TokenCut — 2 videos
- CVPR 2024 Object-Centric Representation For Computer Vision Tutorial (2024)
- Image Matching: Local Features and Beyond (2024)
ONNX Runtime — 2 videos
- Multimodal AI for Edge AI (2024)
- Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
Pixel Shuffle — 2 videos
- Multimodal AI for Edge AI (2024)
- 8th New Trends in Image Restoration and Enhancement Workshop and 8 Associated Challenges (2025)
NFNet — 2 videos
MNASNet — 2 videos
Knowledge Distillation — 2 videos
DistilBERT — 2 videos
Keras — 2 videos
PackNet — 2 videos
ArgoVerse — 2 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
MLPs — 2 videos
- The 3rd Monocular Depth Estimation Challenge (2024)
- The 6th International Workshop on Gaze Estimation and Prediction in the Wild (2024)
InternImage — 2 videos
FlexDM — 2 videos
- Workshop on Graphic Design Understanding and Generation (2024)
- Workshop on Graphic Design Understanding and Generation (2024)
DreamSim — 2 videos
- Workshop on Graphic Design Understanding and Generation (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
Nerf — 2 videos
- Image Matching: Local Features and Beyond (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
VGGSfM — 2 videos
- Image Matching: Local Features and Beyond (2024)
- 3D Foundation Models for Physical Intelligence (2024)
PatchNetVLAD — 2 videos
MixVPR — 2 videos
SGM — 2 videos
ORB-SLAM3 — 2 videos
- Image Matching: Local Features and Beyond (2024)
- CVPR 2024 Workshop (2024)
PoseDiffusion — 2 videos
- Image Matching: Local Features and Beyond (2024)
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
StyleGAN2-ADA — 2 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
StyleGAN3 — 2 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- Machine Learning for Geometric Shape Analysis (2024)
DeepCAD — 2 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
HiFA — 2 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Wonder3D — 2 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 3D Foundation Models for Physical Intelligence (2024)
LucidDreamer — 2 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- 3D Foundation Models for Physical Intelligence (2024)
PhysDreamer — 2 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
Dream-in-4D — 2 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- Machine Learning for Geometric Shape Analysis (2024)
4Real — 2 videos
- 3D/4D Generation and Modeling with Generative Priors (2024)
- The First Workshop on AI for 3D Generation (2024)
LeNet — 2 videos
DGCNN — 2 videos
Aurora — 2 videos
InfoNCE loss — 2 videos
JAX library — 2 videos
- CVPR 2024 Workshop (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
GoogleNet — 2 videos
- CVPRW-NAS 2024 - Day 1 Session 1 (2024)
- CVPR 2024 Workshop on Data Curation and Augmentation in Medical Imaging (2024)
Inception — 2 videos
- CVPRW-NAS 2024 - Day 1 Session 1 (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
AutoAugment — 2 videos
- CVPRW-NAS 2024 - Day 1 Session 1 (2024)
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
DARTS — 2 videos
SNIP — 2 videos
- CVPRW-NAS 2024 - Day 1 Session 1 (2024)
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
CIFAR-100 — 2 videos
Tiny-ImageNet — 2 videos
SORA — 2 videos
- Multimodal Algorithmic Reasoning Workshop & SMART-101 Challenge Awards (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
GIT — 2 videos
- Multimodal Algorithmic Reasoning Workshop & SMART-101 Challenge Awards (2024)
- CV4MS @ CVPR 2024 (2024)
T-SNE — 2 videos
- 7th Multi-modal Learning Workshop (2024)
- ICCVW 2023 VLAR Session 3 (2025)
LLaVA 1.5 — 2 videos
DeepWalk — 2 videos
- Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
- CVPR 24’ Tutorial: Unifying Spectral and Spatial Graph Neural Networks (2024)
TokenHMR — 2 videos
BEDLAM — 2 videos
HMR 2.0 — 2 videos
DexCap — 2 videos
EGO-EXO4D — 2 videos
- Towards the 3D Human Foundation Agent (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
Med-PaLM — 2 videos
Med-PaLM 2 — 2 videos
Med-Gemini — 2 videos
UNet++ — 2 videos
TVQA — 2 videos
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
MovieChat — 2 videos
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- From Multimodal LLM to Human-level AI (2024)
MoRevQA — 2 videos
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- Multi-stage reasoning for video understanding & scene generation (2025)
JCEF — 2 videos
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- Multi-stage reasoning for video understanding & scene generation (2025)
EPIC-KITCHENS — 2 videos
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
Omnivore — 2 videos
- Multimodal Foundational Models: MM Video Understanding & Vision-Language Guided Robotics (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
VIT — 2 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
DUST3R — 2 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Ego-Exo 4D — 2 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- ViLMa Visual Localization and Mapping (2024)
BlockNeRF — 2 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
ZipNeRF — 2 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- The First Workshop on AI for 3D Generation (2024)
Total-Recon — 2 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
L4GM — 2 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- 7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
Vicuna — 2 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- From Multimodal LLM to Human-level AI (2024)
LGM — 2 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- 3D Foundation Models for Physical Intelligence (2024)
LATTE3D — 2 videos
- 2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Quest 3 — 2 videos
- CV4MR 2024: 2nd Workshop on Computer Vision for Mixed Reality (2024)
- ViLMa Visual Localization and Mapping (2024)
Gaussian Splatting (3DGS) — 2 videos
- CV4MR 2024: 2nd Workshop on Computer Vision for Mixed Reality (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
OpenNeRF — 2 videos
- CV4MR 2024: 2nd Workshop on Computer Vision for Mixed Reality (2024)
- ViLMa Visual Localization and Mapping (2024)
DeepMetaHandles — 2 videos
- CV4MR 2024: 2nd Workshop on Computer Vision for Mixed Reality (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
MOTR — 2 videos
- CV4MR 2024: 2nd Workshop on Computer Vision for Mixed Reality (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
CLIP text encoder — 2 videos
- Human Motion Generation (HuMoGen) Workshop (2024)
- CVPR 2024 Workshop (2024)
PhysicsVAE [Won et al. SIGGRAPH 2022] — 2 videos
EgoGen — 2 videos
- Human Motion Generation (HuMoGen) Workshop (2024)
- Estimating human motion in world coordinates (2025)
Emu (Meta) — 2 videos
Imagen (Google) — 2 videos
Firefly (Adobe) — 2 videos
SDXL (Stability.ai) — 2 videos
Variational Score Distillation — 2 videos
- Panel Discussion on AI, Art, and Creativity (2024)
- 3D Foundation Models for Physical Intelligence (2024)
SCAMP-7 — 2 videos
- The 20th Embedded Vision Workshop (EVW2024) (2024)
- SCAMP-5: Vision Sensor with Pixel Parallel SIMD Processor Array (2025)
YOLOv8m — 2 videos
- The 20th Embedded Vision Workshop (EVW2024) (2024)
- Mobile AI Workshop 2025: Introductory Talk (2025)
YOLOv8x — 2 videos
DeepStream — 2 videos
Mapillary — 2 videos
- OmniCV 2024 Workshop (2024)
- ViLMa Visual Localization and Mapping (2024)
Neural Radiance Fields (NeRFs) — 2 videos
L2 loss — 2 videos
LSTMs — 2 videos
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- REALIZING THE PROMISE OF SPIKING NEUROMORPHIC HARDWARE (2025)
FISTA — 2 videos
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- REALIZING THE PROMISE OF SPIKING NEUROMORPHIC HARDWARE (2025)
FCN — 2 videos
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
Self-Attention — 2 videos
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
RandAugment — 2 videos
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- Dataset Distillation: A Comprehensive Review (2024)
AugMix — 2 videos
YOLO V8 — 2 videos
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
VGG19 — 2 videos
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
MobileNet — 2 videos
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
NASNet — 2 videos
- LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
PGD — 2 videos
- 5th Face Anti-spoofing Workshop @ CVPR2024 (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
Swin-Transformer — 2 videos
- 5th Face Anti-spoofing Workshop @ CVPR2024 (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
SimMIM — 2 videos
- 5th Face Anti-spoofing Workshop @ CVPR2024 (2024)
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
Firefly — 2 videos
- 7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
- The First Workshop on AI for 3D Generation (2024)
X-ray — 2 videos
- CVPR 2024 Workshop on Data Curation and Augmentation in Medical Imaging (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
Grounded-SAM — 2 videos
- CVPR 2024 Workshop on Data Curation and Augmentation in Medical Imaging (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
DepthFM — 2 videos
- ReGenAI Workshop CVPR 2024 (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
LCM-LoRA — 2 videos
- ReGenAI Workshop CVPR 2024 (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
CFM — 2 videos
- ReGenAI Workshop CVPR 2024 (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
Claude — 2 videos
- ReGenAI Workshop CVPR 2024 (2024)
- CV4MS @ CVPR 2024 (2024)
OWLv2 — 2 videos
ARC — 2 videos
SIGMA — 2 videos
GPT-1 — 2 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- From Multimodal LLM to Human-level AI (2024)
D-RFCN + SNIP — 2 videos
NAS-FPN — 2 videos
DyHead — 2 videos
FocalNet-H (DINO) — 2 videos
T-MARS — 2 videos
CLIPA-v2 — 2 videos
Qwen-VL-Plus — 2 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
GLIGEN — 2 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- Diffusion-based Video Generative Models (2024)
ImageReward — 2 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
PickScore — 2 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
HPSv2 — 2 videos
- CVPR 2024 Workshop: Multimodal Foundation Models (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
MMD (Maximum Mean Discrepancy) — 2 videos
Structure from Motion (SfM) — 2 videos
- CVPR 2024 Workshop (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
S-GAN — 2 videos
- CVPR 2024 Workshop (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
Transformer Decoder — 2 videos
Self-attention — 2 videos
FFN — 2 videos
- CVPR 2024 Workshop (2024)
- CVPR 2025 - 2nd Workshop on Neural Fields Beyond Conventional Cameras (2025)
PixSFM — 2 videos
- CVPR MetaFood Workshop (2024)
- CVPR MetaFood Workshop (2024)
XMem — 2 videos
- CVPR MetaFood Workshop (2024)
- CVPR MetaFood Workshop (2024)
FoodLearner — 2 videos
- CVPR MetaFood Workshop (2024)
- CVPR MetaFood Workshop (2024)
Image-Informed Text Encoder — 2 videos
- CVPR MetaFood Workshop (2024)
- CVPR MetaFood Workshop (2024)
Graph Neural Network — 2 videos
- CVPR MetaFood Workshop (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
Intrinsic-LoRA (I-LoRA) — 2 videos
DINO-v2 — 2 videos
- CVPR 2024 Workshop (2024)
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
StyleGAN v2 — 2 videos
StyleGAN-XL — 2 videos
Pix2Video — 2 videos
- CVPR 2024 Workshop (2024)
- The First Workshop on AI for 3D Generation (2024)
Control-A-Video — 2 videos
- CVPR 2024 Workshop (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
Layered Neural Atlases — 2 videos
- CVPR 2024 Workshop (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
MVImgNet — 2 videos
LRM (Large Reconstruction Model) — 2 videos
- CVPR 2024 Workshop (2024)
- The First Workshop on AI for 3D Generation (2024)
EmuEdit — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
Parti — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
CIFAR — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Welcome to the Workshop on Responsible Data! (2024)
VDM++ — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
RIN — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
CTM — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
Diff-Instruct — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
PerFlow — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
UFO-Gen — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
InstaFlow-1.7B — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
Lumiere — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- Diffusion-based Video Generative Models (2024)
DreamBooth3D — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- The First Workshop on AI for 3D Generation (2024)
Alchemist — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- The First Workshop on AI for 3D Generation (2024)
Custom Diffusion — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
Dreambooth — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
BLIPDiffusion — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
SUTI — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
E4T-Diffusion — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
AnyDoor — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
FastComposer — 2 videos
- GenAI Media Generation Challenge Workshop @ CVPR (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
MTGS — 2 videos
- The 6th International Workshop on Gaze Estimation and Prediction in the Wild (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
Panoptic Segmentation — 2 videos
- The 6th International Workshop on Gaze Estimation and Prediction in the Wild (2024)
- ViLMa Visual Localization and Mapping (2024)
Blender — 2 videos
- The 6th International Workshop on Gaze Estimation and Prediction in the Wild (2024)
- Synthetic Data for CV (2024)
Unreal Engine — 2 videos
- The 6th International Workshop on Gaze Estimation and Prediction in the Wild (2024)
- Synthetic Data for CV (2024)
Gaussian Splats — 2 videos
CLIP score — 2 videos
- Generative AI by Getty Images: Addressing Concerns and Building Better Models (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
BYOL — 2 videos
- CVPR Tutorial June 2024: Deep Learning for Camera Physiological Measurement (2024)
- Foundation Models in Radiology (2025)
LingoQA — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
TransFuser — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
ADriver-I — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
Argoverse 2 — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- Argoverse Competitions 2025 (2025)
OpenPilot — 2 videos
- CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
- CVPR 2024 Workshop on Autonomous Driving (2024)
Variational Autoencoders (VAEs) — 2 videos
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- Dataset Distillation: A Comprehensive Review (2024)
Energy-based models — 2 videos
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
Contrastive Energy Model — 2 videos
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
Score Denoising — 2 videos
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
Product of Expert — 2 videos
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- Disentanglement and Compositionality in Artificial Intelligence (2024)
HuggingGPT — 2 videos
- Disentanglement and Compositionality in Artificial Intelligence (2024)
- From Multimodal LLM to Human-level AI (2024)
DDIM — 2 videos
- Diffusion-based Video Generative Models (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
Snap Video — 2 videos
FVD — 2 videos
- Diffusion-based Video Generative Models (2024)
- Multi-stage reasoning for video understanding & scene generation (2025)
Safe Diffusion — 2 videos
- Diffusion-based Video Generative Models (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
MSR-VTT — 2 videos
ViT-B — 2 videos
- CVPR 2024 Tutorial: Learning Deep Low-Dimensional Models from High-Dimensional Data: Theory to Practice (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
NegGrad — 2 videos
- Machine Unlearning in Computer Vision: Foundations and Applications (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
KPConv — 2 videos
- All You Need to Know about Point Cloud Understanding (2024)
- Machine Learning for Geometric Shape Analysis (2024)
Transformer-XL — 2 videos
SAN — 2 videos
- All You Need to Know about Point Cloud Understanding (2024)
- Concept Learning Across Domains and Modalities (2025)
PointNext — 2 videos
- All You Need to Know about Point Cloud Understanding (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
Gemini 1.5 Pro — 2 videos
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
Gemini 1.5 Flash — 2 videos
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
Qwen-VL-Chat — 2 videos
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- From Multimodal LLM to Human-level AI (2024)
POPE — 2 videos
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- From Multimodal LLM to Human-level AI (2024)
CUDA — 2 videos
- CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
- Video Foundation Models: From Black Boxes to Controllable Representations (2024)
cuBLAS — 2 videos
RISE — 2 videos
- Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
Mahalanobis — 2 videos
- Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
- Scalable Real-Time Abnormal Event Detection (2024)
LRP — 2 videos
- Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
- CVPR 2024 Workshop (2024)
CelebA-HQ — 2 videos
- Anti-DreamBooth: Protecting Users from Personalized Text-to-Image Synthesis (2024)
- Welcome to the Workshop on Responsible Data! (2024)
Masked Transformer — 2 videos
- Coarse-to-Fine Amodal Segmentation with Shape Prior (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
4D-fy — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
Co-Tracker — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
DIBR — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
DMTet — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
DatasetGAN — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
DefGrid — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
DefTet — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Dr. Robot — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Dream Machine — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
DreamCraft3D — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Dreamitate — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Fantasia3D — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
FlexiCubes — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
GPT4 — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
GRAF — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
GaussianDreamer — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
MipNeRF360 — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 23675 First Workshop on Efficient and On Device Generation EDGE (2024)
SV3D — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Text2Tex — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
UniDepth — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Zero123 — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Zero123-XL — 2 videos
- The First Workshop on AI for 3D Generation (2024)
- 3D Foundation Models for Physical Intelligence (2024)
BPNet — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Machine Learning for Geometric Shape Analysis (2024)
BundleFusion — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 3D Foundation Models for Physical Intelligence (2024)
CoTracker — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
GenZI — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 3D Foundation Models for Physical Intelligence (2024)
GeoNet — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Visual-Inertial Odometry for Small-sized Robots (2024)
LVIS — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
Lidar — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Synthetic Data for CV (2024)
Mip-NeRF — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 8th New Trends in Image Restoration and Enhancement Workshop and 8 Associated Challenges (2025)
Neural Feature Fusion Fields (N3F) — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
Neural Radiance Fields (NeRF) — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- Synthetic Data for CV (2024)
Panoptic Lifting — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- ViLMa Visual Localization and Mapping (2024)
SAM (Segment Anything Model) — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
SAPIEN — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 23598 The 5th Annual Embodied AI Workshop (2024)
SMAL — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
SceneScript — 2 videos
- 2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
- ViLMa Visual Localization and Mapping (2024)
ConsistDreamer — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- ViLMa Visual Localization and Mapping (2024)
FocalNet-Huge — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Foundational Few-Shot Object Detection Challenge (2025)
GCC-PHAT — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
GPT-4V(ision) — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Foundational Few-Shot Object Detection Challenge (2025)
GelSight — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
IID — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
LLaVA-NeXT — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- From Multimodal LLM to Human-level AI (2024)
MQ-GLIP — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Foundational Few-Shot Object Detection Challenge (2025)
MSVD-QA — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- ICCV 2023 Workshop on Vision and Language Algorithmic Reasoning (VLAR) (2025)
MixPL — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Foundational Few-Shot Object Detection Challenge (2025)
MonoCLR — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
OPT — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- From Multimodal LLM to Human-level AI (2024)
Ours-L2R — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
PSG — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- From Multimodal LLM to Human-level AI (2024)
PartCLIPSeg — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Foundational Few-Shot Object Detection Challenge (2025)
REACT — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
RichSem-DINO-FocalNet — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Foundational Few-Shot Object Detection Challenge (2025)
SD-XL — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
StereoCRW — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
Superglue — 2 videos
- VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
- Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
DROID dataset — 2 videos
GENIE — 2 videos
Grounding DINO — 2 videos
- 23598 The 5th Annual Embodied AI Workshop (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
HMD2 — 2 videos
HoloDeck — 2 videos
- 23598 The 5th Annual Embodied AI Workshop (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Nymeria — 2 videos
Octo 55B — 2 videos
Octo 93M — 2 videos
PartNet-Mobility — 2 videos
- 23598 The 5th Annual Embodied AI Workshop (2024)
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
Project Aria — 2 videos
Python — 2 videos
- 23598 The 5th Annual Embodied AI Workshop (2024)
- iniVation Neuromorphic Vision Systems: Core Technology, Software, and Applications (2025)
RT-1-X — 2 videos
- 23598 The 5th Annual Embodied AI Workshop (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
RoboNet — 2 videos
ACE — 2 videos
- CVPR 2024 Workshop (2024)
- ViLMa Visual Localization and Mapping (2024)
Active learning — 2 videos
CAF — 2 videos
EWC — 2 videos
- CVPR 2024 Workshop (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
EfficientNet-B0 — 2 videos
- CVPR 2024 Workshop (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
Fine-tuning — 2 videos
FixMatch — 2 videos
- CVPR 2024 Workshop (2024)
- HDC: Hierarchical Distillation for Multi-level Noisy Consistency in Semi-Supervised Fetal Ultrasound Segmentation (2025)
Joint Training — 2 videos
- CVPR 2024 Workshop (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
LIME — 2 videos
- CVPR 2024 Workshop (2024)
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
Large Language Models — 2 videos
SID — 2 videos
- CVPR 2024 Workshop (2024)
- BIMA: Bijective Maximum Likelihood Learning Approach to Hallucination Prediction and Mitigation in Large Vision-Language Models (2025)
WordNet — 2 videos
- CVPR 2024 Workshop (2024)
- Synthetic Data for CV (2024)
FPN — 2 videos
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- CV4MS @ CVPR 2024 (2024)
HaMeR — 2 videos
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
LLaMA-2 — 2 videos
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- Foundation Models for Vision: From Vision to Clinical Reality (2025)
Mamba — 2 videos
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- Second Egocentric Vision (EgoVis) Workshop (2025)
RoBERTa — 2 videos
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- VizWiz Grand Challenge: Opening Remarks (2024)
SlowFast — 2 videos
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- Unsolved problems in video understanding (2025)
TSN — 2 videos
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- Dataset Distillation: A Comprehensive Review (2024)
TimeSformer — 2 videos
- First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
Contrastive Loss — 2 videos
- ViLMa Visual Localization and Mapping (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
DSO — 2 videos
OpenMask3D — 2 videos
- ViLMa Visual Localization and Mapping (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
PoseNet — 2 videos
- ViLMa Visual Localization and Mapping (2024)
- Real-Time 6DOF Pose Relocalization for Event Cameras with Stacked Spatial LSTM Networks (2025)
RealEstate10K — 2 videos
SceneFun3D — 2 videos
- ViLMa Visual Localization and Mapping (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
VIO — 2 videos
- ViLMa Visual Localization and Mapping (2024)
- What does Embodied Intelligence mean? Lessons Learned from Drone Racing (2025)
VLAD — 2 videos
- ViLMa Visual Localization and Mapping (2024)
- CV4MS @ CVPR 2024 (2024)
AutoDAN — 2 videos
- Dataset Distillation: A Comprehensive Review (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
Generative Adversarial Networks (GANs) — 2 videos
- Dataset Distillation: A Comprehensive Review (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
Llama 2 — 2 videos
- Dataset Distillation: A Comprehensive Review (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
Llama 3 — 2 videos
MiniGPT4 — 2 videos
S3D — 2 videos
Stable Signature — 2 videos
- Dataset Distillation: A Comprehensive Review (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
StegaStamp — 2 videos
- Dataset Distillation: A Comprehensive Review (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
CFLOW — 2 videos
- Scalable Real-Time Abnormal Event Detection (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
Cross-entropy loss — 2 videos
- Scalable Real-Time Abnormal Event Detection (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
CutPaste — 2 videos
- Scalable Real-Time Abnormal Event Detection (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
DMR — 2 videos
- Scalable Real-Time Abnormal Event Detection (2024)
- Hardware and Algorithm Co-design with Event Sensors (2025)
Entropy — 2 videos
- Scalable Real-Time Abnormal Event Detection (2024)
- N-ROD: a Neuromorphic Dataset for Synthetic-to-Real Domain Adaptation (2025)
MVtec AD — 2 videos
- Scalable Real-Time Abnormal Event Detection (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
Masked Auto-Encoder (MAE) — 2 videos
PaDiM — 2 videos
- Scalable Real-Time Abnormal Event Detection (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
PatchCore — 2 videos
- Scalable Real-Time Abnormal Event Detection (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
SPADE — 2 videos
- Scalable Real-Time Abnormal Event Detection (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
ST-GCN — 2 videos
- Scalable Real-Time Abnormal Event Detection (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
STG-NF — 2 videos
- Scalable Real-Time Abnormal Event Detection (2024)
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
VisA — 2 videos
- Scalable Real-Time Abnormal Event Detection (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
Ablation-CAM — 2 videos
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
Backpropagation — 2 videos
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- Neuromorphic computing hardware and event-based vision: a perfect match? (2025)
Clustering — 2 videos
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
SEEM — 2 videos
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- CV4MS @ CVPR 2024 (2024)
Score-CAM — 2 videos
- The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
FullGrad — 2 videos
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
SFT — 2 videos
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- THE BITTER LESSON FOR RL: VERIFICATION AS THE KEY TO REASONING LLMS (2025)
Safe Latent Diffusion — 2 videos
- 23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
EfficientNet — 2 videos
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
- Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
HuBERT — 2 videos
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
- From Multimodal LLM to Human-level AI (2024)
TCN — 2 videos
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
- CVsports Workshop at CVPR 2024, Seattle (2024)
UniformerV2 — 2 videos
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
Whisper — 2 videos
- 23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
- From Multimodal LLM to Human-level AI (2024)
CC12M — 2 videos
- VizWiz Grand Challenge: Opening Remarks (2024)
- Synthetic Data for CV (2024)
CC3M — 2 videos
- VizWiz Grand Challenge: Opening Remarks (2024)
- Synthetic Data for CV (2024)
CLIPSeg — 2 videos
- VizWiz Grand Challenge: Opening Remarks (2024)
- Foundational Few-Shot Object Detection Challenge (2025)
CogAgent — 2 videos
Conceptual Captions — 2 videos
- VizWiz Grand Challenge: Opening Remarks (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
DeepSeek-VL — 2 videos
GQA — 2 videos
PaLI-X — 2 videos
- VizWiz Grand Challenge: Opening Remarks (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
ViLT — 2 videos
- VizWiz Grand Challenge: Opening Remarks (2024)
- ICCV 2023 Workshop on Vision and Language Algorithmic Reasoning (VLAR) (2025)
VizWiz-VQA — 2 videos
- VizWiz Grand Challenge: Opening Remarks (2024)
- ICCV 2023 Workshop on Vision and Language Algorithmic Reasoning (VLAR) (2025)
DeepLabV3 — 2 videos
- Machine Learning for Geometric Shape Analysis (2024)
- Visual-Inertial Odometry for Small-sized Robots (2024)
Occupancy Network — 2 videos
- Machine Learning for Geometric Shape Analysis (2024)
- 3D Foundation Models for Physical Intelligence (2024)
SIREN — 2 videos
- Machine Learning for Geometric Shape Analysis (2024)
- CVPR 2025 - 2nd Workshop on Neural Fields Beyond Conventional Cameras (2025)
4D-Occ — 2 videos
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
Bucket Normalized EPE — 2 videos
Ego-MLP — 2 videos
FocalFormer3D — 2 videos
IoU — 2 videos
- CVPR 2024 Workshop on Autonomous Driving (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
Lite-QCNet — 2 videos
PDM Score — 2 videos
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
QCNet — 2 videos
TrackFlow — 2 videos
- CVPR 2024 Workshop on Autonomous Driving (2024)
- Perception and simulation for self-driving vehicles (2025)
ALOHA — 2 videos
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
Chamfer Distance — 2 videos
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
Graph Visual Question Answering — 2 videos
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
Masked Autoencoder — 2 videos
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Foundation Models in Radiology (2025)
PaLM-E — 2 videos
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- Foundation Models for Autonomous Systems Workshop (2024)
SpatialVLM — 2 videos
- 23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
- From Multimodal LLM to Human-level AI (2024)
EDSR — 2 videos
- Mobile Intelligent Photography and Imaging (2024)
- 8th New Trends in Image Restoration and Enhancement Workshop and 8 Associated Challenges (2025)
SRGAN — 2 videos
StableSR — 2 videos
SwinIR — 2 videos
- Mobile Intelligent Photography and Imaging (2024)
- 8th New Trends in Image Restoration and Enhancement Workshop and 8 Associated Challenges (2025)
VDSR — 2 videos
- Mobile Intelligent Photography and Imaging (2024)
- 8th New Trends in Image Restoration and Enhancement Workshop and 8 Associated Challenges (2025)
3D Gaussian Splatting (3DGS) — 2 videos
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- NAVSIM v2: Pseudo-Simulation for Autonomous Driving & ICCV 2025 Challenge Winner Presentation (2025)
AdaLN — 2 videos
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- World Modeling Challenge (2025)
Ctrl-Sim — 2 videos
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Scalable Autonomous Driving via Fully Data-driven Simulation (2025)
DriveVLM-Dual — 2 videos
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- Foundation models For autonomous driving (2025)
MARS — 2 videos
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
Nerfacto — 2 videos
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- 3D Foundation Models for Physical Intelligence (2024)
NeuRas — 2 videos
- Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
JFT-300M — 2 videos
- Welcome to the Workshop on Responsible Data! (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
APS — 2 videos
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- The development of the DVS and DAVIS sensors (2025)
BIM — 2 videos
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
MaskGit — 2 videos
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- Visual Generative Modeling: What’s After Diffusion? (2025)
Vision Transformers (ViT) — 2 videos
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
YFCC100M — 2 videos
- IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
- Synthetic Data for CV (2024)
EV-IMO — 2 videos
- Visual-Inertial Odometry for Small-sized Robots (2024)
- O-MMS: Zero-Shot Multi-Motion Segmentation With A Monocular Event Camera (2025)
EVDodge — 2 videos
- Visual-Inertial Odometry for Small-sized Robots (2024)
- Object Motion Segmentation: Advantages from Event Data (2025)
ICP — 2 videos
- Visual-Inertial Odometry for Small-sized Robots (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
Kalman filter — 2 videos
- Visual-Inertial Odometry for Small-sized Robots (2024)
- What does Embodied Intelligence mean? Lessons Learned from Drone Racing (2025)
ORB-SLAM — 2 videos
ReLU — 2 videos
- Visual-Inertial Odometry for Small-sized Robots (2024)
- The development of the DVS and DAVIS sensors (2025)
Autoencoder — 2 videos
Monte Carlo Tree Search (MCTS) — 2 videos
- AI4Space 2024 Workshop (2024)
- THE BITTER LESSON FOR RL: VERIFICATION AS THE KEY TO REASONING LLMS (2025)
Neural-Fly — 2 videos
- AI4Space 2024 Workshop (2024)
- From Sim2Real 1.0 to 4.0 for Humanoid Whole-Body Control and Loco-Manipulation (2025)
DreamGaussian — 2 videos
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- 3D Foundation Models for Physical Intelligence (2024)
One-2-3-45 — 2 videos
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- 3D Foundation Models for Physical Intelligence (2024)
VITPose — 2 videos
- 4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
- How to Train Your Humanoid: From Human Mesh Recovery to VideoMimic (2025)
F1 Score — 2 videos
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
IM-Net — 2 videos
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
- 3D Foundation Models for Physical Intelligence (2024)
Optimal Transport — 2 videos
- 1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
- Deep Stereo Matching in the Twenties (2024)
Data augmentation — 2 videos
DeepSORT — 2 videos
LLaMA2 — 2 videos
MLP Projector — 2 videos
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
MOTA — 2 videos
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
NAFNet — 2 videos
Photogrammetry — 2 videos
- THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
- 4th Workshop on Computer Vision in the Built Environment (2024)
Pseudo-labeling — 2 videos
Qwen-VL — 2 videos
Test-time augmentation — 2 videos
Video Swin Transformer — 2 videos
Weighted Box Fusion (WBF) — 2 videos
YOLOv5 — 2 videos
Active Learning — 2 videos
FPN (Feature Pyramid Network) — 2 videos
Image Captioning — 2 videos
- DEF-AI-MIA Workshop at CVPR 2024 (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
GPR — 2 videos
BioGPT — 2 videos
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- From Multimodal LLM to Human-level AI (2024)
CONCH — 2 videos
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- Multimodal, Generative, and Agentic AI for Pathology (2025)
EfficientNet-B7 — 2 videos
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- Black-box Adversarial Attacks on Vision Foundation Models (2024)
Vision Transformer (ViT) — 2 videos
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- Welcome to the workshop on Computer Vision in the Wild (CVinW) (2025)
iBOT — 2 videos
- Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
- Synthetic Data for CV (2024)
DCLM-Baseline — 2 videos
DataComp-1B — 2 videos
DataComp-LM — 2 videos
Mistral-7B-v0.3 — 2 videos
WIT — 2 videos
BiomedParse — 2 videos
- CV4MS @ CVPR 2024 (2024)
- Learning the Language of Patients (2025)
Convolutional Neural Network — 2 videos
- CV4MS @ CVPR 2024 (2024)
- Image Reconstruction from Neuromorphic Event Cameras using Laplacian- Prediction and Poisson Integration with Spiking and Artificial Neural Networks (2025)
DeepLabV3+ — 2 videos
- CV4MS @ CVPR 2024 (2024)
- Mobile AI Workshop 2025: Introductory Talk (2025)
PSPNet — 2 videos
Plenoxels — 2 videos
- 3D Foundation Models for Physical Intelligence (2024)
- CVPR 2025 - 2nd Workshop on Neural Fields Beyond Conventional Cameras (2025)
Stable Video Diffusion — 2 videos
Computer Vision — 2 videos
Generative AI — 2 videos
Transformer decoder — 2 videos
3D-GPT — 2 videos
- Synthetic Data for CV (2024)
- From Multimodal LLM to Human-level AI (2024)
DeepSeek — 2 videos
Phi-3 — 2 videos
- Synthetic Data for CV (2024)
- Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
RAFT-Stereo — 2 videos
- Synthetic Data for CV (2024)
- Deep Stereo Matching in the Twenties (2024)
NNCF — 2 videos
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
- Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
ONNX — 2 videos
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
- Mobile AI Workshop 2025: Introductory Talk (2025)
Self-supervised learning — 2 videos
- CVPR 2024 - Invited Speakers - Chris Padwick (2024)
- AI agents in cancer research and oncology (2025)
CLIP encoder — 2 videos
- Foundation Models for Autonomous Systems Workshop (2024)
- What does Embodied Intelligence mean? Lessons Learned from Drone Racing (2025)
DriveGAN — 2 videos
- Foundation Models for Autonomous Systems Workshop (2024)
- All you need to know about self-driving: Intro to Self-Driving (2024)
OpenVLA — 2 videos
- Foundation Models for Autonomous Systems Workshop (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
S2Net — 2 videos
- Foundation Models for Autonomous Systems Workshop (2024)
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
BehaviorNet — 2 videos
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
- Perception and simulation for self-driving vehicles (2025)
IDM — 2 videos
- 23713 Towards Building AGI in Autonomy and Robotics (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
P3 — 2 videos
- All you need to know about self-driving: Intro to Self-Driving (2024)
- End-to-end Autonomous Driving: Past, Current and Onwards (2025)
Pascal VOC — 2 videos
- Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
- IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme (2025)
AutoGPT — 2 videos
- From Multimodal LLM to Human-level AI (2024)
- MineDojo: Framework for Generally Capable Agents & Voyager: An Open-Ended Embodied Agent with Large Language Models (2025)
Point-Bind — 2 videos
Video-ChatGPT — 2 videos
Voyager — 2 videos
- From Multimodal LLM to Human-level AI (2024)
- MineDojo: Framework for Generally Capable Agents & Voyager: An Open-Ended Embodied Agent with Large Language Models (2025)
Dynamic Vision Sensor (DVS) — 2 videos
- Event-based, 6-DOF Pose Tracking for High-Speed Maneuvers (2025)
- Event-based Cameras: Challenges and Opportunities (2025)
MVSEC dataset — 2 videos
- Learning Event-Based Height From Plane and Parallax (2025)
- v2e: From Video Frames to Realistic DVS Events (2025)
EVO — 2 videos
- Comparing Representations in Tracking for Event Camera-based SLAM (2025)
- Event-based Algorithms for Robust and High-speed Robotics (2025)
DHP19 — 2 videos
- DHP19: Dynamic Vision Sensor 3D Human Pose Dataset (2025)
- Lifting Monocular Events to 3D Human Poses (2025)
Harris score — 2 videos
- Detecting Stable Keypoints from Events through Image Gradient Prediction (2025)
- Neuromorphic vision for humanoid robots (2025)
Spiking Neural Network — 2 videos
- Image Reconstruction from Neuromorphic Event Cameras using Laplacian- Prediction and Poisson Integration with Spiking and Artificial Neural Networks (2025)
- Bio-Inspired Embedded Event-based Visual Processing (2025)
N-MNIST dataset — 2 videos
- Image Reconstruction from Neuromorphic Event Cameras using Laplacian- Prediction and Poisson Integration with Spiking and Artificial Neural Networks (2025)
- Event-Driven Convolution-Based Processing (2025)
CPS — 2 videos
- HDC: Hierarchical Distillation for Multi-level Noisy Consistency in Semi-Supervised Fetal Ultrasound Segmentation (2025)
- IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme (2025)
PS-MT — 2 videos
- HDC: Hierarchical Distillation for Multi-level Noisy Consistency in Semi-Supervised Fetal Ultrasound Segmentation (2025)
- IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme (2025)
CCVC — 2 videos
- HDC: Hierarchical Distillation for Multi-level Noisy Consistency in Semi-Supervised Fetal Ultrasound Segmentation (2025)
- IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme (2025)
FlowNet — 2 videos
- Back to Event Basics: Self-Supervised Learning of Image Reconstruction for Event Cameras via Photometric Constancy (2025)
- Self-supervised Learning for Dynamic 3D Scene Understanding (2025)
ESIM (Event Camera Simulator) — 2 videos
- Events-to-Video: Bringing Modern Computer Vision to Event Cameras (2025)
- Event-based Cameras: Challenges and Opportunities (2025)
Convolutional Neural Networks (CNNs) — 2 videos
- Welcome to the workshop on Computer Vision in the Wild (CVinW) (2025)
- Unsupervised Learning of Optical Flow and Camera Motion from Event Data (2025)
FPGA — 2 videos
- From Event-Based Visions to Real Systems (2025)
- Bio-Inspired Embedded Event-based Visual Processing (2025)
IMU — 2 videos
libcaer — 2 videos
- iniVation Neuromorphic Vision Systems: Core Technology, Software, and Applications (2025)
- Applications, Software and Hardware for Event-Based Vision (2025)
Loihi — 2 videos
- LEARNING FROM EVENTS: ON THE FUTURE OF MACHINE LEARNING FOR EVENT-BASED CAMERAS (2025)
- Neuromorphic Computing: towards event-based (cognitive) sensing and control (2025)
N-MNIST — 2 videos
- LEARNING FROM EVENTS: ON THE FUTURE OF MACHINE LEARNING FOR EVENT-BASED CAMERAS (2025)
- Neuromorphic Vision Applications: From Robotic Foosball to Tracking Space Junk (2025)
MonoSLAM — 2 videos
PTAM — 2 videos
- Event-Based SLAM at Slamcore (2025)
- Event-based Algorithms for Robust and High-speed Robotics (2025)
iniLabs DVS128 — 2 videos
- Event-Based SLAM at Slamcore (2025)
- Object and Action Recognition on the Event-Based IBM TrueNorth Processor (2025)
Uni-NLX — 2 videos
- Vision and Language Algorithmic Reasoning Work & SMART-101 Challenge Awards (2025)
- ICCVW 2023 VLAR Session 3 (2025)
SelfGraphVQA — 2 videos
- Vision and Language Algorithmic Reasoning Work & SMART-101 Challenge Awards (2025)
- ICCV 2023 Workshop on Vision and Language Algorithmic Reasoning (VLAR) (2025)
Neural Networks — 2 videos
- Unsupervised Learning of Optical Flow and Camera Motion from Event Data (2025)
- Visual Generative Modeling: What’s After Diffusion? (2025)
HMAX — 2 videos
- Spiking Neural Networks for Event-based Vision (2025)
- Bio-Inspired Embedded Event-based Visual Processing (2025)
DNN — 2 videos
- Neuromorphic Computing: towards event-based (cognitive) sensing and control (2025)
- Hardware and Algorithm Co-design with Event Sensors (2025)
FastFlow3D — 2 videos
Gabor filters — 2 videos
TRAM — 2 videos
- How to Train Your Humanoid: From Human Mesh Recovery to VideoMimic (2025)
- Estimating human motion in world coordinates (2025)
VisProg — 2 videos
- Multi-stage reasoning for video understanding & scene generation (2025)
- ICCV 2023 Workshop on Vision and Language Algorithmic Reasoning (VLAR) (2025)
DVS128 — 2 videos
SemanticFusion — 2 videos
Brainchip — 2 videos
Particle Filter — 2 videos
- Event-Driven Sensing for a Humanoid Robot (2025)
- Reconstruction, Motion Estimation and SLAM from Events (2025)
Hough Transform — 2 videos
- Event-Driven Sensing for a Humanoid Robot (2025)
- Neuromorphic computing hardware and event-based vision: a perfect match? (2025)
Gaussian blur — 2 videos
VideoMimic — 2 videos
- From Sim2Real 1.0 to 4.0 for Humanoid Whole-Body Control and Loco-Manipulation (2025)
- Towards intelligent robots (2025)
Normalizing Flows — 2 videos
- Visual Generative Modeling: What’s After Diffusion? (2025)
- Visual Generative Modeling: What’s After Diffusion? (2025)
EMMA — 2 videos
- Foundation models For autonomous driving (2025)
- Workshop on Embodied Intelligence for Autonomous Systems on the Horizon (2025)
BiomedCLIP — 2 videos
- Learning the Language of Patients (2025)
- Multimodal, Generative, and Agentic AI for Pathology (2025)