Methods / Models / Datasets — Cross-Reference

7110 unique named entities across 243 videos

`GPT-3` — 15 videos

`Sora` — 15 videos

CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
3D Generative AI: Efficient, high-def & controllable (2024)
3D/4D Generation and Modeling with Generative Priors (2024)
7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
Disentanglement and Compositionality in Artificial Intelligence (2024)
Diffusion-based Video Generative Models (2024)
… and 7 more

`ControlNet` — 15 videos

Geospatial Computer Vision and Machine Learning for Large-Scale Earth Observation Data (2024)
CVPR 2024 Workshop: Multimodal Foundation Models (2024)
Virtual Try-On Workshop (2024)
Disentanglement and Compositionality in Artificial Intelligence (2024)
Diffusion-based Video Generative Models (2024)
The First Workshop on AI for 3D Generation (2024)
VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
Video Foundation Models: From Black Boxes to Controllable Representations (2024)
… and 7 more

`LoRA` — 14 videos

CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
CVPR MetaFood Workshop (2024)
Diffusion-based Video Generative Models (2024)
Video Foundation Models: From Black Boxes to Controllable Representations (2024)
The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
VizWiz Grand Challenge: Opening Remarks (2024)
23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
Mobile Intelligent Photography and Imaging (2024)
… and 6 more

`ResNet-50` — 14 videos

Image Matching: Local Features and Beyond (2024)
CVPR 2024 Workshop (2024)
CVPR 2024 Workshop (2024)
Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
Scalable Real-Time Abnormal Event Detection (2024)
IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
AI4Space 2024 Workshop (2024)
4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
… and 6 more

`ChatGPT` — 13 videos

`AlexNet` — 13 videos

Image Matching: Local Features and Beyond (2024)
2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
Virtual Try-On Workshop (2024)
2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
Dataset Distillation: A Comprehensive Review (2024)
IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
Visual-Inertial Odometry for Small-sized Robots (2024)
Black-box Adversarial Attacks on Vision Foundation Models (2024)
… and 5 more

`LLaVA` — 13 videos

Multimodal Algorithmic Reasoning Workshop & SMART-101 Challenge Awards (2024)
Towards the 3D Human Foundation Agent (2024)
2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
CVPR 2024 Workshop: Multimodal Foundation Models (2024)
Disentanglement and Compositionality in Artificial Intelligence (2024)
The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
VizWiz Grand Challenge: Opening Remarks (2024)
… and 5 more

`CNN` — 13 videos

Virtual Try-On Workshop (2024)
The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)
AI4Space 2024 Workshop (2024)
4th Workshop on CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling (2024)
DEF-AI-MIA Workshop at CVPR 2024 (2024)
Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
All you need to know about self-driving: Intro to Self-Driving (2024)
… and 5 more

`MLP` — 12 videos

`GPT-2` — 11 videos

Workshop on Graphic Design Understanding and Generation (2024)
7th Multi-modal Learning Workshop (2024)
The 13th Women in Computer Vision (WiCV) Workshop (2024)
CVPR 2024 Workshop: Multimodal Foundation Models (2024)
The 6th International Workshop on Gaze Estimation and Prediction in the Wild (2024)
23641 6th Workshop and Competition on Affective Behavior Analysis in the wild (2024)
Visual-Inertial Odometry for Small-sized Robots (2024)
Black-box Adversarial Attacks on Vision Foundation Models (2024)
… and 3 more

`U-Net` — 10 videos

3D Generative AI: Efficient, high-def & controllable (2024)
LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
Virtual Try-On Workshop (2024)
The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
DEF-AI-MIA Workshop at CVPR 2024 (2024)
CVPR 2024 - Invited Speakers - Chris Padwick (2024)
Events-to-Video: Bringing Modern Computer Vision to Event Cameras (2025)
… and 2 more

`COCO` — 10 videos

CVPR 2024 Object-Centric Representation For Computer Vision Tutorial (2024)
CVPR 2024 Workshop: Multimodal Foundation Models (2024)
2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
Machine Learning for Geometric Shape Analysis (2024)
Welcome to the Workshop on Responsible Data! (2024)
Synthetic Data for CV (2024)
All you need to know about self-driving: Intro to Self-Driving (2024)
Edge-Optimized Deep Learning: Harnessing Generative AI and Computer Vision with Open-Source Libraries (2024)
… and 2 more

`DALL-E` — 10 videos

CVPR 2024 Object-Centric Representation For Computer Vision Tutorial (2024)
Towards the 3D Human Foundation Agent (2024)
CVPR 2024 Workshop: Multimodal Foundation Models (2024)
2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
Video Foundation Models: From Black Boxes to Controllable Representations (2024)
23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
Black-box Adversarial Attacks on Vision Foundation Models (2024)
… and 2 more

`ViT` — 10 videos

The 3rd Monocular Depth Estimation Challenge (2024)
All You Need to Know about Point Cloud Understanding (2024)
2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
Dataset Distillation: A Comprehensive Review (2024)
Machine Learning for Geometric Shape Analysis (2024)
Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
Black-box Adversarial Attacks on Vision Foundation Models (2024)
… and 2 more

`Gaussian Splatting` — 10 videos

Image Matching: Local Features and Beyond (2024)
Towards the 3D Human Foundation Agent (2024)
CVPR MetaFood Workshop (2024)
The First Workshop on AI for 3D Generation (2024)
23598 The 5th Annual Embodied AI Workshop (2024)
4th Workshop on Computer Vision in the Built Environment (2024)
23675 First Workshop on Efficient and On Device Generation EDGE (2024)
3D Foundation Models for Physical Intelligence (2024)
… and 2 more

`Gemini` — 10 videos

Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
The 13th Women in Computer Vision (WiCV) Workshop (2024)
2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
23598 The 5th Annual Embodied AI Workshop (2024)
23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
CV4MS @ CVPR 2024 (2024)
Synthetic Data for CV (2024)
Multi-stage reasoning for video understanding & scene generation (2025)
… and 2 more

`DreamFusion` — 9 videos

3D Generative AI: Efficient, high-def & controllable (2024)
3D/4D Generation and Modeling with Generative Priors (2024)
2’nd Workshop for Learning 3D with Multi-View Supervision (3DMV) at CVPR 2024 (2024)
CV4MR 2024: 2nd Workshop on Computer Vision for Mixed Reality (2024)
7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
GenAI Media Generation Challenge Workshop @ CVPR (2024)
The First Workshop on AI for 3D Generation (2024)
2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
… and 1 more

`RANSAC` — 9 videos

The 3rd Monocular Depth Estimation Challenge (2024)
CVPR 2024 Workshop (2024)
Visual-Inertial Odometry for Small-sized Robots (2024)
1st Workshop on Urban Scene Modeling: Where Vision Meets Photogrammetry and Graphics (2024)
4th Workshop on Computer Vision in the Built Environment (2024)
All you need to know about self-driving: Intro to Self-Driving (2024)
CVPR 2024 Tutorial (2024)
Event-based Feature Tracking and Visual Inertial Odometry (2025)
… and 1 more

`BLIP-2` — 9 videos

The 3rd Monocular Depth Estimation Challenge (2024)
7th Multi-modal Learning Workshop (2024)
7th Multi-modal Learning Workshop (2024)
The 6th International Workshop on Gaze Estimation and Prediction in the Wild (2024)
CVPR 2024 Workshop (2024)
VizWiz Grand Challenge: Opening Remarks (2024)
23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
From Multimodal LLM to Human-level AI (2024)
… and 1 more

`PointNet` — 9 videos

CVPR 2024 Workshop (2024)
Workshop on Scene Graphs and Graph Representation Learning (SG2RL 2024) (2024)
Virtual Try-On Workshop (2024)
All You Need to Know about Point Cloud Understanding (2024)
The First Workshop on AI for 3D Generation (2024)
2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
23598 The 5th Annual Embodied AI Workshop (2024)
Machine Learning for Geometric Shape Analysis (2024)
… and 1 more

`DALL-E 3` — 9 videos

Multimodal Algorithmic Reasoning Workshop & SMART-101 Challenge Awards (2024)
Panel Discussion on AI, Art, and Creativity (2024)
7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
ReGenAI Workshop CVPR 2024 (2024)
CVPR 2024 Workshop (2024)
The First Workshop on AI for 3D Generation (2024)
VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
Visual Perception and Learning in an Open World (VPLOW) Workshop Session (2025)
… and 1 more

`Flamingo` — 9 videos

Multimodal Algorithmic Reasoning Workshop & SMART-101 Challenge Awards (2024)
CVPR 2024 Workshop: Multimodal Foundation Models (2024)
VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
VizWiz Grand Challenge: Opening Remarks (2024)
Black-box Adversarial Attacks on Vision Foundation Models (2024)
CV4MS @ CVPR 2024 (2024)
From Multimodal LLM to Human-level AI (2024)
Multi-stage reasoning for video understanding & scene generation (2025)
… and 1 more

`Midjourney` — 9 videos

Panel Discussion on AI, Art, and Creativity (2024)
7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
CVPR 2024 Workshop (2024)
GenAI Media Generation Challenge Workshop @ CVPR (2024)
2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
… and 1 more

`ResNet18` — 9 videos

LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
CVPR 2024 Tutorial: Learning Deep Low-Dimensional Models from High-Dimensional Data: Theory to Practice (2024)
CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
CVPR 2024 Workshop (2024)
Dataset Distillation: A Comprehensive Review (2024)
Scalable Real-Time Abnormal Event Detection (2024)
The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
Machine Learning for Geometric Shape Analysis (2024)
… and 1 more

`MNIST` — 9 videos

GenAI Media Generation Challenge Workshop @ CVPR (2024)
CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
Disentanglement and Compositionality in Artificial Intelligence (2024)
VPLOW@CVPR’24: The 4th Workshop of Visual Perception and Learning in an Open World (2024)
Welcome to the Workshop on Responsible Data! (2024)
CV4MS @ CVPR 2024 (2024)
3D Foundation Models for Physical Intelligence (2024)
Neuromorphic Vision Applications: From Robotic Foosball to Tracking Space Junk (2025)
… and 1 more

`CARLA` — 9 videos

CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
Generating The Invisible: Capturing and Generating Edge-cases in Autonomous Driving (2024)
Synthetic Data for CV (2024)
Foundation Models for Autonomous Systems Workshop (2024)
All you need to know about self-driving: Intro to Self-Driving (2024)
X-WORLD: Accessibility, Vision, and Autonomy Meet (2025)
CVPR 2025 Workshop on Autonomous Driving (2025)
… and 1 more

`GPT-3.5` — 8 videos

`VQ-VAE` — 8 videos

`VQ-GAN` — 8 videos

`COLMAP` — 8 videos

`nuScenes` — 8 videos

`DDPM` — 8 videos

`Diffusion Models` — 8 videos

`SIFT` — 8 videos

`StyleGAN` — 8 videos

`InstructBLIP` — 8 videos

`SimCLR` — 8 videos

`YOLO` — 8 videos

`UniAD` — 7 videos

`Waymo` — 7 videos

`Imagen` — 7 videos

`VQGAN` — 7 videos

`Transformers` — 7 videos

`Flow Matching` — 7 videos

`ResNet-18` — 7 videos

`VGG16` — 7 videos

`Instant3D` — 7 videos

`LLaMA` — 7 videos

`BLIP` — 7 videos

Panel Discussion on AI, Art, and Creativity (2024)
ReGenAI Workshop CVPR 2024 (2024)
Welcome to the Workshop on Responsible Data! (2024)
Black-box Adversarial Attacks on Vision Foundation Models (2024)
CVsports Workshop at CVPR 2024, Seattle (2024)
Multi-stage reasoning for video understanding & scene generation (2025)
Cross-Modal 3D Scene Understanding (2025)

`Grad-CAM` — 7 videos

5th Face Anti-spoofing Workshop @ CVPR2024 (2024)
CVPR 2024 Workshop (2024)
The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
23642 2nd Workshop on Multimodal Content Moderation mp4 (2024)
DEF-AI-MIA Workshop at CVPR 2024 (2024)
Microscopy, foundation models, and the scaling hypothesis (CVMI @ CVPR 2024) (2024)
CV4MS @ CVPR 2024 (2024)

`SLAM` — 7 videos

Computer Vision Foundation Workshop (2024)
CVPR 2024 Workshop (2024)
23598 The 5th Annual Embodied AI Workshop (2024)
Visual-Inertial Odometry for Small-sized Robots (2024)
4th Workshop on Computer Vision in the Built Environment (2024)
Event-based vision and processing for tiny drones (2025)
REALIZING THE PROMISE OF SPIKING NEUROMORPHIC HARDWARE (2025)

`GANs` — 7 videos

`Ego4D` — 7 videos

`YOLOv8` — 7 videos

`DVS` — 7 videos

`LSTM` — 6 videos

`PRISM-1` — 6 videos

`DriveDreamer` — 6 videos

`nuPlan` — 6 videos

`WayveScenes101` — 6 videos

`DALL-E 2` — 6 videos

`MAE` — 6 videos

`TensorFlow` — 6 videos

Multimodal AI for Edge AI (2024)
CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
CVPR 2024 - Invited Speakers - Chris Padwick (2024)
Applications, Software and Hardware for Event-Based Vision (2025)
REALIZING THE PROMISE OF SPIKING NEUROMORPHIC HARDWARE (2025)
Mobile AI Workshop 2025: Introductory Talk (2025)

`ShapeNet` — 6 videos

`VGG` — 6 videos

`CIFAR-10` — 6 videos

`VLM` — 6 videos

ReGenAI Workshop CVPR 2024 (2024)
Disentanglement and Compositionality in Artificial Intelligence (2024)
23598 The 5th Annual Embodied AI Workshop (2024)
THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
4th Workshop on Computer Vision in the Built Environment (2024)
Foundation Models for Autonomous Systems Workshop (2024)

`DreamBooth` — 6 videos

`Deep Learning` — 6 videos

`DVS (Dynamic Vision Sensor)` — 6 videos

`Point-E` — 5 videos

`Ghost Gym` — 5 videos

`Lingo-2` — 5 videos

`VIDAR` — 5 videos

`LDM` — 5 videos

3D Generative AI: Efficient, high-def & controllable (2024)
ReGenAI Workshop CVPR 2024 (2024)
CVPR 2024 Workshop: Multimodal Foundation Models (2024)
GenAI Media Generation Challenge Workshop @ CVPR (2024)
Visual Generative Modeling: What’s After Diffusion? (2025)

`PCA` — 5 videos

3D Generative AI: Efficient, high-def & controllable (2024)
Virtual Try-On Workshop (2024)
ViLMa Visual Localization and Mapping (2024)
CV4MS @ CVPR 2024 (2024)
Neuromorphic computing hardware and event-based vision: a perfect match? (2025)

`SDXL` — 5 videos

`GAN` — 5 videos

Multimodal AI for Edge AI (2024)
Scalable Real-Time Abnormal Event Detection (2024)
AI4Space 2024 Workshop (2024)
3D Foundation Models for Physical Intelligence (2024)
8th New Trends in Image Restoration and Enhancement Workshop and 8 Associated Challenges (2025)

`ConvNeXt` — 5 videos

`SuperPoint` — 5 videos

The 3rd Monocular Depth Estimation Challenge (2024)
Image Matching: Local Features and Beyond (2024)
CVPR MetaFood Workshop (2024)
ViLMa Visual Localization and Mapping (2024)
AI4Space 2024 Workshop (2024)

`StyleGAN2` — 5 videos

`EG3D` — 5 videos

`Objaverse-XL` — 5 videos

`UNet` — 5 videos

CVPR 2024 Workshop (2024)
The 13th Women in Computer Vision (WiCV) Workshop (2024)
First Joint Egocentric Vision (EgoVis) Workshop Held in Conjunction with CVPR 2024 (2024)
CV4MS @ CVPR 2024 (2024)
23675 First Workshop on Efficient and On Device Generation EDGE (2024)

`CycleGAN` — 5 videos

CVPRW-NAS 2024 - Day 1 Session 1 (2024)
The 13th Women in Computer Vision (WiCV) Workshop (2024)
Panel Discussion on AI, Art, and Creativity (2024)
THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
Visual Generative Modeling: What’s After Diffusion? (2025)

`DenseNet` — 5 videos

`Diffusion Model` — 5 videos

`ViperGPT` — 5 videos

`InternVideo` — 5 videos

`Segment Anything` — 5 videos

`ALIGN` — 5 videos

CV4MR 2024: 2nd Workshop on Computer Vision for Mixed Reality (2024)
VizWiz Grand Challenge: Opening Remarks (2024)
CV4MS @ CVPR 2024 (2024)
Foundation Models for Autonomous Systems Workshop (2024)
Welcome to the workshop on Computer Vision in the Wild (CVinW) (2025)

`InstructPix2Pix` — 5 videos

`OpenPose` — 5 videos

`GCN` — 5 videos

`MaskGIT` — 5 videos

CVPR 2024 Workshop (2024)
GenAI Media Generation Challenge Workshop @ CVPR (2024)
Coarse-to-Fine Amodal Segmentation with Shape Prior (2024)
23598 The 5th Annual Embodied AI Workshop (2024)
Visual Generative Modeling: What’s After Diffusion? (2025)

`LAION-5B` — 5 videos

`Faster R-CNN` — 5 videos

CVPR 2024 Workshop: Multimodal Foundation Models (2024)
Visual-Inertial Odometry for Small-sized Robots (2024)
CV4MS @ CVPR 2024 (2024)
Synthetic Data for CV (2024)
Concept Learning Across Domains and Modalities (2025)

`Mask R-CNN` — 5 videos

`UMAP` — 5 videos

`IP-Adapter` — 5 videos

`Gato` — 5 videos

`Open X-Embodiment` — 5 videos

`PyTorch` — 5 videos

`Diffusion Policy` — 5 videos

`Diffusion models` — 5 videos

`3D Gaussian Splatting` — 5 videos

2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
ViLMa Visual Localization and Mapping (2024)
Scalable Real-Time Abnormal Event Detection (2024)
23675 First Workshop on Efficient and On Device Generation EDGE (2024)
Synthetic Data for CV (2024)

`LAION` — 5 videos

`Random Forest` — 5 videos

`Instant-NGP` — 5 videos

`DAVIS` — 5 videos

`GAIA-1` — 4 videos

`VISTA` — 4 videos

`Lingo-1` — 4 videos

`MCTS` — 4 videos

`GNN` — 4 videos

`GenAD` — 4 videos

`DriveGPT4` — 4 videos

`V-JEPA` — 4 videos

`Drive-WM` — 4 videos

`OccWorld` — 4 videos

`WoVoGen` — 4 videos

`Llama` — 4 videos

`PID controller` — 4 videos

`Cross-attention` — 4 videos

CVPR 2024 Tutorial: End-to-End Autonomy: A New Era of Self-Driving (2024)
CVPR 2024 Workshop (2024)
CVPR 2024 Workshop (2024)
CVPR 2024 Workshop (2024)

`AMASS` — 4 videos

`GigaGAN` — 4 videos

`CNNs` — 4 videos

`VILA` — 4 videos

`AWQ` — 4 videos

`DPO` — 4 videos

`MobileNetV2` — 4 videos

Multimodal AI for Edge AI (2024)
The 20th Embedded Vision Workshop (EVW2024) (2024)
CVPR 2024 Workshop on Autonomous Driving (2024)
IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)

`SuperGlue` — 4 videos

The 3rd Monocular Depth Estimation Challenge (2024)
Image Matching: Local Features and Beyond (2024)
CVPR MetaFood Workshop (2024)
AI4Space 2024 Workshop (2024)

`Cityscapes` — 4 videos

`ResNet-34` — 4 videos

Image Matching: Local Features and Beyond (2024)
Robustness at Inference: Towards Explainability, Uncertainty, and Intervenability (2024)
CV4MS @ CVPR 2024 (2024)
Lifting Monocular Events to 3D Human Poses (2025)

`ProlificDreamer` — 4 videos

3D/4D Generation and Modeling with Generative Priors (2024)
ReGenAI Workshop CVPR 2024 (2024)
The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`Magic3D` — 4 videos

`Segment Anything Model (SAM)` — 4 videos

`Visual Genome` — 4 videos

`ViT (Vision Transformer)` — 4 videos

`PointNet++` — 4 videos

`HMR` — 4 videos

Towards the 3D Human Foundation Agent (2024)
Virtual Try-On Workshop (2024)
CVPR 2024 Workshop on Autonomous Driving (2024)
Estimating human motion in world coordinates (2025)

`SMPL` — 4 videos

Towards the 3D Human Foundation Agent (2024)
Virtual Try-On Workshop (2024)
2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
CVsports Workshop at CVPR 2024, Seattle (2024)

`PaLM` — 4 videos

The 13th Women in Computer Vision (WiCV) Workshop (2024)
23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
CV4MS @ CVPR 2024 (2024)
From Multimodal LLM to Human-level AI (2024)

`BLIP2` — 4 videos

`Shap-E` — 4 videos

`ByteTrack` — 4 videos

`TensorRT` — 4 videos

The 20th Embedded Vision Workshop (EVW2024) (2024)
CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
CVsports Workshop at CVPR 2024, Seattle (2024)

`LiDAR` — 4 videos

`Mixup` — 4 videos

`Muse` — 4 videos

CVPR 2024 Workshop (2024)
CVPR 2024 Workshop: Multimodal Foundation Models (2024)
GenAI Media Generation Challenge Workshop @ CVPR (2024)
GenAI Media Generation Challenge Workshop @ CVPR (2024)

`StyleDrop` — 4 videos

`VideoPoet` — 4 videos

CVPR 2024 Workshop (2024)
Diffusion-based Video Generative Models (2024)
From Multimodal LLM to Human-level AI (2024)
Generalization via Scaling Robotics (2025)

`LLM` — 4 videos

ReGenAI Workshop CVPR 2024 (2024)
Disentanglement and Compositionality in Artificial Intelligence (2024)
23598 The 5th Annual Embodied AI Workshop (2024)
Foundation Models for Autonomous Systems Workshop (2024)

`CutMix` — 4 videos

`CLIPScore` — 4 videos

CVPR 2024 Workshop: Multimodal Foundation Models (2024)
Diffusion-based Video Generative Models (2024)
VizWiz Grand Challenge: Opening Remarks (2024)
Visual Generative Modeling: What’s After Diffusion? (2025)

`VideoMAE` — 4 videos

CVPR 2024 Workshop (2024)
THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
Unsolved problems in video understanding (2025)
What can biological systems teach us about embodied learning? (2025)

`Transformer Encoder` — 4 videos

`Objaverse` — 4 videos

`AtlasNet` — 4 videos

`LINGO-1` — 4 videos

`LINGO-2` — 4 videos

`COMPASS` — 4 videos

`LM-Nav` — 4 videos

`RT-1` — 4 videos

`BLIP-2 Q-Former` — 4 videos

`Vicuna-7B` — 4 videos

`ST-P3` — 4 videos

`Waymo Open Dataset` — 4 videos

`Llama-2` — 4 videos

`GET3D` — 4 videos

`SDEdit` — 4 videos

`ScanNet` — 4 videos

`Co-DETR` — 4 videos

`DinoV2` — 4 videos

23598 The 5th Annual Embodied AI Workshop (2024)
CV4MS @ CVPR 2024 (2024)
Foundation Models for Autonomous Systems Workshop (2024)
23713 Towards Building AGI in Autonomy and Robotics (2024)

`Foundation Models` — 4 videos

`VAE` — 4 videos

`RT-2` — 4 videos

`DriveVLM` — 4 videos

`SigLIP` — 4 videos

`IBM TrueNorth` — 4 videos

`ATIS` — 4 videos

`SpiNNaker` — 4 videos

Neuromorphic Computing: towards event-based (cognitive) sensing and control (2025)
Event-Driven Convolution-Based Processing (2025)
Novel Hardware for Spatial AI (2025)
Event-Driven Sensing for a Humanoid Robot (2025)

`GLIDE` — 3 videos

`PPO` — 3 videos

`RAG-driver` — 3 videos

`LMdrive` — 3 videos

`Nuro` — 3 videos

`Drive Anywhere` — 3 videos

`IRIS` — 3 videos

`SEM2` — 3 videos

`DriveWorld` — 3 videos

`TrafficBots` — 3 videos

`SubjectDrive` — 3 videos

`UniWorld` — 3 videos

`MUVO` — 3 videos

`Think2Drive` — 3 videos

`DriveLM` — 3 videos

`MP3` — 3 videos

`Q-Former` — 3 videos

`Flan-T5` — 3 videos

`VIVIT` — 3 videos

`Nerfstudio` — 3 videos

`HyperNeRF` — 3 videos

`Nerfies` — 3 videos

`DriveSim` — 3 videos

`Waabi World` — 3 videos

`Waymo's Waymax` — 3 videos

`KITTI-360` — 3 videos

`NMP` — 3 videos

`Large Language Model` — 3 videos

`CarLLaVA` — 3 videos

`Copilot4D` — 3 videos

`NeuRAD` — 3 videos

`Chinchilla` — 3 videos

`LRM` — 3 videos

`Zero-1-to-3` — 3 videos

3D Generative AI: Efficient, high-def & controllable (2024)
The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`IM-3D` — 3 videos

`Contrastive Learning` — 3 videos

`SkySense` — 3 videos

`SmoothQuant` — 3 videos

`TinyChat` — 3 videos

`TensorRT-LLM` — 3 videos

`DIT` — 3 videos

Computer Vision Foundation Talk/Workshop (2024)
Diffusion-based Video Generative Models (2024)
3D Foundation Models for Physical Intelligence (2024)

`Rectified Flow` — 3 videos

`EDM` — 3 videos

`CLEVR` — 3 videos

`ADE20K` — 3 videos

`TensorFlow Lite` — 3 videos

Multimodal AI for Edge AI (2024)
CVPR 2024 - Invited Speakers - Chris Padwick (2024)
Mobile AI Workshop 2025: Introductory Talk (2025)

`OpenCV` — 3 videos

Multimodal AI for Edge AI (2024)
Applications, Software and Hardware for Event-Based Vision (2025)
Neuromorphic vision for humanoid robots (2025)

`Depth Anything` — 3 videos

The 3rd Monocular Depth Estimation Challenge (2024)
3D Foundation Models for Physical Intelligence (2024)
23713 Towards Building AGI in Autonomy and Robotics (2024)

`Marigold` — 3 videos

The 3rd Monocular Depth Estimation Challenge (2024)
ReGenAI Workshop CVPR 2024 (2024)
23675 First Workshop on Efficient and On Device Generation EDGE (2024)

`LoFTR` — 3 videos

`ZoeDepth` — 3 videos

The 3rd Monocular Depth Estimation Challenge (2024)
ReGenAI Workshop CVPR 2024 (2024)
23675 First Workshop on Efficient and On Device Generation EDGE (2024)

`Metric3D` — 3 videos

The 3rd Monocular Depth Estimation Challenge (2024)
3D Foundation Models for Physical Intelligence (2024)
Second Egocentric Vision (EgoVis) Workshop (2025)

`MiDaS` — 3 videos

`KITTI` — 3 videos

`Kinect` — 3 videos

The 3rd Monocular Depth Estimation Challenge (2024)
2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
Synthetic Data for CV (2024)

`MIM` — 3 videos

`DINO-ViT` — 3 videos

Image Matching: Local Features and Beyond (2024)
Scalable Real-Time Abnormal Event Detection (2024)
Foundation Models for Autonomous Systems Workshop (2024)

`LightGlue` — 3 videos

Image Matching: Local Features and Beyond (2024)
ViLMa Visual Localization and Mapping (2024)
Synthetic Data for CV (2024)

`NetVLAD` — 3 videos

Image Matching: Local Features and Beyond (2024)
ViLMa Visual Localization and Mapping (2024)
Workshop on Autonomous Driving (2025)

`ResNet-101` — 3 videos

`DBSCAN` — 3 videos

Image Matching: Local Features and Beyond (2024)
7th Multi-modal Learning Workshop (2024)
4th Workshop on Computer Vision in the Built Environment (2024)

`MeshGPT` — 3 videos

`SyncDreamer` — 3 videos

3D/4D Generation and Modeling with Generative Priors (2024)
The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`Ego-Exo4D` — 3 videos

`ImageNet-1K` — 3 videos

CVPRW-NAS 2024 - Day 1 Session 1 (2024)
Dataset Distillation: A Comprehensive Review (2024)
Synthetic Data for CV (2024)

`MiniGPT-4` — 3 videos

`Graph Neural Networks (GNNs)` — 3 videos

`OpenScene` — 3 videos

`NeRFs` — 3 videos

Towards the 3D Human Foundation Agent (2024)
The First Workshop on AI for 3D Generation (2024)
ViLMa Visual Localization and Mapping (2024)

`Mask2Former` — 3 videos

Towards the 3D Human Foundation Agent (2024)
23598 The 5th Annual Embodied AI Workshop (2024)
THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)

`LaMDA` — 3 videos

The 13th Women in Computer Vision (WiCV) Workshop (2024)
Visual-Inertial Odometry for Small-sized Robots (2024)
From Multimodal LLM to Human-level AI (2024)

`ActivityNet-QA` — 3 videos

`EgoSchema` — 3 videos

`NEXT-QA` — 3 videos

`MVCNN` — 3 videos

`Magic123` — 3 videos

`MVDream` — 3 videos

`GPT` — 3 videos

`Mask3D` — 3 videos

`Inverse Kinematics` — 3 videos

`BigGAN` — 3 videos

`Swin Transformer` — 3 videos

OmniCV 2024 Workshop (2024)
Dataset Distillation: A Comprehensive Review (2024)
Mobile AI Workshop 2025: Introductory Talk (2025)

`L1 loss` — 3 videos

PBVS 2024 Workshop: Challenges and Results (2024)
Machine Learning for Geometric Shape Analysis (2024)
Mobile AI Workshop 2025: Introductory Talk (2025)

`AUROC` — 3 videos

`t-SNE` — 3 videos

5th Face Anti-spoofing Workshop @ CVPR2024 (2024)
Scalable Real-Time Abnormal Event Detection (2024)
DEF-AI-MIA Workshop at CVPR 2024 (2024)

`Emu` — 3 videos

7th Workshop on Computer Vision for Fashion, Art, and Design (2024)
ReGenAI Workshop CVPR 2024 (2024)
GenAI Media Generation Challenge Workshop @ CVPR (2024)

`LLMs` — 3 videos

`MedSAM` — 3 videos

CVPR 2024 Workshop on Data Curation and Augmentation in Medical Imaging (2024)
CV4MS @ CVPR 2024 (2024)
AI agents in cancer research and oncology (2025)

`Latent Diffusion` — 3 videos

CVPR 2024 Workshop (2024)
Diffusion-based Video Generative Models (2024)
The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)

`SR3` — 3 videos

ReGenAI Workshop CVPR 2024 (2024)
Mobile Intelligent Photography and Imaging (2024)
23675 First Workshop on Efficient and On Device Generation EDGE (2024)

`Stable Diffusion XL` — 3 videos

ReGenAI Workshop CVPR 2024 (2024)
The First Workshop on AI for 3D Generation (2024)
The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)

`RAG` — 3 videos

ReGenAI Workshop CVPR 2024 (2024)
23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)
Learning the Language of Patients (2025)

`DataComp` — 3 videos

CVPR 2024 Workshop: Multimodal Foundation Models (2024)
Black-box Adversarial Attacks on Vision Foundation Models (2024)
Synthetic Data for CV (2024)

`Swin-L` — 3 videos

CVPR 2024 Workshop: Multimodal Foundation Models (2024)
THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
Synthetic Data for CV (2024)

`OpenCLIP` — 3 videos

`DETR` — 3 videos

`CLIP Image Encoder` — 3 videos

CVPR MetaFood Workshop (2024)
CVPR MetaFood Workshop (2024)
THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)

`DDIM Inversion` — 3 videos

CVPR 2024 Workshop (2024)
CVPR 2024 Workshop (2024)
Video Foundation Models: From Black Boxes to Controllable Representations (2024)

`ZeroScope` — 3 videos

`I3D` — 3 videos

GenAI Media Generation Challenge Workshop @ CVPR (2024)
Dataset Distillation: A Comprehensive Review (2024)
CVsports Workshop at CVPR 2024, Seattle (2024)

`HyperDreamBooth` — 3 videos

`ZipLoRA` — 3 videos

`Textual Inversion` — 3 videos

`MoCo` — 3 videos

GenAI Media Generation Challenge Workshop @ CVPR (2024)
Welcome to the Workshop on Responsible Data! (2024)
CV4MS @ CVPR 2024 (2024)

`VQA` — 3 videos

`ImageDream` — 3 videos

Virtual Try-On Workshop (2024)
The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`LORA finetuning` — 3 videos

`Vista` — 3 videos

`Alexnet` — 3 videos

`AlphaGo` — 3 videos

`Inception-v3` — 3 videos

`MinkUNet` — 3 videos

`LLaVA-1.5` — 3 videos

`TextVQA` — 3 videos

CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
VizWiz Grand Challenge: Opening Remarks (2024)
From Multimodal LLM to Human-level AI (2024)

`GradCAM` — 3 videos

`GradCAM++` — 3 videos

`ODIN` — 3 videos

`SHAP` — 3 videos

`CAT3D` — 3 videos

`DeepSDF` — 3 videos

`Genie` — 3 videos

`SDF` — 3 videos

`KNN` — 3 videos

`ScanNet++` — 3 videos

`GroundingDINO` — 3 videos

`ImageNetV2` — 3 videos

`LAION-2B` — 3 videos

`LAION-400M` — 3 videos

`Bard` — 3 videos

23598 The 5th Annual Embodied AI Workshop (2024)
VISION-AND-LANGUAGE ALGORITHMIC REASONING (VLAR) (2025)
The missing rungs on the ladder to general AI (2025)

`Kinetics` — 3 videos

23598 The 5th Annual Embodied AI Workshop (2024)
23713 Towards Building AGI in Autonomy and Robotics (2024)
Generalization via Scaling Robotics (2025)

`MPC` — 3 videos

`Octo` — 3 videos

23598 The 5th Annual Embodied AI Workshop (2024)
Foundation Models for Autonomous Systems Workshop (2024)
23713 Towards Building AGI in Autonomy and Robotics (2024)

`PIVOT` — 3 videos

23598 The 5th Annual Embodied AI Workshop (2024)
CVPR 2024 Workshop (2024)
23650 Vision and Language for Autonomous Driving and Robotics VLADR (2024)

`RT-2-X` — 3 videos

23598 The 5th Annual Embodied AI Workshop (2024)
Foundation Models for Autonomous Systems Workshop (2024)
23713 Towards Building AGI in Autonomy and Robotics (2024)

`Vision Transformer` — 3 videos

`Block-NeRF` — 3 videos

`SUDS` — 3 videos

`SGD` — 3 videos

`Grad-CAM++` — 3 videos

`ImageBind` — 3 videos

`YOLOv7` — 3 videos

`Common Crawl` — 3 videos

`RLHF` — 3 videos

`OCR` — 3 videos

`PaLI` — 3 videos

`RedCaps` — 3 videos

VizWiz Grand Challenge: Opening Remarks (2024)
Black-box Adversarial Attacks on Vision Foundation Models (2024)
Synthetic Data for CV (2024)

`ResNet34` — 3 videos

`GAIA` — 3 videos

`Transfuser` — 3 videos

`Reinforcement Learning` — 3 videos

`DrivingGaussian` — 3 videos

`MotionLM` — 3 videos

`UniSim` — 3 videos

`VAD` — 3 videos

`Waymax` — 3 videos

`MS-COCO` — 3 videos

`RNN` — 3 videos

`HRNet` — 3 videos

`OpenVINO` — 3 videos

`CLIP (Contrastive Language-Image Pre-training)` — 3 videos

`SegFormer` — 3 videos

4th Workshop on Computer Vision in the Built Environment (2024)
CVsports Workshop at CVPR 2024, Seattle (2024)
CVPR 2024 - Invited Speakers - Chris Padwick (2024)

`ObjectNet` — 3 videos

`RAFT` — 3 videos

CV4MS @ CVPR 2024 (2024)
Synthetic Data for CV (2024)
Mobile AI Workshop 2025: Introductory Talk (2025)

`MegaDepth` — 3 videos

`BEVFormer` — 3 videos

`NAVSIM` — 3 videos

Foundation Models for Autonomous Systems Workshop (2024)
23713 Towards Building AGI in Autonomy and Robotics (2024)
CVPR 2025 Workshop on Autonomous Driving (2025)

`DART` — 3 videos

`mPLUG-Owl` — 3 videos

`Intel Loihi` — 3 videos

`YOLOv3` — 3 videos

`ROS` — 3 videos

`SNN` — 3 videos

`SCAMP` — 3 videos

`Spinnaker` — 3 videos

`TrueNorth` — 3 videos

`NAVSIM v2` — 3 videos

`Denoising Diffusion GANs (DDG)` — 3 videos

`f-Distill` — 3 videos

`DiffPD` — 2 videos

`MuJoCo` — 2 videos

`DOROTHIE` — 2 videos

`Dreamer v1` — 2 videos

`Dreamer v2` — 2 videos

`Dreamer v3` — 2 videos

`Phenaki` — 2 videos

`MILE` — 2 videos

`Panacea` — 2 videos

`LidarDM` — 2 videos

`Iso-Dream` — 2 videos

`DriveAGI` — 2 videos

`ELM` — 2 videos

`DriveAdapter` — 2 videos

`Model Predictive Control` — 2 videos

`NSFF` — 2 videos

`D-NeRF` — 2 videos

`Carla` — 2 videos

`CNN E2E` — 2 videos

`CILRS` — 2 videos

`SafeDagger` — 2 videos

`BDD-X` — 2 videos

`PlanT` — 2 videos

`Transformer Block` — 2 videos

`Lingo-Judge` — 2 videos

`Vision Encoder` — 2 videos

`DiT` — 2 videos

`PixArt-α` — 2 videos

`T5` — 2 videos

3D Generative AI: Efficient, high-def & controllable (2024)
From Multimodal LLM to Human-level AI (2024)

`Instruct-NeRF2NeRF` — 2 videos

`Neus` — 2 videos

`ConvNext` — 2 videos

`TextMesh` — 2 videos

`MeshLRM` — 2 videos

3D Generative AI: Efficient, high-def & controllable (2024)
CVPR 2024 Workshop (2024)

`DreamScene4D` — 2 videos

`MV-Dream` — 2 videos

`Splatter Image` — 2 videos

`Free3D` — 2 videos

`Sentinel-2` — 2 videos

`Supervised Learning` — 2 videos

`SatMAE` — 2 videos

`Scale-MAE` — 2 videos

`USat` — 2 videos

`K-Means` — 2 videos

`MCUNet` — 2 videos

`TinyNAS` — 2 videos

`TinyEngine` — 2 videos

`StyleGAN-T` — 2 videos

`CrossDIT` — 2 videos

`LADD` — 2 videos

`NuScenes` — 2 videos

`Kubric` — 2 videos

`TokenCut` — 2 videos

`ONNX Runtime` — 2 videos

`Pixel Shuffle` — 2 videos

`NFNet` — 2 videos

Multimodal AI for Edge AI (2024)
Dataset Distillation: A Comprehensive Review (2024)

`MNASNet` — 2 videos

Multimodal AI for Edge AI (2024)
LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)

`Knowledge Distillation` — 2 videos

Multimodal AI for Edge AI (2024)
Dataset Distillation: A Comprehensive Review (2024)

`DistilBERT` — 2 videos

Multimodal AI for Edge AI (2024)
Foundation Models for Autonomous Systems Workshop (2024)

`Keras` — 2 videos

Multimodal AI for Edge AI (2024)
Mobile AI Workshop 2025: Introductory Talk (2025)

`PackNet` — 2 videos

The 3rd Monocular Depth Estimation Challenge (2024)
ViLMa Visual Localization and Mapping (2024)

`ArgoVerse` — 2 videos

`MLPs` — 2 videos

`InternImage` — 2 videos

The 3rd Monocular Depth Estimation Challenge (2024)
THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)

`FlexDM` — 2 videos

`DreamSim` — 2 videos

`Nerf` — 2 videos

`VGGSfM` — 2 videos

Image Matching: Local Features and Beyond (2024)
3D Foundation Models for Physical Intelligence (2024)

`PatchNetVLAD` — 2 videos

Image Matching: Local Features and Beyond (2024)
Workshop on Autonomous Driving (2025)

`MixVPR` — 2 videos

Image Matching: Local Features and Beyond (2024)
ViLMa Visual Localization and Mapping (2024)

`SGM` — 2 videos

Image Matching: Local Features and Beyond (2024)
Deep Stereo Matching in the Twenties (2024)

`ORB-SLAM3` — 2 videos

Image Matching: Local Features and Beyond (2024)
CVPR 2024 Workshop (2024)

`PoseDiffusion` — 2 videos

`StyleGAN2-ADA` — 2 videos

`StyleGAN3` — 2 videos

`DeepCAD` — 2 videos

`HiFA` — 2 videos

`Wonder3D` — 2 videos

`LucidDreamer` — 2 videos

`PhysDreamer` — 2 videos

`Dream-in-4D` — 2 videos

`4Real` — 2 videos

`LeNet` — 2 videos

CVPR 2024 Workshop (2024)
The development of the DVS and DAVIS sensors (2025)

`DGCNN` — 2 videos

CVPR 2024 Workshop (2024)
All You Need to Know about Point Cloud Understanding (2024)

`Aurora` — 2 videos

CVPR 2024 Workshop (2024)
Visual Generative Modeling: What’s After Diffusion? (2025)

`InfoNCE loss` — 2 videos

CVPR 2024 Workshop (2024)
3D Foundation Models for Physical Intelligence (2024)

`JAX library` — 2 videos

CVPR 2024 Workshop (2024)
CVPR 2024 Workshop on Autonomous Driving (2024)

`GoogleNet` — 2 videos

`Inception` — 2 videos

CVPRW-NAS 2024 - Day 1 Session 1 (2024)
Black-box Adversarial Attacks on Vision Foundation Models (2024)

`AutoAugment` — 2 videos

CVPRW-NAS 2024 - Day 1 Session 1 (2024)
LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)

`DARTS` — 2 videos

CVPRW-NAS 2024 - Day 1 Session 1 (2024)
Deep Stereo Matching in the Twenties (2024)

`SNIP` — 2 videos

`CIFAR-100` — 2 videos

CVPRW-NAS 2024 - Day 1 Session 1 (2024)
Scalable Real-Time Abnormal Event Detection (2024)

`Tiny-ImageNet` — 2 videos

CVPRW-NAS 2024 - Day 1 Session 1 (2024)
Scalable Real-Time Abnormal Event Detection (2024)

`SORA` — 2 videos

`GIT` — 2 videos

Multimodal Algorithmic Reasoning Workshop & SMART-101 Challenge Awards (2024)
CV4MS @ CVPR 2024 (2024)

`T-SNE` — 2 videos

7th Multi-modal Learning Workshop (2024)
ICCVW 2023 VLAR Session 3 (2025)

`LLaVA 1.5` — 2 videos

7th Multi-modal Learning Workshop (2024)
From Multimodal LLM to Human-level AI (2024)

`DeepWalk` — 2 videos

`TokenHMR` — 2 videos

Towards the 3D Human Foundation Agent (2024)
Estimating human motion in world coordinates (2025)

`BEDLAM` — 2 videos

Towards the 3D Human Foundation Agent (2024)
Estimating human motion in world coordinates (2025)

`HMR 2.0` — 2 videos

Towards the 3D Human Foundation Agent (2024)
CVPR 2024 Workshop on Autonomous Driving (2024)

`DexCap` — 2 videos

Towards the 3D Human Foundation Agent (2024)
Human Motion Generation (HuMoGen) Workshop (2024)

`EGO-EXO4D` — 2 videos

`Med-PaLM` — 2 videos

The 13th Women in Computer Vision (WiCV) Workshop (2024)
Foundation Models in Radiology (2025)

`Med-PaLM 2` — 2 videos

The 13th Women in Computer Vision (WiCV) Workshop (2024)
Foundation Models in Radiology (2025)

`Med-Gemini` — 2 videos

The 13th Women in Computer Vision (WiCV) Workshop (2024)
Foundation Models in Radiology (2025)

`UNet++` — 2 videos

The 13th Women in Computer Vision (WiCV) Workshop (2024)
CV4MS @ CVPR 2024 (2024)

`TVQA` — 2 videos

`MovieChat` — 2 videos

`MoRevQA` — 2 videos

`JCEF` — 2 videos

`EPIC-KITCHENS` — 2 videos

`Omnivore` — 2 videos

`VIT` — 2 videos

`DUST3R` — 2 videos

`Ego-Exo 4D` — 2 videos

`BlockNeRF` — 2 videos

`ZipNeRF` — 2 videos

`Total-Recon` — 2 videos

`L4GM` — 2 videos

`Vicuna` — 2 videos

`LGM` — 2 videos

`LATTE3D` — 2 videos

`Quest 3` — 2 videos

`Gaussian Splatting (3DGS)` — 2 videos

`OpenNeRF` — 2 videos

`DeepMetaHandles` — 2 videos

`MOTR` — 2 videos

`CLIP text encoder` — 2 videos

Human Motion Generation (HuMoGen) Workshop (2024)
CVPR 2024 Workshop (2024)

`PhysicsVAE [Won et al. SIGGRAPH 2022]` — 2 videos

Human Motion Generation (HuMoGen) Workshop (2024)
Human Motion Generation (HuMoGen) Workshop (2024)

`EgoGen` — 2 videos

Human Motion Generation (HuMoGen) Workshop (2024)
Estimating human motion in world coordinates (2025)

`Emu (Meta)` — 2 videos

Panel Discussion on AI, Art, and Creativity (2024)
CVPR 2024 Workshop (2024)

`Imagen (Google)` — 2 videos

Panel Discussion on AI, Art, and Creativity (2024)
CVPR 2024 Workshop (2024)

`Firefly (Adobe)` — 2 videos

Panel Discussion on AI, Art, and Creativity (2024)
CVPR 2024 Workshop (2024)

`SDXL (Stability.ai)` — 2 videos

Panel Discussion on AI, Art, and Creativity (2024)
CVPR 2024 Workshop (2024)

`Variational Score Distillation` — 2 videos

Panel Discussion on AI, Art, and Creativity (2024)
3D Foundation Models for Physical Intelligence (2024)

`SCAMP-7` — 2 videos

`YOLOv8m` — 2 videos

The 20th Embedded Vision Workshop (EVW2024) (2024)
Mobile AI Workshop 2025: Introductory Talk (2025)

`YOLOv8x` — 2 videos

The 20th Embedded Vision Workshop (EVW2024) (2024)
THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)

`DeepStream` — 2 videos

The 20th Embedded Vision Workshop (EVW2024) (2024)
CVsports Workshop at CVPR 2024, Seattle (2024)

`Mapillary` — 2 videos

OmniCV 2024 Workshop (2024)
ViLMa Visual Localization and Mapping (2024)

`Neural Radiance Fields (NeRFs)` — 2 videos

OmniCV 2024 Workshop (2024)
23675 First Workshop on Efficient and On Device Generation EDGE (2024)

`L2 loss` — 2 videos

PBVS 2024 Workshop: Challenges and Results (2024)
Mobile AI Workshop 2025: Introductory Talk (2025)

`LSTMs` — 2 videos

`FISTA` — 2 videos

`FCN` — 2 videos

`Self-Attention` — 2 videos

`RandAugment` — 2 videos

`AugMix` — 2 videos

LatinX in Computer Vision (LXCV) at CVPR 2024 Workshop (2024)
Synthetic Data for CV (2024)

`YOLO V8` — 2 videos

`VGG19` — 2 videos

`MobileNet` — 2 videos

`NASNet` — 2 videos

`PGD` — 2 videos

`Swin-Transformer` — 2 videos

`SimMIM` — 2 videos

`Firefly` — 2 videos

`X-ray` — 2 videos

`Grounded-SAM` — 2 videos

`DepthFM` — 2 videos

ReGenAI Workshop CVPR 2024 (2024)
23675 First Workshop on Efficient and On Device Generation EDGE (2024)

`LCM-LoRA` — 2 videos

ReGenAI Workshop CVPR 2024 (2024)
23675 First Workshop on Efficient and On Device Generation EDGE (2024)

`CFM` — 2 videos

ReGenAI Workshop CVPR 2024 (2024)
23675 First Workshop on Efficient and On Device Generation EDGE (2024)

`Claude` — 2 videos

ReGenAI Workshop CVPR 2024 (2024)
CV4MS @ CVPR 2024 (2024)

`OWLv2` — 2 videos

ReGenAI Workshop CVPR 2024 (2024)
3D scene understanding for interactive agents (2025)

`ARC` — 2 videos

ReGenAI Workshop CVPR 2024 (2024)
Neuromorphic vision for humanoid robots (2025)

`SIGMA` — 2 videos

ReGenAI Workshop CVPR 2024 (2024)
Second Egocentric Vision (EgoVis) Workshop (2025)

`GPT-1` — 2 videos

CVPR 2024 Workshop: Multimodal Foundation Models (2024)
From Multimodal LLM to Human-level AI (2024)

`D-RFCN + SNIP` — 2 videos

CVPR 2024 Workshop: Multimodal Foundation Models (2024)
Synthetic Data for CV (2024)

`NAS-FPN` — 2 videos

CVPR 2024 Workshop: Multimodal Foundation Models (2024)
Synthetic Data for CV (2024)

`DyHead` — 2 videos

CVPR 2024 Workshop: Multimodal Foundation Models (2024)
Synthetic Data for CV (2024)

`FocalNet-H (DINO)` — 2 videos

CVPR 2024 Workshop: Multimodal Foundation Models (2024)
Synthetic Data for CV (2024)

`T-MARS` — 2 videos

CVPR 2024 Workshop: Multimodal Foundation Models (2024)
Synthetic Data for CV (2024)

`CLIPA-v2` — 2 videos

CVPR 2024 Workshop: Multimodal Foundation Models (2024)
Synthetic Data for CV (2024)

`Qwen-VL-Plus` — 2 videos

`GLIGEN` — 2 videos

CVPR 2024 Workshop: Multimodal Foundation Models (2024)
Diffusion-based Video Generative Models (2024)

`ImageReward` — 2 videos

`PickScore` — 2 videos

`HPSv2` — 2 videos

`MMD (Maximum Mean Discrepancy)` — 2 videos

CVPR 2024 Workshop: Multimodal Foundation Models (2024)
DEF-AI-MIA Workshop at CVPR 2024 (2024)

`Structure from Motion (SfM)` — 2 videos

`S-GAN` — 2 videos

CVPR 2024 Workshop (2024)
THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)

`Transformer Decoder` — 2 videos

CVPR 2024 Workshop (2024)
Machine Learning for Geometric Shape Analysis (2024)

`Self-attention` — 2 videos

CVPR 2024 Workshop (2024)
The missing rungs on the ladder to general AI (2025)

`FFN` — 2 videos

CVPR 2024 Workshop (2024)
CVPR 2025 - 2nd Workshop on Neural Fields Beyond Conventional Cameras (2025)

`PixSFM` — 2 videos

CVPR MetaFood Workshop (2024)
CVPR MetaFood Workshop (2024)

`XMem` — 2 videos

CVPR MetaFood Workshop (2024)
CVPR MetaFood Workshop (2024)

`FoodLearner` — 2 videos

CVPR MetaFood Workshop (2024)
CVPR MetaFood Workshop (2024)

`Image-Informed Text Encoder` — 2 videos

CVPR MetaFood Workshop (2024)
CVPR MetaFood Workshop (2024)

`Graph Neural Network` — 2 videos

CVPR MetaFood Workshop (2024)
DEF-AI-MIA Workshop at CVPR 2024 (2024)

`Intrinsic-LoRA (I-LoRA)` — 2 videos

CVPR 2024 Workshop (2024)
Dataset Distillation: A Comprehensive Review (2024)

`DINO-v2` — 2 videos

CVPR 2024 Workshop (2024)
2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)

`StyleGAN v2` — 2 videos

CVPR 2024 Workshop (2024)
Dataset Distillation: A Comprehensive Review (2024)

`StyleGAN-XL` — 2 videos

CVPR 2024 Workshop (2024)
Dataset Distillation: A Comprehensive Review (2024)

`Pix2Video` — 2 videos

CVPR 2024 Workshop (2024)
The First Workshop on AI for 3D Generation (2024)

`Control-A-Video` — 2 videos

`Layered Neural Atlases` — 2 videos

`MVImgNet` — 2 videos

CVPR 2024 Workshop (2024)
3D Foundation Models for Physical Intelligence (2024)

`LRM (Large Reconstruction Model)` — 2 videos

CVPR 2024 Workshop (2024)
The First Workshop on AI for 3D Generation (2024)

`EmuEdit` — 2 videos

`Parti` — 2 videos

`CIFAR` — 2 videos

`VDM++` — 2 videos

`RIN` — 2 videos

`CTM` — 2 videos

`Diff-Instruct` — 2 videos

`PerFlow` — 2 videos

`UFO-Gen` — 2 videos

`InstaFlow-1.7B` — 2 videos

`Lumiere` — 2 videos

GenAI Media Generation Challenge Workshop @ CVPR (2024)
Diffusion-based Video Generative Models (2024)

`DreamBooth3D` — 2 videos

GenAI Media Generation Challenge Workshop @ CVPR (2024)
The First Workshop on AI for 3D Generation (2024)

`Alchemist` — 2 videos

GenAI Media Generation Challenge Workshop @ CVPR (2024)
The First Workshop on AI for 3D Generation (2024)

`Custom Diffusion` — 2 videos

`Dreambooth` — 2 videos

`BLIPDiffusion` — 2 videos

`SUTI` — 2 videos

`E4T-Diffusion` — 2 videos

`AnyDoor` — 2 videos

`FastComposer` — 2 videos

`MTGS` — 2 videos

`Panoptic Segmentation` — 2 videos

`Blender` — 2 videos

`Unreal Engine` — 2 videos

`Gaussian Splats` — 2 videos

Virtual Try-On Workshop (2024)
The First Workshop on AI for 3D Generation (2024)

`CLIP score` — 2 videos

`BYOL` — 2 videos

`LingoQA` — 2 videos

`TransFuser` — 2 videos

`ADriver-I` — 2 videos

`Argoverse 2` — 2 videos

`OpenPilot` — 2 videos

`Variational Autoencoders (VAEs)` — 2 videos

`Energy-based models` — 2 videos

`Contrastive Energy Model` — 2 videos

`Score Denoising` — 2 videos

`Product of Expert` — 2 videos

`HuggingGPT` — 2 videos

`DDIM` — 2 videos

`Snap Video` — 2 videos

Diffusion-based Video Generative Models (2024)
The First Workshop on AI for 3D Generation (2024)

`FVD` — 2 videos

`Safe Diffusion` — 2 videos

`MSR-VTT` — 2 videos

Diffusion-based Video Generative Models (2024)
From Multimodal LLM to Human-level AI (2024)

`ViT-B` — 2 videos

`NegGrad` — 2 videos

`KPConv` — 2 videos

`Transformer-XL` — 2 videos

All You Need to Know about Point Cloud Understanding (2024)
CVPR 2024 Workshop (2024)

`SAN` — 2 videos

`PointNext` — 2 videos

`Gemini 1.5 Pro` — 2 videos

`Gemini 1.5 Flash` — 2 videos

`Qwen-VL-Chat` — 2 videos

`POPE` — 2 videos

`CUDA` — 2 videos

`cuBLAS` — 2 videos

CVPR 2024 Tutorial on Full-stack Acceleration of Deep Learning (2024)
CVPR 2024 Tutorial (2024)

`RISE` — 2 videos

`Mahalanobis` — 2 videos

`LRP` — 2 videos

`CelebA-HQ` — 2 videos

`Masked Transformer` — 2 videos

`4D-fy` — 2 videos

`Co-Tracker` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`DIBR` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`DMTet` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`DatasetGAN` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`DefGrid` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`DefTet` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`Dr. Robot` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`Dream Machine` — 2 videos

`DreamCraft3D` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`Dreamitate` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`Fantasia3D` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`FlexiCubes` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`GPT4` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`GRAF` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`GaussianDreamer` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`MipNeRF360` — 2 videos

`SV3D` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`Text2Tex` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`UniDepth` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`Zero123` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`Zero123-XL` — 2 videos

The First Workshop on AI for 3D Generation (2024)
3D Foundation Models for Physical Intelligence (2024)

`BPNet` — 2 videos

`BundleFusion` — 2 videos

`CoTracker` — 2 videos

`GenZI` — 2 videos

`GeoNet` — 2 videos

`LVIS` — 2 videos

`Lidar` — 2 videos

2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
Synthetic Data for CV (2024)

`Mip-NeRF` — 2 videos

`Neural Feature Fusion Fields (N3F)` — 2 videos

`Neural Radiance Fields (NeRF)` — 2 videos

2nd Workshop on Compositional 3D Vision (C3DV) and 3DCoMPaT challenge (2024)
Synthetic Data for CV (2024)

`Panoptic Lifting` — 2 videos

`SAM (Segment Anything Model)` — 2 videos

`SAPIEN` — 2 videos

`SMAL` — 2 videos

`SceneScript` — 2 videos

`ConsistDreamer` — 2 videos

`FocalNet-Huge` — 2 videos

`GCC-PHAT` — 2 videos

`GPT-4V(ision)` — 2 videos

`GelSight` — 2 videos

`IID` — 2 videos

`LLaVA-NeXT` — 2 videos

`MQ-GLIP` — 2 videos

`MSVD-QA` — 2 videos

`MixPL` — 2 videos

`MonoCLR` — 2 videos

`OPT` — 2 videos

`Ours-L2R` — 2 videos

`PSG` — 2 videos

`PartCLIPSeg` — 2 videos

`REACT` — 2 videos

`RichSem-DINO-FocalNet` — 2 videos

`SD-XL` — 2 videos

`StereoCRW` — 2 videos

`Superglue` — 2 videos

`DROID dataset` — 2 videos

23598 The 5th Annual Embodied AI Workshop (2024)
World Modeling Challenge (2025)

`GENIE` — 2 videos

23598 The 5th Annual Embodied AI Workshop (2024)
World Modeling Challenge (2025)

`Grounding DINO` — 2 videos

`HMD2` — 2 videos

23598 The 5th Annual Embodied AI Workshop (2024)
ViLMa Visual Localization and Mapping (2024)

`HoloDeck` — 2 videos

23598 The 5th Annual Embodied AI Workshop (2024)
3D Foundation Models for Physical Intelligence (2024)

`Nymeria` — 2 videos

23598 The 5th Annual Embodied AI Workshop (2024)
ViLMa Visual Localization and Mapping (2024)

`Octo 55B` — 2 videos

23598 The 5th Annual Embodied AI Workshop (2024)
World Modeling Challenge (2025)

`Octo 93M` — 2 videos

23598 The 5th Annual Embodied AI Workshop (2024)
World Modeling Challenge (2025)

`PartNet-Mobility` — 2 videos

`Project Aria` — 2 videos

23598 The 5th Annual Embodied AI Workshop (2024)
ViLMa Visual Localization and Mapping (2024)

`Python` — 2 videos

`RT-1-X` — 2 videos

`RoboNet` — 2 videos

23598 The 5th Annual Embodied AI Workshop (2024)
Generalization via Scaling Robotics (2025)

`ACE` — 2 videos

CVPR 2024 Workshop (2024)
ViLMa Visual Localization and Mapping (2024)

`Active learning` — 2 videos

CVPR 2024 Workshop (2024)
Solving Real-World Challenges of Large-Scale AV Deployment (2025)

`CAF` — 2 videos

CVPR 2024 Workshop (2024)
Foundation Models for Autonomous Systems Workshop (2024)

`EWC` — 2 videos

CVPR 2024 Workshop (2024)
IEEE CVPR workshop on Fair, Data Efficient and Trusted Computer Vision (2024)

`EfficientNet-B0` — 2 videos

`Fine-tuning` — 2 videos

CVPR 2024 Workshop (2024)
CVPR 2024 - Invited Speakers - Chris Padwick (2024)

`FixMatch` — 2 videos

`Joint Training` — 2 videos

CVPR 2024 Workshop (2024)
The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)

`LIME` — 2 videos

CVPR 2024 Workshop (2024)
The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)

`Large Language Models` — 2 videos

CVPR 2024 Workshop (2024)
3D Foundation Models for Physical Intelligence (2024)

`SID` — 2 videos

`WordNet` — 2 videos

CVPR 2024 Workshop (2024)
Synthetic Data for CV (2024)

`FPN` — 2 videos

`HaMeR` — 2 videos

`LLaMA-2` — 2 videos

`Mamba` — 2 videos

`RoBERTa` — 2 videos

`SlowFast` — 2 videos

`TSN` — 2 videos

`TimeSformer` — 2 videos

`Contrastive Loss` — 2 videos

`DSO` — 2 videos

ViLMa Visual Localization and Mapping (2024)
Visual-Inertial Odometry for Small-sized Robots (2024)

`OpenMask3D` — 2 videos

`PoseNet` — 2 videos

`RealEstate10K` — 2 videos

ViLMa Visual Localization and Mapping (2024)
3D Foundation Models for Physical Intelligence (2024)

`SceneFun3D` — 2 videos

`VIO` — 2 videos

`VLAD` — 2 videos

ViLMa Visual Localization and Mapping (2024)
CV4MS @ CVPR 2024 (2024)

`AutoDAN` — 2 videos

`Generative Adversarial Networks (GANs)` — 2 videos

`Llama 2` — 2 videos

`Llama 3` — 2 videos

Dataset Distillation: A Comprehensive Review (2024)
Foundation Models in Radiology (2025)

`MiniGPT4` — 2 videos

Dataset Distillation: A Comprehensive Review (2024)
From Multimodal LLM to Human-level AI (2024)

`S3D` — 2 videos

Dataset Distillation: A Comprehensive Review (2024)
Synthetic Data for CV (2024)

`Stable Signature` — 2 videos

`StegaStamp` — 2 videos

`CFLOW` — 2 videos

`Cross-entropy loss` — 2 videos

`CutPaste` — 2 videos

`DMR` — 2 videos

`Entropy` — 2 videos

`MVtec AD` — 2 videos

`Masked Auto-Encoder (MAE)` — 2 videos

Scalable Real-Time Abnormal Event Detection (2024)
Generalization via Scaling Robotics (2025)

`PaDiM` — 2 videos

`PatchCore` — 2 videos

`SPADE` — 2 videos

`ST-GCN` — 2 videos

`STG-NF` — 2 videos

`VisA` — 2 videos

`Ablation-CAM` — 2 videos

`Backpropagation` — 2 videos

`Clustering` — 2 videos

`SEEM` — 2 videos

The 3rd Explainable AI for Computer Vision (XAI4CV) Workshop @ CVPR 2024 (2024)
CV4MS @ CVPR 2024 (2024)

`Score-CAM` — 2 videos

`FullGrad` — 2 videos

`SFT` — 2 videos

`Safe Latent Diffusion` — 2 videos

`EfficientNet` — 2 videos

`HuBERT` — 2 videos

`TCN` — 2 videos

`UniformerV2` — 2 videos

`Whisper` — 2 videos

`CC12M` — 2 videos

VizWiz Grand Challenge: Opening Remarks (2024)
Synthetic Data for CV (2024)

`CC3M` — 2 videos

VizWiz Grand Challenge: Opening Remarks (2024)
Synthetic Data for CV (2024)

`CLIPSeg` — 2 videos

VizWiz Grand Challenge: Opening Remarks (2024)
Foundational Few-Shot Object Detection Challenge (2025)

`CogAgent` — 2 videos

VizWiz Grand Challenge: Opening Remarks (2024)
From Multimodal LLM to Human-level AI (2024)

`Conceptual Captions` — 2 videos

`DeepSeek-VL` — 2 videos

VizWiz Grand Challenge: Opening Remarks (2024)
From Multimodal LLM to Human-level AI (2024)

`GQA` — 2 videos

VizWiz Grand Challenge: Opening Remarks (2024)
From Multimodal LLM to Human-level AI (2024)

`PaLI-X` — 2 videos

VizWiz Grand Challenge: Opening Remarks (2024)
Foundation Models for Autonomous Systems Workshop (2024)

`ViLT` — 2 videos

`VizWiz-VQA` — 2 videos

`DeepLabV3` — 2 videos

`Occupancy Network` — 2 videos

`SIREN` — 2 videos

`4D-Occ` — 2 videos

CVPR 2024 Workshop on Autonomous Driving (2024)
Foundation Models for Autonomous Systems Workshop (2024)

`Bucket Normalized EPE` — 2 videos

CVPR 2024 Workshop on Autonomous Driving (2024)
Argoverse Competitions 2025 (2025)

`Ego-MLP` — 2 videos

CVPR 2024 Workshop on Autonomous Driving (2024)
Foundation models For autonomous driving (2025)

`FocalFormer3D` — 2 videos

CVPR 2024 Workshop on Autonomous Driving (2024)
Argoverse Competitions 2025 (2025)

`IoU` — 2 videos

`Lite-QCNet` — 2 videos

CVPR 2024 Workshop on Autonomous Driving (2024)
Argoverse Competitions 2025 (2025)

`PDM Score` — 2 videos

CVPR 2024 Workshop on Autonomous Driving (2024)
Foundation Models for Autonomous Systems Workshop (2024)

`QCNet` — 2 videos

CVPR 2024 Workshop on Autonomous Driving (2024)
Argoverse Competitions 2025 (2025)

`TrackFlow` — 2 videos

`ALOHA` — 2 videos

`Chamfer Distance` — 2 videos

`Graph Visual Question Answering` — 2 videos

`Masked Autoencoder` — 2 videos

`PaLM-E` — 2 videos

`SpatialVLM` — 2 videos

`EDSR` — 2 videos

`SRGAN` — 2 videos

Mobile Intelligent Photography and Imaging (2024)
Mobile AI Workshop 2025: Introductory Talk (2025)

`StableSR` — 2 videos

Mobile Intelligent Photography and Imaging (2024)
THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)

`SwinIR` — 2 videos

`VDSR` — 2 videos

`3D Gaussian Splatting (3DGS)` — 2 videos

`AdaLN` — 2 videos

`Ctrl-Sim` — 2 videos

`DriveVLM-Dual` — 2 videos

`MARS` — 2 videos

`Nerfacto` — 2 videos

`NeuRas` — 2 videos

`JFT-300M` — 2 videos

`APS` — 2 videos

`BIM` — 2 videos

`MaskGit` — 2 videos

`Vision Transformers (ViT)` — 2 videos

`YFCC100M` — 2 videos

`EV-IMO` — 2 videos

`EVDodge` — 2 videos

`ICP` — 2 videos

`Kalman filter` — 2 videos

`ORB-SLAM` — 2 videos

Visual-Inertial Odometry for Small-sized Robots (2024)
Event-Based SLAM at Slamcore (2025)

`ReLU` — 2 videos

`Autoencoder` — 2 videos

AI4Space 2024 Workshop (2024)
Scalable Autonomous Driving via Fully Data-driven Simulation (2025)

`Monte Carlo Tree Search (MCTS)` — 2 videos

AI4Space 2024 Workshop (2024)
THE BITTER LESSON FOR RL: VERIFICATION AS THE KEY TO REASONING LLMS (2025)

`Neural-Fly` — 2 videos

`DreamGaussian` — 2 videos

`One-2-3-45` — 2 videos

`VITPose` — 2 videos

`F1 Score` — 2 videos

`IM-Net` — 2 videos

`Optimal Transport` — 2 videos

`Data augmentation` — 2 videos

THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
CVPR 2024 - Invited Speakers - Chris Padwick (2024)

`DeepSORT` — 2 videos

THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
CVsports Workshop at CVPR 2024, Seattle (2024)

`LLaMA2` — 2 videos

THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
From Multimodal LLM to Human-level AI (2024)

`MLP Projector` — 2 videos

THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
23713 Towards Building AGI in Autonomy and Robotics (2024)

`MOTA` — 2 videos

`NAFNet` — 2 videos

THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
Mobile AI Workshop 2025: Introductory Talk (2025)

`Photogrammetry` — 2 videos

`Pseudo-labeling` — 2 videos

THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
CVPR 2024 - Invited Speakers - Chris Padwick (2024)

`Qwen-VL` — 2 videos

THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
From Multimodal LLM to Human-level AI (2024)

`Test-time augmentation` — 2 videos

THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
CVPR 2024 - Invited Speakers - Chris Padwick (2024)

`Video Swin Transformer` — 2 videos

THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
DEF-AI-MIA Workshop at CVPR 2024 (2024)

`Weighted Box Fusion (WBF)` — 2 videos

THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
Argoverse Competitions 2025 (2025)

`YOLOv5` — 2 videos

THE 8TH AI CITY CHALLENGE @ CVPR 2024 (2024)
CVsports Workshop at CVPR 2024, Seattle (2024)

`Active Learning` — 2 videos

DEF-AI-MIA Workshop at CVPR 2024 (2024)
CVPR 2024 - Invited Speakers - Chris Padwick (2024)

`FPN (Feature Pyramid Network)` — 2 videos

DEF-AI-MIA Workshop at CVPR 2024 (2024)
Industrial DVS Design; Key Features and Applications (2025)

`Image Captioning` — 2 videos

`GPR` — 2 videos

4th Workshop on Computer Vision in the Built Environment (2024)
CV4MS @ CVPR 2024 (2024)

`BioGPT` — 2 videos

`CONCH` — 2 videos

`EfficientNet-B7` — 2 videos

`Vision Transformer (ViT)` — 2 videos

`iBOT` — 2 videos

`DCLM-Baseline` — 2 videos

Black-box Adversarial Attacks on Vision Foundation Models (2024)
Synthetic Data for CV (2024)

`DataComp-1B` — 2 videos

Black-box Adversarial Attacks on Vision Foundation Models (2024)
Synthetic Data for CV (2024)

`DataComp-LM` — 2 videos

Black-box Adversarial Attacks on Vision Foundation Models (2024)
Synthetic Data for CV (2024)

`Mistral-7B-v0.3` — 2 videos

Black-box Adversarial Attacks on Vision Foundation Models (2024)
Synthetic Data for CV (2024)

`WIT` — 2 videos

Black-box Adversarial Attacks on Vision Foundation Models (2024)
Synthetic Data for CV (2024)

`BiomedParse` — 2 videos

CV4MS @ CVPR 2024 (2024)
Learning the Language of Patients (2025)

`Convolutional Neural Network` — 2 videos

`DeepLabV3+` — 2 videos

CV4MS @ CVPR 2024 (2024)
Mobile AI Workshop 2025: Introductory Talk (2025)

`PSPNet` — 2 videos

CV4MS @ CVPR 2024 (2024)
CVPR 2024 - Invited Speakers - Chris Padwick (2024)

`Plenoxels` — 2 videos

`Stable Video Diffusion` — 2 videos

3D Foundation Models for Physical Intelligence (2024)
World Modeling Challenge (2025)

`Computer Vision` — 2 videos

CVsports Workshop at CVPR 2024, Seattle (2024)
CVPR 2024 - Invited Speakers - Chris Padwick (2024)

`Generative AI` — 2 videos

CVsports Workshop at CVPR 2024, Seattle (2024)
CVPR 2024 - Invited Speakers - Chris Padwick (2024)

`Transformer decoder` — 2 videos

CVsports Workshop at CVPR 2024, Seattle (2024)
CVPR 2025 Workshop on Autonomous Driving (2025)

`3D-GPT` — 2 videos

Synthetic Data for CV (2024)
From Multimodal LLM to Human-level AI (2024)

`DeepSeek` — 2 videos

Synthetic Data for CV (2024)
AI agents in cancer research and oncology (2025)

`Phi-3` — 2 videos

`RAFT-Stereo` — 2 videos

Synthetic Data for CV (2024)
Deep Stereo Matching in the Twenties (2024)

`NNCF` — 2 videos

`ONNX` — 2 videos

CVPR 2024 - Invited Speakers - Chris Padwick (2024)
Mobile AI Workshop 2025: Introductory Talk (2025)

`Self-supervised learning` — 2 videos

CVPR 2024 - Invited Speakers - Chris Padwick (2024)
AI agents in cancer research and oncology (2025)

`CLIP encoder` — 2 videos

`DriveGAN` — 2 videos

`OpenVLA` — 2 videos

`S2Net` — 2 videos

`BehaviorNet` — 2 videos

`IDM` — 2 videos

`P3` — 2 videos

`Pascal VOC` — 2 videos

`AutoGPT` — 2 videos

`Point-Bind` — 2 videos

From Multimodal LLM to Human-level AI (2024)
Cross-Modal 3D Scene Understanding (2025)

`Video-ChatGPT` — 2 videos

From Multimodal LLM to Human-level AI (2024)
Second Egocentric Vision (EgoVis) Workshop (2025)

`Voyager` — 2 videos

`Dynamic Vision Sensor (DVS)` — 2 videos

`MVSEC dataset` — 2 videos

`EVO` — 2 videos

`DHP19` — 2 videos

`Harris score` — 2 videos

`Spiking Neural Network` — 2 videos

`N-MNIST dataset` — 2 videos

`CPS` — 2 videos

`PS-MT` — 2 videos

`CCVC` — 2 videos

`FlowNet` — 2 videos

`ESIM (Event Camera Simulator)` — 2 videos

`Convolutional Neural Networks (CNNs)` — 2 videos

`FPGA` — 2 videos

`IMU` — 2 videos

From Event-Based Visions to Real Systems (2025)
The development of the DVS and DAVIS sensors (2025)

`libcaer` — 2 videos

`Loihi` — 2 videos

`N-MNIST` — 2 videos

`MonoSLAM` — 2 videos

Event-Based SLAM at Slamcore (2025)
Reconstruction, Motion Estimation and SLAM from Events (2025)

`PTAM` — 2 videos

Event-Based SLAM at Slamcore (2025)
Event-based Algorithms for Robust and High-speed Robotics (2025)

`iniLabs DVS128` — 2 videos

`Uni-NLX` — 2 videos

`SelfGraphVQA` — 2 videos

`Neural Networks` — 2 videos

`HMAX` — 2 videos

`DNN` — 2 videos

`FastFlow3D` — 2 videos

Argoverse Competitions 2025 (2025)
Perception and simulation for self-driving vehicles (2025)

`Gabor filters` — 2 videos

Event-Driven Convolution-Based Processing (2025)
Event-Driven Sensing for a Humanoid Robot (2025)

`TRAM` — 2 videos

`VisProg` — 2 videos

`DVS128` — 2 videos

Bio-Inspired Embedded Event-based Visual Processing (2025)
Novel Hardware for Spatial AI (2025)

`SemanticFusion` — 2 videos

Novel Hardware for Spatial AI (2025)
Reconstruction, Motion Estimation and SLAM from Events (2025)

`Brainchip` — 2 videos

Novel Hardware for Spatial AI (2025)
Reconstruction, Motion Estimation and SLAM from Events (2025)

`Particle Filter` — 2 videos

`Hough Transform` — 2 videos

`Gaussian blur` — 2 videos

Fusing Frame and Event data for High Dynamic Range Video (2025)
World Modeling Challenge (2025)

`VideoMimic` — 2 videos

`Normalizing Flows` — 2 videos

`EMMA` — 2 videos

`BiomedCLIP` — 2 videos

Learning the Language of Patients (2025)
Multimodal, Generative, and Agentic AI for Pathology (2025)

Methods / Models / Datasets — Cross-Reference

CLIP — 57 videos

GPT-4 — 32 videos

ImageNet — 29 videos

NeRF — 23 videos

Stable Diffusion — 22 videos

SAM — 22 videos

DINO — 19 videos

ResNet — 18 videos

Transformer — 18 videos

DINOv2 — 17 videos

ResNet50 — 16 videos

GPT-4V — 16 videos

BERT — 16 videos

GPT-4o — 16 videos

GPT-3 — 15 videos

Sora — 15 videos

ControlNet — 15 videos

LoRA — 14 videos

ResNet-50 — 14 videos

ChatGPT — 13 videos

AlexNet — 13 videos

LLaVA — 13 videos

CNN — 13 videos

MLP — 12 videos

GPT-2 — 11 videos

U-Net — 10 videos

COCO — 10 videos

DALL-E — 10 videos

ViT — 10 videos

Gaussian Splatting — 10 videos

Gemini — 10 videos

DreamFusion — 9 videos

RANSAC — 9 videos

BLIP-2 — 9 videos

PointNet — 9 videos

DALL-E 3 — 9 videos

Flamingo — 9 videos

Midjourney — 9 videos

ResNet18 — 9 videos

MNIST — 9 videos

CARLA — 9 videos

GPT-3.5 — 8 videos

VQ-VAE — 8 videos

VQ-GAN — 8 videos

COLMAP — 8 videos

nuScenes — 8 videos

DDPM — 8 videos

Diffusion Models — 8 videos

SIFT — 8 videos

StyleGAN — 8 videos

InstructBLIP — 8 videos

SimCLR — 8 videos

YOLO — 8 videos

UniAD — 7 videos

Waymo — 7 videos

Imagen — 7 videos

VQGAN — 7 videos

Transformers — 7 videos

Flow Matching — 7 videos

ResNet-18 — 7 videos

VGG16 — 7 videos

Instant3D — 7 videos

LLaMA — 7 videos

BLIP — 7 videos

Grad-CAM — 7 videos

SLAM — 7 videos

GANs — 7 videos

Ego4D — 7 videos

YOLOv8 — 7 videos

DVS — 7 videos

LSTM — 6 videos

PRISM-1 — 6 videos

DriveDreamer — 6 videos

nuPlan — 6 videos

WayveScenes101 — 6 videos

DALL-E 2 — 6 videos

MAE — 6 videos

TensorFlow — 6 videos

ShapeNet — 6 videos

`CLIP` — 57 videos

`GPT-4` — 32 videos

`ImageNet` — 29 videos

`NeRF` — 23 videos

`Stable Diffusion` — 22 videos

`SAM` — 22 videos

`DINO` — 19 videos

`ResNet` — 18 videos

`Transformer` — 18 videos

`DINOv2` — 17 videos

`ResNet50` — 16 videos

`GPT-4V` — 16 videos

`BERT` — 16 videos

`GPT-4o` — 16 videos

`GPT-3` — 15 videos

`Sora` — 15 videos

`ControlNet` — 15 videos

`LoRA` — 14 videos

`ResNet-50` — 14 videos

`ChatGPT` — 13 videos

`AlexNet` — 13 videos

`LLaVA` — 13 videos

`CNN` — 13 videos

`MLP` — 12 videos

`GPT-2` — 11 videos

`U-Net` — 10 videos

`COCO` — 10 videos

`DALL-E` — 10 videos

`ViT` — 10 videos

`Gaussian Splatting` — 10 videos

`Gemini` — 10 videos

`DreamFusion` — 9 videos

`RANSAC` — 9 videos

`BLIP-2` — 9 videos

`PointNet` — 9 videos

`DALL-E 3` — 9 videos

`Flamingo` — 9 videos

`Midjourney` — 9 videos

`ResNet18` — 9 videos

`MNIST` — 9 videos

`CARLA` — 9 videos

`GPT-3.5` — 8 videos

`VQ-VAE` — 8 videos

`VQ-GAN` — 8 videos

`COLMAP` — 8 videos

`nuScenes` — 8 videos

`DDPM` — 8 videos

`Diffusion Models` — 8 videos

`SIFT` — 8 videos

`StyleGAN` — 8 videos

`InstructBLIP` — 8 videos

`SimCLR` — 8 videos

`YOLO` — 8 videos

`UniAD` — 7 videos

`Waymo` — 7 videos

`Imagen` — 7 videos

`VQGAN` — 7 videos

`Transformers` — 7 videos

`Flow Matching` — 7 videos

`ResNet-18` — 7 videos

`VGG16` — 7 videos

`Instant3D` — 7 videos

`LLaMA` — 7 videos

`BLIP` — 7 videos

`Grad-CAM` — 7 videos

`SLAM` — 7 videos

`GANs` — 7 videos

`Ego4D` — 7 videos

`YOLOv8` — 7 videos

`DVS` — 7 videos

`LSTM` — 6 videos

`PRISM-1` — 6 videos

`DriveDreamer` — 6 videos

`nuPlan` — 6 videos

`WayveScenes101` — 6 videos

`DALL-E 2` — 6 videos

`MAE` — 6 videos

`TensorFlow` — 6 videos

`ShapeNet` — 6 videos

`VGG` — 6 videos