IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme

Event: CVPR Nashville 2025 · Duration: 8 min · ▶ Watch on YouTube

Abstract

Semi-supervised semantic segmentation aims to improve model performance using a small set of labeled images and a large pool of unlabeled data. Current methods often rely on a single model backbone or generate noisy pseudo-labels, struggling to effectively learn both high-level context and local details. This paper proposes IGL-DT, an iterative global-local feature learning framework with a dual-teacher strategy. IGL-DT addresses these issues by combining a Transformer and a CNN to extract complementary global and local knowledge, guided by Global Context Learning, Local Regional Learning, and Discrepancy Learning.

Speakers

Dinh Dai Quan Tran — National Chung Cheng University, AI VIET NAM
Hoang-Thien Nguyen — National Chung Cheng University, AI VIET NAM
Thanh-Huy Nguyen — National Chung Cheng University, AI VIET NAM
Gia-Van To — National Chung Cheng University, AI VIET NAM
Tien-Huy Nguyen — National Chung Cheng University, AI VIET NAM
Quan Nguyen — National Chung Cheng University, AI VIET NAM

Talks (1)

00:00:00 — Dinh Dai Quan Tran: IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework under Limited Annotation Scheme
- Presents IGL-DT, a semi-supervised semantic segmentation framework that iteratively learns global and local features using a dual-teacher strategy to improve performance under limited annotation.

Key Takeaways

IGL-DT is a novel semi-supervised semantic segmentation framework that leverages a dual-teacher strategy with complementary Transformer (global) and CNN (local) backbones.
The framework incorporates Global Context Learning, Local Regional Learning, and Discrepancy Learning to effectively guide the student model and prevent overfitting.
IGL-DT achieves state-of-the-art quantitative results on Pascal VOC and Cityscapes datasets, outperforming strong baselines.
Visual comparisons demonstrate improved segmentation accuracy, especially for challenging cases like small objects, boundaries, and occlusions.
Ablation studies confirm the unique contribution of each loss component and the synergistic benefits of combining global and local feature learning with discrepancy learning.

Methods / Models / Datasets Mentioned

IGL-DT
SwinUnet
ResNet
CPS
PS-MT
U2PL
ST++
CCVC
CorrMatch
Dual-Teacher
Pascal VOC
Cityscapes

Topics

Semi-supervised semantic segmentation · Dual-teacher framework · Global context learning · Local regional learning · Discrepancy learning · Transformer · CNN · Limited annotation · Pseudo-labeling · Iterative learning

Notes

Open for commentary — connections to other work, critiques, follow-up reading.