Reasoning Segmentation for Images and Videos: A Survey
Submitted to IJCV, 2025
Reasoning Segmentation (RS) segments objects from natural-language queries that require reasoning and knowledge, moving beyond fixed categories or explicit prompts. This survey synthesizes 26 methods, evaluation metrics, and 29 datasets/benchmarks, reviews applications across domains, and outlines current gaps and future research directions.
Recommended citation: Yiqing Shen, Chenjia Li, et al. (2025). "RVTBench: A Benchmark for Visual Reasoning Tasks." arXiv preprint arXiv:2505.18816.
Download Paper | Download Bibtex
