1 paper across 1 session
We present a prompt based multimodal semantic segmentation on the basis of pertained single-modality RGB model