Model | RES | GRES | ORES | ||
---|---|---|---|---|---|
w/o <mask-ref> | w/ <mask-ref> | Overall | |||
Prev. SOTA | 77.1 (PSALM) |
67.8 (SAM4MLLM) |
49.6 (GSVA) |
N/A* | N/A* |
Ours | 77.8 | 71.8 | 74.6 | 68.8 | 73.1 |
* No previous methods could understand visual reference prompts