End-to-End Referring Video Object Segmentation with Multimodal Transformers Github: https://github.com/mttr2021/MTTR Paper: https://arxiv.org/abs/2111.14821v1 Dataset: https://kgavrilyuk.github.io/publication/actor_action/ @ai_machinelearning_big_data 3.0K views13:21