πΉ End-to-End Referring Video Object Segmentation with Multimodal Transformers Github: https://github.com/mttr2021/MTTR Paper: https://arxiv.org/abs/2111.14821v1 Dataset: kgavrilyuk.github.io/publicaβ¦r_action @ai_machinelearning_big_data