Get Mystery Box with random crypto!

ImageBind: One Embedding Space To Bind Them All. PyTorch impl | Artificial Intelligence && Deep Learning

ImageBind: One Embedding Space To Bind Them All.

PyTorch implementation and pretrained models for ImageBind. For details, see the paper: ImageBind: One Embedding Space To Bind Them All.

ImageBind learns a joint embedding across six different modalities - images, text, audio, depth, thermal, and IMU data. It enables novel emergent applications ‘out-of-the-box’ including cross-modal retrieval, composing modalities with arithmetic, cross-modal detection and generation.

Source code: https://github.com/facebookresearch/imagebind

Paper: https://arxiv.org/pdf/2305.05665v1.pdf

@deeplearning_ai