Search images with composite query of image and text instruction. The multimodal embedding takes image and text as input to better express the search intention of users. Combine the powerful embedding like MagicLens and Visualized BGE with Milvus vector database to build smart image search applications!