Vision Language models: towards multi-modal deep learning / Artificial Intelligence / By hi@aiweekly.co.in A review of state of the art vision-language models such as CLIP, DALLE, ALIGN and SimVL