Started research on Visual Language Models