In response to the intensifying competition for technological supremacy, Alibaba has unveiled a new artificial intelligence model that it claims is capable of understanding photos and having dialogues that are more complicated than those of the company’s prior products.
The Chinese technology giant reportedly told CNBC that its two new models, Qwen-VL and Qwen-VL-Chat, will be open source. This means that researchers, academics, and businesses from all over the world can use them to develop their own AI apps without having to invest time and money in training their own systems.
Alibaba claimed that Qwen-VL could produce captions for photographs and react to open-ended questions about various images.
In contrast, Qwen-VL-Chat, according to Alibaba, supports more “complex interaction,” such as comparing several image inputs and responding to various rounds of questions.
Alibaba claimed that Qwen-VL-Chat is capable of composing stories, producing images from user-inputted photos, and solving mathematical problems represented as pictures, among other things.
Alibaba used the input of a Chinese-language hospital sign as an example. By analyzing the sign’s image, the AI can respond to queries regarding the whereabouts of specific medical departments.
The majority of generative AI, which is a technology that creates responses depending on user input, has so far concentrated on text response. The most recent iteration of OpenAI’s ChatGPT can also comprehend graphics and respond in text, much like Qwen-VL-Chat.
The two most recent models from Alibaba are based on the Tongyi Qianwen large language model that was released earlier this year. An LLM is an AI model that powers chatbot apps and has been trained on massive amounts of data.
This month, the startup with its headquarters in Hangzhou released two additional AI models. Despite not receiving any licensing fees, Alibaba will benefit from the open-source distribution by gaining additional customers for its AI model at a time when the company’s cloud division is trying to restart growth as it gets ready to go public.