xAI releases Grok-1.5V, a multimodal AI model with vision support
xAI, an artificial intelligence company under Musk, announced the launch of its first multimodal AI model, Grok-1.5V. In addition to its powerful text processing capabilities, Grok can also process various visual information, including documents, charts, screenshots, and photos. In benchmark tests in multiple fields, Grok-1.5V's performance is comparable to existing cutting-edge multimodal models. Especially in xAI's newly launched RealWorldQA benchmark test, Grok surpassed similar models in its ability to understand the real-world space. The RealWorldQA dataset contains more than 700 images and aims to evaluate the basic understanding ability of multimodal models in the physical world. Grok-1.5 will soon be open to early testers and existing users.
Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.
You may also like
CandyBomb x ROBO: Trade futures to share 340,000 ROBO!
OPNUSDT now launched for pre-market futures trading
Join the BGB holders group—unlock Spring Festival Mystery Boxes to win up to 8888 USDT and merch from Morph
Trading Club Championship (Margin)—Trade to share 58,000 USDT, with up to 3000 USDT per user!
