Elon Musk’s xAI debuts its first multimodal model ‘Grok-1.5 Vision’ in race to better compete with OpenAI

Grok-1.5V is competitive with existing frontier multimodal models in a number of domains

Musk’s AI company continues to upgrade itself and works to keep up with OpenAI and other market leaders
Musk’s AI company continues to upgrade itself and works to keep up with OpenAI and other market leaders

Elon Musk’s xAI showcased its first multimodal model. It is believed that the new mode can understand text and is also capable of processing things seen in documents, diagrams, charts, screenshots and photographs. Experts explained that soon Grok-1.5 Vision, or Grok-1.5V, will be available only  to early testers and existing Grok users.

A look at Grok 1.5 vision

The company explained that they have found seven examples showcasing Grok-1.5V’s potential. Grk-1.5 V’s can transform a whiteboard sketch of a flowchart into Python code. It could also generate a bedtime story from a kid’s drawing to explain a meme.In addition the new model can also convert a table into a CSV file format  identifying if your deck has rotten wood and needs replacing.

“Grok-1.5V is competitive with existing frontier multimodal models in a number of domains, ranging from multi-disciplinary reasoning to understanding documents, science diagrams, charts, screenshots, and photographs,” Musk explained.

The future ahead

From what it is understood, Musk’s AI company continues to upgrade itself and works to keep up with OpenAI and other market leaders. This is expected to have started since its chatbot first hit the scene in November 2023. Experts highlighted that Grok-1.5V came within less than a month after xAI made its Grok AI open source. 

Critics argue Grok chatbot has faced  a lot of controversy. In early April researchers mentioned that the Grok chatbot could instruct users on criminal activities. Moreover, xAI plans to build “beneficial [artificial general intelligence]”. The new model is expected to be capable of understanding the universe. It reveals thatFurthermore, Grok highlights that “significant” updates will be made to Grok AI’s multimodal by understanding the next generation capabilities in the coming months.

Follow FE Tech Bytes on TwitterInstagramLinkedInFacebook

Get live Share Market updates, Stock Market Quotes, and the latest India News
This article was first uploaded on April fourteen, twenty twenty-four, at thirteen minutes past eleven in the morning.
X