xAI’s Grok chatbot has been updated to include a new feature that allows it to interpret and respond to questions about objects in view of a smartphone’s camera, similar to functionalities offered by Google’s Gemini and ChatGPT with real-time vision capabilities.
On Tuesday, xAI announced the introduction of Grok Vision, a feature that allows users to point their smartphone camera at various objects, including products, signs, and documents, and then inquire about them. Currently, Grok Vision is available on the Grok app for iOS users; however, it has not yet been launched on the Grok Android app.
Grok’s voice mode now integrates camera access, enabling users to direct their camera toward an object and ask questions such as, “What am I looking at?” This Vision feature on iOS provides the chatbot with the ability to analyze real-world objects, text, and environments interactively.
Alongside the Vision feature, Grok has introduced additional capabilities such as multilingual audio support and real-time search functions within its voice mode. While these features are available to Android users, they require a subscription to xAI’s SuperGrok plan, priced at $30 per month.
Recently, Grok has been steadily expanding its feature set. Earlier this month, xAI introduced a “memory” feature, enabling Grok to access information from prior conversations. Furthermore, a canvas-like tool was added, allowing for the creation of documents and applications.