Amazon is working on an AI model that can process video and images

Amazon is stepping up its AI game with a new generative AI model capable of processing images and videos alongside text.

According to a report by The Information, this development may allow the e-commerce giant to reduce its dependence on external AI technologies, including Anthropic’s Claude chatbot, which is integrated into Amazon Web Services (AWS).

The new large language model (LLM), code-named Olympus, is designed to interpret visual content and enable intuitive search capabilities. For instance, users could find specific moments, such as a game-winning basketball shot, by inputting simple text prompts. This advancement would enhance Amazon’s AI offerings for both consumers and enterprise clients.

The timing of Olympus’ announcement may coincide with next week’s AWS re:Invent conference, a major annual event for Amazon’s cloud computing arm.

Amazon has been ramping up its generative AI efforts to stay competitive with rivals like Google, Microsoft, and OpenAI, which are perceived as frontrunners in the space. Last week, Amazon deepened its ties with OpenAI competitor Anthropic, investing an additional US$4 billion. This mirrors a similar investment made last year to expand AWS’s generative AI capabilities.

By developing in-house tools like Olympus, Amazon appears to be aiming to solidify its position as a key player in the AI landscape, minimizing reliance on external partners while enhancing its AI-powered offerings across cloud services and e-commerce.

Amazon has yet to comment on the reports.

Share this Post:

Accessibility Toolbar