Ovis1.6-Gemma2-9B: A Breakthrough in Multimodal AI Technology by Aidc-AI

By Horay AI Team|

AIDC-AI is the AI team at Alibaba International Digital Commerce Group. Released by Aidc-AI in mid-September 2024, Ovis1.6-gemma2-9b is a groundbreaking Multimodal Large Language Model (MLLM) that has already made great waves among the AI community. With its open-source nature, this model has drawn attention to its versatility and advanced capabilities. Ovis1.6-gemma2-9b scored exceptionally high for its performance on OpenCompass, a respected benchmark for LLMs. It racks up ahead of a number of mainstream open-source models such as MiniCPM-V-2.6, Qwen2VL-7B and InternVL2-26B, and ranks first among open-source models with less than 30 billion parameters. Thus, Ovis1.6-gemma2-9b has consequently stood out as a robust AI tool designed to cater to various industries and applications.

Main Functions of Ovis1.6-gemma2-9b

Some cases listed

1. Maths: When given a picture of a math problem, Ovis 1.6-Gemma2-9B efficiently extracts the text from the image and provides a detailed, accurate solution within a short time.Apart from that, all the mathematical expressions are well and coherently shown through the solutions.

ovis1.6-gemma

2. Food: The model easily identified the type of food in the picture and then provided detailed steps to prepare it, based on the prompt entered.

ovis1.6-gemma

Advantages of Ovis1.6-gemma2-9b

Application of Ovis in Cross-Border E-Commerce

Aidge (Alibaba International AI Platform) is recognized as the e-commerce platform for Alibaba, technologically supported by Aidc-AI. However, international e-commerce faces many challenges: navigating complex overseas markets, high operational costs, competitive pressure, and a lot more. To make this much easier, AIGC technologies like MLLMs can be particularly helpful to provide effective solutions in order to reduce costs and improve efficiency.

For example, one major issue in overseas e-commerce is to manage returns and refunds, which greatly impacts both users and merchants experience. Refunds and return audits were previously done manually, requiring significant labour force, time, and sometimes even leading to inconsistent decisions due to subjective judgment.

Now with Ovis, Aidc-AI has developed an intelligent refund system that leverages the company’s vast e-commerce knowledge. Ovis is able to process user-submitted images and videos related to refund claims, providing fast and consistent assessments. This consequently results in a cost-effective, efficient solution that ensures fair treatment for both consumers and merchants.

In addition to Ovis, Aidc-AI has also developed other advanced tools like the multi-language model Marco and the e-commerce-focused MLLM MarcoVL, offering a lot more MaaS (Model-as-a-Service) capabilities, such as:

Therefore, AI has fundamentally transformed how merchants operate and how customers purchase, greatly boosting productivity and reducing costs. For platforms like Aidge, these AI-driven capabilities have become a key competitive advantage.

Conclusion

As AI continues to evolve, MLLMs like Ovis1.6-gemma2-9b are likely to become more integrated into our daily lives, providing seamless assistance across industries. There are a wide range of application scenarios for MLLMs, including automated driving, medical diagnosis, video content understanding, image description generation, and visual Q&A. These rounded applications have made Ovis1.6-gemma2-9b a bright future. From mathematical reasoning to complex task processing, its range of capabilities makes it an invaluable asset in various industries.

Whether you’re a content creator, business leader, or developer, staying updated on what this model can do will allow you to harness its full potential. Please keep an eye on Ovis1.6-gemma2-9b as it continues to push the boundaries of what AI can achieve.

Get Start Now