Artist's perspective illustrating what LLM can do

News New Tech

Alibaba Cloud open-sources its 7-billion parameter LLM models, commits to open-source community

By Ralph Fajardo

calendar_today August 14, 2023

schedule 3 min read

visibility 190 views

Alibaba Cloud, the digital technology and intelligence backbone of Alibaba Group, today announced its latest contribution to the open-source community by open-sourcing its 7-billion-parameter Large Language Models (LLM), Qwen-7B and Qwen-7B-Chat, through its AI model community ModelScope, and the collaborative AI platform Hugging Face.

IMAGE CREDIT: www.wisecube.ai

Alibaba Cloud introduced its proprietary LLM, Tongyi Qianwen, earlier this year in April. This cutting-edge model, capable of generating human-like content in both Chinese and English, has different model sizes, including seven billion and above parameters. This time, the open-source release includes the pre-trained 7-billion-parameter model, Qwen-7B, and its conversationally fine-tuned version, Qwen-7B-Chat.

In an effort to democratize AI technologies, the models’ code, model weights, and documentation will be freely accessible to academics, researchers, and commercial institutions worldwide. For commercial uses, the models will be free to use for companies with fewer than 100 million monthly active users. Programs with more users can request a license from Alibaba Cloud.

Jingren Zhou, CTO of Alibaba Cloud Intelligence

“By open-sourcing our proprietary large language models, we aim to promote inclusive technologies and enable more developers and SMEs to reap the benefits of generative AI,” said Jingren Zhou, CTO of Alibaba Cloud Intelligence.

“As a determined long-term champion of open-source initiatives, we hope that this open approach can also bring collective wisdom to further help open-source communities thrive,” he added.

The Qwen-7B was pre-trained on over 2 trillion tokens, including Chinese, English, and other multilingual materials, code, and mathematics, covering general and professional fields. Its context length reaches 8K. In training, the Qwen-7B-Chat model was aligned with human instructions.

Both Qwen-7B and Qwen-7B-Chat models can be deployed on cloud and on-premises infrastructures. This enables users to fine-tune the models and build their own high-quality generative models effectively and cost-efficiently.

The pre-trained Qwen-7B model distinguished itself in the Massive Multi-task Language Understanding (MMLU) benchmark, scoring a notable 56.7, outperforming other major pre-trained open-source models with similar scales or even some larger-size models.

This benchmark assesses a text model’s multitask accuracy across 57 varied tasks, encompassing fields such as elementary mathematics, computer science, and law. Moreover, Qwen-7B achieved the highest score among models with equivalent parameters in the leaderboard of C-Eval, a comprehensive Chinese evaluation suite for foundational models. It covers 52 subjects in four major specialties including humanities, social sciences, STEM, and others. Additionally, Qwen-7B reached outstanding performance on benchmarks of mathematics and code generation, such as GSM8K and HumanEval.

Alibaba Cloud’s Qwen-7B model distinguished itself in several benchmarks

In July, Alibaba Cloud also introduced its AI image generator, Tongyi Wanxiang, which was designed to support developers and SMEs in their creative image expression.

The cloud pioneer also unveiled ModelScopeGPT, a versatile framework designed to assist users in performing complex and specialized AI tasks across language, vision, and speech domains by leveraging various AI models on ModelScope.

Launched by Alibaba Cloud last year, ModelScope is an open-source AI model community currently featuring over 1,000 AI models contributed by 20 leading AI institutes.

For more information, please check out the details of Qwen-7B and Qwen-7B-Chat on ModelScope, Hugging Face and GitHub pages.

Interested in this article? You can share this through:

sell Alibaba Cloud, collaborative AI, digital technology, intelligence backbone, large language models, Open-source, Tongyi Qianwen

Ralph Fajardo

Ralph, the Editor-in-Chief of FintechNewsPH.com, brings over 15 years of writing and editorial experience that make him a strong fit to lead the publication’s mission of delivering credible and compelling fintech stories. Before joining FintechNewsPH.com, he served as editor of Hello Philippines, a UK-based news magazine for the Filipino community abroad, where he covered stories on culture, business, and the global Filipino experience. He also contributed as a writer for The International Filipino, profiling Filipinos making an impact worldwide, and later worked as copy editor for Malaya Business Insight, one of the country’s respected business newspapers, where he refined his eye for accuracy, clarity, and style. Ralph’s editorial journey began at the University of the Philippines Diliman, where he was Editor-in-Chief of Kampus Dyornal. There, he developed a keen sense for storytelling that informs and connects — a passion that continues to define his work today. Through the years, Ralph has written across diverse subjects, from finance and technology to culture and communication, consistently weaving insight with narrative depth. His solid newsroom background and commitment to quality journalism position him to guide FintechNewsPH.com in highlighting the stories that shape the country’s rapidly evolving fintech landscape. Discover more about Ralph's professional journey on his LinkedIn profile (https://www.linkedin.com/in/raphael-fajardo-17155491/).

Alibaba Cloud open-sources its 7-billion parameter LLM models, commits to open-source community

Interested in this article? You can share this through:

Ralph Fajardo

Related Articles

PSA e-certificates mark new era for civil registry services in PH

Logitech G brings G321 LIGHTSPEED wireless gaming headset to Filipino gamers

AWS bets on Filipino talent and public impact as AI enters a more autonomous era