Kunlun Wanwei joins hands with Nanyang Technological University to rush to issue Q* algorithm: 100-fold improvement of 7B model inference ability

author：Heart of the Machine Pro 2024-06-25 18:22:00

AIxiv is a column that publishes academic and technical content in the heart of the machine. In the past few years, the AI xiv column has received more than 2,000 reports, covering the top laboratories of major universities and enterprises around the world, effectively promoting academic exchanges and dissemination. If you have a great job to share, please feel free to submit or contact us. Submission mailbox:[email protected];[email protected]

Since OpenAI's Q* project was exposed, it has sparked a lot of discussion in the industry. According to the available information, the Q* project is regarded as a major attempt by OpenAI to explore artificial general intelligence (AGI), which is expected to bring revolutionary breakthroughs to AI technology in multiple aspects, including mathematical problem-solving ability, self-learning and self-improvement.

Kunlun Wanwei joins hands with Nanyang Technological University to rush to issue Q* algorithm: 100-fold improvement of 7B model inference ability

Nvidia scientist Jim Fan, Turing Award winner Yann LeCun, and others discuss OpenAI's Q* implementation

Meta scientist Tian Yuandong believes that Q* is a combination of Q-learning and A*, and is naturally suitable for reasoning tasks, especially in mathematical reasoning

However, OpenAI has not disclosed specific details about the Q* algorithm so far, and we don't know exactly how effective it will be.

Kunlun Wanwei has been paying close attention to the trends of Q* since the Q* project was exposed, and set up a research team to try to develop its own Q* algorithm at the first time, hoping to break the blockade of OpenAI and improve the inference ability of existing open-source models.

After months of experimentation, Kunlun and Nanyang Technological University in Singapore have successfully developed an algorithm called Q*, which can significantly improve the inference ability of existing large models. On the GSM8K dataset, Q* helped Llama-2-7b achieve an accuracy rate of 80.8%, surpassing ChatGPT. On the MATH dataset, Q* helped DeepSeek-Math-7b achieve an accuracy rate of 55.4%, surpassing Gemini Ultra. On the MBPP dataset, Q* helped CodeQwen1.5-7b-Chat achieve an accuracy rate of 77.0%, narrowing the programming gap with GPT-4.

论文：Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning
Link to paper: https://arxiv.org/abs/2406.14283

Q* can help small models achieve the inference ability of models with tens or even hundreds of times the number of parameters larger than them, which not only greatly improves the performance of small models, but also significantly reduces the demand for computing resources, bringing new possibilities for the wide application of artificial intelligence and creating a new era of efficient intelligence.

Research has proved that Q* can help a small model with only 7b parameters to achieve the inference ability of a model with a parameter number that is tens or even hundreds of times larger than it, greatly improving the performance of the model and significantly reducing the demand for computing resources. At present, the research of Q* is still in its infancy, and there is still room for further improvement in all aspects of the algorithm. In the future, Kunlun Wanwei will continue to deepen this research, continuously improve the inference ability of domestic open-source models, break the closed-source blockade of OpenAI, and bring new possibilities for the development of cutting-edge artificial intelligence technologies.

Kunlun Wanwei joins hands with Nanyang Technological University to rush to issue Q* algorithm: 100-fold improvement of 7B model inference ability

Read on

South Sea Gold Bead Ring 🌻14.2mm Very Slightly Flawed Intense Gold Color Deep Sea Pearl 18K Gold Diamond Fanta Stone Inlay Gradient Side Stone No. 13 Shank

Nanyang Technological University began to restrict study tours to the campus

18k white gold, gold weight 9g, natural diamond 💎1.72ct, natural emerald 2.46ct, South Sea gold pearl & Australian white, keshi, seedless, wild, whole

Senior Visiting Scholar at Nanyang Technological University, Singapore

CICGF | "Hairun Blue" will make its debut at the 4th CICGF, bringing a clear island South Sea breeze

The finale of "Nanyang Daughter's Love": Three weddings, two reversals, his ending is too tearful

Good guys, CCTV's "Nanyang Daughter's Love" has been broadcast with bad reviews, and the reasons for the audience's bad reviews are surprisingly consistent

Zhang Baozai, a big pirate who was kicked by Huang Feihong, used to be such a great existence in Nanyang|Literary and Historical Banquet

Design double bead bracelet, Australian white and South Sea gold beads, double bead staggered design, pearlescent like a torch, 12-13mm extremely strong light, almost flawless, delicate skin, perfect cold white light with strong gold green 18K gold inlay

Gorgeous Gold Bead Floral Elements Heavy Industry Inlaid Gold Bead Flower Full Diamond Bracelet 15-16mm Thick Gold South Sea Gold Beads Quality Comparable to Tea Gold Zhengyuan Almost Flawless Super Concentrated Gold Mirror Light 18K White Gold 💎4

The cultural activity of "Loyalty and Righteousness to the South Seas" was held in Guan Gong's hometown

Nanyang Na 🌴

130㎡ retro Nanyang style, home of humanities and arts

Lu Xuelin, who is merciful everywhere in "Nanyang Daughter's Love", has harmed three women with his romantic debts

140㎡ Shanghai-style retro Nanyang-style two-child home

What makes a good large model? Jiazi light year

Baidu released Wenxin Model 4.0 Turbo, which is officially open to users on multiple terminals

Baidu released Wenxin large model 4.0 Turbo: faster and better results

WAIC Observations | AI models are no longer outstanding, and humanoid robots have become "fragrant and sweet"

Special report on large models: from the technical path, look at the road of counterattack of domestic large models

Research on small-scale LID layout optimization model based on GWO-PSO algorithm

The baseline qHBsAg level model predicts the functional cure rate of patients treated with CHB with PEG-IFN

If the large model wants to make money, it must first pass these seven difficulties

What are the difficulties in the large-scale model landing test? 2024 WAIC

To create a benchmark for the landing and application of vertical large models, the Honeynest Government Affairs Large Model 3.0 was unveiled at WAIC 2024

Use the Pangu model! China's first open-source Hongmeng humanoid robot Kuafu debuted at WAIC

Overview of large models: A 10,000-word long article that explains in detail the principles, applications, and future trends of AI large models

SenseTime released a controllable portrait video model Vimi丨Gates: White-collar workers were replaced by AI earlier than blue-collar workers

Marketing Director Work Model PPT (PDF Version)

An open-source project that introduces the basics of large models: so-large-lm

Robin Li debates the AI circle: model feast or application is king?