主要 报价 日历 论坛
flag

FX.co ★ Meta Develops Efficient Language Models For Smartphones

back back next
typeContent_19130:::2024-07-09T20:31:00

Meta Develops Efficient Language Models For Smartphones

Meta (META) has introduced a new, compact artificial intelligence model called MobileLLM, designed specifically for smartphones and devices with limited computational capabilities.

Developed collaboratively by Meta Reality Labs, Meta AI Research (FAIR), and PyTorch, MobileLLM features fewer than one billion parameters.

Yann LeCun, Meta's Chief AI Scientist, highlighted crucial aspects of the research in a post on X/Twitter, stating, "Our findings indicate that, for smaller models, prioritizing depth over width enhances model performance. Furthermore, by leveraging advanced weight-sharing techniques, including embedding sharing, grouped query attention, and block-wise weight sharing, we achieve significant enhancements in weight utilization within storage-constrained scenarios."

These design innovations have enabled MobileLLM to outperform previous models of similar sizes by 2.7 to 4.3 percent, as evidenced by benchmark tests.

LeCun's post also noted that MobileLLM demonstrates "substantial advancements in zero-shot commonsense reasoning, question answering, and reading comprehension tasks compared to state-of-the-art (SoTA) methods."

Additionally, the researchers showcased the effectiveness of MobileLLM in chat and API call applications, further emphasizing its proficiency in these areas.

分享此文章:
back back next
loader...
all-was_read__icon
你现在看过所有最好的出版物。
我们已经在寻找一些有趣的东西......
all-was_read__star
最近发表:
loader...
最近的出版物