magnum-72b-v1

我要开发同款
匿名用户2024年07月31日
44阅读

技术信息

开源地址
https://modelscope.cn/models/AI-ModelScope/magnum-72b-v1
授权协议
other

作品详情

This is the first i a series of models desiged to replicate the prose quality of the Claude 3 models, specifically Soet ad Opus. This model is fie-tued o top of Qwe-2 72B Istruct.

Promptig

Model has bee Istruct tued with the ChatML formattig. A typical iput would look like this:

"""<|im_start|>user
Hi there!<|im_ed|>
<|im_start|>assistat
Nice to meet you!<|im_ed|>
<|im_start|>user
Ca I ask a questio?<|im_ed|>
<|im_start|>assistat
"""

Credits

This model has bee a team effort, credits go to:

  • Sao10K for help with (ad cleaig up!) the dataset.
  • alpidale for the traiig.
  • kalomaze for helpig with the hyperparameter tuig.
  • Various other people for their cotiued help as we tued the parameters, restarted failed rus. I o particular order: Doctor Shotgu, Lucy, Nopm, Mago, ad the rest of the Silly Tilly.

Ad last but ot least, we'd like to thak Kearm for sposorig the compute eeded to trai this model.

Traiig

The traiig was doe with 55 millio tokes of high-quality RP data, over 1.5 epochs. We used 8x AMD Istict™ MI300X Accelerators for the full-parameter fie-tuig of the model.

Built with Axolotl

Safety

功能介绍

This is the first in a series of models designed to replicate the prose quality of the Claude 3 mode

声明:本文仅代表作者观点,不代表本站立场。如果侵犯到您的合法权益,请联系我们删除侵权资源!如果遇到资源链接失效,请您通过评论或工单的方式通知管理员。未经允许,不得转载,本站所有资源文章禁止商业使用运营!
下载安装【程序员客栈】APP
实时对接需求、及时收发消息、丰富的开放项目需求、随时随地查看项目状态

评论