This is the first i a series of models desiged to replicate the prose quality of the Claude 3 models, specifically Soet ad Opus. This model is fie-tued o top of Qwe-2 72B Istruct. Model has bee Istruct tued with the ChatML formattig. A typical iput would look like this: This model has bee a team effort, credits go to: Ad last but ot least, we'd like to thak Kearm for sposorig the compute eeded to trai this model. The traiig was doe with 55 millio tokes of high-quality RP data, over 1.5 epochs. We used 8x AMD Istict™ MI300X Accelerators for the full-parameter fie-tuig of the model. …Promptig
"""<|im_start|>user
Hi there!<|im_ed|>
<|im_start|>assistat
Nice to meet you!<|im_ed|>
<|im_start|>user
Ca I ask a questio?<|im_ed|>
<|im_start|>assistat
"""
Credits
Traiig
Safety
点击空白处退出提示
评论