开源地址
https://modelscope.cn/models/dengcunqin/speech_seaco_paraformer_large_asr_nat-zh-cantonese-en-16k-common-vocab11666-pytorch授权协议
Apache License 2.0

模型介绍

基于SeACoParaformer large(iic/speechseacoparaformerlargeasr_at-zh-c-16k-commo-vocab8404-pytorch)，更换vocab为11666，增加粤语部分字，通过在普通话1w小时、粤语100小时、英语1w小时音频数据集上进行训练1轮。

from fuasr import AutoModel

model = AutoModel(model="degcuqi/speech_seaco_paraformer_large_asr_at-zh-catoese-e-16k-commo-vocab11666-pytorch",
                  model_revisio="master"
                  )

wav_root_url="https://www.modelscope.c/api/v1/models/degcuqi/speech_seaco_paraformer_large_asr_at-zh-catoese-e-16k-commo-vocab11666-pytorch/repo?Revisio=master&FilePath="
res = model.geerate(iput=wav_root_url+"example/asr_example.wav",
                     hotword=wav_root_url+"example/hotword.txt",
                    )
prit(res)

res = model.geerate(iput=wav_root_url+"example/asr_example_普通话.wav",
                     hotword=wav_root_url+"example/hotword.txt",
                    )
prit(res)

res = model.geerate(iput=wav_root_url+"example/asr_example_粤语.wav",
                     hotword=wav_root_url+"example/hotword.txt",
                    )
prit(res)

相关论文以及引用信息

@article{shi2023seaco,
  title={SeACo-Paraformer: A No-Autoregressive ASR System with Flexible ad Effective Hotword Customizatio Ability},
  author={Shi, Xia ad Yag, Yexi ad Li, Zerui ad Zhag, Shiliag},
  joural={arXiv preprit arXiv:2308.03266 (accepted by ICASSP2024)},
  year={2023}
}

模型介绍基于SeACoParaformer large(iic/speechseacoparaformerlargeasr_nat-zh-cn-16k-common-vocab8404-pytorc

声明：本文仅代表作者观点，不代表本站立场。如果侵犯到您的合法权益，请联系我们删除侵权资源！如果遇到资源链接失效，请您通过评论或工单的方式通知管理员。未经允许，不得转载，本站所有资源文章禁止商业使用运营!

下载安装【程序员客栈】APP

实时对接需求、及时收发消息、丰富的开放项目需求、随时随地查看项目状态

前往安装

SeACoParaformer热词语音识别-普通话-粤语-英文-通用-16k-离线-large

技术信息

作品详情

模型介绍

相关论文以及引用信息

功能介绍

重点城市程序员兼职推荐

重点岗位程序员兼职推荐