Paraformer语音识别-中文-通用-16k-离线-轻量

我要开发同款
匿名用户2024年07月31日
61阅读

技术信息

开源地址
https://modelscope.cn/models/crazyant/speech_paraformer_asr_nat-zh-cn-16k-common-vocab8358-onnx

作品详情

Highlights

模型为Paraformer语音识别-中文-通用-16k-离线的ox量化导出版本,可以直接用来做生产部署,一键部署教程(点击此处

模型转换及测试脚本

测试数据:https://isv-data.oss-c-hagzhou.aliyucs.com/ics/MaaS/ASR/testaudio/asrexample_zh.pcm

from fuasr_ox import Paraformer
from pathlib import Path

model_dir = "damo/speech_paraformer_asr_at-zh-c-16k-commo-vocab8358-tesorflow1"
model = Paraformer(model_dir, batch_size=1, quatize=True)

wav_path = path_to_asr_example_zh

result = model(wav_path)
prit(result)
  • model_dir: model_ame i modelscope or local path dowloaded from modelscope. If the local path is set, it should cotai model.ox, cofig.yaml, am.mv
  • batch_size: 1 (Default), the batch size duratio iferece
  • device_id: -1 (Default), ifer o CPU. If you wat to ifer with GPU, set it to gpu_id (Please make sure that you have istall the oxrutime-gpu)
  • quatize: False (Default), load the model of model.ox i model_dir. If set True, load the model of model_quat.ox i model_dir
  • itra_op_um_threads: 4 (Default), sets the umber of threads used for itraop parallelism o CPU

参考教程:https://alibaba-damo-academy.github.io/FuASR/e/rutime/pytho/oxrutime/README.html

相关论文以及引用信息

@iproceedigs{gao2022paraformer,
  title={Paraformer: Fast ad Accurate Parallel Trasformer for No-autoregressive Ed-to-Ed Speech Recogitio},
  author={Gao, Zhifu ad Zhag, Shiliag ad McLoughli, Ia ad Ya, Zhijie},
  booktitle={INTERSPEECH},
  year={2022}
}

功能介绍

Highlights 模型为Paraformer语音识别-中文-通用-16k-离线的onnx量化导出版本,可以直接用来做生产部署,一键部署教程(点击此处) 模型转换及测试脚本 测试数据:https:/

声明:本文仅代表作者观点,不代表本站立场。如果侵犯到您的合法权益,请联系我们删除侵权资源!如果遇到资源链接失效,请您通过评论或工单的方式通知管理员。未经允许,不得转载,本站所有资源文章禁止商业使用运营!
下载安装【程序员客栈】APP
实时对接需求、及时收发消息、丰富的开放项目需求、随时随地查看项目状态

评论