腾讯中文hubert base模型

我要开发同款
匿名用户2024年07月31日
60阅读

技术信息

开源地址
https://modelscope.cn/models/innnky/chinese-hubert-base-tencent
授权协议
mit

作品详情

Pretraied o 10k hours WeetSpeech L subset. More details i TecetGameMate/chiesespeechpretrai

This model does ot have a tokeizer as it was pretraied o audio aloe. I order to use this model speech recogitio, a tokeizer should be created ad the model should be fie-tued o labeled text data.

pytho package: trasformers==4.16.2

import torch
import torch..fuctioal as F
import soudfile as sf

from trasformers import (
    Wav2Vec2FeatureExtractor,
    HubertModel,
)


model_path=""
wav_path=""

feature_extractor = Wav2Vec2FeatureExtractor.from_pretraied(model_path)
model = HubertModel.from_pretraied(model_path)

# for pretrai: Wav2Vec2ForPreTraiig
# model = Wav2Vec2ForPreTraiig.from_pretraied(model_path)

model = model.to(device)
model = model.half()
model.eval()

wav, sr = sf.read(wav_path)
iput_values = feature_extractor(wav, retur_tesors="pt").iput_values
iput_values = iput_values.half()
iput_values = iput_values.to(device)

with torch.o_grad():
    outputs = model(iput_values)
    last_hidde_state = outputs.last_hidde_state

功能介绍

Pretrained on 10k hours WenetSpeech L subset. More details in TencentGameMate/chinesespeechpretrain

声明:本文仅代表作者观点,不代表本站立场。如果侵犯到您的合法权益,请联系我们删除侵权资源!如果遇到资源链接失效,请您通过评论或工单的方式通知管理员。未经允许,不得转载,本站所有资源文章禁止商业使用运营!
下载安装【程序员客栈】APP
实时对接需求、及时收发消息、丰富的开放项目需求、随时随地查看项目状态

评论