找回密碼
 To register

QQ登錄

只需一步,快速開(kāi)始

掃一掃,訪問(wèn)微社區(qū)

打印 上一主題 下一主題

Titlebook: Advances in Speech and Language Technologies for Iberian Languages; IberSPEECH 2014 Conf Juan Luis Navarro Mesa,Alfonso Ortega,Doroteo T. T

[復(fù)制鏈接]
樓主: Causalgia
41#
發(fā)表于 2025-3-28 17:04:50 | 只看該作者
https://doi.org/10.1007/978-3-319-48354-2across the AHC iterations is done using reference speaker ground-truth labels to select the purer clustering as input for the global framework. Experiments on the REPERE phase 1 test database show improvements of around 6% absolute DER compared to the baseline system output.
42#
發(fā)表于 2025-3-28 19:47:46 | 只看該作者
43#
發(fā)表于 2025-3-29 00:12:10 | 只看該作者
Xiaobin Qiu,Hongqian Chen,Nan Zhoud. All processing is done locally on the phone, which is able to react in real-time to incoming keywords. In this paper we describe the application, review the matching algorithm we used and show experimentally that it successfully reacts to voice commands in a variety of acoustic conditions.
44#
發(fā)表于 2025-3-29 06:08:03 | 只看該作者
Global Speaker Clustering towards Optimal Stopping Criterion in Binary Key Speaker Diarizationacross the AHC iterations is done using reference speaker ground-truth labels to select the purer clustering as input for the global framework. Experiments on the REPERE phase 1 test database show improvements of around 6% absolute DER compared to the baseline system output.
45#
發(fā)表于 2025-3-29 11:14:38 | 只看該作者
CVX-Optimized Beamforming and Vector Taylor Series Compensation with German ASR Employing Star-Shapedium-vocabulary German database for microphone array made of embedded clean signals contaminated with real room impulsive responses and mixed in a ‘natural’ way with real noises. We show that the proposed enhancement framework performs better than other related systems on the presented database.
46#
發(fā)表于 2025-3-29 13:35:42 | 只看該作者
Flexible Stand-Alone Keyword Recognition Application Using Dynamic Time Warpingd. All processing is done locally on the phone, which is able to react in real-time to incoming keywords. In this paper we describe the application, review the matching algorithm we used and show experimentally that it successfully reacts to voice commands in a variety of acoustic conditions.
47#
發(fā)表于 2025-3-29 19:37:59 | 只看該作者
Statistical Text-to-Speech Synthesis of Spanish Subtitlesthe best of our knowledge, this is the first time that a DNN-based TTS system has been implemented for the synthesis of Spanish. A comparative objective evaluation between both models has been carried out. Our results show that DNN-based systems can reconstruct speech waveforms more accurately.
48#
發(fā)表于 2025-3-29 20:50:04 | 只看該作者
Unsupervised Training of PLDA with Variational Bayesre latent variables. We experimented on unlabeled NIST SRE data. The trained models were evaluated on NIST SRE10. Compared to cosine distance, unsupervised PLDA improved EER by 28% and minimum DCF by 36%.
49#
發(fā)表于 2025-3-30 02:32:30 | 只看該作者
50#
發(fā)表于 2025-3-30 06:08:38 | 只看該作者
 關(guān)于派博傳思  派博傳思旗下網(wǎng)站  友情鏈接
派博傳思介紹 公司地理位置 論文服務(wù)流程 影響因子官網(wǎng) 吾愛(ài)論文網(wǎng) 大講堂 北京大學(xué) Oxford Uni. Harvard Uni.
發(fā)展歷史沿革 期刊點(diǎn)評(píng) 投稿經(jīng)驗(yàn)總結(jié) SCIENCEGARD IMPACTFACTOR 派博系數(shù) 清華大學(xué) Yale Uni. Stanford Uni.
QQ|Archiver|手機(jī)版|小黑屋| 派博傳思國(guó)際 ( 京公網(wǎng)安備110108008328) GMT+8, 2025-10-11 17:14
Copyright © 2001-2015 派博傳思   京公網(wǎng)安備110108008328 版權(quán)所有 All rights reserved
快速回復(fù) 返回頂部 返回列表
丰县| 鄱阳县| 错那县| 沙坪坝区| 洛扎县| 水富县| 平乡县| 梨树县| 沁水县| 陇西县| 蒲城县| 寿宁县| 麦盖提县| 信阳市| 黄陵县| 临澧县| 盐山县| 神农架林区| 扶余县| 鹰潭市| 治多县| 克东县| 桂林市| 伊通| 仙居县| 洪雅县| 迁西县| 高安市| 西畴县| 旬邑县| 苏州市| 平乡县| 中西区| 界首市| 德州市| 河池市| 华蓥市| 工布江达县| 板桥市| 忻州市| 句容市|