Fast Implementation of KLT-Based Speech Enhancement Using Vector Quantization

Nagata, Yoshifumi; Mitsubori, Kenji; Kagi, Takahiko; Fujioka, Toyota; Abe, Masato

doi:10.1109/TASL.2006.872622

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスリンク

インデックスツリー

アイテム

{"_buckets": {"deposit": "d9e66ff2-7930-40fc-a64c-fbf55d861d2f"}, "_deposit": {"created_by": 3, "id": "9674", "owners": [3], "pid": {"revision_id": 0, "type": "depid", "value": "9674"}, "status": "published"}, "_oai": {"id": "oai:iwate-u.repo.nii.ac.jp:00009674", "sets": ["1519"]}, "author_link": ["74007", "74008", "74006", "74004", "74005"], "item_16_biblio_info_7": {"attribute_name": "書誌情報", "attribute_value_mlt": [{"bibliographicIssueDates": {"bibliographicIssueDate": "2006-01-01", "bibliographicIssueDateType": "Issued"}, "bibliographicIssueNumber": "6", "bibliographicPageEnd": "2097", "bibliographicPageStart": "2086", "bibliographicVolumeNumber": "14", "bibliographic_titles": [{"bibliographic_title": "IEEE Transactions on Audio Speech and Language Processing"}]}]}, "item_16_date_6": {"attribute_name": "登録日", "attribute_value_mlt": [{"subitem_date_issued_datetime": "2008-12-16"}]}, "item_16_description_12": {"attribute_name": "Abstract", "attribute_value_mlt": [{"subitem_description": "We propose a new method for implementing Karhunen–Loeve transform (KLT)-based speech enhancement to exploit vector quantization (VQ). The method is suitable for real-time processing. The proposed method consists of a VQ learning stage and a filtering stage. In the VQ learning stage, the autocorrelation vectors comprising the first$K$elements of the autocorrelation function are extracted from learning data. The autocorrelation vectors are used as codewords in the VQ codebook. Next, the KLT bases that correspond to all the codeword vectors are estimated through eigendecomposition (ED) of the empirical Toeplitz covariance matrices constructed from the codeword vectors. In the filtering stage, the autocorrelation vectors that are estimated from the input signal are compared to the codewords. The nearest one is chosen in each frame. The precomputed KLT bases corresponding to the chosen codeword are used for filtering instead of performing ED, which is computationally intensive. Speech quality evaluation using objective measures shows that the proposed method is comparable to a conventional KLT-based method that performs ED in the filtering process. Results of subjective tests also support this result. In addition, processing time is reduced to about 1/66 that of the conventional method in the case where a frame length of 120 points is used. This complexity reduction is attained after the computational cost in the learning stage and a corresponding increase in the associated memory requirement. Nevertheless, these results demonstrate that the proposed method reduces computational complexity while maintaining the speech quality of the KLT-based speech enhancement.", "subitem_description_type": "Other"}]}, "item_16_publisher_14": {"attribute_name": "出版者", "attribute_value_mlt": [{"subitem_publisher": "IEEE"}]}, "item_16_relation_26": {"attribute_name": "DOI", "attribute_value_mlt": [{"subitem_relation_type": "isIdenticalTo", "subitem_relation_type_id": {"subitem_relation_type_id_text": "10.1109/TASL.2006.872622", "subitem_relation_type_select": "DOI"}}]}, "item_16_rights_18": {"attribute_name": "権利", "attribute_value_mlt": [{"subitem_rights": "© 2006 IEEE"}]}, "item_16_source_id_9": {"attribute_name": "ISSN", "attribute_value_mlt": [{"subitem_source_identifier": "1558-7916", "subitem_source_identifier_type": "ISSN"}]}, "item_16_text_4": {"attribute_name": "著者(機関)", "attribute_value_mlt": [{"subitem_text_value": "Department of Computer and Information Sciences, Iwate University"}]}, "item_16_version_type_27": {"attribute_name": "著者版フラグ", "attribute_value_mlt": [{"subitem_version_resource": "http://purl.org/coar/version/c_970fb48d4fbd8a85", "subitem_version_type": "VoR"}]}, "item_creator": {"attribute_name": "著者", "attribute_type": "creator", "attribute_value_mlt": [{"creatorNames": [{"creatorName": "Nagata, Yoshifumi"}], "nameIdentifiers": [{"nameIdentifier": "74004", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Mitsubori, Kenji"}], "nameIdentifiers": [{"nameIdentifier": "74005", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Kagi, Takahiko"}], "nameIdentifiers": [{"nameIdentifier": "74006", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Fujioka, Toyota"}], "nameIdentifiers": [{"nameIdentifier": "74007", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Abe, Masato"}], "nameIdentifiers": [{"nameIdentifier": "74008", "nameIdentifierScheme": "WEKO"}]}]}, "item_files": {"attribute_name": "ファイル情報", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_date", "date": [{"dateType": "Available", "dateValue": "2016-11-14"}], "displaytype": "detail", "download_preview_message": "", "file_order": 0, "filename": "itaslp-v14n6p2086-2097.pdf", "filesize": [{"value": "935.6 kB"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensetype": "license_free", "mimetype": "application/pdf", "size": 935600.0, "url": {"label": "itaslp-v14n6p2086-2097.pdf", "url": "https://iwate-u.repo.nii.ac.jp/record/9674/files/itaslp-v14n6p2086-2097.pdf"}, "version_id": "4accb943-215a-471a-aa1a-1cbbe6beb224"}]}, "item_keyword": {"attribute_name": "キーワード", "attribute_value_mlt": [{"subitem_subject": "Complexity", "subitem_subject_scheme": "Other"}, {"subitem_subject": "Karhunen-Loeve transform(KLT)", "subitem_subject_scheme": "Other"}, {"subitem_subject": "speech enhancement", "subitem_subject_scheme": "Other"}, {"subitem_subject": "subspace", "subitem_subject_scheme": "Other"}, {"subitem_subject": "vector quantization", "subitem_subject_scheme": "Other"}]}, "item_resource_type": {"attribute_name": "資源タイプ", "attribute_value_mlt": [{"resourcetype": "journal article", "resourceuri": "http://purl.org/coar/resource_type/c_6501"}]}, "item_title": "Fast Implementation of KLT-Based Speech Enhancement Using Vector Quantization", "item_titles": {"attribute_name": "タイトル", "attribute_value_mlt": [{"subitem_title": "Fast Implementation of KLT-Based Speech Enhancement Using Vector Quantization"}]}, "item_type_id": "16", "owner": "3", "path": ["1519"], "permalink_uri": "https://iwate-u.repo.nii.ac.jp/records/9674", "pubdate": {"attribute_name": "公開日", "attribute_value": "2008-12-16"}, "publish_date": "2008-12-16", "publish_status": "0", "recid": "9674", "relation": {}, "relation_version_is_last": true, "title": ["Fast Implementation of KLT-Based Speech Enhancement Using Vector Quantization"], "weko_shared_id": -1}

Fast Implementation of KLT-Based Speech Enhancement Using Vector Quantization

https://iwate-u.repo.nii.ac.jp/records/9674

名前 / ファイル	ライセンス	アクション
itaslp-v14n6p2086-2097.pdf (935.6 kB)

Item type

学術雑誌論文 / Journal Article(1)

公開日

2008-12-16

タイトル

Fast Implementation of KLT-Based Speech Enhancement Using Vector Quantization

キーワード

主題Scheme

Other

主題

Complexity

キーワード

主題Scheme

Other

主題

Karhunen-Loeve transform(KLT)

キーワード

主題Scheme

Other

主題

speech enhancement

キーワード

主題Scheme

Other

主題

subspace

キーワード

主題Scheme

Other

主題

vector quantization

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_6501

資源タイプ

journal article

著者

Abe, Masato

著者(機関)

Department of Computer and Information Sciences, Iwate University

登録日

日付

2008-12-16

書誌情報

IEEE Transactions on Audio Speech and Language Processing

巻 14, 号 6, p. 2086-2097, 発行日 2006-01-01

ISSN

収録物識別子タイプ

ISSN

収録物識別子

1558-7916

Abstract

内容記述タイプ

Other

内容記述

We propose a new method for implementing Karhunen–Loeve transform (KLT)-based speech enhancement to exploit vector quantization (VQ). The method is suitable for real-time processing. The proposed method consists of a VQ learning stage and a filtering stage. In the VQ learning stage, the autocorrelation vectors comprising the first$K$elements of the autocorrelation function are extracted from learning data. The autocorrelation vectors are used as codewords in the VQ codebook. Next, the KLT bases that correspond to all the codeword vectors are estimated through eigendecomposition (ED) of the empirical Toeplitz covariance matrices constructed from the codeword vectors. In the filtering stage, the autocorrelation vectors that are estimated from the input signal are compared to the codewords. The nearest one is chosen in each frame. The precomputed KLT bases corresponding to the chosen codeword are used for filtering instead of performing ED, which is computationally intensive. Speech quality evaluation using objective measures shows that the proposed method is comparable to a conventional KLT-based method that performs ED in the filtering process. Results of subjective tests also support this result. In addition, processing time is reduced to about 1/66 that of the conventional method in the case where a frame length of 120 points is used. This complexity reduction is attained after the computational cost in the learning stage and a corresponding increase in the associated memory requirement. Nevertheless, these results demonstrate that the proposed method reduces computational complexity while maintaining the speech quality of the KLT-based speech enhancement.

出版者

IEEE

権利

権利情報

DOI

Versions

Ver.1

2023-05-15 14:24:54.131377

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

Fast Implementation of KLT-Based Speech Enhancement Using Vector Quantization

× Nagata, Yoshifumi

× Mitsubori, Kenji

× Kagi, Takahiko

× Fujioka, Toyota

× Abe, Masato

Versions

Share

Cite as

エクスポート