Skip navigation
  • 中文
  • English

DSpace CRIS

  • DSpace logo
  • Home
  • Organizations
  • Researchers
  • Research Outputs
  • Projects
  • Explore by
    • Organizations
    • Researchers
    • Research Outputs
    • Projects
  • Academic & Publications
  • Sign in
  • 中文
  • English
  1. Scholars Hub of the Academia Sinica
Academia Sinica / Division of Mathematics and Physical Sciences / Institute of Information Science

Wang, Hsin-Min

Network Lab View Statistics Email Alert RSS Feed

  • Resume
  • Publications 79
  • Project

Publications
  • All
  • Articles
  • Conference Papers

By Author

  • 4 ryandhimas e. zezario
  • 4 ryandhimas edo zezario
  • 4 shang-bao luo
  • 4 tassadaq hussain
  • 4 yao-fei cheng
  • 3 chao-chun liang
  • 3 chen-chou lo
  • 3 chia-hua wu
  • 3 fei chen
  • 3 hung-yi lee
  • . < previous next >

By Issue Date

  • 50 2020 - 2024
  • 28 2010 - 2019
  • 1 2000 - 2009

By Type

  • 79 學術會議(研討會)論文/conference paper

Fulltext

  • 79 no fulltext


Results 1-79 of 79 (Search time: 0.001 seconds).

Issue DateTitleAuthor(s)RelationscopusWOSFulltext/Archive link
12017Wavelet Speech Enhancement Based on Robust Principal Component AnalysisChia-Lung Wu; Hsiang-Ping Hsu; Syu-Siang Wang; Jeih-Weih Hung; Ying-Hui Lai; Hsin-Min Wang ; Yu Tsao 
22017Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial NetworksChin-Cheng Hsu; Hsin-Te Hwang; Yi-Chiao Wu; Yu Tsao ; Hsin-Min Wang 
32016Voice Conversion from Non-parallel Corpora Using Variational Auto-encoderChin-Cheng Hsu; Hsin-Te Hwang; Yi-Chiao Wu; Yu Tsao ; Hsin-Min Wang 
42020Using Taigi Dramas with Mandarin Chinese Subtitles to Improve Taigi Speech RecognitionPin-Yuan Chen; Chia-Hua Wu; Hung-Shin Lee; Shao-Kang Tsao; Ming-Tat Ko ; Hsin-Min Wang 
52021Time Alignment Using Lip Images for Frame-Based Electrolaryngeal Voice ConversionYi-Syuan Liou; Wen-Chin Huang; Ming-Chi Yen; Shu-Wei Tsai; Yu-Huai Peng; Tomoki Toda; Yu Tsao ; Hsin-Min Wang 
62023The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple DomainsErica Cooper; Wen-Chin Huang; Yu Tsao; Hsin-Min Wang ; Tomoki Toda; Junichi Yamagishi
72022The VoiceMOS Challenge 2022Wen Chin Huang; Erica Cooper; Yu Tsao; Hsin-Min Wang ; Tomoki Toda; Junichi Yamagishi
82020The Academia Sinica Systems of Voice Conversion for VCC2020Yu-Huai Peng; Cheng-Hung Hu; Alexander Kang; Hung-Shin Lee; Pin-Yuan Chen; Yu Tsao; Hsin-Min Wang 
92020The Academia Sinica Systems of Speech Recognition and Speaker Diarization for the CHiME-6 ChallengeHung-Shin Lee; Yu-Huai Peng; Pin-Tuan Huang; Ying-Chun Tseng; Chia-Hua Wu; Yu Tsao; Hsin-Min Wang 
102021SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise ContoursYi-Wei Chen; Hung-Shin Lee; Yen-Hsing Chen; Hsin-Min Wang 
112020STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment ModelRyandhimas E. Zezario; Szu-Wei Fu; Chiou-Shann Fuh; Yu Tsao; Hsin-Min Wang 
122020Statistics Pooling Time Delay Neural Network Based on X-Vector for Speaker VerificationQian-Bei Hong; Chung-Hsien Wu; Hsin-Min Wang ; Chien-Lin Huang
132019Spoken Multiple-Choice Question Answering Using Multimodal Convolutional Neural NetworksShang-Bao Luo; Hung-Shin Lee; Kuan-Yu Chen; Hsin-Min Wang 
142024SpeechCLIP : Self-supervised multi-task representation learning for speech via CLIP and speech-image dataHsuan-Fu Wang; Yi-Jen Shih; Heng-Jui Chang; Layne Berry; Puyuan Peng; Hung-yi Lee; Hsin-Min Wang ; David Harwath
152022Speech-enhanced and Noise-aware Networks for Robust Speech RecognitionHung-Shin Lee; Pin-Yuan Chen; Yao-Fei Cheng; Yu Tsao ; Hsin-Min Wang 
162021Speech Recognition by Simply Fine-Tuning BERTWen-Chin Huang; Chia-Hua Wu; Shang-Bao Luo; Kuan-Yu Chen; Hsin-Min Wang ; Tomoki Toda
172022Speech Enhancement-Assisted Voice Conversion in Noisy EnvironmentsYun-Ju Chan; Chiang-Jen Peng; Syu-Siang Wang; Hsin-Min Wang ; Yu Tsao; Tai-Shih Chi
182021Speech Enhancement with Zero-Shot Model SelectionRyandhimas E. Zezario; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao 
192019Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment MetricRyandhimas Edo Zezario; Szu-wei Fu; Xugang Lu; Hsin-Min Wang ; Yu Tsao
202020SERIL: Noise Adaptive Speech Enhancement using Regularization-based Incremental LearningChi-Chang Lee; Yu-Chen Lin; Hsuan-Tien Lin; Hsin-Min Wang ; Yu Tsao
212019Sequential Speaker Embedding and Transfer Learning for Text-Independent Speaker IdentificationQian-Bei Hong; Chung-Hsien Wu; Ming-Hsiang Su; Hsin-Min Wang 
222021Sequence to General Tree: Knowledge-Guided Geometry Word Problem SolvingShih-hung Tsai; Chao-Chun Liang; Hsin-Min Wang ; Keh-Yih Su 
232020Self-Supervised Denoising Autoencoder with Linear Regression Decoder for Speech EnhancementRyandhimas Edo Zezario; Tassadaq Hussain; Xugang Lu; Hsin-Min Wang ; Yu Tsao
242018SeeTheVoice: Learning from Music to Visual Storytelling of ShotsWen-Li Wei; Jen-Chun Lin; Tyng-Luh Liu ; Yi-Hsuan Yang ; Hsin-Min Wang ; Hsiao-Rong Tyan; Hong-Yuan Mark Liao 
252021Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN VocoderYi-Chiao Wu; Cheng-Hung Hu; Hung-Shin Lee; Yu-Huai Peng; Wen-Chin Huang; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda
262019Reinforcement Learning based Speech Enhancement for Robust Speech RecognitionYih-Liang Shen; Chao-Yuan Huang; Syu-Siang Wang; Yu Tsao; Hsin-Min Wang ; Tai-Shih Chi
272022Partially Fake Audio Detection by Self-attention-based Fake Span DiscoveryHaibin Wu; Heng-Cheng Kuo; Naijun Zheng; Kuo-Hsuan Hung; Hung-yi Lee; Yu Tsao ; Hsin-Min Wang ; Helen Meng
282019Noise Adaptive Speech Enhancement using Domain Adversarial TrainingChien-Feng Liao; Yu Tsao; Hung-Yi Lee; Hsin-Min Wang 
292022Multimodal Forgery Detection Using Ensemble LearningAmmarah Hashmi; Sahibzada Adil Shahzad; Chia-Wen Lin; Yu Tsao; Hsin-Min Wang 
302024Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment ModelRyandhimas Zezario; Bo-Ren Brian Bai; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao
312019Multi-task Learning for Mandarin Acoustic Modeling Using Articulatory AttributesYueh-Ting Lee; Xuan-Bo Chen; Hung-Shin Lee; Jyh-Shing Roger Jang; Hsin-Min Wang 
322022MTI-Net: A Multi-Target Speech Intelligibility Prediction ModelRyandhimas Edo Zezario; Szu-wei Fu; Fei Chen; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao
332019MOSNet: Deep Learning based Objective Assessment for Voice ConversionChen-Chou Lo; Szu-Wei Fu; Wen-Chin Huang; Xin Wang; Junichi Yamagishi; Yu Tsao; Hsin-Min Wang 
342021MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation AccelerationY.-T. Chang; Y.-H. Yang; Y.-H. Peng; S.-S. Wang; T.-S. Chi; Y. Tsao ; H.-M. Wang 
352021Mining Commonsense and Domain Knowledge from Math Word ProblemsShih-Hung Tsai; Chao-Chun Liang; Hsin-Min Wang ; Keh-Yih Su 
362016Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity EvaluationHung-Shin Lee; Yu Tsao ; Chi-Chun Lee; Hsin-Min Wang ; Wei-Cheng Lin; Wei-Chen Chen; Shan-Wen Hsiao; Shyh-Kang Jeng
372021Melody Harmonization Using Orderless NADE, Chord Balancing, and Blocked Gibbs SamplingChung-En Sun; Yi-Wei Chen; Hung-Shin Lee; Yen-Hsing Chen; Hsin-Min Wang 
382022MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing AidsRyandhimas Edo Zezario; Fei Chen; Chiou-Shann Fuh; Hsin-Min Wang ; Yu Tsao
392022Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GANYin-Ping Cho; Yu Tsao; Hsin-Min Wang ; Yi-Wen Liu
402019Mandarin Mispronunciation Detection and Diagnosis Feedback Using Articulatory Attributes Based Multi-task LearningXuan-Bo Chen; Yueh-Ting Lee; Hung-Shin Lee; Jyh-Shing Roger Jang; Hsin-Min Wang 
412021Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence ModelingMing-Chi Yen; Wen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Shu-Wei Tsai; Yu Tsao ; Tomoki Toda; Jyh-Shing Jang; Hsin-Min Wang 
422016Locally Linear Embedding for Exemplar-Based Spectral ConversionYi-Chiao Wu; Hsin-Te Hwang; Chin-Cheng Hsu; Yu Tsao ; Hsin-Min Wang 
432020Lite Audio-Visual Speech EnhancementShang-Yi Chuang; Yu Tsao; Chen-Chou Lo; Hsin-Min Wang 
442022Lip Sync Matters: A Novel Multimodal Forgery DetectorSahibzada Adil Shahzad; Ammarah Hashmi; Sarwar Khan; Yan-Tsung Peng; Yu Tsao; Hsin-Min Wang 
452023LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification ModelsChi-Chang Lee; Hong-Wei Chen; Chu-Song Chen ; Hsin-Min Wang ; Tsung-Te Liu; Yu Tsao
462020Joint Training of Guided Learning and Mean Teacher Models for Sound Event DetectionHao Yen; Pin-Jui Ku; Ming-Chi Yen; Hung-Shin Lee; Hsin-Min Wang 
472019Investigation of Neural Network Approaches for Unified Spectral and Prosodic Feature EnhancementWei-Cheng Lin; Yu Tsao; Hsin-Min Wang ; Fei Chen
482019Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice ConversionWen-Chin Huang; Yi-Chiao Wu; Chen-Chou Lo; Patrick Lumban Tobing; Tomoki Hayashi; Kazuhiro Kobayashi; Tomoki Toda; Yu Tsao; Hsin-Min Wang 
492021Investigation of a Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-to-End Bengali Automatic Speech Recognition Under Unseen Noisy ConditionsMd Mahbub E Noor|; Yen-Ju Lu; Syu-Siang Wang; Supratip Ghose; Chia-Yu Chang; Ryandhimas E. Zezario; Shafique Ahmed; Wei-Ho Chung; Yu Tsao ; Hsin-Min Wang 
502019Influences of Prosodic Feature Replacement on the Perceived Singing Voice IdentityKuan-Yi Kang; Yi-Wen Liu; Hsin-Min Wang 
512019Improving Automatic Jazz Melody Generation by Transfer Learning TechniquesHsiao-Tzu Hung; Chung-Yang Wang; Yi-Hsuan Yang ; Hsin-Min Wang 
522021Improvement of Spatial Ambiguity in Multi-Channel Speech Separation Using Channel AttentionQian-Bei Hong; Chung-Hsien Wu; Thanh Binh Nguyen; Hsin-Min Wang 
532021HASA-NET: A Non-Intrusive Hearing-Aid Speech Assessment NetworkHsin-Tien Chiang; Yi-Chiao Wu; Cheng Yu; Tomoki Toda; Hsin-Min Wang ; Yih-Chun Hu; Yu Tsao 
542021Generation of Speaker Representations Using Heterogeneous Training Batch AssemblyYu-Huai Peng; Hung-Shin Lee; Pin-Tuan Huang; Hsin-Min Wang 
552019Generalization of Spectrum Differential based Direct Waveform Modification for Voice ConversionWen-Chin Huang; Yi-Chiao Wu; Kazuhiro Kobayashi; Yu-Huai Peng; Hsin-Te Hwang; Patrick Lumban Tobing; Yu Tsao; Hsin-Min Wang ; Tomoki Toda
562022Filter-based Discriminative Autoencoders for Children Speech RecognitionChiang-Lin Tai; Hung-Shin Lee; Yu Tsao ; Hsin-Min Wang 
572019Exploring the Encoder Layers of Discriminative Autoencoders for LVCSRPin-Tuan Huang; Hung-Shin Lee; Syu-Siang Wang; Kuan-Yu Chen; Yu Tsao; Hsin-Min Wang 
582022EMGSE: Acoustic/EMG Fusion for Multimodal Speech EnhancementKuan-Chen Wang; Kai-Chun Liu; Hsin-Min Wang ; Yu Tsao 
592021Dual-Path Filter Network: Speaker-Aware Modeling for Speech SeparationFan-Lin Wang; Yu-Huai Peng; Hung-Shin Lee; Hsin-Min Wang 
602022Disentangling the Impacts of Language and Channel Variability on Speech Separation NetworksFan-Lin Wang; Hung-Shin Lee; Yu Tsao; Hsin-Min Wang 
612017Discriminative Autoencoders for Speaker VerificationHung-Shin Lee; Yu-Ding Lu; Chin-Cheng Hsu; Yu Tsao ; Hsin-Min Wang ; Shyh-Kang Jeng
622017Discriminative Autoencoders for Acoustic ModelingMing-Han Yang; Hung-Shin Lee; Yu-Ding Lu; Kuan-Yu Chen; Yu Tsao ; Berlin Chen; Hsin-Min Wang 
632022Detecting Replay Attacks Using Single-Channel Audio: The Temporal Autocorrelation of SpeechShih-Kuang Lee; Yu Tsao; Hsin-Min Wang 
642019Compressed Multimodel Hierarchical Extreme Learning Machine for Speech EnhancementTassadaq Hussain; Yu Tsao; Hsin-Min Wang ; Jia-Ching Wang; Sabato Marco Siniscalchi; Wen-Hung Liao
652001Comparative Analysis for Data-Driven Temporal Filters Obtained via Principal Component AnalysisHung, Jeih-weih; Wang, Hsin-min ; Lee, Lin-shan
662020Combining Deep Embeddings of Acoustic and Articulatory Features for Speaker IdentificationQian-Bei Hong; Chung-Hsien Wu; Hsin-Min Wang ; Chien-Lin Huang
672022Chinese Movie Dialogue Question Answering DatasetShang-Bao Luo; Hsin-Min Wang ; Kuan-Yu Chen; Keh-Yih Su ; Yu Tsao ; Cheng-Chung Fan
682022Chain-based Discriminative Autoencoders for Speech RecognitionHung-Shin Lee; Pin-Tuan Huang; Yao-Fei Cheng; Hsin-Min Wang 
692019Bone-conducted Speech Enhancement using Hierarchical Extreme Learning MachineTassadaq Hussain; Yu Tsao; Sabato Marco Siniscalchi; Jia-Ching Wang; Hsin-Min Wang ; Wen-Hung Liao
702019Audio-Visual Speech Enhancement using Hierarchical Extreme Learning MachineTassadaq Hussain; Yu Tsao; Hsin-Min Wang ; Jia-Ching Wang; Sabato Marco Siniscalchi; Wen-Hung Liao
712016Audio-Visual Speech Enhancement using Deep Neural NetworksJen-Cheng Hou; Syu-Siang Wang; Ying-Hui Lai; Jen-Chun Lin; Yu Tsao ; Hsiu-Wen Chang; Hsin-Min Wang 
722023Audio-Visual Mandarin Electrolaryngeal Speech Voice ConversionYung-Lun Chien; Hsin-Hao Chen; Ming-Chi Yen; Shu-Wei Tsai; Hsin-Min Wang ; Yu Tsao; Tai-Shih Chi
732021AlloST: Low-resource Speech Translation without Source TranscriptionYao-Fei Cheng; Hung-Shin Lee; Hsin-Min Wang 
742023A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean SpeechLi-Wei Chen; Yao-Fei Cheng; Hung-Shin Lee; Yu Tsao; Hsin-Min Wang 
752024A Study on Incorporating Whisper for Robust Speech AssessmentRyandhimas E. Zezario; Yu-Wen Chen; Szu-Wei Fu; Yu Tsao; Hsin-Min Wang ; Chiou-Shann Fuh
762021A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice ConversionWen-Chin Huang; Kazuhiro Kobayashi; Yu-Huai Peng; Ching-Feng Liu; Yu Tsao ; Hsin-Min Wang ; Tomoki Toda
772017A Post-filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech EnhancementYi-Chiao Wu; Hsin-Te Hwang; Syu-Siang Wang; Chin-Cheng Hsu; Yu Tsao ; Hsin-Min Wang 
782017A Locally Linear Embbeding Based Postfiltering Approach for Speech EnhancementYi-Chiao Wu; Hsin-Te Hwang; Syu-Siang Wang; Chin-Cheng Hsu; Ying-Hui Lai; Yu Tsao ; Hsin-Min Wang 
792021A Flexible and Extensible Framework for Multiple Answer Modes Question AnsweringCheng-Chung Fan; Keh-Yih Su ; Kuan-Yu Chen; Yu Tsao ; Jia-Zhi Guo; Shang-Bao Luo; Pei-Jun Liao; Kuang-Yu Chang; Chiao-Wei Hsu; Meng-Tse Wu; Shih-Hong Tsai; Tzu-Man Wu; Aleksandra Smolka; Chao-Chun Liang; Hsin-Min Wang 

 

Claim Researcher Page

Contact via feedback form

If you want contact administrator site clicking the follow button 
Explore by
  • Academic & Publications
  • Organizations
  • Researchers
  • Research Outputs
  • Projects
Build with DSpace-CRIS - Extension maintained and optimized by Logo 4SCIENCE Feedback