International Conferences

Yuseon Choi, Hyeonseung Kim, Jewoo Jun, and Jong Won Shin, “FUN-SSL: Full-band Layer Followed by U-Net with Narrow-band Layers for Multiple Moving Sound Source Localization”, in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, pp. 14837-14841, May. 2026. [download]
Jisoo Myoung, Sangwook Han, Kihyuk Kim, and Jong Won Shin, “Short-Segment Speaker Verification with Pre-trained Models and Multi-Resolution Encoder”, in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, pp. 18872-18876, May. 2026. [download]
Seonggyu Lee, Sein Cheong, Sangwook Han, Kihyuk Kim and Jong Won Shin, “Speech Enhancement based on cascaded two flows” in Proceedings of Interspeech, pp. 4863-4867, Aug. 2025. [download] [code] [video in english] [video in korean] [post]
Seonggyu Lee, Sein Cheong, Sangwook Han, and Jong Won Shin, “FlowSE: Flow Matching-based Speech Enhancement,” in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, April. 2025. [video in english] [video in korean] [download] [demo] [code] [link]
Minseung Kim, Sein Cheong, and Jong Won Shin, “DNN-based Parameter Estimation for MVDR Beamforming and Post-filtering,” in Proceedings of Interspeech, pp 3879-3883, Aug. 2023. [download]
Youngdo Ahn, Chengyi Wang, Yu Wu, Jong Won Shin, Shujie Liu, “GRAVO: Learning to Generate Relevant Audio from Visual Features with Noisy Online Videos,” in Proceedings of Interspeech, pp 2743-2747, Aug. 2023. [demo] [download]
Sangwook Han, Youngdo Ahn, Kyeongmuk Kang and Jong Won Shin, “A Study of Joint Framework for Robustness Against Noise on Speaker Verification System” The 14th International Conference on Ubiquitous and Future Networks (ICUFN) WorkShop, pp 179-181, July. 2023. [download]
Sangwook Han, Youngdo Ahn, Kyeongmuk Kang and Jong Won Shin, “Short-segment speaker verification using ECAPA-TDNN with multi-resolution encoder,” in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, pp 1-5, June. 2023. [download]
Hyungchan Song, Sanyuan Chen, Zhuo Chen, Yu Wu, Takuya Yoshioka, Min Tang, Jong Won Shin, and Shujie Liu, “Exploring WavLM on Speech Enhancement,” accepted by 2022 IEEE Spoken Language Technology Workshop, pp 451-457, Jan. 2023. [download]
Minseung Kim, Hyungchan Song, Sein Cheong, and Jong Won Shin, “iDeepMMSE: An improved deep learning approach to MMSE speech and noise power spectrum estimation for speech enhancement,” in Proceedings of Interspeech, pp 181-185, Sep. 2022. [download]
Youngdo Ahn, Sung Joo Lee, and Jong Won Shin, “Multi-Corpus Speech Emotion Recognition for Unseen Corpus Using Corpus-Wise Weights in Classification Loss” in Proceedings of Interspeech, pp 131-135, Sep. 2022. [download]
Youngju Cheon, Soojoong Hwang, Sangwook Han, Inseon Jang, and Jong Won Shin, “Coded Speech Enhancement Using Neural Network-Based Vector-Quantized Residual Features,” in Proceedings of Interspeech, pp 1664-1668, Sep. 2021. [download]
Hyungchan Song and Jong Won Shin, “Multiple Sound Source Localization Based on Interchannel Phase Differences in All Frequencies with Spectral Masks,” in Proceedings of Interspeech, pp 671-675, Sep. 2021. [download]
Sangwook Han, Jaeuk Byun, and Jong Won Shin, “Time-Domain Speaker Verification Using Temporal Convolutional Networks,” in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, pp. 6688-6692, Jun. 2021. [download]
Eesung Kim and Jong Won Shin, “DNN-based Emotion Recognition based on Bottleneck Acoustic Features and Lexical Features,” in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, pp. 6720-6724, May 2019. [download]
Kisoo Kwon, Jong Won Shin, Inkyu Choi, Hyung Yong Kim, and Nam Soo Kim, “Incremental Approach to NMF Basis Estimation for Audio Source Separation,” in Proceedings of Asia Pacific Signal and Information Processing Association, Apr. 2016. [download]
Kisoo Kwon, Jong Won Shin, and Nam Soo Kim, “NMF-based source separation utilizing prior knowledge on encoding vector,” in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, pp. 479-483, Mar. 2016. [download]
Kisoo Kwon, Jong Won Shin, Hyung Yong Kim, and Nam Soo Kim, “Discriminative nonnegative matrix factorization using cross-reconstruction error for source separation,” in Proceedings of Interspeech, pp. 1513-1516, Sep. 2015. [download]
Chul Min Lee, Jong Won Shin, and Nam Soo Kim, “DNN-based residual echo suppression,” in Proceedings of Interspeech, pp. 1775-1778, Sep. 2015. [download]
Sukanya Sonowal, Kisoo Kwon, Nam Soo Kim, and Jong Won Shin “A data-driven approach to speech enhancement using Gaussian process,” in Proceedings of Interspeech, pp. 2847-2851, Sep. 2014. [download]
Tae Gyoon Kang, Kisoo Kwon, Jong Won Shin, and Nam Soo Kim, “NMF-based speech enhancement incorporating deep neural network,” in Proceedings of Interspeech, pp. 2843-2846, Sep. 2014. [download]
Kisoo Kwon, Jong Won Shin, Sukanya Sonowal, Inkyu Choi, and Nam Soo Kim, “Speech enhancement combining statistical models and NMF with update of speech and noise bases,” in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, pp. 7103-7107, May. 2014. [download]
Yu Gwang Jin, Jong Won Shin, Chul Min Lee, Soo Hyun Bae, and Nam Soo Kim, “Parametric multichannel noise reduction algorithm utilizing temporal correlations in reverberant environment,” in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, pp. 7099-7102, May. 2014. [download]
Chul Min Lee, Jong Won Shin, Yu Gwang Jin, Jeoung Hun Kim, and Nam Soo Kim, “Crossband filtering for stereophonic acoustic echo suppression,” in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, pp. 1359-1363, May. 2014. [download]
Jong Won Shin, Yu Gwang Jin, Seung Seop Park, and Nam Soo Kim, “Speech reinforcement based on partial masking effect,” in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, pp. 4401-4404, Apr. 2009. [download]
Yu Liu, Kiho Cho, Hwan Sik Yun, Jong Won Shin, and Nam Soo Kim, “DCT based multiple hashing technique for robust audio fingerprinting,” in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, pp. 61-64, Apr. 2009. [download]
Woohyung Lim, Chang Woo Han, Jong Won Shin, and Nam Soo Kim, “Cepstral domain feature compensation based on diagonal approximation,” in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, pp. 4401-4404, Mar. 2008. [download]
Jong Won Shin, Woohyung Lim, Junesig Sung, and Nam Soo Kim, “Speech reinforcement based on partial specific loudness,” in Proceedings of Interspeech, pp. 978-981, Aug. 2007. [download]
Seung Yeol Lee, Jong Won Shin, Hwan Sik Yun, and Nam Soo Kim, “A statistical model based post-filtering algorithm for residual echo suppression,” in Proceedings of Interspeech, pp. 858-861, Aug. 2007. [download]
Seung Seop Park, Jong Won Shin, Jong Kyu Kim, and Nam Soo Kim, “A multiple-model based framework for automatic speech segmentation,” in Proceedings of Interspeech, pp. 82-85, Aug. 2007. [download]
Jong Won Shin, Seung Yeol Lee, Hwan Sik Yun, and Nam Soo Kim, “Speech enhancement based on residual noise shaping, ” in Proceedings of Interspeech, pp. 1415-1418, Sep. 2006. [download]
Seung Seop Park, Jong Won Shin, and Nam Soo Kim, “Automatic speech segmentation with multiple statistical models,” in Proceedings of Interspeech, pp. 2066-2069, Sep. 2006. [download]
Joon-Hyuk Chang, Jong Won Shin, Seung Yeol Lee, and Nam Soo Kim, “A new structural preprocessor for low-bit rate speech coding, ” in Proceedings of Interspeech, pp. 2829-2832, Sep. 2005. [download]
Jong Won Shin, Joon-Hyuk Chang, Hwan Sik Yun, and Nam Soo Kim, “Voice activity detection based on generalized gamma distribution,” in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, pp. 781-784, Mar. 2005. [download]
Jong Won Shin, Joon-Hyuk Chang, and Nam Soo Kim, “Speech probability distribution based on generalized gamma distribution,” in Proceedings of Interspeech, pp. 2477-2480, Oct. 2004. [download]
Joon-Hyuk Chang, Jong Won Shin, and Nam Soo Kim, “Likelihood ratio test with complex Laplacian model for voice activity detection,” In Proceedings of European Conference on Speech Communication and Technology, pp. 1065-1068, Sep. 2003. [download]

Domestic Conferences

나영진, 이성규, 신종원. “멀티모달 대화형 감정인식을 위한 마트료시카 표현 학습과 모달리티별 지식 증류 기법.” 한국통신학회 추계종합학술발표회, 2025년 11월.
명지수, 한상욱, 신종원, “자기 지도 학습 모델의 다계층 특징을 활용한 짧은 발화 화자인식 연구”, 한국통신학회 하계종합학술발표회, 2025년 6월.
최유선, 김현승, 신종원, “실시간 다중이동 음원 정위를 위한 직접 경로 채널 간 위상 차이 추정 모델 경량화 연구”, 한국통신학회 하계종합학술발표회, 2025년 6월.
이상윤, 김현승, 신종원 “소형 장치에 탑재할 수 있는 딥러닝 기반 실시간 음성 향상 모델 개발”, 한국방송⋅미디어공학회 추계학술대회, 2024년 11월.
채종욱, 이은균, 신종원 “Streaming SEANet을 이용한 실시간 음성 코덱 후처리 모델 연구” 한국방송⋅미디어공학회 추계학술대회, 2024년 11월.
안영도, 이성규, 신종원, “라벨스무딩 및 사전학습모델을 이용하여 라벨링 신뢰도를 고려한 음성감정인식,” 한국뇌공학회 하계 워크샵, 2024년 6월.
안영도, 이성규, 신종원, “컨텍스트 감정 레이블을 이용한 대화상황 감정인식”, 한국통신학회 추계종합학술발표회, 2023년 11월.
손주혜, 김현승, 신종원, “실시간 음악 분리를 위한 일대일 추출 모델로부터 통합된 일대다 분리 모델로의 확장 연구”, 한국통신학회 추계종합학술발표회, 2023년 11월.
권영후, 김현승, 정세인, 신종원, “동적 합성곱을 적용한 실시간 음성 향상 모델”, 한국통신학회 추계종합학술발표회, 2023년 11월.
김기혁, 한상욱, 신종원, “지식 증류 기법을 활용한 자기지도 특징 추출 네트워크 기반의 핵심어 검출”, 한국통신학회 추계종합학술발표회, 2023년 11월.
안영도, 이성규, 신종원, “클립 및 제로샷 오디오분리를 이용한 이미지 기반의 오디오 생성,” 한국뇌공학회 하계 워크샵, 2023년 6월.
박정원, 송형찬, 김민승, 신종원, “단채널 위상 인지 음성 향상을 위한 SNR 정보를 활용한 가중 손실 함수 기법”, 한국통신학회 하계종합학술발표회, 2022년 6월.
한상욱, 천영주, 김민승, 신종원, “화자정보 활용을 위한 듀얼 어텐션 기반 화자 검증”, 한국통신학회 동계종합학술발표회, 2022년 2월.
안영도, 한상욱, 이성주, 신종원, “Wav2vec 특징 기반의 한국어 음성감정인식”, 한국통신학회 추계종합학술발표회, 2021년 11월.
한상욱, 변재욱, 최영원, 신종원, “시간 영역 멀티 스케일 인코더를 이용한 화자 검증”, 한국통신학회 추계종합학술발표회, 2021년 11월.
김민승, 김한솔, 박정원, 신종원, “다채널 음성 향상을 위한 FASNET-TAC 알고리즘의 개선 방향 고찰”, 한국통신학회 추계종합학술발표회, 2021년 11월.
이지운, 정세인, 신종원, “손실 함수와 후처리 기법을 이용한 실시간 음성 향상 모델의 성능 개선 연구”, 한국통신학회 추계종합학술발표회, 2021년 11월.
강경묵, 송형찬, 이은균, 신종원, “심층 신경망 기반 단채널 음성 향상을 위한 SNR에 따른 손실 함수 강조 기법”, 한국통신학회 추계종합학술발표회, 2021년 11월.
송형찬, 김현승, 오진우, 변재욱, 신종원, “End-to-End 기반 다채널 음성분리를 위한 SI-SNR 손실 함수 변조 연구”, 한국통신학회 추계종합학술발표회, 2021년 11월.
윤상휴, 변재욱, 김현승, 신종원, “시간 영역 다채널 음성 분리를 위한 향상된 마스크 추정”, 한국통신학회 추계종합학술발표회, 2020년 11월.
유정화, 김한솔, 윤상휴, 신종원, “캡슐 네트워크를 이용한 음성 스펙트로그램에서의 감정 인식”, 한국통신학회 하계종합학술발표회, 2019년 6월.
안영도, 송형찬, 한상욱, 천영주, 신종원, “CycleGAN 기반의 데이터 생성을 이용한 음성감정인식”, 한국통신학회 하계종합학술발표회, 2019년 6월.
김민승, 변재욱, 김현승, 신종원, “음향학적 반향 제거를 위한 IP-INLMS 알고리즘”, 한국통신학회 하계종합학술발표회, 2019년 6월.
정세인, 박준형, 신종원, “반향 추정치를 이용한 Soft Decision 기반 음향학적 잔여 반향 제거 알고리즘”, 한국통신학회 하계종합학술발표회, 2019년 6월.
송형찬, 신종원, “음성존재확률을 고려한 GSC 빔포머 기반 다채널 음성향상 연구”, 한국통신학회 하계종합학술발표회, 2019년 6월.
김민승, 신종원, “주파수 영역 칼만 필터 기반의 음향학적 반향 제거기를 위한 잡음 공분산 추정 기법”, 한국통신학회 추계종합학술발표회, 2018년 11월.
김한솔, 김의성, 김현승, 신종원, “살인사건의 평균 형량을 고려한 심층신경망 기반 형량 예측 연구”, 한국통신학회 하계종합학술발표회, 2018년 6월.
송형찬, 김민승, 이은균, 유정화, 신종원, “심층 신경망 기반 음성향상을 위한 SNR 정보를 이용한 하모닉 기반 후처리 기법”, 한국통신학회 하계종합학술발표회, 2018년 6월.
박준형, 정세인, 안영도, 신종원, “채널간 위상차 재조정을 통한 방향각 추정 기법”, 한국통신학회 하계종합학술발표회, 2018년 6월.
김의성, 김한솔, 변재욱, 신종원, “가우시안 혼합 모델 기반 심층신경망 병목특징을 이용한 음성감정인식”, 한국통신학회 하계종합학술발표회, 2018년 6월.
변재욱, 김한솔, 김의성, 신종원, “투영 경사법을 이용한 희소 비음수 행렬 인수분해 알고리즘”, 한국통신학회 하계종합학술발표회, pp. 938-939, 2017년 6월.
김한솔, 송형찬, 고석갑, 이병탁, 신종원, “RNN-LSTM 기반 공휴일 정보를 고려한 단기 전력수요예측”, 대한전자공학회 추계학술대회, pp. 552-555, 2016년 11월.
박준형, 황수중, 변재욱, 신종원, “장기 평균 음성 스펙트럼을 활용한 잡음제거 기법,” 한국통신학회 하계종합학술발표회, pp. 1255-1256, 2016년 6월.
신종원, 진유광, 한창우, 장준혁, 김남수, “일반화된 감마 분포 모델을 이용한 음성 검출기”, 한국통신학회 검출추정이론연구회 하계워크샵 논문집, pp. 41-42, 2008년 8월.
신종원, 권혁진, 진석호, 김남수, “최대조건부사후확률을 이용한 음성 검출기,” 한국음향학회 학술발표대회 논문집, 제26권, 2호. 2007년 11월.
신종원, 성준식, 조기호, 김남수, “부분 마스킹 효과를 도입한 음성 강화 방법,” 음성 통신 및 신호처리 학술대회 논문집, 제24권, 1호, 2007년 8월.
이승열, 신종원, 윤환식, 김남수, “통계적 모델기반의 잔여 반향 제거 기법 연구,” 음성 통신 및 신호처리 학술대회 논문집, 제24권, 1호, 2007년 8월.
박승섭, 김종규, 신종원, 김남수, “다수의 모델들을 이용한 자동음소분할,” 음성 통신 및 신호처리 학술대회 논문집, 제24권, 1호, 2007년 8월.
신종원, 한창우, 이승열, 장준혁, 김남수, “지각적으로 편안한 잔여 잡음의 형성을 기반으로 한 음성 향상,” 음성 통신 및 신호처리 학술대회 논문집, 제23권, 1호, 2006년 8월.
장준혁, 신종원, 김남수, “복소 라플라시안 모델을 이용한 음성검출기,” 한국음향학회 학술발표대회 논문집, 제22권, 1호, 2003년 7월.