cv | Shota Horiguchi

Education

2022.4 - 2023.3
Ph.D., Computer Science

University of Tsukuba, Ibaraki, Japan
- Supervisor: Prof. Takeshi Yamada
- Thesis title: Study on overlap-aware speaker diarization and its applications
  - Thesis
  - Slides
2015.4 - 2017.3
M.E., Information Science and Technology

The University of Tokyo, Tokyo, Japan
- Supervisor: Prof. Kiyoharu Aizawa
- Thesis title: Personalized Object Recognition
2011.4 - 2015.3
B.E., Information and Communication Engineering

The University of Tokyo, Tokyo, Japan
- Supervisor: Prof. Kiyoharu Aizawa

Work Experience

2024.2 - Present
Research Specialist

NTT, Inc. (formerly NTT Corporation), Human Informatics Laboratories
- Research topic
  - Speech technology
2021.10 - 2024.1
Senior Researcher

Hitachi, Ltd. Research & Development Group
- Team leader (2021.10-2022.9, 2023.4-2024.1)
- Tech lead (2022.10-2023.3)
- Research topic
  - Speaker diarization (2021-2023)
  - Streaming active learning (2022-2024)
2017.4 - 2021.9
Researcher

Hitachi, Ltd. Research & Development Group
- Research topic
  - Multimodal enviromnental recognition for human-robot interaction (2017-2019)
  - Meeting transcription using distributed microphones (2019-2021)
  - Speaker diarization (2019-2021)

Honors and Awards

2025
IEEE SPS Young Author Best Paper Award
- For the paper entitled "Encoder-Decoder Based Attractors for End-to-End Neural Diarization" published at TASLP.
2025
IEEE SPS Japan Young Author Best Paper Award
- For the paper entitled "Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local Attractors" published at TASLP.
2024
Honorable Mention Award at IEEE Spoken Language Technology Workshop (SLT) 2024
- For the paper entitled "Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings" presented at SLT 2024.
- Award certificate
2024
2nd prize in The 8th CHiME Speech Separation and Recognition Challenge (CHiME-8) Task 1
- As the NTT team
- Technical report / Slides / Poster
- In charge of preparation of simulated mixtures and pretraining of EEND-VC
2023
Itakura Prize Innovative Young Researcher Award, The Acoustical Society of Japan
- For the research on overlap-aware speaker diarization for unkonwn numbers of speakers
2021
2nd prize in The Third DIHARD Speech Diarization Challenge (DIHARD III)
- As the Hitachi-JHU team
- Technical report / Slides
- In charge of EEND-EDA, EEND as post-processing, and system ensemble
2018
2nd prize in The 5th CHiME Speech Separation and Recognition Challenge (CHiME-5)
- As the Hitachi-JHU team
- Technical report / Slides
- In charge of speech separation and server cooling 🌬️
2017
Outstanding Research Presentation Award, The Institute of Image Information and Television Engineers
- For the presentation in PRMU, Feb, 2017

Invited Talks

2023.8
Speaker Diarization: A Key to Solving Cocktail Party Problem
- Speech-based Communication for Robots and Systems (IEEE RO-MAN 2023 Workshop)
2019.7
Face-Voice Matching Using Cross-Modal Embeddings (In Japanese)
- The 22nd Meeting on Image Recogntion and Understanding (MIRU)

Membership

Institute of Electrical and Electronics Engineers (IEEE)
IEEE Signal Processing Society (SPS)
Acoustical Society of Japan (ASJ): No. 22704
Information Processing Society of Japan (IPSJ)

Academic Services

Session chair / vice chair
- IEEE ICASSP (2022, 2025)
- ISCA Interspeech (2025)
- ASJ Annual Meeting (2024-)
Board member
- IPSJ SIG-SLP (2025.4-)
- ASJ-SP (2025.4-)
- IEICE Technical Committee on Speech (2025.6-)
Organizer
- The Joint Workshop on HSCMA and CHiME 2026 (a satellite workshop of ICASSP 2026)
Meta Reviewer
- ICASSP (2026)
Reviewer

Internship Supervision

Natsuo Yamashita (The University of Tokyo, 2021.8 - 2022.2) @Hitachi, Ltd.
Aoi Ito (Hosei University, 2022.11 - 2024.1) @Hitachi, Ltd.

cv

Education

Ph.D., Computer Science

University of Tsukuba, Ibaraki, Japan

M.E., Information Science and Technology

The University of Tokyo, Tokyo, Japan

B.E., Information and Communication Engineering

The University of Tokyo, Tokyo, Japan

Work Experience

Research Specialist

NTT, Inc. (formerly NTT Corporation), Human Informatics Laboratories

Senior Researcher

Hitachi, Ltd. Research & Development Group

Researcher

Hitachi, Ltd. Research & Development Group

Honors and Awards

IEEE SPS Young Author Best Paper Award

IEEE SPS Japan Young Author Best Paper Award

Honorable Mention Award at IEEE Spoken Language Technology Workshop (SLT) 2024

2nd prize in The 8th CHiME Speech Separation and Recognition Challenge (CHiME-8) Task 1

Itakura Prize Innovative Young Researcher Award, The Acoustical Society of Japan

2nd prize in The Third DIHARD Speech Diarization Challenge (DIHARD III)

2nd prize in The 5th CHiME Speech Separation and Recognition Challenge (CHiME-5)

Outstanding Research Presentation Award, The Institute of Image Information and Television Engineers

Invited Talks

Speaker Diarization: A Key to Solving Cocktail Party Problem

Face-Voice Matching Using Cross-Modal Embeddings (In Japanese)

Membership

Academic Services

Session chair / vice chair

Board member

Organizer

Meta Reviewer

Reviewer

Journal

Conference

Internship Supervision