Publications

Recent Journal articles

2022DVOŘÁKOVÁ Martina, HRADIŠ Michal, ŽABIČKA Petr, KOHÚT Jan, KIŠŠ Martin a BENEŠ Karel. Využití PERO OCR při přepisu rukopisů. Archivní časopis. Praha: Ministerstvo vnitra České republiky, 2022, roč. 72, č. 1, s. 14-27. ISSN 0004-0398.
 EGOROVA Ekaterina, VYDANA Hari K., BURGET Lukáš a ČERNOCKÝ Jan. Spelling-Aware Word-Based End-to-End ASR. IEEE Signal Processing Letters. Piscataway, NJ 08854 USA: IEEE Signal Processing Society, 2022, roč. 29, č. 29, s. 1729-1733. ISSN 1558-2361.
 LANDINI Federico Nicolás, PROFANT Ján, DIEZ Sánchez Mireia a BURGET Lukáš. Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks. Computer Speech and Language. Amsterdam: Elsevier Science, 2022, roč. 71, č. 101254, s. 1-16. ISSN 0885-2308.
 ONDEL Lucas Antoine Francois, YUSUF Bolaji, BURGET Lukáš a SARAÇLAR Murat. Non-Parametric Bayesian Subspace Models for Acoustic Unit Discovery. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING. New York City: IEEE Signal Processing Society, 2022, roč. 30, č. 5, s. 1902-1917. ISSN 2329-9290.
2020DIEZ Sánchez Mireia, BURGET Lukáš, LANDINI Federico Nicolás a ČERNOCKÝ Jan. Analysis of Speaker Diarization based on Bayesian HMM with Eigenvoice Priors. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING. New York City: IEEE Signal Processing Society, 2020, roč. 28, č. 1, s. 355-368. ISSN 2329-9290.
 KESIRAJU Santosh, PLCHOT Oldřich, BURGET Lukáš a GANGASHETTY Suryakanth V. Learning Document Embeddings Along With Their Uncertainties. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING. New York City: IEEE Signal Processing Society, 2020, roč. 2020, č. 28, s. 2319-2332. ISSN 2329-9290.
 KOSIBA Matěj a BURGET Lukáš a kol. Multiwavelength classification of X-ray selected galaxy cluster candidates using convolutional neural networks. Monthly Notices of the Royal Astronomical Society. Oxford: Royal Astronomical Society(RAS), 2020, roč. 496, č. 4, s. 4141-4153. ISSN 1365-2966.
 MATĚJKA Pavel, PLCHOT Oldřich, GLEMBEK Ondřej, BURGET Lukáš, ROHDIN Johan A., ZEINALI Hossein, MOŠNER Ladislav, SILNOVA Anna, NOVOTNÝ Ondřej, DIEZ Sánchez Mireia a ČERNOCKÝ Jan. 13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE. Computer Speech and Language. Amsterdam: Elsevier Science, 2020, roč. 2020, č. 63, s. 1-15. ISSN 0885-2308.
 ROHDIN Johan A., SILNOVA Anna, DIEZ Sánchez Mireia, PLCHOT Oldřich, MATĚJKA Pavel, BURGET Lukáš a GLEMBEK Ondřej. End-to-end DNN based text-independent speaker recognition for long and short utterances. Computer Speech and Language. Amsterdam: Elsevier Science, 2020, roč. 2020, č. 59, s. 22-35. ISSN 0885-2308.
 SCHARENBORG Odette, BESACIER Laurent, BLACK Alan, HASEGAWA-JOHNSON Mark, METZE Florian, NEUBIG Graham, STÜKER Sebastian, GODARD Pierre, MÜLLER Markus, ONDEL Lucas Antoine Francois, PALASKAR Shruti, ARTHUR Philip, CIANNELLA Francesco, DU Mingxing, LARSEN Elin, MERKX Danny, RIAD Rachid, WANG Liming a DUPOUX Emmanuel. Speech Technology for Unwritten Languages. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING. New York City: IEEE Signal Processing Society, 2020, roč. 2020, č. 28, s. 964-975. ISSN 2329-9290.
2019DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, OCHIAI Tsubasa, KINOSHITA Keisuke, ARAKI Shoko a NAKATANI Tomohiro. Evaluation of SpeakerBeam target speech extraction in real noisy and reverberant conditions. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN. Tokyo: Acoustical Society of Japan, 2019, roč. 2019, č. 2, s. 1-2. ISSN 0369-4232.
 MAGHSOODI Nooshin, SAMETI Hossein, ZEINALI Hossein a STAFYLAKIS Themos. Speaker Recognition With Random Digit Strings Using Uncertainty Normalized HMM-Based i-Vectors. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING. New York City: IEEE Signal Processing Society, 2019, roč. 2019, č. 11, s. 1815-1825. ISSN 2329-9290.
 NOVOTNÝ Ondřej, PLCHOT Oldřich, GLEMBEK Ondřej, ČERNOCKÝ Jan a BURGET Lukáš. Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition. Computer Speech and Language. Amsterdam: Elsevier Science, 2019, roč. 2019, č. 58, s. 403-421. ISSN 0885-2308.
 SZŐKE Igor, SKÁCEL Miroslav, MOŠNER Ladislav, PALIESEK Jakub a ČERNOCKÝ Jan. Building and Evaluation of a Real Room Impulse Response Dataset. IEEE Journal of Selected Topics in Signal Processing. 2019, roč. 13, č. 4, s. 863-876. ISSN 1932-4553.
 ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, KINOSHITA Keisuke, OCHIAI Tsubasa, NAKATANI Tomohiro, BURGET Lukáš a ČERNOCKÝ Jan. SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures. IEEE Journal of Selected Topics in Signal Processing. 2019, roč. 13, č. 4, s. 800-814. ISSN 1932-4553.

Recent Conference papers

2022ALAM Jahangir, BURGET Lukáš, GLEMBEK Ondřej, MATĚJKA Pavel, MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna a STAFYLAKIS Themos a kol. Development of ABC systems for the 2021 edition of NIST Speaker Recognition evaluation. In: Proceedings of The Speaker and Language Recognition Workshop (Odyssey 2022). Beijing: International Speech Communication Association, 2022, s. 346-353.
 BASKAR Murali K., HERZIG Tim, NGUYEN Diana, DIEZ Sánchez Mireia, POLZEHL Tim, BURGET Lukáš a ČERNOCKÝ Jan. Speaker adaptation for Wav2vec2 based dysarthric ASR. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 3403-3407. ISSN 1990-9772.
 BASKAR Murali K., ROSENBERG Andrew, RAMABHADRAN Bhuvana a ZHANG Yu. Reducing Domain mismatch in Self-supervised speech pre-training. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 3028-3032. ISSN 1990-9772.
 BLATT Alexander, KOCOUR Martin, VESELÝ Karel, SZŐKE Igor a KLAKOW Dietrich. Call-Sign Recognition and Understanding for Noisy Air-Traffic Transcripts Using Surveillance Information. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, s. 8357-8361. ISBN 978-1-6654-0540-9.
 BRUMMER Johan Nikolaas Langenhoven, SWART Albert du Preez, MOŠNER Ladislav, SILNOVA Anna, PLCHOT Oldřich, STAFYLAKIS Themos a BURGET Lukáš. Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 1446-1450. ISSN 1990-9772.
 DE Benito Gorron Diego, ŽMOLÍKOVÁ Kateřina a TORRE Toledano Doroteo. Source Separation for Sound Event Detection in domestic environments using jointly trained models. In: Proceedings of The 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022). Bamberg: IEEE Signal Processing Society, 2022, s. 1-5. ISBN 978-1-6654-6867-1.
 DELCROIX Marc, KINOSHITA Keisuke, OCHIAI Tsubasa, ŽMOLÍKOVÁ Kateřina, SATO Hiroshi a NAKATANI Tomohiro. Listen only to me! How well can target speech extraction handle false alarms?. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 216-220. ISSN 1990-9772.
 HAN Jiangyu, LONG Yanhua, BURGET Lukáš a ČERNOCKÝ Jan. DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation and Extraction. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, s. 7292-7296. ISBN 978-1-6654-0540-9.
 KIŠŠ Martin, KOHÚT Jan, BENEŠ Karel a HRADIŠ Michal. Importance of Textlines in Historical Document Classification. In: Uchida, S., Barney, E., Eglin, V. (eds) Document Analysis Systems. La Rochelle: Springer Nature Switzerland AG, 2022, s. 158-170. ISBN 978-3-031-06554-5.
 KOCOUR Martin, UMESH Jahnavi, KARAFIÁT Martin, ŠVEC Ján, LOPEZ Fernando, BENEŠ Karel, DIEZ Sánchez Mireia, SZŐKE Igor, LUQUE Jordi, VESELÝ Karel, BURGET Lukáš a ČERNOCKÝ Jan. BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge. In: Proceedings of IberSpeech 2022. Granada: International Speech Communication Association, 2022, s. 276-280.
 KOCOUR Martin, ŽMOLÍKOVÁ Kateřina, ONDEL Lucas Antoine Francois, ŠVEC Ján, DELCROIX Marc, OCHIAI Tsubasa, BURGET Lukáš a ČERNOCKÝ Jan. Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 4955-4959. ISSN 1990-9772.
 LANDINI Federico Nicolás, LOZANO Díez Alicia, DIEZ Sánchez Mireia a BURGET Lukáš. From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 5095-5099. ISSN 1990-9772.
 MOŠNER Ladislav, PLCHOT Oldřich, BURGET Lukáš a ČERNOCKÝ Jan. Multi-Channel Speaker Verification with Conv-Tasnet Based Beamformer. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, s. 7982-7986. ISBN 978-1-6654-0540-9.
 MOŠNER Ladislav, PLCHOT Oldřich, BURGET Lukáš a ČERNOCKÝ Jan. Multisv: Dataset for Far-Field Multi-Channel Speaker Verification. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, s. 7977-7981. ISBN 978-1-6654-0540-9.
 NIGMATULINA Iuliia, ZULUAGA-GOMEZ Juan, PRASAD Amrutha, SARFJOO Saeed a MOTLÍČEK Petr. A Two-Step Approach to Leverage Contextual Data: Speech Recognition in Air-Traffic Communications. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, s. 6282-6286. ISBN 978-1-6654-0540-9.
 ONDEL Lucas Antoine Francois, LAM-YEE-MUI L'ea-Marie, KOCOUR Martin, CORRO Caio Filippo a BURGET Lukáš. GPU-Accelerated Forward-Backward Algorithm with Application to Lattice-Free MMI. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, s. 8417-8421. ISBN 978-1-6654-0540-9.
 PENG Junyi, GU Rongzhi, MOŠNER Ladislav, PLCHOT Oldřich, BURGET Lukáš a ČERNOCKÝ Jan. Learnable Sparse Filterbank for Speaker Verification. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 5110-5114. ISSN 1990-9772.
 PENG Junyi, ZHANG Chunlei, ČERNOCKÝ Jan a YU Dong. Progressive contrastive learning for self-supervised text-independent speaker verification. In: Proceedings of The Speaker and Language Recognition Workshop (Odyssey 2022). Beijing: International Speech Communication Association, 2022, s. 17-24.
 SILNOVA Anna, STAFYLAKIS Themos, MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., MATĚJKA Pavel, BURGET Lukáš, GLEMBEK Ondřej a BRUMMER Johan Nikolaas Langenhoven. Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch. In: Proceedings of The Speaker and Language Recognition Workshop (Odyssey 2022). Beijing: International Speech Communication Association, 2022, s. 9-16.
 SOLEWICZ Yosef, COHEN Noa, ROHDIN Johan A., MADIKERI Srikanth a ČERNOCKÝ Jan. Speaker recognition on mono-channel telephony recordings. In: Proceedings of Odyssey 2022. Beijing: International Speech Communication Association, 2022, s. 193-199.
 STAFYLAKIS Themos, MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, BURGET Lukáš a ČERNOCKÝ Jan. Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 605-609. ISSN 1990-9772.
 YUSUF Bolaji, GANDHE Ankur a SOKOLOV Alex. Usted: Improving ASR with a Unified Speech and Text Encoder-Decoder. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, s. 8297-8301. ISBN 978-1-6654-0540-9.
 ŠVEC Ján, ŽMOLÍKOVÁ Kateřina, KOCOUR Martin, DELCROIX Marc, OCHIAI Tsubasa, MOŠNER Ladislav a ČERNOCKÝ Jan. Analysis of impact of emotions on target speech extraction and speech separation. In: Proceedings of The 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022). Bamberg: IEEE Signal Processing Society, 2022, s. 1-5. ISBN 978-1-6654-6867-1.
2021BASKAR Murali K., BURGET Lukáš, WATANABE Shinji, ASTUDILLO Ramon a ČERNOCKÝ Jan. Eat: Enhanced ASR-TTS for Self-Supervised Speech Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, s. 6753-6757. ISBN 978-1-7281-7605-5.
 BENEŠ Karel a BURGET Lukáš. Text Augmentation for Language Models in High Error Recognition Scenario. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Brno: International Speech Communication Association, 2021, s. 1872-1876. ISSN 1990-9772.
 DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, OCHIAI Tsubasa, KINOSHITA Keisuke a NAKATANI Tomohiro. Speaker activity driven neural speech extraction. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Toronto: IEEE Signal Processing Society, 2021, s. 6099-6103. ISBN 978-1-7281-7605-5.
 EGOROVA Ekaterina, VYDANA Hari K., BURGET Lukáš a ČERNOCKÝ Jan. Out-of-Vocabulary Words Detection with Attention and CTC Alignments in an End-to-End ASR System. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, s. 2901-2905. ISSN 1990-9772.
 HELMKE Hartmut, KLEINERT Matthias, SHETTY Shruthi, OHNEISER Oliver, EHR Heiko, PRASAD Amrutha, MOTLÍČEK Petr, VESELÝ Karel, ONDŘEJ Karel, SMRŽ Pavel, HARFMANN Julia a WINDISCH Christian a kol. Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety. In: Proceedings of ATM 2021. on-line: Federal Aviation Administration, 2021, s. 1-10.
 HELMKE Hartmut, SHETTY Shruthi, KLEINERT Matthias, OHNEISER Oliver, EHR Heiko, MOTLÍČEK Petr, PRASAD Amrutha a WINDISCH Christian a kol. Measuring Speech Recognition And Understanding Performance in Air Traffic Control Domain Beyond Word Error Rates. In: Proceedings of 11th SESAR Innovation Days 2021. Belgie, 2021, s. 1-8.
 KARAFIÁT Martin, VESELÝ Karel, ČERNOCKÝ Jan, PROFANT Ján, NYTRA Jiří, HLAVÁČEK Miroslav a PAVLÍČEK Tomáš. Analysis of X-Vectors for Low-Resource Speech Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, s. 6998-7002. ISBN 978-1-7281-7605-5.
 KIŠŠ Martin, BENEŠ Karel a HRADIŠ Michal. AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions. In: Lladós J., Lopresti D., Uchida S. (eds) Document Analysis and Recognition - ICDAR 2021. Lausanne: Springer Nature Switzerland AG, 2021, s. 463-477. ISBN 978-3-030-86336-4.
 KLEINERT Matthias, HELMKE Hartmut, SHETTY Shruthi, OHNEISER Oliver, EHR Heiko, PRASAD Amrutha, MOTLÍČEK Petr a HARFMANN Julia. Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning. In: Proceedings of DASC 2021. San Antonio, Texas: Institute of Electrical and Electronics Engineers, 2021, s. 1-9. ISBN 978-1-6654-3420-1.
 KOCOUR Martin, CÁMBARA Guillermo, LUQUE Jordi, BONET David, FARRÚS Mireia, KARAFIÁT Martin, VESELÝ Karel a ČERNOCKÝ Jan. BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge. In: Proceedings of IberSPEECH 2021. Vallaloid: International Speech Communication Association, 2021, s. 113-117.
 KOCOUR Martin, VESELÝ Karel, BLATT Alexander, ZULUAGA-GOMEZ Juan, SZŐKE Igor, ČERNOCKÝ Jan, KLAKOW Dietrich a MOTLÍČEK Petr. Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, s. 3301-3305. ISSN 1990-9772.
 KOCOUR Martin, VESELÝ Karel, SZŐKE Igor, KESIRAJU Santosh, ZULUAGA-GOMEZ Juan, BLATT Alexander, PRASAD Amrutha, NIGMATULINA Iuliia, MOTLÍČEK Petr, KLAKOW Dietrich, TART Allan, KOLČÁREK Pavel, ČERNOCKÝ Jan, CEVENINI Claudia, CHOUKRI Khalid, RIGAULT Mickael, LANDIS Fabian a SARFJOO Saeed a kol. Automatic Processing Pipeline for Collecting and Annotating Air-Traffic Voice Communication Data. In: Proceedings of 9th OpenSky Symposium 2021, OpenSky Network, Brussels, Belgium. Brussels: MDPI, 2021, s. 1-10. ISSN 2504-3900.
 LANDINI Federico Nicolás, GLEMBEK Ondřej, MATĚJKA Pavel, ROHDIN Johan A., BURGET Lukáš, DIEZ Sánchez Mireia a SILNOVA Anna. Analysis of the BUT Diarization System for Voxconverse Challenge. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, s. 5819-5823. ISBN 978-1-7281-7605-5.
 LANDINI Federico Nicolás, LOZANO Díez Alicia, BURGET Lukáš, DIEZ Sánchez Mireia, SILNOVA Anna, ŽMOLÍKOVÁ Kateřina, GLEMBEK Ondřej, MATĚJKA Pavel, STAFYLAKIS Themos a BRUMMER Johan Nikolaas Langenhoven. BUT System Description for The Third DIHARD Speech Diarization Challenge. In: Proceedings available at Dihard Challenge Github. on-line by LDC and University of Pennsylvania, 2021, s. 1-5.
 PENG Junyi, QU Xiaoyang, WANG Jianzong, GU Rongzhi, XIAO Jing, BURGET Lukáš a ČERNOCKÝ Jan. ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Brno: International Speech Communication Association, 2021, s. 511-515. ISSN 1990-9772.
 STAFYLAKIS Themos, ROHDIN Johan A. a BURGET Lukáš. Speaker embeddings by modeling channel-wise correlations. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Brno: International Speech Communication Association, 2021, s. 501-505. ISSN 1990-9772.
 SZŐKE Igor, KESIRAJU Santosh, NOVOTNÝ Ondřej, KOCOUR Martin, VESELÝ Karel a ČERNOCKÝ Jan. Detecting English Speech in the Air Traffic Control Voice Communication. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, s. 3286-3290. ISSN 1990-9772.
 VYDANA Hari K., KARAFIÁT Martin, BURGET Lukáš a ČERNOCKÝ Jan. The IWSLT 2021 BUT Speech Translation Systems. In: 18th International Conference on Spoken Language Translation (IWSLT) . Bangkok, on-line: Association for Computational Linguistics, 2021, s. 75-83. ISBN 978-1-7138-3378-9.
 VYDANA Hari K., KARAFIÁT Martin, ŽMOLÍKOVÁ Kateřina, BURGET Lukáš a ČERNOCKÝ Jan. Jointly Trained Transformers Models for Spoken Language Translation. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, s. 7513-7517. ISBN 978-1-7281-7605-5.
 WANNER Leo, KLUSCH Matthias, MAVROPOULOS Athanasios, JAMIN Emmanuel, MARIN Puchades Victor, CASAMAYOR Gerard, ČERNOCKÝ Jan a EGOROVA Ekaterina a kol. Towards a Versatile Intelligent Conversational Agent as Personal Assistant for Migrants. In: The PAAMS Collection. PAAMS 2021: Advances in Practical Applications of Agents, Multi-Agent Systems, and Social Good. . Salamanca: Springer International Publishing, 2021, s. 316-327. ISBN 978-3-030-85738-7.
 YUSUF Bolaji, GOK Alican, GUNDOGDU Batuhan a SARAÇLAR Murat. End-to-End Open Vocabulary Keyword Search. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, s. 4388-4392. ISSN 1990-9772.
 YUSUF Bolaji, ONDEL Lucas Antoine Francois, BURGET Lukáš, ČERNOCKÝ Jan a SARAÇLAR Murat. A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, s. 3710-3714. ISBN 978-1-7281-7605-5.
 ZULUAGA-GOMEZ Juan, NIGMATULINA Iuliia, PRASAD Amrutha, MOTLÍČEK Petr, VESELÝ Karel, KOCOUR Martin a SZŐKE Igor. Contextual Semi-Supervised Learning: An Approach to Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, s. 3296-3300. ISSN 1990-9772.
 ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, BURGET Lukáš, NAKATANI Tomohiro a ČERNOCKÝ Jan. Integration of Variational Autoencoder and Spatial Clustering for Adaptive Multi-Channel Neural Speech Separation. In: 2021 IEEE Spoken Language Technology Workshop, SLT 2021 - Proceedings. Shenzhen - virtual : IEEE Signal Processing Society, 2021, s. 889-896. ISBN 978-1-7281-7066-4.
 ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, RAJ Desh, WATANABE Shinji a ČERNOCKÝ Jan. Auxiliary Loss Function for Target Speech Extraction and Recognition with Weak Supervision Based on Speaker Characteristics. In: Proceedings of 2021 Interspeech. Brno: International Speech Communication Association, 2021, s. 1464-1468. ISSN 1990-9772.
2020ALAM Jahangir, BOULIANNE Gilles, BURGET Lukáš, DAHMANE Mohamed, DIEZ Sánchez Mireia, GLEMBEK Ondřej, LALONDE Marc, LOZANO Díez Alicia, MATĚJKA Pavel, MIZERA Petr, MOŠNER Ladislav, NOISEUX Cédric, MONTEIRO Joao, NOVOTNÝ Ondřej, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, SLAVÍČEK Josef, STAFYLAKIS Themos, ST-CHARLES Pierre-Luc, WANG Shuai a ZEINALI Hossein. Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge. In: Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Tokyo: International Speech Communication Association, 2020, s. 289-295. ISSN 2312-2846.
 BURGET Lukáš, GLEMBEK Ondřej, LOZANO Díez Alicia, MATĚJKA Pavel, NOVOTNÝ Ondřej, PLCHOT Oldřich, PULUGUNDLA Bhargav, ROHDIN Johan A., SILNOVA Anna a VESELÝ Karel. BUT System Description to SdSV Challenge 2020. In: Proceedings of Short-duration Speaker Verification Challenge 2020 Workshop. Shanghai, on-line event of Interspeech 2020 Conference, 2020, s. 1-5.
 DELCROIX Marc, OCHIAI Tsubasa, ŽMOLÍKOVÁ Kateřina, KINOSHITA Keisuke, TAWARA Naohiro, NAKATANI Tomohiro a ARAKI Shoko. Improving Speaker Discrimination of Target Speech Extraction With Time-Domain Speakerbeam. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, s. 691-695. ISBN 978-1-5090-6631-5.
 DIEZ Sánchez Mireia, BURGET Lukáš, LANDINI Federico Nicolás, WANG Shuai a ČERNOCKÝ Jan. Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, s. 6519-6523. ISBN 978-1-5090-6631-5.
 DUNBAR Ewan, KARADAYI Julien, BERNARD Mathieu, CAO Xuan-Nga, ALGAYRES Robin, ONDEL Lucas Antoine Francois, BESACIER Laurent, SAKTI Sakriani a DUPOUX Emmanuel. The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Shanghai: International Speech Communication Association, 2020, s. 4831-4835. ISSN 1990-9772.
 LANDINI Federico Nicolás, WANG Shuai, DIEZ Sánchez Mireia, BURGET Lukáš, MATĚJKA Pavel, ŽMOLÍKOVÁ Kateřina, MOŠNER Ladislav, SILNOVA Anna, PLCHOT Oldřich, NOVOTNÝ Ondřej, ZEINALI Hossein a ROHDIN Johan A. But System for the Second Dihard Speech Diarization Challenge. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, s. 6529-6533. ISBN 978-1-5090-6631-5.
 LOZANO Díez Alicia, SILNOVA Anna, PULUGUNDLA Bhargav, ROHDIN Johan A., VESELÝ Karel, BURGET Lukáš, PLCHOT Oldřich, GLEMBEK Ondřej, NOVOTNÝ Ondřej a MATĚJKA Pavel. BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Shanghai: International Speech Communication Association, 2020, s. 761-765. ISSN 1990-9772.
 MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A. a ČERNOCKÝ Jan. Utilizing VOiCES dataset for multichannel speaker verification with beamforming. In: Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Tokyo: International Speech Communication Association, 2020, s. 187-193. ISSN 2312-2846.
 SILNOVA Anna, BRUMMER Johan Nikolaas Langenhoven, ROHDIN Johan A., STAFYLAKIS Themos a BURGET Lukáš. Probabilistic embeddings for speaker diarization. In: Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Tokyo: International Speech Communication Association, 2020, s. 24-31. ISSN 2312-2846.
 WANG Shuai, ROHDIN Johan A., PLCHOT Oldřich, BURGET Lukáš, YU Kai a ČERNOCKÝ Jan. Investigation of Specaugment for Deep Speaker Embedding Learning. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, s. 7139-7143. ISBN 978-1-5090-6631-5.
 ZEINALI Hossein, LEE Kong Aik, ALAM Jahangir a BURGET Lukáš. SdSV Challenge 2020: Large-Scale Evaluation of Short-duration Speaker Verification. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Shanghai: International Speech Communication Association, 2020, s. 731-735. ISSN 1990-9772.
 ZULUAGA-GOMEZ Juan, MOTLÍČEK Petr, ZHAN Qingran, VESELÝ Karel a BRAUN Rudolf. Automatic Speech Recognition Benchmark for Air-Traffic Communications. In: Proceedings of Interspeech 2020. Shanghai: International Speech Communication Association, 2020, s. 2297-2301. ISSN 1990-9772.
 ZULUAGA-GOMEZ Juan, VESELÝ Karel, BLATT Alexander, MOTLÍČEK Petr, KLAKOW Dietrich, TART Allan, SZŐKE Igor, PRASAD Amrutha, SARFJOO Saeed, KOLČÁREK Pavel, KOCOUR Martin, ČERNOCKÝ Jan, CEVENINI Claudia, CHOUKRI Khalid, RIGAULT Mickael a LANDIS Fabian. Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications. In: Proceedings of the 8th OpenSky Symposium 2020. Brusel: MDPI, 2020, s. 1-10. ISSN 2504-3900.
 ŽMOLÍKOVÁ Kateřina, KOCOUR Martin, LANDINI Federico Nicolás, BENEŠ Karel, KARAFIÁT Martin, VYDANA Hari K., LOZANO Díez Alicia, PLCHOT Oldřich, BASKAR Murali K., ŠVEC Ján, MOŠNER Ladislav, MALENOVSKÝ Vladimír, BURGET Lukáš, YUSUF Bolaji, NOVOTNÝ Ondřej, GRÉZL František, SZŐKE Igor a ČERNOCKÝ Jan. BUT System for CHiME-6 Challenge. In: Proceedings of CHiME 2020 Virtual Workshop. Barcelona: University of Sheffield, 2020, s. 1-3.
2019ALAM Jahangir, BOULIANNE Gilles, BURGET Lukáš, GLEMBEK Ondřej, LOZANO Díez Alicia, MATĚJKA Pavel, MIZERA Petr, MOŠNER Ladislav, NOVOTNÝ Ondřej, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, SLAVÍČEK Josef, STAFYLAKIS Themos, WANG Shuai, ZEINALI Hossein, DAHMANE Mohamed, ST-CHARLES Pierre-Luc, LALONDE Marc, NOISEUX Cédric a MONTEIRO Joao. ABC System Description for NIST Multimedia Speaker Recognition Evaluation 2019. In: Proceedings of NIST 2019 SRE Workshop. Sentosa, Singapore: National Institute of Standards and Technology, 2019, s. 1-7.
 ALAM Jahangir, BOULIANNE Gilles, GLEMBEK Ondřej, LOZANO Díez Alicia, MATĚJKA Pavel, MIZERA Petr, MONTEIRO Joao, MOŠNER Ladislav, NOVOTNÝ Ondřej, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, SLAVÍČEK Josef, STAFYLAKIS Themos, WANG Shuai a ZEINALI Hossein. ABC NIST SRE 2019 CTS System Description. In: Proceedings of NIST. Sentosa, Singapore: National Institute of Standards and Technology, 2019, s. 1-6.
 BASKAR Murali K., BURGET Lukáš, WATANABE Shinji, KARAFIÁT Martin, HORI Takaaki a ČERNOCKÝ Jan. Promising Accurate Prefix Boosting For Sequence-to-sequence ASR. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, s. 5646-5650. ISBN 978-1-5386-4658-8.
 BASKAR Murali K., WATANABE Shinji, ASTUDILLO Ramon, HORI Takaaki, BURGET Lukáš a ČERNOCKÝ Jan. Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 3790-3794. ISSN 1990-9772.
 BENEŠ Karel, IRIE Kazuki, BECK Eugen, SCHLÜTER Ralf a NEY Hermann. Unsupervised Language Model Adaptation for Speech Recognition with no Extra Resources. In: Proceedings of DAGA 2019. Rostock: Deutsche Gesellschaft für Akustik (DEGA), DEGA Head office, 2019, s. 954-957. ISBN 978-3-939296-14-0.
 CARTAS Alejandro, KOCOUR Martin, RAMAN Aravindh, LEONTIADIS Ilias, LUQUE Jordi, SASTRY Nishanth, NUNEZ-MARTINEZ Leon, PERINO Diego a PERALES Carlos Segura. A Reality Check on Inference at Mobile Networks Edge. In: Proceedings of the 2nd ACM International Workshop on Edge Systems, Analytics and Networking (EDGESYS '19). Dressden: Association for Computing Machinery, 2019, s. 54-59. ISBN 978-1-4503-6275-7.
 CHO Jaejin, WATANABE Shinji, HORI Takaaki, BASKAR Murali K., INAGUMA Hirofumi, VILLALBA Lopez Jesus Antonio a DEHAK Najim. Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition. In: Proceedings of 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP). Brighton: IEEE Signal Processing Society, 2019, s. 6191-6195. ISBN 978-1-5386-4658-8.
 DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, OCHIAI Tsubasa, KINOSHITA Keisuke, ARAKI Shoko a NAKATANI Tomohiro. Compact Network for Speakerbeam Target Speaker Extraction. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, s. 6965-6969. ISBN 978-1-5386-4658-8.
 DIEZ Sánchez Mireia, BURGET Lukáš, WANG Shuai, ROHDIN Johan A. a ČERNOCKÝ Jan. Bayesian HMM based x-vector clustering for Speaker Diarization. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 346-350. ISSN 1990-9772.
 INAGUMA Hirofumi, CHO Jaejin, BASKAR Murali K., KAWAHARA Tatsuya a WATANABE Shinji. Transfer Learning Of Language-independent End-to-end ASR With Language Model Fusion. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, s. 6096-6100. ISBN 978-1-5386-4658-8.
 KARAFIÁT Martin, BASKAR Murali K., WATANABE Shinji, HORI Takaaki, WIESNER Matthew a ČERNOCKÝ Jan. Analysis of Multilingual Sequence-to-Sequence Speech Recognition Systems. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 2220-2224. ISSN 1990-9772.
 MATĚJKA Pavel, PLCHOT Oldřich, ZEINALI Hossein, MOŠNER Ladislav, SILNOVA Anna, BURGET Lukáš, NOVOTNÝ Ondřej a GLEMBEK Ondřej. Analysis of BUT Submission in Far-Field Scenarios of VOiCES 2019 Challenge. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 2448-2452. ISSN 1990-9772.
 MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., BURGET Lukáš a ČERNOCKÝ Jan. Speaker Verification with Application-Aware Beamforming. In: IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU). Sentosa, Singapore: IEEE Signal Processing Society, 2019, s. 411-418. ISBN 978-1-7281-0306-8.
 MOŠNER Ladislav, WU Minhua, RAJU Anirudh, PARTHASARATHI Sree Hari Krishnan, KUMATANI Kenichi, SUNDARAM Shiva, MAAS Roland a HOFFMEISTER Björn. Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, s. 6475-6479. ISBN 978-1-5386-4658-8.
 NOVOTNÝ Ondřej, PLCHOT Oldřich, GLEMBEK Ondřej a BURGET Lukáš. Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 4330-4334. ISSN 1990-9772.
 NOVOTNÝ Ondřej, PLCHOT Oldřich, GLEMBEK Ondřej, BURGET Lukáš a MATĚJKA Pavel. Discriminatively Re-trained i-Vector Extractor For Speaker Recognition. In: Proceedings of 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP). Brighton: IEEE Signal Processing Society, 2019, s. 6031-6035. ISBN 978-1-5386-4658-8.
 ONDEL Lucas Antoine Francois, LI Ruizhi, SELL Gregory a HEŘMANSKÝ Hynek. Deriving Spectro-temporal Properties of Hearing from Speech Data. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, s. 411-415. ISBN 978-1-5386-4658-8.
 ONDEL Lucas Antoine Francois, VYDANA Hari K., BURGET Lukáš a ČERNOCKÝ Jan. Bayesian Subspace Hidden Markov Model for Acoustic Unit Discovery. In: Proceedings of Interspeech 2019. Graz: International Speech Communication Association, 2019, s. 261-265. ISSN 1990-9772.
 ROHDIN Johan A., STAFYLAKIS Themos, SILNOVA Anna, ZEINALI Hossein, BURGET Lukáš a PLCHOT Oldřich. Speaker Verification Using End-To-End Adversarial Language Adaptation. In: Proceedings of ICASSP 2019. Brighton: IEEE Signal Processing Society, 2019, s. 6006-6010. ISBN 978-1-5386-4658-8.
 STAFYLAKIS Themos, ROHDIN Johan A., PLCHOT Oldřich, MIZERA Petr a BURGET Lukáš. Self-supervised speaker embeddings. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 2863-2867. ISSN 1990-9772.
 SUBRAMANIAN Aswin S., WANG Xiaofei, BASKAR Murali K., WATANABE Shinji, TANIGUCHI Toru, TRAN Dung a FUJITA Yuya. Speech Enhancement Using End-to-End Speech Recognition Objectives. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, NY: IEEE Signal Processing Society, 2019, s. 234-238. ISBN 978-1-7281-1123-0.
 WANG Shuai, ROHDIN Johan A., BURGET Lukáš, PLCHOT Oldřich, QIAN Yanmin, YU Kai a ČERNOCKÝ Jan. On the Usage of Phonetic Information for Text-independent Speaker Embedding Extraction. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 1148-1152. ISSN 1990-9772.
 YANG Jinyi, ONDEL Lucas Antoine Francois, MANOHAR Vimal a HEŘMANSKÝ Hynek. Towards Automatic Methods to Detect Errors in Transcriptions of Speech Recordings. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, s. 3747-3751. ISBN 978-1-5386-4658-8.
 ZEINALI Hossein, BURGET Lukáš, ROHDIN Johan A., STAFYLAKIS Themos a ČERNOCKÝ Jan. How To Improve Your Speaker Embeddings Extractor in Generic Toolkits. In: Proceedings of 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP). Brighton: IEEE Signal Processing Society, 2019, s. 6141-6145. ISBN 978-1-5386-4658-8.
 ZEINALI Hossein, STAFYLAKIS Themos, ATHANASOPOULOU Georgia, ROHDIN Johan A., GKINIS Ioanis, BURGET Lukáš a ČERNOCKÝ Jan. Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 1073-1077. ISSN 1990-9772.
 ZEINALI Hossein, WANG Shuai, SILNOVA Anna, MATĚJKA Pavel a PLCHOT Oldřich. BUT System Description to VoxCeleb Speaker Recognition Challenge 2019. In: Proceedings of The VoxCeleb Challange Workshop 2019. Graz, 2019, s. 1-4.
 ZEINALI Hossein, ČERNOCKÝ Jan a BURGET Lukáš. A multi purpose and large scale speech corpus in Persian and English for speaker and speech Recognition: the DeepMine database. In: IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU). Sentosa, Singapore: IEEE Signal Processing Society, 2019, s. 397-402. ISBN 978-1-7281-0306-8.

PhD Dissertations

2020 KESIRAJU Santosh. Generative models for learning document representations along with their uncertainties. PhD thesis, IIIT Hyderabad, 2020.
2017 VESELÝ Karel. Semi-Supervised Training of Deep Neural Networks for Speech Recognition. PhD thesis, Brno University of Technology, FIT, 2017.
2016 HANNEMANN Mirko. Finite-state based recognition networks for forward-backward speech decoding. PhD thesis, Brno University of Technology, FIT, 2016.
2014 PLCHOT Oldřich. Extensions to Probabilistic Linear Discriminant Analysis for Speaker Recognition. PhD thesis, Brno University of Technology, FIT, 2014.
FAPŠO Michal. Query-by-Example Spoken Term Detection. PhD thesis, Brno University of Technology, FIT, 2014.
SOUFIFAR Mehdi.Subspace Modeling of Discrete features for Language Recognition. PhD thesis, Norwegian University of Science and Technology, 2014.
2012 GLEMBEK Ondřej. Optimization of Gaussian Mixture Subspace Models and Related Scoring Algorithms in Speaker Verification. PhD thesis, Brno University of Technology, FIT, 2012.
MIKOLOV Tomáš. STATISTICAL LANGUAGE MODELS BASED ON NEURAL NETWORKS. PhD thesis, Brno University of Technology, FIT, 2012.
2011 KOCKMANN Marcel. Subspace Modeling of Prosodic Features for Speaker Verification. PhD thesis, Brno University of Technology, FIT, 2011.
2010 SZŐKE Igor. Hybrid word-subword spoken term detection. PhD thesis, Brno University of Technology, FIT, 2010.
2008 SCHWARZ Petr. Phoneme recognition based on long temporal context. PhD thesis, Brno University of Technology, FIT, 2008.
MATĚJKA Pavel. Phonotactic and acoustic language recognition. PhD thesis, Brno University of Technology, FEKT, 2008.
OPARIN Ilya. Language models for automatic speech recognition of inflectional languages. PhD thesis, University of West Bohemia, FAS, 2008.
KARAFIÁT Martin. Study of Linear Transformations Applied to Training of Cross-Domain Adapted Large Vocabulary Continuous Speech Recognition Systems. PhD thesis, Brno University of Technology, FIT, 2008.
2007 GRÉZL František. TRAP-based probabilistic features for automatic speech recognition. PhD thesis, Brno University of Technology, FIT, 2007.
2004 BURGET Lukáš. Complementarity of Speech Recognition Systems and System Combination. PhD thesis, Brno University of Technology, FIT, 2004.
2003 MOTLÍČEK Petr. Modeling of Spectra and Temporal Trajectories in Speech Processing. PhD thesis, Brno University of Technology, FIT, 2003.