Publikationen

2025

  • Laurynas Zavistanavicius, Frank Zalkow, Christian Dittmar, and Robert L. Stevenson. Adapting the fréchet audio distance as an objective metric for text-to-speech quality evaluation. In Proceedings of the ITG Conference on Speech Communication. Berlin, Germany, 2025. accepted. doi:.
    PDF BibTeX▼
  • Frank Zalkow, Benedikt Schäfer, Thomas Moissl, Jonas Bücherl, Kerstin Markl, Sebastian Bothe, Francois Duchateau, Julia Dollase, Patric Kabus, Daniel Steinigen, Oliver Schmitt, and Fabian Küch. Generating search-engine-optimized headlines for sports news. In Proceedings of the Conference on Natural Language Processing (KONVENS), 59–65. Hildesheim, Germany, 2025.
    PDF BibTeX▼
  • Judith Bauer, Frank Zalkow, Meinard Müller, and Christian Dittmar. Explicit emphasis control in text-to-speech synthesis. In Proceedings of the ISCA Speech Synthesis Workshop (SSW), 21–27. Leeuwarden, The Netherlands, 2025. doi:10.21437/SSW.2025-4.
    PDF BibTeX▼
  • Zahra Kolagar, Frank Zalkow, and Alessandra Zarcone. Investigating methods for mapping learning objectives to bloom's revised taxonomy in course descriptions for higher education. In Proceedings of the Workshop on Innovative Use of NLP for Building Educational Applications (BEA), 415–445. Vienna, Austria, 2025. doi:10.18653/v1/2025.bea-1.32.
    PDF BibTeX▼
  • Subhayu Ghosh, Frank Zalkow, and Nanda Dulal Jana. Enhanced audio-visual speech synthesis via multi-discriminative learning. IEEE Transactions on Multimedia, ():, 2025. accepted. doi:.
    PDF BibTeX▼
  • Frank Zalkow, Paolo Sani, Kishor Kayyar Lakshminarayana, Emanuël A. P. Habets, Nicola Pia, and Christian Dittmar. Bridging the training–inference gap in TTS: Training strategies for robust generative postprocessing for low-resource speakers. In Proceedings of the Conference of the International Speech Communication Association (INTERSPEECH), 2470–2474. Rotterdam, The Netherlands, 2025. doi:10.21437/Interspeech.2025-854.
    PDF Details BibTeX▼
  • Kishor Kayyar Lakshminarayana, Frank Zalkow, Christian Dittmar, Nicola Pia, and Emanuël A. P. Habets. Low-resource text-to-speech synthesis using noise-augmented training of ForwardTacotron. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Hyderabad, India, 2025. doi:10.1109/ICASSP49660.2025.10890686.
    PDF BibTeX▼

2024

  • Arunava Kr. Kalita, Christian Dittmar, Paolo Sani, Frank Zalkow, Emanuël A. P. Habets, and Rusha Patra. PAD-VC: A prosody-aware decoder for any-to-few voice conversion. In Proceedings of the International Workshop on Acoustic Signal Enhancement (IWAENC), 389–393. Aalborg, Denmark, 2024. doi:10.1109/IWAENC61483.2024.10694576.
    PDF Details BibTeX▼
  • Florian Lux, Sarina Meyer, Lyonel Behringer, Frank Zalkow, Phat Do, Matt Coler, Emanuël A. P. Habets, and Ngoc Thang Vu. Meta learning text-to-speech synthesis in over 7000 languages. In Proceedings of the Conference of the International Speech Communication Association (INTERSPEECH), 4958–4962. Kos, Greece, 2024. doi:10.21437/Interspeech.2024-1335.
    PDF Details BibTeX▼
  • Subhayu Ghosh, Snehashis Sarkar, Sovan Ghosh, Frank Zalkow, and Nanda Dulal Jana. Audio-visual speech synthesis using vision transformer–enhanced autoencoders with ensemble of loss functions. Applied Intelligence, 54(6):4507–4524, 2024. doi:10.1007/s10489-024-05380-7.
    PDF Details BibTeX▼
  • Judith Bauer, Frank Zalkow, Meinard Müller, and Christian Dittmar. Evaluating the impact of prosody feature normalization on the controllability of pitch in speech synthesis. In Elektronische Sprachsignalverarbeitung (ESSV), 188–195. Regensburg, Germany, 2024. doi:10.35096/othr/pub-7097.
    PDF BibTeX▼

2023

  • Christof Weiß, Vlora Arifi-Müller, Michael Krause, Frank Zalkow, Stephanie Klauk, Rainer Kleinertz, and Meinard Müller. Wagner Ring Dataset: A complex opera scenario for music processing and computational musicology. Transactions of the International Society for Music Information Retrieval (TISMIR), 6(1):135–149, 2023. doi:10.5334/tismir.161.
    PDF Details BibTeX▼
  • Frank Zalkow, Paolo Sani, Michael Fast, Judith Bauer, Mohammad Joshaghani, Kishor Kayyar Lakshminarayana, Emanuël A. P. Habets, and Christian Dittmar. The AudioLabs system for the Blizzard Challenge 2023. In Proceedings of the Blizzard Challenge Workshop, 63–68. Grenoble, France, 2023. doi:10.21437/Blizzard.2023-8.
    PDF BibTeX▼
  • Paolo Sani, Judith Bauer, Frank Zalkow, Emanuël A. P. Habets, and Christian Dittmar. Improving the naturalness of synthesized spectograms for TTS using GAN-based post-processing. In Proceedings of the ITG Conference on Speech Communication, 270–274. Aachen, Germany, 2023. doi:10.30420/456164053.
    PDF Details BibTeX▼
  • Meinard Müller and Frank Zalkow. FMP notebooks. In Peter Moormann and Nicolas Ruth, editors, Musik und Internet: Aktuelle Phänomene populärer Kulturen, Musik und Medien, pages 237–247. Springer VS, Wiesbaden, Germany, 2023. doi:10.1007/978-3-658-39145-4.
    BibTeX▼
  • Frank Zalkow, Prachi Govalkar, Meinard Müller, Emanuël A. P. Habets, and Christian Dittmar. Evaluating speech–phoneme alignment and its impact on neural text-to-speech synthesis. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Rhodes Island, Greece, 2023. doi:10.1109/ICASSP49357.2023.10097248.
    PDF Details BibTeX▼

2022

  • Yi-Jen Shih, Shih-Lun Wu, Frank Zalkow, Meinard Müller, and Yi-Hsuan Yang. Theme transformer: Symbolic music generation with theme-conditioned transformer. IEEE Transactions on Multimedia, 25:3495–3508, 2022. doi:10.1109/TMM.2022.3161851.
    PDF Details BibTeX▼

2021

  • Frank Zalkow and Meinard Müller. CTC-based learning of chroma features for score–audio music retrieval. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29:2957–2971, 2021. doi:10.1109/TASLP.2021.3110137.
    PDF Details BibTeX▼
  • Meinard Müller and Frank Zalkow. libfmp: A Python package for fundamentals of music processing. Journal of Open Source Software (JOSS), 2021. doi:10.21105/joss.03326.
    PDF Details BibTeX▼
  • Frank Zalkow. Learning Audio Representations for Cross-Version Retrieval of Western Classical Music. PhD thesis, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Erlangen, Germany, 2021.
    PDF BibTeX▼
  • Frank Zalkow, Julian Brandner, and Meinard Müller. Efficient retrieval of music recordings using graph-based index structures. Signals, 2(2):336–352, 2021. doi:10.3390/signals2020021.
    PDF Details BibTeX▼
  • Christof Weiß, Frank Zalkow, Vlora Arifi-Müller, Meinard Müller, Hendrik Vincent Koops, Anja Volk, and Harald G. Grohganz. Schubert Winterreise dataset: A multimodal scenario for music analysis. ACM Journal on Computing and Cultural Heritage (JOCCH), 2021. doi:10.1145/3429743.
    PDF Details BibTeX▼

2020

  • Frank Zalkow, Stefan Balke, Vlora Arifi-Müller, and Meinard Müller. MTD: a multimodal dataset of musical themes for MIR research. Transactions of the International Society for Music Information Retrieval (TISMIR), 3(1):180–192, 2020. doi:10.5334/tismir.68.
    PDF Details BibTeX▼
  • Stephanie Klauk and Frank Zalkow. Methoden computergestützter melodischer Analyse am Beispiel italienischer Streichquartette. In Stephanie Klauk, editor, Instrumentalmusik neben Haydn und Mozart. Analyse, Aufführungspraxis und Edition, pages 151–168. Saarbrücker Studien zur Musikwissenschaft 20, Königshausen & Neumann, Saarbrücken, Germany, 2020.
    BibTeX▼
  • Michael Krause, Frank Zalkow, Julia Zalkow, Christof Weiß, and Meinard Müller. Classifying leitmotifs in recordings of operas by Richard Wagner. In Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 473–480. Montréal, Canada, 2020.
    PDF Details BibTeX▼
  • Hendrik Schreiber, Frank Zalkow, and Meinard Müller. Modeling and estimating local tempo: A case study on Chopin's mazurkas. In Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 773–779. Montréal, Canada, 2020.
    PDF BibTeX▼
  • Frank Zalkow and Meinard Müller. Using weakly aligned score–audio pairs to train deep chroma models for cross-modal music retrieval. In Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 184–191. Montréal, Canada, 2020.
    PDF Details BibTeX▼
  • Frank Zalkow and Meinard Müller. Learning low-dimensional embeddings of audio shingles for cross-version retrieval of classical music. Applied Sciences, 2020. doi:10.3390/app10010019.
    PDF BibTeX▼

2019

  • Frank Zalkow, Angel Villar Corrales, TJ Tsai, Vlora Arifi-Müller, and Meinard Müller. Tools for semi-automatic bounding box annotation of musical measures in sheet music. In Demos and Late Breaking News of the International Society for Music Information Retrieval Conference (ISMIR). Delft, The Netherlands, 2019.
    PDF Details BibTeX▼
  • Prachi Govalkar, Johannes Fischer, Frank Zalkow, and Christian Dittmar. A comparison of recent neural vocoders for speech signal reconstruction. In Proceedings of the ISCA Speech Synthesis Workshop (SSW), 7–12. Vienna, Austria, September 2019. doi:10.21437/SSW.2019-2.
    PDF BibTeX▼
  • Meinard Müller and Frank Zalkow. FMP notebooks: educational material for teaching and learning fundamentals of music processing. In Proceedings of the International Conference on Music Information Retrieval (ISMIR), 573–580. Delft, The Netherlands, November 2019.
    PDF Details BibTeX▼
  • Frank Zalkow, Stefan Balke, and Meinard Müller. Evaluating salience representations for cross-modal retrieval of western classical music recordings. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 331–335. Brighton, United Kingdom, 2019. doi:10.1109/ICASSP.2019.8683609.
    PDF Details BibTeX▼

2018

  • Frank Zalkow, Sebastian Rosenzweig, Johannes Graulich, Lukas Dietz, El Mehdi Lemnaouar, and Meinard Müller. A web-based interface for score following and track switching in choral music. In Demos and Late Breaking News of the International Society for Music Information Retrieval Conference (ISMIR). Paris, Fance, 2018.
    PDF Details BibTeX▼
  • Frank Zalkow and Meinard Müller. Vergleich von PCA- und Autoencoder-basierter Dimensionsreduktion von Merkmalssequenzen für die effiziente Musiksuche. In Proceedings of the Deutsche Jahrestagung für Akustik (DAGA), 1526–1529. München, Germany, 2018.
    PDF BibTeX▼
  • Meinard Müller, Helmut Hedwig, Frank Zalkow, and Stefan Popescu. Constraint-based time-scale modification of music recordings for noise beautification. Applied Sciences, March 2018. doi:10.3390/app8030436.
    PDF Details BibTeX▼

2017

  • Frank Zalkow, Christof Weiß, and Meinard Müller. Exploring tonal-dramatic relationships in richard Wagner’s ring cycle. In Proceedings of the International Conference on Music Information Retrieval (ISMIR), 642–648. Suzhou, China, 2017.
    PDF BibTeX▼
  • Christof Weiß, Frank Zalkow, Meinard Müller, Stephanie Klauk, and Rainer Kleinertz. Versionsübergreifende Visualisierung harmonischer Verläufe: Eine Fallstudie zu Wagners Ring-Zyklus. In Proceedings of the Jahrestagung der Gesellschaft für Informatik (GI), 205–217. Chemnitz, Germany, 2017. doi:10.18420/in2017_14.
    PDF BibTeX▼
  • Frank Zalkow, Christof Weiß, Thomas Prätzlich, Vlora Arifi-Müller, and Meinard Müller. A multi-version approach for transferring measure annotations between music recordings. In Proceedings of the AES International Conference on Semantic Audio, 148–155. Erlangen, Germany, 2017. doi:10.17743/aesconf.2017.978-1-942220-15-2.
    PDF BibTeX▼

2016

  • Stephanie Klauk and Frank Zalkow. Das italienische Streichquartett im 18. Jahrhundert. Möglichkeiten der semiautomatisierten Stilanalyse. In Wolfgang Auhagen and Wolfgang Hirschmann, editors, Bericht zur Jahrestagung der Gesellschaft für Musikforschung (GfM) 2015 in Halle/Saale. Mainz, Germany, 2016. Schott Campus.
    PDF BibTeX▼
  • Frank Zalkow, Stephan Brand, and Bejamin Graf. Musical Style Modification as an Optimization Problem. In Hans Timmermans, editor, Proceedings of the International Computer Music Conference Utrecht 2016, 206–211. Utrecht, Netherlands, 2016. HKU University of the Arts Utrecht, HKU Music and Technology.
    PDF BibTeX▼