Ingo Siegert

Jun.-Prof. Dr.-Ing. Ingo Siegert

Fakultät für Elektrotechnik und Informationstechnik (FEIT)
Institut für Informations- und Kommunikationstechnik (IIKT)

Universitätsplatz 2, G03-325

Vitae

since 11/2018	Assistant Professor for Mobile Dialogsystems at the Institute for Information Technology and Communications University Magdeburg
04/2015 - 10/2018	Post-doctoral researcher at the Cognitive Systems Group
03/2015	Graduation (Dr.-Ing.) Title of PhD-Thesis: Emotional and User-Specific Cues for Improved Analysis of Naturalistic Interactions Otto-von-Guericke-Universität Magdeburg
07/2009-03/2015	Research asistant at the Cognitive Systems Group within the project SFB/TRR 62
05/2009	Diploma in Engineering Sciences title of Diploma thesis: Implementierung einer Sprecherverifikation für ein generisches Telefon-Dialogsystem
9/2006 - 05/2009 mit Unterbrechungen	Student Assistant at the Cognitive Systems Group
10/2007 - 03/2008	Internship at IBM Deutschland Entwicklung GmbH Department WebSphere VoiceServer Language Development
10/2003-05/2009	Study of information technology at the Otto-von-Guericke-University Magdeburg Beginn des Studiums der Informationstechnologie an der Otto-von-Guericke-Universität Magdeburg
06/2003	Acquisition of the general higher education entrance qualification at the Gymnasium Stadtfeld Wernigerode

Dissertation

Emotional and User-Specific Cues for Improved Analysis of Naturalistic Interactions. Otto-von-Guericke-Universität Magdeburg, 2015

Research Interests

Speech-Processing, Addressee-detection, Dialog-design, Speech-signal processing, Speech coding, Human-Machine-Interaction, Modelling of User's mood or intention, speakergroups and emotions

Memberships

Institute of Electrical and Electronics Engineers (IEEE)

Usability in Germany (UiG)

2026

Article in conference proceedings

Creating documents with voice - maybe it is not about transcription but reflection?

Busch, Matthias; Schewior, Jonas; Wendemuth, Andreas; Siegert, Ingo

In: Elektronische Sprachsignalverarbeitung 2026 / Konferenz Elektronische Sprachsignalverarbeitung , 2026 - Dresden : TUDpress ; Wirsching, Günther *1960-*, S. 143-150 - (Studientexte zur Sprachkommunikation; Bd. 113) [Konferenz: 37. Konferenz Elektronische Sprachsignalverarbeitung, Eichstätt, 4.-6. März 2026]

2025

Peer-reviewed journal article

Robot System Assistant (RoSA) - evaluation of touch and speech input modalities for on-site HRI and telerobotics

Strazdas, Dominykas; Busch, Matthias; Shaji, Rijin; Siegert, Ingo; Hamadi, al- Ayoub

In: Frontiers in robotics and AI - Lausanne : [Verlag nicht ermittelbar], Bd. 12 (2025), Artikel 1561188, insges. 15 S.

Publication link

Music time-out with digital voice assistant - design of a music intervention to complement psychotherapeutic/psychosomatic treatment

Metzner, Susanne; Siegert, Ingo; Busch, Matthias; Krüger, Julia

In: Approaches: An Interdisciplinary Journal of Music Therapy - [Erscheinungsort nicht ermittelbar] : Approaches, Bd. 17 (2025), Heft 4, insges. 13 S.

Publication link

Book chapter

Cloning dialects - recreating and localizing dialectal voices

Fischer, Hanna; Lameli, Alfred; Schubert, Martha; Siegert, Ingo

In: 2025 IEEE International Professional Communication Conference , 2025 - Piscataway, NJ : IEEE, S. 358-367 [Konferenz: 2025 IEEE International Professional Communication Conference (ProComm), Sønderborg, Denmark, 20-23 July 2025]

Publication link

Article in conference proceedings

Speech technology in psychotherapy - exploring transcription tools and their potential impact

Schubert, Martha; Busch, Matthias; Krüger, Julia; Siegert, Ingo

In: Elektronische Sprachsignalverarbeitung 2025 / Konferenz Elektronische Sprachsignalverarbeitung , 2025 - Dresden : TUDpress ; Grawunder, Sven, S. 289-296 [Konferenz: 36. Konferenz zur Elektronischen Sprachsignalverarbeitung, Halle/Saale, 5.-7. März 2025]

Gender spectrum data from podcasts - a proof of concept

Marquenie, Jan; Leonhardt, Mareile; Siegert, Ingo

In: Elektronische Sprachsignalverarbeitung 2025 - Dresden : TUDpress ; Grawunder, Sven, S. 239-246 [Konferenz: 36. Konferenz zur Elektronischen Sprachsignalverarbeitung, Halle/Saale, 5.-7. März 2025]

Pitch strength for normal-hearing listeners and cochlear-implant users

Verhey, Jesko L.; Leyhausen, Hilmar; Beyer, Benjamin; Siegert, Ingo; Böckmann-Barthel, Martin

In: Proceedings of DAS/DAGA 2025 , 2025 - Berlin : Deutsche Gesellschaft für Akustik e.V. (DEGA) ; Dau, Torsten, S. 1120-1121

Publication link

Queer waves - a German speech dataset capturing gender and sexual diversity from podcasts and YouTube

Siegert, Ingo; Marquenie, Jan; Grawunder, Sven

In: Interspeech 2024 - International Speech and Communication Association . - 2025, S. 679-683 [Koferenz: Interspeech 2025, Rotterdam, The Netherlands, 17-21 August 2025]

Publication link

Abstract

Relevanz und Möglichkeiten automatischer Anonymisierung von sensiblen Sprachdaten in der Psychosomatischen Medizin und Psychotherapie

Sinha, Yamini; Siegert, Ingo; Krüger, Julia

In: Zeitschrift für psychosomatische Medizin und Psychotherapie - Göttingen : Vandenhoeck & Ruprecht, Bd. 71 (2025), Heft 1, S. 96-97, Artikel (#558)

Publication link

Automatische Transkription von Psychotherapiegesprächen - Vergleich gängiger Systeme hinsichtlich ihrer Eignung für die automatisierte Sprachinhaltsanalyse in der Psychotherapieforschung

Schubert, Martha; Krüger, Julia; Siegert, Ingo

In: Zeitschrift für psychosomatische Medizin und Psychotherapie - Göttingen : Vandenhoeck & Ruprecht, Bd. 71 (2025), Heft 1, S. 75-76, Artikel (#409)

Publication link

2024

Book chapter

AnonEmoFace - emotion preserving facial anonymization

Hintz, Jan; Rühe, Jacob; Siegert, Ingo

In: Proceedings of the 10th International Conference on Information Systems Security and Privacy, Volume 1 - Setúbal : SciTePress - Science and Technology Publications, Lda. ; Lenzini, Gabriele . - 2024, S. 785-788 [Konferenz: 10th International Conference on Information Systems Security and Privacy, Rome, Italy, February 26-28, 2024]

Publication link

Speech recognition errors in ASR engines and their impact on linguistic analysis in psychotherapies

Schubert, Martha; Sinha, Yamini; Krüger, Julia; Siegert, Ingo

In: Elektronische Sprachsignalverarbeitung 2024 / Rue , Mitch - Dresden : TUDpress ; Rue, Mitch, S. 203-210 - (Studientexte zur Sprachkommunikation; 107) [Konferenz: 35. Konferenz „Elektronische Sprachsignalverarbeitung”, Regensburg, 6.-8. März 2024]

Evaluation of audio deepfakes - systematic review

Sinha, Yamini; Hintz, Jan; Siegert, Ingo

In: Elektronische Sprachsignalverarbeitung 2024 / Rue , Mitch - Dresden : TUDpress ; Rue, Mitch, S. 181-187 - (Studientexte zur Sprachkommunikation; 107) [Konferenz: 35. Konferenz „Elektronische Sprachsignalverarbeitung”, Regensburg, 6.-8. März 2024]

Review of usage and potentials of conversational interfaces at universities and in students' daily lives

Kisser, Lea; Busch, Matthias; Siegert, Ingo

In: Elektronische Sprachsignalverarbeitung 2024 / Rue , Mitch - Dresden : TUDpress ; Rue, Mitch, S. 38-45 - (Studientexte zur Sprachkommunikation; 107) [Konferenz: 35. Konferenz „Elektronische Sprachsignalverarbeitung”, Regensburg, 6.-8. März 2024]

Embarking on inclusive voice user interfaces - initial steps in exploring technology integration within the Seminar ‘AI and Educational Sciences’

Busch, Matthias; Ibs, Robin; Siegert, Ingo

In: Universal Access in Human-Computer Interaction , 1st ed. 2024. - Cham : Springer Nature Switzerland, S. 35-50 - (Lecture notes in computer science; volume 14696) [Konferenz: 18th International Conference on Universal Access in Human-Computer Interaction, UAHCI 2024, Washington, DC, USA, June 29 – July 4, 2024]

Publication link

Development of an automated, rule-based measurement method for easy language and its application to aI-generated texts

Siegert, Ingo; Al-Hamad, Ahmad; Pongratz, Katharina Maria; Busch, Matthias

In: HCI International 2024 Posters , 1st ed. 2024. - Cham : Springer Nature Switzerland ; Stephanidis, Constantine, S. 234-244 - (Communications in computer and information science; volume 2120) [Konferenz: 26th International Conference on Human-Computer Interaction, HCII 2024, Washington, DC, USA, June 29 – July 4, 2024]

Publication link

Challenges of German speech recognition - a study on multi-ethnolectal speech among adolescents

Schubert, Martha; Duran, Daniel; Siegert, Ingo

In: Interspeech 2024 - International Speech and Communication Association, S. 3045-3049 [Konferenz: Interspeech 2024, Kos, Greece, 1-5 September 2024]

Publication link

Anonymising elderly and pathological speech - voice conversion using DDSP and query-by-example

Ghosh, Suhita; Jouaiti, Melanie; Das, Arnab; Sinha, Yamini; Polzehl, Tim; Siegert, Ingo; Stober, Sebastian

In: Interspeech 2024 - International Speech and Communication Association, S. 4438-4442 [Konferenz: Interspeech 2024, Kos, Greece, 1-5 September 2024]

Publication link

CommonBench - a larger scale speaker verification benchmark

Hintz, Jan; Siegert, Ingo

In: 4th Symposium on Security and Privacy in Speech Communication - Kos, Greece, 6 September 2024 - International Speech Communication Association ; Siegert, Ingo, S. 17-20 [Symposium: 4th Symposium on Security and Privacy in Speech Communication, Kos, Greece, 6 September 2024]

Publication link

Safeguarding speech content style - enhancing privacy beyond speaker identity

Sinha, Yamini; Raivakhovskyi, Mykola; Schubert, Martha; Siegert, Ingo

In: 4th Symposium on Security and Privacy in Speech Communication - Kos, Greece, 6 September 2024 - International Speech Communication Association ; Siegert, Ingo, S. 92-101 [Symposium: 4th Symposium on Security and Privacy in Speech Communication, Kos, Greece, 6 September 2024]

Publication link

User perspective on anonymity in voice assistants - a comparison between Germany and Finland

Siegert, Ingo; Rech, Silas; Bäckström, Tom; Haase, Matthias

In: LREC-Coling 2024 - proceedings of the Workshop on Legal and Ethical Issues in Human Language Technologies : Turin, Italy - ELRA Language Resource Associatio ; Siegert, Ingo, S. 73-78 [Workshop: Workshop on Legal and Ethical Issues in Human Language Technologies, Turin, 20. May 2024]

Publication link

Abstract

First steps into aspire - a pilot study on automated speech analysis regarding psychotherapeutic alliance in psychotherapies

Schubert, Martha; Schenk, Michael; Krüger, Julia; Elgner, Melanie; Junne, Florian; Siegert, Ingo

In: 2. Sprachassistenten - Anwendungen, Implikationen, Entwicklungen : 5. März, 2024, Regensburg - Regensburg : OTH ; Baumann, Timo, S. 22-23 [Workshop: 2. ITG-Workshop Sprachassistenten – Anwendungen, Implikationen, Entwicklungen, Regensburg, 5. März 2024]

Publication link

Voice interaction in motion - eaasy vui and physical exertion

Busch, Matthias; Long, Nguyen; Siegert, Ingo

In: 2. Sprachassistenten - Anwendungen, Implikationen, Entwicklungen : 5. März, 2024, Regensburg - Regensburg : OTH ; Baumann, Timo, S. 20-21 [Workshop: 2. ITG-Workshop Sprachassistenten – Anwendungen, Implikationen, Entwicklungen, Regensburg, 5. März 2024]

Publication link

Students’ readiness to adopt GenAI in their learning and ethical considerations – An international comparative study

Dettmer, Sandra; Eisenbardt, Monika; Eisenbardt, Tomasz; Gafni, Ruti; Gal, Eran; Kurtz, Gila; Leiba, Moshe; Mullins, Roisin; Siegert, Ingo

In: KM Conference 2024 - International Institute for Applied Knowledge Management, S. 28 [Konferenz: KM Conference 2024, Warsaw, Poland, 03.-06. July 2024]

Publication link

Predicting therapeutic alliance by automated AI-supported speech analysis - preliminary results on acoustic prosodic speech markers from the ASPIRE pilot project

Schenk, Michael; Elgner, Melanie; Schubert, Martha; Lassoued, Amina; Siegert, Ingo; Junne, Florian; Krüger, Julia

In: Psychotherapy and psychosomatics - Basel : Karger, Bd. 93 (2024), Heft Suppl. 1, S. 116, Artikel ST12-03

Publication link

Editor

2. Sprachassistenten - Anwendungen, Implikationen, Entwicklungen : 5. März, 2024, Regensburg

Baumann, Timo; Siegert, Ingo

In: Regensburg: OTH, 2024, 1 Online Ressource (IV, 28 Seiten) Kongress: ITG-Workshop "Sprachassistenten : Anwendungen, Implikationen, Entwicklungen" 2 Regensburg 2024.04.11

Publication link

4th Symposium on Security and Privacy in Speech Communication - Kos, Greece, 6 September 2024

Siegert, Ingo; Williams, Jennifer; Das, Sneha; Tomashenko, Natalia

In: International Speech Communication Association, 2024, 1 Online-Ressource Kongress: ISCA Symposium on Security and Privacy in Speech Communication 4 Kos, Greece 2024.09.06

Publication link

LREC-Coling 2024 - proceedings of the Workshop on Legal and Ethical Issues in Human Language Technologies : Turin, Italy

Siegert, Ingo; Choukri, Khalid

In: ELRA Language Resource Associatio, 2024, 1 Online-Ressource, ISBN: 978-2-493814-21-0 Kongress: LREC-Coling 2024 Turin, Italy 2024.05.20

Publication link

2023

Peer-reviewed journal article

A digital ”flat affect”? - popular speech compression codecs and their effects on emotional prosody

Siegert, Ingo; Niebuhr, Oliver

In: Frontiers in communication - Lausanne : Frontiers Media, Bd. 8 (2023), Artikel 972182

Publication link

Emo-StarGAN - a semi-supervised any-to-many non-parallel emotion-preserving voice conversion

Ghosh, Suhita; Das, Arnab; Sinha, Yamini; Siegert, Ingo; Polzehl, Tim; Stober, Sebastian

In: Interspeech 2023 - International Speech and Communication Association ; Harte, Naomi, S. 2093-2097 [Konferenz: INTERSPEECH 2023, Dublin, Ireland, 20-24 August 2023]

Publication link

Book chapter

Radlogistik als Anwendungsgebiet für Digitale Sprachassistenten - ein Diskussionsbeitrag

Busch, Matthias; Kania, Malte; Assmann, Tom; Siegert, Ingo

In: Elektronische Sprachsignalverarbeitung 2023 / Konferenz Elektronische Sprachsignalverarbeitung , 2023 - Dresden : TUDpress ; Draxler, Christoph *1960-*, S. 223-230 - (Studientexte zur Sprachkommunikation; 105)

Cross-reliability benchmark test for preserving emotional content in speech-synthesis related datasets

Hintz, Jan; Wendemuth, Andreas; Siegert, Ingo

In: Elektronische Sprachsignalverarbeitung 2023 / Konferenz Elektronische Sprachsignalverarbeitung , 2023 - Dresden : TUDpress ; Draxler, Christoph *1960-*, S. 64-72 - (Studientexte zur Sprachkommunikation; 105)

Presenting a German dataset of wake words - first analyses and comparison of different solutions for speech-based activation techniques

Busch, Matthias; Sinha, Yamini; Hintz, Jan; Wendemuth, Andreas; Siegert, Ingo

In: DAGA 2023 - Berlin : Deutsche Gesellschaft für Akustik e.V., S. 1478-1481

Publication link

Impact of pathological speech on speaker anonymization - a proof of concept

Hintz, Jan; Sinha, Yamini; Bayerl, Sebastian P.; Riedhammer, Korbinian; Siegert, Ingo

In: DAGA 2023 - Berlin : Deutsche Gesellschaft für Akustik e.V., S. 1470-1473

Publication link

Improving voice conversion for dissimilar speakers using perceptual losses

Gosh, Suhita; Sinha, Yamini; Siegert, Ingo; Stober, Sebastian

In: DAGA 2023 , 2023 - Berlin : Deutsche Gesellschaft für Akustik e.V., S. 1358-1361 [Tagung: 49. Jahrestagung für Akustik, DAGA 2023, Hamburg, 06. - 09. März 2023]

Publication link

“What can I study at OVGU?” - an analysis of the applicability of conversational voice assistants in student advisory service

Busch, Matthias; Böhm, Felix; Siegert, Ingo

In: Design, Operation and Evaluation of Mobile Communications , 1st ed. 2023. - Cham : Springer Nature Switzerland ; Salvendy, Gavriel, S. 144-155 - (Lecture notes in computer science; volume 14052)

Publication link

User perspective on anonymity in voice assistants

Haase, Matthias; Krüger, Julia; Siegert, Ingo

In: Design, Operation and Evaluation of Mobile Communications , 1st ed. 2023. - Cham : Springer Nature Switzerland ; Salvendy, Gavriel *1938-*, S. 156-166 - (Lecture notes in computer science; volume 14052) [Konferenz: 4th International Conference on Design, Operation and Evaluation of Mobile Communications, MOBILE 2023, Copenhagen, Denmark, July 23-28, 2022]

Publication link

Voice assistants for therapeutic support - a literature review

Siegert, Ingo; Busch, Matthias; Metzner, Susanne; Krüger, Julia

In: Design, Operation and Evaluation of Mobile Communications , 1st ed. 2023. - Cham : Springer Nature Switzerland ; Salvendy, Gavriel, S. 221-239 - (Lecture notes in computer science; volume 14052)

Publication link

Anonymization of stuttered speech - removing speaker information while preserving the utterance

Hintz, Jan; Bayerl, Sebastian; Sinha, Yamini; Ghosh, Suhita; Schubert, Martha; Stober, Sebastian; Riedhammer, Korbinian; Siegert, Ingo

In: 3rd Symposium on Security and Privacy in Speech Communication - Internatinal Speech Communication Association ; Siegert, Ingo . - 2023, S. 41-45 [Symposium: 3rd Symposium on Security and Privacy in Speech Communication, Dublin, Ireland, 19 August 2023]

Publication link

AI Engineering als interdisziplinäres Einführungsmodul zwischen Künstlicher Intelligenz und Ingenieurwesen

Lang, Sebastian; Siegert, Ingo; Artiushenko, Viktor; Schleiss, Johannes

In: Informatik 2023 - Berlin : Gesellschaft für Informatik e.V. ; Klein, Maike, S. 381-384 - (GI-Edition. Proceedings; volume P-337) [Tagung: Informatik 2023, Berlin, 26. - 29. September 2023]

Publication link

Die Chatbot-Challenge - spielend mit KI von der Idee zum Dialogsystem

Siegert, Ingo; Hillmann, Stefan; Kowol, Philline T.; Busch, Matthias; Nehring, Jan; Klinge, Xenia

In: Informatik 2023 - Berlin : Gesellschaft für Informatik e.V. ; Klein, Maike, S. 377-380 - (GI-Edition. Proceedings; volume P-337)

Publication link

Abstract

Emotionswahrnehmung sprachkodierter Sätze bei Nutzern von Cochlea-Implantaten

Koyutürk, Ece; Siegert, Ingo; Verhey, Jesko L.; Böckmann-Barthel, Martin

In: 25. Jahrestagung der Deutschen Gesellschaft für Audiologie / Deutsche Gesellschaft für Audiologie , 2023 - German Medical Science, GMS, Artikel 163, insges. 2 S. [Tagung: 25. Jahrestagung der Deutschen Gesellschaft für Audiologie, Köln, 01.03. - 03.03.2023]

Publication link

Editor

3rd Symposium on Security and Privacy in Speech Communication - Dublin, Ireland, 19 August 2023

Siegert, Ingo; Williams, Jennifer L.; Das, Sneha

In: International Speech Communication Association, 2023, 1 Online Ressource Kongress: ISCA Symposium on Security and Privacy in Speech Communication 3 Dublin, Ireland 2023.08.19

Publication link

Other materials

Evaluating state-of-the-art speech recognition systems with focus on low resource languages

Sinha, Yamini; Silber-Varod, Yered; Siegert, Ingo

In: KM Conference 2023 - International Institute for Applied Knowledge Management , 2023, S. 41

Publication link

2022

Peer-reviewed journal article

Acoustic-based automatic addressee detection for technical systems - a review

Siegert, Ingo; Weißkirchen, Norman; Wendemuth, Andreas

In: Frontiers in computer science - Lausanne : Frontiers Media, Bd. 4 (2022), Artikel 831784, insges. 20 S.

Publication link

Handling of unknown unknowns - classification of 3D geometries from CAD open set datasets using Convolutional Neural Networks

Schmidt, Georg; Stüring, Stefan; Richnow, Norman; Siegert, Ingo

In: The Online Journal of Applied Knowledge Management - [S.l.]: [s.n.], Bd. 10 (2022), 1, S. 62-76

Publication link

Künstliche Intelligenz für die Sprachanalyse in der Psychotherapie - Chancen und Risiken - Artificial intelligence for speech analysis in psychotherapy - chances and risks

Krüger, Julia; Siegert, Ingo; Junne, Florian

In: Psychotherapie, Psychosomatik, medizinische Psychologie - Stuttgart [u.a.]: Thieme, Bd. 72 (2022), 9/10, S. 395-396

Publication link

Book chapter

Improving the accuracy for voice-assistant conversations in German by combining different online ASR-API outputs

Sinha, Yamini; Siegert, Ingo

In: Konferenz: Human Perspectives on Spoken Human-Machine Interaction, Freiburg im Breisgau (online), 15.-17. November 2021, Proceedings of the conference Human Perspectives on Spoken Human-Machine Interaction - Freiburg: FRIAS, Freiburg Institute for Advanced Studies, Albert-Ludwigs-Universität; Warchhold, Sarah *1994-* . - 2022, S. 11-16

Publication link

Erroneous reactions of voice assistants "In the Wild" - first analyses

Kisser, Lea; Siegert, Ingo

In: Konferenz: 33. Konferenz "Elektronische Sprachsignalverarbeitung", Sonderborg, 2.-4. März 2022, Elektronische Sprachsignalverarbeitung 2022 - Dresden: TUDpress; Weston, Heather . - 2022, S. 113-120 - (Studientexte zur Sprachkommunikation; 103)

"High on emotion"? - how audio codecs interfere with the perceived charisma and emotional states of men and women

Niebuhr, Oliver; Siegert, Ingo

In: Konferenz: 33. Konferenz "Elektronische Sprachsignalverarbeitung", Sonderborg, 2.-4. März 2022, Elektronische Sprachsignalverarbeitung 2022/ Konferenz Elektronische Sprachsignalverarbeitung - Dresden: TUDpress; Weston, Heather . - 2022, S. 243-252 - (Studientexte zur Sprachkommunikation; 103)

Emotion preservation for one-shot speaker anonymization using McAdams

Sinha, Yamini; Wendemuth, Andreas; Siegert, Ingo

In: Konferenz: 33. Konferenz "Elektronische Sprachsignalverarbeitung", Sonderborg, 2.-4. März 2022, Elektronische Sprachsignalverarbeitung 2022 - Dresden: TUDpress; Weston, Heather . - 2022, S. 235-242 - (Studientexte zur Sprachkommunikation; 103)

The effect of room acoustics and channel coding on affective computing in far field speech interaction

Siegert, Ingo; Niebuhr, Oliver; Gottschalk, Martin; Jokisch, Oliver

In: DAGA 2022 - Berlin : Deutsche Gesellschaft für Akustik e.V., S. 74-77

Performance and quality evaluation of a McAdams speaker anonymization for spontaneous German speech

Sinha, Yamini; Siegert, Ingo

In: Fortschritte der Akustik - DAGA 2022 - Berlin: Deutsche Gesellschaft für Akustik e.V. (DEGA) . - 2022, S. 1185-1188

The influence of different room acoustics and microphone distances on charismatic prosodic parameters

Siegert, Ingo; Niebuhr, Oliver

In: Fortschritte der Akustik - DAGA 2022 - Berlin: Deutsche Gesellschaft für Akustik e.V. (DEGA) . - 2022, S. 1193-1196

Music-guided imagination and digital voice assistant - study design and first results on the application of voice assistants for music-guided stress reduction

Siegert, Ingo; Busch, Matthias; Metzner, Susanne; Junne, Florian; Krüger, Julia

In: Konferenz: 24th International Conference on Human-Computer Interaction, HCII 2022, Virtual Event, June 26 July 1, 2022, Design, Operation and Evaluation of Mobile Communications - Cham: Springer International Publishing; Salvendy, Gavriel . - 2022, S. 347-362 - (Lecture notes in computer science; volume 13337)

Publication link

Why Eli Roth should not use TTS-Systems for anonymization

Sinha, Yamini; Hintz, Jan; Busch, Matthias; Polzehl, Tim; Haase, Matthias; Wendemuth, Andreas; Siegert, Ingo

In: 2nd Symposium on Security and Privacy in Speech Communication - Incheon, Korea, 23-24 September 2022 - Internatinal Speech Communication Association ; Siegert, Ingo, S. 17-22

Publication link

Voice Privacy - leveraging multi-scale blocks with ECAPA-TDNN SE-Res2NeXt extension for speaker anonymization

Khamsehashari, Razieh; Sinha, Yamini; Hintz, Jan; Ghosh, Suhita; Polzehl, Tim; Franzreb, Carlos; Stober, Sebastian; Siegert, Ingo

In: 2nd Symposium on Security and Privacy in Speech Communication - Incheon, Korea, 23-24 September 2022 - Internatinal Speech Communication Association ; Siegert, Ingo, S. 43-48 [Symposium: 2nd Symposium on Security and Privacy in Speech Communication, Incheon, Korea, 23-24 September 2022]

Publication link

DyCoDa - a multi-modal data collection of multi-user remote survival game recordings

Dresvyanskiy, Denis; Sinha, Yamini; Busch, Matthias; Siegert, Ingo; Karpov, Alexey; Minker, Wolfgang

In: Konferenz: 24th International Conference on Speech and Computer, SPECOM 2022, Gurugram, India, November 14-16, 2022, Speech and Computer - Cham: Springer International Publishing; Prasanna, S. R. Mahadeva . - 2022, S. 163-177 - (Lecture notes in computer science; volume 13721)

Publication link

Article in conference proceedings

Public interactions with voice assistant - discussion of different one-shot solutions to preserve speaker privacy

Siegert, Ingo; Sinha, Yamini; Winkelmann, Gino; Jokisch, Oliver; Wendemuth, Andreas

In: Proceedings of the LREC 2022 Joint Workshop on Legal and Ethical Issues in Human Language Technologies and Multilingual De-Identification of Sensitive Language Resources (LEGAL - MDLR 2022) - Paris : European Language Resources Association (ELRA) ; Rigault, Mickaël, S. 44-47

Publication link

Pseudonymisation of speech data as an alternative approach to GDPR aompliance

Kamocki, Pawel; Siegert, Ingo

In: Proceedings of the LREC 2022 Joint Workshop on Legal and Ethical Issues in Human Language Technologies and Multilingual De-Identification of Sensitive Language Resources (LEGAL - MDLR 2022) - Paris: European Language Resources Association (ELRA); Rigault, Mickaël . - 2022, S. 17-21

Publication link

Abstract

A preliminary study on voice-assisted interfaces in the German public administration

Jokisch, Oliver; Brauner, Kurt; Siegert, Ingo

In: Konferenz: KM Conference 2022, Ljubljana, Slovenia, 29 June - 2 July 2022, KM Conference 2022 - International Institute for Applied Knowledge Management, S. 42

Publication link

Der sprachliche Emotionsausdruck von Patient*innen mit Anorexia nervosa - eine systematische Literaturrecherche

Korbanka, Tatjana A.; Siegert, Ingo; Junne, Florian; Krüger, Julia

In: Zeitschrift für psychosomatische Medizin und Psychotherapie - Göttingen: Vandenhoeck & Ruprecht, 1999, Bd. 68 (2022), 2, S. 180-181

Publication link

Editor

Proceedings of the LREC 2022 Joint Workshop on Legal and Ethical Issues in Human Language Technologies and Multilingual De-Identification of Sensitive Language Resources (LEGAL - MDLR 2022)

Rigault, Mickael; Arranz, Victoria; Siegert, Ingo

In: Paris: European Language Resources Association (ELRA), 2022, 1 Online-RessourceKongress: Joint Workshop on Legal and Ethical Issues in Human Language Technologies and Multilingual De-Identification of Sensitive Language Resources (Marseille : 2022.06.20)

Publication link

2nd Symposium on Security and Privacy in Speech Communication - Incheon, Korea, 23-24 September 2022

Siegert, Ingo; Tomashenko, Natalia; Williams, Jennifer

In: Internatinal Speech Communication Association, 2022, 1 Online-RessourceKongress: ISCA Symposium on Security and Privacy in Speech Communication 2 (virtual : 2022.09.23-24)

Publication link

2021

Peer-reviewed journal article

Case report: women, be aware that your vocal charisma can dwindle in remote meetings

Siegert, Ingo; Niebuhr, Oliver

In: Frontiers in communication - Lausanne : Frontiers Media - Volume 5(2021), article 611555, 7 Seiten

Publication link

A cross-language study of speech recognition systems for English, German, and Hebrew

Silber Varod, Vered; Siegert, Ingo; Jokisch, Oliver; Sinha, Yamini; Geri, Nitza

In: The Online Journal of Applied Knowledge Management - [Erscheinungsort nicht ermittelbar] : [Verlag nicht ermittelbar], Bd. 9 (2021), Heft 1, insges. 15 S.

Publication link

Admitting the addressee detection faultiness of voice assistants to improve the activation performance using a continuous learning framework

Siegert, Ingo; Weißkirchen, Norman; Krüger, Julia; Akhtiamov, Oleg; Wendemuth, Andreas

In: Cognitive systems research - Amsterdam [u.a.] : Elsevier Science, Bd. 70 (2021), S. 65-79

Publication link

Book chapter

Speech melody and speech content didnt fit together - differences in speech behavior for device directed and human directed interactions

Siegert, Ingo; Krüger, Julia

In: Advances in Data Science: Methodologies and Applications - Cham: Springer International Publishing; Phillips-Wren, Gloria . - 2021, S. 65-95 - (Intelligent Systems Reference Library; volume 189)

Publication link

Speech Signal Compression Deteriorates Acoustic Cues to Perceived Speaker Charisma

Siegert, Ingo; Niebuhr, Oliver

In: Elektronische Sprachsignalverarbeitung 2021 / Konferenz Elektronische Sprachsignalverarbeitung , 2021 - Dresden : TUDpress, S. 1-10 [32. Konferenz Elektrische Sprachsignalverarbeitung 2021, Berlin, 3. - 5. März 2021]

Audio and Video Processing of UAV-Based Signals in the Harmonic Project

Jokisch, Oliver; Strutz, Tilo; Leipnitz, Alexander; Siegert, Ingo; Ronzhin, Abdrey

In: Elektronische Sprachsignalverarbeitung 2021 / Konferenz Elektronische Sprachsignalverarbeitung , 2021 - Dresden : TUDpress, S. 77-86 [32. Konferenz Elektrische Sprachsignalverarbeitung 2021, Berlin, 3. - 5. März 2021]

Studie zur Lösbarkeit des Problems starker Pegelschwankungen im Home-Entertainment

Schmidt, Georg; Siegert, Ingo

In: Elektronische Sprachsignalverarbeitung 2021 / Konferenz Elektronische Sprachsignalverarbeitung , 2021 - Dresden : TUDpress, S. 303-310 [32. Konferenz Elektrische Sprachsignalverarbeitung 2021, Berlin, 3. - 5. März 2021]

Effects of prosodic variations on accidental triggers of a commercial voice assistant

Siegert, Ingo

In: Interspeech 2021: Brno, Czechia, 30 August - 3 September 2021$dGeneral chairs: Hynek Heřmanský, Honza Černocký : Technical chairs: Lukáš Burget, Lori Lamel, Odette Scharenborg, Petr Motlicek - International Speech and Communication Association; Heřmanský, Hynek . - 2021, S. 1674-1678

Publication link

Engagement recognition using audio channel only

Dresvyanskiy, Denis; Siegert, Ingo; Karpov, Alexei; Minker, Wolfgang

In: 1st AI-DEbate Workshop: workshop establishing An InterDisciplinary pErspective on speech-BAsed TEchnology : Magdeburg, September, 27 2021/ AI-Debate Workshop - Magdeburg: Universitätsbibliothek; Carolus, Astrid *1982-* . - 2021, S. 19-22

Publication link

Introduction to the workshop

Carolus, Astrid; Wienrich, Carolin; Siegert, Ingo

Publication link

Article in conference proceedings

How to collect speech data with human rights in mind - workshop at the SPSC

Backstrom, Tom; Nautsch, Andreas; Markert, Karla; Siegert, Ingo

In: Proceedings 2021 ISCA Symposium on Security and Privacy in Speech Communication - Internatinal Speech Communication Association; Siegert, Ingo . - 2021, S. 80-82

Publication link

Speaker anonymization solution for public voice-assistant interactions - presentation of a work in progress development

Siegert, Ingo

In: Proceedings 2021 ISCA Symposium on Security and Privacy in Speech Communication - Internatinal Speech Communication Association . - 2021, S. 80-82

Publication link

Experience with an online assessment in a lecture about fundamentals of electrical engineering

Magdowski, Mathias; Siegert, Ingo

In: Higher Education 2021 - Bari: Higher Education . - 2021, insges. 5 S.

Publication link

Editor

1st AI-DEbate Workshop - workshop establishing An InterDisciplinary pErspective on speech-BAsed TEchnology : Magdeburg, September, 27 2021

Carolus, Astrid; Wienrich, Carolin; Siegert, Ingo

In: Magdeburg: Universitätsbibliothek, 2021, 1 Online-Ressource (42 Seiten, 1,03 MB)Kongress: AI-Debate Workshop 1 (Magdeburg : 2021.09.27)

Publication link

Proceedings 2021 ISCA Symposium on Security and Privacy in Speech Communication

Siegert, Ingo; Markert, Karla

In: Internatinal Speech Communication Association, 2021, 1 Online-Ressource (88 Seiten)Kongress: ISCA Symposium on Security and Privacy in Speech Communication 1 (virtual : 2021.11.10-12)

Publication link

2020

Peer-reviewed journal article

Using complexity-identical human- and machine-directed utterances to investigate addressee detection for spoken dialogue systems

Akhtiamov, Oleg; Siegert, Ingo; Karpov, Alexey; Minker, Wolfgang

In: Sensors - Basel: MDPI, Volume 20(2020), issue 9, article 2740, 15 Seiten

Publication link

Personal data protection and academia: GDPR issues and multi-modal data-collections "in the wild"

Siegert, Ingo; Silber-Varod, Vered; Carmi, Nehoray; Kamocki, Pawel

In: The Online Journal of Applied Knowledge Management: OJAKM - [S.l.], Bd. 8 (2020), 1, S. 16-31

Publication link

Book chapter

Reduction of aircraft noise in UAV-based speech signal recordings by quantile based noise estimation

Lösch, Enrico; Jokisch, Oliver; Leipnitz, Alexander; Siegert, Ingo

Publication link

Emergency Service - Sprachbasierte Klassifikation eingehender Anrufe in Ausnahmesituationen

Petersen, Marcus; Niedrist, Karl-Heinz; Busch, Matthias; Marquardt, Florian; Siegert, Ingo

In: Konferenz: 31. Konferenz "Elektronische Sprachsignalverarbeitung", Magdeburg, 4.-6. März 2020, Elektronische Sprachsignalverarbeitung 2020 - Tagungsband der 31. Konferenz Magdeburg : Magdeburg, 4.-6. März 2020/ Konferenz "Elektronische Sprachsignalverarbeitung" - Dresden: TUDpress; Wendemuth, Andreas . - 2020, S. 206-213 - (Studientexte zur Sprachkommunikation; 95)

Publication link

Does users' system evaluation influence speech behavior in HCI? - first insights from the engineering and psychological perspective

Siegert, Ingo; Busch, Matthias; Krüger, Julia

In: Konferenz: 31. Konferenz "Elektronische Sprachsignalverarbeitung", Magdeburg, 4.-6. März 2020, Elektronische Sprachsignalverarbeitung 2020 - Tagungsband der 31. Konferenz Magdeburg : Magdeburg, 4.-6. März 2020/ Konferenz "Elektronische Sprachsignalverarbeitung" - Dresden: TUDpress . - 2020, S. 241-248 - (Studientexte zur Sprachkommunikation; 95)

Publication link

Filtering-based analysis of spectral and temporal effects of room modes on low-level descriptors of emotionally coloured speech

Gottschalk, Martin; Höbel-Müller, Juliane; Siegert, Ingo; Verhey, Jesko L.; Wendemuth, Andreas

In: Elektronische Sprachsignalverarbeitung 2020 - Tagungsband der 31. Konferenz Magdeburg : Magdeburg, 4.-6. März 2020 / Konferenz "Elektronische Sprachsignalverarbeitung" , 2020 - Dresden : TUDpress ; Wendemuth, Andreas, S. 219-226 - (Studientexte zur Sprachkommunikation; 95) [Konferenz: 31. Konferenz "Elektronische Sprachsignalverarbeitung", Magdeburg, 4.-6. März 2020]

Publication link

Investigation of the influence of standing waves on distant speech emotion recognition

Höbel-Müller, Juliane; Siegert, Ingo; Gottschalk, Martin; Heinemann, Ralph; Wendemuth, Andreas

In: Fortschritte der Akustik - DAGA 2020 - Berlin : Deutsche Gesellschaft für Akustik e.V. (DEGA), S. 822-825 [Konferenz: DAGA 2020, Hannover, 16.-19. März 2020]

Speech communication at the presence of unmanned aerial vehicles

Jokisch, Oliver; Lösch, Enrico; Siegert, Ingo

In: Fortschritte der Akustik - DAGA 2020 - Berlin : Deutsche Gesellschaft für Akustik e.V. (DEGA), S. 952-955

Alexa in the wild - collecting unconstrained conversations with a modern voice assistant in a public environment

Siegert, Ingo

In: LREC 2020 / International Conference on Language Resources and Evaluation , 2020 - Paris : European Language Resources Association, ELRA ; Calzolari, Nicoletta, S. 608-612

Publication link

Prosodic addressee-detection - ensuring privacy in always-on spoken dialog systems

Baumann, Timo; Siegert, Ingo

In: Mensch und Computer 2020 - Tagungsband - New York, New York: The Association for Computing Machinery, Inc. . - 2020, S. 195-198

Publication link

GDPR - a game changer for acoustic interaction analyses

Siegert, Ingo; Silber-Varod, Vered; Kamocki, Pawel

In: Proceedings of the LREC 2020 Workshop on Legal and Ethical Issues in Human Language Technologies (LEGAL2020) - proceedings: proceedings - Paris: European Language Resources Association (ELRA); Choukri, Khalid . - 2020, S. 1-3

Publication link

Utilizing computer vision algorithms to detect and describe local features in images for emotion recognition from speech

Weißkirchen, Norman; Reddy, Mainampati Vasudeva; Wendemuth, Andreas; Siegert, Ingo

In: Proceedings of the 2020 IEEE International Conference on Human-Machine Systems (ICHMS): Sept 7-9, 2020, Rome, Italy/ IEEE International Conference on Human-Machine Systems - [Piscataway, NJ]: IEEE; Weibkirchen, Norman . - 2020, insges. 6 S.

Publication link

Recognition performance of selected speech recognition APIs - a longitudinal study

Siegert, Ingo; Sinha, Yamini; Jokisch, Oliver; Wendemuth, Andreas

In: Speech and Computer - Cham : Springer ; Karpov, Alexey . - 2020, S. 520-529 - ( Lecture notes in computer science; 12335)

Publication link

Advances in sound and speech signal processing at the presence of drones

Jokisch, Oliver; Siegert, Ingo

In: Quiet Drones , 2020 - INCE Europe ; Quiet Drones (Veranstaltung:2020), insges. 17 S.

An analysis of the applicability of VoiceXML as basis for a dialog control flow in industrial interaction management

Böhm, Felix; Siegert, Ingo; Belyaev, Alexander; Diedrich, Christian

In: 2020 25th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA): proceedings : Vienna, Austria - hybrid, 08-11 September, 2020/ IEEE International Conference on Emerging Technologies and Factory Automation - Piscataway, NJ: IEEE; IEEE International Conference on Emerging Technologies and Factory Automation (25.:2020) . - 2020, S. 30-37

Publication link

Improving automatic speech recognition utilizing audio-codecs for data augmentation

Hailu, Nirayo; Siegert, Ingo; Nürnberger, Andreas

In: IEEE 22nd International Workshop on Multimedia Signal Processing / IEEE International Workshop on Multimedia Signal Processing , 2020 - [Piscataway, NJ] : IEEE [Workshop: IEEE 22nd International Workshop on Multimedia Signal Processing, MMSP, Tampere, Finland, 21-24 Sept. 2020]

Publication link

Abstract

das ist schon gruselig so dieses Belauschtwerden - subjektives Erleben von Interaktionen mit Sprachassistenzsystemen zum Zwecke der Individualisierung

Krüger, Julia; Siegert, Ingo

In: Sprachassistenten - Anwendungen, Implikationen, Entwicklungen : ITG-Workshop : Magdeburg, 3. März, 2020 : [Abstractbook]/ Workshop Sprachassistenten - Anwendungen, Implikationen, Entwicklungen : ITG-Workshop : Magdeburg, 3. März, 2020 : [Abstractbook] - Anwendungen, Implikationen, Entwicklungen - Magdeburg: Otto-von-Guericke-Universität Magdeburg, 2020; Siegert, Ingo . - 2020, S. 29

Publication link

Intelligent LSF-answering system - an Alexa Skill

Kuzhipathalil, Adarsh; Thomas, Anto; Chand, Keerthana; Siegert, Ingo

In: Sprachassistenten - Anwendungen, Implikationen, Entwicklungen : ITG-Workshop : Magdeburg, 3. März, 2020 : [Abstractbook]: Anwendungen, Implikationen, Entwicklungen : ITG-Workshop : Magdeburg, 3. März, 2020 : [Abstractbook]/ Workshop Sprachassistenten - Anwendungen, Implikationen, Entwicklungen : ITG-Workshop : Magdeburg, 3. März, 2020 : [Abstractbook] - Anwendungen, Implikationen, Entwicklungen - Magdeburg: Otto-von-Guericke-Universität Magdeburg, 2020; Siegert, Ingo . - 2020, S. 39

Publication link

Editor

Sprachassistenten - Anwendungen, Implikationen, Entwicklungen : ITG-Workshop : Magdeburg, 3. März, 2020 : [Abstractbook]

Siegert, Ingo; Möller, Sebastian

In: Magdeburg: Otto-von-Guericke-Universität Magdeburg, 2020, 1 Online-Ressource (39 Seiten, 0,3 MB)Kongress: ITG-Workshop "Sprachassistenten : Anwendungen, Implikationen, Entwicklungen" (Magdeburg : 2020.03.03)

Publication link

Elektronische Sprachsignalverarbeitung 2020 - Tagungsband der 31. Konferenz Magdeburg : Magdeburg, 4.-6. März 2020

Wendemuth, Andreas; Böck, Ronald; Siegert, Ingo

In: Dresden: TUDpress, 2020, XI, 288 Seiten, Illustrationen, Diagramme, 24 cm x 17 cm - (Studientexte zur Sprachkommunikation; Band 95)

Publication link

Proceedings of the LREC 2020 Workshop on Legal and Ethical Issues in Human Language Technologies (LEGAL2020) - proceedings

Choukri, Khalid; Linden, Kirster; Rigault, Mickael; Siegert, Ingo

In: Paris: European Language Resources Association (ELRA), 2020, 1 Elektronische RessourceKongress: Workshop on Legal and Ethical Issues in Human Language Technologies 12 (Marseille : 2020.05.11)

Publication link

2019

Book chapter

The Restaurant Booking Corpus - content-identical comparative human-human and human-computer simulated telephone conversations

Siegert, Ingo; Nietzold, Jannik; Heinemann, Ralph; Wendemuth, Andreas

In: Elektronische Sprachsignalverarbeitung 2019 - Dresden: TUDpress, S. 126-133 - (Studientexte zur Sprachkommunikation; 93)[Konferenz: 30. Konferenz Elektronische Sprachsignalverarbeitung 2019, Dresden, 6.-8. März 2019]

Comparing phonetic changes in computer-directed and human-directed speech

Raveh, Eran; Steiner, Ingmar; Siegert, Ingo; Gessinger, Iona; Möbius, Bernd

In: Elektronische Sprachsignalverarbeitung 2019 - Dresden: TUDpress, S. 42-49 - (Studientexte zur Sprachkommunikation; 93)[Konferenz: 30. Konferenz Elektronische Sprachsignalverarbeitung 2019, Dresden, 6.-8. März 2019]

Analysis of the influence of different room acoustics on acoustic emotion features

Höbel-Müller, Juliane; Siegert, Ingo; Heinemann, Ralph; Requardt, Alicia Flores; Tornow, Michael; Wendemuth, Andreas

In: Elektronische Sprachsignalverarbeitung 2019: Tagungsband der 30. Konferenz, Dresden, 6.-8. März 2019 / Peter Birkholz und Simon Stone (Hrsg.): Tagungsband der 30. Konferenz, Dresden, 6.-8. März 2019/ Konferenz "Elektronische Sprachsignalverarbeitung" - Dresden: TUDpress, 2019 . - 2019, S. 156-163 - (Studientexte zur Sprachkommunikation; 93)[Konferenz: 30. Konferenz Elektronische Sprachsignalverarbeitung 2019: Tagungsband der 30. Konferenz, Dresden, 6.-8. März 2019 / Peter Birkholz und Simon Stone (Hrsg.), Dresden, 6.-8. März 2019]

Analysis of the influence of different room acoustics on acoustic emotion features and emotion recognition performance

Höbel-Müller, Juliane; Siegert, Ingo; Heinemann, Ralph; Requardt, Alicia Flores; Tornow, Michael; Wendemuth, Andreas

In: Tagungsband - DAGA 2019 - Berlin: Deutsche Gesellschaft für Akustik e.V. (DEGA), 2019 . - 2019, S. 886-889[Tagung: 45. Jahrestagung für Akustik, DAGA 2019, 18.-21. März 2019, Rostock]

Publication link

Anticipating the user - acoustic disposition recognition in intelligent interactions

Böck, Ronald; Egorow, Olga; Höbel-Müller, Juliane; Requardt, Alicia Flores; Siegert, Ingo; Wendemuth, Andreas

In: Innovations in big data mining and embedded knowledge - Cham, Switzerland: Springer, 2019; Esposito, Anna . - 2019, S. 203-233 - (Intelligent systems reference library; volume 159)

Publication link

Don’t talk to noisy drones - acoustic interaction with unmanned aerial vehicles

Jokisch, Oliver; Siegert, Ingo; Maruschke, Michael; Strutz, Tilo; Ronzhin, Andrey

In: Speech and Computer / SPECOM , 2019 - Cham : Springer, S. 180-190 - (Lecture notes in artiﬁcial intelligence; 11658)

Publication link

Cross-corpus data augmentation for acoustic addressee detection

Akhtiamov, Oleg; Siegert, Ingo; Karpov, Alexey; Minker, Wolfgang

In: 20th Annual Meeting of the Special Interest Group on Discourse and Dialogue/ Association for Computational Linguistics - Stroudsburg, PA: Association for Computational Linguistics (ACL); Nakamura, Satoshi . - 2019, S. 274-283

Publication link

Threes a crowd? - effects of a second human on vocal accommodation with a voice assistant

Raveh, Eran; Siegert, Ingo; Steiner, Ingmar; Gessinger, Iona; Möbius, Bernd

In: Interspeech 2019 - International Speech and Communication Association; Kubin, Gernot . - 2019, S. 4005-4009

Publication link

Abstract

Admitting the addressee-detection faultiness to improve the performance using a continous learning framework

Siegert, Ingo; Weißkirchen, Norman; Wendemuth, Andreas

In: 8. Interdisziplinärer Workshop Kognitive Systeme: Verstehen, Beschreiben und Gestalten Kognitiver (Technischer) Systeme - Duisburg: Universität Duisburg-Essen, S. 38-39, 2019

Publication link

2018

Peer-reviewed journal article

Using a PCA-based dataset similarity measure to improve cross-corpus emotion recogniton

Siegert, Ingo; Böck, Ronald; Wendemuth, Andreas

In: Computer speech and language - London: Academic Press, 1986 . - 2018, insges. 31 S.

Publication link

Using category theory to structure the OCC theory of emotions

Trujillo, Michael Olmos; Adamatti, Diana F.; Siegert, Ingo

In: Congreso Argentino de Ciencias de la Informática y Desarrollos de Investigación (CACIDI): 28 de noviembre al 30 de noviembre de 2018 - Piscataway, NJ: IEEE[Kongress: 2018 Congreso Argentino de Ciencias de la Informática y Desarrollos de Investigación, CACIDI, Ciudad Autónoma de Buenos Aires, Argentina, 28-30 November 2018]

Publication link

How do we speak with ALEXA - subjective and objective assessments of changes in speaking style between HC and HH conversations

Siegert, Ingo; Krüger, Julia

In: Kognitive Systeme - Duisburg: DuEPublico, Duisburg-Essen Publication Online, Universität Duisburg-Essen, 2013 . - 2018, 1, insges. 11 S.

Publication link

An experimental paradigm for inducing emotions in a real world driving scenario evidence from self-report, annotation of speech data and peripheral physiology

Requardt, Alicia Flores; Wilbrink, Marc; Siegert, Ingo; Jipp, Meike; Wendemuth, Andreas; Ihme, Klas

In: Kognitive Systeme - Duisburg : DuEPublico, Duisburg-Essen Publication Online, Universität Duisburg-Essen . - 2018, Heft 1, insges. 12 S.

Publication link

Book chapter

Emotion recognition from disturbed speech - towards affective computing in real-world in-car environments

Lotz, Alicia Flores; Faller, Fabian; Siegert, Ingo; Wendemuth, Andreas

In: Elektronische Sprachsignalverarbeitung 2018: Tagungsband der 29. Konferenz, Ulm, 7.-9. März 2018/ Konferenz "Elektronische Sprachsignalverarbeitung"- Dresden: TUDpress, 2018, S. 208-215[Konferenz: 29. Elektronische Sprachsignalverarbeitung 2018, Ulm, 7. - 10. März; Literaturverzeichnis: Seite 214-215]

Acoustic addressee-detection - analysing the impact of age, sex and technical knowledge

Siegert, Ingo; Tang, Shuran; Lotz, Alicia Flores

In: Elektronische Sprachsignalverarbeitung 2018: Tagungsband der 29. Konferenz, Ulm, 7.-9. März 2018 / André Berton, Udo Haiber, Wolfgang Minker (Hrsg.) : Tagungsband der 29. Konferenz, Ulm, 7.-9. März 2018 / Konferenz "Elektronische Sprachsignalverarbeitung" , 2018 - Dresden : TUDpress , 2018, S. 113-120, Illustrations [Literaturverzeichnis: Seite 118-120; Konferenz: 29. Elektronische Sprachsignalverarbeitung 2018, Ulm, 7. - 10. März]

Voice Assistant Conversation Corpus (VACC) - a multi-scenario dataset for addressee detection in human-computer-interaction using Amazon's ALEXA

Siegert, Ingo; Krüger, Julia; Egorow, Olga; Nietzold, Jannik; Heinemann, Ralph; Lotz, Alicia Flores

In: Proceedings of the LREC 2018 Workshop LB-ILR2018 and MMC2018 Joint Workshop, 7 May 2018, Miyazaki, Japan - Paris: European Language Resources Association, ELRA, 2018; Koiso, Hanae . - 2018, S. 51-54[Workshop: LREC 2018 Workshop LB-ILR2018 and MMC2018 Joint Workshop, Miyazaki, Japan, 7 May 2018]

Publication link

Improving emotion recognition performance by random-forest-based feature selection

Egorow, Olga; Siegert, Ingo; Wendemuth, Andreas

In: Speech and computer: 20th International Conference, SPECOM 2018, Leipzig, Germany, September 18-22, 2018 : proceedings/ SPECOM - Cham: Springer, 2018 . - 2018, S. 134-144 - (Lecture notes in computer science; 11096; Lecture notes in artificial intelligence)[Konferenz: 20th International Conference Speech and Computer, SPECOM 2018, Leipzig, Germany, September 18-22, 2018]

Publication link

Utilizing psychoacoustic modeling to improve speech-based emotion recognition

Siegert, Ingo; Lotz, Alicia Flores; Egorow, Olga; Wolff, Susann

In: Speech and computer: 20th International Conference, SPECOM 2018, Leipzig, Germany, September 18-22, 2018 : proceedings/ SPECOM - Cham: Springer, 2018 . - 2018, S. 625-635 - (Lecture notes in computer science; 11096; Lecture notes in artificial intelligence)[Konferenz: 20th International Conference Speech and Computer, SPECOM 2018, Leipzig, Germany, September 18-22, 2018]

2017

Peer-reviewed journal article

Prediction of user satisfaction in naturalistic human-computer interaction

Egorow, Olga; Siegert, Ingo; Wendemuth, Andreas

In: Kognitive Systeme - Duisburg: DuEPublico, Duisburg-Essen Publication Online, Universität Duisburg-Essen, 2013 . - 2017, 1, insges. 9 S.

Publication link

Book chapter

ikannotate2 - a tool supporting annotation of emotions in audio-visual data

Siegert, Ingo; Wendemuth, Andreas

In: Elektronische Sprachsignalverarbeitung 2017: Tagungsband der 28. Konferenz Saarbrücken, 15. - 17. März 2017 / Jürgen Trouvain ; Ingmar Steiner und Bern Möbius (Hrsg.): Tagungsband der 28. Konferenz Saarbrücken, 15. - 17. März 2017 - Dresden: TUDpress Verlag der Wissenschaften GmbH, 2017 . - 2017, S. 17-24[Kongress: 28. Konferenz Elektronische Sprachsignalverarbeitung, Saarbrücken, 15. - 17. März, 2017]

Audio compression and its impact on emotion recognition in affective computing

Lotz, Alicia Flores; Siegert, Ingo; Maruschke, Michael; Wendemuth, Andreas

In: Elektronische Sprachsignalverarbeitung 2017: Tagungsband der 28. Konferenz Saarbrücken, 15. - 17. März 2017 / Jürgen Trouvain ; Ingmar Steiner und Bern Möbius (Hrsg.): Tagungsband der 28. Konferenz Saarbrücken, 15. - 17. März 2017 - Dresden: TUDpress Verlag der Wissenschaften GmbH, 2017 . - 2017, S. 1-8[Kongress: 28. Konferenz Elektronische Sprachsignalverarbeitung, Saarbrücken, 15. - 17. März, 2017]

Improving speech-based emotion recognition by using psychoacoustic modeling and analysis-by-synthesis

Siegert, Ingo; Lotz, Alicia Flores; Egorow, Olga; Wendemuth, Andreas

In: Speech and Computer: 19th International Conference, SPECOM 2017, Hatfield, UK, September 12-16, 2017, Proceedings - Cham: Springer, 2017; Potapova, Rodmonga . - 2017, S. 445-455 - (Lecture Notes in Computer Science; 10458)[Konferenz: 19th International Conference Speech and Computer, SPECOM 2017, Hatfield, UK, September 12-16, 2017]

Publication link

Acoustic cues for the perceptual assessment of surround sound

Siegert, Ingo; Jokisch, Oliver; Lotz, Alicia Flores; Trojahn, Franziska; Meszaros, Martin; Maruschke, Michael

In: Speech and Computer - Cham : Springer ; Potapova, Rodmonga . - 2017, S. 65-75 - (Lecture Notes in Computer Science; 10458)

Publication link

Multimodal affect recognition in the context of human-computer interaction for companion-systems

Schwenker, Friedhelm; Böck, Ronald; Schels, Martin; Meudt, Sascha; Siegert, Ingo; Glodek, Michael; Kächele, Markus; Schmidt-Wack, Miriam; Thiam, Patrick; Wendemuth, Andreas; Krell, Gerald

In: Companion technology - Cham : Springer ; Biundo-Stephan, Susanne *1955-* . - 2017, S. 387-408

Publication link

Emotion recognition from speech

Wendemuth, Andreas; Vlasenko, Bogdan; Siegert, Ingo; Böck, Ronald; Schwenker, Friedhelm; Palm, Günther

In: Companion technology - Cham : Springer ; Biundo-Stephan, Susanne *1955-* . - 2017, S. 409-428

Publication link

Modeling aspects in human-computer interaction - adaptivity, user characteristics and evaluation

Gossen, Tatiana; Siegert, Ingo; Nürnberger, Andreas; Hartmann, Kim; Kotzyba, Michael; Wendemuth, Andreas

In: Companion technology - Cham : Springer ; Biundo-Stephan, Susanne *1955-* . - 2017, S. 57-58

Publication link

Multi-modal information processing in companion-systems - a ticket purchase system

Siegert, Ingo; Schüssel, Felix; Schmidt, Miriam; Reuter, Stephan; Meudt, Sascha; Layher, Georg; Krell, Gerald; Hörnle, Thilo; Handrich, Sebastian; Al-Hamadi, Ayoub; Dietmayer, Klaus; Neumann, Heiko; Palm, Günther; Schwenker, Friedhelm; Wendemuth, Andreas

In: Companion technology - Cham : Springer ; Biundo-Stephan, Susanne *1955-* . - 2017, S. 493-500

Publication link

The last minute corpus as a research resource - from signal processing to behavioral analyses in user-companion interactions

Rösner, Dietmar; Frommer, Jörg; Wendemuth, Andreas; Bauer, Thomas; Günther, Stephan; Haase, Matthias; Siegert, Ingo

In: Companion technology - Cham : Springer ; Biundo-Stephan, Susanne *1955-* . - 2017, S. 277-299

Publication link

Comparative study on normalisation in emotion recognition from speech

Böck, Ronald; Egorow, Olga; Siegert, Ingo; Wendemuth, Andreas

In: Intelligent human computer interaction / IHCI , 2017 - Cham : Springer ; Horain, Patrick, S. 189-201 - (Lecture Notes in Computer Science; 10688)

Publication link

Accelerating manual annotation of filled pauses by automatic pre-selection

Egorow, Olga; Lotz, Alicia Flores; Siegert, Ingo; Böck, Ronald; Krüger, Julia; Wendemuth, Andreas

In: 2017 International Conference on Companion Technology (ICCT): 11-13 Sept. 2017/ International Conference on Companion Technology - [Piscataway, NJ]: IEEE, 2017; International Conference on Companion Technology (2.:2017) . - 2017, insges. 6 S.[Konferenz: 2017 International Conference on Companion Technology (ICCT), Ulm, Germany, 11.-13. September 2017]

Publication link

2016

Peer-reviewed journal article

Emotional and user-specific acoustic cues for improved analysis of naturalistic interactions

Siegert, Ingo

In: Künstliche Intelligenz: KI : Forschung, Entwicklung, Erfahrungen : Organ des Fachbereichs 1 Künstliche Intelligenz der Gesellschaft für Informatik e.V., GI - Berlin: Springer, Bd. 30.2016, 1, S. 93-94

Publication link

Comparison of different modeling techniques for robust prototype matching of speech pitch-contours

Lotz, Alicia Flores; Siegert, Ingo; Wendemuth, Andreas

In: Kognitive Systeme - Duisburg: DuEPublico, 1, insges. 10 S., 2016

Publication link

Book chapter

Measuring the impact of audio compression on the spectral quality of speech data

Siegert, Ingo; Lotz, Alicia Flores; Doung, Linh Linda; Wendemuth, Andreas

In: Elektronische Sprachsignalverarbeitung 2016 / Konferenz "Elektronische Sprachsignalverarbeitung" , 2016 - Dresden : TUDpress, S. 229-236 - (Studientexte zur Sprachkommunikation; Band 81)

Multimodal information processing - the ticket purchase : a demonstration scenario of the SFB/TRR-62

Siegert, Ingo; Reuter, Stephan; Schüssel, Felix; Layer, Georg; Hörnle, Thilo; Meudt, Sascha; Wendemuth, Andreas

In: Elektronische Sprachsignalverarbeitung 2016: Tagungsband der 27. Konferenz, Leipzig, 2.-4. März 2016 / Oliver Jokisch (Hrsg.) ; Tagungsorganisation: Hochschule für Telekommunikation Leipzig, Institut für Kommunikationstechnik, Prof. Dr.-Ing. Oliver Jokisch: Tagungsband der 27. Konferenz, Leipzig, 2.-4. März 2016/ Konferenz "Elektronische Sprachsignalverarbeitung" - Dresden: TUDpress, 2016; Jokisch, Oliver . - 2016, S. 111-118 - (Studientexte zur Sprachkommunikation; Band 81)[Kongress: 27. Konferenz Elektronische Sprachsignalverarbeitung 2016: Tagungsband der 27. Konferenz, Leipzig, 2.-4. März 2016 / Oliver Jokisch (Hrsg.) ; Tagungsorganisation: Hochschule für Telekommunikation Leipzig, Institut für Kommunikationstechnik, Prof. Dr.-Ing. Oliver Jokisch, Leipzig, 2. - 4. März 2016]

Classification of functional-meanings of non-isolated discourse particles in human-human-interaction

Lotz, Alicia Flores; Siegert, Ingo; Wendemuth, Andreas

In: Human-Computer Interaction. Theory, Design, Development and Practice - Cham : Springer International Publishing ; Kurosu, Masaaki . - 2016, S. 53-64 - (Lecture Notes in Computer Science; 9731)

Publication link

Discourse particles in human-human and human-computer interaction - Analysis and evaluation

Siegert, Ingo; Krüger, Julia; Haase, Matthias; Lotz, Alicia Flores; Günther, Stephan; Frommer, Jörg; Rösner, Dietmar; Wendemuth, Andreas

In: Human-Computer Interaction. Theory, Design, Development and Practice - Cham : Springer International Publishing ; Kurosu, Masaaki . - 2016, S. 105-117

Publication link

Emotion intelligibility within codec-compressed and reduced bandwidth speech

Siegert, Ingo; Lotz, Alicia Flores; Maruschke, Michael; Jokisch, Oliver; Wendemuth, Andreas

In: Speech communication / ITG-Fachtagung Sprachkommunikation , 2016 - Berlin : VDE Verlag, S. 215-219

ERM4CT 2016: 2nd international workshop on emotion representations and modelling for companion systems (workshop summary)

Hartmann, Kim; Siegert, Ingo; Salah, Ali Albert; Truong, Khiet P.

In: Proceedings of the 18th ACM International Conference on Multimodal Interaction: November 12 - 16, 2016, Tokyo, Japan - New York, NY: ACM, S. 593-595[Kongress: 18th ACM International Conference on Multimodal Interaction, Tokyo, Japan, 12. - 16. November , 2016]

Publication link

Kennzeichnung von Nutzerprofilen zur Interaktionssteuerung beim Gehen

Thiers, Angelina; Hamacher, Dennis; Tornow, Michael; Heinemann, Ralph; Siegert, Ingo; Wendemuth, Andreas; Schega, Lutz

In: Technische Unterstützungssysteme, die die Menschen wirklich wollen / Transdisziplinäre Konferenz Technische Unterstützungssysteme, die die Menschen Wirklich Wollen , 2016 - Hamburg : Laboratorium Fertigungstechnik, smartASSIST, Helmut Schmidt Universität, S. 475-484

Akustische Marker für eine verbesserte Situations- und Intentionserkennung von technischen Assistenzsystemen

Siegert, Ingo; Lotz, Alicia Flores; Egorow, Olga; Böck, Ronald; Schega, Lutz; Tornow, Michael; Thiers, Angelina; Wendemuth, Andreas

In: Technische Unterstützungssysteme, die die Menschen wirklich wollen: Zweite Transdisziplinäre Konferenz : Hamburg 2016 - Hamburg: Laboratorium Fertigungstechnik, smartASSIST, Helmut Schmidt Universität$, S. 465-474[Kongress: 2. Transdisziplinäre Konferenz "Technische Unterstützungssysteme, die die Menschen wirklich wollen", Hamburg, 2016]

Article in conference proceedings

Integrated health and fitness (iGF)-corpus - ten-modal highly synchronized subject-dispositional and emotional human machine interactions

Tornow, Michael; Krippl, Martin; Bade, Svea; Thiers, Angelina; Siegert, Ingo; Handrich, Sebastian; Krüger, Julia; Schega, Lutz; Wendemuth, Andreas

In: Kongress: MMC 2016, Portorož, 2016.05.24, Multimodal Corpora: Computer vision and language processing (MMC 2016) - ELRA, S. 21-24

Publication link

2015

Peer-reviewed journal article

Exploratory voice-controlled search for young users - Challenges & Potential Benefits

Kotzyba, Michael; Siegert, Ingo; Gossen, Tatiana; Wendemuth, Andreas; Nürnberger, Andreas

In: Kognitive Systeme - Duisburg : DuEPublico, Duisburg-Essen Publication Online, Universität Duisburg-Essen . - 2015, Heft 1, insges. 10 S.

Publication link

Probabilistic breadth as an evaluation measure of gaussian mixture models used for acoustic emotion states

Böck, Ronald; Siegert, Ingo; Wendemuth, Andreas

In: Kognitive Systeme - Duisburg: DuEPublico, Duisburg-Essen Publication Online, Universität Duisburg-Essen, 2013 . - 2015, 2, insges. 8 S.

Publication link

Dissertation

Emotional and user-specific cues for improved analysis of naturalistic interactions

Siegert, Ingo; Wendemuth, Andreas; Diedrich, Christian

In: Magdeburg, Magdeburg, Univ., Fak. für Elektrotechnik und Informationstechnik, Diss., 2015, XIX, 266 S.

Book chapter

Overlapping speech, utterance duration and affective content in HHI and HCI - an comparison

Siegert, Ingo; Böck, Ronald; Vlasenko, Bogdan; Ohnemus, Kerstin; Wendemuth, Andreas

In: CogInfoCom 2015 , 2015 - [Piscataway, NJ] : IEEE ; CogInfoCom (6.:2015), S. 83-88

Publication link

Emotion and disposition detection in medical machines - chances and challenges

Hartmann, Kim; Siegert, Ingo; Prylipko, Dmytro

In: Machine Medical Ethics - Cham: Springer, 2015; van Rysewyk, Simon Peter . - 2015, S. 317-339

Publication link

ERM4CT 2015: Workshop on Emotion Representations and Modelling for Companion Systems

Hartmann, Kim; Siegert, Ingo; Schuller, Björn; Morency, Louis-Philippe; Salah, Albert Ali; Böck, Ronald

In: Proceedings of the International Workshop on Emotion Representations and Modelling for Companion Technologies - New York, NY : ACM ; Hartmann, Kim . - 2015, S. 1-2 Kongress: ERM4CT'15 17 Seattle, USA 2015.11.09-13

Publication link

Recognising emotional evolution from speech

Böck, Ronald; Siegert, Ingo

In: Proceedings of the International Workshop on Emotion Representations and Modelling for Companion Technologies - New York, NY: ACM, 2015; Hartmann, Kim . - 2015, S. 13-18Kongress: ERM4CT'15 17 (Seattle, USA : 2015.11.09-13)

Publication link

Ein Datenset zur Untersuchung emotionaler Sprache in Kundenbindungsdialogen

Siegert, Ingo; Philippou-Hübner, David; Tornow, Michael; Heinemann, Ralph; Wendemuth, Andreas; Ohnemus, Kerstin; Fischer, Sarah; Schreiber, Gerald

In: Elektronische Sprachsignalverarbeitung 2015: Tagungsband der 26. Konferenz, Eichstätt, 25. - 27. März 2015 - Dresden: TUDpress, S. 180-187 - (Studientexte zur Sprachkommunikation; 78)Kongress: Konferenz "Elektronische Sprachsignalverarbeitung 26 (Eichstätt : 2015)

Automatic differentiation of form-function-relations of the discourse particle "hm" in a naturalistic human-computer interaction

Lotz, Alicia Flores; Siegert, Ingo; Wendemuth, Andreas

In: Elektronische Sprachsignalverarbeitung 2015: Tagungsband der 26. Konferenz, Eichstätt, 25. - 27. März 2015 / [26. Konferenz "Elektronische Sprachsignalverarbeitung"]. Günther Wirsching (Hrsg.). [Mitw. Förderverein Elektronische Sprachsignalverarbeitung e.V. Tagungsort Katholische Universität Eichstätt-Ingolstadt. Tagungsorganisation Katholische Universität Eichstätt-Ingolstadt, Lehrstuhl für Mathematik-Statistik]: Tagungsband der 26. Konferenz, Eichstätt, 25. - 27. März 2015 - Dresden: TUDpress, 2015 . - 2015, S. 172-179 - (Studientexte zur Sprachkommunikation; 78)Kongress: Konferenz "Elektronische Sprachsignalverarbeitung 26 (Eichstätt : 2015)

ERM4CT chairs' welcome

Hartmann, Kim; Siegert, Ingo; Schuller, Björn; Morency, Louis-Philippe; Salah, Albert Ali; Böck, Ronald

In: Proceedings of the International Workshop on Emotion Representations and Modelling for Companion Technologies - New York, NY : ACM ; Hartmann, Kim . - 2015, S. III

Publication link

Article in conference proceedings

Exploring dataset similarities using PCA-based feature selection

Siegert, Ingo; Böck, Ronald; Vlasenko, Bogdan; Wendemuth, Andreas

In: 2015 International Conference on Affective Computing and Intelligent Interaction (ACII): 21 - 24 Sept. 2015, Xi'an - Piscataway, NJ: IEEE, 2015 . - 2015, S. 387-393

Publication link

A new dataset of telephone-based human-human call-center interaction with emotional evaluation

Siegert, Ingo; Ohnemus, Kerstin

In: Proceedings of the 1st International Symposium on Companion-Technology (ISCT 2015): September 23rd - 25th, Ulm University, Germany, S. 143-148Kongress: International Symposium on Companion-Technology, ISCT 1 (Ulm : 2015.09.23-25)

Publication link

Probabilistic breadth used in evaluation of resulting gaussian mixture models

Böck, Ronald; Siegert, Ingo; Wendemuth, Andreas

In: 4. Interdisziplinärer Workshop Kognitive Systeme 2015: Mensch, Teams, Systeme und Automaten: proceedings - Bielefeld, 2015 . - 2015, insges. 8 S.Kongress: Interdisziplinärer Workshop Kognitive Systeme 4 (Bielefeld : 2015.03.23-25)

Editor

Proceedings of the International Workshop on Emotion Representations and Modelling for Companion Technologies

Hartmann, Kim; Siegert, Ingo; Schuller, Björn; Morency, Louis-Philippe; Sala, Albert Ali; Böck, Ronald

In: New York, NY: ACM, 2015, Online Ressource (PDF-Datei)Kongress: International Workshop on Emotion Representations and Modelling for Companion Technologies 17 (Seattle, USA : 2015.11.09-13)

Publication link

2014

Peer-reviewed journal article

Analysis of significant dialog events in realistic human-computer interaction

Prylipko, Dmytro; Rösner, Dietmar; Siegert, Ingo; Günther, Stephan; Friesen, Rafael; Haase, Matthias; Vlasenko, Bogdan; Wendemuth, Andreas

In: Journal on multimodal user interfaces - Berlin: Springer, 2007, Bd. 8.2014, 1, S. 75-86

Publication link

Investigation of speaker group-dependent modelling for recognition of affective states from speech

Siegert, Ingo; Philippou-Hübner, David; Hartmann, Kim; Böck, Ronald; Wendemuth, Andreas

In: Cognitive Computation - New York, NY: Springer, 2009, Bd. 6.2014, 4, S. 892-913

Publication link

Book chapter

Investigating the form-function-relation of the discourse particle “hm” in a naturalistic human-computer interaction

Siegert, Ingo; Prylipko, Dmytro; Hartmann, Kim; Böck, Ronald; Wendemuth, Andreas

In: Recent Advances of Neural Network Models and Applications / Bassis , Simone - Cham : Springer International Publishing ; Bassis, Simone . - 2014, S. 387-394 - (Smart Innovation, Systems and Technologies; 26)

Publication link

Discourse particles and user characteristics in naturalistic human-computer interaction

Siegert, Ingo; Haase, Matthias; Prylipko, Dmytro; Wendemuth, Andreas

In: Human-Computer Interaction. Advanced Interaction Modalities and Techniques / Kurosu , Masaaki - Cham [u.a.] : Springer ; Kurosu, Masaaki . - 2014, S. 492-501 - (Lecture notes in computer science; 8511) Kongress: HCI International 16 Heraklion, Crete 2014.06.22-27

Publication link

Article in conference proceedings

Application of image processing methods to filled pauses detection from spontaneous speech

Prylipko, Dmytro; Egorow, O.; Siegert, Ingo; Wendemuth, Andreas

In: 15th annual conference of the International Speech Communication Association, INTERSPEECH: Singapore, 14 - 18 September 2014 - International Speech and Communication Association, S. 1816-1820Kongress: INTERSPEECH 15 (Singapore : 2014.09.14-18)

2013

Peer-reviewed journal article

Inter-rater reliability for emotion annotation in human-computer interaction - comparison and methodological improvements

Siegert, Ingo; Böck, Ronald; Wendemuth, Andreas

In: Journal on multimodal user interfaces - Berlin: Springer, 2007 . - 2013

Publication link

Modelling of emotional development within human-computer-interaction

Siegert, Ingo; Hartmann, Kim; Glüge, Stefan; Wendemuth, Andreas

In: Kognitive Systeme. - Duisburg : DuEPublico, 1, insges. 8 S., 2013

Publication link

Book chapter

The influence of context knowledge for multi-modal affective annotation

Siegert, Ingo; Böck, Ronald; Wendemuth, Andreas

In: Human-computer interaction ; Pt. 5:Towards intelligent and implicit interaction - Berlin [u.a.]: Springer, 2013 . - 2013, S. 381-390 - (Lecture notes in computer science; 8008)Kongress: HCI International 15 (Las Vegas, Nev. : 2013.07.21-26)

Publication link

Human behaviour in HCI - complex emotion detection through sparse speech features

Siegert, Ingo; Hartmann, Kim; Philippou-Hübner, David; Wendemuth, Andreas

In: Human Behavior Understanding / Salah , Albert Ali - Cham [u.a.] : Springer ; Salah, Albert Ali . - 2013, S. 246-257 - (Lecture notes in computer science; 8212) Kongress: HBU 4 Barcelona 2013.10.22

Publication link

Using speaker group dependent modelling to improve fusion of fragmentary classifier decisions

Siegert, Ingo; Glodek, Michael; Panning, Axel; Krell, Gerald; Schwenker, Friedhelm; Al-Hamadi, Ayoub; Wendemuth, Andreas

In: Proceedings of the 2013 IEEE International Conference on Cybernetics (CYBCONF 2013) : Lausanne, Switzerland, 13-15 June, 2013. - IEEE, S. 132-137Kongress: CYBCONF; (Lausanne, Switzerland) : 2013.06.13-15

Publication link

Annotation and classification of changes of involvement in group conversation

Böck, Ronald; Glüge, Stefan; Siegert, Ingo; Wendemuth, Andreas

In: Humaine Association Conference on Affective Computing and Intelligent Interaction (ACII), 2013 - Piscataway, NJ : IEEE, S. 803-808

Publication link

Audio-based pre-classification for semi-automatic facial expression coding

Böck, Ronald; Limbrecht-Ecklundt, Kerstin; Siegert, Ingo; Walter, Steffen; Wendemuth, Andreas

In: Human-computer interaction ; Pt. 5:Towards intelligent and implicit interaction - Berlin [u.a.]: Springer, 2013 . - 2013, S. 301-309 - (Lecture notes in computer science; 8008)Kongress: HCI International 15 (Las Vegas, Nev. : 2013.07.21-26)

Publication link

Characterization of Lamb wave attenuation mechanisms

Schmidt, Daniel; Sadri, Hossein; Szewieczek, Artur; Sinapius, Michael; Wierach, Peter; Siegert, Ingo; Wendemuth, Andreas

In: Health monitoring of structural and biological systems 2013 : 11 - 14 March 2013, San Diego, California, United States ; [part of SPIE smart structures/NDE]. - Bellingham, Wash. : SPIE - (Proceedings of SPIE; 8695)Kongress: Conference on Health Monitoring of Structural and Biological Systems; (San Diego, Calif.) : 2013.03.11-14

Publication link

Fusion of fragmentary classifier decisions for affective state recognition

Krell, Gerald; Glodek, Michael; Panning, Axel; Siegert, Ingo; Michaelis, Bernd; Wendemuth, Andreas; Schwenker, Friedhelm

In: Multimodal pattern recognition of social signals in human-computer-interaction : first IAPR TC3 workshop, MPRSS 2012, Tsukuba, Japan, November 11, 2012 ; revised selected papers. - Berlin [u.a.] : Springer, S. 116-130, 2013 - (Lecture notes in computer science; 7742)Kongress: MPRSS; 1 (Tsukuba) : 2012.11.11

Publication link

Emotion detection in HCI - from speech features to emotion space?

Hartmann, Kim; Siegert, Ingo; Philippou-Hübner, David; Wendemuth, Andreas

In: 7th IFAC Conference on Manufacturing Modelling, Management, and Control, 2013. - IFAC, S. 288-295Kongress: IFAC Conference on Manufacturing Modelling, Management, and Control; 7 (Saint Petersburg) : 2013.06.19-21

Publication link

Editor

Joint proceedings of the 2013th T2CT and CCGL workshops

Böck, Ronald; Degens, Nick; Heylen, Dirk; Louchart, Sandy; Minker, Wolfgang; Morency, Louis-Philippe; Nazir, Asad; Schwenker, Friedhelm; Siegert, Ingo

In: Magdeburg: Otto von Guericke University Magdeburg, 2013, 1 CD-R, 12 cmKongress: Workshop "Techniques Towards Companion Technologies", T2CT 13 (Edinburgh, UK : 2013.08.28)

2012

Book chapter

Modeling users' mood state to improve human-machine-interaction

Siegert, Ingo; Böck, Ronald; Wendemuth, Andreas

In: Cognitive behavioural systems - Berlin [u.a.]: Springer; Esposito, Anna . - 2012, S. 273-279 - (Lecture Notes in Computer Science; 7403)Kongress: COST Dresden : 2011.02.21-26

Publication link

Combining mimic and prosodic analyses for user disposition classification

Böck, Ronald; Limbrecht, Kerstin; Siegert, Ingo; Glüge, Stefan; Walter, Steffen; Wendemuth, Andreas

In: Elektronische Sprachsignalverarbeitung 2012 - Dresden: TUDpress Verl. der Wiss.; Wolff, Matthias . - 2012, S. 220 - (Studientexte zur Sprachkommunikation; 64)Kongress: Konferenz Elektronische Sprachsignalverarbeitung 23 (Cottbus : 2012.08.29-31)

The Influence of Context Knowledge for Multimodal Annotation

Siegert, Ingo; Böck, Ronald; Wendemuth, Andreas

In: Joint proceedings of the IVA 2012 workshops - Santa Cruz, California, September 15, 2012 - Magdeburg: Univ. . - 2012, S. 25-31Kongress: IVA 12 (Santa Cruz, Calif. : 2012.09.12-15)

Multimodal affect recognition in spontaneous HCI environment

Panning, Axel; Siegert, Ingo; Al-Hamadi, Ayoub; Wendemuth, Andreas; Rösner, Dietmar; Frommer, Jörg; Krell, Gerald; Michaelis, Bernd

In: 2012 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC 2012) : Hong Kong, China, 12-15 August 2012 ; proceedings. - Piscataway, NJ : IEEE, insges. 6 S.Kongress: ICSPCC; (Hong Kong) : 2012.08.12-15

Publication link

Describing human emotions through mathematical modelling

Hartmann, Kim; Siegert, Ingo; Glüge, Stefan; Wendemuth, Andreas; Kotzyba, Michael; Deml, Barbara

In: Preprints MATHMOD 2012 Vienna : abstract volume. - Vienna : ARGESIM, ARGE Simulation News, Vienna Univ. of Technology, insges. 6 S. - (ARGESIM report; 38)Kongress: MATHMOD; 7 (Vienna) : 2012.02.15-17

Publication link

Investigation of hierarchical classification for simultaneous gender and age recognition

Siegert, Ingo; Böck, Ronald; Philippou-Hübner, David; Wendemuth, Andreas

In: Elektronische Sprachsignalverarbeitung 2012 - Dresden: TUDpress Verl. der Wiss.; Wolff, Matthias . - 2012, S. 58 - (Studientexte zur Sprachkommunikation; 64)Kongress: Konferenz Elektronische Sprachsignalverarbeitung 23 (Cottbus : 2012.08.29-31)

Towards emotion and affect detection in the multimodal LAST MINUTE corpus

Frommer, Jörg; Michaelis, Bernd; Rösner, Dietmar; Wendemuth, Andreas; Friesen, Rafael; Haase, Matthias; Kunze, Manuela; Andrich, Rico; Krüger, Julia; Panning, Axel; Siegert, Ingo

In: Proceedings of the 8th International Conference on Language Resources and Evaluation: May 23-25, 2012 / eds. Nicoletta Calzolari: May 23-25, 2012 - ELRA, 2012; Calzolari, Nicoletta . - 2012, S. 3064-3069Kongress: LREC 2010 8 (Istanbul, Turkey : 2012.05.23-25)

Abstract

Emotion detection by event evaluation using fuzzy sets as appraisal variables

Kotzyba, Michael; Deml, Barbara; Neumann, Hendrik; Glüge, Stefan; Hartmann, Kim; Siegert, Ingo; Wendemuth, Andreas; Traue, Harald; Walter, Steffen

In: Proceedings of ICCM 2012 : 11th International Conference on Cognitive Modeling. - Berlin : Universitätsverl. der TU Berlin, S. 123-124

Publication link

2011

Book chapter

Vowels formants analysis allows straightforward detection of high arousal emotions

Vlasenko, Bogdan; Philippou-Hübner, David; Prylipko, Dmytro; Böck, Ronald; Siegert, Ingo; Wendemuth, Andreas

In: 2011 IEEE International Conference on Multimedia and Expo: ICME 2011 ; electronic proceedings - Piscataway, NJ: IEEE, 2011; Chen, Irene, 2011, paper 631, insgesamt 6 S.Kongress: ICME (Barcelona, Spain : 2011.07.11-15)[Beitrag auf USB-Stick]

Publication link

A processing tool for emotionally coloured speech

Böck, Ronald; Siegert, Ingo; Vlasenko, Bogdan; Wendemuth, Andreas; Haase, Matthias; Lange, Julia

In: 2011 IEEE International Conference on Multimedia and Expo: ICME 2011 ; electronic proceedings - Piscataway, NJ: IEEE, 2011; Chen, Irene, 2011, paper 895, insgesamt 1 S.Kongress: ICME (Barcelona, Spain : 2011.07.11-15)[Beitrag auf USB-Stick]

Appropriate emotional labelling of non-acted speech using basic emotions, geneva emotion wheel and self assessment manikins

Siegert, Ingo; Böck, Ronald; Philippou-Hübner, David; Vlasenko, Bogdan; Wendemuth, Andreas

In: 2011 IEEE International Conference on Multimedia and Expo: ICME 2011 ; electronic proceedings - Piscataway, NJ: IEEE, 2011; Chen, Irene, 2011, paper 419, insgesamt 6 S.Kongress: ICME (Barcelona, Spain : 2011.07.11-15)[Beitrag auf USB-Stick]

Publication link

Ikannotate - a tool for labelling, transcription, and annotation of emotionally coloured speech

Böck, Ronald; Siegert, Ingo; Haase, Matthias; Lange, Julia; Wendemuth, Andreas

In: Affective computing and intelligent interaction ; Pt. 1 - Heidelberg [u.a.]: Springer, 2011; Pt. 1 . - 2011, S. 25-34 - (Lecture notes in computer science; 6974)Kongress: ACII 4 (Memphis, TN : 2011.10.09-12)

Publication link

Abstract

Incorporation of a mood-model to improve user-disposition prediction from emotion recognition

Siegert, Ingo; Böck, Ronald; Wendemuth, Andreas

In: Program and abstracts of the COST 2102 Final Conference: held in conjunction with the 4th COST 2102 International Training School on Cognitive Behavioural Systems ; February 21 - 25, 2011, Dresden, Germany / Technische Universität Dresden, Institut für Akustik und Sprachkommunikation. [Ed. by Anna Esposito ...]: held in conjunction with the 4th COST 2102 International Training School on Cognitive Behavioural Systems ; February 21 - 25, 2011, Dresden, Germany - Dresden: Techn. Univ., Inst. für Akustik und Sprachkommunikation, 2011; Esposito, Anna . - 2011, S. 34[Kongress: COST 2102 Final Conference, Dresden, Germany, February 21 - 25, 2011]

2010

Book chapter

Developing an expressive speech labeling tool incorporating the temporal characteristics of emotion

Scherer, Stefan; Siegert, Ingo; Bigalke, Lutz; Meudt, Sascha

In: Proceedings of the 7th International Conference on Language Resources and Evaluation - Paris : ELRA . - 2010, S. 1172-1175 Kongress: LREC 2010 7 Vallette, Malta 2010.05.17-23

Publication link

the following publications are accepted but not yet published

Ingo Siegert & Julia Krüger: “Speech melody and speech content didn’t fit together” – Differences in Speech Behavior for Device Directed and Human Directed Interactions.
Advances in Data Science: Methodologies and Applications, in print.

Ingo Siegert. Alicia Flores Lotz, Andreas Wendemuth: Emotionserkennung für eine nutzerzentrierte Fahrerassistenz – Affective Computing im realem Fahrzeugkontext
11. Symposium "Motor- und Aggregateakustik" (accepted)

Oliver Jokisch, Enrico Lösch and Ingo Siegert Advances in sound and speech signal processing at the presence of drones
Accepted for Quiet Drones - A Symposium on Noise from UASs/UAVs

Ingo Siegert “Alexa in the wild” – Collecting unconstrained conversations with a modern voice assistant in a public environment
Accepted for LREC 2020

Norman Weißkirchen, Mainampati Vasudeva Reddy, Andreas Wendemuth and Ingo Siegert. Utilizing Computer Vision Algorithms to Detect and Describe Local Features in Images for Emotion Recognition from Speech. Accepted for ICHMS 2020.

Organiser

Conferences

ESSV 2020, 31. Konferenz Elektronische Sprachverarbeitung, 4-6 März, 2020, Magdeburg, Co-Organisator

SPECOM 2018, 20th International Conference on Speech and Computer, 18-22 September, 2018, Leipzig, Germany, Local Organizing Committee – Special Session Chair

Summer Schools

International Summer School on Companion Technology (ISSCT 2017) - Theory and Application
In conjunction with the IEEE International Conference on Companion Technology, Ulm, Germany
September 9-13, 2017

Workshops

LREC Workshop Legal and Ethical Issues Workshop, Mai 2020, Co-Organisator

ITG-Workshop Sprachassistenten: Anwendungen, Implikationen, Entwicklungen, 3 März 2020, Magdeburg, Co-Organisator
For further information: click here

2nd International Workshop on Emotion Representations and Modelling for Companion Technologies (ERM4CT 2016),
Workshop at ICMI 2016 (18th ACM International Conference on Multimodal Interaction),
Seattle, USA, November 16th, 2016
For further information: click here

International Workshop on Emotion Representations and Modelling for Companion Technologies (ERM4CT 2015),
Workshop at ICMI 2015 (17th ACM International Conference on Multimodal Interaction),
Seattle, USA, November 13th, 2015
For further information: click here

1st International Workshop on Techniques Towards Companion Technologies (T2CT 2013)
Workshop at IVA 2013 (International Conference on Intelligent Virtual Agents), Edinburgh, UK
August 28, 2013
For further information: click here

Editorships

Elektronische Sprachsignalverarbeitung 2020 - Tagungsband der 31. Konferenz Magdeburg, 4. - 6. März 2020.
Eds.: Andreas Wendemuth, Ronald Böck, Ingo Siegert.
Dresden: TUDpress, 2020

Sprachassistenten - Anwendungen, Implikationen, Entwicklungen : ITG-Workshop : Magdeburg, 3. März, 2020. Abstractbook.
Eds.: Ingo Siegert, Sebastian Möller
Unibibliothek Magdeburg

ERM4CT '16: Proceedings of the 2nd International Workshop on Emotion Representations and Modelling for Companion Technologies.
Eds.: Kim Hartmann, Ingo Siegert, Ali Albert Salah and Khiet P. Truong.
ACM, New York, NY, USA, 2016

Proceedings of the International Workshop on Emotion Representations and Modelling for Companion Technologies.
Eds: Kim Hartmann, Ingo Siegert, Björn Schuller, Louis-Philippe Morency, Albert Ali Salah and Ronald Böck
ACM, New York, NY, USA, 2015

Joint Proceedings of the 2013th T2CT and CCGL Workshops,
Eds: Ronald Böck, Nick Degens, Dirk Heylen, Sandy Louchart, Wolfgang Minker, Louis-Philippe Morency, Asad Nazir, Friedhelm Schwenker, Ingo Siegert
Edinburgh, UK, August 28, 2013
Publisher: Otto von Guericke University Magdeburg,
ISBN: 978-3-940961-99-0.

invited Talks

2020 -- Siri, Alexa & Co: Wie können Dialoge mit Sprachassistenten natürlicher werden und warum können diese mir nicht einfach mal meine Fragen beantworten?
Vortrag im Rahmen der Vortragsreihe "Wissenschaft im Rathaus" Magdeburg

2020 -- Differences in Speech Behavior for Human-Directed and Device Directed Speech for the application of Addressee-Detection
Kolloqium an der Universität des Saarlandes Fachrichtung Sprachwissenschaft und Sprachtechnologie, Saarbrücken

2019 -- Wie finden wir es, wenn Maschinen uns Persönliches fragen?
Mitmachwerkstatt auf der KI&Wir Convention Magdeburg

2019 -- Rendezvous mit Mr(s) Robot - Liebe auf das erste BYTE? Ein Blick auf die aktuelle Forschung im Bericht der Mensch-Maschine-Interaktion.
Filmgespräch im Rahmen des SILBERSALZ Science & Media Festival 2019

2019 -- Speech Technology for Human-Machine-Interaction
Vortrag auf dem KI@OVGU Symposium in Magdeburg

2019 -- Meet the Scientist Magdeburg "Der (in-)kompetente Helfer" im Rahmen der Wissenschaftsausstellung auf der MS Wissenschaft
Wie Menschen mit einem digitalen Sprachassistenten sprechen müssen, damit er sie versteht.

2018 -- Smarte Systeme vermitteln (Bachelor) - Smarte Ingenieure für die Industrie
Co-Speaker, NI Technologie- und Anwenderkongress VIP 2018

2017 – Freud und Leid am Ticketautomaten – Situations- und Dispositionserkennende Companiontechnologie,
Gastvortrag an der Hochschule für Telekommunikation, Leipzig

2015 – Situations- und Dispositionserkennende Companiontechnologie – Vortrag im Rahmen der Auszeichnung „Deutschland Land der Ideen“ an den SFB/TRR-62

2013 – Companion-Technology – The Future of Cognitive Technical Systems, Introduction talk at the 1st International Workshop on Techniques Towards Companion
Technologies

Awards

Ronald Böck, Olga Egorow, Ingo Siegert und Andreas Wendemuth. “Comparative Study on Normalisation in Emotion Recognition from Speech”. In: Proceedings
of the 9th International Conference on Intelligent Human Computer Interaction (IHCI 2017). Hrsg. von Patrick Horain, Catherine Achard und Malik Mallem. Cham: Springer International Publishing, 2017, S. 189–201, Best Paper Award.

Current projects

AI-supported speech analysis for precision psychotherapy
Duration: 01.09.2025 to 31.08.2030

With the help of artificial intelligence (AI), it is now possible not only to recognize language, but also to better understand its emotional and contextual content. This can open up new possibilities in psychotherapy: If linguistic signals are automatically analyzed during a therapy session, therapists can use additional information to assess how the therapeutic alliance or other important process factors are developing over the course of the session.
Building on the ASPIRE pilot study, this project will further develop AI models that capture the acoustic characteristics of the voice (e.g. pitch, speech tempo, intonation) as well as linguistic content and derive indications of relevant psychological constructs such as the therapeutic alliance. The project will also investigate how robust these models are in relation to methods of speech anonymization - i.e. procedures that protect sensitive patient data without distorting important information patterns.
The long-term aim is to develop a prototype that provides therapists with objective additional information about the course of therapy. In a further step, it will also be investigated how voice data can be combined with other information channels - such as video material - in order to understand and support the therapeutic process even more comprehensively.
This text was translated with DeepL on 05/02/2026

View project in the research portal

SAVER: Language analysis of psychotherapeutic treatment in a transdiagnostic context
Duration: 01.09.2025 to 31.08.2028

Mental disorders are among the greatest burdens on health worldwide. Language plays a central role in diagnosis and treatment, both as a medium for the expression of mental experience and for its change. The SAVER project uses current developments in the field of artificial intelligence to systematically record these linguistic dimensions. The aim is to create a multi-center database of psychotherapeutic sessions in the course of treatment and to analyze them using machine learning methods. Four main areas are being investigated: (1) the identification of diagnostic markers, (2) the automated recording of active therapeutic elements and change processes, (3) the robustness of such analyses using modern language anonymization and (4) the simulation of therapeutic conversation components using large language models. In the long term, the project should contribute to more precise diagnoses, a better understanding of therapeutic mechanisms of action and an evidence-based further development of psychotherapeutic procedures.
The overall study is being led by the Central Institute of Mental Health Mannheim. In addition to the KPSM of the Medical Faculty of Otto von Guericke University, Ruhr University Bochum, Ludwig Maximilian University Munich, Friedrich Schiller University Jena and the University of Ulm are also involved in the project.
This text was translated with DeepL on 05/02/2026

View project in the research portal

Completed projects

Medinym - AI-based anonymization of personal patient data in clinical text and voice databases
Duration: 15.12.2022 to 14.12.2025

Motivation
The ongoing scientific development of technologies based on artificial intelligence (AI) is promoting potential medical applications. The real use of these technologies by a large number of users such as citizens, public authorities, healthcare professionals and small and medium-sized enterprises faces the difficulty of handling data in a secure and data-protected manner. Innovative technologies often cannot be used in the automated processing of medical data in particular, as the protection of identity is rightly a high priority due to the sensitive content. The need to protect clinical data and the resulting difficulty in accessing it also means that machine learning (ML) methods, for example for clinical diagnoses, prognoses and therapy or decision support, cannot be developed without major hurdles.

Aims and approach
The project "AI-based anonymization of personal patient data in clinical text and speech datasets" (Medinym) investigates the possibility of reusing sensitive data by removing sensitive information through anonymization. Two medical use cases, text-based data from electronic patient records and voice data from diagnostic doctor-patient consultations, are being implemented as examples in the project. To this end, open technologies for anonymization are being investigated, further developed and applied to real data. The researchers are also investigating how the informative value of such anonymized data can be preserved for further use. Methods that prevent or hinder misuse of the technology outside of the intended use case will also be considered.

Innovations and perspectives
Information-preserving anonymization should make it possible to further process clinical data, as de-anonymization is no longer possible. These data sets can then be used to train AI models on clinical data in compliance with data protection regulations or be extended to other cohorts. This would make it possible for small and medium-sized companies to collect corresponding amounts of data cumulatively. This would allow sensitive data to be pooled across multiple applications and used for AI training routines, provided it is always anonymized accordingly. The desired anonymization should also increase the willingness of patients to consent to participation in studies, data analyses and general donations of health data. Ultimately, information-preserving anonymization allows the technology to be integrated into current development methods and diagnostic systems, thereby strengthening Germany as a location for science and business in the fields of diagnostics, treatment and therefore healthcare in general.

Funding
Funded by the European Union - NextGenerationEU
This text was translated with DeepL

View project in the research portal

AI Engineering - An interdisciplinary, project-oriented degree program with an educational focus on artificial intelligence and engineering sciences
Duration: 01.12.2021 to 30.11.2025

AI Engineering (AiEng) encompasses the systematic design, development, integration and operation of solutions based on artificial intelligence (AI) using engineering methods as a model. At the same time, AiEng builds a bridge between basic research on AI methods and the engineering sciences and makes the use of AI systematically accessible and available there. The project focuses on the nationwide development of a Bachelor's degree program in AI Engineering, which combines the training of AI methods, models and technologies with those of engineering sciences. AiEng is to be designed as a cooperative study program between Otto von Guericke University (OVGU) Magdeburg and the four universities in Saxony-Anhalt: Anhalt University of Applied Sciences, Harz University of Applied Sciences, Magdeburg-Stendal University of Applied Sciences and Merseburg University of Applied Sciences. The interdisciplinary degree program will enable students to develop AI systems and services in the industrial environment and beyond and to provide holistic support for the associated engineering process - from problem analysis to commissioning and maintenance / servicing. The AiEng curriculum provides comprehensive AI training, supplemented by basic engineering training and in-depth training in a selected application domain. In order to achieve a symbiosis of AI and engineering education, a new action-oriented framework is developed and taught, which describes the complete engineering process of AI solutions and methodically supports all phases. AIEng is characterized by a cross-module interlocking of teaching and learning content within a semester as well as by a cross-faculty and cross-university tandem teaching concept and pursues a student-centered didactic concept, which is supported by many practice-oriented (team) projects and a wide range of Open Educational Resources (OERs) with an (e)-tutor program.
This text was translated with DeepL

View project in the research portal

AnonymPrevent - AI-based Improvement of Anonymity for Remote Assessment, Treatment and Prevention against Child Sexual Abuse
Duration: 01.12.2021 to 31.07.2025

AnonymPrevent investigates both the use and improvement of innovative AI-based anonymization techniques for initial counseling and preventive remote treatment of people who are sexually attracted to children. We aim to anonymize the identity of a patient (given by voice and way of speaking), but at the same time we retain clinical-diagnostic information of, e.g., emotional and personality-related expression. Anonymization of telephone-based contacts, as well as for follow-up therapy possibly supplemented by video transmission, are implemented using latest neural models such as Variational Autoencoder with Differential Digital Signal Processing and avatar-based communication respectively. Since 2005 the Institute of Sexology and Sexual Medicine of Berlin’s Charité, here acting as both practical and research partner, has been leading nationally and internationally growing projects offering treatment to people with pedophilic or hebephilic inclination. Since these sexual inclinations are societally connotated with a high degree of shame and stigmatization, the topic child sexual abuse prevention proofs highly relevant. Ultimately, the project investigates whether and to what extent anonymization of verbal and visual communication channels can lead to increased acceptance of a preventive treatment offer and at the same time does not have an unfavorable influence on communication within the therapy, possibly even promotes open exchange.

View project in the research portal

Automated acoustic-prosodic speech analysis for psychotherapy research and the development of e-companion enhancement in psychotherapy (ASPIRE)
Duration: 01.06.2023 to 31.05.2025

Automated AI-supported speech analysis, which can potentially capture relevant construct markers in real time (intra-session) and enable their evaluation, has the potential to contribute to evidence-based situational intervention design in precision psychotherapy and to become effective as digital enhancement technology (e-companion) (Kučera & Mehl, 2022; Chekroud et al., 2021; Krüger, Siegert & Junne, 2022).
The aim of the project is to develop a valid prediction model for the central impact factor therapeutic relationship (as a model construct) based on speech content and prosodic-acoustic speech data as part of a proof-of-concept approach. This enables automated marker identification as a basis for future feedback to psychotherapists for further targeted intervention design. On the basis of automated discourse analyses and validated rating systems, cross-sectional analyses of the interpersonal robustness of content-analytical and acoustic-prosodic markers as well as longitudinal analyses of individual relationship trajectories will be made possible. In the data analysis, speech content and prosodic-acoustic markers are automatically extracted from audio data (especially those related to pitch, energy, voice quality and rhythm). In parallel, AI-based state-of-the-art anonymization methods are adapted to obtain the speech content and prosodic-acoustic markers and the extent to which the anonymized data is reliable for the evaluation of the therapeutic relationship is analyzed.
This text was translated with DeepL

View project in the research portal

Eaasy System - Electric Adaptive Autonomous Smart Delivery System
Duration: 01.02.2022 to 31.01.2025

The Eaasy System project aims to develop electric cargo bikes with automated driving functions that enable the environmentally friendly delivery of goods for use in "last mile" logistics. This new development aims to combine the flexibility of conventional cargo bikes with the ergonomic advantages and lean delivery processes of delivery robots (Follow-Me). The driving functions of the automated cargo bikes are geared towards unstructured traffic situations and equipped with a Come-With-Me function - an intuitive voice control system that allows delivery staff to direct the vehicle. The aim is to make logistics more sustainable overall, reduce the physical strain on delivery staff and significantly speed up the delivery of goods.
This text was translated with DeepL

View project in the research portal

Perception of the distinctiveness of pitch in cochlear implant users
Duration: 01.02.2024 to 31.01.2025

The pitch of acoustic signals can be perceived differently even if the perceived pitch is the same. For example, a pure tone of a certain frequency has a more pronounced pitch than a narrowband noise of a corresponding center frequency. The perception measure is the pitch strength. As cochlear implants reproduce the spectral information and dynamics of sound in a reduced form, it is of interest to what extent implanted persons perceive the distinctness of pitch in a similar way to people with normal hearing. For this purpose, paradigmatic signals such as harmonic tone complexes and band-pass filtered noise, but also short speech segments are presented for assessment.
This text was translated with DeepL on 29/12/2025

View project in the research portal

Emonymous - speaker anonymization while preserving the emotional expression effect
Duration: 01.08.2021 to 31.12.2023

Thanks to technological advances in the field of artificial intelligence (AI), interactive and intelligent voice assistants are increasingly finding their way into everyday social life. For data protection reasons, however, their use is mostly limited to applications in the private sphere. In particular, the ability to identify speakers on the basis of a large amount of collected data prevents the effective use of voice assistants in areas that are sensitive under data protection law, such as the healthcare sector or learning support. For many applications, however, the identity of the speaker is not necessarily relevant; it is only necessary to know exactly what was said. In addition to the content of what has been said, language also contains other indicators, such as emotionality or expression. However, preserving these linguistic subtleties after anonymizing the speaker is very important for the interpretation and comprehensive understanding of what has been said in many areas of application (e.g. to correctly assess a patient's state of health).
This text was translated with DeepL

View project in the research portal

MusIAs - Music-guided imagination and digital voice assistant - a pilot study
Duration: 01.01.2021 to 30.06.2023

Music-guided imagination is a resource-oriented music therapy technique which, in addition to music reception, includes therapeutic discussion about the significance of inner images for coping with psychological stress. Between therapy sessions, targeted listening to music supports self-regulation processes. This pilot study investigates the extent to which a common voice assistant can support the selection of music for a music-guided imagination and stimulate the reflection of inner processes and thus promote music-supported self-care. For this purpose, a skill for Amazon's Alexa is being developed, which is based on the so-called Short Music Journey (KMR) and comprises the modules "state of mind assessment", "music selection", "relaxation instructions" and "reflection". In a pilot study, the acceptance and subjective experience of the skill as well as changes in the experience of stress are investigated in comparison to a control condition using a mixed-methods approach in which quantifying measures, experience reports and the speech prosody of the users are analyzed. If a voice assistant for music-guided imagination is experienced as helpful, clinical applications may open up, e.g. technology-supported bridging of gaps in care or inter-session applications in ongoing therapies, if the risks and benefits are carefully weighed up.
This text was translated with DeepL

View project in the research portal

MusIAs - Music-guided imagination and digital voice assistant - a pilot study ...
Duration: 01.01.2021 to 30.06.2023

Music-guided imagination is a resource-oriented music therapy technique which, in addition to music reception, includes therapeutic discussion about the significance of inner images for coping with psychological stress. Between therapy sessions, targeted listening to music supports self-regulation processes. This pilot study investigates the extent to which a common voice assistant can support the selection of music for a music-guided imagination and stimulate the reflection of inner processes and thus promote music-supported self-care. For this purpose, a skill for Amazon's Alexa is being developed, which is based on the so-called Short Music Journey (KMR) and includes the modules "state of mind assessment", "music selection", "relaxation instructions" and "reflection". In a pilot study, the acceptance and subjective experience of the skill as well as changes in the experience of stress are investigated in comparison to a control condition using a mixed-methods approach in which quantifying measures, experience reports and the speech prosody of the users are analyzed. If a voice assistant for music-guided imagination is experienced as helpful, clinical applications may open up, e.g. technology-supported bridging of gaps in care or inter-session applications in ongoing therapies, if the risks and benefits are carefully weighed up.
This text was translated with DeepL on 05/02/2026

View project in the research portal

Perception of paraverbal information in data-reduced spoken language in users of cochlear implants
Duration: 15.08.2020 to 28.02.2022

Data reduction is not only essential for synthesized announcements, but also for speech-producing communication systems (e.g. Siri, Alexa, VoIP, mobile navigation systems) and for the transmission of telephony (Voice over IP, VoIP). Users of a cochlear implant are confronted with a strong impairment of spectral information in sound, which above all limits the exact perception of pitch. The project investigates the extent to which emotion in particular is perceived in spoken language and the effects of additional impairment through data reduction.
This text was translated with DeepL

View project in the research portal

Differences in users' speech behaviour between human-machine and human-human interactions ("Alexa studies")
Duration: 01.11.2018 to 30.06.2021

This interdisciplinary project deals with fundamental research on the speech behaviour of people with technical systems from an engineering and a psychological perspective. In particular, it is investigated to what extent the speech behaviour of humans in interpersonal interactions differs from the speech behaviour of humans in interactions with technical systems. For this purpose, several studies will be carried out using the specially developed data corpus, the Voice Assistant Conversation Corpus (VACC), which is based on interactions with Amazon's Alexa. Different interaction situations (formal vs. informal, dyadic vs. triadic) are investigated and comparisons between objective measurements of acoustic and lexical speech characteristics, user self-reports and external ratings are made. The major goal is to identify a set of distinctive speech features that will enable voice-controlled technical systems to detect whether they are addressed by the user or not. In addition, it will be investigated how the user's experience of the technical system (attributed more to human or more to technical characteristics and abilities) influences the user's speech behaviour.

View project in the research portal

"Find your degree program" - A language-guided guide to study information at the OvGU
Duration: 01.02.2020 to 28.02.2021

Study counseling at a distance? How can this work when interested parties are at home? This is where the current project aims to provide an answer. With just a few questions, prospective students should be presented with a suitable selection of degree programs that match their interests and are offered at the University of Magdeburg.
This text was translated with DeepL

View project in the research portal

ADAS&ME : Adaptive Advanced Driver Assistance Systems to support incapacitated drivers and Mitigate Effectively risks through tailor made Human Machine Interaction under automation
Duration: 01.09.2016 to 28.02.2020

ADAS&ME will develop adapted Advanced Driver Assistance Systems that incorporate driver/rider state, situational/environmental context, and adaptive interaction to automatically transfer control between vehicle and driver/rider and thus ensure safer and more efficient road usage. The work is based around 7 Use Cases, covering a large proportion of driving on European roads. Experimental research will be carried out on algorithms for driver state monitoring as well as on Human-Machine-Interaction and automation transitions. Robust detection/prediction algorithms will be developed for driver/rider state monitoring towards different driver states, such as fatigue, sleepiness, stress, inattention and impairing emotions, employing existing and novel sensing technologies, taking into account traffic and weather conditions and personalizing them to individual driver s physiology and driving behavior. Further, the core development includes multimodal and adaptive warning and intervention strategies based on current driver state and severity of scenarios. The final outcome is a driver/rider state monitoring system, integrated within vehicle automation. The system will be validated with a wide pool of drivers/riders under simulated and real road conditions and under different driver/rider states. This challenging task has been undertaken by a multidisciplinary European Consortium of 30 Partners, including an original equipment manufacturer per vehicle type and 7 direct suppliers.

The Cognitive Systems Group at Otto-von-Guericke-University will contribute to this consortium by providing analysis of emotional content of acoustic utterances in the car. We will also engage in information fusion of data from various modalities (acoustic, video, and others) and analyzing this data for identifying markers for detecting drowsiness or a loss of control state of the driver, thus contributing to driver assistance in several use cases, such as cars, busses, trucks, and motorcycles.

View project in the research portal

Emotion -based support for interactive applications in call centers
Duration: 15.04.2014 to 28.11.2015

The application-oriented research in the field " emotion -based support for interactive applications in call centers " will be further developed. Here is the phone dialogue ,
in which the call center operator is supported in his conversation design through feedback on the
emotional state (control , valence ).

View project in the research portal

Model for localisation of moods and personality traits in valence-pleasure-arousal-space
Duration: 01.01.2013 to 31.12.2013

A Model for localisation of moods and personality traits in valence-pleasure-arousal-space is developed. Experimental trals are located in this space and a trajectory is modelled, which is mood- and personality dependent.

View project in the research portal

Sommersemster

Sprachverarbeitung - Teil Sprachmodelle und Dialogsysteme

Medizinische Signal- und Informationsverarbeitung - Teil: Informationsvearbeitung

Wintersemester

Sprachdialogsysteme

I had the honor to sipervise the following students

2017

Mainampati Vasudeva Reddy
Overview and Comparison of Computer Vision Algorithms to Detect and Describe Local Features in Images'
Non-Technical Project Report

Tang Shuran
Analysis of acoustic features and automatic recognition experiments for conversation addressee detection
Masterarbeit

2016

Thomas Aab
Datenvorverarbeitung und Klassifizierung von Kopfdrehungen und Kopfbeschleunigung mittels MEMS Sensorik
Bachelorarbeit

Srinivasa Rao Peddi
Implementation and Investigations of Broad Phoneme Recognisers for Discourse Particle Detection
Masterarbeit

Linh Linda Duong
Untersuchung des Einflusses unterschiedlicher Audiospeicherformate und Kompressionsformate auf die Audioqualität
Bachelorarbeit (Betreuung als Kooperation mit Alicia Lotz)

Somtapa Bhattacharya
Implementation of improved Methods for Voice Activity Detection
Technical Project

Somtapa Bhattacharya
Evolution of Speech Processing
Non-technical Project

Daile Vera Poungue Wetoumdu
Überlappung in der Mensch-Maschine-Interaktion
Forschungsprojekt

2015

Fengjie Zhang
Comparison of Speech Emotion Recognition using Neural Networks and Deep Belief Networks having limited data material
Master Thesis

Yu Bi
Investigations on Wuality Asessment for Emotion Speech Data
Master Thesis

Bharath Bhat
Overlapping Speech
Non-technical Project

2014

Alicia Flores Lotz
Differentiation von Form-Funktions-Verläufen des Diskurspartikels "hm" über unterschiedliche mathematische Herangehensweisen
Masterarbeit

René Kallweit
Sprachsteuerung eines Roboters über eine Raspberry Pi bzw. Arduino Plattform
Masterarbeit

2012

Thomas Willner
Zerstörungsfreie Werkstoffprüfung von PMMA-Scheiben mit Hilfe von Lambwellen und digitalen Filtern
Studienarbeit

Christian Sporleder
Zerstörungsfreie Werkstoffprüfung von PMMA-Scheiben mit Hilfe von Lambwellen zur Detektion von Beschädigungen
Studienarbeit

Daniel Hellge-Theune
Erstellung und Evaluierung einer parametrisierbaren Onlineabfrage für ein Phonemlexikon der deutschen Sprache
Forschungsprojekt

Charité – Universitätsmedizin Berlin, Institut für Sexualwissenschaft und Sexualmedizin, Prof. Dr. Dr. Klaus Beier
DFKI Berlin Speech and Language Technology (SLT), Berlin
Hochschule Anhalt
Hochschule für Telekommunikation (HfTL), Leipzig, Prof. Dr. Oliver Jokisch
Hochschule Harz
Hochschule Magdeburg-Stendal
Hochschule Merseburg
Ludwig-Maximilians-Universität München, Department Psychologie, Lehrstuhl psychologische Methodenlehre und Diagnostik
NDR Kultur, Michael Becker
Otto-von-Guericke-Universität, AiLab, Prof. Sebastian Stober
Otto-von-Guericke-Universität, Arbeitsgruppe Logistische Systeme, Dr. Tobias Reggelin
Otto von Guericke Universität, Hochschulforschung und Professionalisierung der akademischen Lehre, Prof. Philipp Pohlenz
Otto von Guericke Universität, Institut für Strömungstechnik und Thermodynamik, apl. Prof. Gabor Janiga
Otto-von-Guericke-Universität Magdeburg Medizinische Fakultät Universitätsklinik für Hals-, Nasen- und Ohrenheilkunde, Kopf- und Halschirurgie Abteilung für Experimentelle Audiologie, Prof. Dr. Jesko Verhey
Prof. Dr. Katrin Giel, Sektion Translationale Psychotherapieforschung, Universitätsklinikum Tübingen
Prof. Dr. Susanne Metzner, Wiss. Leitung Studien- und Forschungsbereich Musiktherapie, Leopold-Mozart-Zentrum, Universität Augsburg
regiocom SE
Technische Universität Berlin, Quality and Usability Labs
Universitätsklinik für Psychosomatische Medizin und Psychotherapie, Dr. Julia Krüger, Prof. Dr. Jörg Frommer
University of Southern Queensland, Toowoomba, Australien, Dr. Rajib Rana