Numbers > Number 21 > Analysis of voice-generative artificial intelligence software applied to podcasting
Research
Licencia de Creative Commons
ISSN: 1885-365X
FITÓ-CARRERAS, María Contact 0000-0002-0500-4006
VIDAL-MESTRE, Montserrat Contact 0000-0001-6144-5386
FREIRE-SÁNCHEZ, Alfonso Contact 0000-0003-2082-1212

Analysis of voice-generative artificial intelligence software applied to podcasting

18 de octubre de 2024
3 de noviembre de 2024
Voice AI is capable of generating human language messages through deep learning algorithms, such as convolutional neural networks (CNN), which learn to imitate vocal patterns from speech data. In this context, the main objective is to provide an overview of voice AI applied to podcasting, aiming to answer whether the current technological offerings pose a threat to audio professionals’ jobs, particularly voice-over artists. To this end, the main software used by podcast creators for voice cloning is analyzed, and a comparative framework is established. Secondly, creators’ perceptions of the results are gathered by analyzing 10 titles. The main software provides specific tools that can enhance workflow and optimize production costs. Based on the findings about the current state of voice AI in podcasting, we have identified both the opportunities and limitations this technology offers to creators. It is observed that the voice AI industry is adapting to the sector’s needs, offering multiple tools through specialized platforms that allow for voice cloning, editing recordings, publishing podcasts, and distributing them in several languages. However, it is not perceived as an immediate threat due to the reproduction of inaccurate prosody and the absence of paralinguistic elements.
See full article (PDF)
<< Back to nº 21 index See next article >>
Colabora en los próximos números de Comunicación y Hombre
AVATARES
Replicantes, impostores y sustitutos: comunicación e inteligencia artificial
CALL FOR PAPER
Trascendencia
CALL FOR PAPERS